Feb 21, 2025·8 min read

AI assistant usage policy engineers will actually use

Q: How should we mark AI-assisted pull requests?

Mark the pull request in one simple way, like an `[AI-assisted]` prefix or one checkbox in the template. Reviewers need a clear signal so they know to read the diff carefully and check what the tool changed.

Build an AI assistant usage policy that sets data rules, review duty, and tool limits in plain language your engineering team will follow.

Why teams ignore AI policies

Teams usually do not ignore an AI policy because they hate rules. They ignore it because the policy slows them down.

A developer is fixing a bug and wants to know if they can paste an error log into an approved tool. Instead of a clear answer, they get two pages of soft language. Words like "sensitive," "internal," and "limited use" sound careful, but they leave too much room for debate. People stop reading and start guessing.

Length makes that worse. Engineers work fast. If a policy takes ten minutes to decode, they ask a teammate, copy whatever the rest of the team does, or skip the document altogether. That is how one team ends up with five different habits and no real standard.

Tool confusion is another common problem. Many policies say AI is allowed, but never name which tools are approved, which ones are blocked, and what data can go into each one. Then one person uses a public chatbot for code, another uses an enterprise plan, and both think they followed the rules.

Review rules often collapse in the same way. A policy says engineers must "review outputs," but never explains what review means. Does the author test every change? Does a second engineer inspect security-sensitive code? Who checks prompts that touched customer data? If the document does not answer those questions, responsibility gets blurry fast.

Copied vendor language does not help. Legal text from a tool provider rarely matches how a product team actually ships software. A startup team needs direct answers to real questions: can we paste stack traces into the tool, can we use AI for test code, can it help with production migrations, who signs off on AI-written code, and which data is always off-limits?

Good policies read like team habits, not legal homework. A person should be able to scan the page, find their case, and know what to do in under a minute.

Start with a small scope

The first version should stay narrow on purpose. A policy works better when it covers the places people touch every day instead of trying to solve every edge case at once.

Start with the work where people already copy, paste, summarize, and draft. For most product teams, that means source code and pull requests, internal docs and specs, tickets and bug reports, support notes, and team chat. That already covers most day-to-day AI use without dragging in HR files, finance records, or legal contracts.

The first page should also say who the rule applies to. Keep it simple: employees, contractors, interns, and vendors who can access company systems or customer data. If someone can paste a stack trace, a customer email, or a roadmap note into an AI tool, the rule applies to them.

Be specific about systems. Do not write "internal tools" and assume everyone will interpret it the same way. Name the actual places where sensitive material lives: the code repository, ticketing system, docs workspace, chat, file storage, CRM, analytics warehouse, and production database. People make fewer mistakes when they can match the rule to the screen in front of them.

Do not settle every rare case on day one. Edge cases bloat the document, and almost nobody remembers them later. If one team handles regulated records or merger documents, add a short follow-up note for that group when you actually need it.

A one-page policy can feel almost too short. That is usually a good sign. Engineers will read one page. They will not read six pages of vague warnings. If a sentence does not change what someone does in code, docs, tickets, or chat, cut it.

Use simple data classes

Three or four labels are enough. If you create seven, people stop checking and start guessing.

Use the same labels across the company. Engineering should not call something "internal" while legal calls the same thing "restricted" and support calls it "private." One label per type of data keeps decisions quick and avoids useless arguments.

A simple model usually works:

Public: anything you could share openly, such as marketing copy, public docs, open-source snippets, or sample code with fake data.
Internal: normal company material that is not public but does not expose customers or secrets, such as architecture notes, backlog text, or code patterns without real credentials or user data.
Confidential: real business data, such as source code from private repositories, logs with customer details, incident notes, or support tickets with names, emails, and account history.
Regulated: data with legal or strict contract limits, such as payment details, health records, government IDs, private keys, and customer data covered by contract.

Examples matter more than definitions. A stack trace with fake values might be internal. The same trace becomes confidential if it includes a customer email, IP address, or token. A support ticket about a login bug may stay internal if it uses test data, then jump to a higher class the moment it includes a real person or a regulated record.

Keep those labels consistent in code reviews, ticket templates, logging rules, and support workflows. People can learn one system once. They will ignore four slightly different versions.

List approved tools and limits

A short tool list does more work than a long policy. If engineers have to guess which assistant they can use, they will pick whatever is fastest.

Do not throw every AI product into one bucket. A coding assistant in an IDE, a browser chat app, and an API used in a backend all carry different risks. Name the exact tools you allow, including plugins and APIs. "AI coding tools" is too vague to help anyone.

Give each tool a clear boundary

For every approved tool, list the same basic facts: the exact product name and plan, where teams may use it, which data class it may handle, whether it stores prompts or logs, and whether people must use a company account or company API key.

That last point matters more than many teams expect. Personal accounts should stay out of work completely. The moment someone pastes customer data into a personal chatbot, the company loses control over access, offboarding, logging, and retention.

Be plain about limits. You might allow one assistant for code suggestions in company repositories, another for document drafting with internal notes, and block both from customer secrets or regulated data. Most engineers will follow rules when the boundary fits real work and the words are clear.

A small example helps more than a generic policy line. You might allow one coding assistant in the company IDE for internal repositories, allow one API for test generation in CI, and ban browser chat tools for anything beyond public code snippets. That is much easier to follow than a broad statement about "approved AI tools."

One person should own the list. Make an engineering manager, staff engineer, or CTO review it every month. Vendors change retention settings, plugins appear, and safe defaults do not stay safe forever.

Set review duty for AI-written work

Classify Data Without Confusion

Map public, internal, confidential, and regulated data to real team workflows.

Plan With Oleg

If a tool writes part of the code, a person owns the result. Put that rule near the top.

No AI-written change should go to production without human review, and no pull request should merge because the code "looks fine." Keep the rule simple: every AI-assisted change needs a reviewer before merge, and high-risk changes need a named approver before release.

Reviewers should know what to check. In most teams, that means security risks like secrets, authentication, permissions, and unsafe dependencies; test coverage that matches the risk; licensing concerns if the change adds packages, snippets, models, or generated assets; and basic correctness, especially around edge cases and silent failures.

Mark AI-assisted pull requests in one easy way. A title prefix like "[AI-assisted]" or a single checkbox in the PR template is enough. Do not turn it into paperwork. Reviewers just need a clear signal to slow down and inspect the work a little more carefully.

Rubber-stamp reviews need an explicit ban. Say that reviewers must read the code, run checks that fit the risk, and leave at least one comment showing what they verified. That sounds strict, but it prevents the usual failure mode where everyone assumes someone else looked closely.

High-risk changes need stronger approval. If code touches payments, customer data, access control, infrastructure, legal terms, or production migrations, name the approver role in the policy. For example, an engineering lead can approve product logic, a security owner can approve auth or data handling, and the CTO or head of engineering can approve production infrastructure changes.

This does not need a big process. A startup can say it in three lines: label AI-assisted PRs, require human review before merge, and require named approval for risky changes.

Draft the first version step by step

Your first policy should fit on one page. Write it like an internal README: plain words, short sections, a few examples.

Start with the purpose in two sentences. For example: "We use AI tools to speed up routine work without exposing sensitive data or skipping human review. This policy says which tools we approve, what data people can share, and what work a person must check before it ships."

Then add a small table for data classes:

Data class	Allowed in approved AI tools	Example
Public	Yes	Marketing copy, public docs, open source code
Internal	Yes, with care	Roadmaps, internal docs, non-secret code
Sensitive	No	Secrets, customer data, production dumps, contracts

After that, list approved tools and their limits. Name the tool, say who can use it, and state what people must not paste into it. A short line like "Use only company accounts" prevents a lot of bad habits.

Review rules need the same plain style. If AI writes code, a reviewer reads the diff, runs tests, and checks for unsafe calls, odd logic, and copied code. If AI writes docs, the document owner checks facts, commands, and examples before publishing.

Do not assign questions to a mailbox or a vague "engineering team." Name one person. A staff engineer, tech lead, or engineering manager is enough. People need to know who can answer an edge case in one message.

You also do not need a complicated rollout. Publish the draft where the team already works, give people a week to point out unclear rules and missing cases, fix the wording that trips them up, resolve disagreements in writing, and publish version one. Then update it after people have actually used it.

A policy people can remember beats a perfect policy nobody opens twice.

A simple example from a product team

Cut Policy Friction Fast

Replace soft wording with direct rules that fit how your team ships.

Book Policy Help

A checkout test starts failing after a small refactor. The developer wants a quick second opinion, but does not paste real orders, names, or payment details into the prompt.

Instead, they use mock data: a fake order ID, sample prices, and a short stack trace. They ask the assistant why a rounding test might fail after a tax change, and they do it through the company account already approved for this kind of work.

The assistant points to a likely cause: the code now rounds one value earlier than the test expects. That still does not make the answer correct. The developer checks the code, updates the test fixture, and runs the full suite.

The change still needs human review. Before merge, another engineer reads the diff line by line and verifies four things: the prompt used fake data only, the account and tool matched team rules, the code change fixed the real bug, and no extra logic slipped in with the suggested edit.

That review matters more than the prompt itself. AI code review rules should make it clear that the assistant can explain, draft, and suggest, but a human still decides what ships.

After the fix goes in, the team saves a short version of the case in the policy. Not a long write-up. Just a few lines that show what happened, what data was safe to use, which tool was allowed, and what review the team required.

This is why examples work better than abstract warnings. Engineers remember a real case like this and can copy the pattern the next time a test fails.

Mistakes that make the policy useless

Most weak drafts fail for the same reason: they try to reduce risk with blanket bans. That can look safe on paper, but engineers still need help with debugging, test generation, and small research tasks. If the rule says "never use AI," people move to personal accounts and private browser tabs. Then the company loses review, logging, and any real control.

Dense legal writing causes the same problem. If engineers have to read three pages to learn whether they can paste a stack trace, they will guess and move on. The policy should read like a team rule, not a contract fight.

Another common mistake is mixing policy with setup guides. Data classes for AI use should stay separate from installation steps, prompt tips, or IDE instructions. The policy answers what is allowed. Supporting docs explain how to do it.

Teams also forget people. They write rules for full-time engineers and leave out contractors, interns, support staff, and vendors. Those people may still handle source code, tickets, logs, and customer data. If they are outside the policy, the policy has a hole in the middle.

Tool lists go stale fast. A page of approved tools means very little if it still includes expired pilots, old vendors, or products nobody has checked in months. That trains people to ignore the document.

Most bad policies share the same symptoms. They block normal work instead of guiding it, use legal language for everyday decisions, mix permanent rules with temporary setup notes, forget non-employees who still handle data, and leave old tools on the list long after the team stopped using them.

If the policy is short, current, and clear about review duty, engineers will use it. If it feels fake, they will work around it.

Quick checks before you publish

Review Your AI Tool Rules

Get a practical second pass on approved tools, data limits, and account setup.

Book Review

A good policy should survive a five-minute read. If a new engineer needs a meeting to decode it, trim it down. Drop obvious definitions, move rare edge cases out of the main flow, and keep the real rules near the top.

Test it on someone who did not help write it. Give them five minutes, take the document away, and ask what they understood. They should be able to tell you which AI tools they may use, what data they may share, and what work still needs human review.

Fuzzy language around customer data is the most common failure. Do not say "use caution" or "avoid sensitive data." Name the data classes people may put into approved AI tools, and name the classes they must keep out. If someone cannot tell whether a support ticket, crash log, screenshot, or code snippet with secrets is allowed, the policy is not ready.

Review flow needs the same clarity. A reviewer should spot AI-assisted work quickly and know what to check. That can be as simple as a PR note, a commit tag, or one line in the template. If reviewers need detective work, they will miss things.

A short pre-publish checklist helps. A new engineer should be able to explain the rules in plain words after one read. People should know where customer data may go and where it may not. Reviewers should be able to spot AI-assisted code or content without guessing. Managers should know the approved tools from memory. The team should be able to find the current version in one obvious place, not buried in chat history.

One last test is worth doing. Ask two people on different teams to read it on their own. If they give different answers, keep editing.

What to do after rollout

A policy only works when it meets real work. In the first team meeting after launch, bring two real prompts and walk through them together. Pick one that is clearly safe, like a test case draft, and one that sits closer to the line, like a bug report with customer details.

That exercise does two useful things quickly. It shows whether engineers interpret the data classes the same way, and it exposes tool rules that looked clear on paper but break down in practice.

During the first month, pay more attention to friction than to compliance scores. If people keep asking the same question, the policy needs a rewrite, not another reminder.

A simple pattern works well: note the exceptions people make, record where the data class was unclear, track the cases where review caught an AI mistake, and turn repeated questions into short examples.

Exceptions deserve special attention. If an engineer used an unapproved tool during an outage, do not let that story vanish in chat history. Add a short note to the policy or companion doc that explains what happened, what data was involved, and what the team should do next time.

Short examples help more than long rules. One paragraph that says "You can paste a fake customer record into Tool A, but not a real support ticket" will save more time than a page of general warnings.

After about a month, revise the parts that caused hesitation. If people keep arguing about words like "sensitive," replace them with names the team already uses, such as production logs, contracts, invoices, or source code from private repositories.

Keep the policy short as the stack changes. Review the approved AI tools list on a regular schedule, remove tools nobody uses, and add limits for new tools before they spread across the team. A short document gets opened. A long one gets ignored.

Some teams also want an outside review before the rules drift. In that case, Oleg Sotnikov at oleg.is can review the tool list, data rules, and rollout plan as part of Fractional CTO or startup advisory work. That kind of second pass is most useful when a team is changing how it builds, reviews, and ships software.

If the policy still fits on one screen and answers real questions, keep it. If it grows every time someone makes a mistake, cut it back and replace the extra text with a better example.

Frequently Asked Questions

Why do engineers ignore AI policies?

Teams ignore them when the rules slow normal work. If people cannot tell in under a minute whether they can use a tool or paste a specific snippet, they stop reading and start guessing.

How long should the policy be?

Keep version one to one page if you can. A short policy works better because people can scan it during real work and find an answer fast.

What should the first version cover?

Start with daily work like code, pull requests, docs, tickets, bug reports, and chat. Name who the rule covers and which systems hold sensitive material so people can match the rule to the screen in front of them.

How many data classes do we need?

Use three or four labels, then stop there. Most teams do well with public, internal, confidential, and regulated because people can learn them quickly and apply them the same way across engineering, support, and ops.

Can engineers paste logs or stack traces into AI tools?

Yes, but only when the data fits the allowed class for that tool. A stack trace with fake values may be fine, while the same trace turns off-limits if it includes a real email, token, IP address, or account detail.

Should we allow personal AI accounts for work?

No. Work should stay inside company accounts and company API keys so the company controls access, offboarding, logging, and retention.

How should we mark AI-assisted pull requests?

Mark the pull request in one simple way, like an [AI-assisted] prefix or one checkbox in the template. Reviewers need a clear signal so they know to read the diff carefully and check what the tool changed.

What review does AI-written code need?

A person owns the result every time. The reviewer should read the code, run the checks that fit the risk, and verify correctness, security, tests, and any copied code or package changes before merge.

Who should approve high-risk AI-assisted changes?

Name the approver role in the policy. Risky work like payments, customer data, access control, production migrations, or infra changes should go to an engineering lead, security owner, CTO, or another named person your team already trusts.

What should we do after rollout?

Run two real examples with the team during the first month, then fix the parts that confuse people. If the same question keeps coming up, rewrite the policy, trim stale tool entries, and add a short example instead of more vague text.