Jul 21, 2024·8 min read

Promotion in AI teams: defining growth beyond execution

Promotion in AI teams should reflect decision scope, risk ownership, and clarity across teams, not raw output. Use a simple framework to judge growth.

When output stops telling the whole story

AI can draft code, docs, and tests in minutes. That speed helps, but it also makes raw output a weak signal. A dashboard full of closed tasks no longer tells you who is ready for more responsibility.

Two people can ship the same amount of work and still operate at very different levels. One uses AI to move faster on clear tasks. Another uses the same tools, then stops to question assumptions, catch edge cases, and change direction before the team builds the wrong thing. The numbers look similar. The judgment does not.

That is why promotion decisions get blurry in AI teams. Many managers still reward what they can count: tickets closed, pull requests merged, bugs fixed, pages written. Those measures were never perfect. They matter even less when AI handles much of the first draft.

A simple example makes the gap obvious. Imagine two engineers shipping a checkout update. Both finish on time. One accepts AI suggestions, cleans up the code, and moves on. The other notices that the new flow will confuse support, break an analytics report, and create a refund issue for finance. That second person did more than execute. They reduced risk outside their own task.

The best promotion questions now sound different:

Who makes sound calls when the brief is incomplete?
Who catches problems before customers or other teams feel them?
Who helps the team move with less confusion and less rework?

Those signals are harder to count, but they are much closer to real growth. When AI handles more execution, the bigger step is not typing faster. It is carrying broader decisions, owning tougher trade-offs, and keeping work clear enough that other people can move without guessing.

What changes when AI handles most execution

AI compresses routine work. Drafts appear in seconds, tests get written faster, and basic research starts to look similar across people using the same tools. That changes what good performance looks like.

You can no longer judge growth by asking who produced the most tickets, pages, or code. AI helps many people reach a similar level of raw output. The bigger difference appears earlier, in the choices people make before the work becomes visible.

One person asks a better question, cuts a weak idea, or spots a dependency before it blocks three other people. Another pushes ahead, finishes fast, and leaves design, support, or operations to clean up the mess later. Both may look productive on paper. Only one makes the work easier for the whole team.

That is why promotions shift away from pure execution. The real gap shows up in judgment, trade-offs, and timing. Who picks the simpler path when speed matters? Who slows down when a rushed release could hurt customers? Who notices that a small request will spill into pricing, onboarding, or support?

People with more scope reduce avoidable problems before they spread. They turn fuzzy requests into clear work. They pull in the right people early. They notice risk while the cost of fixing it is still low. That kind of contribution often saves more time than finishing ten extra tasks.

Finished work still counts, but it tells less of the story than it used to. A stronger signal is how someone shapes the work around them.

When two people use the same AI tools, one may simply move faster. The other may help the whole team move with fewer resets, fewer surprises, and better timing. That second person is usually carrying the larger role, even if their output count looks ordinary.

Decision scope shows how much ground someone covers

When AI writes drafts, suggests code, and prepares options, task count stops being a clean measure of growth. A better signal is the size of the decisions a person can make and still get a solid result.

Start with independence. Can this person choose an approach on their own, or do they still wait for approval every time speed, cost, and quality pull in different directions? Someone can finish tickets faster with AI and still need help on the hard calls.

The wider the decision, the more ground that person covers. At one level, they decide how to complete a task. At the next level, they shape a whole feature. After that, they make choices for a broader area where one call affects product, engineering, support, and budget at the same time.

This is why decision scope matters so much. Two people can look equally productive when AI handles much of the execution. One completes assigned work. The other decides what should happen, which trade-off makes sense, and when the team can move without another approval step.

You can usually spot the difference in a few habits. People operating at a wider level choose between speed, cost, and quality on their own. They explain the downside of each option in plain language. They set direction for a feature, not just the next task. They know when to act and when a choice needs wider input. Most importantly, they move work forward without creating more approval loops.

A small product team makes this easy to see. One engineer asks, "Which option do you want?" Another says, "Option B ships this week, costs less to run, and creates a support issue we can accept for now." Both may use the same AI tools. Only one is covering a wider decision area.

That wider area is often the real promotion line. It shows trust, judgment, and the ability to carry a bigger part of the business without constant supervision.

Risk ownership shows who carries the harder load

When AI handles much of the building, the harder work often shifts from doing to judging. That is why promotion decisions should pay close attention to risk ownership, not just output.

A person with more scope does not simply ship more. They spot where a launch can fail before customers feel it. They ask plain questions early: What breaks if this model gives the wrong answer? How will we know? Who gets paged? How do we roll back quickly?

That is the difference between someone who completes tasks and someone who protects the business. The second person carries more weight, even if their name appears on fewer tickets.

One strong signal is how often they name failure points before launch. They do not wait for a surprise in production. They point out weak data, missing checks, unclear ownership, or a support gap while the work is still easy to change.

Another signal is whether they plan for the bad day, not just the happy path. In AI products, that often means thinking through rollback steps, monitoring for drift or bad responses, support handoff when users get confusing results, budget impact if usage spikes, and deadline risk if one dependency slips.

You can also see risk ownership in how people raise concerns. Strong candidates do not turn every doubt into drama, but they also do not stay quiet just to look easygoing. They bring up small problems while they are still cheap to fix.

That habit protects customer trust. It also protects budgets and schedules, because the team spends less time cleaning up preventable mistakes.

Say two people use AI to ship the same feature in three days. One celebrates the speed and moves on. The other notices that support has no script for likely failures, alerts are missing, and the rollback plan depends on one engineer being online. The second person is carrying the harder load.

That is often what growth looks like now. Less proof that you can produce. More proof that you can keep the product safe when production gets messy.

Clarity across teams keeps work moving

Make AI Transition Practical

Plan team structure, workflows, and AI use with experienced CTO help.

Plan With Oleg

When AI writes a first draft of almost everything, confusion gets easier to hide. Product may think a feature is ready, design may still have open questions, and engineering may build from a different assumption. The person ready for more scope closes those gaps early.

They write goals in plain language. Not broad slogans, and not a wall of detail. A good goal says who the change is for, what should happen, and what counts as done.

Before work starts, they make ownership clear. They do not leave people guessing who decides copy, who approves edge cases, or who answers customer impact questions. That sounds simple, but teams lose days when nobody names the owner until a problem appears.

Often, a short handoff note does more than a long meeting. It should cover the goal, who owns each decision, what is still open, and when the team needs an answer.

People who are ready for promotion do this without making it feel heavy. They turn messy discussion into a shared plan that product, design, and engineering can all use. After a meeting, fewer loose ends remain. Someone knows the next step. Someone knows the deadline. Someone knows what does not need more debate.

This matters most when teams move fast. AI can produce mockups, copy, code, and test cases in hours. If nobody clears up meaning between teams, that speed just creates more rework. A designer revises the wrong flow. An engineer builds the wrong edge case. A product manager rewrites the ticket after development has already started.

A simple example helps. A team wants to add onboarding prompts. One person says, "We want users to activate faster." That is too vague. A stronger teammate rewrites it as, "New users should reach their first completed action in under five minutes," then names product as owner of the success metric, design as owner of the prompt flow, and engineering as owner of event tracking. Now the work can move.

Clear teams are not always calmer. They are simply easier to trust because people know what they own and what happens next.

How to assess promotion step by step

When AI writes code, drafts specs, and handles routine tasks, raw output stops being a clean way to judge growth. You need a process that looks at judgment, ownership, and how work moves through the company.

Start by defining the role before you judge the person. A higher level role should own a wider set of decisions. Write those decisions down in plain language. Should this person choose between speed and reliability? Decide when to cut scope? Settle conflicts between product and engineering? If you cannot name the decisions, the role is probably still fuzzy.

Then write down the risks that come with that role. Every promotion should carry more downside if the call is wrong. That might mean shipping something unstable, delaying a launch, creating support pain, or locking the team into a bad technical choice. If the role carries no real risk, the scope may not be different enough yet.

A simple review process works well:

Define the decisions the higher level role owns.
Name the risks attached to those decisions.
Ask for two recent examples where the person faced real trade-offs.
Review how they kept product, design, engineering, support, or sales aligned.
Compare the evidence across a full delivery cycle, from planning to launch to follow-up.

Those examples matter a lot. Ask what options they had, what they chose, what they gave up, and what happened after. Good answers sound specific. "I used AI to finish faster" is weak. "I cut one feature, warned support about likely complaints, and avoided a risky release" is much stronger.

Cross-team clarity often separates a solid promotion case from a shaky one. Someone may move fast alone and still create confusion for everyone else. Look for clear handoffs, fewer repeated questions, and fewer late surprises.

Finally, judge the pattern over time. One strong week is not enough. Watch a full cycle: planning, execution, launch, and cleanup. If the person makes sound calls, carries the risk, and keeps other teams clear through that whole arc, the case is usually strong.

A simple example from a small product team

Assess Risk Ownership

Look at who spots failure points early and protects launch quality.

Book Consultation

A small SaaS team needs to add a new billing option before the next sales push. Two engineers get similar tasks. Both use AI every day, and both ship real work.

The first engineer moves fast. In two days, they use AI to draft the API changes, the checkout screen, and the admin toggle. The demo works. The ticket closes. On paper, that looks strong.

The second engineer moves slower. They still use AI for code, but they spend extra time on the messy parts: prorated charges, failed renewals, duplicate invoices, refunds, and what happens if a customer downgrades halfway through a cycle. They ask support what customers usually complain about. They check with finance on how the invoice should read. They add a short internal note so support knows what to say when billing goes wrong.

Both engineers produced output. Only one took on more business risk.

If the release goes badly, the first engineer can say, "The code worked in staging." The second engineer owns a wider slice of the result. They tried to prevent angry tickets, billing mistakes, and confused handoffs between teams. That is a bigger job, even if it created fewer lines of code and took an extra day.

This is where promotions often get misread. When AI helps everyone draft faster, speed stops being a clean signal. Scope is a better one. Who made decisions that protected revenue, support time, and customer trust? Who reduced the chance of a painful surprise after launch?

In this example, the stronger promotion case belongs to the second engineer. Not because they were vaguely "more careful," but because they covered more ground and owned harder consequences.

Fast drafting still matters. Teams need it. But promotion should follow the person who can ship and also think through what the business will have to live with a week later.

Mistakes that lead to weak promotion calls

Promotion decisions get fuzzy when leaders still reward the old signals. Fast output is easier to buy now. Good judgment is not. If a manager does not separate the two, promotions start to feel random.

One common mistake is promoting the person who closes the most tasks. AI can help someone write more code, draft more tickets, and move faster through small requests. That does not tell you whether they picked the right work, spotted a bad trade-off, or protected the team from a bigger problem. Volume is easy to count, so teams lean on it too much.

Confidence fools people too. Some team members speak clearly, answer fast, and sound certain in meetings. That can look like seniority. Sound judgment usually appears in quieter ways: asking one sharp question before a risky release, slowing down a rushed decision, or admitting when the data is thin. The calm person who avoids a bad call often matters more than the loud person who wins the room.

Another weak signal is invisible prevention work. Many promotion cases focus on visible wins: launches, demos, shipped features. But mature people often spend time on the less glamorous work that keeps everyone else moving. They tighten a review step, catch a security gap, fix a flaky deployment, or clean up a handoff between teams. Nothing dramatic happens because they did the work well. That still counts.

Tool skill causes another bad call. Someone may know every shortcut in Claude, GPT, or code assistants and still not be ready for more scope. Tool skill helps. It is not seniority by itself. A senior person uses AI to make better decisions, reduce avoidable risk, and leave clearer paths for others.

The last mistake is waiting until review season to define what growth means. By then, people argue from memory and recent wins. Expectations need names much earlier. A manager should say, in plain language, what the next level looks like in daily work.

A strong promotion call usually answers five questions. Did this person improve decisions, not just output? Did they take responsibility when the risk was real? Did they prevent problems others never saw? Did they make work clearer across teams? Did they do this more than once, under pressure? If those answers stay vague, the case is weak, even when the task count looks great.

Quick checks before you make the call

Bring In a Fractional CTO

Use practical CTO advice to define growth beyond ticket count.

Talk to Oleg

Raw output is now a weak signal. A person can ship a lot with good tools and still need close guidance. The stronger signal is judgment under pressure.

Start with how they explain decisions. Ask them to walk through one real choice they made last month. Good answers stay plain and concrete. They should say what they chose, what they gave up, and why that trade made sense for the team, the customer, or the budget.

Then look at clarity across teams. If product, design, support, or sales leave a meeting with different ideas about who owns the next step, that person is not operating at a wider level yet. People ready for more scope reduce confusion. They name owners, set boundaries, and catch gaps before work stalls.

Risk tells you even more. Promotions should not go to people who keep work looking calm by hiding bad news. Trust the person who says, "This may slip by three days unless we cut this part," early enough for others to respond. That kind of honesty saves time and money.

A quick check can fit on one page:

They explain trade-offs without jargon or fog.
Other teams can say who owns what after working with them.
They raise risks early, even when the news is awkward.
They keep moving when requirements stay fuzzy, and they write down assumptions.
You would trust their judgment on a larger customer promise or a larger spend.

That last point matters more than many managers admit. A promotion is not a reward for effort alone. It is a decision to widen decision scope and accept more risk ownership. If you would still want to approve every customer commitment or every extra dollar they spend, wait.

When two or more of these checks fail, do not guess. Give the person one clearer area to own, then watch what happens over the next cycle. Growth usually shows up there first: cleaner decisions, fewer surprises, and less confusion for everyone around them.

Next steps for a team that wants clearer growth

Teams get stuck when promotion talks still reward volume, even after AI handles much of the drafting, coding, testing, and documentation. The career ladder has to describe what grows when output no longer tells the full story. In most teams, that means broader decision scope, heavier risk ownership, and clearer coordination across functions.

Start with the level definitions. If a level still says someone writes more code or closes more tickets, rewrite it. A stronger description says what decisions they make, what failures they prevent, and how they keep product, design, engineering, and support aligned when work gets messy.

One practical fix is a short audit before the next review cycle:

Pull your last few promotion decisions and score each person on scope, risk, and cross-team clarity.
Look for places where speed got rewarded more than judgment.
Add real examples from your own team to each level so managers can compare like with like.
Review those examples together until different managers reach similar conclusions.

Shared examples matter more than polished wording. "Led delivery" is too vague to help. "Chose the rollout plan, spotted a failure risk early, and got engineering and support aligned before launch" gives managers something concrete to judge.

Small startups often need an outside view here. Founders usually know who works hard, but they can blur effort, loyalty, and level. A Fractional CTO can separate those things and tighten the ladder. Oleg Sotnikov at oleg.is does this kind of work with startups and small businesses, drawing on experience across software engineering, CTO leadership, startup advising, and AI-first operations.

Do one pass, use it in a real promotion cycle, then revise it again. If managers still debate the same cases for the same reasons, the framework is still too fuzzy. Fix that before the next round.