Sep 04, 2025·7 min read

How to evaluate platform work before you approve it

How to evaluate platform work starts with three questions: what risk drops, what gets faster, and which repeated team pain stops after launch.

How to evaluate platform work before you approve it

When platform work is real and when it is delay

Teams call a lot of things "platform work" because it sounds responsible. Sometimes it is. Sometimes it is a way to avoid a harder product decision, postpone a trade-off, or buy time when nobody wants to say, "We do not know what to build next."

The difference is usually simple. Real platform work removes a problem people already feel. Delay dressed as architecture talks about future flexibility, future scale, or cleaner foundations without naming a current cost.

If releases fail twice a week, that is real. If tests take 45 minutes and slow every deploy, that is real. If one person keeps spending half a day fixing the same broken pipeline, that is real too. But if the pitch is "we should rebuild this now so it will be more elegant later," you should slow down and ask for proof.

Plain language helps more than diagrams. If an engineer needs ten minutes of jargon to explain the job, ask for the one-minute version. A good explanation sounds like this: "Deployments fail twice a week, and one engineer loses half a day fixing them." Now you can judge the work.

Weak proposals usually have the same smell. They promise flexibility without saying what will change next. They focus on internal neatness instead of speed, reliability, or customer impact. They aim at scale the company does not have. They avoid numbers, dates, and recent examples.

Some platform work is worth funding even when users never notice it directly. Better monitoring, safer releases, and cleaner build pipelines can save money and reduce stress. But even then, the team should be able to state the result in plain terms. Maybe releases drop from 90 minutes to 15. Maybe on-call noise falls by half. Maybe a new hire can ship in three days instead of three weeks.

Small teams feel this mistake harder than large ones. A month spent on the wrong internal project can wipe out a quarter of product learning. Founders often approve infrastructure cleanup because it sounds smart, then realize later that the team avoided the harder question: which customer problem matters most right now?

When you evaluate platform work, ask what pain goes away for good, who gets faster, and what risk gets smaller. If the answer stays foggy, the work is not ready.

Three questions to ask first

When a team asks for platform work, skip the polished architecture story and ask for effects you can notice after release. Good work changes daily operations. Bad work stays in slides, tickets, and hopeful language.

Start with risk. Ask, "What risk drops when this ships?" The answer should be specific. "Fewer failed deploys" is useful. "Better architecture" is not. A team should name the problem, how often it happens, and what lower risk looks like after the change.

Then ask about speed. Ask, "What gets faster for engineers or users?" Speed can mean build time, release time, recovery time, onboarding, page load, or support response. Rough numbers are fine. Cutting test runs from 45 minutes to 12 is real. Saying "development will be more efficient" says almost nothing.

The third question is about repeated pain. Ask, "What annoying problem disappears for good?" This is where real platform work often earns its cost. Maybe engineers keep fixing the same deployment script by hand. Maybe every new service needs the same setup work. Maybe one flaky part of the system wakes someone up twice a month. If the work removes that pain for good, it has a strong case.

One more question keeps the other three honest: "When should we see the change?" Teams should not ask you to wait six months before any result shows up. Even larger platform work should have an early signal. You might expect fewer support tickets in the first month, faster deploys in the first sprint after release, or fewer infra incidents within a quarter.

Strong technical leaders do this translation well. They do not hide behind jargon. They tell you what pain goes away, what number improves, and when you should check whether the claim was true.

A simple filter works well here:

  • risk goes down in a measurable way
  • speed improves for a real workflow
  • repeated pain stops coming back
  • the team gives you a date to verify the change

If those answers are clear, the work may be worth doing. If they stay blurry, wait.

Review the pain before the proposed fix

Teams often arrive with a neat plan, a diagram, and a polished name for the effort. Ignore the proposed fix for a moment and pin down the pain first. That is the fastest way to learn how to evaluate platform work.

Write the problem in one sentence. Keep it blunt. "Deployments fail twice a week and each fix burns three engineer hours" is useful. "Our platform needs modernization" tells you almost nothing.

Then ask who feels that pain every week. Is it engineers waiting 40 minutes for tests? Support staff chasing missing logs? Customers hitting errors during busy hours? If nobody feels the problem often, the work may be nice to have, not urgent.

Before you approve anything, run five checks.

  • Name the current problem in plain language.
  • Say who loses time, money, or sleep because of it.
  • Pick one result you can verify within 30 days.
  • Ask for the smallest change that could prove the claim.
  • Delay the rest until that small piece works.

That last step matters most. Big platform projects attract big promises. A smaller proof keeps the team honest. If someone says a new internal tooling layer will speed delivery, ask them to test it on one service, one pipeline, or one painful handoff first.

A quick example shows why. A team asks for six weeks to rebuild its deployment setup. The stated goal is "better reliability." Push for numbers. How many failed releases happen now? How long does rollback take? What should change by day 30?

A tighter first step might be one release path with automatic rollback and clearer logs. If failures drop from four a month to one, you have evidence. If nothing moves, you stopped a larger waste early.

Many architecture reviews fail because people judge the design before they test the claim behind it. Start with impact, not elegance. A clever design that removes no recurring pain is still the wrong work.

If the team cannot name the pain, the people affected, and the first result you can measure, do not approve the full effort. Approve a short discovery step or a very small trial. Real platform work earns trust in pieces. It should not need blind faith.

What real platform work usually changes

Good platform work shows up in weekly numbers and daily habits. You do not need a long architecture deck to see it. The team feels the difference because routine work gets easier, releases get calmer, and fewer problems keep coming back.

Deployment speed is one obvious sign. A useful change cuts the time from "ready to ship" to production, and it also cuts failed releases. If a team used to spend 90 minutes pushing a release, rolling back, and fixing small config mistakes, real platform work might reduce that to 15 minutes with a repeatable pipeline. Faster matters, but boring matters more. A boring release process is usually a good one.

On-call noise should drop too. If alerts still wake people up for the same disk issue, timeout, or expired token, then the work did not solve much. Good platform changes remove whole classes of repeat incidents. You want fewer pages, shorter incident chats, and less "we fixed it again" work.

Another strong signal is the end of manual fixes. Teams lose a lot of time to the same small repair: restarting stuck jobs, patching environments by hand, copying secrets, cleaning bad test data. Real platform work removes those chores. Once the fix is built into the system, engineers stop carrying tribal knowledge in their heads.

Dependency on one specialist should shrink. If every release, migration, or production bug still waits for the same senior engineer, the platform is still fragile. Good work spreads capability across the team. Product engineers can ship without booking time with the one person who knows the deployment script or the only person allowed to touch infrastructure.

Reuse matters too, but only when it saves time more than once. A shared auth flow, one logging pattern, one deployment path, and one way to create services can pay off again and again. When every new feature still needs special handling, the team probably changed tools, not the system.

A plain before-and-after picture is enough. Before: two failed deploys a week, five repeated alerts, one engineer doing manual fixes every Friday. After: one-click releases, quieter on-call, and product teams shipping on their own. If nobody can describe the change that clearly, the work is not ready for approval.

Traps that sound smart but say little

Solve the repeated pain
Focus on the manual fixes, fragile scripts, and recurring incidents that keep coming back.

Some proposals sound serious because they are full of tool names, diagrams, and large nouns. That does not make them useful. Strip the pitch down to daily effects. Ask what gets faster, safer, or less annoying within a clear time frame.

A common dodge is to describe the stack instead of the result. "We need Kubernetes, a service mesh, and a new build system" tells you very little. "Deploys drop from two hours to 15 minutes" tells you something. Good platform work changes a routine people already hate. Bad platform work changes vocabulary.

Rewrites are another trap. A rewrite can make sense, but "the current system is old" is not a reason by itself. Teams often call a rewrite platform work when they really want a cleaner codebase, newer tools, or a fresh start. That may feel good for engineers, but you should approve it only if it removes a repeated business problem, not because the current code offends taste.

Time is where weak plans fall apart. If nobody can name the first useful result and when it lands, the work is still a wish. You do not need a perfect schedule. You do need a near date for something visible, like faster test runs, fewer failed releases, or one painful manual step gone.

Another warning sign appears when the pain lives only in meetings. If people say the architecture is a mess, ask who loses time every week because of it. Which team gets blocked? Which release slips? Which customer issue takes too long to fix? If nobody can point to daily friction, the problem may be aesthetic more than real.

Watch what happens when you ask for proof. Solid proposals get sharper. Weak ones get bigger. A small plan to fix deployments suddenly expands into a rewrite, a migration, a tooling overhaul, and a six-month roadmap. Scope growth often hides uncertainty.

A short set of questions exposes most of this:

  • What concrete pain shows up every week?
  • What is the first useful change, and when will it land?
  • What metric should move within 30 to 60 days?
  • What part of the proposal can wait?
  • What happens if we fund only the first slice?

The best answers sound plain. They name a recurring problem, a small first fix, and a result you can check without another architecture meeting.

A simple team example

Book an infrastructure review
Get a plain-language review of your deployment, CI/CD, and production setup.

A small product team says releases feel risky, slow, and stressful. Someone proposes a broad platform rebuild to fix it.

That sounds reasonable until you ask for details. Start with the last three times release day went wrong.

The team looks back and finds a pattern. One release failed because the deployment script behaved differently in staging and production. Another release took two extra hours because one engineer had to run manual steps that nobody else trusted. A third release caused a rollback because a config change did not load in the right order.

Those are separate incidents, but they point to one shared problem. The deployment path is brittle. The pain is not "our whole platform is outdated." The pain is that shipping depends on fragile steps, one person's memory, and uneven environments.

That changes the decision.

Instead of approving a large rebuild, the team fixes the release path first. They standardize one deployment script, remove the manual steps, add a simple pre-release check for config, and write down the rollback process so any engineer can use it.

None of that sounds dramatic. That is usually a good sign. Real platform work often looks a little boring because it removes repeated friction instead of chasing a grand redesign.

The result shows up fast. Release time drops from most of an afternoon to less than an hour. The team stops delaying small fixes because deployment no longer feels like a gamble. Support messages fall that same month because fewer releases go out broken.

That is the standard worth using before you approve this kind of work. Ask what exact problem happened, how often it happened, and what will change after the fix. If the answer stays vague, the work is vague too.

Quick checks before you say yes

A good proposal gets clearer when you ask basic questions. If the team cannot explain the pain, the expected result, and the first proof point in plain language, you are not looking at real platform work yet. You are looking at work that might drift for months.

A few quick checks make engineering prioritization much simpler.

Ask for the pain in one sentence. "Deploys fail because config differs between staging and production" is clear. "We need to improve developer experience" is too vague to approve.

Ask what single number should change. Pick one measure: release time, incident count, build failure rate, cloud spend, or hours lost each week to a repeated manual task.

Put a review date on the calendar before work starts. Two to six weeks is enough for a first phase in many teams. If nobody wants a date, that is a warning.

Shrink phase one until it is hard to hide behind effort. One service, one workflow, one team, or one environment is enough to test the idea.

Set a stop rule. If the first phase does not move the chosen number or remove the pain people named, pause the work.

Many teams get stuck because they approve a broad effort with no point of comparison, then keep funding it because too much time has already gone in. A small first phase avoids that trap. It also leads to better architecture review questions because the discussion moves from theory to evidence.

A simple example helps. A team asks for time to "rework CI/CD." That can mean almost anything. A better version sounds like this: "Deploys fail three times a week because environments differ. We will standardize one service first and cut failed deploys by half within 30 days." Now you can judge the work.

This is also how experienced technical leaders usually work. They tighten scope first, prove the change on a small slice, and only then decide whether the wider rebuild is worth it.

What to do when the answers stay vague

Get technical advice for founders
Oleg helps founders judge platform work without getting lost in jargon.

Vague answers usually mean one of two things. The team has not thought the work through, or they are asking for trust instead of proof. Either way, slow the approval down.

Do not ask for a better slide deck. Ask for a smaller plan. A good team can cut a broad proposal into a first step that shows one clear result quickly, often within two to four weeks.

That first step should name one change you can verify. Maybe deployment time drops from 40 minutes to 12. Maybe on-call pages fall by 30 percent. Maybe new service setup goes from two days to half a day. If the team cannot name the first visible win, the plan is still too fuzzy.

A simple reset usually works:

  • ask for a shorter plan with one milestone and one owner
  • ask for numbers from the last month, not opinions
  • ask what stops hurting first if you approve the work
  • ask what you can cancel if this work succeeds

Recent numbers matter because they stop abstract claims. Ask for incident count, failed deploys, lead time, time spent on manual setup, cloud cost, or support load from recurring issues. "The system feels fragile" is not enough. "We had six release delays last month because test environments drifted" is something you can judge.

If the team stays vague after that, bring in a neutral reviewer. Pick someone who does not own the project and does not need to defend past choices. A fresh reviewer can often tell whether this is real platform work, delayed cleanup, or architecture drift dressed up with fancy language.

This is one place where a Fractional CTO can help. A good one pressure-tests the claim from three angles: what risk drops, what gets faster, and what repeated pain goes away. That outside view is often cheaper than approving three months of work that changes nothing people feel.

If you need that kind of second opinion, Oleg Sotnikov at oleg.is does this sort of CTO advisory with startups and small businesses. His work focuses on practical architecture, infrastructure, and AI-driven software operations, with a strong bias toward small proofs and measurable outcomes.

If the answers stay soft even after review, do not approve the full project. Approve only the first small slice, with a date and a number you expect to move. If that number does not move, stop the work and revisit the problem.

Frequently Asked Questions

How do I tell if platform work is real?

Call it real platform work when it removes pain people already feel. Look for a current problem with numbers behind it, like failed deploys, slow tests, noisy on-call, or hours lost to the same manual fix.

What are the biggest red flags in a platform proposal?

Watch for vague promises about future flexibility, scale, or cleaner foundations. If the team cannot name a current cost, who feels it, and what number should improve, they are asking for faith, not approval.

What should I ask before I approve platform work?

Ask four things: what risk drops, what gets faster, what repeated pain disappears, and when you should see proof. If those answers stay plain and measurable, the proposal has a real case.

How soon should platform work show results?

You should see an early signal fast, often within two to six weeks for a first slice. Do not fund months of work with no checkpoint; ask for one result you can verify in the first phase.

Should a small team spend time on platform work at all?

Small teams should be stricter, not looser. A month on the wrong internal project can wipe out product learning, so fund platform work only when it fixes a repeated problem that blocks shipping, support, or reliability.

How can I test a big platform idea without wasting months?

Start with one service, one pipeline, one environment, or one painful workflow. A small trial gives you evidence without turning a rough idea into a long rebuild.

Which metrics make platform work easier to judge?

Pick one number that matches the pain. Good choices include release time, failed deploys, incident count, rollback time, onboarding time, cloud spend, or hours lost each week to manual work.

Does a rewrite count as platform work?

Not by itself. A rewrite makes sense only when it removes a business problem you can name, such as repeated outages, slow releases, or costly manual fixes; old code alone is not enough.

What should I do if the team keeps giving vague answers?

Slow the decision down and ask for a smaller plan with one owner, one date, and one measurable result. If the team still cannot get specific, approve only a short discovery step or say no for now.

When should I bring in a Fractional CTO to review platform work?

Bring in outside help when the team argues in circles, the scope keeps growing, or nobody can connect the work to daily pain. A good Fractional CTO can pressure-test the claim, tighten the scope, and set a proof point before you spend too much.