data & research6 min read26 march 2026

84% of Engineers Report AI Productivity Gains — Here's What They Have in Common

84% of engineers report AI productivity gains, but the data shows they're gaining time in verification and review, not code writing — here's what separates the engineers seeing real results from those just feeling busy.

tl;dr

Engineers reporting real AI productivity gains have shifted their focus away from writing code and toward verifying, testing, and reviewing it. The 84% figure comes from self-reported perception, and objective data tells a messier story. What separates engineers who genuinely benefit is not which AI tool they use, but how they've repositioned their own judgment in the workflow.

The engineers getting the most out of AI coding tools are spending less time writing code. That sounds wrong until you look at what they're actually doing with the hours they've recovered.

Annie Vella's 2026 study found that 84% of engineers perceived meaningful productivity gains from AI, with participants describing tasks that once took days collapsing into hours. That's striking. It's also self-reported, and Vella herself is careful to note that people "generally aren't great at estimating time." Perception is a data point, not a measurement.

The useful part of that finding is what the engineers who reported gains were actually doing differently.

The Shift Nobody Advertised

The popular pitch for AI coding tools is that they write code faster, so you write more code. The actual pattern among engineers reporting gains looks different. They're using AI to generate first drafts and boilerplate, then spending their own time on the parts AI gets wrong: edge cases, integration behaviour, security assumptions, test coverage.

Research from MIT's Initiative on the Digital Economy found that developers using GitHub Copilot increased their time spent on coding by 12.4 percentage points, while time on project management tasks dropped by nearly 25 percentage points. That's a meaningful reallocation. It says nothing about whether the code being produced was better or safer, only that engineers were writing more of it.

7.2%

Drop in software delivery stability with AI

Google DORA Report 2024

The Google DORA 2024 State of DevOps report found delivery stability down 7.2%, throughput down 1.5%, code churn doubled, and code duplication up 4x, all in the same period that AI began writing 41% of code. More output, lower stability. That's the gap between writing speed and engineering quality.

The engineers reporting genuine gains appear to have understood this gap and built their workflows around closing it.

The engineers winning with AI have made verification the high-value work, not an afterthought.

What High Performers Actually Do

Engineer using AI as a tool within their workflow, not as a replacement

The pattern across teams and individual engineers reporting real gains comes down to one decision: treating AI as a fast but unreliable first-pass generator, and reserving human judgement for the work that determines whether what gets shipped is sound.

In practice, that means:

Writing tests before or alongside AI-generated code, not after it's been accepted
Reviewing AI suggestions with explicit attention to security and state management, not just syntax
Using AI heavily for scaffolding, documentation, and repetitive patterns where errors are cheap and obvious
Avoiding AI on critical business logic or integration points without rigorous review protocols

One of the clearest examples of this in practice comes from Codurance. A single engineer rebuilt their entire production timesheet system using AI-assisted tools.

in practice·Single engineer at Codurance

what they did

Used modern LLMs to generate full features including calendar entry, user controls, business rules, and reporting. Applied engineering guardrails throughout: code reviews, test coverage, and validation at each stage.

outcome

Rebuilt to near feature parity in approximately 3 days against an original estimate of 60 developer-days, while maintaining code quality.

source

The speed came from AI. The quality came from the engineer not treating AI output as finished work.

The Junior-Senior Asymmetry

Engineer reviewing AI-generated code with a handwritten checklist

There's a split in the data that rarely gets enough attention. Junior engineers appear to see speed gains of 27-39% with AI tools, largely because AI works as an on-demand mentor: suggesting patterns, completing syntax, explaining unfamiliar APIs. Seniors, by contrast, see 8-16% gains while spending significantly more time reviewing AI suggestions, reportedly around 4.3 minutes per AI-generated suggestion compared to 1.2 minutes for human-written code.

That's a hidden cost that doesn't show up in lines of code per hour. Senior engineers are absorbing review overhead that doesn't exist when they write code themselves. Whether that trade is worth it depends entirely on the volume and complexity of what needs building.

The implication for teams is direct: if you're measuring AI's value by how fast junior engineers ship features, you may be loading review debt onto your seniors without realising it. Track review time, not just commit frequency.

Measuring AI productivity by commit frequency misses the review overhead it creates upstream.

Perception Needs a Check

The METR study, referenced across several analyses of AI coding tools, found that developers predicted a 24% speedup from AI, felt 20% faster after using it, but measured a 19% slowdown in initial conditions (later revised to -4% with a wide confidence interval, partly because 30-50% of participants refused to complete tasks without AI). The gap between feeling productive and being productive is not a rounding error.

This doesn't mean the 84% who reported gains in Vella's study are wrong. It means the gains are real in specific contexts and fragile in others. Engineers who've locked in consistent benefits tend to be disciplined about which tasks they hand to AI and which they keep. They're also the ones most likely to notice when AI has introduced a subtle regression, because they haven't outsourced their understanding of the codebase.

If you're trying to figure out where you actually stand, the question to ask is: "What's happened to my defect rate and review cycle time since I started using AI tools?"

What to Do Tomorrow

If your team is already using AI coding tools, the highest-leverage change is probably auditing what happens to AI-generated code after it's written. Does it go through the same review process as human-written code? Do your tests cover the edge cases AI is likely to miss? Are your seniors being asked to review more without a corresponding reduction in their own build workload?

If you're evaluating tools or approaches for the first time, the framework that works is simple: use AI on the parts of your work where errors are cheap to catch, and preserve your sharpest attention for the parts where they're not. To explore the best AI tools for productivity across different engineering contexts, the differentiator is almost never feature sets. It's how well the tool fits into a workflow where human review remains the quality gate.

The engineers in Vella's 84% didn't just adopt AI. They restructured their attention around it.

verdict

The productivity gains from AI coding tools are real, but they belong to engineers who've made verification and review the centre of their workflow, not a box-ticking step at the end. Teams that measure success by output volume will likely find the DORA numbers: more code, lower stability. The actual skill shift AI demands from engineers is not learning to prompt better. It's learning to trust less and verify faster.

Alec Chambers

Founder, ToolsForHumans

I've been building things online since I was 12 — 18 years of shipping products, picking tools, and finding out what actually works after the launch noise dies down. ToolsForHumans started as the research I kept needing: what practitioners are still recommending months after launch, and whether the search data backs it up. Since 2022 it's helped 600,000+ people find software that actually fits how they work.

about LinkedIn

keep reading

vibe coding is here: how ai is flipping developer roles from creation to verification

Vibe coding shifts developers from writing code to verifying it — here's what that means for your skills, your team, and whether this is progress or a slow deskilling.

the 60% ai productivity gain is real. what the teams achieving it have in common

Most teams quoting the 60% AI productivity gain never achieve it. Here's what the teams that do actually have in common, and what everyone else is getting wrong.

best ai tools for coders: top picks for developers

8 AI coding tools compared: GitHub Copilot, Cursor, Tabnine, and more, ranked by output quality, pricing, and workflow fit for working developers.

why ai agents are winning against traditional automation (and why it matters for your workflow)

Agentic AI is replacing traditional automation across enterprise workflows — here's what's actually different, what the real-world numbers show, and how to decide which approach fits your work.

custom ai agents vs. chatgpt: when to build vs. when to subscribe

A practical decision framework for choosing between a ChatGPT subscription and a custom AI agent, with real numbers on cost, ROI, and when each approach actually makes sense.

best ai tools for productivity: top picks for professionals

7 AI productivity tools compared: ChatGPT, Claude, Notion AI, and more, ranked by output quality, pricing, and use-case fit for professionals.