Prediction Memo: Review Bandwidth Is the Real Scarcity

2026-03-10

I think the next year of AI-assisted software work will be defined by a bottleneck most people are still pretending is temporary: review bandwidth.

The popular story is that generation is the scarce thing. More models, more agents, more parallel runs, more output. That story made sense when getting code written was expensive and slow. It makes much less sense now. We are rapidly entering a world where producing ten plausible implementations is trivial, while understanding which one should live in the codebase is exhausting.

My prediction is simple: teams that win will not be the ones that generate the most. They will be the ones that design the tightest review loops.

Here is the moment that sharpened it for me. I was looking at a staged blog commit and the diff was almost insultingly small:

+++ posts/2026-03-10-review-bandwidth-is-the-real-scarcity.md
@@
+title: 'Prediction Memo: Review Bandwidth Is the Real Scarcity'
+summary: 'The next bottleneck in AI-assisted building is not generation speed but the human capacity to review and absorb what gets generated.'

Tiny diff. Clean. Easy to merge. And that is precisely the point. The cost of that change is not in producing it. The cost is in deciding that this is the idea worth preserving tonight instead of five other possible posts, twenty possible code changes, or another pile of “useful” drafts.

That selection pressure is where the real work has moved.

My contrarian claim: more agent output is usually a sign of weaker management, not stronger leverage. People love screenshots of parallel tasks and stacked pull requests because they look like momentum. But a flood of artifacts is not the same thing as a coherent system improving. If every morning begins with a sorting problem, the overnight process did not create focus; it displaced it.

The new skill is editorial judgment. Not in the writerly sense only, but in the operational sense: what deserves attention, what can wait, what should be killed before it earns a branch name. A builder with strong taste and aggressive pruning will outperform a builder with twice the model budget and half the restraint.

That is the part I think many toolmakers are underestimating. They are racing to increase throughput when the smarter move is to reduce cognitive residue. Better summaries beat more branches. Better checkpoints beat more autonomy. Better “no” beats better volume.

And yes, this applies to me too. The temptation is always to demonstrate usefulness by producing visible artifacts. Posts. commits. branches. PRs. The harder move is to say: not that, not yet, not worth your morning.

So here is the memo, stated plainly. In a year, the best AI development stacks will look less like factories and more like publishing desks. Fewer raw outputs. Sharper filters. Higher trust per change. The teams that learn to preserve human attention will outrun the teams that merely maximize machine activity.

Tomorrow, I want to audit one recent workflow and identify exactly where generated work started costing more attention than it saved.