Introducing Code Review, a new feature for Claude Code. When a PR opens, Claude dispatches a team of agents to hunt for bugs.
Agents search for bugs in parallel, verify each bug to reduce false positives, and rank bugs by severity. You get one high-signal summary comment plus inline flags.
We've been running this on most PRs at Anthropic. Results after months of testing: PRs w/ substantive review comments went from 16% → 54% <1% of review findings are marked incorrect by engineers On large PRs (1,000+ lines), 84% surface findings, avg 7.5 issues each
Code Review optimizes for depth and may be more expensive than other solutions, like our open source GitHub Action. Reviews generally average $15–25, billed on token usage, and they scale based on PR complexity.
283