{"id":3594,"date":"2026-03-11T10:12:04","date_gmt":"2026-03-11T10:12:04","guid":{"rendered":"https:\/\/rssfeedtelegrambot.bnaya.co.il\/index.php\/2026\/03\/11\/anthropic-code-review-dispatches-agent-teams-to-catch-the-bugs-that-skim-reads-miss\/"},"modified":"2026-03-11T10:12:04","modified_gmt":"2026-03-11T10:12:04","slug":"anthropic-code-review-dispatches-agent-teams-to-catch-the-bugs-that-skim-reads-miss","status":"publish","type":"post","link":"https:\/\/rssfeedtelegrambot.bnaya.co.il\/index.php\/2026\/03\/11\/anthropic-code-review-dispatches-agent-teams-to-catch-the-bugs-that-skim-reads-miss\/","title":{"rendered":"Anthropic Code Review Dispatches Agent Teams to Catch the Bugs That Skim Reads Miss"},"content":{"rendered":"<div><img data-opt-id=922599562  fetchpriority=\"high\" decoding=\"async\" width=\"769\" height=\"329\" src=\"https:\/\/devops.com\/wp-content\/uploads\/2023\/06\/programming-background-with-person-working-with-codes-computer-2-scaled-e1703244429743.jpg\" class=\"attachment-large size-large wp-post-image\" alt=\"Google, code signing, trust, CodeRabbit, code, GenAI, Quali, Torque, code, Symbiotic, application developers, Zencoder, code, operations, code, commit, developer, mainframe, code, GenAI; code review efficiency cloud development\" \/><\/div>\n<p><img data-opt-id=86780662  fetchpriority=\"high\" decoding=\"async\" width=\"150\" height=\"150\" src=\"https:\/\/devops.com\/wp-content\/uploads\/2023\/06\/programming-background-with-person-working-with-codes-computer-2-scaled-e1703244429743-150x150.jpg\" class=\"attachment-thumbnail size-thumbnail wp-post-image\" alt=\"Google, code signing, trust, CodeRabbit, code, GenAI, Quali, Torque, code, Symbiotic, application developers, Zencoder, code, operations, code, commit, developer, mainframe, code, GenAI; code review efficiency cloud development\" \/><\/p>\n<p><span>The math was straightforward. Code output per engineer at Anthropic increased by 200% over the past year. Code review didn\u2019t scale with it. Before deploying an automated solution, only 16% of pull requests at Anthropic received substantive review comments. The rest got skim reads.<\/span><\/p>\n<p><span>That\u2019s the problem Code Review is designed to solve. Announced March 10 and available now as a research preview for Claude Code Teams and Enterprise customers, Code Review dispatches a team of AI agents on every pull request to find the bugs that quick reads miss. It\u2019s the system Anthropic has been running on nearly every internal PR for months. Now it\u2019s available to customers.<\/span><\/p>\n<h3><b>How it Works<\/b><\/h3>\n<p><span>When a PR opens in an enabled repository, Code Review spins up multiple specialized agents that work in parallel. Some probes for data-handling errors, off-by-one conditions, and API misuse. Others perform cross-file consistency checks and reason about intent. A verification step tests each hypothesis to filter false positives. A final aggregation agent consolidates findings, removes duplicates, and ranks issues by severity.<\/span><\/p>\n<p><span>Results appear directly on the PR as a single summary comment with inline notes on specific lines. Each finding includes step-by-step reasoning, an analysis of the potential impact, and a suggested fix. Issues are labeled by severity using color codes.<\/span><\/p>\n<p><span>The agents do not approve pull requests. Humans decide what to do about the findings.<\/span><\/p>\n<p><span>Cat Wu, Anthropic\u2019s head of product for Claude Code, told TechCrunch the tool focuses on logical errors rather than style \u2014 a deliberate choice based on feedback that developers weren\u2019t finding value in automated style comments.<\/span><\/p>\n<h3><b>The Numbers<\/b><\/h3>\n<p><span>After deploying Code Review internally, substantive review comments on PRs jumped from 16% to 54%. Engineers disagreed with fewer than 1% of surfaced findings.<\/span><\/p>\n<p><span>Find rates scale with PR size. Changesets over 1,000 lines showed findings 84% of the time. Small PRs under 50 lines had findings 31% of the time. Reviews average about 20 minutes per PR.<\/span><\/p>\n<p><span>Two examples stand out. Internally, a single-line change to a production service \u2014 the kind of edit that would typically be rubber-stamped \u2014 would have broken authentication. Code Review flagged it before the merge. In a customer pilot, TrueNAS caught a type mismatch bug during a ZFS encryption refactoring that risked erasing the encryption key cache during sync operations.<\/span><\/p>\n<p><span>Pricing is token-based, averaging $15 to $25 per review. Administrators can set monthly spending caps. That per-use model costs more than flat-rate alternatives like CodeRabbit ($24\/month unlimited), but Anthropic is betting depth justifies the premium.<\/span><\/p>\n<p><span>\u201cAnthropic Code Review demonstrates what multi-agent orchestration looks like in practice: Specialized agents working in parallel, verifying findings, and consolidating results ranked by severity. At $15 to $25 per review, the economics favor teams where a missed bug carries significant financial, regulatory, or safety consequences, including medical devices, defense systems, space, and mission-critical applications,\u201d according to Mitch Ashley, <\/span><span>VP and practice lead for software lifecycle engineering at<\/span><a href=\"https:\/\/futurumgroup.com\/\" target=\"_blank\" rel=\"noopener\"> <span>The Futurum Group<\/span><\/a><span>.<\/span><\/p>\n<p><span>\u201cFor teams with high review frequency of codebases, that per-use cost compounds quickly. CodeRabbit\u2019s subscription model serves a broader audience by prioritizing utility across broad sets of codebases and organizational goals. The market is segmenting on risk profile, not just capability, and teams need to match the tool to the business impact.\u201d<\/span><\/p>\n<h3><b>Why This Matters for DevOps<\/b><\/h3>\n<p><span>Code Review addresses a bottleneck that every team using AI coding agents is hitting. Claude Code, Cursor, GitHub Copilot\u2019s coding agent \u2014 they all generate pull requests faster than humans can review them. We\u2019ve written about this dynamic with <a href=\"https:\/\/devops.com\/cursor-cloud-agents-get-their-own-computers-and-35-of-internal-prs-to-prove-it\/\" target=\"_blank\" rel=\"noopener\">Cursor\u2019s cloud agents<\/a> (35% of internal PRs), GitHub Copilot\u2019s Jira integration, and VS Code\u2019s agent plugin ecosystem. The pattern is the same: Agent-generated code is outpacing the review process.<\/span><\/p>\n<p><span>Anthropic\u2019s approach \u2014 using agents to review agent-generated code \u2014 is the logical next step. And the multi-agent architecture matters. A single reviewer model can miss issues that specialized agents catch when they work in parallel and then verify each other\u2019s findings. The aggregation step, which consolidates, deduplicates, and ranks by severity, turns raw findings into actionable information without alert fatigue.<\/span><\/p>\n<p><span>The distinction between Code Review and Anthropic\u2019s existing Claude Code Security tool is worth noting. Claude Code Security runs continuous deep security sweeps across entire codebases. Code Review focuses on logical errors in individual pull requests. If Code Review detects a security issue, it will flag it, but it\u2019s not as thorough as the dedicated security scanner. The two are complementary \u2014 security scanning of the codebase and logic review for each PR.<\/span><\/p>\n<p><span>For teams with strict change-control requirements, Code Review can be configured on a per-repository basis. Admins enable it, select which repos to cover, and set spending limits. Once enabled, reviews run automatically on new PRs with no developer configuration needed. That administrative control matters for enterprises managing costs and compliance across dozens of repositories.<\/span><\/p>\n<p><span>The competitive picture is getting crowded. GitHub Copilot\u2019s agentic code review has hit 60 million reviews and accounts for one in five on the platform. CodeRabbit offers unlimited AI review at a flat monthly rate. Anthropic is positioning Code Review as the depth-first option \u2014 more expensive, more thorough, targeted at teams where the cost of a missed production bug exceeds the cost of the review.<\/span><\/p>\n<p><span>Code Review is available now in research preview. Enable it in Claude Code settings, install the GitHub App, and select repositories.<\/span><\/p>\n<p><a href=\"https:\/\/devops.com\/anthropic-code-review-dispatches-agent-teams-to-catch-the-bugs-that-skim-reads-miss\/\" target=\"_blank\" class=\"feedzy-rss-link-icon\">Read More<\/a><\/p>\n<p>\u200b<\/p>","protected":false},"excerpt":{"rendered":"<p>The math was straightforward. Code output per engineer at Anthropic increased by 200% over the past year. Code review didn\u2019t [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":3595,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"site-sidebar-layout":"default","site-content-layout":"","ast-site-content-layout":"","site-content-style":"default","site-sidebar-style":"default","ast-global-header-display":"","ast-banner-title-visibility":"","ast-main-header-display":"","ast-hfb-above-header-display":"","ast-hfb-below-header-display":"","ast-hfb-mobile-header-display":"","site-post-title":"","ast-breadcrumbs-content":"","ast-featured-img":"","footer-sml-layout":"","theme-transparent-header-meta":"","adv-header-id-meta":"","stick-header-meta":"","header-above-stick-meta":"","header-main-stick-meta":"","header-below-stick-meta":"","astra-migrate-meta-layouts":"default","ast-page-background-enabled":"default","ast-page-background-meta":{"desktop":{"background-color":"var(--ast-global-color-4)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"tablet":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"mobile":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""}},"ast-content-background-meta":{"desktop":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"tablet":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"mobile":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""}},"footnotes":""},"categories":[5],"tags":[],"class_list":["post-3594","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-devops"],"_links":{"self":[{"href":"https:\/\/rssfeedtelegrambot.bnaya.co.il\/index.php\/wp-json\/wp\/v2\/posts\/3594","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/rssfeedtelegrambot.bnaya.co.il\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/rssfeedtelegrambot.bnaya.co.il\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/rssfeedtelegrambot.bnaya.co.il\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/rssfeedtelegrambot.bnaya.co.il\/index.php\/wp-json\/wp\/v2\/comments?post=3594"}],"version-history":[{"count":0,"href":"https:\/\/rssfeedtelegrambot.bnaya.co.il\/index.php\/wp-json\/wp\/v2\/posts\/3594\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/rssfeedtelegrambot.bnaya.co.il\/index.php\/wp-json\/wp\/v2\/media\/3595"}],"wp:attachment":[{"href":"https:\/\/rssfeedtelegrambot.bnaya.co.il\/index.php\/wp-json\/wp\/v2\/media?parent=3594"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/rssfeedtelegrambot.bnaya.co.il\/index.php\/wp-json\/wp\/v2\/categories?post=3594"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/rssfeedtelegrambot.bnaya.co.il\/index.php\/wp-json\/wp\/v2\/tags?post=3594"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}