{"id":4259,"date":"2026-06-08T13:50:16","date_gmt":"2026-06-08T13:50:16","guid":{"rendered":"https:\/\/rssfeedtelegrambot.bnaya.co.il\/index.php\/2026\/06\/08\/the-hidden-cost-of-ai-code-keeping-quality-up-with-production\/"},"modified":"2026-06-08T13:50:16","modified_gmt":"2026-06-08T13:50:16","slug":"the-hidden-cost-of-ai-code-keeping-quality-up-with-production","status":"publish","type":"post","link":"https:\/\/rssfeedtelegrambot.bnaya.co.il\/index.php\/2026\/06\/08\/the-hidden-cost-of-ai-code-keeping-quality-up-with-production\/","title":{"rendered":"The Hidden Cost of AI Code: Keeping Quality Up With Production\u00a0"},"content":{"rendered":"<div><img data-opt-id=1179980539  fetchpriority=\"high\" decoding=\"async\" width=\"770\" height=\"329\" src=\"https:\/\/devops.com\/wp-content\/uploads\/2022\/02\/coding-gb646cb77a_1280-e1644931732205.jpg\" class=\"attachment-large size-large wp-post-image\" alt=\"AI coding, teams, vibecoding, shadow, vibecoding vibe, coding, GitHub, agents, Gemini, Canvas, Gemini, code, Augment Code, code, kernel compliance-as-code software secure software Terraform infrastructure\" \/><\/div>\n<p><img data-opt-id=536178619  fetchpriority=\"high\" decoding=\"async\" width=\"150\" height=\"150\" src=\"https:\/\/devops.com\/wp-content\/uploads\/2022\/02\/coding-gb646cb77a_1280-e1644931732205-150x150.jpg\" class=\"attachment-thumbnail size-thumbnail wp-post-image\" alt=\"AI coding, teams, vibecoding, shadow, vibecoding vibe, coding, GitHub, agents, Gemini, Canvas, Gemini, code, Augment Code, code, kernel compliance-as-code software secure software Terraform infrastructure\" \/><\/p>\n<p><span data-contrast=\"none\">AI maturity is fundamentally about expanding the delegation boundary. You start by letting AI assist with code completion, then with features. Eventually, agents write pull requests from requirements with minimal human involvement. Each step hands more responsibility to machines.<\/span><span data-ccp-props='{\"201341983\":0,\"335559739\":0,\"335559740\":240}'>\u00a0<\/span><\/p>\n<p><span data-contrast=\"none\">But here\u2019s what people skip over: Tests are the primary mechanism for making that delegation safe.<\/span><b><span data-contrast=\"none\">\u00a0<\/span><\/b><span data-contrast=\"none\">You can\u2019t let agents operate autonomously if you can\u2019t verify what they produce. Low test coverage is the single biggest barrier to advancing along the AI maturity curve and most organizations haven\u2019t come to terms with that. They want the AI productivity gains without doing the unglamorous work of building test infrastructure to make those gains trustworthy.<\/span><span data-ccp-props='{\"201341983\":0,\"335559739\":0,\"335559740\":240}'>\u00a0<\/span><\/p>\n<p><span data-contrast=\"none\"><a href=\"https:\/\/devops.com\/what-vibe-coding-means-for-the-enterprise-fast-code-real-considerations\/\" target=\"_blank\" rel=\"noopener\">Coding assistants make delivery so fast<\/a> that most QA organizations are underwater. Teams face a choice nobody wants to make: Attempt ten times the testing work, or test selectively and accept less certainty about whether what\u2019s running is safe. Neither option is sustainable.<\/span><span data-ccp-props='{\"201341983\":0,\"335559739\":0,\"335559740\":240}'>\u00a0<\/span><\/p>\n<p><span data-contrast=\"none\">It\u2019s time to shift the conversation. The question isn\u2019t how fast teams can ship. It\u2019s whether they can prove, with evidence, that what they shipped works.<\/span><span data-ccp-props='{\"201341983\":0,\"335559739\":0,\"335559740\":240}'>\u00a0<\/span><\/p>\n<h3><b><span data-contrast=\"none\">What\u2019s Changed<\/span><\/b><span data-ccp-props='{\"201341983\":0,\"335559738\":320,\"335559739\":80,\"335559740\":240}'>\u00a0<\/span><\/h3>\n<p><span data-contrast=\"none\">AI accelerated code delivery, but the bigger change is in who\u2019s building software, how much is being built and what it takes to trust it.\u202f<\/span><span data-ccp-props='{\"201341983\":0,\"335559739\":0,\"335559740\":240}'>\u00a0<\/span><\/p>\n<p><span data-contrast=\"none\">People without training in architectures, APIs, or security can now generate functioning applications. That\u2019s a real unlock \u2013 and a risk multiplier. A junior developer hard-coding plain-text credentials used to be an isolated incident you\u2019d catch in code review. Now, with AI assistants putting production-grade tools in everyone\u2019s hands, that rookie mistake can show up across dozens of repos before anyone notices.<\/span><span data-ccp-props='{\"201341983\":0,\"335559739\":0,\"335559740\":240}'>\u00a0<\/span><\/p>\n<p><span data-contrast=\"none\">Throwing more people at the problem is not realistic. Companies have downsized teams. Even if fully staffed, QA has been defined by code-level inspection and automation. Inspection is a tick-box verification of functions, integrations and outputs under predefined conditions. Automation mechanizes that inspection. Neither was designed to keep pace with AI-generated code arriving at this volume.<\/span><span data-ccp-props='{\"201341983\":0,\"335559739\":0,\"335559740\":240}'>\u00a0<\/span><\/p>\n<p><span data-contrast=\"none\">The other response is to fight AI-generated code with AI-generated tests. Sounds logical. In practice, it creates a different problem because nobody is sure which of those tests matter, which are redundant and what\u2019s missing. Unthoughtful AI-generated tests are just noise.<\/span><span data-ccp-props='{\"201341983\":0,\"335559739\":0,\"335559740\":240}'>\u00a0<\/span><\/p>\n<p><span data-contrast=\"none\">So if you can\u2019t hire your way out and you can\u2019t generate your way out, what\u2019s left? It\u2019s a governance and intelligence problem. When an agent generates fifty variations of a login test, who decides whether that\u2019s useful or busy work? When AI creates a test suite, who ensures it traces back to requirements? Enterprise teams are demanding answers. They want test cases generated from issue trackers, linked back to originating requirements, organized and traceable, all triggered from within existing workflows.\u202f<\/span><span data-ccp-props='{\"201341983\":0,\"335559739\":0,\"335559740\":240}'>\u00a0<\/span><\/p>\n<p><span data-contrast=\"none\">The organizations that build this layer of trust will scale AI-driven development. The rest will find out the hard way what happens when you ship fast without proof.<\/span><span data-ccp-props='{\"201341983\":0,\"335559739\":0,\"335559740\":240}'>\u00a0<\/span><\/p>\n<h3><b><span data-contrast=\"none\">From Instructions to Outcomes<\/span><\/b><span data-ccp-props='{\"201341983\":0,\"335559738\":320,\"335559739\":80,\"335559740\":240}'>\u00a0<\/span><\/h3>\n<p><span data-contrast=\"none\">QA has historically been about verifying what an application\u00a0<\/span><i><span data-contrast=\"none\">is<\/span><\/i><span data-contrast=\"none\">, not what it\u00a0<\/span><i><span data-contrast=\"none\">does<\/span><\/i><span data-contrast=\"none\">. That approach worked when humans wrote code and applications changed on schedule. It falls apart when AI regenerates parts of the codebase continuously.\u202f<\/span><span data-ccp-props='{\"201341983\":0,\"335559739\":0,\"335559740\":240}'>\u00a0<\/span><\/p>\n<p><span data-contrast=\"none\">The gap between what someone intended the software to do and what it does is where releases go sideways. That gap is widening. Seven of 10 software executives are concerned that application quality is suffering as AI speeds code development, new\u00a0<\/span><a href=\"https:\/\/www.businesswire.com\/news\/home\/20260318326658\/en\/SmartBear-Survey-70-of-Software-Experts-Concerned-that-Application-Quality-is-Suffering-Given-Faster-AI-Code-Development\" target=\"_blank\" rel=\"noopener\"><span data-contrast=\"none\">research<\/span><\/a><span data-contrast=\"none\">\u00a0shows and just as many are concerned about the impact going forward.\u202f<\/span><span data-ccp-props='{\"201341983\":0,\"335559739\":0,\"335559740\":240}'>\u00a0<\/span><\/p>\n<p><span data-contrast=\"none\">This means the focus on quality needs to catch up to the focus on AI development speed. Product managers and engineers define the requirements. QA must validate that requirements are met. But most QA practices are still oriented around inspecting code artifacts rather than confirming outcomes. The shift needs to be from \u201cdoes this component work as coded?\u201d to \u201cdoes this application behave the way the business expects it to?\u201d We call this \u201cApplication Integrity,\u201d continuous, measurable assurance that software works as intended with the governance to operate at AI speed and scale.\u202f<\/span><span data-ccp-props='{\"201341983\":0,\"335559739\":0,\"335559740\":240}'>\u00a0<\/span><\/p>\n<p><span data-contrast=\"none\">One pain point I hear from QA teams is about the manual\u00a0<\/span><i><span data-contrast=\"none\">creation<\/span><\/i><span data-contrast=\"none\"> of tests, especially for each variation of applications. That\u2019s where agents should be doing the heavy lifting. But AI-driven QA will only help if applied thoughtfully. You need agents that can test based on user behavior, adapt as interfaces change, and prioritize what\u2019s most likely to break rather than what\u2019s easiest to test. The difference between useful AI-driven testing and expensive noise comes down to whether the tools are built with that kind of judgment baked in, or whether they\u2019re just fast.<\/span><span data-ccp-props='{\"201341983\":0,\"335559739\":0,\"335559740\":240}'>\u00a0<\/span><\/p>\n<p><span data-contrast=\"none\">Maintaining quality when everything is moving this fast requires application integrity, meaning ongoing evidence that the software behaves as intended. Continuous assurance. Can you show, at any given moment, that your software is doing what you said it would? To do that, teams need to get a lot clearer about what they\u2019re measuring. Some of the metrics teams track today are still the right ones. Others need to be added.<\/span><br \/>\n<span data-ccp-props='{\"201341983\":0,\"335559739\":0,\"335559740\":240}'>\u00a0<\/span><\/p>\n<ul>\n<li data-leveltext=\"\uf0b7\" data-font=\"Symbol\" data-listid=\"1\" data-list-defn-props='{\"335552541\":1,\"335559685\":720,\"335559991\":360,\"469769226\":\"Symbol\",\"469769242\":[8226],\"469777803\":\"left\",\"469777804\":\"\uf0b7\",\"469777815\":\"multilevel\"}' data-aria-posinset=\"1\" data-aria-level=\"1\"><b><span data-contrast=\"none\">Deployment frequency<\/span><\/b><span data-contrast=\"none\">\u00a0and\u00a0<\/span><b><span data-contrast=\"none\">lead time for changes<\/span><\/b><span data-contrast=\"none\"> aren\u2019t going away. AI should make shipping faster and release cycles shorter. If your quality practice is slowing them down, fix it. Those are throughput metrics. On their own, they don\u2019t tell you whether the things you\u2019re shipping actually work.<\/span><span data-ccp-props='{\"201341983\":0,\"335559739\":0,\"335559740\":240}'>\u00a0<\/span><\/li>\n<\/ul>\n<ul>\n<li data-leveltext=\"\uf0b7\" data-font=\"Symbol\" data-listid=\"2\" data-list-defn-props='{\"335552541\":1,\"335559685\":720,\"335559991\":360,\"469769226\":\"Symbol\",\"469769242\":[8226],\"469777803\":\"left\",\"469777804\":\"\uf0b7\",\"469777815\":\"multilevel\"}' data-aria-posinset=\"1\" data-aria-level=\"1\"><span data-contrast=\"none\">That\u2019s where\u00a0<\/span><b><span data-contrast=\"none\">change failure rate<\/span><\/b><span data-contrast=\"none\">\u00a0comes in.\u202f This is becoming the most important metric. What percentage of deployments cause incidents, rollbacks, or hotfixes? When code volume goes up but verification doesn\u2019t keep pace, this number climbs. Driving this percentage down is the core challenge for quality teams and probably the one with the most direct financial impact.<\/span><span data-ccp-props='{\"201341983\":0,\"335559739\":0,\"335559740\":240}'>\u00a0<\/span><\/li>\n<\/ul>\n<ul>\n<li data-leveltext=\"\uf0b7\" data-font=\"Symbol\" data-listid=\"3\" data-list-defn-props='{\"335552541\":1,\"335559685\":720,\"335559991\":360,\"469769226\":\"Symbol\",\"469769242\":[8226],\"469777803\":\"left\",\"469777804\":\"\uf0b7\",\"469777815\":\"multilevel\"}' data-aria-posinset=\"1\" data-aria-level=\"1\"><b><span data-contrast=\"none\">Mean time to resolution<\/span><\/b><span data-contrast=\"none\">\u00a0deserves more attention. If deployments are landing twice as often but recovery time hasn\u2019t improved, your exposure window just doubled.\u202f<\/span><span data-ccp-props='{\"201341983\":0,\"335559739\":0,\"335559740\":240}'>\u00a0<\/span><\/li>\n<\/ul>\n<ul>\n<li data-leveltext=\"\uf0b7\" data-font=\"Symbol\" data-listid=\"4\" data-list-defn-props='{\"335552541\":1,\"335559685\":720,\"335559991\":360,\"469769226\":\"Symbol\",\"469769242\":[8226],\"469777803\":\"left\",\"469777804\":\"\uf0b7\",\"469777815\":\"multilevel\"}' data-aria-posinset=\"1\" data-aria-level=\"1\"><span data-contrast=\"none\">Invest in\u00a0<\/span><b><span data-contrast=\"none\">functional coverage<\/span><\/b><span data-contrast=\"none\">: What percentage of your app\u2019s functionality and API surface is covered by tests. And <\/span><b><span data-contrast=\"none\">traceability coverage<\/span><\/b><span data-contrast=\"none\">: what percentage of tests and API contracts link back to a requirement or spec? This separates teams who can\u00a0<\/span><i><span data-contrast=\"none\">demonstrate<\/span><\/i><span data-contrast=\"none\">\u00a0their software works from teams who are pretty sure it does.<\/span><span data-ccp-props='{\"201341983\":0,\"335559739\":0,\"335559740\":240}'>\u00a0<\/span><\/li>\n<\/ul>\n<h3><b><span data-contrast=\"none\">A New Standard<\/span><\/b><span data-ccp-props='{\"201341983\":0,\"335559739\":0,\"335559740\":240}'>\u00a0<\/span><\/h3>\n<p><span data-contrast=\"none\">Code is being delivered by more people, faster than ever. Now, the new standard is proving the applications work.\u202f<\/span><\/p>\n<p><a href=\"https:\/\/devops.com\/the-hidden-cost-of-ai-code-keeping-quality-up-with-production\/\" target=\"_blank\" class=\"feedzy-rss-link-icon\">Read More<\/a><\/p>\n<p>\u200b<\/p>","protected":false},"excerpt":{"rendered":"<p>AI maturity is fundamentally about expanding the delegation boundary. You start by letting AI assist with code completion, then with [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":4260,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"site-sidebar-layout":"default","site-content-layout":"","ast-site-content-layout":"default","site-content-style":"default","site-sidebar-style":"default","ast-global-header-display":"","ast-banner-title-visibility":"","ast-main-header-display":"","ast-hfb-above-header-display":"","ast-hfb-below-header-display":"","ast-hfb-mobile-header-display":"","site-post-title":"","ast-breadcrumbs-content":"","ast-featured-img":"","footer-sml-layout":"","ast-disable-related-posts":"","theme-transparent-header-meta":"","adv-header-id-meta":"","stick-header-meta":"","header-above-stick-meta":"","header-main-stick-meta":"","header-below-stick-meta":"","astra-migrate-meta-layouts":"default","ast-page-background-enabled":"default","ast-page-background-meta":{"desktop":{"background-color":"var(--ast-global-color-4)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"tablet":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"mobile":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""}},"ast-content-background-meta":{"desktop":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"tablet":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"mobile":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""}},"footnotes":""},"categories":[5],"tags":[],"class_list":["post-4259","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-devops"],"_links":{"self":[{"href":"https:\/\/rssfeedtelegrambot.bnaya.co.il\/index.php\/wp-json\/wp\/v2\/posts\/4259","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/rssfeedtelegrambot.bnaya.co.il\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/rssfeedtelegrambot.bnaya.co.il\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/rssfeedtelegrambot.bnaya.co.il\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/rssfeedtelegrambot.bnaya.co.il\/index.php\/wp-json\/wp\/v2\/comments?post=4259"}],"version-history":[{"count":0,"href":"https:\/\/rssfeedtelegrambot.bnaya.co.il\/index.php\/wp-json\/wp\/v2\/posts\/4259\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/rssfeedtelegrambot.bnaya.co.il\/index.php\/wp-json\/wp\/v2\/media\/4260"}],"wp:attachment":[{"href":"https:\/\/rssfeedtelegrambot.bnaya.co.il\/index.php\/wp-json\/wp\/v2\/media?parent=4259"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/rssfeedtelegrambot.bnaya.co.il\/index.php\/wp-json\/wp\/v2\/categories?post=4259"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/rssfeedtelegrambot.bnaya.co.il\/index.php\/wp-json\/wp\/v2\/tags?post=4259"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}