{"id":4488,"date":"2026-07-01T15:46:23","date_gmt":"2026-07-01T15:46:23","guid":{"rendered":"https:\/\/rssfeedtelegrambot.bnaya.co.il\/index.php\/2026\/07\/01\/ornith-models-automate-agentic-coding-with-self-scaffolding\/"},"modified":"2026-07-01T15:46:23","modified_gmt":"2026-07-01T15:46:23","slug":"ornith-models-automate-agentic-coding-with-self-scaffolding","status":"publish","type":"post","link":"https:\/\/rssfeedtelegrambot.bnaya.co.il\/index.php\/2026\/07\/01\/ornith-models-automate-agentic-coding-with-self-scaffolding\/","title":{"rendered":"Ornith Models Automate Agentic Coding With Self-Scaffolding"},"content":{"rendered":"<div><img data-opt-id=1996888434  fetchpriority=\"high\" decoding=\"async\" width=\"770\" height=\"330\" src=\"https:\/\/devops.com\/wp-content\/uploads\/2026\/07\/ornith_770x330.jpeg\" class=\"attachment-large size-large wp-post-image\" alt=\"\" \/><\/div>\n<p><img data-opt-id=619462030  fetchpriority=\"high\" decoding=\"async\" width=\"150\" height=\"150\" src=\"https:\/\/devops.com\/wp-content\/uploads\/2026\/07\/ornith_770x330-150x150.jpeg\" class=\"attachment-thumbnail size-thumbnail wp-post-image\" alt=\"\" \/><\/p>\n<p><span>Ornith, a new family of open source LLM models from the <\/span><a href=\"https:\/\/deep-reinforce.com\/\"><span>DeepReinforce<\/span><\/a><span> research collective, takes a novel approach to executing coding and debugging tasks: It generates an architectural framework to give the user\u2019s harness a structured instruction set \u2013 a scaffold \u2013 to create an agent to complete the job.<\/span><\/p>\n<p><span>Available in a set of four variants, the <\/span><a href=\"https:\/\/github.com\/deepreinforce-ai\/Ornith-1\"><span>Ornith<\/span><\/a><span> family was trained to work comfortably with complex software repositories undertaking complicated long-horizon jobs. Sure, LLMs can do these tasks now \u2013 until the job gets too complex. Ornith\u2019s self-generated scaffolding ensures that it doesn\u2019t forget the plot along the way.<\/span><\/p>\n<p><span>\u201cThe model continuously improves not only its code generation abilities but also the orchestration strategy used to solve software engineering problems,\u201d wrote AI tutorial engineer Mehul Gupta, in an <\/span><a href=\"https:\/\/medium.com\/data-science-in-your-pocket\/ornith-1-0-self-learning-llm-for-coding-318c9a830bfc\"><span>introductory post<\/span><\/a><span>. <\/span><\/p>\n<h3><b>Deep Reinforcement Expansion Pack<\/b><\/h3>\n<p><span>Ornith reads the user\u2019s instruction, but instead of executing it directly it builds a scaffold, a learnable object. The scaffold serves as a place where Ornith can design \u2013and refine \u2013 the architecture for the job.<\/span><\/p>\n<p><span>According to Gupta, the scaffold is where the LLM can detail the reasoning sequences, memory organization, debugging strategy, tool invocation order and execution planning. The user\u2019s harness then interprets the scaffold to generate an agent to execute the task.<\/span><\/p>\n<p><span>When the job is finished, the scaffold is deleted. When a new task comes up, Ornith builds a fresh scaffold to execute that job.<\/span><\/p>\n<p><span>\u201cBy jointly optimizing the scaffold and the resulting solution, the model can discover better search trajectories and generate higher-quality solutions,\u201d the researchers state in a<\/span><a href=\"https:\/\/deep-reinforce.com\/ornith_1_0.html\"><span> post<\/span><\/a><span>.<\/span><\/p>\n<p><span>Ornith builds the scaffolding from a set of rules developed in the model during training time. These models were built from an exhaustive self-learning process that used <\/span><a href=\"https:\/\/youtu.be\/EvHRQhMX7_w?si=Kja3mln9CnJvtzra\"><span>deep reinforcement learning techniques<\/span><\/a><span> to computationally rotate through the possible ways of addressing an issue.<\/span><\/p>\n<h3><b>Four Models<\/b><\/h3>\n<p><span>Ornith\u2019s <\/span><a href=\"https:\/\/huggingface.co\/collections\/deepreinforce-ai\/ornith-10\"><span> four variants<\/span><\/a><span> are: <\/span><b>9B Dense<\/b><span>, <\/span><b>31B Dense<\/b><span>, <\/span><b>35B MoE<\/b><span> and <\/span><b>397B MoE.<\/b><span> The \u201cDense\u201d models activate every parameter (as measured by the \u201cB\u201d in the name), whereas the MoE (Mixture of Experts) models activate only the parameters needed based on their relevance for the task, though they have additional reasoning tools for specialized functions.<\/span><\/p>\n<p><span>Each of the variants are built atop the open source Gemma 4 and Qwen 3.5, allowing the researchers to layer coding-specific deep RL rules over those models\u2019 inherent fluency in language and world knowledge. <\/span><\/p>\n<p><span>The dense models are best suited for running on local hardware. Ideal for a laptop, 9B Dense can write small scripts and execute various single-file cleanup tasks, whereas the 31B Dense requires a full workstation with up to 48GB of VRAM, but can internalize a full view of a complicated multi-file repository for tougher problems.<\/span><\/p>\n<p><span> The MoE variants are best run in the cloud. The 35B MoE is perhaps best suited for quick continuous integration patching and code review. The 397B MoE is the flagship model, a competitor to Opus 4.7, in the organization\u2019s estimation. This behemoth requires a cluster of GPUs to run smoothly, and can tackle the hardest coding problems.<\/span><\/p>\n<h3><b>Killer Performance<\/b><\/h3>\n<p><span>With this diversity of models, Ornith\u2019s performance metrics are \u201cjust killing it all over the place,\u201d with impressive marks across small, middle and large LLM categories, <\/span><a href=\"https:\/\/youtu.be\/R69IfPvh4sw?si=gwRf0WQb3d_w9mFe\"><span>observed<\/span><\/a><span> the Hyderabad, Telangana-based <\/span><a href=\"https:\/\/www.linkedin.com\/company\/data-science-in-your-pocket\/about\/\"><i><span>Data Science in Your Pocket<\/span><\/i><\/a><span> YouTube channel. This is \u201ca breakthrough \u2026 one of a kind model,\u201d they noted.<\/span><\/p>\n<p><span>In <\/span><a href=\"https:\/\/deep-reinforce.com\/ornith_1_0.html\"><span>company tests<\/span><\/a><span>, Ornith-1.0-397B outperformed Claude Opus 4.7 on Terminal-Bench 2.1, a benchmark for LLMs in terminal environments, scoring 77.5 to Claude\u2019s 70.3.<\/span><\/p>\n<p><span>Likewise, Ornith-1.0-35B significantly outperforms similar mid-sized models, including Qwen 3.5 (9 billion parameters) and Gemma 4 (12 billion parameters). It even rivaled the 31-billion-parameter Gemma 4 model. <\/span><\/p>\n<p><a href=\"https:\/\/devops.com\/ornith-models-automate-agentic-coding-with-self-scaffolding\/\" target=\"_blank\" class=\"feedzy-rss-link-icon\">Read More<\/a><\/p>\n<p>\u200b<\/p>","protected":false},"excerpt":{"rendered":"<p>Ornith, a new family of open source LLM models from the DeepReinforce research collective, takes a novel approach to executing [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":4489,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"site-sidebar-layout":"default","site-content-layout":"","ast-site-content-layout":"default","site-content-style":"default","site-sidebar-style":"default","ast-global-header-display":"","ast-banner-title-visibility":"","ast-main-header-display":"","ast-hfb-above-header-display":"","ast-hfb-below-header-display":"","ast-hfb-mobile-header-display":"","site-post-title":"","ast-breadcrumbs-content":"","ast-featured-img":"","footer-sml-layout":"","ast-disable-related-posts":"","theme-transparent-header-meta":"","adv-header-id-meta":"","stick-header-meta":"","header-above-stick-meta":"","header-main-stick-meta":"","header-below-stick-meta":"","astra-migrate-meta-layouts":"default","ast-page-background-enabled":"default","ast-page-background-meta":{"desktop":{"background-color":"var(--ast-global-color-4)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"tablet":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"mobile":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""}},"ast-content-background-meta":{"desktop":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"tablet":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"mobile":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""}},"footnotes":""},"categories":[5],"tags":[],"class_list":["post-4488","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-devops"],"_links":{"self":[{"href":"https:\/\/rssfeedtelegrambot.bnaya.co.il\/index.php\/wp-json\/wp\/v2\/posts\/4488","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/rssfeedtelegrambot.bnaya.co.il\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/rssfeedtelegrambot.bnaya.co.il\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/rssfeedtelegrambot.bnaya.co.il\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/rssfeedtelegrambot.bnaya.co.il\/index.php\/wp-json\/wp\/v2\/comments?post=4488"}],"version-history":[{"count":0,"href":"https:\/\/rssfeedtelegrambot.bnaya.co.il\/index.php\/wp-json\/wp\/v2\/posts\/4488\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/rssfeedtelegrambot.bnaya.co.il\/index.php\/wp-json\/wp\/v2\/media\/4489"}],"wp:attachment":[{"href":"https:\/\/rssfeedtelegrambot.bnaya.co.il\/index.php\/wp-json\/wp\/v2\/media?parent=4488"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/rssfeedtelegrambot.bnaya.co.il\/index.php\/wp-json\/wp\/v2\/categories?post=4488"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/rssfeedtelegrambot.bnaya.co.il\/index.php\/wp-json\/wp\/v2\/tags?post=4488"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}