{"id":3754,"date":"2026-03-31T18:12:18","date_gmt":"2026-03-31T18:12:18","guid":{"rendered":"https:\/\/rssfeedtelegrambot.bnaya.co.il\/index.php\/2026\/03\/31\/run-and-iterate-on-llms-faster-with-docker-model-runner-on-dgx-station\/"},"modified":"2026-03-31T18:12:18","modified_gmt":"2026-03-31T18:12:18","slug":"run-and-iterate-on-llms-faster-with-docker-model-runner-on-dgx-station","status":"publish","type":"post","link":"https:\/\/rssfeedtelegrambot.bnaya.co.il\/index.php\/2026\/03\/31\/run-and-iterate-on-llms-faster-with-docker-model-runner-on-dgx-station\/","title":{"rendered":"Run and Iterate on LLMs Faster with Docker Model Runner on DGX Station"},"content":{"rendered":"<p>Back in October, we showed how <a href=\"https:\/\/www.docker.com\/blog\/new-nvidia-dgx-spark-docker-model-runner\/\">Docker Model Runner on the NVIDIA DGX Spark<\/a> makes it remarkably easy to run large AI models locally with the same familiar Docker experience developers already trust. That post struck a chord: hundreds of developers discovered that a compact desktop system paired with Docker Model Runner could replace complex GPU setups and cloud API calls.<\/p>\n<p>Recently at NVIDIA GTC 2026, NVIDIA is raising the bar with <a href=\"https:\/\/www.nvidia.com\/en-us\/products\/workstations\/dgx-station\/\" rel=\"nofollow noopener\" target=\"_blank\">NVIDIA DGX Station <\/a>and we\u2019re excited to add support for it in Docker Model Runner!\u00a0 The new DGX Station brings serious performance, and Model Runner helps make it practical to use day to day. With Model Runner, you can run and iterate on larger models on a DGX Station, using the same intuitive Docker experience you already know and trust.<\/p>\n<h2 class=\"wp-block-heading\">From NVIDIA DGX Spark to DGX Station: What has changed and why does this matter?<\/h2>\n<p><a href=\"https:\/\/www.nvidia.com\/en-us\/products\/workstations\/dgx-spark\/\" rel=\"nofollow noopener\" target=\"_blank\">NVIDIA DGX Spark<\/a>, powered by the GB10 Grace Blackwell Superchip, gave developers 128GB of unified memory and petaflop-class AI performance in a compact form factor. A fantastic entry point for running models.<\/p>\n<p>NVIDIA DGX Station is a different beast entirely. Built around the <a href=\"https:\/\/www.nvidia.com\/en-us\/data-center\/gb300-nvl72\/\" rel=\"nofollow noopener\" target=\"_blank\">NVIDIA GB300 Grace Blackwell Ultra Desktop Superchip<\/a>, it connects a 72-core NVIDIA Grace CPU and NVIDIA Blackwell Ultra GPU through NVIDIA NVLink-C2C, creating a unified, high-bandwidth architecture built for frontier AI workloads. It brings data-center-class performance to a deskside form factor. Here are the headline specs:<\/p>\n<div class=\"wp-block-ponyo-table\" data-highlighted-columns=\"null\" data-highlighted-rows=\"null\">\n<table class=\"responsive-table\">\n<tbody class=\"wp-block-ponyo-table-body\" data-highlighted-columns=\"[]\" data-highlighted-rows=\"[]\">\n<tr class=\"wp-block-ponyo-table-header\">\n<th class=\"wp-block-ponyo-cell empty\">\n<\/th>\n<th class=\"wp-block-ponyo-cell\" data-responsive-table-heading=\"DGX Spark (GB10)\">\n<p>DGX Spark (GB10)<\/p>\n<\/th>\n<th class=\"wp-block-ponyo-cell\" data-responsive-table-heading=\"DGX Station (GB300)\">\n<p>DGX Station (GB300)<\/p>\n<\/th>\n<\/tr>\n<tr class=\"wp-block-ponyo-table-row\">\n<td class=\"wp-block-ponyo-cell\">\n                    <span class=\"responsive-table-label\"><\/span>\n<p>                    <span class=\"responsive-table-value\"><br \/>\n                                                    <span class=\"responsive-table-value-content\"><\/span><\/span><\/p>\n<p>GPU Memory<\/p>\n<p>                    <br \/>\n                                            \n            <\/p><\/td>\n<td class=\"wp-block-ponyo-cell\">\n                    <span class=\"responsive-table-label\"><\/span>\n<p>                    <span class=\"responsive-table-value\"><br \/>\n                                                    <span class=\"responsive-table-value-content\"><\/span><\/span><\/p>\n<p>128 GB unified<\/p>\n<p>                    <br \/>\n                                            \n            <\/p><\/td>\n<td class=\"wp-block-ponyo-cell\">\n                    <span class=\"responsive-table-label\"><\/span>\n<p>                    <span class=\"responsive-table-value\"><br \/>\n                                                    <span class=\"responsive-table-value-content\"><\/span><\/span><\/p>\n<p>252 GB<\/p>\n<p>                    <br \/>\n                                            \n            <\/p><\/td>\n<\/tr>\n<tr class=\"wp-block-ponyo-table-row\">\n<td class=\"wp-block-ponyo-cell\">\n                    <span class=\"responsive-table-label\"><\/span>\n<p>                    <span class=\"responsive-table-value\"><br \/>\n                                                    <span class=\"responsive-table-value-content\"><\/span><\/span><\/p>\n<p>GPU Memory Bandwidth<\/p>\n<p>                    <br \/>\n                                            \n            <\/p><\/td>\n<td class=\"wp-block-ponyo-cell\">\n                    <span class=\"responsive-table-label\"><\/span>\n<p>                    <span class=\"responsive-table-value\"><br \/>\n                                                    <span class=\"responsive-table-value-content\"><\/span><\/span><\/p>\n<p>273 GB\/s<\/p>\n<p>                    <br \/>\n                                            \n            <\/p><\/td>\n<td class=\"wp-block-ponyo-cell\">\n                    <span class=\"responsive-table-label\"><\/span>\n<p>                    <span class=\"responsive-table-value\"><br \/>\n                                                    <span class=\"responsive-table-value-content\"><\/span><\/span><\/p>\n<p>7.1 TB\/s<\/p>\n<p>                    <br \/>\n                                            \n            <\/p><\/td>\n<\/tr>\n<tr class=\"wp-block-ponyo-table-row\">\n<td class=\"wp-block-ponyo-cell\">\n                    <span class=\"responsive-table-label\"><\/span>\n<p>                    <span class=\"responsive-table-value\"><br \/>\n                                                    <span class=\"responsive-table-value-content\"><\/span><\/span><\/p>\n<p>Total Coherent Memory<\/p>\n<p>                    <br \/>\n                                            \n            <\/p><\/td>\n<td class=\"wp-block-ponyo-cell\">\n                    <span class=\"responsive-table-label\"><\/span>\n<p>                    <span class=\"responsive-table-value\"><br \/>\n                                                    <span class=\"responsive-table-value-content\"><\/span><\/span><\/p>\n<p>128 GB<\/p>\n<p>                    <br \/>\n                                            \n            <\/p><\/td>\n<td class=\"wp-block-ponyo-cell\">\n                    <span class=\"responsive-table-label\"><\/span>\n<p>                    <span class=\"responsive-table-value\"><br \/>\n                                                    <span class=\"responsive-table-value-content\"><\/span><\/span><\/p>\n<p>748 GB<\/p>\n<p>                    <br \/>\n                                            \n            <\/p><\/td>\n<\/tr>\n<tr class=\"wp-block-ponyo-table-row\">\n<td class=\"wp-block-ponyo-cell\">\n                    <span class=\"responsive-table-label\"><\/span>\n<p>                    <span class=\"responsive-table-value\"><br \/>\n                                                    <span class=\"responsive-table-value-content\"><\/span><\/span><\/p>\n<p>Networking<\/p>\n<p>                    <br \/>\n                                            \n            <\/p><\/td>\n<td class=\"wp-block-ponyo-cell\">\n                    <span class=\"responsive-table-label\"><\/span>\n<p>                    <span class=\"responsive-table-value\"><br \/>\n                                                    <span class=\"responsive-table-value-content\"><\/span><\/span><\/p>\n<p>200 Gb\/s<\/p>\n<p>                    <br \/>\n                                            \n            <\/p><\/td>\n<td class=\"wp-block-ponyo-cell\">\n                    <span class=\"responsive-table-label\"><\/span>\n<p>                    <span class=\"responsive-table-value\"><br \/>\n                                                    <span class=\"responsive-table-value-content\"><\/span><\/span><\/p>\n<p>800 Gb\/s<\/p>\n<p>                    <br \/>\n                                            \n            <\/p><\/td>\n<\/tr>\n<tr class=\"wp-block-ponyo-table-row\">\n<td class=\"wp-block-ponyo-cell\">\n                    <span class=\"responsive-table-label\"><\/span>\n<p>                    <span class=\"responsive-table-value\"><br \/>\n                                                    <span class=\"responsive-table-value-content\"><\/span><\/span><\/p>\n<p>GPU Architecture<\/p>\n<p>                    <br \/>\n                                            \n            <\/p><\/td>\n<td class=\"wp-block-ponyo-cell\">\n                    <span class=\"responsive-table-label\"><\/span>\n<p>                    <span class=\"responsive-table-value\"><br \/>\n                                                    <span class=\"responsive-table-value-content\"><\/span><\/span><\/p>\n<p>Blackwell (5th-gen Tensor Cores, FP4)<\/p>\n<p>                    <br \/>\n                                            \n            <\/p><\/td>\n<td class=\"wp-block-ponyo-cell\">\n                    <span class=\"responsive-table-label\"><\/span>\n<p>                    <span class=\"responsive-table-value\"><br \/>\n                                                    <span class=\"responsive-table-value-content\"><\/span><\/span><\/p>\n<p>Blackwell Ultra (5th-gen Tensor Cores, FP4)<\/p>\n<p>                    <br \/>\n                                            \n            <\/p><\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<p>With 252GB of GPU memory at 7.1 TB\/s of bandwidth and a total of 748GB of coherent memory, the DGX Station doesn\u2019t just let you run frontier models,\u00a0 it lets you run trillion-parameter models, fine-tune massive architectures, and serve multiple models simultaneously, all from your desk.<\/p>\n<p>Here\u2019s what 748GB of coherent memory and 7.1 TB\/s of bandwidth unlock in practice:<\/p>\n<ul class=\"wp-block-list\">\n<li><strong>Run the largest open models without quantization<\/strong>. DGX Station can run the largest open 1T parameter models with quantization.<\/li>\n<li><strong>Serve a team, not just yourself<\/strong>. <a href=\"https:\/\/www.nvidia.com\/en-us\/technologies\/multi-instance-gpu\/\" rel=\"nofollow noopener\" target=\"_blank\">NVIDIA Multi-Instance GPU <\/a>(MIG) technology lets you partition <a href=\"https:\/\/www.nvidia.com\/en-us\/data-center\/technologies\/blackwell-architecture\/\" rel=\"nofollow noopener\" target=\"_blank\">NVIDIA Blackwell Ultra GPUs<\/a> into up to seven isolated instances. Combined with Docker Model Runner\u2019s containerized architecture, a single DGX Station can serve as a shared AI development node for an entire team \u2014 each member getting their own sandboxed model endpoint.<\/li>\n<li><strong>Faster iteration on agentic workflows<\/strong>. Agentic AI pipelines often require multiple models running concurrently \u2014 a reasoning model, a code generation model, a vision model. With 7.1 TB\/s of memory bandwidth, switching between and serving these models is dramatically faster than anything a desktop system has offered before.<\/li>\n<\/ul>\n<p><strong>Bottom line<\/strong>: The DGX Spark made that fast. The DGX Station makes it transformative. And raw hardware is only half the story. With Docker Model Runner, the setup stays effortless and the developer experience stays smooth, no matter how powerful the machine underneath becomes.<\/p>\n<h2 class=\"wp-block-heading\">Getting Started: It\u2019s the Same Docker Experience<\/h2>\n<p>For the full step-by-step walkthrough check out our <a href=\"https:\/\/www.docker.com\/blog\/new-nvidia-dgx-spark-docker-model-runner\/\">guide for DGX Spark<\/a>. Every instruction applies to the DGX Station as well.<\/p>\n<p>NVIDIA\u2019s new DGX Station puts data-center-class AI on your desk with 252GB of GPU memory, 7.1 TB\/s bandwidth, and 748GB of total coherent memory. Docker Model Runner makes all of that power accessible with the same familiar commands developers already use on the DGX Spark. Pull a trillion-parameter model, serve a whole team, and iterate on agentic workflows. No cloud required, no new tools to learn.<\/p>\n<h2 class=\"wp-block-heading\">How You Can Get Involved<\/h2>\n<p>The strength of Docker Model Runner lies in its community, and there\u2019s always room to grow. To get involved:<\/p>\n<ul class=\"wp-block-list\">\n<li><strong>Star the repository:<\/strong> Show your support by starring the <a href=\"https:\/\/github.com\/docker\/model-runner\" rel=\"nofollow noopener\" target=\"_blank\">Docker Model Runner repo<\/a>.<\/li>\n<\/ul>\n<ul class=\"wp-block-list\">\n<li><strong>Contribute your ideas:<\/strong> Create an issue or submit a pull request. We\u2019re excited to see what ideas you have!<\/li>\n<li><strong>Spread the word:<\/strong> Tell your friends and colleagues who might be interested in running AI models with Docker.<\/li>\n<\/ul>\n<p>Learn More<\/p>\n<ul class=\"wp-block-list\">\n<li>Read our original post on <a href=\"https:\/\/www.docker.com\/blog\/new-nvidia-dgx-spark-docker-model-runner\/\">Docker Model Runner + DGX Spark<\/a>\u00a0<\/li>\n<li>Check out the Docker Model Runner General Availability <a href=\"https:\/\/www.docker.com\/blog\/announcing-docker-model-runner-ga\/\">announcement<\/a><\/li>\n<li>Visit our <a href=\"https:\/\/github.com\/docker\/model-runner\" rel=\"nofollow noopener\" target=\"_blank\">Model Runner GitHub repo<\/a><\/li>\n<li>Get started with a simple <a href=\"https:\/\/github.com\/docker\/hello-genai\" rel=\"nofollow noopener\" target=\"_blank\">hello GenAI application<\/a><\/li>\n<\/ul>","protected":false},"excerpt":{"rendered":"<p>Back in October, we showed how Docker Model Runner on the NVIDIA DGX Spark makes it remarkably easy to run [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":94,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"site-sidebar-layout":"default","site-content-layout":"","ast-site-content-layout":"default","site-content-style":"default","site-sidebar-style":"default","ast-global-header-display":"","ast-banner-title-visibility":"","ast-main-header-display":"","ast-hfb-above-header-display":"","ast-hfb-below-header-display":"","ast-hfb-mobile-header-display":"","site-post-title":"","ast-breadcrumbs-content":"","ast-featured-img":"","footer-sml-layout":"","ast-disable-related-posts":"","theme-transparent-header-meta":"","adv-header-id-meta":"","stick-header-meta":"","header-above-stick-meta":"","header-main-stick-meta":"","header-below-stick-meta":"","astra-migrate-meta-layouts":"default","ast-page-background-enabled":"default","ast-page-background-meta":{"desktop":{"background-color":"var(--ast-global-color-4)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"tablet":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"mobile":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""}},"ast-content-background-meta":{"desktop":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"tablet":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"mobile":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""}},"footnotes":""},"categories":[4],"tags":[],"class_list":["post-3754","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-docker"],"_links":{"self":[{"href":"https:\/\/rssfeedtelegrambot.bnaya.co.il\/index.php\/wp-json\/wp\/v2\/posts\/3754","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/rssfeedtelegrambot.bnaya.co.il\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/rssfeedtelegrambot.bnaya.co.il\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/rssfeedtelegrambot.bnaya.co.il\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/rssfeedtelegrambot.bnaya.co.il\/index.php\/wp-json\/wp\/v2\/comments?post=3754"}],"version-history":[{"count":0,"href":"https:\/\/rssfeedtelegrambot.bnaya.co.il\/index.php\/wp-json\/wp\/v2\/posts\/3754\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/rssfeedtelegrambot.bnaya.co.il\/index.php\/wp-json\/wp\/v2\/media\/94"}],"wp:attachment":[{"href":"https:\/\/rssfeedtelegrambot.bnaya.co.il\/index.php\/wp-json\/wp\/v2\/media?parent=3754"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/rssfeedtelegrambot.bnaya.co.il\/index.php\/wp-json\/wp\/v2\/categories?post=3754"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/rssfeedtelegrambot.bnaya.co.il\/index.php\/wp-json\/wp\/v2\/tags?post=3754"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}