Cursor’s New SDK Turns AI Coding Agents Into Deployable Infrastructure

For most of its life, Cursor has been an IDE. A very good one. But with the public beta of the Cursor SDK, the company is making a different kind of move — one that should get the attention of DevOps teams.

The Cursor SDK is a TypeScript library that gives engineers programmatic access to the same runtime, models, and agent harness that power Cursor’s desktop app, CLI, and web interface. In short, the agents that used to live inside an editor can now be invoked from anywhere in your stack.

That’s a meaningful shift in how AI coding tools fit into software delivery pipelines.

From the Editor to the Pipeline

If you’ve used Cursor before, the workflow is familiar — you interact with an agent in real time, asking it to write functions, fix bugs, or review code. The SDK breaks that dependency on interactive use. Now you can call those same agents programmatically, from a CI/CD trigger, a backend service, or embedded inside another tool.

Getting started is a single install command: npm install @cursor/sdk. From there, you create an agent instance, send it a task, and stream the response back — all in TypeScript. You point the agent at a local directory or a cloud environment, and it goes to work.

The key detail is what “same runtime” actually means here. The SDK doesn’t just expose a raw LLM call. It includes the full supporting infrastructure: codebase indexing and semantic search so the agent retrieves relevant context before generating code; MCP (Model Context Protocol) server support for connecting external tools and data sources; reusable skill definitions the agent picks up from a project directory; and hooks that let you observe and control the agent loop across cloud, self-hosted, and local environments. That last piece matters a lot for teams that need logging, guardrails, or custom orchestration.

There’s also built-in support for subagents — the main agent can delegate subtasks to named subagents with their own prompts and models, enabling multi-agent workflows without having to write custom orchestration code from scratch.

Why This Matters for DevOps Teams

Building a coding agent that actually works in production is harder than it sounds. You need secure sandboxing, durable session state, environment setup, and context management — and every time a new model ships, teams often have to rework their agent loops to take advantage of it.

The Cursor SDK is designed to absorb that complexity. Teams can focus on what the agent should do rather than on maintaining the underlying infrastructure.

Mitch Ashley, VP and practice lead for software lifecycle engineering at The Futurum Group, believes, “As the battle for the agent control plane continues, the Cursor SDK turns IDE-based coding agents into deployable infrastructure and positions Cursor as a contender. Exposing runtime, sandboxing, MCP integration, and execution hooks programmatically puts Cursor against CD platforms, observability vendors, and cloud providers competing to own how coding agents run inside enterprise pipelines.”

Ashley continues, “Enterprise buyers now evaluate Cursor as infrastructure. Platform teams will press on worker isolation, agent telemetry, and policy enforcement before these agents enter CI/CD. Teams deferring that evaluation inherit governance debt the moment agents start opening pull requests unattended.”

For teams with strict security requirements, the SDK supports self-hosted workers, where both code and execution remain within the organization’s network. That’s a practical requirement for many enterprise environments, not a nice-to-have.

Cloud Execution: Persistent and Resumable

One of the more useful features is cloud execution. When configured to run on Cursor’s cloud, each agent gets its own sandboxed VM with a clone of the target repository and a fully configured development environment. The agent keeps running even if the machine that kicked it off goes offline. You can reconnect later and stream the conversation from where it left off.

Cloud agents also integrate with Cursor’s existing web interface, so a task started programmatically can be inspected or manually taken over in Cursor. When the agent finishes, it can open a pull request, push a branch, or attach output artifacts. That makes them practical for async, unattended workflows — the kind that fit naturally into a CI/CD pipeline.

Model Flexibility and Composer 2

The SDK exposes every model supported in Cursor. Switching models is a single field change in the configuration. Cursor’s own Composer 2 — a specialized coding model the company describes as delivering strong performance at a fraction of the cost of general-purpose models — is positioned as the default recommendation for most coding-agent tasks.

For teams already managing AI model costs, token-based pricing lets you track and control spending on a per-task basis. That’s a more predictable cost structure than many AI tools offer today.

Getting Started

Cursor has published a public cookbook repository on GitHub with four starter projects: A minimal quickstart for local agents, a web-based scaffolding tool, an agent-powered kanban board that automatically opens PRs when cards are moved, and a terminal CLI for spawning agents from the command line. There’s also a Cursor SDK plugin in the Cursor Marketplace.

The SDK is in public beta now. For DevOps teams looking to wire AI coding capabilities into their existing pipelines — rather than asking developers to switch contexts to an IDE — this is worth a close look.