Menu
InfoQ Architecture·March 20, 2026

Stripe's Minions: Autonomous Agents for Automated Code Generation and Software Delivery

Stripe has implemented "Minions," autonomous coding agents that leverage LLMs and internal tooling to automate end-to-end software development tasks, generating thousands of production-ready pull requests weekly. This system demonstrates how AI agents can be integrated into CI/CD pipelines to augment developer productivity while maintaining stringent quality controls through human review, automated tests, and static analysis.

Read original on InfoQ Architecture

Overview of Stripe's Minions System

Stripe's Minions are autonomous agents designed to complete software development tasks from a single instruction, generating production-ready code, tests, and documentation. This system integrates large language models (LLMs) with Stripe's existing developer tooling and CI/CD pipelines. A key architectural aspect is the Minions' ability to operate across complex dependencies, including financial institutions and regulatory frameworks, while handling significant transaction volumes (over $1 trillion annually).

Agent Workflow and Blueprints

Unlike interactive coding assistants, Minions execute one-shot, end-to-end tasks. A task description, originating from sources like Slack or bug reports, is sent to a Minion. The Minion then orchestrates the work using blueprints. Blueprints are code-defined workflows that specify how tasks are broken down into subtasks, which can be handled by either deterministic routines or the agent itself. This hybrid approach ensures both efficiency for routine tasks and adaptability for more complex scenarios.

📌

Example Minion Task Types

Minions are particularly effective for well-defined tasks such as: - Configuration adjustments - Dependency upgrades - Minor refactoring

Reliability and Quality Assurance

Given the sensitive nature of Stripe's operations, reliability and correctness are paramount. All generated changes by Minions undergo human review, but critically, they contain no human-written code. Quality is further enforced through robust CI/CD pipelines, extensive automated tests, and static analysis tools, ensuring that all automatically generated changes adhere to Stripe's engineering standards before deployment.

  • All generated code is human-reviewed.
  • Integrated into existing CI/CD pipelines.
  • Automated tests ensure correctness.
  • Static analysis enforces code quality.

The system's evolution from an internal fork of Block's Goose agent highlights the customization required to fit specific LLM infrastructure and stringent operational requirements. This case demonstrates a significant trend in agent-driven software development, where LLM-based agents are deeply integrated into the entire development lifecycle to boost productivity while maintaining high quality.

AI agentsLLMautomated code generationdeveloper productivityCI/CDsoftware deliveryStripeautonomous systems

Comments

Loading comments...