InfoQ Architecture·March 20, 2026

Stripe's Minions: Autonomous Agents for Automated Code Generation and Software Delivery

Stripe has implemented "Minions," autonomous coding agents that leverage LLMs and internal tooling to automate end-to-end software development tasks, generating thousands of production-ready pull requests weekly. This system demonstrates how AI agents can be integrated into CI/CD pipelines to augment developer productivity while maintaining stringent quality controls through human review, automated tests, and static analysis.

DevOps & SRE AI & ML Infrastructure Tools & Frameworks

Read original on InfoQ Architecture

Overview of Stripe's Minions System

Stripe's Minions are autonomous agents designed to complete software development tasks from a single instruction, generating production-ready code, tests, and documentation. This system integrates large language models (LLMs) with Stripe's existing developer tooling and CI/CD pipelines. A key architectural aspect is the Minions' ability to operate across complex dependencies, including financial institutions and regulatory frameworks, while handling significant transaction volumes (over $1 trillion annually).

Agent Workflow and Blueprints

Unlike interactive coding assistants, Minions execute one-shot, end-to-end tasks. A task description, originating from sources like Slack or bug reports, is sent to a Minion. The Minion then orchestrates the work using blueprints. Blueprints are code-defined workflows that specify how tasks are broken down into subtasks, which can be handled by either deterministic routines or the agent itself. This hybrid approach ensures both efficiency for routine tasks and adaptability for more complex scenarios.

📌

Example Minion Task Types

Minions are particularly effective for well-defined tasks such as: - Configuration adjustments - Dependency upgrades - Minor refactoring

Reliability and Quality Assurance

Given the sensitive nature of Stripe's operations, reliability and correctness are paramount. All generated changes by Minions undergo human review, but critically, they contain no human-written code. Quality is further enforced through robust CI/CD pipelines, extensive automated tests, and static analysis tools, ensuring that all automatically generated changes adhere to Stripe's engineering standards before deployment.

All generated code is human-reviewed.
Integrated into existing CI/CD pipelines.
Automated tests ensure correctness.
Static analysis enforces code quality.

The system's evolution from an internal fork of Block's Goose agent highlights the customization required to fit specific LLM infrastructure and stringent operational requirements. This case demonstrates a significant trend in agent-driven software development, where LLM-based agents are deeply integrated into the entire development lifecycle to boost productivity while maintaining high quality.

AI agentsLLMautomated code generationdeveloper productivityCI/CDsoftware deliveryStripeautonomous systems

Comments

Loading comments...

Architecture Design

Design this yourself

Design a system for autonomous code generation and deployment using LLM-powered agents that integrates with existing CI/CD pipelines and version control. The system must support generating production-ready code, tests, and documentation from high-level instructions, ensure high reliability through automated testing and human review, and handle deployments in a highly regulated environment.

Practice Interview

Focus: autonomous code generation and deployment system using LLM-powered agents

Other design angles

· Design a developer platform that incorporates AI coding assistants and autonomous agents to accelerate feature development and maintenance while maintaining strict code quality and security standards.· Architect a secure and scalable internal platform for deploying and managing custom large language models, including their integration with existing developer tooling and automated workflows for code generation.· Design an enterprise-grade CI/CD pipeline that integrates autonomous AI agents for automated code review, testing, and dependency management, focusing on minimizing human intervention while ensuring compliance and stability.