The Pragmatic Engineer·June 17, 2026

Key Principles and Practices in CI/CD and Software Delivery at Scale

This article summarizes a discussion with Robert Erez, a principal engineer at Octopus Deploy, on critical aspects of CI/CD, deployment systems, and software delivery. It covers best practices, common pitfalls, and future trends, including the impact of AI, offering valuable insights for designing robust and efficient deployment pipelines and platform engineering initiatives. The discussion emphasizes trade-offs in various deployment approaches, from GitOps to progressive delivery, and highlights the ongoing relevance of on-premise solutions for specific industries.

DevOps & SRE Distributed Systems Performance & Scaling

Read original on The Pragmatic Engineer

Strategic Deployment: Roll Forwards vs. Rollbacks

A fundamental principle in highly stateful systems, especially those interacting with databases, is to prioritize roll-forward deployments over rollbacks. Attempting to roll back to a previous version (v1) after a failure in a new version (v2) can lead to critical schema mismatches and data inconsistencies if v2 introduced database changes. Instead, the recommended strategy is to quickly prepare and deploy a v3 that incorporates the fix, ensuring forward compatibility and avoiding complex state reconciliation issues. This approach emphasizes rapid iteration and recovery over reverting to a potentially incompatible state.

💡

Feature Flags as a Safety Net

Feature toggles offer a superior safety mechanism compared to traditional rollbacks. In case of production issues, a feature flag allows for immediate disabling of the problematic functionality, effectively "stopping the bleeding" without requiring a full redeployment. This reduces the pressure during incidents, enabling calmer diagnosis and resolution, and makes incident response less nerve-wracking.

Rethinking GitOps and Scalability Challenges

While widely adopted, the term "GitOps" can be misleading. Its four core pillars—declarative configuration, versioned and immutable artifacts, pull-based deployments, and continuous reconciliation—do not inherently mandate Git. The industry's dogmatic focus on Git can lead to anti-patterns, such as storing sensitive information (like secrets) in repositories. Furthermore, at extreme scales with thousands of independent Kubernetes clusters, a centralized Git repository can become a bottleneck, leading to throttling issues and requiring complex workarounds. This highlights that pull-based GitOps, while powerful, is not infinitely scalable without careful architectural consideration.

Modern CI/CD Practices and Evolving Environments

Continuous Delivery vs. Continuous Deployment: Full continuous deployment (shipping every change to production) may be overkill for many organizations. Continuous delivery, where changes are validated and ready for deployment but triggered manually, often provides more practical value and control.
Ephemeral Environments: The shift from static test/staging environments to ephemeral, per-feature-branch environments is accelerating feedback loops. These temporary environments, spun up for validation and torn down post-merge, significantly improve development efficiency.
AI's Impact on CI/CD: AI-driven development agents will likely shift the focus of CI/CD from optimizing for build speed (to unblock human developers) to minimizing risk. Longer, more thorough testing will become acceptable if AI agents can manage and monitor pipelines without human context-switching, prioritizing bug prevention over sheer velocity.

The article also touches on the enduring need for on-premise solutions in highly regulated industries like finance and government, emphasizing their demand for full control over hardware and upgrades. It also discusses the rise of platform engineering teams in larger organizations to standardize and streamline development workflows, providing sanity and focus across multiple projects and teams.

CI/CDGitOpsDeployment StrategiesFeature FlagsKubernetesPlatform EngineeringProgressive DeliverySoftware Delivery

Comments

Loading comments...

Architecture Design

Design this yourself

Design a highly resilient and scalable CI/CD system for a microservices-based application platform that supports continuous delivery, progressive deployments (e.g., canary, blue/green), and robust feature flag management. The system should prioritize roll-forward strategies, integrate with Kubernetes, and accommodate both cloud-native and potential on-premise components for specific workloads.

Practice Interview

Focus: CI/CD pipeline with progressive delivery and feature flags

Other design angles

· Design a platform engineering solution for an organization with hundreds of microservices, focusing on standardized CI/CD pipelines, ephemeral environments, and developer self-service capabilities.· Design a progressive delivery system for a critical real-time service, ensuring zero downtime deployments, immediate rollback capability via feature flags, and comprehensive monitoring for health checks and performance degradation detection.· Design a CI/CD pipeline that integrates AI agents for code generation and testing, focusing on strategies to mitigate the risk of AI-introduced bugs and optimizing for thoroughness over raw speed.