Cloudflare Blog·May 12, 2026

Debugging Congestion Control: A QUIC CUBIC Bug and its Impact on Performance

This article details a complex bug in Cloudflare's QUIC implementation of the CUBIC congestion control algorithm. It highlights how a Linux kernel optimization for handling idle TCP connections led to a critical issue in QUIC where the congestion window permanently collapsed, severely impacting data transfer rates. The case study provides insights into the intricacies of congestion control, the challenges of porting kernel-level optimizations to user-space implementations, and systematic debugging of performance-critical network protocols.

Distributed Systems Performance & Scaling Tools & Frameworks

Read original on Cloudflare Blog

Congestion control algorithms (CCAs) like CUBIC are fundamental to how TCP and QUIC connections manage network bandwidth, detect loss, and recover from congestion. They determine the congestion window (cwnd) — the maximum amount of data in flight at any given moment. A larger cwnd enables higher throughput, while a smaller one throttles data transfer. Loss-based algorithms increase the sending rate when the network is healthy and decrease it upon detecting packet loss, assuming congestion. This article explores a specific bug where CUBIC's cwnd gets permanently pinned at its minimum, leading to connection stalls.

The Unexpected Bug: CUBIC's Congestion Window Collapse

Cloudflare observed unexpected failures in integration tests for their `quiche` QUIC implementation. Specifically, in scenarios with heavy early packet loss, CUBIC failed to recover, with downloads timing out 60% of the time. Investigation revealed that after the loss stopped, the cwnd remained at its minimum (two full-size packets), and the congestion state rapidly oscillated between "recovery" and "congestion avoidance" states, once per RTT. This behavior contradicted CUBIC's core logic: in the absence of loss, the cwnd should grow to utilize available bandwidth.

Tracing the Root Cause: Linux Kernel Optimization & QUIC Differences

The bug originated from a Linux kernel optimization for TCP CUBIC in 2017, designed to prevent cwnd inflation after an application goes idle. The fix adjusted the `epoch_start` timestamp, which CUBIC uses to anchor its growth curve, to shift it forward by the idle duration instead of resetting it. This preserved the shape of the growth curve. When this optimization was ported to `quiche`'s user-space QUIC implementation, a subtle difference in how `bytes_in_flight == 0` was handled (in `on_packet_sent` vs. kernel's `CA_EVENT_TX_START` callback) exposed a flaw. An unaddressed follow-up kernel fix was missed during the port.

⚠️

The Self-Perpetuating Recovery Trap

The flaw caused `congestion_recovery_start_time` to be pushed into the future during ACK processing. At minimum cwnd (two packets), every ACK cycle would trigger a false idle detection, leading to the `congestion_recovery_start_time` being incorrectly advanced. This creates a "death spiral" where the connection constantly re-enters a recovery state, preventing cwnd growth and permanently pinning it at the minimum.

System Design Implications

Complexity of Network Protocols: Implementing and porting complex network protocols like CUBIC and QUIC requires deep understanding of underlying mechanisms and edge cases, especially concerning state transitions and timing.
Interplay of Layers: Optimizations at one layer (e.g., kernel TCP) may not directly translate or behave identically when moved to another environment (e.g., user-space QUIC), necessitating thorough re-evaluation.
Robust Testing: The bug was only surfaced by specialized tests that deliberately pushed the congestion controller into extreme, uncommon states (e.g., minimum cwnd after heavy loss). This highlights the importance of comprehensive test suites covering not just steady-state but also recovery and error conditions.
Observability for Debugging: Detailed instrumentation and visualizations (like qlog output) were crucial for diagnosing the oscillatory behavior and understanding the internal state of the congestion controller, emphasizing the value of good observability in distributed systems.

QUICCUBICCongestion ControlNetwork ProtocolsCloudflareDebuggingKernelPerformance

Comments

Loading comments...

Architecture Design

Design this yourself

Design a highly available and performant API gateway that utilizes QUIC for client-facing communication. Focus on how you would select and configure congestion control algorithms, implement robust testing strategies to prevent and detect issues like the CUBIC bug described, and integrate comprehensive monitoring and observability to ensure optimal network performance and rapid debugging of protocol-level issues.

Practice Interview

Other design angles

· Design a user-space network stack component for a high-performance application (e.g., game server) that includes a custom congestion control implementation. Discuss the design considerations for handling idle periods and recovering from severe packet loss.· Propose a comprehensive testing framework for network protocol implementations within a distributed system. Detail the types of tests (unit, integration, chaos engineering), metrics collected, and failure scenarios to simulate to ensure protocol robustness.· Design an observability system for a large-scale QUIC-based service. What metrics would you collect to detect and diagnose performance anomalies related to congestion control, and how would you visualize them to understand complex interactions?