Slack Engineering·March 31, 2026

Scalable HTTP/3 Network Probing with Prometheus Blackbox Exporter

Slack faced a significant observability gap when rolling out HTTP/3 due to existing monitoring tools lacking QUIC support. This article details their solution: extending the Prometheus Blackbox Exporter to natively support HTTP/3 probing, enabling scalable, unified monitoring, and improved alerting across their infrastructure. It highlights the importance of observability in new protocol adoption and open-source contributions.

DevOps & SRE Performance & Scaling Tools & Frameworks

Read original on Slack Engineering

The Challenge: HTTP/3 Observability Gap

When Slack began rolling out HTTP/3 support, built on the QUIC transport protocol (which uses UDP instead of TCP), they encountered a critical observability gap. Their existing commercial SaaS offerings and custom internal tools, including the Prometheus Blackbox Exporter, lacked the capability to probe HTTP/3 endpoints. This meant they couldn't get client-side visibility for metrics like round-trip times or regressions, essential for monitoring the health and performance of their new infrastructure.

ℹ️

Why HTTP/3 and QUIC Pose a Monitoring Challenge

HTTP/3's reliance on QUIC (Quick UDP Internet Connections) fundamentally changes how network communication works at the transport layer. Traditional monitoring tools are often designed with TCP in mind, making them incompatible with UDP-based QUIC without specific support. This highlights a common system design challenge: adopting new protocols requires re-evaluating and potentially re-architecting supporting infrastructure like monitoring systems.

Solution: Extending Prometheus Blackbox Exporter

To address this, Slack chose to extend their existing Prometheus Blackbox Exporter (BBE). The core of the solution involved integrating a QUIC-capable HTTP client, specifically `quic-go` due to its widespread adoption and Go language support, into BBE's HTTP client structure. This allowed BBE to send probes over HTTP/3 and collect metrics from these endpoints.

http3Transport := &http3.Transport{
    TLSClientConfig: tlsConfig,
    QUICConfig: &quic.Config{},
}
client = &http.Client{
    Transport: http3Transport,
}

Unified Monitoring: A single pane of glass in Grafana for HTTP/1.1, HTTP/2, and HTTP/3 metrics, improving correlation and debugging.
Reliable Alerts: Ability to create more accurate and reliable alerts based on HTTP/3 endpoint health and performance.
Open Source Contribution: The HTTP/3 probing functionality was open-sourced, benefiting the broader Prometheus community and future-proofing observability for QUIC adoption.

Key Takeaways for System Design

Monitor First, Migrate Second: Emphasizes the critical importance of robust observability before migrating to new protocols or infrastructure. Proving the new system's stability and performance is crucial for long-term investment.
Open Source Contributions Pay Dividends: Contributing to open-source projects not only benefits the community by filling critical gaps (like HTTP/3 support in monitoring tools) but also ensures that the organization's specific needs are met and can be maintained within a shared ecosystem.
Architect for Extensibility: The ability to extend an existing tool like Prometheus Blackbox Exporter with new transport protocol support demonstrates the value of designing systems with clear architectural patterns that allow for future adaptations and new feature integration without a complete overhaul.

PrometheusObservabilityMonitoringHTTP/3QUICBlackbox ExporterNetwork ProbingOpen Source

Comments

Loading comments...

Architecture Design

Design this yourself

Design a distributed network observability platform capable of monitoring a hybrid infrastructure (on-premise and cloud) that includes both traditional TCP-based HTTP/1.1/2 services and new HTTP/3 (QUIC/UDP) endpoints. The platform should unify metrics collection, provide a single pane of glass for performance and health, and enable proactive alerting, leveraging a black-box probing approach.

Practice Interview

Focus: HTTP/3 network probing and observability

Other design angles

· Design a dedicated HTTP/3 client-side performance monitoring system that can detect and report regressions to HTTP/2 and provide accurate round-trip measurements.· Design a multi-tenant API gateway and microservices architecture, focusing on how you would integrate comprehensive HTTP/3 observability into its traffic management and monitoring components.· Design a robust alerting system for a global CDN service, specifically detailing how it would leverage both internal and external HTTP/3 probes to detect edge network issues and certificate mismatches.