InfoQ Architecture·June 20, 2026

Atlassian Forge Billing: Distributed Usage Tracking Architecture

This article details Atlassian's Forge billing architecture, focusing on how it tracks and processes distributed usage signals at scale for its serverless platform. The system handles high-volume events from various services, ensuring validation, attribution, deduplication, and aggregation for accurate financial records and near real-time visibility.

Distributed Systems Cloud & Infrastructure Performance & Scaling

Read original on InfoQ Architecture

Atlassian's Forge platform, a serverless extensibility ecosystem for products like Jira and Confluence, evolved to support usage-based pricing. This shift necessitated a robust billing architecture capable of accurately tracking and processing high-volume, distributed usage events. The core challenge involved collecting fine-grained signals (e.g., function invocations, storage consumption) from independent services, ensuring their financial correctness, and transforming them into billing-ready records without loss or duplication.

Architectural Overview

The Forge billing architecture comprises several key layers: Forge services that emit usage events, a centralized ingestion and streaming layer, a Usage Tracking Service (UTS), and downstream billing/commerce systems. This decoupled design allows for independent scaling and resilience. Services use a shared schema for events to ensure consistent interpretation across the pipeline.

Usage Tracking Service (UTS)

The UTS acts as the "nervous system" for Forge Billing. Its responsibilities include validating, normalizing, enriching, and preparing incoming usage data. Crucially, UTS ensures that each event is correctly attributed to the right entitlement or subscription context before persistence and further processing. This attribution is a key complexity in multi-tenant, distributed billing systems.

ℹ️

Key System Design Challenges

Building such a system involves critical challenges like ensuring financial correctness (no double counting, no lost events), handling distributed consistency (idempotency, out-of-order events), and supporting high volume and scale with near real-time visibility.

Event Ingestion and Processing

Events flow through a Kafka-based streaming infrastructure, which provides schema validation and reliable delivery. The tracking layer then performs validation, normalization, enrichment, deduplication, and ordering. Idempotent event design and time-based aggregation are used to prevent double-counting and correctly incorporate late-arriving events through windowed processing. A stream processing engine aggregates raw usage events into metrics for billing and analytics.

Idempotent Event Design: Crucial for distributed systems to prevent side effects from duplicate deliveries.
Time-Based Aggregation: Handles out-of-order and late-arriving events, ensuring data completeness.
Kafka-based Streaming: Provides decoupling, fault tolerance, and scalability for high-volume event ingestion.
Layered Storage: Immutable long-term storage for auditability and a low-latency analytical layer for dashboards.

billing systemevent-driven architecturemicroserviceskafkausage trackingdata pipelinemulti-tenancyscalability

Comments

Loading comments...

Architecture Design

View Architecture

Design a distributed, multi-tenant billing platform for a serverless application ecosystem, focusing on accurate usage tracking at scale. The system must ingest high-volume, fine-grained events from various independent services, ensure correct attribution to tenants, handle eventual consistency with idempotent processing, and provide aggregated metrics for billing and analytics. Detail the event ingestion pipeline, the usage processing layer (validation, enrichment, deduplication, aggregation), and storage strategies for auditability and real-time visibility.

Practice Interview

Focus: distributed usage tracking and billing system

Other design angles

· Design only the core Usage Tracking Service (UTS) responsible for processing raw usage events into billing-ready records, assuming an existing event ingestion layer.· Design a system for near real-time cost visibility dashboards for developers, leveraging the aggregated usage data generated by the billing system.· Design the API and data model for defining usage-based pricing plans and how they integrate with the distributed usage tracking system.