Cloudflare evolved its Threat Intelligence Platform (TIP) to be highly scalable and real-time by eliminating traditional ETL. The architecture leverages a sharded, SQLite-backed design distributed across Cloudflare's edge network using Durable Objects and Workers, enabling sub-second query latency for billions of events. This approach allows for real-time threat detection, correlation, and automated response directly at the edge.
Read original on Cloudflare BlogCloudflare's Threat Intelligence Platform (TIP) addresses the challenge of providing actionable insights from vast amounts of security telemetry by moving away from centralized, monolithic databases and complex ETL pipelines. The core architectural innovation lies in its distributed, edge-native design, leveraging Cloudflare's serverless platform.
The platform is built on a sharded architecture, where threat events are distributed across thousands of logical shards. Each shard is implemented using a Cloudflare Durable Object, which provides a consistent, transactional interface to its own private SQLite database. This design prevents a single database from becoming a bottleneck during high-volume ingestion and complex queries. Data ingestion uses Cloudflare Queues for asynchronous processing, storing "hot" index data in Durable Object SQLite and "cold" data in R2 for long-term retention.
// A conceptual look at fanning out a query to multiple shards
async function fetchFromShards(shards, query) {
const promises = shards.map(shardId => {
const stub = TELEMETRY_DO.get(shardId);
return stub.querySQLite(query); // Calling the DO's storage method
});
// Parallel execution across the Cloudflare network
const results = await Promise.all(promises);
return results.flat();
}A key aspect is running compute directly at the edge. GraphQL endpoints, also implemented using Cloudflare Workers, fan out queries in parallel to multiple Durable Objects across the global network. This approach minimizes latency by executing SQL queries where the data resides, eliminating the need to backhaul data to a central datacenter. Optimizations like Smart Placement ensure query-handling Workers are physically close to their Durable Objects.
Eliminating ETL Overhead
By storing structured data directly in edge-located SQLite databases and providing GraphQL access via Workers, Cloudflare effectively eliminates traditional ETL processes. Data is live upon ingestion, reducing delays between telemetry capture and its availability for analysis and action.