Cloudflare has launched a comprehensive, full-stack platform for building and running AI agents, featuring six distinct infrastructure primitives. This architecture enables high concurrency and low latency for AI agent workloads by leveraging dedicated containers, optimized state management, and an edge-distributed global network. The platform addresses common challenges in AI agent development, such as compute, orchestration, memory, and browsing capabilities.
Read original on InfoQ ArchitectureCloudflare's Browser Run, a key component for AI agents requiring browser interaction, underwent a significant architectural overhaul. Initially sharing infrastructure with Browser Isolation (BISO), it struggled with the distinct, spiky usage patterns of AI agents compared to BISO's long, steady human sessions. This led to a redesign focusing on dedicated resources and optimized state management to handle high demand and achieve better performance metrics.
Key Improvements for Browser Run
The rebuild delivered 4x higher concurrency (120 simultaneous browsers), 50% faster response times for quick actions, and added support for WebGL and WebMCP. This was achieved by moving to dedicated Containers with regional pools of pre-warmed browsers and migrating state management for better consistency and batch processing.
A crucial architectural decision involved migrating state management. Previously, Browser Run used Workers KV, which, due to its eventual consistency model, introduced race conditions with the rapid, transactional needs of AI agents. The platform shifted to D1 (Cloudflare's serverless database built on SQLite) combined with Queues for transactional assignment and batch writes. This change allows for atomic operations and efficient handling of up to 500,000 containers per location, improving reliability and scalability.
Cloudflare's platform for AI agents is built on six distinct infrastructure primitives, providing a full-stack solution:
This vertically integrated, edge-distributed architecture aims to offer a competitive alternative to hyperscaler solutions by providing a cohesive platform where every layer is optimized for dynamic, per-tenant operations, running on Cloudflare's global network.