Menu
Cloudflare Blog·July 1, 2026

Cloudflare's Enhanced Bot Management for AI Traffic

Cloudflare is evolving its bot management capabilities to provide website owners with more granular control over AI traffic. This update introduces a pragmatic taxonomy to classify bots based on their behavior (Search, Agent, Training) and content use, moving beyond a simple block/allow mechanism. The new features enable finer tuning of access for various AI activities, aiming to balance content protection with discoverability and fair compensation for creators.

Read original on Cloudflare Blog

Cloudflare has significantly updated its bot management platform to address the evolving landscape of AI-driven web traffic. Recognizing that a simple "block all AI bots" approach is insufficient, the new system provides a more nuanced way for website owners to manage how AI crawlers interact with their content. This is crucial for maintaining content independence while ensuring discoverability and potential compensation.

Pragmatic Taxonomy for AI Bot Behavior

The core of Cloudflare's new approach is a pragmatic taxonomy that classifies bots based on their *behavior* rather than just a generic "AI" label. This allows for more precise control and transparency regarding what bots are doing on a website. Website owners can now configure rules based on these distinct behaviors:

  • Search: Bots that collect or index content to answer questions later, aiming to drive referral traffic.
  • Agent: Automated behavior acting on a person's behalf, often in real-time, to complete a job (e.g., chat fetch bots, browser-use agents).
  • Training: Crawlers taking content to train or fine-tune AI models, where data is permanently absorbed.
ℹ️

Architectural Transparency

Cloudflare encourages bot operators to separate their crawlers for different purposes (Search, Agent, Training) to increase transparency. This architectural decision allows website owners to better understand and manage access, ensuring that multi-purpose crawlers are treated according to all their declared behaviors, with defaults leaning towards the most restrictive rules.

Granular Control with BotBase and Content Use Signals

For enterprise customers, Cloudflare introduces BotBase, a comprehensive database tracking all known bots and their classifications. This provides an unprecedented level of visibility into automated traffic. Additionally, new capabilities are being built to allow customers to block or allow bots based on their *content use* levels, which indicate what a bot may keep and reshare after crawling:

  • immediate: Interact, but store and reuse nothing.
  • reference (default): Index, excerpt, and link back.
  • full: Summarize and reproduce content.

This granular control is extended with a new `use` signal in `robots.txt`, allowing website owners to express preferences for content use, which Cloudflare supports by tracking and verifying bot adherence. Bots that abuse these signals will lose their "Verified" status, impacting their ability to crawl effectively. This system design empowers site owners with more precise tools to manage their digital assets in the age of AI.

bot managementAI trafficcontent independencecrawler classificationCloudflareDDoS mitigationweb securityrobots.txt

Comments

Loading comments...