Cloudflare accuses Perplexity of ignoring crawl limits

Cloudflare has stated in a blog post that AI search company Perplexity may be actively crawling websites without respecting the applicable guidelines and restrictions for bots.

According to the network service provider, Perplexity uses techniques to circumvent detection and gain access to content that is normally protected from automated traffic.

The suspicions focus on so-called stealth crawling. This involves Perplexity initially crawling under its own name, but switching to other methods as soon as the traffic is blocked. Cloudflare found that the crawlers changed their identity by pretending to be regular browsers, such as Chrome on macOS. It also used changing IP addresses and different autonomous systems (ASNs) to circumvent firewall rules.

To verify these findings, Cloudflare set up a test environment with new domains on which restrictions were set to target Perplexity bots. According to the company, the results showed that the crawlers were initially recognizable as coming from Perplexity. Still, when blocked, they switched to generic user agents commonly associated with human users. In addition, it was found that the IP addresses used were outside the company’s known ranges and that the ASNs varied.

According to Cloudflare, this behavior deviates from generally accepted standards on the internet, such as the robots.txt protocol. This protocol allows website owners to specify which parts of a site are accessible to automated systems and which are not. Cloudflare emphasizes in the blog post that transparency about the purpose and identity of crawlers is essential, especially in light of the increasing use of AI in information processing.

Perplexity offers an AI search engine that provides users with summaries and answers in natural language, based on web content. Crawling plays an important role in this, as the underlying models depend on access to up-to-date online information.

Millions of requests per day

According to Cloudflare, the scale of the activity detected is significant. It involves millions of requests per day, spread across tens of thousands of domains. The company states that this pattern is not incidental and has taken measures. Perplexity has been removed from the list of verified bots, and additional network rules have been activated to block this type of traffic.

The Verge quotes a spokesperson for Perplexity, Jesse Dwyer, who says that Cloudflare’s report is mainly a publicity stunt and contains many misunderstandings about how they work. The company denies any intentional deception or technical tricks such as changing user agents or IP infrastructure. Although Perplexity questions the conclusions of the investigation, they do not elaborate on Cloudflare’s specific technical findings, such as modified user agents or autonomous systems.

Perplexity has previously faced criticism regarding its collection of web content, with questions raised about how the company handles robots.txt restrictions, among other things. At the time, management indicated that certain scraping activities may have originated from test bots outside their own infrastructure.

Clear guidelines necessary

In its blog post, Cloudflare reiterates its call for the industry to agree on clear guidelines for AI crawling. According to the company, such systems must be recognizable, adhere to website preferences, and only collect information ethically and transparently.

CM.com and VOLT build sovereign AI environment for Europe

CM.com will use VOLT's AI infrastructure to offer solutions for various sectors with sovereignty requirements...

Erik van Klinken February 5, 2026

Top story

China tries its hand at advanced AI chips without Nvidia: will it succeed?

Vendor lock-in is a ubiquitous problem. Anyone looking for AI chips will find it difficult to bypass Nvidia. ...

Erik van Klinken September 2, 2025

Top story

EU Data Act in force as of today: companies free from cloud lock-in

The EU Data Act comes into force today in all member states. The legislation gives companies and consumers mo...

Berry Zwets September 12, 2025

Broadcom thins out VMware partner channel: forced migrations feared

Broadcom is cutting back on the VMware partner channel. European cloud providers fear forced customer migrati...

Mels Dees February 2, 2026

Expert Talks

Cloudflare accuses Perplexity of ignoring crawl limits

Millions of requests per day

Clear guidelines necessary

Stay tuned, subscribe!

Western Europe is a hotbed for cybercriminals’ servers (update)

How Lucid Software makes business agility attainable

Nvidia-OpenAI turmoil leads to downturn in AI sentiment

IFS gives industrial AI ecosystem leg up through partnerships

How Capgemini transformed HR for 400,000 employees globally

Why vulnerability counting fails: a new approach to risk ops

Why this CIO ditched Microsoft for Google and Slack

Qualcomm tells us how ARM chips will disrupt the enterprise PC market

4 steps to create a future-proof data infrastructure

Secure networking: the foundation for the AI era

Why AI adoption requires a dedicated approach to cyber governance

Professional print materials for European tech events, why booth design still makes the difference

Appdevcon

Webdevcon

Dutch PHP Conference

De IT Afdeling van de toekomst

GITEX ASIA 2026

Southeast Asia AI Application Summit 2026

Experience Synology’s latest enterprise backup solution

How to choose the right Enterprise Linux platform?

Enhance your data protection strategy for 2025

Strengthen your cybersecurity with DNS best practices