ViqusViqus
Navigate
Company
Blog
About Us
Contact
System Status
Enter Viqus Hub

Perplexity Accused of Stealth Bot Evasion, Threatening Web Norms

Perplexity AI Search Engine Robots.txt Web Crawling Cloudflare Internet Norms Data Scraping
August 04, 2025
Viqus Verdict Logo Viqus Verdict Logo 8
Norms Under Siege
Media Hype 7/10
Real Impact 8/10

Article Summary

Perplexity AI is under scrutiny for allegedly employing sophisticated stealth bots to circumvent website’s no-crawl directives, a practice that could undermine established internet norms. Cloudflare researchers discovered that when Perplexity’s known crawlers were blocked by robots.txt files or firewall rules, the company’s stealth bots masked their activity by utilizing multiple IP addresses and rotating them in response to restrictions. This activity spanned over 10,000 domains and millions of requests per day, demonstrating a targeted effort to access content despite website protections. The accusation harkens back to the 1994 Robots Exclusion Protocol, which provides a standard for informing crawlers of restricted access. The controversy highlights a broader tension between AI-driven data access and website owners’ control over their content. Several publishers, including Forbes and Ars Technica, have voiced similar concerns about Perplexity’s aggressive crawling tactics. Perplexity’s lack of response to these allegations further fuels the debate. This situation raises significant questions about the ethical implications of large language models’ data sourcing methods and the potential for disruption to the internet ecosystem.

Key Points

  • Perplexity AI is accused of using stealth bots to bypass website restrictions.
  • Researchers found that Perplexity’s bots masked their activity by rotating IP addresses and ASNs.
  • The controversy raises ethical questions about AI data sourcing and potential disruption to the internet.

Why It Matters

This news matters because it represents a fundamental challenge to the established norms of the internet. The Robots Exclusion Protocol has been a cornerstone of web governance for decades, giving website owners control over how their content is accessed and used. If Perplexity’s actions are confirmed, it could set a dangerous precedent, potentially leading to widespread disregard for these norms and creating significant legal and ethical challenges for website operators. This has broader implications for the future of AI development and deployment, forcing a critical discussion about responsible data access practices.

You might also be interested in