ViqusViqus
Navigate
Company
Blog
About Us
Contact
System Status
Enter Viqus Hub

Perplexity's Stealth Bots Evade Website Restrictions, Sparking Controversy

Perplexity AI Search Engine Robots.txt Web Crawling Cloudflare Internet Norms Data Scraping
August 04, 2025
Viqus Verdict Logo Viqus Verdict Logo 8
Data Governance in the Age of AI
Media Hype 7/10
Real Impact 8/10

Article Summary

Perplexity, the popular AI search engine, is facing serious allegations of circumventing website restrictions using stealth bots. Cloudflare researchers discovered that Perplexity’s crawlers, after encountering blocks from standard robots.txt files and firewalls, would switch to a new bot employing multiple IP addresses and ASNs to mask its activity and bypass these protections. This behavior, observed across over 10,000 domains and millions of requests, directly violates the established norms of the internet established in 1994 by Martijn Koster’s Robots Exclusion Protocol, which allows website owners to control which bots can access their content. The allegations extend beyond mere technical violations; Perplexity has also been accused of plagiarism by publications like Forbes and Wired, who noted suspicious traffic patterns and manipulated bot ID strings. These issues highlight growing tensions between AI development and the established infrastructure of the web, raising questions about data rights, website accessibility, and the ethical use of AI. Perplexity’s refusal to respond to these concerns further fuels the controversy.

Key Points

  • Perplexity is allegedly using stealth bots to bypass website restrictions outlined in the Robots Exclusion Protocol.
  • Researchers found Perplexity’s bots rotate IP addresses and utilize different ASNs to evade website blocks.
  • The company faces accusations of plagiarism from multiple publications, compounding the ethical concerns.

Why It Matters

This news is significant because it exposes a fundamental conflict between the rapid advancements in AI and the established rules governing how information is accessed on the internet. The Robots Exclusion Protocol has been a cornerstone of web architecture for decades, providing websites with control over their content. Perplexity’s actions, if proven true, undermine this system and raise broader questions about data ownership, algorithmic transparency, and the long-term sustainability of the web. It’s a critical issue for web developers, content creators, and anyone concerned about the future of online information ecosystems.

You might also be interested in