Viqus Logo Viqus Logo
Home
Categories
Language Models Generative Imagery Hardware & Chips Business & Funding Ethics & Society Science & Robotics
Resources
AI Glossary Academy CLI Tool Labs
About Contact
Back to all news ETHICS & SOCIETY

Perplexity Accused of Stealth Bots and Robots.txt Evasion

Perplexity AI Search Engine Robots.txt Web Crawling Cloudflare Internet Norms Data Scraping
August 04, 2025
Viqus Verdict Logo Viqus Verdict Logo 8
Norms Under Threat
Media Hype 7/10
Real Impact 8/10

Article Summary

Perplexity AI is under scrutiny for allegedly circumventing website security measures through the use of stealth bots. Cloudflare researchers discovered that when Perplexity's known crawlers were blocked by robots.txt files or firewall rules, Perplexity deployed bots that masked their activity by utilizing multiple IP addresses and rotating them in response to detection. This activity spanned tens of thousands of domains and millions of requests daily, a clear violation of the established Robots Exclusion Protocol, which dates back to 1994 and gained formal standardization in 2022. The allegations stem from a pattern of behavior mirroring concerns raised by other publishers, including Forbes and Wired, who accuse Perplexity of aggressively scraping their content. This behavior, coupled with manipulated bot ID strings, highlights a serious breach of trust within the web ecosystem and raises questions about the ethical implications of AI-powered search engines.

Key Points

  • Perplexity AI is accused of using stealth bots to bypass website security measures, specifically robots.txt directives.
  • Cloudflare researchers identified a pattern of Perplexity deploying bots that masked their activity by rotating IP addresses and using different ASNs.
  • This activity violates established internet norms and raises concerns about the ethical sourcing of information by AI search engines, mirroring previous accusations from publications like Forbes and Wired.

Why It Matters

This news is significant because it exposes a potential vulnerability in the way AI search engines operate and highlights the ongoing tension between accessibility of information and the rights of content creators. The use of stealth tactics undermines the fundamental principles of the internet’s architecture, established over three decades, and has broader implications for data privacy and the future of web scraping. It forces a conversation about the responsibility of AI developers and the ethical considerations surrounding information access in an increasingly automated world. Professionals in cybersecurity, web development, and data governance need to understand these emerging threats.

You might also be interested in