Perplexity Accused of Stealth Bot Evasion, Threatening Web Norms
8
What is the Viqus Verdict?
We evaluate each news story based on its real impact versus its media hype to offer a clear and objective perspective.
AI Analysis:
While the specific impact is still unfolding, the underlying issue – the potential undermining of fundamental internet protocols – has the capacity for significant systemic disruption, justifying a high impact score. The level of media attention and social discussion surrounding the accusations also warrant a high hype score.
Article Summary
Perplexity AI is under scrutiny for allegedly employing sophisticated stealth bots to circumvent website’s no-crawl directives, a practice that could undermine established internet norms. Cloudflare researchers discovered that when Perplexity’s known crawlers were blocked by robots.txt files or firewall rules, the company’s stealth bots masked their activity by utilizing multiple IP addresses and rotating them in response to restrictions. This activity spanned over 10,000 domains and millions of requests per day, demonstrating a targeted effort to access content despite website protections. The accusation harkens back to the 1994 Robots Exclusion Protocol, which provides a standard for informing crawlers of restricted access. The controversy highlights a broader tension between AI-driven data access and website owners’ control over their content. Several publishers, including Forbes and Ars Technica, have voiced similar concerns about Perplexity’s aggressive crawling tactics. Perplexity’s lack of response to these allegations further fuels the debate. This situation raises significant questions about the ethical implications of large language models’ data sourcing methods and the potential for disruption to the internet ecosystem.Key Points
- Perplexity AI is accused of using stealth bots to bypass website restrictions.
- Researchers found that Perplexity’s bots masked their activity by rotating IP addresses and ASNs.
- The controversy raises ethical questions about AI data sourcing and potential disruption to the internet.

