ai scraping content stealthily

When a website puts up a “no AI bots allowed” sign, Perplexity apparently sees it as more of a suggestion.

Cloudflare’s research team caught the AI company red-handed, scraping sites that explicitly blocked AI crawlers. The scale? Tens of thousands of domains. Millions of requests daily. That’s not an accident.

Perplexity’s tactics read like a spy thriller. They’re changing user-agent strings to pretend they’re regular browsers. Switching between different networks to dodge IP blocks. Even using third-party scraping APIs like Crawlbase that handle the dirty work – rotating IPs, bypassing CAPTCHAs, rendering JavaScript. Some scrapers turn to solutions like Bright Data’s Web Unlocker API to break through HTTP request barriers and 403 Forbidden errors.

They’re masquerading as regular browsers, switching networks, using third-party scraping APIs to bypass blocks.

Clever? Sure. Ethical? That’s another story. The company’s CEO couldn’t even define plagiarism when asked during an interview, raising questions about their content ethics.

Cloudflare had to break out machine learning and network analysis.

References

You May Also Like

Historic Win: Texas Repair Bill Forces Tech Giants to Surrender Control to Consumers

Texas just forced Apple, Samsung, and tech titans to surrender their repair monopoly—your broken phone is finally yours to fix.

Wikipedia Slams Brakes on AI Summaries as Editors Revolt Against ‘Irreversible Harm’

Wikipedia editors revolt against AI summaries, calling them “irreversible harm” as the foundation kills its own experiment after just one day.

Psychology-Trained AI Mimics Human Thinking—But Does It Actually Understand?

AI mimics human thinking perfectly—but there’s a disturbing truth about what’s missing inside these machines.

OpenAI’s Legal Strike: Counter-Lawsuit Aims to Silence Musk’s ‘Fake’ Takeover Schemes

OpenAI’s $97.4 billion legal counterattack exposes Musk’s alleged AI hijacking plot. The battle between ethics and profit could forever transform how tech protects its soul.