ai scraping content stealthily

When a website puts up a “no AI bots allowed” sign, Perplexity apparently sees it as more of a suggestion.

Cloudflare’s research team caught the AI company red-handed, scraping sites that explicitly blocked AI crawlers. The scale? Tens of thousands of domains. Millions of requests daily. That’s not an accident.

Perplexity’s tactics read like a spy thriller. They’re changing user-agent strings to pretend they’re regular browsers. Switching between different networks to dodge IP blocks. Even using third-party scraping APIs like Crawlbase that handle the dirty work – rotating IPs, bypassing CAPTCHAs, rendering JavaScript. Some scrapers turn to solutions like Bright Data’s Web Unlocker API to break through HTTP request barriers and 403 Forbidden errors.

They’re masquerading as regular browsers, switching networks, using third-party scraping APIs to bypass blocks.

Clever? Sure. Ethical? That’s another story. The company’s CEO couldn’t even define plagiarism when asked during an interview, raising questions about their content ethics.

Cloudflare had to break out machine learning and network analysis.

References

You May Also Like

Louisiana Enlists AI Against Rampant Medicaid Fraud

Louisiana’s AI watchdog catches Medicaid cheats with 90% accuracy, slashing response time from years to days. Billions in taxpayer money now helps real patients instead of fraudsters.

Furious Judge Blasts Attorneys Over Fake AI Legal Citations

Federal judge blasts attorneys over 30 AI-fabricated legal citations, raising alarm throughout the legal profession. Hallucinating algorithms threaten the very foundation of justice.

The Unfixable Crisis: Why Social Media’s Youth Mental Health Damage Defies Solutions

Teens spend 5 hours daily on platforms destroying their mental health—yet only 14% believe they’re personally affected. The disconnect is devastating.

AI Now Judges Federal Workers’ Fate: Musk’s DOGE Sparks Government Purge

Musk’s AI judges decide government workers’ job fates as DOGE eliminates 25,000 positions. Can anyone survive the weekly justification emails?