ai scraping content stealthily

When a website puts up a “no AI bots allowed” sign, Perplexity apparently sees it as more of a suggestion.

Cloudflare’s research team caught the AI company red-handed, scraping sites that explicitly blocked AI crawlers. The scale? Tens of thousands of domains. Millions of requests daily. That’s not an accident.

Perplexity’s tactics read like a spy thriller. They’re changing user-agent strings to pretend they’re regular browsers. Switching between different networks to dodge IP blocks. Even using third-party scraping APIs like Crawlbase that handle the dirty work – rotating IPs, bypassing CAPTCHAs, rendering JavaScript. Some scrapers turn to solutions like Bright Data’s Web Unlocker API to break through HTTP request barriers and 403 Forbidden errors.

They’re masquerading as regular browsers, switching networks, using third-party scraping APIs to bypass blocks.

Clever? Sure. Ethical? That’s another story. The company’s CEO couldn’t even define plagiarism when asked during an interview, raising questions about their content ethics.

Cloudflare had to break out machine learning and network analysis.

References

You May Also Like

Einstein’s Nuclear Regret Letter Hits Auction Block as Middle East Tensions Flare

Einstein’s $150,000 guilt letter proves nuclear regret pays less than apocalyptic warnings—but why does humanity keep bidding on its darkest mistakes?

FDA’s Drug Approval Revolution: AI Giants Enter Regulatory Medicine

Tech giants challenge traditional medicine as FDA embraces AI for drug approvals. Powerful algorithms now decide which medications reach patients. Can we trust silicon to safeguard our health?

AI’s Breakthrough Role in Bringing Lost Dogs Back Home When Shelters Fail

AI facial recognition has reunited 100,000 lost pets with owners while shelters struggle at 20% success rate. See how this groundbreaking technology outsmarts traditional recovery methods when time matters most.

Kremlin’s Digital Trojan Horse: AI Chatbots Now Parroting Russian Propaganda

Popular AI chatbots are spreading Kremlin propaganda about Ukraine, with Russian disinformation appearing in one-third of responses to war-related questions.