ai scraping content stealthily

When a website puts up a “no AI bots allowed” sign, Perplexity apparently sees it as more of a suggestion.

Cloudflare’s research team caught the AI company red-handed, scraping sites that explicitly blocked AI crawlers. The scale? Tens of thousands of domains. Millions of requests daily. That’s not an accident.

Perplexity’s tactics read like a spy thriller. They’re changing user-agent strings to pretend they’re regular browsers. Switching between different networks to dodge IP blocks. Even using third-party scraping APIs like Crawlbase that handle the dirty work – rotating IPs, bypassing CAPTCHAs, rendering JavaScript. Some scrapers turn to solutions like Bright Data’s Web Unlocker API to break through HTTP request barriers and 403 Forbidden errors.

They’re masquerading as regular browsers, switching networks, using third-party scraping APIs to bypass blocks.

Clever? Sure. Ethical? That’s another story. The company’s CEO couldn’t even define plagiarism when asked during an interview, raising questions about their content ethics.

Cloudflare had to break out machine learning and network analysis.

References

You May Also Like

Facebook’s AI Takeover: The Terrifying Future of Digital Censorship

Meta’s AI now controls 90% of Facebook’s content decisions—what they’re hiding about automated censorship will change how you see social media forever.

This City’s Bold AI Experiment Is Reading Residents’ Minds

This controversial experiment reads citizens’ minds using AI while officials defend its benefits. Privacy advocates warn we’re crossing a line. Are your thoughts really private anymore?

AI ‘Death Panels’ Could Soon Decide Your Medicare Coverage Under Trump Plan

Medicare’s AI revolution promises faster care but hides a disturbing truth: companies profit more when machines deny your treatment.

The Silent War: AI Training Models Weaponized as Political Propaganda Machines

AI propaganda machines now match human persuasiveness, eroding democracy while 43% fall for their lies. Truth is vanishing before our eyes.