#web-scraping

[ follow ]
fromeLearning Industry
5 days ago

How Web Scraping Fuels Competitive Intelligence In eLearning?

Web scraping delivers competitive intelligence for online course providers, giving them a strategic edge in the market. This intelligence helps in building stronger courses that can sell better.
E-Commerce
#ai-ethics
fromTechCrunch
1 week ago
Privacy technologies

Some people are defending Perplexity after Cloudflare 'named and shamed' it | TechCrunch

fromTechCrunch
1 week ago
Privacy professionals

Perplexity accused of scraping websites that explicitly blocked AI scraping | TechCrunch

fromZDNET
2 months ago
Artificial intelligence

Reddit sues Anthropic for scraping its users' content without consent

fromTechCrunch
1 week ago
Privacy technologies

Some people are defending Perplexity after Cloudflare 'named and shamed' it | TechCrunch

fromTechCrunch
1 week ago
Privacy professionals

Perplexity accused of scraping websites that explicitly blocked AI scraping | TechCrunch

Artificial intelligence
fromZDNET
2 months ago

Reddit sues Anthropic for scraping its users' content without consent

Reddit sues Anthropic for breaching user privacy by scraping content without consent, amid increasing legal challenges to AI content usage.
#cloudflare
fromZDNET
1 week ago
Privacy professionals

Perplexity is sneaking onto websites to scrape blocked content, says Cloudflare

fromZDNET
1 week ago
Privacy professionals

Perplexity is sneaking onto websites to scrape blocked content, says Cloudflare

Privacy professionals
fromArs Technica
1 week ago

AI site Perplexity uses "stealth tactics" to flout no-crawl edicts, Cloudflare says

Perplexity allegedly uses stealth bots to bypass websites' no-crawl directives, violating established Internet norms.
fromRealpython
1 week ago

Introduction to Web Scraping With Python Quiz - Real Python

This quiz consists of seven questions that test knowledge on core concepts of web scraping using Python, focusing on Beautiful Soup and MechanicalSoup.
Web development
fromRaymondcamden
3 weeks ago

Using AgentQL and Pipedream to Fix Missing RSS Feeds

AgentQL efficiently transforms web page data into structured formats, facilitating tasks like creating RSS feeds from blogs without existing feeds.
fromHackernoon
1 year ago

The TechBeat: Downside Liquidity: A Hypothesis on Short Pools for EVM (7/22/2025) | HackerNoon

AI-powered web scrapers can automate HTML fetching, XPath generation, and script writing within IDEs.
fromHackernoon
1 year ago

The TechBeat: Welcome to the Museum of AI Hallucinations (7/20/2025) | HackerNoon

Targeted campaigns, effective publisher selection, and real-time optimization can drive scalable growth for crypto brands, showcasing the importance of strategic marketing in this sector.
Tech industry
fromHackernoon
2 years ago

The HackerNoon Newsletter: Outsmarting Akamais Bot Detection with JA3Proxy (7/19/2025) | HackerNoon

The Machine Economy represents not just process optimization but a profound shift in the underlying forces that drive economics, as machines take more control over economic functions.
Tech industry
fromHackernoon
2 years ago

Teaching Your AI to Read: A Guide to Scraping, RAG, and Smart Data Insights | HackerNoon

Large Language Models are reshaping data analysis by allowing natural language queries instead of traditional Business Intelligence tools.
fromHackernoon
2 years ago

Scrape Smarter, Not Harder: Let MCP and AI Write Your Next Scraper for You | HackerNoon

The Model Context Protocol (MCP) is an open standard that enables large language models to interact with external tools and data through a standardized interface.
Web development
fromHackernoon
1 month ago

Kasada Anti-Bot Bypass Techniques: Save Money with These Open-Source Solutions | HackerNoon

The easiest way to detect when a website is using Kasada is by asking it for Wappalyzer, which has a browser extension you can use while visiting a website to detect its tech stack.
E-Commerce
fromZDNET
1 month ago

This open-source bot blocker shields your site from pesky AI scrapers - here's how

Anubis is an open-source program designed to protect websites from harmful AI bots.
fromBusiness Matters
1 month ago

Antidetect Browser + Automation: A Safe Setup for Web Scraping and Botting

The integration of antidetect browsers with automation frameworks is essential to counteract advanced web scraping barriers implemented by websites.
fromZDNET
1 month ago

This proxy provider I tested is the best for web scraping - and it's not IPRoyal or MarsProxies

Oxylabs provides a robust and ethical web scraping service powered by a vast network of residential proxies.
#nodejs
Privacy technologies
fromZDNET
1 month ago

Paid proxy servers vs free proxies: Is paying for a proxy service worth it?

Proxy servers serve as gateways, providing anonymity and various functionalities for both individuals and businesses.
fromNature
2 months ago

Web-scraping AI bots cause disruption for scientific databases and journals

It's the wild west at the moment, the biggest issue is the sheer volume of requests [to access a website], which is causing strain on their systems. It costs money and causes disruption to genuine users.
Artificial intelligence
Artificial intelligence
fromHackernoon
3 years ago

Behind the Scenes of Using Web Scraping and AI in Investigative Journalism | HackerNoon

Web scraping is essential for journalists to extract public information and hold authorities accountable.
fromHackernoon
4 months ago

AI and Proxies: Are They Connected? | HackerNoon

Proxies are crucial for overcoming data collection barriers in machine learning.
E-Commerce
fromEntrepreneur
3 months ago

How Web Data Helps You Stay Ahead of the Competition | Entrepreneur

Ecommerce businesses need to leverage public web data for better decision-making across industries.
fromHackernoon
2 years ago

What Does Your AI Agent Need to Conquer the Web? | HackerNoon

AI agents must evolve to outperform humans in speed and accuracy.
Real-time data extraction is crucial for AI agents to succeed online.
[ Load more ]