#web-scraping

[ follow ]
#cloud-storage
fromHackernoon
1 year ago
Tech industry

The TechBeat: Downside Liquidity: A Hypothesis on Short Pools for EVM (7/22/2025) | HackerNoon

fromHackernoon
1 year ago
Tech industry

The TechBeat: Downside Liquidity: A Hypothesis on Short Pools for EVM (7/22/2025) | HackerNoon

fromHackernoon
2 years ago

The HackerNoon Newsletter: Outsmarting Akamais Bot Detection with JA3Proxy (7/19/2025) | HackerNoon

The Machine Economy represents not just process optimization but a profound shift in the underlying forces that drive economics, as machines take more control over economic functions.
Tech industry
#data-extraction
#large-language-models
fromHackernoon
2 years ago
Web development

Scrape Smarter, Not Harder: Let MCP and AI Write Your Next Scraper for You | HackerNoon

fromHackernoon
2 years ago
Business intelligence

Teaching Your AI to Read: A Guide to Scraping, RAG, and Smart Data Insights | HackerNoon

fromHackernoon
2 years ago
Web development

Scrape Smarter, Not Harder: Let MCP and AI Write Your Next Scraper for You | HackerNoon

fromHackernoon
2 years ago
Business intelligence

Teaching Your AI to Read: A Guide to Scraping, RAG, and Smart Data Insights | HackerNoon

fromHackernoon
1 week ago

Kasada Anti-Bot Bypass Techniques: Save Money with These Open-Source Solutions | HackerNoon

The easiest way to detect when a website is using Kasada is by asking it for Wappalyzer, which has a browser extension you can use while visiting a website to detect its tech stack.
E-Commerce
#internet-security
fromZDNET
1 month ago
Privacy technologies

Paid proxy servers vs free proxies: Is paying for a proxy service worth it?

fromZDNET
1 month ago
Privacy technologies

Paid proxy servers vs free proxies: Is paying for a proxy service worth it?

fromZDNET
1 week ago

This open-source bot blocker shields your site from pesky AI scrapers - here's how

F5 reports that over half of all web visits originate from data scrapers like OpenAI and Google, raising concerns about the impact of AI on online resources.
Privacy technologies
#cloudflare
#data-collection
Artificial intelligence
fromHackernoon
3 years ago

Behind the Scenes of Using Web Scraping and AI in Investigative Journalism | HackerNoon

Web scraping is essential for journalists to extract public information and hold authorities accountable.
Artificial intelligence
fromHackernoon
3 years ago

Behind the Scenes of Using Web Scraping and AI in Investigative Journalism | HackerNoon

Web scraping is essential for journalists to extract public information and hold authorities accountable.
fromZDNET
4 weeks ago

This proxy provider I tested is the best for web scraping - and it's not IPRoyal or MarsProxies

Oxylabs offers a significantly larger pool of residential proxy machines, boasting over 175 million proxies compared to competitors who have fewer than 1 million.
Marketing tech
#nodejs
Artificial intelligence
fromZDNET
1 month ago

Reddit sues Anthropic for scraping its users' content without consent

Reddit sues Anthropic for breaching user privacy by scraping content without consent, amid increasing legal challenges to AI content usage.
#bots
fromNature
1 month ago
Artificial intelligence

Web-scraping AI bots cause disruption for scientific databases and journals

fromNature
1 month ago
Artificial intelligence

Web-scraping AI bots cause disruption for scientific databases and journals

#data-analysis
Bootstrapping
fromHackernoon
2 years ago

How to Build a No-Limits Stock Market Scraper with Python | HackerNoon

Building a custom web scraping solution allows for unrestricted access to financial data without the limitations of traditional APIs.
E-Commerce
fromEntrepreneur
2 months ago

How Web Data Helps You Stay Ahead of the Competition | Entrepreneur

Ecommerce businesses need to leverage public web data for better decision-making across industries.
Bootstrapping
fromHackernoon
2 years ago

How to Build a No-Limits Stock Market Scraper with Python | HackerNoon

Building a custom web scraping solution allows for unrestricted access to financial data without the limitations of traditional APIs.
E-Commerce
fromEntrepreneur
2 months ago

How Web Data Helps You Stay Ahead of the Competition | Entrepreneur

Ecommerce businesses need to leverage public web data for better decision-making across industries.
#ai
Artificial intelligence
fromTheregister
3 months ago

Wikimedia Foundation bemoans AI bot bandwidth burden

Web-scraping bots are straining Wikimedia's resources, increasing bandwidth usage by 50% since January 2024, heavily impacting project sustainability.
Artificial intelligence
fromTheregister
3 months ago

Wikimedia Foundation bemoans AI bot bandwidth burden

Web-scraping bots are straining Wikimedia's resources, increasing bandwidth usage by 50% since January 2024, heavily impacting project sustainability.
#cryptocurrency
fromHackernoon
1 year ago
Cryptocurrency

The TechBeat: Bybit's $1.5 Billion Hack Proves Crypto's Biggest Flaw Isn't the Blockchain (4/7/2025) | HackerNoon

fromHackernoon
1 year ago
Cryptocurrency

The TechBeat: Bybit's $1.5 Billion Hack Proves Crypto's Biggest Flaw Isn't the Blockchain (4/7/2025) | HackerNoon

EU data protection
fromHackernoon
3 months ago

A Guide on How to Legally Web Scrape EU Data | HackerNoon

The Markup emphasizes the importance of web scraping for data journalism while navigating legal risks, especially in the EU.
Privacy technologies
fromArs Technica
3 months ago

AI bots strain Wikimedia as bandwidth surges 50%

AI crawlers are circumventing established rules, creating challenges for content platforms.
Wikimedia is focusing on a systemic initiative to address scraping issues and protect its infrastructure.
Marketing tech
fromForbes
4 months ago

New Data Shows Just How Badly OpenAI And Perplexity Are Screwing Over Publishers

AI-powered search engines are sending significantly less referral traffic to news sites compared to traditional search engines.
fromHackernoon
2 years ago

The HackerNoon Newsletter: Managing Stress May Be A Lot Simpler Than You Think (12/17/2024) | HackerNoon

Tech today emphasizes the significance of managing stress effectively and leveraging adaptable tools like Bluesky's API to enhance productivity and technical engagements.
Miscellaneous
[ Load more ]