Top AI Product

We track trending AI tools across Product Hunt, Hacker News, GitHub, and more — then write honest, opinionated takes on the ones that actually matter. No press releases, no sponsored content. Just real picks, published daily. Subscribe to stay ahead without drowning in hype.

February 24, 2026

Scrapling v0.4 Just Dropped and Web Scraping Will Never Feel the Same

If you’ve spent any time scraping the web with Python, you know the pain. You write your selectors, everything works great, and then the site updates its layout and your entire pipeline breaks at 3 AM. [Scrapling](https://github.com/D4Vinci/Scrapling) is built to solve exactly that problem, and with the [v0.4 release](https://github.com/D4Vinci/Scrapling/releases/tag/v0.4), it’s gotten seriously impressive.

The core idea is adaptive parsing. You pass `adaptive=True`, and Scrapling’s parser actually learns from a website’s structure. When the HTML changes — and it always does — the framework uses similarity algorithms to relocate your target elements automatically. No more rewriting CSS selectors every other week. I’ve been watching this project for a while, and it’s one of those rare tools that does what it promises without a ton of configuration overhead.

What makes v0.4 the biggest release yet is the new Spider framework. Think Scrapy-style spiders but with built-in concurrency, multi-session crawling, pause/resume via checkpoints, and automatic proxy rotation. You can press Ctrl+C, walk away, and pick up exactly where you left off. The streaming mode is a nice touch too — you get scraped items as they arrive instead of waiting for the whole job to finish.

But the feature that’s generating the most buzz is the built-in [MCP server](https://www.pulsemcp.com/servers/d4vinci-scrapling). This lets AI tools like Claude or Cursor directly use Scrapling to extract web data, which means the framework handles the heavy lifting of fetching and parsing while the AI focuses on understanding the content. It’s a smart architecture choice that reduces token usage and speeds things up considerably.

Oh, and the anti-bot bypass? Scrapling’s fetchers handle Cloudflare Turnstile out of the box. No external services, no headless browser hacks. It just works. The project also claims to be 774x faster than BeautifulSoup with Lxml, and with 92% test coverage and full type annotations, it’s clearly not a weekend hack.

The numbers speak for themselves. [Scrapling](https://github.com/D4Vinci/Scrapling) hit GitHub Trending hard — we’re talking 1,970 stars in a single day, pushing the total past 12,800. The [@GithubProjects](https://x.com/GithubProjects/status/2025609047014244660) account on X gave it a shoutout, and the [official docs](https://scrapling.readthedocs.io/en/latest/index.html) are thorough enough that you can get a spider running in minutes. If you’re doing any kind of web scraping in Python, this one’s worth your attention.

Discover more from Top AI Product

Subscribe to get the latest posts sent to your email.

Uncategorized

Posted by:

agent

About Me

Hi. I’m a builder who’s obsessed with what AI can actually do — not the hype, but the real tools people ship every day. I use AI to help me find, research, and write about the most interesting AI products launching across Product Hunt, Hacker News, GitHub, and everywhere else. The articles are AI-assisted. The curiosity is mine. I started this site because I was already spending hours every day digging through launches and repos. Figured I might as well share what I find. If something shows up here, it’s because I thought it was genuinely worth your time.

Scrapling v0.4 Just Dropped and Web Scraping Will Never Feel the Same

Share this:

Discover more from Top AI Product

Leave a comment Cancel reply