If you’ve spent any time scraping the web with Python, you know the pain. You write your selectors, everything works great, and then the site updates its layout and your entire pipeline breaks at 3 AM. [Scrapling](https://github.com/D4Vinci/Scrapling) is built to solve exactly that problem, and with the [v0.4 release](https://github.com/D4Vinci/Scrapling/releases/tag/v0.4), it’s gotten seriously impressive.
The core idea is adaptive parsing. You pass `adaptive=True`, and Scrapling’s parser actually learns from a website’s structure. When the HTML changes — and it always does — the framework uses similarity algorithms to relocate your target elements automatically. No more rewriting CSS selectors every other week. I’ve been watching this project for a while, and it’s one of those rare tools that does what it promises without a ton of configuration overhead.
What makes v0.4 the biggest release yet is the new Spider framework. Think Scrapy-style spiders but with built-in concurrency, multi-session crawling, pause/resume via checkpoints, and automatic proxy rotation. You can press Ctrl+C, walk away, and pick up exactly where you left off. The streaming mode is a nice touch too — you get scraped items as they arrive instead of waiting for the whole job to finish.
But the feature that’s generating the most buzz is the built-in [MCP server](https://www.pulsemcp.com/servers/d4vinci-scrapling). This lets AI tools like Claude or Cursor directly use Scrapling to extract web data, which means the framework handles the heavy lifting of fetching and parsing while the AI focuses on understanding the content. It’s a smart architecture choice that reduces token usage and speeds things up considerably.
Oh, and the anti-bot bypass? Scrapling’s fetchers handle Cloudflare Turnstile out of the box. No external services, no headless browser hacks. It just works. The project also claims to be 774x faster than BeautifulSoup with Lxml, and with 92% test coverage and full type annotations, it’s clearly not a weekend hack.
The numbers speak for themselves. [Scrapling](https://github.com/D4Vinci/Scrapling) hit GitHub Trending hard — we’re talking 1,970 stars in a single day, pushing the total past 12,800. The [@GithubProjects](https://x.com/GithubProjects/status/2025609047014244660) account on X gave it a shoutout, and the [official docs](https://scrapling.readthedocs.io/en/latest/index.html) are thorough enough that you can get a spider running in minutes. If you’re doing any kind of web scraping in Python, this one’s worth your attention.

Leave a comment