Lightning-Fast, Adaptive Web Scraping for Python
BSD-3-CLAUSE License
querySelector that can pierce Shadow DOM roots without knowing the path through nested shadow roo...
Retrieve XPath and CSS selectors from elements selected in Playwright
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In ...
Generates dynamic prototype methods for JavaScript objects (classes) by supporting method definit...
dude uncomplicated data extraction: A simple framework for writing web scrapers using Python deco...
Fluent jquery integration for puppeteer
🎭 Playwright integration for Scrapy
Collection of patches for puppeteer and playwright to avoid automation detection and leaks. Helps...
🚀 Web scraping for humans
Swiss-army tool for scraping and extracting data from online assets, made for hackers
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extr...
Turn any webpage into structured data using LLMs
Turn unstructured HTML pages into structured data. The OpenScraping library can extract informati...
The web scraping open project repository aims to share knowledge and experiences about web scrapi...