SimFin's open source PDF crawler
Dynamic file detection tool based on crawler 基于爬虫的动态敏感文件探测工具
An open source webapp for scraping: towards a public service for webscraping
pylinkvalidator is a standalone and pure python link validator and crawler that traverses a web s...
Google, Naver multiprocess image web crawler (Selenium)
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extr...
Dead simple web crawler for Python
Community maintained fork of pdfminer - we fathom PDF
A Python module to scrape several search engines (like Google, Yandex, Bing, Duckduckgo, ...). In...