Dead simple web crawler for Python
GPL-3.0 License
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extr...
Enhanced cookiecutter template for Python libraries.
A collection of self-using anime-related crawlers.
Syntactic patterns of HTTP user-agents used by bots / robots / crawlers / scrapers / spiders. pul...
pylinkvalidator is a standalone and pure python link validator and crawler that traverses a web s...
实战🐍多种网站、电商数据爬虫🕷。包含🕸:淘宝商品、微信公众号、大众点评、企查查、招聘网站、闲鱼、阿里任务、博客园、微博、百度贴吧、豆瓣电影、包图网、全景网、豆瓣音乐、某省药监局、搜狐新闻、机器学...
An open source webapp for scraping: towards a public service for webscraping
The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns
Google, Naver multiprocess image web crawler (Selenium)
A Python module to scrape several search engines (like Google, Yandex, Bing, Duckduckgo, ...). In...
Python3网络爬虫实战:淘宝、京东、网易云、B站、12306、抖音、笔趣阁、漫画小说下载、音乐电影下载等
Easy-to-use Web archiver