crawlee

Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.

APACHE-2.0 License

Downloads
1.9M
Stars
15.2K
Committers
94

Commit Statistics

Past Year

All Time

Total Commits
781
4,146
Total Committers
36
102
Avg. Commits Per Committer
21.69
40.65
Bot Commits
350
677

Issue Statistics

Past Year

All Time

Total Pull Requests
360
609
Merged Pull Requests
275
459
Total Issues
186
903
Time to Close Issues
21 days
4 months
Package Rankings
Top 1.07% on Npmjs.org
Top 4.76% on Proxy.golang.org
Related Projects