dude

dude uncomplicated data extraction: A simple framework for writing web scrapers using Python decorators

AGPL-3.0 License

Downloads
272
Stars
413
Committers
3
dude - 🐛 Fix incompatible types when sorting

Published by roniemartinez over 2 years ago

What's Changed

Full Changelog: https://github.com/roniemartinez/dude/compare/0.4.0...0.4.1

dude - ✨ New group() decorator and improvements to select() decorator

Published by roniemartinez over 2 years ago

What's Changed

Full Changelog: https://github.com/roniemartinez/dude/compare/0.3.0...0.4.0

dude - 📚 Project Documentation (0.3.0)

Published by roniemartinez over 2 years ago

What's Changed

Documentation: https://roniemartinez.github.io/dude/
Full Changelog: https://github.com/roniemartinez/dude/compare/0.2.0...0.3.0

dude - ✨ Initial BeautifulSoup4 support (0.2.0)

Published by roniemartinez over 2 years ago

What's Changed

Full Changelog: https://github.com/roniemartinez/dude/compare/0.1.0...0.2.0

dude - 🎉 First Release (Version 0.1.0)

Published by roniemartinez over 2 years ago

What's New

  • Simple Flask-inspired design - build a scraper with decorators.
  • Uses Playwright's API - run your scraper in Chrome, Firefox and Webkit and leverage Playwright's powerful selector engine.
  • Data grouping - group related scraping data.
  • URL pattern matching - run functions on specific URLs.
  • Priority - reorder functions based on priority.
  • Setup function - enable setup steps (clicking dialogs or login).
  • Navigate function - enable navigation steps to move to other pages.
  • Custom storage - option to save data to other formats or database.
  • Async support - write async handlers.

Full Changelog: https://github.com/roniemartinez/dude/commits/0.1.0