Scrapling

Lightning-Fast, Adaptive Web Scraping for Python

BSD-3-CLAUSE License

Downloads
324
Stars
48

Bot releases are visible (Hide)

Scrapling - v0.1.2 Latest Release

Published by D4Vinci 3 days ago

Changelog:

  • Fixed a bug where the keep_comments argument is not working as intended.
  • Adjusted the text function to automatically remove HTML comments from elements before extracting its text to prevent Lxml different behavior, for example:
    >>> page = Adaptor('<span>CONDITION: <!-- -->Excellent</span>', keep_comments=True)
    >>> page.css('span::text')
    ['CONDITION: ', 'Excellent']
    
    previously would result in this because of Lxml default behavior but now it would return the full text 'CONDITION: Excellent'
    This behavior is known with parsel\scrapy as well so wanted to handle it here.
  • Fixed a bug where the SQLite db file created by the library is not deleted when doing pip uninstall scrapling or similar.
Scrapling - v0.1.1

Published by D4Vinci 5 days ago

Minor fixes

Scrapling - v0.1

Published by D4Vinci 6 days ago