graby

Graby helps you extract article content from web pages

MIT License

Downloads
282.7K
Stars
365
Committers
21

Bot releases are hidden (Show)

graby - 1.14.0

Published by j0k3r about 6 years ago

  • Add support for cookie in site_config #165 (mostly for GDPR pages)
graby - 1.13.6

Published by j0k3r about 6 years ago

  • Add more logs & fix errors when trimming authors. #164
graby - 1.13.5

Published by j0k3r about 6 years ago

  • Body can’t be a DOMAttr #160
  • Fix tests #161
graby - 1.13.4

Published by j0k3r over 6 years ago

Fix “Invalid predicate” error on DOMXPath::query #154

graby - 1.13.3

Published by j0k3r over 6 years ago

Remove empty nodes #141 (& #153)

graby - 1.13.2

Published by j0k3r over 6 years ago

Keep the old way to remove conditional comment #152

graby - 1.13.1

Published by j0k3r over 6 years ago

  • Fix unique authors list #148
  • Fixing tests #149
  • Better remove conditional comment for IE #150
graby - 1.13.0

Published by j0k3r over 6 years ago

  • Add src_lazy_load_attr in siteconfig #144
graby - 1.12.1

Published by j0k3r over 6 years ago

  • Switch to newer HTMLawed #138
  • Improve PHP 7.2 compatibility #139
graby - 1.12.0

Published by j0k3r almost 7 years ago

  • Require tidy extension for tests #130
  • Backport some changes from FTRSS 3.5 #131
  • Add a small message about Tidy #132
  • Remove unused $res #134
  • Move lazy load attributes to config #136
  • Add support of strip_attr #133
graby - 1.11.1

Published by j0k3r almost 7 years ago

  • makeAbsoluteAttr: keep anchor links untouched #127
graby - 1.11.0

Published by j0k3r almost 7 years ago

  • Update pdf deps #122
  • Add a Gitter chat badge to README.md #123
  • Retrieve information from JSON-LD schema #124
graby - 1.10.1

Published by j0k3r about 7 years ago

  • Jump to HTMLawed 1.2 #118
  • Small update on LICENCE file
graby - 1.10.0

Published by j0k3r about 7 years ago

  • Add body length in debug #115
  • Add Monolog Handler & Formatter #116
graby - 1.9.3

Published by j0k3r about 7 years ago

  • If request fail try to fetch content anyway #113
  • Rise log level when catching exception #114
graby - 1.9.2

Published by j0k3r over 7 years ago

  • Fix utf8 encoding for plain text page #110
graby - 1.9.1

Published by j0k3r over 7 years ago

  • Take only the first og:image #109
graby - 1.9.0

Published by j0k3r over 7 years ago

  • Ability to clean given HTML #105
  • Update README.md #104
graby - 1.8.2

Published by j0k3r over 7 years ago

  • ng-controller should not trigger escaped fragment #101
graby - 1.8.1

Published by j0k3r over 7 years ago

  • Switch to MIT #99 🎉
  • Fix url extraction when url contains the symbol '+' #100
Package Rankings
Top 2.11% on Packagist.org
Badges
Extracted from project README
Join the chat at https://gitter.im/j0k3r/graby Coverage Status Total Downloads License