graby

Graby helps you extract article content from web pages

MIT License

Downloads
282.7K
Stars
365
Committers
21

Bot releases are visible (Hide)

graby - 2.4.5 Latest Release

Published by j0k3r 10 months ago

What's Changed

Full Changelog: https://github.com/j0k3r/graby/compare/2.4.4...2.4.5

graby - 2.4.4

Published by j0k3r over 1 year ago

What's Changed

Full Changelog: https://github.com/j0k3r/graby/compare/2.4.3...2.4.4

graby - 2.4.3

Published by j0k3r over 1 year ago

What's Changed

Full Changelog: https://github.com/j0k3r/graby/compare/2.4.2...2.4.3

graby - 2.4.2

Published by j0k3r almost 2 years ago

What's Changed

Full Changelog: https://github.com/j0k3r/graby/compare/2.4.1...2.4.2

graby - 2.4.1

Published by j0k3r over 2 years ago

Just an empty release because 2.4.0 was created on the wrong branch (master instead of 2.x).
It should be fine by now.

graby - 2.4.0

Published by j0k3r over 2 years ago

Addition

If you want to extract content from a page you already fetched outside of Graby, you can call setContentAsPrefetched() before calling fetchContent(), e.g.:

use Graby\Graby;
$article = 'http://www.bbc.com/news/entertainment-arts-32547474';
$input = '<html>[...]</html>';
$graby = new Graby();
$graby->setContentAsPrefetched($input);
$result = $graby->fetchContent($article);

What's Changed

Full Changelog: https://github.com/j0k3r/graby/compare/2.3.3...2.4.0

graby - 2.3.5

Published by j0k3r almost 3 years ago

What's Changed

Full Changelog: https://github.com/j0k3r/graby/compare/2.3.3...2.3.5

graby - 2.3.4

Published by j0k3r almost 3 years ago

What's Changed

New Contributors

Full Changelog: https://github.com/j0k3r/graby/compare/2.3.3...2.3.4

graby - 2.3.3

Published by j0k3r almost 3 years ago

What's Changed

Full Changelog: https://github.com/j0k3r/graby/compare/2.3.2...2.3.3

graby - 2.3.2

Published by j0k3r almost 3 years ago

What's Changed

Full Changelog: https://github.com/j0k3r/graby/compare/2.3.1...2.3.2

graby - 2.3.1

Published by j0k3r about 3 years ago

What's Changed

Full Changelog: https://github.com/j0k3r/graby/compare/2.3.0...2.3.1

graby - 2.3.0

Published by j0k3r over 3 years ago

  • Add support of wrap_in #262
  • Update PHP-CS-Fixer to 3.0 #263
graby - 2.2.7

Published by j0k3r over 3 years ago

  • Use new pdfparser version #259
graby - 2.2.6

Published by j0k3r over 3 years ago

Kudos to @Simounet for all these PRs 🤝

  • Fix test case with real content (bjori site updated) #257
  • Case where an attribute is empty overriding a filled in value #255
  • Fix wrong URL exception handling #256
graby - 2.2.5

Published by j0k3r over 3 years ago

  • Fix and refactor makeAbsoluteStr #249
  • Update CS-Fixer #250
  • ContentExtractor: fix handling of data-srcset #248
  • Rebuild siteconfig if host for singlepage is not the same #246
graby - 2.2.4

Published by j0k3r almost 4 years ago

  • Fix CS issues (following PHP CS Fixer update) #243
  • Allow new patch/fix from PDFParser #247
graby - 2.2.3

Published by j0k3r almost 4 years ago

  • Fix the way to detect a request factory #238
  • Test Guzzle7 #239
  • Ditch Travis to use GitHub Actions #241
  • Improve Composer install in GitHub Actions #242
graby - 2.2.2

Published by j0k3r about 4 years ago

  • Cast date_parse result #233
  • Use new Scrutinizer engine #234
  • Allow installing smalot/pdfparser newer version #235
graby - 2.2.1

Published by j0k3r over 4 years ago

  • Add support for Monolog v2 #230
  • Define lowest working deps #231
graby - 2.2.0

Published by j0k3r over 4 years ago

  • Remove wallabag's fork of tcpdf (it wasn't used) #225
  • Handle error using preg_* function when html is really too big #226
  • Add support for refresh header #227
  • Cleanup Travis #228
  • Allow Symfony 5 installation #229
Package Rankings
Top 2.11% on Packagist.org
Badges
Extracted from project README
Join the chat at https://gitter.im/j0k3r/graby Coverage Status Total Downloads License