Graby helps you extract article content from web pages
MIT License
Bot releases are visible (Hide)
Full Changelog: https://github.com/j0k3r/graby/compare/2.4.4...2.4.5
Published by j0k3r over 1 year ago
Full Changelog: https://github.com/j0k3r/graby/compare/2.4.3...2.4.4
Published by j0k3r over 1 year ago
Full Changelog: https://github.com/j0k3r/graby/compare/2.4.2...2.4.3
Published by j0k3r almost 2 years ago
guzzlehttp/psr7
2.0 & Symfony 6.0 by @jtojnar in https://github.com/j0k3r/graby/pull/301
Full Changelog: https://github.com/j0k3r/graby/compare/2.4.1...2.4.2
Published by j0k3r over 2 years ago
Just an empty release because 2.4.0 was created on the wrong branch (master
instead of 2.x
).
It should be fine by now.
Published by j0k3r over 2 years ago
If you want to extract content from a page you already fetched outside of Graby, you can call setContentAsPrefetched()
before calling fetchContent()
, e.g.:
use Graby\Graby;
$article = 'http://www.bbc.com/news/entertainment-arts-32547474';
$input = '<html>[...]</html>';
$graby = new Graby();
$graby->setContentAsPrefetched($input);
$result = $graby->fetchContent($article);
Full Changelog: https://github.com/j0k3r/graby/compare/2.3.3...2.4.0
Published by j0k3r almost 3 years ago
find_string
& replace_string
size are equal by @j0k3r in https://github.com/j0k3r/graby/pull/281
Full Changelog: https://github.com/j0k3r/graby/compare/2.3.3...2.3.5
Published by j0k3r almost 3 years ago
site_config
are kept by @j0k3r in https://github.com/j0k3r/graby/pull/280
Full Changelog: https://github.com/j0k3r/graby/compare/2.3.3...2.3.4
Published by j0k3r almost 3 years ago
Full Changelog: https://github.com/j0k3r/graby/compare/2.3.2...2.3.3
Published by j0k3r almost 3 years ago
Full Changelog: https://github.com/j0k3r/graby/compare/2.3.1...2.3.2
Published by j0k3r about 3 years ago
Full Changelog: https://github.com/j0k3r/graby/compare/2.3.0...2.3.1
Published by j0k3r over 3 years ago
wrap_in
#262Published by j0k3r over 3 years ago
Published by j0k3r over 3 years ago
Kudos to @Simounet for all these PRs 🤝
Published by j0k3r over 3 years ago
makeAbsoluteStr
#249Published by j0k3r almost 4 years ago
Published by j0k3r almost 4 years ago
Published by j0k3r about 4 years ago
date_parse
result #233smalot/pdfparser
newer version #235Published by j0k3r over 4 years ago
Published by j0k3r over 4 years ago
preg_*
function when html is really too big #226