A command-line tool to turn web pages into readable PDF, EPUB, HTML, or Markdown docs.
MIT License
Bot releases are visible (Hide)
Published by danburzo almost 4 years ago
Added support for hyphenation with Hyphenopoly, based on the explicit document language or best guess with franc. Hyphenation is enabled by default for PDF, and disabled for EPUB and HTML. The --hyphenate
and --no-hyphenate
flags let you explicitly opt in or out of the feature.
Thanks @yashha for this feature!
<pre>
elements in <figure>
elements to make sure Readability doesn't strip them out (#66);await
for the EPUB file to be fully generated in the epub()
call (Thanks @pascalw!).nodejs-percollate
AUR package (Thanks @pedrolucasp!)uuid/v1
function (https://github.com/danburzo/percollate/pull/111)Note: I botched v1.1.1 — never release at midnight!
This release sees a few new features and improvements from @yashha:
pdf-lib
to add a title and author to the PDF metadata (#88);--author
option to the CLI to use for populating PDF and EPUB metadata (#104);@mozilla/readability
npm package instead; (thanks @gijsk!)file://
protocol, and absolute/relative paths to files on disk (See #34)srcset
attributesCLI changes:
--cover
flag is enabled implicitly when using the --title
option or when bundling more than one item; disable the cover page with --no-cover
;--toc
flag is enabled implicitly when bundling more than one item; disable the ToC with --no-toc
;Improvements:
data-lazy-
prefix)Published by danburzo almost 6 years ago
Prefer the AMP version of an article, if available. Details here.
Support for lazy-loaded images. (#71)
Increased Puppeteer navigation timeout to 2 minutes. (#80, thanks @butu5!). Also added a --debug
flag to print more information about the process.
Fixed URL encoding before fetching it. (#83, thanks @ncsing!)
Generate a Table of Contents page (#81, thanks @guybedo!) when using the --toc
option.