article-extractor

To extract main article from given URL with Node.js

MIT License

Downloads
14.8K
Stars
1.6K
Committers
13

Bot releases are hidden (Show)

article-extractor - v5.0.0rc6

Published by ndaidong almost 3 years ago

  • Simplify extract() method, now it accepts only one parameter. Please use addQueryRules() to specify custom selector.
article-extractor - v5.0.0rc5

Published by ndaidong almost 3 years ago

article-extractor - v5.0.0rc4

Published by ndaidong almost 3 years ago

  • Refactor main workflow
  • No longer extracting oembed data
  • No longer extracting with selector rule by default, let's call it when needed!
  • Stop using lru-cache
  • Change a few default config values
  • Replace got with axios
  • Update README & test scripts

Breaking changes

  • extract(String url | String html) --> extract(String url | String html [, querySelector])
  • setFetchOptions(Object opts) --> setRequestOptions(Object opts)
  • getFetchOptions() --> getRequestOptions()
article-extractor - v5.0.0rc3

Published by ndaidong almost 3 years ago

  • Improve parsing logic
  • Re-add lru cache to save network resource
  • Improve test cases
  • Remove unused files
article-extractor - v5.0.0rc2

Published by ndaidong almost 3 years ago

  • Improve parsing flow
  • Stop parsing invalid HTML input
  • Simplify url absolutification
  • Cleanify retrieving logic
  • Add more test scripts
article-extractor - v5.0.0rc1

Published by ndaidong almost 3 years ago

  • Change parser engine to @mozilla/readability
  • Optimize result while extracting with rules
  • Fix issues: #169 #173 #176 #184 #185 #186 #187 #189
  • Stop support node older than v14
article-extractor - v4.2.12

Published by ndaidong almost 3 years ago

  • Update dependencies
  • Fix wrong property name from URL instance
  • Use https in test scripts
article-extractor - v4.2.11

Published by ndaidong almost 3 years ago

  • Change coding style to standardjs
  • Replace cross-fetch with got
  • Update dependencies
article-extractor - v4.2.10

Published by ndaidong about 3 years ago

  • merge pr #182 by @yakimka
  • update dependencies
article-extractor - v4.2.9

Published by ndaidong over 3 years ago

  • Merge PR #171
  • Resolve issue #168
  • Update dependencies
article-extractor - v4.2.8

Published by ndaidong over 3 years ago

  • replace html-minifier with html-minifier-terser (#157)
  • add rule to people.com
  • update dependencies
article-extractor - v4.2.7

Published by ndaidong over 3 years ago

  • Improve rule based logic
  • Resolve issues #150, #151
article-extractor - v4.2.6

Published by ndaidong over 3 years ago

v4.2.6 #144

  • Merge pr #142 (fix issue #139)
  • Update dependencies
article-extractor - v4.2.5

Published by ndaidong almost 4 years ago

  • Add types definition
  • Update dependencies
  • Replace Travis CI with GitHub Actions
  • Enable SonarCloud integration
  • Fix a few potential issues
article-extractor - v4.2.4rc1

Published by ndaidong almost 4 years ago

  • Add type definitions
  • Update dependencies & travis config

(#123 #132 )

article-extractor - v4.2.3

Published by ndaidong about 4 years ago

  • Fix issue #128 (also remove function isHtmlDoc)
  • Update dependencies
article-extractor - v4.2.2

Published by ndaidong about 4 years ago

  • Merge pr #115 & fix coding style
article-extractor - v4.2.1

Published by ndaidong over 4 years ago

  • Fix some stuff on test command
  • Update README
article-extractor - v4.2.0

Published by ndaidong over 4 years ago

  • Replace es6-readability with readabilitySAX
  • Remove es6 module syntax to avoid depending on esm
  • Replace tap with jest, update unit tests script
article-extractor - v4.0.5

Published by ndaidong almost 5 years ago

  • Fully refactor