node-website-scraper

Download website to local directory (including all css, images, js, etc.)

MIT License

Downloads
44.5K
Stars
1.6K
Committers
19

Bot releases are hidden (Show)

node-website-scraper - v5.3.1 Latest Release

Published by s0ph1e about 2 years ago

node-website-scraper - v5.3.0

Published by s0ph1e about 2 years ago

What's Changed

Full Changelog: https://github.com/website-scraper/node-website-scraper/compare/v5.2.0...v5.3.0

node-website-scraper - v5.2.0

Published by s0ph1e over 2 years ago

What's Changed

New Contributors

Full Changelog: https://github.com/website-scraper/node-website-scraper/compare/v5.1.0...v5.2.0

node-website-scraper - v5.1.0 [deprecated]

Published by s0ph1e over 2 years ago

What's Changed

⚠️ Version 5.1.0 was deprecated because of bugs introduced by encoding changes https://github.com/website-scraper/node-website-scraper/issues/493, failed tests in website-scraper-puppeteer

Full Changelog: https://github.com/website-scraper/node-website-scraper/compare/v5.0.0...v5.1.0

node-website-scraper - v5.0.0

Published by s0ph1e almost 3 years ago

💥 Breaking changes

  • module requires nodejs version >= 14.14
  • module is now pure ESM and cannot be required from CommonJS module. Read more about ESM modules
  • options.request may need to be changed, underlying http library was updates from request(deprecated) to got
  • scrape.defaults now can be imported with import defaultOptions from 'website-scraper/defaultOptions';
  • scrape.plugins now can be imported with import * as plugins from 'website-scraper/plugins';
  • callback usage support was removed, now only promises and async/await are supported

🐛 Bug fixes

  • urlFilter is no longer applied for root resources #460
  • issue with wrong quotes in generated html is fixed #355

🌵 Other changes

  • cheerio (underlying dependency for html parsing) was updated from version 0.22 to 1.0.0-rc.10
node-website-scraper - v4.2.3

Published by s0ph1e about 4 years ago

node-website-scraper - v4.2.2

Published by s0ph1e over 4 years ago

  • db206b705341e31c3598b8bf0463028068008697 Scrape resources with query strings when using bySiteStructure
node-website-scraper - v4.2.1

Published by s0ph1e over 4 years ago

  • c565e14fcea79a983a1e1f1557254d764b121a39 fix attributes with html-entity encoded quotes
node-website-scraper - v4.2.0

Published by s0ph1e about 5 years ago

  • 7cc60dbec87912f5d54d6ee3503fff3302b937c3 add responseData to generateFilename action params
node-website-scraper - v4.1.0

Published by s0ph1e about 5 years ago

  • 6988b86b42c441a9348cc0dc21e523c1aa9809b8 Fix issue with transforming attribute names to lowercase #369
node-website-scraper - v4.0.1

Published by s0ph1e over 5 years ago

  • Update dependencies
  • Update readme
node-website-scraper - v4.0.0

Published by s0ph1e almost 6 years ago

Breaking changes

  • drop support for nodejs versions < 8
  • remove options:
    • resourceSaver
    • updateMissingSources
    • updateSources
    • filenameGenerator with function (string with name is supported)
    • httpResponseHandler
    • request property with function (object is supported)
    • onResourceSaved
    • onResourceError
      Each removed option functionality can be implemented using plugins, see Migration guide
  • option ignoreErrors is false by default

New functionality

  • plugins
node-website-scraper - v3.4.0

Published by s0ph1e almost 6 years ago

  • 5afb782e2f43dadf8cb1002f50103bf3091158df - add updateSources option
node-website-scraper - v3.3.7

Published by s0ph1e about 6 years ago

  • e1422032a960ac192e28227b26b93ba06a7e08fa - fix debug breaking change
node-website-scraper - v3.3.6

Published by s0ph1e about 6 years ago

  • 7531dc27903381c27bfaaf04a754fc5c97f13463 encode relative path when update url for downloaded resource
node-website-scraper - v3.3.5

Published by s0ph1e about 6 years ago

Same as 3.3.4

node-website-scraper - v3.3.4

Published by s0ph1e about 6 years ago

This version was unpublished, 3.3.5 is the same as 3.3.4

  • 5ed0f624a34ff09ffe38f759fe88c8aad8ca1535 - add js type for Resource, resolve it from mime
node-website-scraper - v3.3.3

Published by s0ph1e over 6 years ago

update dependencies

node-website-scraper - v3.3.2

Published by s0ph1e about 7 years ago

  • 5a1b991 - Fix malformed url error
node-website-scraper - v3.3.1

Published by s0ph1e about 7 years ago

  • 539b48dbca3b1965408681d8d9cafbb6ca1c1de8 - Fix relative resource urls in redirected page