LinkedIn Jobs Scraper running in Node.js that uses Puppeteer and RxJS to scrape job offers from LinkedIn.
MIT License
LinkedIn Jobs Scraper running in Node.js that uses Puppeteer and RxJS to scrape job offers from LinkedIn.
IMPORTANT: Web scraping can frequently violate the terms of service of a website. Always review and respect a website's robots.txt file and its Terms of Service. In this instance, this code should be used ONLY for teaching and hobby purposes. LinkedIn specifically prohibits any data extraction from its website; you can read more here: https://www.linkedin.com/legal/crawling-terms.
/data
folderI wrote a blog explaining the code written in this repo with all the steps involved. You can find it here
Node version >= 12 and NPM >= 6
# clone the repo.
git clone https://github.com/your-username/linkedin-jobs-scraper.git
# go to the repo
cd linkedin-jobs-scraper
# install the dependencies via npm
npm install
# start scraping
npm run start
npm run start
- runs with puppeteer in headless mode.npm run start:debug
- runs with puppeteer in non-headless mode.npm run clean:data
- removes the folder /data