This repository contains a Puppeteer-based script for scraping product details from Internubel's website.
MIT License
This repository contains a Puppeteer-based script for scraping product data from Internubel's website (https://www.internubel.be).
The script logs into the site, navigates through product groups, and extracts product details including title, image, nutrition score, and nutritional information.
Data is structured and saved into JSON files categorized by product groups, sub-groups, and sub-sub-groups.
Requirements for the script:
git clone https://github.com/Jihefel/Internubel-website-scraping.git
npm install
Create your configuration file .env
in the root directory as the following to store your credentials.
LOGIN_EMAIL=your_email
LOGIN_PASSWORD=your_password
Replace your_email
and your_password
with your Internubel login credentials.
Run the scraping script using Node.js in your terminal:
node internubel.js
And wait for a moment...
The script will launch a Puppeteer-controlled browser, log into Internubel using provided credentials, and scrape product data into structured JSON files stored in the data directory.
Distributed under the MIT License. See MIT License for more information.