articles-extractor

Python app utilizing Beautiful Soup, Docker, Bash, Raspberry Pi, and cron job to automate article extraction from preferred websites and organize them into Google Sheets.

MIT License

Stars
2

Article Extractor

This application is created to retrieve articles from freeCodeCamp and Substack, and subsequently transfer all pertinent article details to a Google Sheet.

Tech Stacks

Getting Started

Please refer to the Wiki

What I have learned

I employed Python's generator feature for enhanced efficiency. I used this feature to send article information to the Sheets individually. There is no need to store the entire sequence of articles in memory at once. Previously, articles were stored in the array named “all_articles” from various providers. And then I had to loop through the array to send articles to the Sheets.

The generator is a neat way to simplify and streamline the process. This eliminates the need to store the sequence in the memory before sending it to the Sheets