Skip to content

Python app utilizing Beautiful Soup, Docker, Bash, Raspberry Pi, and cron job to automate article extraction from preferred websites and organize them into Google Sheets.

License

Notifications You must be signed in to change notification settings

victoriacheng15/articles-extractor

Repository files navigation

Article Extractor

This application is created to retrieve articles from freeCodeCamp and Substack, and subsequently transfer all pertinent article details to a Google Sheet.

Tech Stacks

Python Google Sheet API docker Raspberry PI Bash

Getting Started

Please refer to the Wiki

What I have learned

I employed Python's generator feature for enhanced efficiency. I used this feature to send article information to the Sheets individually. There is no need to store the entire sequence of articles in memory at once. Previously, articles were stored in the array named “all_articles” from various providers. And then I had to loop through the array to send articles to the Sheets.

The generator is a neat way to simplify and streamline the process. This eliminates the need to store the sequence in the memory before sending it to the Sheets

About

Python app utilizing Beautiful Soup, Docker, Bash, Raspberry Pi, and cron job to automate article extraction from preferred websites and organize them into Google Sheets.

Topics

Resources

License

Stars

Watchers

Forks