whrss

scrape White House Blog to generate RSS until it starts working again

Stars
9

During the Trump administration the White House Website disabled the RSS feed for its Wordpress site. During the four long years of the Trump administration I ran this script (whrss.py) from cron every 30 minutes to scrape the White House website and create an RSS feed that was available at:

https://inkdroid.org/rss/whitehouse.xml

On January 20, 2021 whitehouse.gov began providing an RSS feed again, and so the script was retired, and the old RSS URL permanently redirected to the new location. I took a quick look in the Apache logs and saw that it was being used about 1,500 times a day by close to 1,000 different clients (IP addresses).

https://whitehouse.gov/feed/

The main impetus for doing this was to use diffengine to publish whitehouse_diff. But maybe you'll find the RSS useful for other things too?