From 25fddaef9d988527da7253edd7f3ed0c530a8a4c Mon Sep 17 00:00:00 2001 From: Alessio Date: Sat, 8 Jan 2022 19:50:14 -0500 Subject: [PATCH] Update CHANGELOG.txt for v0.0.4 --- CHANGELOG.txt | 8 ++++++++ doc/TODO.txt | 22 ++++++++++++++++++++++ 2 files changed, 30 insertions(+) create mode 100644 doc/TODO.txt diff --git a/CHANGELOG.txt b/CHANGELOG.txt index 59f8f22..5c60832 100644 --- a/CHANGELOG.txt +++ b/CHANGELOG.txt @@ -27,3 +27,11 @@ v0.0.3 - Improve user image scraping for rando users (download less eagerly) to speed up browsing and save disk space: - Download tiny versions of profile images (48x48 pixels) - Don't download banner images at all + +v0.0.4 +------ + +- Escape HTML entities in tweet text +- Add support for banned users +- Improve tombstone recognition (quote-tweets) +- Fix spacing issue around tiny profile images diff --git a/doc/TODO.txt b/doc/TODO.txt new file mode 100644 index 0000000..20eb13c --- /dev/null +++ b/doc/TODO.txt @@ -0,0 +1,22 @@ +TODO user-images-extract-to-table: +User profile images and banner images => extract to table. Add `is_downloaded` column. Add test for double-downloading a user, shouldn't change the write-times on these files + +TODO update-all: +For each user in userslist.txt: +- update user +- download new images if necessary +- fetch tweets since last update + +TODO implement-search + +TODO fetch-older-tweets: +get tweets older than 3200 ago + + +TODO https://twitter.com/michaelmalice/status/1461031030278742020 => "account no longer exists" tombstone + +TODO cleanup-main: The `cmd/twitter/main.go` file is becoming a disaster, it needs a cleanup badly. lots of copy-pastes that should be functions, code that should be moved to the scraper / persistence modules, etc + +TODO broadcast-cards +panic: Unknown card type: 3691233323:periscope_broadcast +panic: Unknown card type: 745291183405076480:broadcast