14 Commits

Author SHA1 Message Date
Alessio
1b3c5d0ed3 Add timeout error handling for scraper requests to the request body download as well (rather than just headers) 2024-04-13 16:10:23 -07:00
ca7cf613f9 Remove unnecessary import 2024-03-18 21:16:38 -07:00
3967367eed Fix more lint errors 2024-03-18 21:15:27 -07:00
69e0a35e57 Handle HTTP request timeouts 2024-03-16 19:55:05 -07:00
73c5803a47 Add downloading of DM embedded images, videos and links 2024-03-11 21:12:38 -07:00
0ad3cf8fb8 Fix lint errors 2024-03-11 14:08:07 -07:00
aa05708e20 Move media downloader from persistence to scraper package; add 429 Rate Limited error type 2024-03-11 12:57:58 -07:00
1ba4f91463 REFACTOR: replace 'log.Debug(fmt.Sprintf(...))' with 'log.Debugf(...)' and remove 'scraper.' prefix in utils_test.go 2024-03-10 19:14:27 -07:00
73ffb90f63 Move API login flow to its own file; add support for secondary verification challenges 2024-03-02 15:43:02 -08:00
8aca7d4ebe Add manual re-scrape for user feeds and quote-tweets stat on tweets 2023-08-27 22:55:40 -03:00
655a47ec21 Remove debugging panic 2023-08-27 22:00:58 -03:00
8349ca4ae3 Add background scraping of the logged-in user's home timeline 2023-08-27 21:05:09 -03:00
eaa01a2360 Fix fetching users and search
- Add is_deleted field on Users
- Fix fetching of tombstoned users including deleted users
- Fix "verified" bluechecks not being scraped anymore
- Fix search to use new graphql endpoint (old one got taken down)
2023-08-22 20:07:32 -03:00
a061decd0f REFACTOR: Rename go module to 'gitlab.com/offline-twitter/twitter_offline_engine' in accordance with 'go get' conventions
- also restructure project to use a 'pkg' directory for reusable packages
2023-07-30 14:20:07 -03:00