111 Commits

Author SHA1 Message Date
Alessio
4ea15f10af Add scraper function to get user by ID 2024-09-15 22:56:59 -07:00
Alessio
9c0f9504f6 "ConvertToAPIUser() now returns an error indicating a "not-found" response, which propagates through "GetUser" API calls 2024-09-15 17:59:25 -07:00
Alessio
b9e025589e Update CHANGELOG.txt for v0.6.16 2024-09-03 18:36:23 -07:00
Alessio
9c7174a84a Add way to scrape unread count; add db query to count unread notifs since a given sort index 2024-09-02 15:02:27 -07:00
Alessio
0c620621a6 Add scraping notiication detail 2024-08-28 19:22:09 -07:00
Alessio
dc816c6f28 Track retweet lists in notifications alongside tweets 2024-08-28 19:20:29 -07:00
Alessio
b8db01c0ab Add test file for previous commit 2024-08-26 00:19:24 -07:00
Alessio
2c7c35e23f Add pagination to notifications
- handle notifications end-of-feed
- add support for "quoted" and "mentioned" notifications
2024-08-26 00:07:10 -07:00
Alessio
b77612c66f Add SQL queries for notifications 2024-08-25 22:54:18 -07:00
Alessio
e94327b153 Add current user ID to notifications' UserID field (notification recient) when parsing
- also make an enum for notification types instead of magic numbers in the code
2024-08-25 22:48:09 -07:00
Alessio
fae23cc243 Fix lint errors 2024-08-25 19:01:52 -07:00
Alessio
eafdbf5ffb Add Notifications parser 2024-08-25 16:27:49 -07:00
Alessio
c45b8e7ad8 Add --delay flag to force a delay between requests in a large paginated scrape 2024-08-19 18:20:12 -07:00
Alessio
4990e7913d Fix lint error 2024-08-19 17:29:54 -07:00
Alessio
e2ca9a975a Remove scraper singleton pattern entirely 2024-08-19 17:27:54 -07:00
Alessio
08e5a726a8 Remove most uses of the singleton pattern in 'scraper' package 2024-08-19 16:45:32 -07:00
Alessio
91f722b7fa Scraper requests now report invalidated or expired sessions 2024-08-18 16:22:37 -07:00
Alessio
24129c4852 REFACTOR: reduce technical debt, particularly that caused by singleton pattern in pkg/scraper
- ensure all scraper functions have a `api.XYZ` version and a package-level convenience function
	- isolate `the_api` to top-level convenience functions, in preparation for removal
- move a bunch of scraper functions around to be nearby their related functions
- new ErrLoginRequired
- remove obsolete APIv1 stuff (Feed, TweetDetail)
- rename scraper function GetUserFeedGraphqlFor => GetUserFeed
- fix go.mod Go version incorrectly claiming it's compatible with Go 1.16 (should be Go 1.17)
2024-08-09 19:48:50 -07:00
Alessio
b4384f9cc4 Enable sending emoji reactions in DMs 2024-07-28 14:09:41 -07:00
Alessio
8aca12695b Handle media download 404s gracefully 2024-07-28 12:50:00 -07:00
Alessio
d0c23b392b Don't crash when receiving a Grok card 2024-07-28 08:07:33 -07:00
Alessio
98015ddf0c REFACTOR: remove DMTrove type, merge it into TweetTrove 2024-07-28 08:01:58 -07:00
Alessio
ef15e8a306 Handle guest token / session initialization when not connected to internet 2024-07-14 13:20:44 -07:00
Alessio
0fd17f1af0 Fix getting tweets from deleted accounts 2024-07-14 12:52:07 -07:00
Alessio
d2045028a7 Improve closing message when a scrape ends due to rate-limiting 2024-06-11 12:32:58 -07:00
Alessio
a1faacaf6b Don't throw away all previous results if you get rate-limited, just save them 2024-06-10 19:59:08 -07:00
Alessio
cac87ec973 Save tweets with unknown user as <UNKNOWN USER> instead of UserID 0, which causes foreign key errors 2024-06-10 15:47:13 -07:00
Alessio
14a7f88cdb Add scraping and saving of bookmarks 2024-05-28 21:55:42 -07:00
Alessio
42bf8ec06a Fix a bug sending empty POST bodies 2024-05-11 10:58:33 -07:00
Alessio
f927507089 Enable marking DMs as read 2024-05-10 22:09:48 -07:00
Alessio
5cbf96f379 Fix unusual bug when scraping tweets which only happens if the tweet has a link and the text is exactly 22 chars long 2024-05-05 11:59:58 -07:00
Alessio
5d727487ba Fix lint error 2024-05-05 11:23:06 -07:00
Alessio
71230e3886 Fix 3 bugs in DMs
- escape HTML entities in messages so "<" doesn't scrape as "&lt;"
- escape quotes in sent messages, so they no longer fail to send
- fix the web UI receiving the last 50 messages after every sent message instead of just the new ones
2024-05-05 10:58:39 -07:00
Alessio
341ddaa5c2 Add Participants view to group chats 2024-05-04 21:41:15 -07:00
Alessio
1b3c5d0ed3 Add timeout error handling for scraper requests to the request body download as well (rather than just headers) 2024-04-13 16:10:23 -07:00
158710df8d Update User Feed graphql endpoint so it can get long tweets 2024-03-22 21:39:13 -07:00
750ad3031a Fix lint error 2024-03-18 22:32:30 -07:00
898ad163ae Add parsing of pinned tweets on scraped user feeds 2024-03-18 22:20:36 -07:00
ca7cf613f9 Remove unnecessary import 2024-03-18 21:16:38 -07:00
3967367eed Fix more lint errors 2024-03-18 21:15:27 -07:00
69e0a35e57 Handle HTTP request timeouts 2024-03-16 19:55:05 -07:00
73c5803a47 Add downloading of DM embedded images, videos and links 2024-03-11 21:12:38 -07:00
0ad3cf8fb8 Fix lint errors 2024-03-11 14:08:07 -07:00
aa05708e20 Move media downloader from persistence to scraper package; add 429 Rate Limited error type 2024-03-11 12:57:58 -07:00
1ba4f91463 REFACTOR: replace 'log.Debug(fmt.Sprintf(...))' with 'log.Debugf(...)' and remove 'scraper.' prefix in utils_test.go 2024-03-10 19:14:27 -07:00
07233a7153 Fix parsing of other media items in DMs as 'urls' 2024-03-09 15:11:33 -08:00
f18a2e529c Add test files for previous commit 2024-03-08 17:30:27 -08:00
0d12166966 Add parsing of DM images, videos and URLs 2024-03-08 17:22:52 -08:00
2d14621cc0 FIXUP: Add test file for previous commit 2024-03-03 14:58:41 -08:00
b076f4d2f2 Scraper: handle videos being geo-blocked, which was panicking the downloader 2024-03-03 14:56:36 -08:00