63 Commits (5ca90c3b7d59dedcd244177d46693a57584b2158)
 

Author SHA1 Message Date
  JustAnotherArchivist 5ca90c3b7d Update tmux session commands 4 years ago
  JustAnotherArchivist 679923d37d Add support for Twitter hashtag extraction 4 years ago
  JustAnotherArchivist 663383830c Add support for lists 4 years ago
  JustAnotherArchivist d85d142def Handle parameters on Twitter URLs 5 years ago
  JustAnotherArchivist 5984565417 Handle Twitter URLs with trailing slash 5 years ago
  JustAnotherArchivist 8647ccaa8f Support subdomain-less Facebook URLs 5 years ago
  JustAnotherArchivist 66ec0c93c4 Handle more Facebook URLs 5 years ago
  JustAnotherArchivist baa8a566bd Add script for scraping MEP links from europarl.europa.eu 5 years ago
  JustAnotherArchivist c2413b2c4f Add ArchiveBot wiki list helper 5 years ago
  JustAnotherArchivist 72818019bc Extract external links from Twitter 5 years ago
  JustAnotherArchivist b262d893da Silence by default 5 years ago
  JustAnotherArchivist 6fb9587a2b More flexible normalisation 5 years ago
  JustAnotherArchivist 06be216f4c Print Instagram ignore immediately after upload instead of at the end 5 years ago
  JustAnotherArchivist 1be4ed829b Add helper for AB/chromebot-ing YouTube channels and users 5 years ago
  JustAnotherArchivist 2a7a4ea6dc Fix HTTPS handling 5 years ago
  JustAnotherArchivist a812cb5fc2 More snscrape helper tools 5 years ago
  JustAnotherArchivist 3ee3ffc340 Generate commands for Blogspot 5 years ago
  JustAnotherArchivist 5090a8ad02 Enumerate users on a Mastodon instance 5 years ago
  JustAnotherArchivist 0000d8ffd9 Add script to queue derive on IA 5 years ago
  JustAnotherArchivist 6dc711c54e Further helper scripts for snscrape: normalising usernames and extracting them from a list of URLs 5 years ago
  JustAnotherArchivist e3a37455ba Add uniqify 5 years ago
  JustAnotherArchivist 321067819c Proper script for tracking size of uploaded data 5 years ago
  JustAnotherArchivist 5c654cb16b Split out size formatting 5 years ago
  JustAnotherArchivist de2cdc0aae curl with ArchiveBot UA 5 years ago
  JustAnotherArchivist 89ccd68b59 Helper tools for snscrape and the wiki pages 5 years ago
  JustAnotherArchivist f2e836d2e9 Add support for differently formatted digests 5 years ago
  JustAnotherArchivist 94c4f76570 Fix crash when a digest is missing from a record 5 years ago
  JustAnotherArchivist ef78a3318c Colour only the header field names but not the values 5 years ago
  JustAnotherArchivist 9ce4653094 Document colouring and usage 5 years ago
  JustAnotherArchivist e7c5d82254 Coloured WARCs?! 5 years ago
  JustAnotherArchivist 70b413f5c1 Better events: include raw WARC header data and separate HTTP requests into headers and body 5 years ago
  JustAnotherArchivist 641bc7a207 Fix infinite loop at end of WARC 5 years ago
  JustAnotherArchivist a700e8e2fe Add tcp-closer command 5 years ago
  JustAnotherArchivist 859c75a591 Add tool for WARC verification and extraction 5 years ago
  JustAnotherArchivist e867a2327f Replace urlencoded @ symbol 5 years ago
  JustAnotherArchivist cbd952024b Workaround for hash no longer needed with current transfer.sh code 5 years ago
  JustAnotherArchivist 61431c2054 Add VK scraping helper 5 years ago
  JustAnotherArchivist d6ff566c4d Instagram always uses lower-case usernames 5 years ago
  JustAnotherArchivist 138c2a2d39 Get rid of post-processing now that snscrape (dev version) has clean URLs 5 years ago
  JustAnotherArchivist 27b0d2da75 Better username capitalisation extraction method 5 years ago
  JustAnotherArchivist 3aa828a0ac transfer.kiska.pw -> transfer.notkiska.pw 5 years ago
  JustAnotherArchivist 63f4a8b3d3 transfer.sh -> transfer.kiska.pw 5 years ago
  JustAnotherArchivist 0168d50f62 Automatically fix capitalisation of Facebook and Twitter usernames 5 years ago
  JustAnotherArchivist db0104b3c8 Get correct capitalisation for a Facebook username 5 years ago
  JustAnotherArchivist 4a1a9a10e0 Allow overriding the "remote filename" 5 years ago
  JustAnotherArchivist 769f95808e Add ix.io upload script 5 years ago
  JustAnotherArchivist c79721337b +x 5 years ago
  JustAnotherArchivist c30dcf5985 Finding outdated Mastodon instances 5 years ago
  JustAnotherArchivist 1748a6b607 Better workaround for the 5000 results limit; works for FoolFuuka 2.0.1 and up 5 years ago
  JustAnotherArchivist fd680551df Add Bing, Reddit/Pushshift, and FoolFuuka scrapers 5 years ago