JustAnotherArchivist
0168d50f62
Automatically fix capitalisation of Facebook and Twitter usernames
5 vuotta sitten
JustAnotherArchivist
db0104b3c8
Get correct capitalisation for a Facebook username
5 vuotta sitten
JustAnotherArchivist
4a1a9a10e0
Allow overriding the "remote filename"
5 vuotta sitten
JustAnotherArchivist
769f95808e
Add ix.io upload script
5 vuotta sitten
JustAnotherArchivist
c79721337b
+x
5 vuotta sitten
JustAnotherArchivist
c30dcf5985
Finding outdated Mastodon instances
5 vuotta sitten
JustAnotherArchivist
1748a6b607
Better workaround for the 5000 results limit; works for FoolFuuka 2.0.1 and up
5 vuotta sitten
JustAnotherArchivist
fd680551df
Add Bing, Reddit/Pushshift, and FoolFuuka scrapers
5 vuotta sitten
JustAnotherArchivist
ede77ad142
Filter Twitter hashtag scrapes based on account scrapes
5 vuotta sitten
JustAnotherArchivist
57ef544c6c
Fix line endings
5 vuotta sitten
JustAnotherArchivist
07c3e7baaa
Add snscrape helpers
5 vuotta sitten
JustAnotherArchivist
b7e3a703d8
Monitor how a pipeline's wget processes are faring
5 vuotta sitten
JustAnotherArchivist
168f61b39a
Quote filename so it works with any weird characters in the paths
(Last reconstructed commit from text file full of different versions)
5 vuotta sitten
JustAnotherArchivist
8f77c8c72a
xargs -r flag to not run the second find if the first produces no results (GNU extension)
5 vuotta sitten
JustAnotherArchivist
9d7a4096f9
Pipe into second find directly
5 vuotta sitten
JustAnotherArchivist
e3a4bf6a47
Replace slow lsof with procfs access
5 vuotta sitten
JustAnotherArchivist
4a83a54616
Print host for each stuck request
5 vuotta sitten
JustAnotherArchivist
2b2c65f034
Print PID
5 vuotta sitten
JustAnotherArchivist
fadb70e297
Fixed version which handles multiple roots correctly
5 vuotta sitten
JustAnotherArchivist
d10a1d3675
First set of little things
5 vuotta sitten
JustAnotherArchivist
a00607f28e
Initial commit
5 vuotta sitten
JustAnotherArchivist
2a41f169c5
Add -c option to cast the return value of shutdown(2) to int explicitly on broken machines
6 vuotta sitten
JustAnotherArchivist
8ffb48fb1b
Remove set -e/errexit, which causes the script to silently fail when no process is found with -j
6 vuotta sitten
JustAnotherArchivist
632fbcb4d0
Replace kill with ps in process existence check
kill returns the same status whether a process doesn't exist or the current user doesn't have permission to kill, so the script returned a confusing error message in the latter case.
6 vuotta sitten
JustAnotherArchivist
4f3cfc6e56
Add check for ptrace scope
6 vuotta sitten
JustAnotherArchivist
96a329578e
Refactor
6 vuotta sitten
JustAnotherArchivist
1e7ec4a56e
Executable bit
6 vuotta sitten
JustAnotherArchivist
73877ecb96
Initial commit
6 vuotta sitten
JustAnotherArchivist
10715f1d3a
Rewrite GDB command to stop on the first error, e.g. if lsof is broken.
The use of call("echo 'string'") instead of print('string') or sys.stdout.write('string') is due to the latter two not reliably reporting back whether they were successful or not: print doesn't return anything (and actually can't be chained like this), and the return value of sys.stdout.write depends on the Python version (None on Python 2, number of bytes written on Python 3).
6 vuotta sitten
JustAnotherArchivist
103640a311
Make kill-wpull-connections executable
6 vuotta sitten
JustAnotherArchivist
f7dc46991c
Check whether lsof and gdb are available
6 vuotta sitten
JustAnotherArchivist
64e815b9a5
Better way of finding the PID for ArchiveBot jobs
6 vuotta sitten
JustAnotherArchivist
290a4bf518
Filter out the script from the PID list when using -j
6 vuotta sitten
JustAnotherArchivist
2787d9cd51
Initial commit
Imported from https://gist.github.com/JustAnotherArchivist/d1be04b4afec99f512ea9c3a7ffcb055
6 vuotta sitten