JustAnotherArchivist
761606a5be
Add options to pass the URL context out through warc-dump-responses and http-response-bodies
1 rok temu
JustAnotherArchivist
9228c23ae6
Fix off-by-one error on WARC-Type parsing
1 rok temu
JustAnotherArchivist
a79291e081
Fix debug output for small/empty buffer
1 rok temu
JustAnotherArchivist
1737842841
Add http-response-bodies
1 rok temu
JustAnotherArchivist
9d60d7d3d7
Add comment about spec compliance
1 rok temu
JustAnotherArchivist
2e579964f9
More debug output
1 rok temu
JustAnotherArchivist
a432631d9b
Replace memmove with pointer arithmetic
1 rok temu
JustAnotherArchivist
27950fdc52
Check state at the input end
1 rok temu
JustAnotherArchivist
ead56c14a6
Remove dead code
1 rok temu
JustAnotherArchivist
882343eee4
Fix missing trailing LF on errors
1 rok temu
JustAnotherArchivist
dfc809abb4
Fix make exiting 1 if test script is missing
1 rok temu
JustAnotherArchivist
acd2fab899
Add warc-dump-responses
1 rok temu
JustAnotherArchivist
512ced5ebd
Make test script optional
1 rok temu
JustAnotherArchivist
67b12f645f
Fix exit statuses of ia-upload-stream and ia-wait-item-tasks
1 rok temu
JustAnotherArchivist
6a76814ec5
Add crude in-progress upload listing
1 rok temu
JustAnotherArchivist
34a3c9d0f3
Use _type instead of key check hack
1 rok temu
JustAnotherArchivist
ec20f38c82
Handle nested playlists
1 rok temu
JustAnotherArchivist
8386d33323
Add wpull2-log-colourise
1 rok temu
JustAnotherArchivist
a4e05d8932
Fix TypeError
1 rok temu
JustAnotherArchivist
0435954e65
Print net queue size
1 rok temu
JustAnotherArchivist
9f31ba8828
Add archivebot-fix-queue-counters
1 rok temu
JustAnotherArchivist
8d267c7f46
Add bencode2json
1 rok temu
JustAnotherArchivist
98adc6cfac
Exclude backslashes in channel patterns
1 rok temu
JustAnotherArchivist
a07c2b2374
Fix handling of invalid UTF-8 input
1 rok temu
JustAnotherArchivist
725db7d05d
Fix confusing output for skipped lines
1 rok temu
JustAnotherArchivist
3fca23c0a0
Fix pagination on users
1 rok temu
JustAnotherArchivist
c2f6f5054c
Handle actual 429
1 rok temu
JustAnotherArchivist
ccf4d678fb
Allow negative offsets to peek near the end of the file
2 lat temu
JustAnotherArchivist
4798154e98
Fix URLs without a path
2 lat temu
JustAnotherArchivist
1830d67283
Add ia-cdx-search-subdomains
2 lat temu
JustAnotherArchivist
565be7bf1b
Fix
2 lat temu
JustAnotherArchivist
e2085e6c81
Add cloudflare-email-decode
2 lat temu
JustAnotherArchivist
73f35f5591
Fix infinite loop when file ends with something that is not a WARC record
2 lat temu
JustAnotherArchivist
06d60a798c
Bump read size
2 lat temu
JustAnotherArchivist
3e0b70be6b
Handle processes with too many open connections
2 lat temu
JustAnotherArchivist
df7b25c2db
Error on unknown options
2 lat temu
JustAnotherArchivist
4bd4f5a30c
Fix 'Argument list too long' error when using --urls-from-stdin with many URLs
2 lat temu
JustAnotherArchivist
e20d35a553
Fix crash on 429
2 lat temu
JustAnotherArchivist
cef61434a0
Add --urls-from-stdin
2 lat temu
JustAnotherArchivist
b5cf04947b
Add Wasabi
2 lat temu
JustAnotherArchivist
d2afd1309d
Add s3-bucket-find-direct-url
2 lat temu
JustAnotherArchivist
95988466ec
Make S3 response pattern matching more flexible (so it also works on Scaleway)
2 lat temu
JustAnotherArchivist
a9a03d3a00
Add urlsort
2 lat temu
JustAnotherArchivist
9798cc1188
Typo
2 lat temu
JustAnotherArchivist
d193637e5e
Add kill-connections
2 lat temu
JustAnotherArchivist
6cfe8e51ba
Make job a global variable in --pyfilter expressions so it can be used in genexps
2 lat temu
JustAnotherArchivist
a4627fa1c6
Queue derives with `ia tasks` instead of this manual curl rubbish
2 lat temu
JustAnotherArchivist
c68b310afc
Always print the parts value if there is an upload ID
Previously, parts wouldn't be printed if it was an empty list. This made resuming uploads that crashed in the first part harder than necessary.
2 lat temu
JustAnotherArchivist
fdc3c3d69e
Support float values for --partsize with M or G suffix
2 lat temu
JustAnotherArchivist
002c1eb7ae
Wait until item exists
IA doesn't immediately create the item on CreateMultipartUpload, so if it didn't already exist, UploadPart would fail for a while and we'd waste bandwidth.
2 lat temu