ca206e1
(HEAD -> master)
Disable deletion by default by
2024-04-08 03:10:07 +0000
6959835
Fix single-part upload by
2024-04-07 03:51:33 +0000
dfd0156
Fix org listings not including archived repos by
2024-04-07 03:29:42 +0000
6b6cbcf
Reuse existing BytesIO object by
2024-04-07 01:55:57 +0000
5994af0
Rename --partsize to --part-size for consistency with other options by
2024-04-07 01:53:41 +0000
81bba9a
Add --size-hint option by
2024-02-29 22:37:42 +0000
8c31df9
Add support for redesigned org repo list by
2024-02-14 23:04:31 +0000
a5bdbe6
Add uniqify-recent by
2024-02-07 02:15:06 +0000
2ccf28e
Add moinmoin-url-list by
2023-12-24 05:59:02 +0000
08059f7
Replace archivebot-high-memory with more capable archivebot-high-resources by
2023-12-19 22:10:39 +0000
c16bd0a
Add archivebot-compress-db by
2023-12-07 16:26:30 +0000
f3bec23
Remove filtering of onsite URLs because it's unreliable by
2023-12-06 17:53:50 +0000
53535b9
Add wpull2-extract-ignored-offsite and extract-urls-for-archiveteam-projects by
2023-12-06 17:43:59 +0000
0432bd0
Avoid float roundtrip for integer values by
2023-11-18 08:00:11 +0000
a596778
Add archivebot-pipelines-count-jobs by
2023-11-14 05:17:07 +0000
18a96ba
Add test for warc-dump-responses by
2023-11-04 04:40:01 +0000
bf79252
Fix error when the terminating CRLFCRLF of a record is truncated by
2023-11-04 04:38:24 +0000
c192e0c
Fix false positive warning about possibly uninitialised record_bytes_read by
2023-11-04 04:37:47 +0000
d3eb01a
Fix extraction of search results by
2023-10-16 21:27:24 +0000
7e45845
Add support for PermanentRedirect error responses by
2023-10-16 11:15:37 +0000
5ac5aac
Enable line buffering on list URLs FD by
2023-09-18 14:40:56 +0000
8d48785
Fix extra LF between chunks by
2023-07-15 21:41:25 +0000
4ff212e
Fix empty files being considered valid WARCs by
2023-07-11 16:38:47 +0000
828dae2
Raise an error when verification fails by
2023-07-11 10:05:11 +0000
ddc9dc6
Handle and by
2023-06-06 21:36:01 +0000
a85ffe7
Filter out lines with invalid UTF-8 by
2023-06-06 21:08:40 +0000
7f08092
Catch urljoin exceptions (e.g. invalid IPv6) by
2023-06-06 20:54:08 +0000
44260ed
Fix in-progress upload listing by
2023-05-24 04:02:50 +0000
e184bd5
Fix unused argc and argv error by
2023-05-16 02:27:36 +0000
b78820f
Add dir-to-ia by
2023-05-06 23:24:10 +0000
7e01aae
Handle error tasks by exiting non-zero by
2023-05-06 23:23:11 +0000
b51a7c9
Fall back to response length when there is neither Content-Length nor Transfer-Encoding in an HTTP/1.1 response by
2023-05-03 05:04:41 +0000
93e4140
Add support for malformed LF HTTP responses by
2023-05-03 04:56:04 +0000
9879db1
Proper HTTP/1.0 support by
2023-05-03 04:18:11 +0000
a1e2e26
Fix warning by
2023-04-29 22:39:26 +0000
2163d74
Fix warning by
2023-04-29 22:38:25 +0000
ccafb1e
Only search within headers by
2023-04-29 21:35:02 +0000
427884a
Fix warnings by
2023-04-29 21:34:36 +0000
38d8be5
Warnings are bad, mmkay? by
2023-04-29 21:33:45 +0000
b644d3f
Rebuild when .make-and-exec changes by
2023-04-29 21:33:34 +0000
448e624
Fix UB in memcasemem when no match is found by
2023-04-29 21:06:47 +0000
90616b0
Improve debug compilation options by
2023-04-29 21:05:45 +0000
887c063
Add support for non-standard header capitalisation by
2023-04-29 05:26:48 +0000
af25c10
Add support for HTTP 1.0 by
2023-04-29 03:50:43 +0000
6bc6c13
Get rid of Makefile for more control; add proper debug build support by
2023-04-29 03:44:56 +0000
06d8155
Fix --no-derive and --clobber options not working for single-part uploads by
2023-04-02 17:03:07 +0000
ebfc78e
More retries on item existence check by
2023-03-30 06:30:13 +0000
62cee00
Upload files smaller than a single part without using the multipart API by
2023-03-30 06:05:51 +0000
3db8841
Clear line before completion message of progress bar by
2023-03-28 21:28:55 +0000
edf1dd4
Add timeouts by
2023-03-28 21:21:33 +0000
0933c2a
Print progress less frequently by
2023-03-28 20:37:12 +0000
69c718a
Not-so-new new ArchiveBot domain by
2023-03-25 04:16:48 +0000
3378969
Add support for IA_S3_{ACCESS,SECRET} environment variables by
2023-03-11 02:20:57 +0000
5a8bab3
Fix negative ints by
2023-02-22 23:36:30 +0000
232a430
Fix single-file torrents by
2023-02-22 23:02:38 +0000
568cf9a
Add files mode by
2023-02-22 22:54:32 +0000
3b0201c
Fix infohash by
2023-02-22 22:27:29 +0000
1977b23
Fix random BrokenPipeError on exiting Python processes by
2023-02-07 20:48:54 +0000
e3380e6
Fix 'binary' lines by
2023-02-07 20:48:44 +0000
2d4546f
Fix errors on sscanf by
2023-02-07 20:16:20 +0000
8d2b04c
Add torrent-tiny by
2023-02-02 09:04:08 +0000
5eae0c4
Add header mode (e.g. for tasks API) by
2023-02-01 05:38:59 +0000
0f8a22f
Add curl-ia by
2023-02-01 05:26:16 +0000
c9bf3a9
Filter out lines without an attribute value by
2023-01-24 08:29:44 +0000
98ebc66
Silence BrokenPipeError by
2023-01-24 07:43:46 +0000
511405b
Fix case sensitivity on img srcset processing by
2023-01-24 07:41:34 +0000
6acea5d
Add html-extract-stupid by
2023-01-24 07:29:45 +0000
3440da3
Fix output sometimes appearing after prompt by
2023-01-24 04:10:43 +0000
75999e9
Make --name a normal mode by
2023-01-23 07:22:26 +0000
9c1f803
Get rid of shell quoting and print name/fullname on separate lines instead by
2023-01-23 07:21:47 +0000
5ba7d26
Fix error when no arguments are provided by
2023-01-17 18:46:40 +0000
ea27e35
Add optional username and fullname extraction by
2023-01-17 18:09:22 +0000
65a47d5
Fix header matches potentially occurring in the record body by
2023-01-10 19:37:42 +0000
10c7ab0
Fix off-by-one error for non-chunked responses by
2023-01-10 18:00:40 +0000
761606a
Add options to pass the URL context out through warc-dump-responses and http-response-bodies by
2023-01-09 20:55:10 +0000
9228c23
Fix off-by-one error on WARC-Type parsing by
2023-01-09 20:53:51 +0000
a79291e
Fix debug output for small/empty buffer by
2023-01-09 20:52:56 +0000
1737842
Add http-response-bodies by
2023-01-09 18:36:39 +0000
9d60d7d
Add comment about spec compliance by
2023-01-02 00:09:11 +0000
2e57996
More debug output by
2023-01-02 00:00:00 +0000
a432631
Replace memmove with pointer arithmetic by
2023-01-01 23:59:27 +0000
27950fd
Check state at the input end by
2023-01-01 23:57:21 +0000
ead56c1
Remove dead code by
2023-01-01 23:56:43 +0000
882343e
Fix missing trailing LF on errors by
2023-01-01 23:55:43 +0000
dfc809a
Fix make exiting 1 if test script is missing by
2023-01-01 23:16:15 +0000
acd2fab
Add warc-dump-responses by
2023-01-01 23:00:24 +0000
512ced5
Make test script optional by
2023-01-01 22:59:43 +0000
67b12f6
Fix exit statuses of ia-upload-stream and ia-wait-item-tasks by
2023-01-01 22:56:15 +0000
6a76814
Add crude in-progress upload listing by
2022-12-20 06:59:27 +0000
34a3c9d
Use _type instead of key check hack by
2022-12-03 23:01:39 +0000
ec20f38
Handle nested playlists by
2022-12-03 19:22:44 +0000
8386d33
Add wpull2-log-colourise by
2022-11-28 19:29:51 +0000
a4e05d8
Fix TypeError by
2022-11-28 00:18:10 +0000
0435954
Print net queue size by
2022-11-28 00:17:24 +0000
9f31ba8
Add archivebot-fix-queue-counters by
2022-11-15 03:37:31 +0000
8d267c7
Add bencode2json by
2022-10-01 18:53:10 +0000
98adc6c
Exclude backslashes in channel patterns by
2022-09-15 05:18:42 +0000
a07c2b2
Fix handling of invalid UTF-8 input by
2022-09-15 05:18:21 +0000
725db7d
Fix confusing output for skipped lines by
2022-09-01 09:00:35 +0000
3fca23c
Fix pagination on users by
2022-08-29 02:08:39 +0000