JustAnotherArchivist
bf79252001
Fix error when the terminating CRLFCRLF of a record is truncated
7 月之前
JustAnotherArchivist
c192e0c5d3
Fix false positive warning about possibly uninitialised record_bytes_read
7 月之前
JustAnotherArchivist
d3eb01afc3
Fix extraction of search results
8 月之前
JustAnotherArchivist
7e458457d6
Add support for PermanentRedirect error responses
8 月之前
JustAnotherArchivist
5ac5aacd04
Enable line buffering on list URLs FD
8 月之前
JustAnotherArchivist
8d48785caf
Fix extra LF between chunks
11 月之前
JustAnotherArchivist
4ff212eb20
Fix empty files being considered valid WARCs
11 月之前
JustAnotherArchivist
828dae2597
Raise an error when verification fails
11 月之前
JustAnotherArchivist
ddc9dc6b44
Handle and
1 年之前
JustAnotherArchivist
a85ffe791b
Filter out lines with invalid UTF-8
1 年之前
JustAnotherArchivist
7f0809270b
Catch urljoin exceptions (e.g. invalid IPv6)
1 年之前
JustAnotherArchivist
44260ed92e
Fix in-progress upload listing
1 年之前
JustAnotherArchivist
e184bd50fb
Fix unused argc and argv error
1 年之前
JustAnotherArchivist
b78820f8ef
Add dir-to-ia
1 年之前
JustAnotherArchivist
7e01aaefe2
Handle error tasks by exiting non-zero
1 年之前
JustAnotherArchivist
b51a7c9514
Fall back to response length when there is neither Content-Length nor Transfer-Encoding in an HTTP/1.1 response
1 年之前
JustAnotherArchivist
93e4140295
Add support for malformed LF HTTP responses
1 年之前
JustAnotherArchivist
9879db1195
Proper HTTP/1.0 support
HTTP/1.0 does not mandate a Content-Length header in responses since keep-alive connections aren't a thing; the connection closure then signals the end of the response.
This change requires the URL metadata line for processing HTTP/1.0 data, so it plays well with `warc-dump-responses --meta`.
1 年之前
JustAnotherArchivist
a1e2e26a3f
Fix warning
1 年之前
JustAnotherArchivist
2163d745fd
Fix warning
1 年之前
JustAnotherArchivist
ccafb1eb51
Only search within headers
1 年之前
JustAnotherArchivist
427884af5e
Fix warnings
1 年之前
JustAnotherArchivist
38d8be57f2
Warnings are bad, mmkay?
1 年之前
JustAnotherArchivist
b644d3f454
Rebuild when .make-and-exec changes
1 年之前
JustAnotherArchivist
448e624b65
Fix UB in memcasemem when no match is found
1 年之前
JustAnotherArchivist
90616b0d5f
Improve debug compilation options
1 年之前
JustAnotherArchivist
887c063533
Add support for non-standard header capitalisation
1 年之前
JustAnotherArchivist
af25c108ba
Add support for HTTP 1.0
1 年之前
JustAnotherArchivist
6bc6c13427
Get rid of Makefile for more control; add proper debug build support
1 年之前
JustAnotherArchivist
06d8155a10
Fix --no-derive and --clobber options not working for single-part uploads
1 年之前
JustAnotherArchivist
ebfc78ef3a
More retries on item existence check
1 年之前
JustAnotherArchivist
62cee00ebe
Upload files smaller than a single part without using the multipart API
1 年之前
JustAnotherArchivist
3db8841ed1
Clear line before completion message of progress bar
1 年之前
JustAnotherArchivist
edf1dd417c
Add timeouts
1 年之前
JustAnotherArchivist
0933c2adc0
Print progress less frequently
1 年之前
JustAnotherArchivist
69c718a2b2
Not-so-new new ArchiveBot domain
1 年之前
JustAnotherArchivist
337896958a
Add support for IA_S3_{ACCESS,SECRET} environment variables
1 年之前
JustAnotherArchivist
5a8bab3a01
Fix negative ints
1 年之前
JustAnotherArchivist
232a430946
Fix single-file torrents
1 年之前
JustAnotherArchivist
568cf9ade6
Add files mode
1 年之前
JustAnotherArchivist
3b0201cdeb
Fix infohash
1 年之前
JustAnotherArchivist
1977b237e5
Fix random BrokenPipeError on exiting Python processes
Just catching the BrokenPipeError is not sufficient. There might still be data in the stdout buffer, which then causes the following error when Python shuts down and flushes it:
Exception ignored in: <_io.TextIOWrapper name='<stdout>' mode='w' encoding='utf-8'>
BrokenPipeError: [Errno 32] Broken pipe
To handle this, redirect stdout to /dev/null in the exception handler before exiting: https://docs.python.org/3.11/library/signal.html#note-on-sigpipe
1 年之前
JustAnotherArchivist
e3380e6e2a
Fix 'binary' lines
1 年之前
JustAnotherArchivist
2d4546f274
Fix errors on sscanf
%n takes a signed int so is incompatible with size_t; further, sscanf could in theory return a negative value (EOF).
1 年之前
JustAnotherArchivist
8d2b04c56f
Add torrent-tiny
1 年之前
JustAnotherArchivist
5eae0c45c2
Add header mode (e.g. for tasks API)
1 年之前
JustAnotherArchivist
0f8a22f035
Add curl-ia
1 年之前
JustAnotherArchivist
c9bf3a93af
Filter out lines without an attribute value
1 年之前
JustAnotherArchivist
98ebc667f2
Silence BrokenPipeError
1 年之前
JustAnotherArchivist
511405bbca
Fix case sensitivity on img srcset processing
1 年之前