JustAnotherArchivist
ca206e162e
Disable deletion by default
pirms 2 nedēļām
JustAnotherArchivist
6959835012
Fix single-part upload
pirms 2 nedēļām
JustAnotherArchivist
dfd01567cb
Fix org listings not including archived repos
While the 'All' listing currently only excludes archived repos, this is a more general solution that, as long as they don't redesign the repository list yet again, should work even with further category display changes.
pirms 2 nedēļām
JustAnotherArchivist
6b6cbcf840
Reuse existing BytesIO object
pirms 2 nedēļām
JustAnotherArchivist
5994af0019
Rename --partsize to --part-size for consistency with other options
pirms 2 nedēļām
JustAnotherArchivist
81bba9a631
Add --size-hint option
pirms 1 mēnesi
JustAnotherArchivist
8c31df93a0
Add support for redesigned org repo list
pirms 2 mēnešiem
JustAnotherArchivist
a5bdbe6b57
Add uniqify-recent
pirms 2 mēnešiem
JustAnotherArchivist
2ccf28eb43
Add moinmoin-url-list
pirms 4 mēnešiem
JustAnotherArchivist
08059f7441
Replace archivebot-high-memory with more capable archivebot-high-resources
pirms 4 mēnešiem
JustAnotherArchivist
c16bd0a477
Add archivebot-compress-db
pirms 4 mēnešiem
JustAnotherArchivist
f3bec23348
Remove filtering of onsite URLs because it's unreliable
It also erroneously filters out offsite URLs that contain the root domain, and this isn't fixable without using regex, which isn't always available in the SQLite CLI before version 3.36.0.
pirms 4 mēnešiem
JustAnotherArchivist
53535b925a
Add wpull2-extract-ignored-offsite and extract-urls-for-archiveteam-projects
pirms 4 mēnešiem
JustAnotherArchivist
0432bd00c2
Avoid float roundtrip for integer values
pirms 5 mēnešiem
JustAnotherArchivist
a596778f79
Add archivebot-pipelines-count-jobs
pirms 5 mēnešiem
JustAnotherArchivist
18a96ba246
Add test for warc-dump-responses
pirms 5 mēnešiem
JustAnotherArchivist
bf79252001
Fix error when the terminating CRLFCRLF of a record is truncated
pirms 5 mēnešiem
JustAnotherArchivist
c192e0c5d3
Fix false positive warning about possibly uninitialised record_bytes_read
pirms 5 mēnešiem
JustAnotherArchivist
d3eb01afc3
Fix extraction of search results
pirms 6 mēnešiem
JustAnotherArchivist
7e458457d6
Add support for PermanentRedirect error responses
pirms 6 mēnešiem
JustAnotherArchivist
5ac5aacd04
Enable line buffering on list URLs FD
pirms 7 mēnešiem
JustAnotherArchivist
8d48785caf
Fix extra LF between chunks
pirms 9 mēnešiem
JustAnotherArchivist
4ff212eb20
Fix empty files being considered valid WARCs
pirms 9 mēnešiem
JustAnotherArchivist
828dae2597
Raise an error when verification fails
pirms 9 mēnešiem
JustAnotherArchivist
ddc9dc6b44
Handle and
pirms 10 mēnešiem
JustAnotherArchivist
a85ffe791b
Filter out lines with invalid UTF-8
pirms 10 mēnešiem
JustAnotherArchivist
7f0809270b
Catch urljoin exceptions (e.g. invalid IPv6)
pirms 10 mēnešiem
JustAnotherArchivist
44260ed92e
Fix in-progress upload listing
pirms 11 mēnešiem
JustAnotherArchivist
e184bd50fb
Fix unused argc and argv error
pirms 11 mēnešiem
JustAnotherArchivist
b78820f8ef
Add dir-to-ia
pirms 11 mēnešiem
JustAnotherArchivist
7e01aaefe2
Handle error tasks by exiting non-zero
pirms 11 mēnešiem
JustAnotherArchivist
b51a7c9514
Fall back to response length when there is neither Content-Length nor Transfer-Encoding in an HTTP/1.1 response
pirms 11 mēnešiem
JustAnotherArchivist
93e4140295
Add support for malformed LF HTTP responses
pirms 11 mēnešiem
JustAnotherArchivist
9879db1195
Proper HTTP/1.0 support
HTTP/1.0 does not mandate a Content-Length header in responses since keep-alive connections aren't a thing; the connection closure then signals the end of the response.
This change requires the URL metadata line for processing HTTP/1.0 data, so it plays well with `warc-dump-responses --meta`.
pirms 11 mēnešiem
JustAnotherArchivist
a1e2e26a3f
Fix warning
pirms 1 gada
JustAnotherArchivist
2163d745fd
Fix warning
pirms 1 gada
JustAnotherArchivist
ccafb1eb51
Only search within headers
pirms 1 gada
JustAnotherArchivist
427884af5e
Fix warnings
pirms 1 gada
JustAnotherArchivist
38d8be57f2
Warnings are bad, mmkay?
pirms 1 gada
JustAnotherArchivist
b644d3f454
Rebuild when .make-and-exec changes
pirms 1 gada
JustAnotherArchivist
448e624b65
Fix UB in memcasemem when no match is found
pirms 1 gada
JustAnotherArchivist
90616b0d5f
Improve debug compilation options
pirms 1 gada
JustAnotherArchivist
887c063533
Add support for non-standard header capitalisation
pirms 1 gada
JustAnotherArchivist
af25c108ba
Add support for HTTP 1.0
pirms 1 gada
JustAnotherArchivist
6bc6c13427
Get rid of Makefile for more control; add proper debug build support
pirms 1 gada
JustAnotherArchivist
06d8155a10
Fix --no-derive and --clobber options not working for single-part uploads
pirms 1 gada
JustAnotherArchivist
ebfc78ef3a
More retries on item existence check
pirms 1 gada
JustAnotherArchivist
62cee00ebe
Upload files smaller than a single part without using the multipart API
pirms 1 gada
JustAnotherArchivist
3db8841ed1
Clear line before completion message of progress bar
pirms 1 gada
JustAnotherArchivist
edf1dd417c
Add timeouts
pirms 1 gada