31 Révisions (master)

Auteur SHA1 Message Date
  JustAnotherArchivist 4ff212eb20 Fix empty files being considered valid WARCs il y a 10 mois
  JustAnotherArchivist 828dae2597 Raise an error when verification fails il y a 10 mois
  JustAnotherArchivist 73f35f5591 Fix infinite loop when file ends with something that is not a WARC record il y a 2 ans
  JustAnotherArchivist 06d60a798c Bump read size il y a 2 ans
  JustAnotherArchivist 74485c399b Require decompressed WARCs with warc-tiny il y a 2 ans
  JustAnotherArchivist fe0b020352 Add support for reading from stdin il y a 2 ans
  JustAnotherArchivist 5b731fbde1 Fix compatibility with wpull 2.x il y a 3 ans
  JustAnotherArchivist 743e0582ba Fix confusing error message when lxml is not installed il y a 3 ans
  JustAnotherArchivist 491a80a04b Add warc-tiny scrape command for parsing HTTP responses using wpull and extracting links il y a 3 ans
  JustAnotherArchivist 01274e461a Prevent constantly moving bytes around for better performance on large chunked records il y a 3 ans
  JustAnotherArchivist 4c90bacaed Shield values in colons with angled brackets il y a 3 ans
  JustAnotherArchivist f51adccd3f Add --meta mode for dump-responses which prefixes each line with information about the file and record il y a 3 ans
  JustAnotherArchivist 9cc1f41917 Pass the filename in NewFile events il y a 3 ans
  JustAnotherArchivist a38efc31b6 Introduce a way to provide additional arguments to processors il y a 3 ans
  JustAnotherArchivist 49376db51b Decode HTTP request bodies il y a 4 ans
  JustAnotherArchivist 34c1a58034 Fix detection of multiple transfer encodings il y a 4 ans
  JustAnotherArchivist 5982e131a4 Stop gracefully when encountering a SIGPIPE il y a 4 ans
  JustAnotherArchivist c13a1150df Add support for WARC/1.1 il y a 4 ans
  JustAnotherArchivist 376cde7b8c Fix broken block digest calculation on malformed HTTP responses il y a 4 ans
  JustAnotherArchivist b121cbd958 Write all log messages to stderr il y a 4 ans
  JustAnotherArchivist ed1270d988 Add support for upper-cased chunk lengths il y a 4 ans
  JustAnotherArchivist d4826abde2 Add record ID to log messages il y a 4 ans
  JustAnotherArchivist 552a4147c2 Fix not returning complete body for non-chunked responses il y a 4 ans
  JustAnotherArchivist f2e836d2e9 Add support for differently formatted digests il y a 5 ans
  JustAnotherArchivist 94c4f76570 Fix crash when a digest is missing from a record il y a 5 ans
  JustAnotherArchivist ef78a3318c Colour only the header field names but not the values il y a 5 ans
  JustAnotherArchivist 9ce4653094 Document colouring and usage il y a 5 ans
  JustAnotherArchivist e7c5d82254 Coloured WARCs?! il y a 5 ans
  JustAnotherArchivist 70b413f5c1 Better events: include raw WARC header data and separate HTTP requests into headers and body il y a 5 ans
  JustAnotherArchivist 641bc7a207 Fix infinite loop at end of WARC il y a 5 ans
  JustAnotherArchivist 859c75a591 Add tool for WARC verification and extraction il y a 5 ans