A framework for quick web archiving
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
JustAnotherArchivist 1678075a89 Log traceback on exceptions raised from an item 1 month ago
qwarc Log traceback on exceptions raised from an item 1 month ago
LICENSE Add LICENSE and README 10 months ago
README.md Add LICENSE and README 10 months ago
setup.py Python 3.7 compatibility 6 months ago

README.md

qwarc

qwarc is a framework for rapidly archiving a large number of URLs with little overhead. This is achieved primarily by using many parallel connections (including across multiple processes) and not employing any HTML parsing or other processing.

Use qwarc responsibly. It can easily overwhelm web servers.

License

This program is free software: you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version.

This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.

You should have received a copy of the GNU General Public License along with this program. If not, see https://www.gnu.org/licenses/.