Archiving web sites
Archiving web sites
Posted Sep 25, 2018 14:48 UTC (Tue) by anarcat (subscriber, #66354)Parent article: Archiving web sites
As usual, here's the list of issues and patches generated while researching this article:
- fix broken link to specification in the WARC website
- sample Apache configuration for pywb
- make job status less chatty in ArchiveBot
- Debian packaging of the
ia
commandline tool
Posted Oct 4, 2018 17:58 UTC (Thu)
by anarcat (subscriber, #66354)
[Link]
The Pamplemousse crawl is now available on the Internet
Archive, it might end up in the wayback machine at some point if
the Archive curators think it is worth it.
As it turns out, I couldn't stop working on this topic and opened two more PRs upstream after submitting WARC files to the internet archive:
Archiving web sites
ia
documentationia