🌐 Guide and tools to run a full offline mirror of Wikipedia.org with three different approaches: Nginx caching proxy, Kiwix + ZIM dump, and MediaWiki/XOWA + XML dump
😇 A Docker Compose bundle to run on servers with spare CPU, RAM, disk, and bandwidth to help the world. Includes Tor, ArchiveWarrior, BOINC, and more...
Right now, there seems to be no way to list all of the available archives by timestamp. Unless I am missing something, the only options are to call .oldest(), .newest(), or request for an archive .near() a specific timestamp. It would be great if it was possible to get a list of all available archives which could then be individually fetched as desired with .get(), printed to get the archi
Javascript/Node wrapper around Mozilla's Readability library so that ArchiveBox can call it as a oneshot CLI command to extract each page's article text.
Right now, there seems to be no way to list all of the available archives by timestamp. Unless I am missing something, the only options are to call
.oldest(),.newest(), or request for an archive.near()a specific timestamp. It would be great if it was possible to get a list of all available archives which could then be individually fetched as desired with.get(), printed to get the archi