🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
-
Updated
Jun 14, 2023 - Python
🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
Wayback Machine API interface & a command-line tool
Navigator for Web Archive
Desktop Electron app for ArchiveBox internet archiver. (ALPHA: not ready for general use)
Javascript/Node wrapper around Mozilla's Readability library so that ArchiveBox can call it as a oneshot CLI command to extract each page's article text.
Home of the official docker image for ArchiveBox
Scrape posts, threads from forums, news aggregators, mail archives, export to JSONL, mailbox, WARC
Homebrew formula for the ArchiveBox self-hosted internet archiving solution.
Home of the official apt/deb package for Ubuntu/Debian-based systems.
Official Python package for ArchiveBox, the self-hosted internet archiving solution.
Source for the Github Wiki / ReadTheDocs documentation for AchiveBox, the self-hosted internet archiving solution.
Pick a date and explore websites from the early days of the internet to now all in an easy-to-use browser format!
Pick a date and explore websites from the early days of the internet to now all in an easy-to-use browser format!
Submit URLs listed inside a file to website archival services
Repository for collecting scripts to help capture MyConvento newsroom press-releases from the MyConvento PR management suite. The README provides an analysis of the MyConvento URL architecture for users hoping to develop a solution for themselves.
Add a description, image, and links to the internet-archiving topic page so that developers can more easily learn about it.
To associate your repository with the internet-archiving topic, visit your repo's landing page and select "manage topics."