The Wayback Machine - https://web.archive.org/web/20220617133721/https://github.com/topics/internet-archiving
Here are
17 public repositories
matching this topic...
🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
Updated
Jun 10, 2022
Python
🌐 Guide and tools to run a full offline mirror of Wikipedia.org with three different approaches: Nginx caching proxy, Kiwix + ZIM dump, and MediaWiki/XOWA + XML dump
Updated
Apr 7, 2021
Shell
😇 A Docker Compose bundle to run on servers with spare CPU, RAM, disk, and bandwidth to help the world. Includes Tor, ArchiveWarrior, BOINC, and more...
Wayback Machine API interface & a command-line tool
Updated
Mar 29, 2022
Python
Navigator for Web Archive
Updated
Jun 1, 2022
JavaScript
Desktop Electron app for ArchiveBox internet archiver. (ALPHA: not ready for general use)
Updated
Jun 17, 2022
JavaScript
🎭 An introduction to the Internet Archiving ecosystem, tooling, and some of the ethical dilemmas that the community faces.
Updated
Oct 19, 2020
JavaScript
Javascript/Node wrapper around Mozilla's Readability library so that ArchiveBox can call it as a oneshot CLI command to extract each page's article text.
Updated
May 10, 2022
JavaScript
Homebrew formula for the ArchiveBox self-hosted internet archiving solution.
Updated
Dec 23, 2021
Ruby
Home of the official docker image for ArchiveBox
Updated
Dec 23, 2021
Dockerfile
Home of the official apt/deb package for Ubuntu/Debian-based systems.
Updated
Sep 22, 2021
Python
Official Python package for ArchiveBox, the self-hosted internet archiving solution.
Source for the Github Wiki / ReadTheDocs documentation for AchiveBox, the self-hosted internet archiving solution.
Updated
Sep 30, 2021
Python
Submit URLs listed inside a file to website archival services
Updated
Aug 26, 2021
Python
Pick a date and explore websites from the early days of the internet to now all in an easy-to-use browser format! 💻
Updated
Jun 15, 2022
Swift
Pick a date and explore websites from the early days of the internet to now all in an easy-to-use browser format! 💻
Updated
Jun 14, 2022
Python
Repository for collecting scripts to help capture MyConvento newsroom press-releases from the MyConvento PR management suite. The README provides an analysis of the MyConvento URL architecture for users hoping to develop a solution for themselves.
Updated
Jan 5, 2022
Python
Improve this page
Add a description, image, and links to the
internet-archiving
topic page so that developers can more easily learn about it.
Curate this topic
Add this topic to your repo
To associate your repository with the
internet-archiving
topic, visit your repo's landing page and select "manage topics."
Learn more
You can’t perform that action at this time.
You signed in with another tab or window. Reload to refresh your session.
You signed out in another tab or window. Reload to refresh your session.