The Wayback Machine - https://web.archive.org/web/20221206171114/https://github.com/topics/web-archiving
Here are
84 public repositories
matching this topic...
🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
Updated
Dec 5, 2022
Python
Collect and revisit web pages.
Updated
Dec 6, 2022
Python
Core Python Web Archiving Toolkit for replay and recording of web archives
Updated
Dec 6, 2022
JavaScript
InterPlanetary Wayback: A distributed and persistent archive replay system using IPFS
Updated
Dec 6, 2022
Python
Webrecorder Player for Desktop (OSX/Windows/Linux). (Built with Electron + Webrecorder)
Updated
Sep 17, 2020
JavaScript
A High-Fidelity Web Archiving Extension for Chrome and Chromium based browsers!
Updated
Dec 3, 2022
JavaScript
WarcDB: Web crawl data as SQLite databases.
Updated
Nov 15, 2022
Python
Archiveror will help you preserve the webpages you love. 💾
Updated
Oct 18, 2019
JavaScript
Updated
Dec 5, 2022
JavaScript
A Tool To Push Web Resources Into Web Archives
Updated
Feb 14, 2021
Python
🐋 Web Archiving Integration Layer: One-Click User Instigated Preservation
Serverless Web Archive Replay directly in the browser
Updated
Dec 4, 2022
JavaScript
Streaming WARC/ARC library for fast web archive IO
Updated
Jun 26, 2022
Python
Wayback Machine API interface & a command-line tool
Updated
Nov 17, 2022
Python
Chrome extension to "Create WARC files from any webpage"
Updated
May 31, 2022
JavaScript
Social Feed Manager user interface application.
Updated
Dec 2, 2022
Python
An Apache Spark framework for easy data processing, extraction as well as derivation for web archives and archival collections, developed at Internet Archive.
Updated
Oct 8, 2021
Scala
Desktop Electron app for ArchiveBox internet archiver. (ALPHA: not ready for general use)
Updated
Nov 8, 2022
JavaScript
A toolkit for CDX indices such as Common Crawl and the Internet Archive's Wayback Machine
Updated
Mar 28, 2022
Python
🐋 One-Click User Instigated Preservation
Updated
Feb 3, 2019
JavaScript
Improve this page
Add a description, image, and links to the
web-archiving
topic page so that developers can more easily learn about it.
Curate this topic
Add this topic to your repo
To associate your repository with the
web-archiving
topic, visit your repo's landing page and select "manage topics."
Learn more
You can’t perform that action at this time.
You signed in with another tab or window. Reload to refresh your session.
You signed out in another tab or window. Reload to refresh your session.