🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
-
Updated
May 19, 2025 - Python
🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
Core Python Web Archiving Toolkit for replay and recording of web archives
Collect and revisit web pages.
A High-Fidelity Web Archiving Extension for Chrome and Chromium based browsers!
CLI tool for saving a faithful copy of a complete web page in a single HTML file (based on SingleFile)
Automatically archive links to videos, images, and social media content from Google Sheets (and more).
Free web archiving and sharing service based on Cloudflare. 跑在 Cloudflare 上的免费网页归档和分享工具。
Run a high-fidelity browser-based web archiving crawler in a single Docker container
Serverless replay of web archives directly in the browser
InterPlanetary Wayback: A distributed and persistent archive replay system using IPFS
Wayback Machine API interface & a command-line tool
Webrecorder Player for Desktop (OSX/Windows/Linux). (Built with Electron + Webrecorder)
Archiveror will help you preserve the webpages you love. 💾
Streaming WARC/ARC library for fast web archive IO
A Tool To Push Web Resources Into Web Archives
🐋 Web Archiving Integration Layer: One-Click User Instigated Preservation
Official ArchiveBox browser extension: automatically/manually preserve your browsing history using ArchiveBox.
Browsertrix is the hosted, high-fidelity, browser-based crawling service from Webrecorder designed to make web archiving easier and more accessible for all!
Add a description, image, and links to the web-archiving topic page so that developers can more easily learn about it.
To associate your repository with the web-archiving topic, visit your repo's landing page and select "manage topics."