@ArchiveBox

Top repositories

1

ArchiveBox

πŸ—ƒ Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
Python
20,613
star
2

good-karma-kit

πŸ˜‡ A Docker Compose bundle to run on servers with spare CPU, RAM, disk, and bandwidth to help the world. Includes Tor, ArchiveWarrior, BOINC, and more...
299
star
3

archivebox-browser-extension

Official ArchiveBox browser extension: automatically/manually preserve your browsing history using ArchiveBox.
TypeScript
200
star
4

electron-archivebox

Desktop Electron app for ArchiveBox internet archiver. (ALPHA: not ready for general use)
JavaScript
175
star
5

docker-archivebox

Home of the official docker image for ArchiveBox
Dockerfile
45
star
6

readability-extractor

Javascript/Node wrapper around Mozilla's Readability library so that ArchiveBox can call it as a oneshot CLI command to extract each page's article text.
JavaScript
34
star
7

homebrew-archivebox

Homebrew formula for the ArchiveBox self-hosted internet archiving solution.
Ruby
26
star
8

debian-archivebox

Home of the official apt/deb package for Ubuntu/Debian-based systems.
Python
17
star
9

docs

Source for the Github Wiki / ReadTheDocs documentation for AchiveBox, the self-hosted internet archiving solution.
CSS
14
star
10

DigestBox

DigestBox takes any webpage URL (news article, video link, comment thread, etc.) and gives you just the raw content. It's powered by ArchiveBox.io under the hood.
HTML
13
star
11

pip-archivebox

Official Python package for ArchiveBox, the self-hosted internet archiving solution.
13
star
12

archivebox-proxy

Official ArchiveBox MITM proxy: saves URLs of all requests passing through to an ArchiveBox server for archival.
Python
11
star
13

pydantic-pkgr

A modern Python library for managing system dependencies with package managers like apt, brew, pip, npm, etc.
Python
7
star
14

community

A wiki of the broader Web Archiving Community: important organizations, alternative projects, blog posts, and more.
4
star
15

archivebox-spreadsheet-bot

This is a bot that provides ArchiveBox integration with Google Sheets for new URL ingestion, archived URL management, and automated QA (optionally AI-powered).
1
star