-
The National Archives
- London
-
00:42
(UTC)
Starred repositories
Data pipeline to harvest, transform, reconcile, enrich and export Linked Art data for LUX (or other system)
W3C web annotation search using the IIIF content search API
Cloud-native search engine for observability. An open-source alternative to Datadog, Elasticsearch, Loki, and Tempo.
Next-Generation full text search library for Browser and Node.js
Container runtimes on macOS (and Linux) with minimal setup
Curating, georeferencing and exploring for IIIF maps
OCR, layout analysis, reading order, table recognition in 90+ languages
Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.
Typesafe IIIF presentation v3 parsing without external dependencies
A list of AI agents and robots to block.
Radient turns many data types (not just text) into vectors for similarity search, RAG, regression analysis, and more.
A pure JavaScript implementation of git for node and browsers!
TIFY is a slim and mobile-friendly IIIF document viewer.
a Mirador 3 plugin that adds annotation creation tools to the user interface
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
Fast and efficient DOM-less OCR parser for use in browsers (including Workers) and Node
WARC + AI - Experimental Retrieval Augmented Generation Pipeline for Web Archive Collections.