Library of Congress (@LibraryOfCongress)

Top repositories

1

api.congress.gov

congress.gov API
Java
624
star
2

newspaper-navigator

Jupyter Notebook
225
star
3

bagit-python

Work with BagIt packages from Python.
Python
210
star
4

data-exploration

Tutorials for working with Library of Congress collections data
Jupyter Notebook
179
star
5

concordia

Crowdsourcing platform for full text transcription and tagging. https://crowd.loc.gov
Python
154
star
6

bagger

The Bagger application packages data files according to the BagIt specification.
Java
120
star
7

bagit-java

Java library to support the BagIt specification.
Java
71
star
8

citizen-dj

JavaScript
70
star
9

chronam

This software project is no longer being actively developed at the Library of Congress. Consider using the Open-ONI (https://github.com/open-oni) fork of the chronam software. Project mailing list: http://listserv.loc.gov/archives/chronam-users.html.
Python
70
star
10

viewshare

A web application developed by Zepheira for the Library of Congress National Digital Information Infrastructure and Preservation Program (NDIIPP) which allows users to create and share embeddable interfaces to digital cultural heritage collections. A project of the Library of Congress; the project was retired in March 2018. Note: project members may work on both official Library of Congress projects and non-LC projects.
JavaScript
45
star
11

bagger-js

Upload BagIt-format deliveries to S3 entirely in the browser
JavaScript
32
star
12

coding-standards

Library of Congress coding standards
Python
27
star
13

labs-ai-framework

Planning Framework used by LC Labs for planning AI experiments towards responsible implementation
CSS
24
star
14

gazetteer

A historical gazetteer project of the Library of Congress. Note: project members may work on both official Library of Congress projects and non-LC projects.
Python
23
star
15

wdl-viewer

A fast, responsive HTML5 viewer for scanned items, developed for the World Digital Library. A project of the Library of Congress. Note: project members may work on both official Library of Congress projects and non-LC projects.
JavaScript
22
star
16

speech-to-text-viewer

AWS Transcribe evaluation pipeline: bulk-process audio files and view the results
Python
17
star
17

django-tabular-export

Utilities used to export data into spreadsheets from Django applications. Currently used internally at the Library of Congress in the WDL cataloging application.
Python
15
star
18

Exploring-ML-with-Project-Aida

Jupyter Notebook
13
star
19

bagit-conformance-suite

Test cases for validating BagIt implementations
Python
10
star
20

premis-v3-0

PREMIS schemas are written in XML. They are open source community tools that allow PREMIS users to validate PREMIS records against a version of the PREMIS schema.
10
star
21

mods2bibframe

mods2bibframe XSLT
XSLT
8
star
22

MarcMods3.6xsl

MARC>MODS--the mappings and corresponding XSLTs are open source community tools developed by NDMSO at LC.
XSLT
7
star
23

hitl

Code and documentation for Humans in the Loop (HITL), an LC Labs sponsored collaboration with metadata solutions provider AVP. The experiment explores a framework and considerations for integrating crowdsourcing and machine learning in ways that are ethical, engaging, and useful.
JavaScript
7
star
24

embARC

embARC (โ€œmetadata embedded for archival contentโ€) manages internal file metadata including embedding and validation. Created by FADGI (Federal Agencies Digital Guidelines Initiative), in conjunction with AVP and PortalMedia, embARC enables users to audit and correct embedded metadata of a subset of MXF files, as well as both individual DPX files or an entire DPX sequence, while not impacting the image data.
HTML
7
star
25

speculative-annotation

Speculative Annotation is a web browser application written in Javascript and built with React, FabricJS, IIIF, OpenSeaDragon, and ChakraUI. Source images are hosted locally. The application uses the OpenSeadragon Viewer to render images, so your source images can be a combination of locally hosted images (within the application), or externally hosted images (for example, served from a IIIF image server).Application metadata is represented by a combination of local IIIF Presentation API 3.0 manifest files, and Library of Congress hosted IIIF manifest files. The application allows users to annotate select free to use items from the Library of Congress, save to browser or download locally.
JavaScript
7
star
26

pimtoolbox

The Library of Congress and the Florida Center for Library Automation developed the PREMIS in METS (PiM) Toolbox. The project provides PREMIS:METS conversion and validation tools that support the implementation of PREMIS in the METS container format.
Ruby
6
star
27

inside-baseball

Explore baseball collections from the Library of Congress and the National Museum of African American History and Culture
Python
6
star
28

iptables-gem

A project of the Library of Congress. Note: project members may work on both official Library of Congress projects and non-LC projects.
Ruby
5
star
29

sanborn-navigator

Jupyter Notebook
5
star
30

ADCTest

ADCTest is a desktop application, written in C++, that provides provides simple pass-fail reporting for the tests detailed in the FADGI Low Cost ADC Performance Testing Guidelines as well as more detailed results
C++
5
star
31

MarcMods3.5xsl

MARC>MODS 3.5--the mapping and corresponding XSLT are open source community tools developed by NDMSO at LC.
XSLT
4
star
32

pairtree

A project of the Library of Congress. Note: project members may work on both official Library of Congress projects and non-LC projects.
CSS
4
star
33

simple-artifact-uploader

A plugin for the Gradle build management tool that allows us to automatically upload completed binaries to the Artifactory deployment server.
Java
3
star
34

a-search-for-the-heart

HTML
3
star
35

seeing-lost-enclaves

Seeing Lost Enclaves is an initiative by Jeffrey Yoo Warren as part of the 2023 Innovator in Residence Program at the Library of Congress.
HTML
2
star
36

DVV

The Digital Viewer and Validator (DVV) tool is developed at the Library of Congress for use by National Digital Newspaper Program (NDNP) participants.
1
star
37

LC_Labs

1
star
38

viewshare_site

Site specific project retired Library of Congress instance of the Viewshare project
Python
1
star
39

marc2mads20

MARC>MADS--the mappings and corresponding XSLTs are open source community tools developed by NDMSO at LC.
1
star
40

CCHC

Computing Cultural Heritage in the Cloud (CCHC) is our Andrew W. Mellon-funded experiment for piloting cloud solutions to enable research including data analysis and reduction on large-scale digital collections. Three non-LC staff contracted researchers will analyze large collection datasets that are stored in and accessible from AWS, likely as JSON. The contracted research experts' code will demonstrate how the datasets are gathered, transformed, and manipulated to demonstrate the needs of computational analysis. Languages used in this code may include Python and JavaScript. Code will undergo security review as it is submitted as deliverables during the contract window, with final versions to be made available in GitHub repository by the end of Q2 FY 2022.
1
star
41

btp-data

This Python tutorial demonstrates how to process and visualize the Library of Congress' By the People transcription data using natural language processing.
Jupyter Notebook
1
star