There are no reviews yet. Be the first to send feedback to the community and the maintainers!
htrc-feature-reader
Tools for working with HTRC Feature Extraction filesHTRC-WorksetToolkit
Python SDK for Data API and Solr API accessHTRC-Portal
HTRC PortalACS-TT
ACS: The Trace of TheoryHTRC-Useful-Datasets
JSTOR-FeatureExtractor
Tool for performing feature extraction for TDM JSTOR documentsBibframe-Transform
Tramsform MARC records to BIBFRAME and add linksHTRC-EF-RsyncGenerator
Generates a shell script that allows one to download the extracted features files for a given set of volume ids.htrc.github.io
HathiTrust Research Centerscwared
Home for general documentation about HTRC’s Mellon-funded SCWAReD project.HTRC-Security-OAuth2
userinfoHTRC-FeatureExtractor
Extracts features (token counts, POS tags, etc.) from a list of HT volumes, to aid in non-consumptive research.torchlite-backend
Backend API service for Torchlite web dashboardHTRC-JWTServletFilter
Servlet filter for processing and validating HTTP requests with JWT tokensHTRC-Access-EF
A servlet-based system that provides fine-grained access to the HTRC Extracted Features files through an APIHTRC-RightsAPI
HTRC-Tools-UserManager
HTRC-Client-SolrAPI-Java
HTRC-DevEnvironment
Vagrant based development environment for HTRCHTRC-DataAPI-Client-Scala
Scala client for retrieving volumes from the HTRC DataAPI.HTRC-Cassandra-Ingester
TDM-DataAPI
Web service providing access to the TDM EF datasettorchlite-handbook
Hackathon Handbookef-workshop
Materials for the DH 2016 workshop on text analysis with the HTRC Feature ReaderHTRC-Public-Worksets-API
API for working with worksets in Virtuosocode-of-conduct
This is a repository to provide a space to draft and allow for dialog in the creation of a code of conduct for the HTRC UnCamp and other public related events.HTRC-Commons
Commons libraries used by HTRC related projects.HTRC-CulturalScaleModels
HTRC-Solr-EF-Ingester
Code that uses Spark to ingest the Extracted Feature JSON files (bundled as SequenceFiles), and stream the necessary files over to an Solr cloud installationtorchlite-frontend
Torchlite web interfaceHTRC-DevEnvCassandra
HTRC-Agent
HTRC job submission and job management module.HTRC-RegistryExtension
Workset, file and job persistence API on top of WSO2 Governance RegistryHTRC-Tools-ScalaUtils
Set of utility functions and routines that reduce the boilerplate needed to accomplish some common tasks in Scala.HTRC-Tools-PairtreeToText
Extracts full text from a HT volume stored in Pairtree by concatenating the pages in the correct order, performing optional post-processing to remove hyphenation, empty lines, headers/footers, etc.HTRC-Solr-EF-Cloud
The setup and configuration files necessary to spin up a cloud-based Solr installation that HTRC-Solr-EF-Ingester can stream its output to for indexingHTRC-Tools-RunningHeaders-Python
Library for detecting running headers/footers in pages of textHTRC-EF-Identifier-Info
Provides a browser-accessible endpoint that resolves JSON-LD EF identifier URLs, displaying useful info for themTDM-DataAPI-Aggregator
HTRC-JupyterNotebooks
scwared-spanish-american-fiction
HTRC-Alg-TokenCountTagCloud
Counts tokens and generates a tag cloud for a given list of HT volume idsHTRC-Tools-SparkUtils
Library that adds useful error handling and non-serializable object management capabilities to Apache Spark applications.JGoodwin-Topic-Browser-in-a-Data-Capsule
Notes on how, and scripts to help automate, installing and running JGoodwin's Topic Browser using an Ubuntu-16 Data-CapsuleLove Open Source and this site? Check out how you can help us