• Stars
    star
    1
  • Language
    Java
  • Created over 7 years ago
  • Updated 9 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

A servlet-based system that provides fine-grained access to the HTRC Extracted Features files through an API

More Repositories

1

htrc-feature-reader

Tools for working with HTRC Feature Extraction files
Python
39
star
2

HTRC-WorksetToolkit

Python SDK for Data API and Solr API access
Python
10
star
3

HTRC-Portal

HTRC Portal
Java
7
star
4

ACS-TT

ACS: The Trace of Theory
Jupyter Notebook
5
star
5

HTRC-DataCapsules

Secure environment for text analysis at scale of sensitive digitized content
Java
4
star
6

HTRC-Useful-Datasets

Jupyter Notebook
4
star
7

JSTOR-FeatureExtractor

Tool for performing feature extraction for TDM JSTOR documents
Java
2
star
8

Bibframe-Transform

Tramsform MARC records to BIBFRAME and add links
Python
2
star
9

HTRC-EF-RsyncGenerator

Generates a shell script that allows one to download the extracted features files for a given set of volume ids.
Scala
2
star
10

htrc.github.io

HathiTrust Research Center
HTML
2
star
11

scwared

Home for general documentation about HTRC’s Mellon-funded SCWAReD project.
2
star
12

HTRC-Security-OAuth2

userinfo
Java
2
star
13

HTRC-FeatureExtractor

Extracts features (token counts, POS tags, etc.) from a list of HT volumes, to aid in non-consumptive research.
Scala
2
star
14

torchlite-backend

Backend API service for Torchlite web dashboard
Python
2
star
15

HTRC-JWTServletFilter

Servlet filter for processing and validating HTTP requests with JWT tokens
Java
1
star
16

HTRC-RightsAPI

Java
1
star
17

HTRC-Tools-UserManager

Java
1
star
18

HTRC-Client-SolrAPI-Java

Java
1
star
19

HTRC-DevEnvironment

Vagrant based development environment for HTRC
HTML
1
star
20

HTRC-DataAPI-Client-Scala

Scala client for retrieving volumes from the HTRC DataAPI.
Scala
1
star
21

HTRC-Cassandra-Ingester

Java
1
star
22

TDM-DataAPI

Web service providing access to the TDM EF dataset
Scala
1
star
23

torchlite-handbook

Hackathon Handbook
Jupyter Notebook
1
star
24

ef-workshop

Materials for the DH 2016 workshop on text analysis with the HTRC Feature Reader
Jupyter Notebook
1
star
25

HTRC-Public-Worksets-API

API for working with worksets in Virtuoso
JavaScript
1
star
26

code-of-conduct

This is a repository to provide a space to draft and allow for dialog in the creation of a code of conduct for the HTRC UnCamp and other public related events.
1
star
27

HTRC-Commons

Commons libraries used by HTRC related projects.
Java
1
star
28

HTRC-CulturalScaleModels

Java
1
star
29

HTRC-Solr-EF-Ingester

Code that uses Spark to ingest the Extracted Feature JSON files (bundled as SequenceFiles), and stream the necessary files over to an Solr cloud installation
Java
1
star
30

torchlite-frontend

Torchlite web interface
TypeScript
1
star
31

HTRC-DevEnvCassandra

Shell
1
star
32

HTRC-Agent

HTRC job submission and job management module.
Scala
1
star
33

HTRC-RegistryExtension

Workset, file and job persistence API on top of WSO2 Governance Registry
Java
1
star
34

HTRC-Tools-ScalaUtils

Set of utility functions and routines that reduce the boilerplate needed to accomplish some common tasks in Scala.
Scala
1
star
35

HTRC-Tools-PairtreeToText

Extracts full text from a HT volume stored in Pairtree by concatenating the pages in the correct order, performing optional post-processing to remove hyphenation, empty lines, headers/footers, etc.
Scala
1
star
36

HTRC-Solr-EF-Cloud

The setup and configuration files necessary to spin up a cloud-based Solr installation that HTRC-Solr-EF-Ingester can stream its output to for indexing
Shell
1
star
37

HTRC-Tools-RunningHeaders-Python

Library for detecting running headers/footers in pages of text
Python
1
star
38

HTRC-EF-Identifier-Info

Provides a browser-accessible endpoint that resolves JSON-LD EF identifier URLs, displaying useful info for them
CSS
1
star
39

TDM-DataAPI-Aggregator

Scala
1
star
40

HTRC-JupyterNotebooks

Jupyter Notebook
1
star
41

scwared-spanish-american-fiction

1
star
42

HTRC-Alg-TokenCountTagCloud

Counts tokens and generates a tag cloud for a given list of HT volume ids
Scala
1
star
43

HTRC-Tools-SparkUtils

Library that adds useful error handling and non-serializable object management capabilities to Apache Spark applications.
Scala
1
star
44

JGoodwin-Topic-Browser-in-a-Data-Capsule

Notes on how, and scripts to help automate, installing and running JGoodwin's Topic Browser using an Ubuntu-16 Data-Capsule
HTML
1
star