• Stars
    star
    1
  • Language
    Scala
  • Created about 6 years ago
  • Updated almost 6 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Web service providing access to the TDM EF dataset

More Repositories

1

htrc-feature-reader

Tools for working with HTRC Feature Extraction files
Python
39
star
2

HTRC-WorksetToolkit

Python SDK for Data API and Solr API access
Python
10
star
3

HTRC-Portal

HTRC Portal
Java
7
star
4

ACS-TT

ACS: The Trace of Theory
Jupyter Notebook
5
star
5

HTRC-DataCapsules

Secure environment for text analysis at scale of sensitive digitized content
Java
4
star
6

HTRC-Useful-Datasets

Jupyter Notebook
4
star
7

JSTOR-FeatureExtractor

Tool for performing feature extraction for TDM JSTOR documents
Java
2
star
8

Bibframe-Transform

Tramsform MARC records to BIBFRAME and add links
Python
2
star
9

HTRC-EF-RsyncGenerator

Generates a shell script that allows one to download the extracted features files for a given set of volume ids.
Scala
2
star
10

htrc.github.io

HathiTrust Research Center
HTML
2
star
11

scwared

Home for general documentation about HTRC’s Mellon-funded SCWAReD project.
2
star
12

HTRC-Security-OAuth2

userinfo
Java
2
star
13

HTRC-FeatureExtractor

Extracts features (token counts, POS tags, etc.) from a list of HT volumes, to aid in non-consumptive research.
Scala
2
star
14

torchlite-backend

Backend API service for Torchlite web dashboard
Python
2
star
15

HTRC-JWTServletFilter

Servlet filter for processing and validating HTTP requests with JWT tokens
Java
1
star
16

HTRC-Access-EF

A servlet-based system that provides fine-grained access to the HTRC Extracted Features files through an API
Java
1
star
17

HTRC-RightsAPI

Java
1
star
18

HTRC-Tools-UserManager

Java
1
star
19

HTRC-Client-SolrAPI-Java

Java
1
star
20

HTRC-DevEnvironment

Vagrant based development environment for HTRC
HTML
1
star
21

HTRC-DataAPI-Client-Scala

Scala client for retrieving volumes from the HTRC DataAPI.
Scala
1
star
22

HTRC-Cassandra-Ingester

Java
1
star
23

torchlite-handbook

Hackathon Handbook
Jupyter Notebook
1
star
24

ef-workshop

Materials for the DH 2016 workshop on text analysis with the HTRC Feature Reader
Jupyter Notebook
1
star
25

HTRC-Public-Worksets-API

API for working with worksets in Virtuoso
JavaScript
1
star
26

code-of-conduct

This is a repository to provide a space to draft and allow for dialog in the creation of a code of conduct for the HTRC UnCamp and other public related events.
1
star
27

HTRC-Commons

Commons libraries used by HTRC related projects.
Java
1
star
28

HTRC-CulturalScaleModels

Java
1
star
29

HTRC-Solr-EF-Ingester

Code that uses Spark to ingest the Extracted Feature JSON files (bundled as SequenceFiles), and stream the necessary files over to an Solr cloud installation
Java
1
star
30

torchlite-frontend

Torchlite web interface
TypeScript
1
star
31

HTRC-DevEnvCassandra

Shell
1
star
32

HTRC-Agent

HTRC job submission and job management module.
Scala
1
star
33

HTRC-RegistryExtension

Workset, file and job persistence API on top of WSO2 Governance Registry
Java
1
star
34

HTRC-Tools-ScalaUtils

Set of utility functions and routines that reduce the boilerplate needed to accomplish some common tasks in Scala.
Scala
1
star
35

HTRC-Tools-PairtreeToText

Extracts full text from a HT volume stored in Pairtree by concatenating the pages in the correct order, performing optional post-processing to remove hyphenation, empty lines, headers/footers, etc.
Scala
1
star
36

HTRC-Solr-EF-Cloud

The setup and configuration files necessary to spin up a cloud-based Solr installation that HTRC-Solr-EF-Ingester can stream its output to for indexing
Shell
1
star
37

HTRC-Tools-RunningHeaders-Python

Library for detecting running headers/footers in pages of text
Python
1
star
38

HTRC-EF-Identifier-Info

Provides a browser-accessible endpoint that resolves JSON-LD EF identifier URLs, displaying useful info for them
CSS
1
star
39

TDM-DataAPI-Aggregator

Scala
1
star
40

HTRC-JupyterNotebooks

Jupyter Notebook
1
star
41

scwared-spanish-american-fiction

1
star
42

HTRC-Alg-TokenCountTagCloud

Counts tokens and generates a tag cloud for a given list of HT volume ids
Scala
1
star
43

HTRC-Tools-SparkUtils

Library that adds useful error handling and non-serializable object management capabilities to Apache Spark applications.
Scala
1
star
44

JGoodwin-Topic-Browser-in-a-Data-Capsule

Notes on how, and scripts to help automate, installing and running JGoodwin's Topic Browser using an Ubuntu-16 Data-Capsule
HTML
1
star