• Stars
    star
    3
  • Rank 3,963,521 (Top 79 %)
  • Language
    Python
  • License
    Apache License 2.0
  • Created over 5 years ago
  • Updated over 5 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Tools for working with Hathi Trust Research Center extracted features files

More Repositories

1

Mallet

MALLET is a Java-based package for statistical natural language processing, document classification, clustering, topic modeling, information extraction, and other machine learning applications to text.
Java
981
star
2

jsLDA

An implementation of latent Dirichlet allocation in javascript
JavaScript
183
star
3

info3300-spr2015

Notes and in-class problems for ML+d3 course
HTML
46
star
4

RMallet

R package wrapping Mallet
R
38
star
5

anchor

Mallet-compatible anchor-based topic model
Java
37
star
6

jsLBFGS

A javascript implementation of limited-memory BFGS
JavaScript
26
star
7

info3300-spr2017

Course materials for Data-Driven Web Applications
HTML
24
star
8

info3300-spr2016

Notes and pre-class work for INFO/CS 3300 and INFO 5100
HTML
16
star
9

PyMallet

Python tools for text
Jupyter Notebook
15
star
10

info3300-spr2018

Course materials for Data-Driven Web Applications
HTML
13
star
11

TidyMallet

A tidy-native LDA implementation in Rcpp
C++
12
star
12

info6150-fall2018

Resources for Advanced Topic Modeling (Fall 2018)
Python
9
star
13

info-3350-fall-2017

Materials for "Text Mining for History and Literature"
Python
8
star
14

admixture-ppc

Posterior predictive checks for genetic admixture models
Java
5
star
15

info-3350-fall-2019

Jupyter Notebook
4
star
16

arxivtopics

Python
4
star
17

CulturalAnalytics

Articles from CA
Python
3
star
18

GRMM

Mallet-compatible graphical model toolkit
Java
3
star
19

MalletPPC

Posterior predictive checks for Mallet state files
Python
3
star
20

naivebayes

in-browser classification and analysis
2
star
21

ota

Creative commons texts from Oxford Text Archive
2
star
22

TwelveMedievalGhostStories

Stories transcribed by M.R. James from a manuscript from Byland Abbey
2
star
23

info-3350-fall-2015

Python
2
star
24

networks

Poisson network community model
Java
1
star
25

tada2022

Text as Data 2022
1
star