• Stars
    star
    13
  • Rank 1,512,713 (Top 30 %)
  • Language
    Java
  • License
    GNU General Publi...
  • Created about 9 years ago
  • Updated over 2 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Stanford CoreNLP NER addon for Apache Tika's NamerEntityParser

More Repositories

1

nllb-serve

Meta's "No Language Left Behind" models served as web app and REST API
Python
177
star
2

mtdata

A tool that locates, downloads, and extracts machine translation corpora
Python
147
star
3

tensorflow-grpc-java

Tensorflow grpc java client for image recognition serving inception model
Java
39
star
4

charliebot

The ALICE/ALICEBOT/CHARLIE/CHARLIEBOT
Java
18
star
5

tika-dl4j-spark-imgrec

Image recognition on Spark cluster powered by Deeplearning4j and Apache Tika
Java
14
star
6

006-many-to-eng

Machine translation of many to English
Jupyter Notebook
9
star
7

005-nmt-imbalance

Finding the Optimal Vocabulary for NMT
Jupyter Notebook
6
star
8

solr-similarity

Finding similarity of documents by making use of vector space model
Java
5
star
9

ML101

Machine learning crash course for absolute beginners.
HTML
5
star
10

autoextractor

A toolkit for clustering web pages based on various similarity measures.
Java
4
star
11

notes

A place to dump all my homeworks and practice scribblings.
Jupyter Notebook
4
star
12

awkg

awkg is an awk-like text-processing tool powered by python language
Python
4
star
13

572-hw2

Home Work 2 Indexing + NER
Java
3
star
14

virtchar

A Dialog toolkit for making favorite TV characters as chatbots
Python
3
star
15

parser-indexer

Metadata Parser and Solr Indexer
Java
3
star
16

image-forensics-MFSec17

Automated Large Scale Image Forensics using Tika and Tensorflow
TeX
3
star
17

pdf-extractor

A tool to extract text from PDF files with OCR
Java
2
star
18

202010-pytorch

Pytorch PPT for ISI
Jupyter Notebook
2
star
19

unmass

Unsupervised NMT based on Masked Seq-to-Seq
Python
2
star
20

dialog-data

Dialog Datasets
2
star
21

016-many-eng-v2

Many-English v2
Python
2
star
22

007-mt-eval-macro

MT evaluation, use macro-average because rare types are important too.
Jupyter Notebook
2
star
23

572-hw3

A web-app for displaying indexed weapons data from Solr
JavaScript
2
star
24

tweeter-hackathon

Twitter analysis for hackathon
XSLT
1
star
25

011-imb-learn

Imbalanced Learning
Python
1
star
26

realigner

Re-aligner tool for aligning parallel sentences from comparable documents
Python
1
star
27

summary

Research Summaries
CSS
1
star
28

junkdetect

Junk-not-junk, a detector that supports 100 natural languages.
Python
1
star
29

image-resize

ImageMagick based resizer script for android projects.
Shell
1
star
30

java-plugin-demo

This project demonstrates plugin system which includes an SDK, sample plugins and a plugin app which registers plugins at run time
Java
1
star
31

kannada-osx-keylayout

Kannada keyboard layout setting for OSX
1
star
32

014-udhr-dataset

Parallel dataset, aligned from United Nations' Universal Declaration of Human Rights (UDHR)
Jupyter Notebook
1
star
33

572-hw1

CSCI 572 Assignment 1
Python
1
star