• Stars
    star
    3
  • Rank 3,944,522 (Top 79 %)
  • Language
    Java
  • Created almost 9 years ago
  • Updated over 8 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Metadata Parser and Solr Indexer

More Repositories

1

nllb-serve

Meta's "No Language Left Behind" models served as web app and REST API
Python
154
star
2

mtdata

A tool that locates, downloads, and extracts machine translation corpora
Python
146
star
3

tensorflow-grpc-java

Tensorflow grpc java client for image recognition serving inception model
Java
39
star
4

charliebot

The ALICE/ALICEBOT/CHARLIE/CHARLIEBOT
Java
18
star
5

tika-dl4j-spark-imgrec

Image recognition on Spark cluster powered by Deeplearning4j and Apache Tika
Java
14
star
6

tika-ner-corenlp

Stanford CoreNLP NER addon for Apache Tika's NamerEntityParser
Java
13
star
7

006-many-to-eng

Machine translation of many to English
Jupyter Notebook
9
star
8

005-nmt-imbalance

Finding the Optimal Vocabulary for NMT
Jupyter Notebook
6
star
9

solr-similarity

Finding similarity of documents by making use of vector space model
Java
5
star
10

ML101

Machine learning crash course for absolute beginners.
HTML
5
star
11

autoextractor

A toolkit for clustering web pages based on various similarity measures.
Java
4
star
12

notes

A place to dump all my homeworks and practice scribblings.
Jupyter Notebook
4
star
13

awkg

awkg is an awk-like text-processing tool powered by python language
Python
4
star
14

572-hw2

Home Work 2 Indexing + NER
Java
3
star
15

virtchar

A Dialog toolkit for making favorite TV characters as chatbots
Python
3
star
16

image-forensics-MFSec17

Automated Large Scale Image Forensics using Tika and Tensorflow
TeX
3
star
17

pdf-extractor

A tool to extract text from PDF files with OCR
Java
2
star
18

202010-pytorch

Pytorch PPT for ISI
Jupyter Notebook
2
star
19

unmass

Unsupervised NMT based on Masked Seq-to-Seq
Python
2
star
20

dialog-data

Dialog Datasets
2
star
21

016-many-eng-v2

Many-English v2
Python
2
star
22

007-mt-eval-macro

MT evaluation, use macro-average because rare types are important too.
Jupyter Notebook
2
star
23

572-hw3

A web-app for displaying indexed weapons data from Solr
JavaScript
2
star
24

tweeter-hackathon

Twitter analysis for hackathon
XSLT
1
star
25

011-imb-learn

Imbalanced Learning
Python
1
star
26

realigner

Re-aligner tool for aligning parallel sentences from comparable documents
Python
1
star
27

summary

Research Summaries
CSS
1
star
28

junkdetect

Junk-not-junk, a detector that supports 100 natural languages.
Python
1
star
29

image-resize

ImageMagick based resizer script for android projects.
Shell
1
star
30

kannada-osx-keylayout

Kannada keyboard layout setting for OSX
1
star
31

014-udhr-dataset

Parallel dataset, aligned from United Nations' Universal Declaration of Human Rights (UDHR)
Jupyter Notebook
1
star
32

572-hw1

CSCI 572 Assignment 1
Python
1
star