• Stars
    star
    1
  • Language
    Jupyter Notebook
  • Created about 3 years ago
  • Updated almost 3 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Parallel dataset, aligned from United Nations' Universal Declaration of Human Rights (UDHR)

More Repositories

1

nllb-serve

Meta's "No Language Left Behind" models served as web app and REST API
Python
177
star
2

mtdata

A tool that locates, downloads, and extracts machine translation corpora
Python
147
star
3

tensorflow-grpc-java

Tensorflow grpc java client for image recognition serving inception model
Java
39
star
4

charliebot

The ALICE/ALICEBOT/CHARLIE/CHARLIEBOT
Java
18
star
5

tika-dl4j-spark-imgrec

Image recognition on Spark cluster powered by Deeplearning4j and Apache Tika
Java
14
star
6

tika-ner-corenlp

Stanford CoreNLP NER addon for Apache Tika's NamerEntityParser
Java
13
star
7

006-many-to-eng

Machine translation of many to English
Jupyter Notebook
9
star
8

005-nmt-imbalance

Finding the Optimal Vocabulary for NMT
Jupyter Notebook
6
star
9

solr-similarity

Finding similarity of documents by making use of vector space model
Java
5
star
10

ML101

Machine learning crash course for absolute beginners.
HTML
5
star
11

autoextractor

A toolkit for clustering web pages based on various similarity measures.
Java
4
star
12

notes

A place to dump all my homeworks and practice scribblings.
Jupyter Notebook
4
star
13

awkg

awkg is an awk-like text-processing tool powered by python language
Python
4
star
14

572-hw2

Home Work 2 Indexing + NER
Java
3
star
15

virtchar

A Dialog toolkit for making favorite TV characters as chatbots
Python
3
star
16

parser-indexer

Metadata Parser and Solr Indexer
Java
3
star
17

image-forensics-MFSec17

Automated Large Scale Image Forensics using Tika and Tensorflow
TeX
3
star
18

pdf-extractor

A tool to extract text from PDF files with OCR
Java
2
star
19

202010-pytorch

Pytorch PPT for ISI
Jupyter Notebook
2
star
20

unmass

Unsupervised NMT based on Masked Seq-to-Seq
Python
2
star
21

dialog-data

Dialog Datasets
2
star
22

016-many-eng-v2

Many-English v2
Python
2
star
23

007-mt-eval-macro

MT evaluation, use macro-average because rare types are important too.
Jupyter Notebook
2
star
24

572-hw3

A web-app for displaying indexed weapons data from Solr
JavaScript
2
star
25

tweeter-hackathon

Twitter analysis for hackathon
XSLT
1
star
26

011-imb-learn

Imbalanced Learning
Python
1
star
27

realigner

Re-aligner tool for aligning parallel sentences from comparable documents
Python
1
star
28

summary

Research Summaries
CSS
1
star
29

junkdetect

Junk-not-junk, a detector that supports 100 natural languages.
Python
1
star
30

image-resize

ImageMagick based resizer script for android projects.
Shell
1
star
31

java-plugin-demo

This project demonstrates plugin system which includes an SDK, sample plugins and a plugin app which registers plugins at run time
Java
1
star
32

kannada-osx-keylayout

Kannada keyboard layout setting for OSX
1
star
33

572-hw1

CSCI 572 Assignment 1
Python
1
star