There are no reviews yet. Be the first to send feedback to the community and the maintainers!
Tatoeba-Challenge
Opus-MT
Open neural machine translation models and web servicesOPUS-MT-train
Training open neural machine translation modelsprosody
Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from TextOpusFilter
OpusFilter - Parallel corpus processing toolkitHBMP
Sentence Embeddings in NLI with Iterative Refinement EncodersOPUS-CAT
OPUS-CAT is a collection of software which make it possible to OPUS-MT neural machine translation models in professional translation. OPUS-CAT includes a local offline MT engine and a collection of CAT tool plugins.OpusTools
XED
XED multilingual emotion datasetsUkrainianLT
A collection of links to Ukrainian language toolsOPUS-translator
Translation demonstratormammoth
MAMMOTH: MAssively Multilingual Modular Open Translation @ HelsinkiMuCoW
Automatically harvested multilingual contrastive word sense disambiguation test sets for machine translationsubalign
sentimentator
Tool for sentiment analysis annotationOPUS-MT-testsets
benchmarks for evaluating MT modelsOpusTools-perl
neural-search-tutorials
Additional Notebooks for the Building NLP Applications courseOPUS-interface
OPUS repository interfaceOPUS-ingest
LanguageCodes
shroom
nli-data-sanity-check
Data and scripts for a diagnostics test suite which allows to assess whether an NLU dataset constitutes a good testbed for evaluating the models' meaning understanding capabilities.OPUS-repository
doclevel-MT-benchmark
Document-level Machine Translation BenchmarkUplug
americasnlp2021-st
AmericasNLP 2021 shared taskGeometry
shared-info
LSDC
Low-Saxon Dialect Classificationpdf2xml
Syntactic_Debiasing
OpusTranslationService
Translation service based on LibreTranslatemurre24
Manually annotated dataset of Finnish varieties in the Suomi24, the largest Finnish internet forum, the id's of automatically annotated dialectal messages and the scripts used for classification and evaluation.OPUS-index
Index of resources in OPUSOpusFilter-hub
A hub of OpusFilter configurationsNLU-Course-2020
SELF-FEIL
Emotion Lexicons for Finnishndc-aligned
Word-aligned version of the Norwegian Dialect CorpusOPUS-MT-dashboard
External-MT-leaderboard
Leaderboards for external MT modelsnlu-dataset-diagnostics
This repository contains data and scripts to reproduce the results from our paper: How Does Data Corruption Affect Natural Language Understanding Models? A Study on GLUE datasets.en-fi-testsuite
WMT18 Testsuite for Finnish morphologyfinlandsvensk-AI
OPUS-website
OPUS website filesOPUS-MT-leaderboard-recipes
Makefile recipes shared between all leaderboard reposOPUS-MT-leaderboard
murreviikko
Dialectologically annotated and normalized dataset of dialectal Finnish tweetsSami-MT
machine translation for Sámi languageslm-vs-mt
Two Stacks Are Better Than One: A Comparison of Language Modeling and Translation as Multilingual Pretraining ObjectivesOPUS-API
API for searching corpora from OPUSdialect-topic-model
Scripts and metadata for the paper "Corpus-based dialectometry with topic models"Love Open Source and this site? Check out how you can help us