• Stars
    star
    25
  • Rank 957,573 (Top 19 %)
  • Language
    Jupyter Notebook
  • License
    MIT License
  • Created over 1 year ago
  • Updated 11 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

The multilingual language model for Switzerland

More Repositories

1

mbr

Minimum Bayes Risk Decoding for Hugging Face Transformers
Python
51
star
2

ContraDecode

The implementation of "Mitigating Hallucinations and Off-target Machine Translation with Source-Contrastive and Language-Contrastive Decoding"
Python
33
star
3

xstance

A Multilingual Multi-Target Dataset for Stance Detection
Python
33
star
4

nmtscore

A library of translation-based text similarity measures
Python
25
star
5

ContraPro

Contrastive evaluation of pronoun translation in neural machine translation
Perl
24
star
6

multilingual-instruction-tuning

Code and data for the paper "Turning English-centric LLMs Into Polyglots: How Much Multilinguality Is Needed?"
Jupyter Notebook
23
star
7

coverage-contrastive-conditioning

Data and code accompanying the paper "As Little as Possible, as Much as Necessary: Detecting Over- and Undertranslations with Contrastive Conditioning" (ACL 2022)
Python
20
star
8

ContraWSD

Word sense disambiguation test sets for NMT
Python
19
star
9

understanding-mbr

Shell
17
star
10

domain-robustness

Shell
12
star
11

segtest

A Test Suite for Morphological Phenomena in Neural Machine Translation
Shell
7
star
12

mtrain

Training automation for neural and statistical machine translation engines
Python
7
star
13

mbr-sensitivity

Data and code for the paper "Identifying Weaknesses in Machine Translation Metrics Through Minimum Bayes Risk Decoding: A Case Study for COMET"
Python
6
star
14

sdg_swisstext_2024_sharedtask

Repository for data and evaluation of 2024 Shared Task on SDG classification held by the Swiss Text Conference.
Python
5
star
15

BLESS

Code for the EMNLP 2023 paper "BLESS: Benchmarking Large Language Models on Sentence Simplification"
Jupyter Notebook
5
star
16

emnlp2018-imitation-learning-for-neural-morphology

Code for Paper "Imitation Learning for Neural Morphological String Transduction" by Peter Makarov and Simon Clematide. 2018. EMNLP
Python
4
star
17

monotonicity_loss

PLSQL
4
star
18

romanesco

Simple recurrent neural network (RNN) language model
Python
4
star
19

translation-direction-detection

Unsupervised translation direction detection using NMT systems
Python
4
star
20

mt-parity-assessment-data

experimental data for paper "A Set of Recommendations for Assessing Humanโ€“Machine Parity in Language Translation"
HTML
3
star
21

acl2020-historical-text-normalization

Code for the ACL 2020 paper "Semi-supervised Contextual Historical Text Normalization" by Peter Makarov and Simon Clematide
Python
3
star
22

contrastive-conditioning

Code and data accompanying the paper "Contrastive Conditioning for Assessing Disambiguation in MT: A Case Study of Distilled Bias"
Python
3
star
23

coling2018-neural-transition-based-morphology

Code repository for COLING 2018 paper by Makarov and Clematide
Python
3
star
24

distil-lingeval

Data and code accompanying the paper "On the Limits of Minimal Pairs in Contrastive Evaluation"
Python
3
star
25

20Minuten

Jupyter Notebook
3
star
26

MultiPivotNMT

The implementation of "Investigating Multi-Pivot Ensembling with Massively Multilingual Machine Translation Models"
Python
3
star
27

multilingual-lemma-disambiguation-gold-standard

A Multilingual Lemma Disambiguation Gold Standard for German, Finnish, French and Italian (as described in the MA thesis )
2
star
28

specific_hospo_respo

Code for hospitality review response generation
Jupyter Notebook
2
star
29

voting-booklet-bias

Code for the paper "Voting Booklet Bias: Stance Detection in Swiss Federal Communication"
Jupyter Notebook
2
star
30

recognizing-semantic-differences

Code for the paper "Towards Unsupervised Recognition of Token-level Semantic Differences in Related Documents"
Python
2
star
31

swiss-german-text-encoders

Code for the paper "Modular Adaptation of Multilingual Encoders to Written Swiss German Dialect"
Python
2
star
32

CoNTra_corpora

Collection of corpora built in the project Rich Context in Neural Machine Translation (2017-2020)
1
star
33

RANLP2021-German-ATS

Shell
1
star
34

daikon

Simple encoder-decoder neural machine translation written in tensorflow
Python
1
star
35

SockUeye

Vue
1
star
36

RumantschCorpora

1
star
37

SockAPeye

Python
1
star
38

understanding-ctx-aug

Code for the 2023 ACL Findings paper, Uncovering Hidden Consequences of Pre-training Objectives in Sequence-to-Sequence Models (Kew & Sennrich, 2023)
Jupyter Notebook
1
star
39

romanisation-transfer

Code for the Paper "On Romanization for Model Transfer Between Scripts in Neural Machine Translation"
Mathematica
1
star