@THUNLP-MT
  • Stars
    star
    4,217
  • Global Org. Rank 5,207 (Top 2 %)
  • Registered over 5 years ago
  • Most used languages
    Python
    87.1 %
    TeX
    6.5 %
    C
    3.2 %
    JavaScript
    3.2 %
  • Location 🇨🇳 China
  • Country Total Rank 1,350
  • Country Ranking
    TeX
    13
    Python
    553
    C
    6,100

Top repositories

1

MT-Reading-List

A machine translation reading list maintained by Tsinghua Natural Language Processing Group
TeX
2,410
star
2

THUMT

An open-source neural machine translation toolkit developed by Tsinghua Natural Language Processing Group
Python
691
star
3

TG-Reading-List

A text generation reading list maintained by Tsinghua Natural Language Processing Group.
TeX
444
star
4

Document-Transformer

Improving the Transformer translation model with document-level context
Python
171
star
5

dyMEAN

This repo contains the codes for our paper "End-to-End Full-Atom Antibody Design"
Python
79
star
6

MEAN

This repo contains the codes for our paper Conditional Antibody Design as 3D Equivariant Graph Translation.
Python
74
star
7

Mask-Align

Code for our paper "Mask-Align: Self-Supervised Neural Word Alignment" in ACL 2021
Python
58
star
8

THUCC

An open-source classical Chinese information processing toolkit developed by Tsinghua Natural Language Processing Group
Python
48
star
9

PS-VAE

This repo contains the codes for our paper: Molecule Generation by Principal Subgraph Mining and Assembling.
Python
29
star
10

Template-NMT

Python
22
star
11

PLM4MT

Code for our work "MSP: Multi-Stage Prompting for Making Pre-trained Language Models Better Translators" in ACL 2022
Python
20
star
12

UCE4BT

Python
19
star
13

MT-Toolkit-List

A list of machine translation open-source toolkits maintained by Tsinghua Natural Language Processing Group
13
star
14

PR4NMT

Prior Knowledge Integration for Neural Machine Translation using Posterior Regularization
Python
12
star
15

L2Copy4APE

Learning to Copy for Automatic Post-Editing (EMNLP 2019)
Python
11
star
16

TRICE

Code for our paper "Transfer Learning for Sequence Generation: from Single-source to Multi-source" in ACL 2021.
Python
11
star
17

DirectQuote

A Dataset for Direct Quotation Extraction and Attribution in News Articles.
11
star
18

SKR

Self-Knowledge Guided Retrieval Augmentation for Large Language Models (EMNLP Findings 2023)
Python
11
star
19

UBiLexAT

An Unsupervised Bilingual Lexicon Inducer From Non-Parallel Data by Adversarial Training
Python
8
star
20

PromptGating4MCTG

This is the repo for our work “An Extensible Plug-and-Play Method for Multi-Aspect Controllable Text Generation” (ACL 2023).
Python
8
star
21

PGRA

Prompt-Guided Retrieval For Non-Knowledge-Intensive Tasks
Python
7
star
22

DBKD-PLM

Codebase for ACL 2023 conference long paper Bridging the Gap between Decision and Logits in Decision-based Knowledge Distillation for Pre-trained Language Models.
Python
6
star
23

FIIG

Filling the Image Information Gap for VQA: Prompting Large Language Models to Proactively Ask Questions (EMNLP 2023 Findings)
6
star
24

BiLex

A Bilingual Lexicon Inducer From Non-Parallel Data
C
5
star
25

UBiLexEMD

An Unsupervised Bilingual Lexicon Inducer From Non-Parallel Data by Earth Mover's Distance Minimization
Python
5
star
26

SelfSupervisedQE

Self-Supervised Quality Estimation for Machine Translation
Python
5
star
27

symbol2language

Speak It Out: Solving Symbol-Related Problems with Symbol-to-Language Conversion for Language Models
5
star
28

TRAN

This is the repo for our work “Failures Pave the Way: Enhancing Large Language Models through Tuning-free Rule Accumulation” (EMNLP 2023).
Python
5
star
29

Voting4SC

Modeling Voting for System Combination in Machine Translation (IJCAI 2020)
Python
4
star
30

ktnmt

Python
4
star
31

CODIS

Repo for paper "CODIS: Benchmarking Context-Dependent Visual Comprehension for Multimodal Large Language Models".
JavaScript
4
star
32

ModelCompose

3
star
33

MT-Dataset-List

A list machine translation datasets maintained by Tsinghua Natural Language Processing Group
2
star
34

MetaRanking

Official code repo for our work "Meta Ranking: Less Capable Language Models are Capable for Single Response Judgement".
2
star
35

DEEM

2
star
36

ROGO

This repo contains the codes for our work “Restricted orthogonal gradient projection for continual learning”.
Python
1
star
37

Brote

Python
1
star
38

Transformer-DMB

Codes for our paper "Dynamic Multi-Branch Layers for On-Device Neural Machine Translation" in TASLP
Python
1
star
39

CKD

Continual Knowledge Distillation for Neural Machine Translation
Python
1
star