Discover ictnlp/PTE-NMT Open Source project by ICTNLP (@ictnlp)

Stars
15
Rank 1,371,379 (Top 28 %)
Language
Python
License
Other
Created over 3 years ago
Updated over 3 years ago

ictnlp

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Source code for the NAACL 2021 paper: Pruning-then-Expanding Model for Domain Adaptation of Neural Machine Translation

LLaMA-Omni

LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

Python

1,889

StreamSpeech

StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.

Python

882

BayLing

“百聆”是一个基于LLaMA的语言对齐增强的英语/中文大语言模型，具有优越的英语/中文能力，在多语言和通用任务等多项测试中取得ChatGPT 90%的性能。BayLing is an English/Chinese LLM equipped with advanced language alignment, showing superior capability in English/Chinese generation, instruction following and multi-turn interaction.

Python

294

TruthX

Code for ACL 2024 paper "TruthX: Alleviating Hallucinations by Editing Large Language Models in Truthful Space"

Python

105

awesome-transformer

A collection of transformer's guides, implementations and variants.

102

DialoFlow

Code for ACL 2021 main conference paper "Conversations Are Not Flat: Modeling the Dynamic Information Flow across Dialogue Utterances".

Python

NAST-S2x

A fast speech-to-any translation model that supports simultaneous decoding and offers 28× speedup.

Python

DASpeech

Code for NeurIPS 2023 paper "DASpeech: Directed Acyclic Transformer for Fast and High-quality Speech-to-Speech Translation".

Python

DSTC8-AVSD

We rank the 1st in DSTC8 Audio-Visual Scene-Aware Dialog competition. This is the source code for our IEEE/ACM TASLP (AAAI2020-DSTC8-AVSD) paper "Bridging Text and Video: A Universal Multimodal Transformer for Video-Audio Scene-Aware Dialog".

Python

OR-NMT

Source Code for ACL2019 paper <Bridging the Gap between Training and Inference for Neural Machine Translation>

Python

STEMM

Code for ACL 2022 main conference paper "STEMM: Self-learning with Speech-text Manifold Mixup for Speech Translation".

Python

DiSeg

Source code for ACL 2023 paper "End-to-End Simultaneous Speech Translation with Differentiable Segmentation"

Python

BoN-NAT

Python

Seq-NAT

Source code for <Sequence-Level Training for Non-Autoregressive Neural Machine Translation>.

Python

PLUVR

Code for ACL 2022 main conference paper "Neural Machine Translation with Phrase-Level Universal Visual Representations".

Python

HMT

Source code for ICLR 2023 spotlight paper "Hidden Markov Transformer for Simultaneous Machine Translation"

Python

ComSpeech

Code for ACL 2024 main conference paper "Can We Achieve High-quality Direct Speech-to-Speech Translation Without Parallel Speech Data?".

Python

TLAT-NMT

Source code for the EMNLP 2020 long paper <Token-level Adaptive Training for Neural Machine Translation>.

Python

NMLA-NAT

Code for NeurIPS 2022 Spotlight paper " Non-Monotonic Latent Alignments for CTC-Based Non-Autoregressive Machine Translation"

Python

AIH

Code for Findings of ACL 2021 paper "Addressing Inquiries about History: An Efficient and Practical Framework for Evaluating Open-domain Chatbot Consistency".

Python

RSI-NAT

Source code for "Retrieving Sequential Information for Non-Autoregressive Neural Machine Translation"

Python

DiverseNMT

Source code for the AAAI 2020 long paper <Modeling Fluency and Faithfulness for Diverse Neural Machine Translation>.

Python

CMOT

Code for ACL 2023 main conference paper "CMOT: Cross-modal Mixup via Optimal Transport for Speech Translation"

Python

LNMT-CA

Code for EMNLP 2022 main conference paper "Low-resource Neural Machine Translation with Cross-modal Alignment".

Jupyter Notebook

CRESS

Code for ACL 2023 main conference paper "Understanding and Bridging the Modality Gap for Speech Translation".

Python

ITST

Code for EMNLP 2022 main conference paper "Information-Transport-based Policy for Simultaneous Translation"

Python

SiLLM

SiLLM is a Simultaneous Machine Translation (SiMT) Framework. It utilizes a Large Language model as the translation model and employs a traditional SiMT model for policy-decision to achieve SiMT through collaboration.

Python

TACS

Source code for Truth-Aware Context Selection: Mitigating the Hallucinations of Large Language Models Being Misled by Untruthful Contexts

Python

BT4ST

Code for ACL 2023 main conference paper "Back Translation for Speech-to-text Translation Without Transcripts".

Python

Dual-Path

Code for ACL 2022 main conference paper "Modeling Dual Read/Write Paths for Simultaneous Machine Translation"

Python

NA-MNMT

Source code for "Importance-based Neuron Allocation for Multilingual Neural Machine Translation"

Python

Convex-Learning

Code for NeurIPS 2023 paper "Beyond MLE: Convex Learning for Text Generation"

Python

GMA

Code for ACL 2022 findings paper "Gaussian Multi-head Attention for Simultaneous Machine Translation"

Python

PCFG-NAT

Code for NeurIPS 2023 paper "Non-autoregressive Machine Translation with Probabilistic Context-free Grammar".

Cuda

NAST

Official implementation for EMNLP 2023 paper "Non-autoregressive Streaming Transformer for Simultaneous Translation"

Python

COKD

Code for ACL 2022 main conference paper "Overcoming Catastrophic Forgetting beyond Continual Learning: Balanced Training for Neural Machine Translation".

JavaScript

MoE-Waitk

Code for EMNLP 2021 oral paper "Universal Simultaneous Machine Translation with Mixture-of-Experts Wait-k Policy"

Python

DDRS-NAT

Code for NAACL2022 main conference paper "One Reference Is Not Enough: Diverse Distillation with Reference Selection for Non-Autoregressive Translation"

Python

SU4MT

Code for EMNLP 2023 paper "Enhancing Neural Machine Translation with Semantic Units"

Python

SeerForcingNMT

Source code for "Guiding Teacher Forcing with Seer Forcing for Neural Machine Translation"

Zero-MNMT

Python

Multiscale-Contextualization

ACL2024 Integrating Multi-scale Contextualized Information for Byte-based Neural Machine Translation

Python

Wait-info

Source code for our EMNLP 2022 paper "Wait-info Policy: Balancing Source and Target at Information Level for Simultaneous Machine Translation"

Python

BS-SiMT

Source code for our ACL 2023 paper "Learning Optimal Policy for Simultaneous Machine Translation via Binary Search"

Python

CTC-S2UT

Code for ACL 2024 findings paper "CTC-based Non-autoregressive Textless Speech-to-Speech Translation"

GS4NMT

source code for "Greedy Search with Probabilistic N-gram Matching for Neural Machine Translation"

Python

DST

DST is a Decoder-only simultaneous machine translation model, which can conduct policy decision and translation concurrently

Python

LFR-NMT

Source code for the EMNLP 2022 paper "Continual Learning of Neural Machine Translation within Low Forgetting Risk Regions"

Python

CPDecoder

optimize the decoder of the neural machine translation model by the cube pruning algorithm

Python

nar-tutorial

Slides for NAR tutorial

CAPT

Code for EMNLP 2022 main conference paper "Counterfactual Data Augmentation via Perspective Transition for Open-Domain Dialogues".

PED-SiMT

Code for "Turning Fixed to Adaptive: Integrating Post-Evaluation into Simultaneous Machine Translation"

Python

Glance-SiMT

Python

SAMMT

Code for EMNLP 2023 paper "Bridging the Gap between Synthetic and Authentic Images for Multimodal Machine Translation"

Python

RN4NMT

source code for "Refining Source Representations with Relation Networks for Neural Machine Translation".

PLSQL

Seg2Seg

Tailored-Ref

Code for EMNLP 2023 paper "Simultaneous Machine Translation with Tailored Reference"

SemLing-MNMT

Code for ACL 2024 paper "Improving Multilingual Neural Machine Translation by Utilizing Semantic and Linguistic Features".

Python

ComSpeech-Site

JavaScript

corpus_NKD

data repo for "Knowledge Diffusion for Neural Dialogue Generation"

TruthX-site

HTML

StreamSpeech-site

JavaScript

Rephraser-NAT

Code for AAAI 2023 paper "Rephrasing the Reference for Non-Autoregressive Machine Translation"

Auto-RAG

Python

ictnlp/PTE-NMT

ictnlp

Reviews

Repository Details

More Repositories