• Stars
    star
    157
  • Rank 229,851 (Top 5 %)
  • Language
  • License
    MIT License
  • Created almost 2 years ago
  • Updated 11 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

List of Korean pre-trained language models.

Korean-PLM (Korean Pre-trained Language Models)

곡개된 ν•œκ΅­μ–΄ μ‚¬μ „ν•™μŠ΅ λͺ¨λΈμ„ κΈ°λ‘ν•©λ‹ˆλ‹€. 크게 μ•„λž˜ 3개의 λͺ¨λΈ κ³„μ—΄λ‘œ κ΅¬λΆ„ν–ˆμœΌλ©°, λͺ¨λΈ μ‚¬μ΄μ¦ˆλŠ” μ •ν™•ν•˜μ§€ μ•Šμ„ 수 μžˆμŠ΅λ‹ˆλ‹€.

  • Encoder Model (BERT 계열)
  • Decoder Model (GPT 계열)
  • Encoder-Decoder Model (Seq2seq 계열)

Encoder Model (BERT 계열)

Model Name Size Link
LASSL BERT Small (10M) link
LASSL RoBERTa Small (10M) link
TUNiB ELECTRA (Ko) Small (10M) link
TUNiB ELECTRA (Ko-En) Small (10M) link
KoELECTRA Small (10M) link
KcELECTRA Small (10M) link
Soongsil-BERT Small (10M) link
Ko-CHAR-ELECTRA Small (10M) link
Distill-KoBERT 30M link
KoBERT Base (110M) link
KorBERT Base (110M) link
HanBERT Base (110M) link
KcBERT Base (110M) link
Soongsil-BERT Base (110M) link
KLUE-RoBERTa Base (110M) link
KoELECTRA Base (110M) link
KcELECTRA Base (110M) link
TUNiB-ELECTRA (Ko) Base (110M) link
TUNiB-ELECTRA (Ko-En) Base (110M) link
LMKor Albert Base (110M) link
LMKor Bert Base (110M) link
LMKor Funnel Base (110M) link
LMKor ELECTRA Base (110M) link
KalBERT Base (110M) link
LASSL BERT Base (110M) link
KoBigBird Base (110M) link
Ko-CHAR-BERT Base (110M) link
Ko-CHAR-ELECTRA Base (110M) link
BrainSBERT Base (110M) link
KPFBERT Base (110M) link
KcBERT Large (340M) link
KLUE-RoBERTa Large (340M) link

Decoder Model (GPT 계열)

Model Name Size Link
LASSL GPT2 Small (20M) link
LMKor KoGPT2 Base (110M) link
SKT KoGPT2 Base (110M) link
SKT KoGPT Trinity 1.2B link
EleutherAI Polyglot-Ko 1.3B link
EleutherAI Polyglot-Ko 3.8B link
EleutherAI Polyglot-Ko 5.8B link
KakaoBrain KoGPT 6B link

Encoder-Decoder Model (Seq2seq 계열)

Model Name Size Link
Cosmoquester BART Mini (13M) link
KE-T5 (Ko-En) Small (60M) link
KE-T5 (Ko) Small (60M) link
LMKor T5 Small (60M) link
Cosmoquester BART Small (40M) link
LMKor BERT Shared Base (130M) link
SKT KoBART Base (130M) link
Cosmoquester BART Base (130M) link
Kolang-T5 Base (220M) link
KcT5 Base (220M) link
KE-T5 (Ko-En) Base (220M) link
KE-T5 (Ko) Base (220M) link
Asian BART (Ko) Base (360M) link
pko-t5 (Ko) Base (250M) link
Wisenut KoT5 (Ko) Base (220M) link
mBART Large (610M) link
KE-T5 (Ko-En) Large (770M) link
KE-T5 (Ko) Large (770M) link
pko-t5 (Ko) Large (800M) link

More Repositories

1

conformer

PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)
Python
701
star
2

kospeech

Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.
Python
574
star
3

attentions

PyTorch implementation of some attentions for Deep Learning Researchers.
Python
396
star
4

k-startups

List of tech startups in South Korea. (Republic of Korea)
206
star
5

ksponspeech

Pre-processing KsponSpeech corpus (Korean Speech dataset) provided by AI Hub.
Python
76
star
6

pytorch-lr-scheduler

PyTorch implementation of some learning rate schedulers for deep learning researcher.
Python
67
star
7

Speech-Recognition-Tutorial

ν•œκ΅­μ–΄ μŒμ„±μΈμ‹ νŠœν† λ¦¬μ–Ό
60
star
8

nlp-tasks

Natural Language Processing Tasks and Examples.
Python
59
star
9

speech-transformer

Transformer implementation speciaized in speech recognition tasks using Pytorch.
Python
56
star
10

RNN-Transducer

PyTorch implementation of RNN-Transducer(RNN-T).
Python
51
star
11

lightning-asr

Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.
Python
42
star
12

End-to-End-Speech-Recognition-Models

PyTorch implementation of automatic speech recognition models.
Python
41
star
13

transformer

A PyTorch Implementation of "Attention Is All You Need"
Python
37
star
14

luna-transformer

A PyTorch Implementation of the Luna: Linear Unified Nested Attention
Python
35
star
15

jasper

PyTorch implementation of "Jasper: An End-to-End Convolutional Neural Acoustic Model" (INTERSPEECH 2019)
Python
29
star
16

Naver-AI-Hackathon-Speech

2019 Clova AI Hackathon : Speech - Rank 12 / Team Kai.Lib
Python
25
star
17

deepspeech2

PyTorch implementation of "Deep Speech 2: End-to-End Speech Recognition in English and Mandarin" (ICML, 2016)
Python
19
star
18

seq2seq

PyTorch implementation of the RNN-based sequence-to-sequence architecture.
Python
19
star
19

tacotron2

Pytorch implementation of "Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions", ICASSP, 2018.
Python
17
star
20

speech-paper-review

Review of papers I read
15
star
21

speech-recognition-papers

Awesome Automatic Speech Recognition (ASR) paper collection
15
star
22

Fairseq-Listen-Attend-Spell

A Fairseq implementation of Listen, Attend and Spell (LAS), an End-to-End ASR framework.
Python
14
star
23

char-rnnlm

Character-level Recurrent Neural Network Language Model (rnnlm) implement in Pytorch.
Python
11
star
24

accelerate-asr

Modular and extensible speech recognition library leveraging accelerate and hydra.
Python
10
star
25

sooftware

10
star
26

sooftware.io

My personal blog powered by react (gatsby)
TypeScript
9
star
27

Speech-Note

🎧 Speech study records repository
C
7
star
28

Audio-Signal-Processing

Audio Signal Preocessing: pcm2wav, wav2pcm, feature extraction, augment, delete silence etc
Python
7
star
29

TIL

Today I Learned
Python
6
star
30

generate-sec-dataset

Generate space error correction dataset
Python
6
star
31

sooftware.github.io

SCSS
6
star
32

KoSpeech-Flask

KoSpeech Flask Web Application
Python
3
star