sooftware/attentions

Stars
396
Rank 104,714 (Top 3 %)
Language
Python
License
MIT License
Created about 4 years ago
Updated 11 months ago

sooftware/attentions

sooftware

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

PyTorch implementation of some attentions for Deep Learning Researchers.

An Apache 2.0 PyTorch implementation of some attentions for Deep Learning Researchers.

Intro

attentions provides some attentions used in natural language processing using pytorch.
these attentions can used in neural machine translation, speech recognition, image captioning etc...

attention allows to attend to different parts of the source sentence at each step of the output generation.
Instead of encoding the input sequence into a single fixed context vector, we let the model learn how to generate a context vector for each output time step.

Implementation list

Name	Citation
Additive Attention	Bahdanau et al., 2015
Dot-Product Attention	Luong et al., 2015
Location-Aware (Location Sensitive) Attention	Chorowski et al., 2015
Scaled Dot-Product Attention	Vaswani et al., 2017
Multi-Head Attention	Vaswani et al., 2017
Relative Multi-Head Self Attention	ZihangDai et al., 2019

Troubleshoots and Contributing

If you have any questions, bug reports, and feature requests, please open an issue on Github.
or Contacts [email protected] please.

I appreciate any kind of feedback or contribution. Feel free to proceed with small issues like bug fixes, documentation improvement. For major contributions and new features, please discuss with the collaborators in corresponding issues.

Code Style

I follow PEP-8 for code style. Especially the style of docstrings is important to generate documentation.

Author

Soohwan Kim @sooftware
Contacts: [email protected]

conformer

PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)

kospeech

Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.

k-startups

List of tech startups in South Korea. (Republic of Korea)

Korean-PLM

List of Korean pre-trained language models.

ksponspeech

Pre-processing KsponSpeech corpus (Korean Speech dataset) provided by AI Hub.

pytorch-lr-scheduler

PyTorch implementation of some learning rate schedulers for deep learning researcher.

Speech-Recognition-Tutorial

한국어 음성인식 튜토리얼

nlp-tasks

Natural Language Processing Tasks and Examples.

speech-transformer

Transformer implementation speciaized in speech recognition tasks using Pytorch.

RNN-Transducer

PyTorch implementation of RNN-Transducer(RNN-T).

lightning-asr

Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.

End-to-End-Speech-Recognition-Models

PyTorch implementation of automatic speech recognition models.

transformer

A PyTorch Implementation of "Attention Is All You Need"

luna-transformer

A PyTorch Implementation of the Luna: Linear Unified Nested Attention

jasper

PyTorch implementation of "Jasper: An End-to-End Convolutional Neural Acoustic Model" (INTERSPEECH 2019)

Naver-AI-Hackathon-Speech

2019 Clova AI Hackathon : Speech - Rank 12 / Team Kai.Lib

deepspeech2

PyTorch implementation of "Deep Speech 2: End-to-End Speech Recognition in English and Mandarin" (ICML, 2016)

seq2seq

PyTorch implementation of the RNN-based sequence-to-sequence architecture.

tacotron2

Pytorch implementation of "Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions", ICASSP, 2018.

speech-paper-review

Review of papers I read

speech-recognition-papers

Awesome Automatic Speech Recognition (ASR) paper collection

Fairseq-Listen-Attend-Spell

A Fairseq implementation of Listen, Attend and Spell (LAS), an End-to-End ASR framework.

char-rnnlm

Character-level Recurrent Neural Network Language Model (rnnlm) implement in Pytorch.

accelerate-asr

Modular and extensible speech recognition library leveraging accelerate and hydra.

sooftware

sooftware.io

My personal blog powered by react (gatsby)

Speech-Note

🎧 Speech study records repository

Audio-Signal-Processing

Audio Signal Preocessing: pcm2wav, wav2pcm, feature extraction, augment, delete silence etc

TIL

Today I Learned

generate-sec-dataset

Generate space error correction dataset

sooftware.github.io

KoSpeech-Flask

KoSpeech Flask Web Application