YerevaNN/Spoken-language-identification

Stars
233
Rank 172,230 (Top 4 %)
Language
Python
License
MIT License
Created about 9 years ago
Updated almost 7 years ago

YerevaNN/Spoken-language-identification

YerevaNN

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Spoken language identification with deep learning

Spoken language identification with deep learning

Read more in the following blog posts:

About TopCoder contest and our CNN-based solution implemented in Caffe (October 2015)
About combining CNN and RNN using Theano/Lasagne (June 2016)

Theano/Lasagne models are here. The basic steps to run them are:

Download the dataset from here or use your own dataset.
Create spectrograms for recording using create_spectrograms.py or augment_data.py. The latter will also augment the data by randomly perturbing the spectrograms and cropping a random interval of length 9s from the recording.
Create listfiles for training set and validation set, where each row of the a listfile describes one example and has 2 values seperated by a comma. The first one is the name of the example, the second one is the label (counting starts from 0). A typical listfile will look like this.
Change the png_folder and listfile paths in theano/main.py.
Run theano/main.py.

mimic3-benchmarks

Python suite to construct benchmark machine learning datasets from the MIMIC-III 💊 clinical database.

Dynamic-memory-networks-in-Theano

Implementation of Dynamic memory networks by Kumar et al. http://arxiv.org/abs/1506.07285

A-Guide-to-Deep-Learning

📚 A detailed guide to deep learning: http://yerevann.com/a-guide-to-deep-learning/

R-NET-in-Keras

Open R-NET (hy` առնետ 🐁) implementation and detailed analysis: https://git.io/vd8dx

translit-rnn

Automatic transliteration with LSTM

WARP

Code for ACL'2021 paper WARP 🌀 Word-level Adversarial ReProgramming. Outperforming `GPT-3` on SuperGLUE Few-Shot text classification. https://aclanthology.org/2021.acl-long.381/

DIIN-in-Keras

Reproducing Densely Interactive Inference Network in Keras

neural-colorizer

Convolutional autoencoder to colorize greyscale images

BARTSmiles

BARTSmiles, generative masked language model for molecular representations

ChemLactica

Fine-tuning Galactica and Gemma to operate on SMILES. Integrates into a molecular optimization algorithm.

Jupyter Notebook

BioRelEx

🧬 BioRelEx: Biological Relation Extraction Benchmark @ ACL BioNLP Workshop 2019

dmn-ui

UI for Dynamic Memory Networks

yerevann.github.io

SciERC

A fork of https://bitbucket.org/luanyi/scierc/src

PARASITE

🪱 PARASITE || A parallel sentence data preprocessing toolkit. Originally developed as a part of the `en-ru` winner submission of WMT20 Biomedical Translation Task.

Relation-extraction-pipeline

Pipelines that combine different modules to perform relation extraction

RaSoR-in-Tensorflow

The implementation of one of the SQuAD solutions

armtreebank

Armenian Treebank http://armtreebank.yerevann.com/

word2vec-armenian-wiki

Testing word2vec on Armenian Wikipedia

Caffe-python-tools

Some tools written in Python to work with Caffe

SSL-playground

zsee

Zero Shot Event Extraction - Making pretrained sentence encoders more multilingual and language-agnostic. Works best (at the moment) with YerevaNN's internal version of allennlp.

Molecular_Generation_with_GDB13

Jupyter Notebook

Kaggle-diabetic-retinopathy-detection

Scripts used in Kaggle Diabetic retionpathy detection contest by YerevaNN team

NLOS-Localization-WAIR-D

pmi

Fast pointwise mutual information implementation in C++

RelationClassification

dmn-docker

Dockerfile for starting DMN with UI

hyper-language-identification

amr_seq2seq

dom-gen-failure-modes

char-rnn-constitution

NN-in-Armenian

Presentation and other stuff on Neural networks in Armenian

JointUD

🚬 JointUD - Universal Dependencies | Part-of-Speech tagging, Morphological parsing and Lemmatization

BioER

Biological entity recognition

Jupyter Notebook

yarx

YARX - Yet Another Relation eXtraction framework, based on SciIE architecture and AllenNLP framework

docker-cudnn-theano

Docker image for Theano with Ubuntu 16.04 + CUDA 8.0 + cuDNN 7