JusperLee/Deep-Clustering-for-Speech-Separation

Stars
121
Rank 293,924 (Top 6 %)
Language
Python
Created almost 5 years ago
Updated over 4 years ago

JusperLee/Deep-Clustering-for-Speech-Separation

JusperLee

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Pytorch implements Deep Clustering: Discriminative Embeddings For Segmentation And Separation

Deep Clustering for Speech Separation

Deep clustering in the field of speech separation implemented by pytorch

Demo Pages: Results of pure speech separation model

Hershey J R, Chen Z, Le Roux J, et al. Deep clustering: Discriminative embeddings for segmentation and separation[C]//2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2016: 31-35.

Requirement

Pytorch 1.3.0
librosa 0.7.1
PyYAML 5.1.2

Code writing log

2019-12-27 Friday. It is currently being refined and is not yet complete.

2020-01-02 Thursday. The training code is currently complete and the code bug is being tested.

Training steps

First, you can use the create_scp script to generate training and test data scp files.

python create_scp.py

Then, in order to reduce the mismatch of training and test environments. Therefore, you need to run the util script to generate a feature normalization file (CMVN).

python ./utils/util.py

Finally, use the following command to train the network.

python train.py -opt ./option/train.yml

Inference steps

Use the following command to start testing the model

python test.py -scp 1.scp -opt ./option/train.yml -save_file ./result

You can use the this code to calculate the SNR scores.

Thanks

Pytorch Template

Speech-Separation-Paper-Tutorial

A must-read paper for speech separation based on neural networks

Conv-TasNet

Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation Pytorch's Implement

Dual-Path-RNN-Pytorch

Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation implemented by Pytorch

TDANet

An efficient speech separation method

Looking-to-Listen-at-the-Cocktail-Party

Executable code based on Google articles

AFRCNN-For-Speech-Separation

Speech Separation Using an Asynchronous Fully Recurrent Convolutional Neural Network

LibriSpace

SPMamba

IIANet

This is the demo of our paper "IIANet: An Intra- and Inter-Modality Attention Network for Audio-Visual Speech Separation".

Calculate-SNR-SDR

Script to calculate SNR and SDR using python

LRS3-For-Speech-Separation

Multi-modal speech separation task data generation script on LRS3 data set.

CTCNet

An Audio-Visual Speech Separation Model Inspired by Cortico-Thalamo-Cortical Circuits

UtterancePIT-Speech-Separation

According to funcwj's uPIT, the training code supporting multi-gpu is written, and the Dataloader is reconstructed.

AV-ConvTasNet

Unofficial Time Domain Audio Visual Speech Separation Implementation

Deep-Encoder-Decoder-Conv-TasNet

A PyTorch implementation of " AN EMPIRICAL STUDY OF CONV-TASNET "

DANet-For-Speech-Separation

Pytorch implement of DANet For Speech Separation

S4M

Official implementation of Efficient Speech Separation Framework Based on Neural State-Space Models

Look2hear

A toolkit for researchers in the multimodal sound separation.

speechbrain-docs-zh-cn

SpeechBrain中文文档

Arxiv-New-Paper-Server

Arxiv automatically obtains the latest article service.

My-Script-For-Audio-Process

Some convenient scripts for your own use

Jupyter Notebook

ExamOnline

This is a complete online exam system

Apollo

Music repair method to convert lossy MP3 compressed music to lossless music.

WeChatApp

Complete code of WeChat Mini Program

player

Android Homework(3)

GrabCut

Grass

ELF-SR

Time

My Android Project

Accelerator

Openmp Accelerator

Deep-Learning

Learn to deep learning the code of your own records.

JusperLee

jusperlee.github.io

Souhu-Competition-Dazuoye

TFACM

BigData-Homework-Yanwaizhiyi

RTFS-Net

Deep-learning-course

Store some necessary files

audio-paper-daily