• Stars
    star
    264
  • Rank 155,103 (Top 4 %)
  • Language
    Python
  • License
    MIT License
  • Created over 7 years ago
  • Updated 4 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Collection of EM algorithms for blind source separation of audio signals

Blind Source Separation (BSS) algorithms

Build Status Azure DevOps tests Azure DevOps coverage MIT License

This repository covers EM algorithms to separate speech sources in multi-channel recordings.

In particular, the repository contains methods to integrate Deep Clustering (a neural network-based source separation algorithm) with a probabilistic spatial mixture model as proposed in the Interspeech paper "Tight integration of spatial and spectral features for BSS with Deep Clustering embeddings" presented at Interspeech 2017 in Stockholm.

@InProceedings{Drude2017DeepClusteringIntegration,
  Title                    = {Tight integration of spatial and spectral features for {BSS} with Deep Clustering embeddings},
  Author                   = {Drude, Lukas and and Haeb-Umbach, Reinhold},
  Booktitle                = {INTERSPEECH 2017, Stockholm, Sweden},
  Year                     = {2017},
  Month                    = {Aug}
}

Installation

Install it directly from source

git clone https://github.com/fgnt/pb_bss.git
cd pb_bss
pip install --editable .

We expect that numpy, scipy and cython are installed (e.g. conda install numpy scipy cython or pip install numpy scipy cython).

The default option is to install only the necessary dependencies. When you want to run the tests or execute the notebooks, use the one of the following commands for the installation:

pip install --editable .[all]  # Without a whitespace between `.` and `[all]`
pip install git+https://github.com/fgnt/pb_bss.git#egg=pb_bss[all]

More Repositories

1

nara_wpe

Different implementations of "Weighted Prediction Error" for speech dereverberation
Python
473
star
2

nn-gev

Neural network supported GEV beamformer
Python
191
star
3

pb_chime5

Speech enhancement system for the CHiME-5 dinner party scenario
Python
108
star
4

sms_wsj

SMS-WSJ: Spatialized Multi-Speaker Wall Street Journal database for multi-channel source separation and recognition
Python
101
star
5

meeteval

MeetEval - A meeting transcription evaluation toolkit
Python
72
star
6

padertorch

A collection of common functionality to simplify the design, training and evaluation of machine learning models based on pytorch with an emphasis on speech processing.
Python
71
star
7

pb_sed

Paderborn Sound Event Detection
Python
68
star
8

ci_sdr

Python
51
star
9

mms_msg

Multipurpose Multi Speaker Mixture Signal Generator
Python
42
star
10

paderbox

Paderbox: A collection of utilities for audio / speech processing
Python
37
star
11

graph_pit

Python
32
star
12

sed_scores_eval

Python
26
star
13

lazy_dataset

lazy_dataset: Process large datasets as if it was an iterable.
Python
17
star
14

LatticeWordSegmentation

Software to apply unsupervised word segmentation on lattices or text sequences using a nested hierarchical Pitman Yor language model
C++
17
star
15

paderwasn

Paderwasn is a collection of methods for acoustic signal processing in wireless acoustic sensor networks (WASNs).
Python
13
star
16

nhpylm

Python bindings for a c++ based implementation of the Nested Hierarchical Pitman-Yor Language model
C++
13
star
17

sins

Python
8
star
18

python_crashkurs

Jupyter Notebook
7
star
19

oaf

Jupyter notebooks for the lecture "Optimal and adaptive filters"
Jupyter Notebook
7
star
20

mnist

Makefile
6
star
21

dlp_mpi

Python
5
star
22

nachrichtentechnik

Jupyter noteboooks for the lecture "Nachrichtentechnik" (communications engineering) with explanations in german.
Jupyter Notebook
4
star
23

libriwasn

Tools and scripts for the LibriWASN data set from zenodo
Python
3
star
24

ham_radio

Python
3
star
25

speaker_reassignment

Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment
Python
3
star
26

upb_audio_tagging_2019

UPB system for the Kaggle competition "Freesound Audio Tagging 2019"
Python
2
star
27

asnsig

ASNSIG – A Signal Generator for Ad-Hoc Acoustic Sensor Networks in Smart Home Environments
Python
2
star
28

2019_ad_xidian

HTML
1
star