Norbert

Norbert is an implementation of multichannel Wiener filter, that is a very popular way of filtering multichannel audio for several applications, notably speech enhancement and source separation.

This filtering method assumes you have some way of estimating power or magnitude spectrograms for all the audio sources (non-negative) composing a mixture. If you only have a model for some target sources, and not for the rest, you may use norbert.residual_model to let Norbert create a residual model for you.

Given all source spectrograms and the mixture Time-Frequency representation, this repository can build and apply the filter that is appropriate for separation, by optimally exploiting multichannel information (like in stereo signals). This is done in an iterative procedure called Expectation Maximization, where filtering and re-estimation of the parameters are iterated.

From a beginner's perspective, all you need to do is often to call norbert.wiener with the mix and your spectrogram estimates. This should handle the rest.

From a more expert perspective, you will find the different ingredients from the EM algorithm as functions in the module as described in the API documentation

Installation

pip install norbert

Usage

Asssuming a complex spectrogram X, and a (magnitude) estimate of a target V to be extracted from the spectrogram, performing the multichannel wiener filter is as simple as this:

X = stft(audio)
V = model(X)
Y = norbert.wiener(V, X)
estimate = istft(Y)

How to contribute

norbert is a community focused project, we therefore encourage the community to submit bug-fixes and requests for technical support through github issues. For more details of how to contribute, please follow our CONTRIBUTING.md.

Authors

Antoine Liutkus, Fabian-Robert Stöter

Citation

If you want to cite the Norbert software package, please use the DOI from Zenodo:

License

MIT

sigsep/norbert

sigsep

Reviews

Repository Details