Top Rating
- Top Contributors
  Discover the Top Open Source contributors by country or by language
- Interviews
  Discover real stories from Open Source developers
Discover

Discover your Favorite Language
Discover the top trending repositories and projects on Github. Explore the latest trends in your preferred languages.

Kotlin

PHP

Dart

Solidity

Crystal

Scala

HTML

Groovy

More Languages
Awesome

Awesome repositories
Discover the most awesome repositories and projects of your favorite languages. Inspired by the Awesome-* lists trend in GitHub.

Objective-C

F#

PowerShell

C#

Shell

Nix

Lua

Kotlin

More Languages
By Country

Rankings by Country
Discover the community of talented open source contributors in each country.

🇲🇫 Saint Martin

🇲🇨 Monaco

🇸🇧 Solomon Islands

🇭🇰 Hong Kong

🇬🇹 Guatemala

🇵🇱 Poland

🇬🇩 Grenada

🇵🇷 Puerto Rico

All Countries Compare Countries

Top Contributors
Users
Organizations
Repositories
Discover Languages
Awesome lists
Ranking by Country
Interviews

yjlolo/vae-audio

Stars
106
Rank 325,871 (Top 7 %)
Language
Python
License
MIT License
Created over 5 years ago
Updated over 4 years ago

yjlolo/vae-audio

yjlolo

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Variational auto-encoders for audio

UPDATE (20.5.20): I decided to isolate the code for reproducing the paper Learning Disentangled Representations of Timbre and Pitch for Musical Instrument Sounds Using Gaussian Mixture Variational Autoencoders (up from here) from this repo.

vae-audio

For variational auto-encoders (VAEs) and audio/music lovers, based on PyTorch.

Overview

The repo is under construction.

The project is built to facillitate research on using VAEs to model audio. It provides

vanilla VAE
Gaussian mixture VAE
vector-quantized VAE
customizable model options
audio feature extracton
model testing and latent space visualization
end-to-end audio feature extraction and model training
higher-level wrappers for easier use
easier installation
documentation

The project structure is based on PyTorch Template.

Requirements

torch 1.1.0
librosa 0.6.3

Usage

Audio Feature Extraction

Define customized Dataset classes in dataset/datasets.py
Run python dataset/audio_transform.py -c your_config_of_audio_transform.json to compute audio features (e.g., spectrograms)
Define customized DataLoader classes in data_loader/data_loaders.py

Model Training

Run python train.py -c your_config_of_model_train.json

To Be Continued

More Repositories

gmvae-synth

Reproducing code for Learning Disentangled Representations of Timbre and Pitch for Musical Instrument Sounds Using Gaussian Mixture Variational Autoencoders

Jupyter Notebook

pytorch-deep-markov-model

PyTorch re-implementation of [Structured Inference Networks for Nonlinear State Space Models, AAAI 17]

dSEQ-VAE

BAD-VAE: A VAE framework for unsupervised disentanglement of sequential data

Jupyter Notebook

FHVAE-pytorch

DSAE-VC

Implementation of "Posterior Variance-Parameterised Gaussian Dropout: Improving Disentangled Sequential Autoencoders for Zero-Shot Voice Conversion"

music-seq2seq

my-study-notes

ismir20-unsupervised-disentanglement

personal-site

Home
Users
Organizations
Repositories
Rating by Country
Discover
Awesome
Interviews
Support
Contact

© Copyright 2024 Opensource Heroes

Love Open Source and this site? Check out how you can help us