DEEP LEARNING FOR MUSIC GENERATION

This repository is maintained by Carlos Hernández-Oliván([email protected]) and it presents the State of the Art of Music Generation. Most of these references (previous to 2022) are included in the review paper "Music Composition with Deep Learning: A Review". The authors of the paper want to thank Jürgen Schmidhuber for his suggestions.

Make a pull request if you want to contribute to this references list.

You can download a PDF version of this repo here: README.pdf

All the images belong to their corresponding authors.

Algorithmic Composition
- 1992
- Books
Neural Network Architectures
Deep Learning Models for Symbolic Music Generation
- 2023
- 2022
- 2021
- 2020
- 2019
- 2018
- 2017
- 2016
- 2015
- 2002
- 1990s
- Books and Reviews
  - Books
  - Reviews
Deep Learning Models for Audio Music Generation
- 2023
- 2022
- 2021
- 2020
- 2017
Datasets
Journals and Conferences
Authors
Research Groups and Labs
Apps for Music Generation with AI
Other Resources

2. Algorithmic Composition

1992

HARMONET

Hild, H., Feulner, J., & Menzel, W. (1992). HARMONET: A neural net for harmonizing chorales in the style of JS Bach. In Advances in neural information processing systems (pp. 267-274). Paper

Books

Westergaard, P. (1959). Experimental Music. Composition with an Electronic Computer.
Todd, P. M. (1989). A connectionist approach to algorithmic composition. Computer Music Journal, 13(4), 27-43.
Cope, D. (2000). The algorithmic composer (Vol. 16). AR Editions, Inc..
Nierhaus, G. (2009). Algorithmic composition: paradigms of automated music generation. Springer Science & Business Media.
Müller, M. (2015). Fundamentals of music processing: Audio, analysis, algorithms, applications. Springer.
McLean, A., & Dean, R. T. (Eds.). (2018). The Oxford handbook of algorithmic music. Oxford University Press.

2. Neural Network Architectures

NN Architecture	Year	Authors	Link to original paper	Slides
Long Short-Term Memory (LSTM)	1997	Sepp Hochreiter, Jürgen Schmidhuber	http://www.bioinf.jku.at/publications/older/2604.pdf	LSTM.pdf
Convolutional Neural Network (CNN)	1998	Yann LeCun, Léon Bottou, YoshuaBengio, Patrick Haffner	http://vision.stanford.edu/cs598_spring07/papers/Lecun98.pdf
Variational Auto Encoder (VAE)	2013	Diederik P. Kingma, Max Welling	https://arxiv.org/pdf/1312.6114.pdf
Generative Adversarial Networks (GAN)	2014	Ian J. Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, Yoshua Bengio	https://arxiv.org/pdf/1406.2661.pdf
Transformer	2017	Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Łukasz Kaiser, Illia Polosukhin	https://arxiv.org/pdf/1706.03762.pdf
Diffusion Models	2015	Jascha Sohl-Dickstein, Eric A. Weiss, Niru Maheswaranathan, Surya Ganguli	https://arxiv.org/abs/1503.03585

3. Deep Learning Models for Music Generation

2023

RL-Chord

Ji, S., Yang, X., Luo, J., & Li, J. (2023). RL-Chord: CLSTM-Based Melody Harmonization Using Deep Reinforcement Learning. IEEE Transactions on Neural Networks and Learning Systems.

carlosholivan/DeepLearningMusicGeneration

carlosholivan

Reviews

Repository Details

DEEP LEARNING FOR MUSIC GENERATION

Table of Contents

2. Algorithmic Composition

1992

HARMONET

Books

2. Neural Network Architectures

3. Deep Learning Models for Music Generation

2023

RL-Chord

FIGARO: Generating Symbolic Music with Fine-Grained Artistic Control

2022

Museformer

Bar Transformer

Symphony Generation with Permutation Invariant Language Model

Theme Transfomer

2021

Compound Word Transformer

Melody Generation from Lyrics

Music Generation with Diffusion Models

2020

Pop Musc Transfomer

Controllable Polyphonic Music Generation

MMM: Multitrack Music Generation

Transformer-XL

Transformer VAE

2019

TonicNet

LakhNES

R-Transformer

Maia Music Generator

Coconet: Counterpoint by Convolution

2018

Music Transformer - Google Magenta

Imposing Higher-level Structure in Polyphonic Music

MusicVAE - Google Magenta

2017

MorpheuS

Polyphonic GAN

BachBot - Microsoft

MuseGAN

Composing Music with LSTM

ORGAN

MidiNet

2016

DeepBach

Fine-Tuning with RL

C-RNN-GAN

SeqGAN

2002

Temporal Structure in Music

1980s - 1990s

Books and Reviews

Books

Reviews

4. Audio Generation

2023

Vall-E X

ERNIE Music

Multi-Source Diffusion Models

SingSong

AudioLDM

Mousai

Make-An-Audio

Noise2Music

Msanii

MusicLM

2022

Musika

AudioLM

2021

RAVE

2020

Jukebox - OpenAI

2017

MuseNet - OpenAI