• Stars
    star
    197
  • Rank 197,722 (Top 4 %)
  • Language
    Python
  • License
    MIT License
  • Created almost 2 years ago
  • Updated over 1 year ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Riffusion extension for AUTOMATIC1111's SD Web UI

Riffusion extension for AUTOMATIC1111 Web UI

Screenshot

Screenshot

Installation

  • Make sure ffmpeg is installed and the folder with the binaries is in your PATH
  • Clone this repo inside your /extensions folder, or use the Install from URL functionality in the UI

Usage

Select the Riffusion Audio Generator script before generating, and use the riffusion model.

You can also convert a whole folder of images to audio in the Riffusion tab.

Prompt Travelling

If you want to prompt travel in the latent space as described by the authors, install this extension:

https://github.com/Kahsolt/stable-diffusion-webui-prompt-travel

It will output the results of runs in the <SD>/outputs/(txt|img)2img-images/prompt_travel/ directory, and you can use the convert folder to audio functionality in the Riffusion tab to generate a single stitched-together audio file alongside the individual ones.

Here is a sample made by travelling in img2img mode from jamaican rap to deep house, techno with denoise 0.5 for 14 steps, and using the og_beat.png provided by the original authors as a base image:

Audio Sample (Jamaican Rap to Deep House, Techno)

Acknowledgements

Credit to the original Riffusion authors, Seth Forsgren and Hayk Martiros:

https://riffusion.com/about