Stable Diffusion Tools for Unreal Engine

Description

This plugin feeds your editor viewport through the Diffusers library to create AI synthesized imagery.

Join us on Discord.

Requirements

Unreal Engine 5.1
CUDA compatible GPU (Nvidia)
Minimum 6GB Vram to run Unreal and half-precision floating point (fp16) Stable Diffusion simultaneously
Git

Installation

Clone this repo and copy the StableDiffusionTools folder to YourProjectFolder/Plugins/ or download the latest release from the releases page.

Usage

Video walkthrough for v0.1

Enabling the plugin

After installing the plugin, activate it through Windows->Plugins. To open the main tool window for the plugin, go to Windows->StableDiffusionTools.

Dependencies

The first time you use the plugin you will need to install the required python dependencies using the Update/Install dependencies button in the Dependencies section of the main plugin window or through Windows->Stable Diffusion Dependencies Installer window . You will need to restart the Unreal Editor the first time you install dependencies.

Generator backends

This plugin provides three different generator backends out of the box. Local generation using the Diffusers library, remote generation using the Stability.AI SDK and Dream Studio and remote generation using Stable Horde. Choose a backend in Project Settings->Stable Diffusion Tools->Generator Type and set a token in the Generator Tokens map for your chosen generator.

Models

To generate images using this plugin, you will first need a model. A few model presets are provided with the plugin and you can create your own by creating a StableDiffusionModelAsset though it is recommended that you duplicate one of the provided models as a starting point.

To download a model from huggingface.co, you will need an account and a https://huggingface.co/settings/tokens with read permissions.

You can use any diffusers based model from the diffusers category on huggingface.co. To choose a model, enter it in the format Username/Modelname. To use any of the default models RunwayML Stable Diffusion 1.5, Runway inpaiting, or CompViz Stable Diffusion 1.4, you will need to vist the respective model card page for each of these models and accept their usage agreement before downloading their weights otherwise you will receive an HTTP401 error.

The revision property will allow you to pick a specific branch of the model to download. I recommend the fp16 branch (if the model provides it) in case you have a limited amount of VRAM (under 6GB). You can also choose the level of precision you want to model to use where 16-bit floating point is the default.

Seamless textures

In the model section of the plugin window, if you set the Covolution padding model option to circular then generated images will have borders that will wrap around to the opposite side of the image. When combined with prompts such as a tiling texture of INSERT_SUBJECT_HERE this will generate textures that are appropriate for mapping onto flat surfaces or landscapes.

Prompt based image generation

Use the generation section to tweak your prompt, iteration count, viewport influence strength and the seed. If you are new to prompt authoring then I recommend reading this guide.

After you click the Generate image button, the in-progress image will display in the plugin window and log its progress in the editor's Output Log panel.

Layers

Some model pipelines use different types of images as inputs to help guide the image generation process. The achieve this, the plugin uses LayerProcessors to capture the required infmation from Unreal's viewport or a SceneCapture2D actor in order to feed the layers into the pipeline. For example, any model based upon the StableDiffusionImg2Img pipeline will use a single FinalColourLayerProcessor that will capture the viewport as an input image. Something more complex like the StableDiffusionControlNetPipeline may use multiple layer processors such as combining a DepthLayerProcessor and a NormalLayerProcessor. Take a look at the provided depth and normal ControlNet model preset to see how it's put together.

Some layer processors are configurable in the plugin UI, if you navigate to the Layers section, then each layer may or may not show multiple configurable properties that will affect the layer before it is passed to the pipeline. You can also preview the layer by clicking on the closed eye icon.

Saving results

The Image outputs section contains options for saving a generated image to either a texture or an external file as well as upsampling options. Enter the destination paths and the name of the image and click the Save button for the asset type of your choice, or upsample the output 4x first using Real-ESRGAN.

Sequencer and exporting animations

upsampled_AndroidChairs2.0001_2.mov

The plugin will add two new types of tracks to the Sequencer which can be added using the +Track button, a Stable Diffusion Options track and Stable Diffusion Prompt tracks.

To create an animation, create a single Options track with a section that will span the length of your animation. You can right-click the section and go to properties to modify parameters that will stay consistent over the course of the animation such as the model options and output frame size.

The options track has 3 animatable channels, Iterations, Seed and Strength. All of these can be keyframed across the length of your animation.

To add prompts, you can create multiple prompt tracks containing multiple prompt sections. To set the prompt text, right-click on a prompt section, go to properties, and set the prompt text as well as the sentiment. A positive sentiment will tell Stable Diffusion that this prompt is something we would like to see in our generated output, whilst negative sentiments will instruct it to exclude unwanted prompts from the output.

Prompt tracks have a single animatable parameters, Weight. Increasing or decreasing this value will increase or decrease the influence of the prompt in your animation.

To render your animation, click the Render movie button to open the movie pipeline panel for your sequence. Add a SDMoviePipelineConfig config asset to instruct the pipeline to export a Stable Diffusion image sequence. Rendered frames by default will show up in YourProjectFolder/Saved/MovieRenders.

Note: Currently the Stable Diffusion Movie Pipeline will only export floating point images so I recommend sticking to OpenEXR as the output file type.

Click Render (local) to render your stable diffusion animation. If you haven't already initialised a model, then the pipeline will use the model options set in your Options track to initialise a model at the start of the render which may cause a white screen.

Inpainting

Inpainting lets you fill only a masked portion of your input image whilst keeping areas outside the mask consistent. To use, load the Runway1-5_Inpaint model preset or choose any other model and make sure that Enable inpainting is set in the model options. To create a mask, add actors to an actor layer and then set the Inpaint layer field in the UI to the actor layer you want to use as a mask for inpainting. If using the sequencer, you will need to set the inpaint layer in the properties of an options track and enable Render CustomDepth Pass and set CustomDepth Stencil Value to 1 for each actor on the layer.

IMPORTANT: When generating images from the UI panel, the input image will be too dark. This is due to the plugin now using a SceneCapture2D actor to capture the viewport to enable inpainting stencil support.

To fix this change the following change the following options on a global post processing volume:

Metering mode = Manual
Exposure Compensation = 15

FAQ

When I open the plugin, I can't initialize a model.
- Have you installed the python dependencies? Go to Windows->Stable Diffusion Tools Dependencies Installer and make sure you've installed everything and restarted the editor at least once. Also make sure that you have Git installed.
- Double check that you have entered a token for huggingface.co in the plugin settings.

Known issues

Fix texture memory usage.
Panning the generated image only works when grabbing it from the upper right quarter.
You will have to restart the editor after installing dependencies.

Roadmap

Serialisable fields for the plugin UI.
Re-use the last generated image in the viewport for img2img and instructpix2pix.
Previz support for the sequencer using frame holds.
Generate multiple image iterations in the viewport.

Mystfit/Unreal-StableDiffusionTools

Mystfit

Reviews

Repository Details