• Stars
    star
    129
  • Rank 279,262 (Top 6 %)
  • Language
    Go
  • License
    MIT License
  • Created almost 2 years ago
  • Updated 10 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

A Discord bot, written in Go, that interfaces with the Automatic 1111's API interface.

Stable Diffusion Discord Bot

This is a Discord bot that interfaces with the Automatic1111 API, from this project: https://github.com/AUTOMATIC1111/stable-diffusion-webui

Video showing off the current features: https://www.youtube.com/watch?v=of5MBh3ueMk

Installation

  1. Download the appropriate version for your system from the releases page: https://github.com/AndBobsYourUncle/stable-diffusion-discord-bot/releases
    1. Windows users will need to use the windows-amd64 version
    2. Intel Macs will need to use the darwin-amd64 version
    3. M1 Macs will need to use the darwin-arm64 version
    4. Devices like a Raspberry Pi will need to use the linux-arm64 version
    5. Most other Linux devices will need to use the linux-amd64 version
  2. Extract the archive folder to a location of your choice

Building (optional, only if you want to build from source)

  1. Clone this repository
  2. Install Go
    • This varies with your operating system, but the easiest way is to use the official installer: https://golang.org/dl/
  3. Build the bot with go build

Usage

  1. Create a Discord bot and get the token
  2. Add the Discord bot to your Discord server. It needs permissions to post messages, use slash commands, mentioning anyone, and uploading files.
  3. Ensure that the Automatic 1111 webui is running with --api (and also --listen if it is running on a different computer than the bot).
  4. Run the bot with ./stable_diffusion_bot -token <token> -guild <guild ID> -host <webui host, e.g. http://127.0.0.1:7860>
    • It's important that the -host parameter matches the IP address where the A1111 is running. If the bot is on the same computer, 127.0.0.1 will work.
    • There needs to be no trailing slash after the port number (which is 7860 in this example). So, instead of http://127.0.0.1:7860/, it should be http://127.0.0.1:7860.
  5. The first run will generate a new SQLite DB file in the current working directory.

The -imagine <new command name> flag can be used to have the bot use a different command when running, so that it doesn't collide with a Midjourney bot running on the same Discord server.

Commands

/imagine_settings

Responds with a message that has buttons to allow updating of the default settings for the /imagine command.

By default, the size is 512x512. However, if you are running the Stable Diffusion 2.0 768 model, you might want to change this to 768x768.

Choosing an option will cause the bot to update the setting, and edit the message in place, allowing further edits.

Screenshot 2023-01-06 at 10 41 36 AM

/imagine

Creates an image from a text prompt. (e.g. /imagine cute kitten riding a skateboard)

Available options:

  • Aspect Ratio
    • --ar <width>:<height> (e.g. /imagine cute kitten riding a skateboard --ar 16:9)
    • Uses the default width or height, and calculates the final value for the other based on the aspect ratio. It then rounds that value up to the nearest multiple of 8, to match the expectations of the underlying neural model and SD API.
    • Under the hood, it will use the "Hires fix" option in the API, which will generate an image with the bot's default width/height, and then resize it to the desired aspect ratio.

How it Works

The bot implements a FIFO queue (first in, first out). When a user issues the /imagine command (or uses an interaction button), they are added to the end of the queue.

The bot then checks the queue every second. If the queue is not empty, and there is nothing currently being processed, it will send the top interaction to the Automatic1111 WebUI API, and then remove it from the queue.

After the Automatic1111 has finished processing the interaction, the bot will then update the reply message with the finished result.

Buttons are added to the Discord response message for interactions like re-roll, variations, and up-scaling.

All image generations are saved into a local SQLite database, so that the parameters of the image can be retrieved later for variations or up-scaling.

Screenshot 2022-12-22 at 4 25 03 PM

Screenshot 2022-12-22 at 4 25 18 PM

Options like aspect ratio are extracted and sanitized from the text prompt, and then the resulting options are stored in the database record for the image generation (for further variations or upscaling):

Screenshot 2022-12-28 at 4 30 43 PM

Contributing

Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.

There are lots more features that could be added to this bot, such as:

  • Moving defaults to the database
  • Per-user defaults/settings, as well as enforcing limits on a user's usage of the bot
  • Ability to easily re-roll an image
  • Generating multiple images at once
  • Ability to upscale the resulting images
  • Ability to generate variations on a grid image
  • Ability to tweak more settings when issuing the /imagine command (like aspect ratio)
  • Image to image processing

I'll probably be adding a few of these over time, but any contributions are also welcome.

Why Go?

I like Go a lot better than Python, and for me it's a lot easier to maintain dependencies with Go modules versus running a bunch of different Anaconda environments.

It's also able to be cross-compiled to a wide range of platforms, which is nice.