StableSwarmUI

StableSwarmUI, A Modular Stable Diffusion Web-User-Interface, with an emphasis on making powertools easily accessible, high performance, and extensibility.

MIT License

Stars
2.6K

Bot releases are hidden (Show)

StableSwarmUI - 0.6.2-Beta Latest Release

Published by mcmonkey4eva 5 months ago

Well! Hi there, it's been a minute. Welcome to 0.6.2 Beta -- this one got a rather long time before I finalized the release, as I wanted the "Simple" tab ready to go first. So as a side effect... this update has more new features than any other release thus far, and more bugfixes/QoL improvements/etc than I can even be bothered to try to count.

Major Updates

Minor Updates

  • made a "Sampling" group to hold the Sampler & Scheduler, finally
  • added hacky placeholder "ControlNet Two" and "Three" groups for if multiple controlnets are needed
  • a variety of minor visual aesthetic tune-ups while working on the "Modern" themes
  • the negative prompt box is now by default in the center under the regular prompt box
  • added an 'auto swap to images' checkbox (gear icon at top-right, where the 'auto swap to previews' checkbox is)
  • added support for CosXL
  • added support for CosXL Edit (ip2p) - just use the model as normal, and give an Init Image. then write a prompt like "give him a mustache" to instruct the edit model on what to change
  • wrote a doc guide about how to make extensions for swarm: https://github.com/Stability-AI/StableSwarmUI/blob/master/docs/Making%20Extensions.md
  • added "always refresh on load" setting, for performance benefit reasons. Disable it if your swarm instance is lagging from loading too many models or much data out of comfy or similar.
  • added "backend data cache" setting, also for performance. Enable it if your comfy instance takes a long time to load the UI, or you get timeouts from object_info loads.
  • when you edit model paths in server settings, it now (finally!) reloads the model handler immediately to apply your changes without restarting the program
  • cleaned up model name displays so it's not spamming ".safetensors" on the end of every single one
  • added automatic log save-to-file as an option under Server Configuration
  • pushed more work towards making a functional multi-user support system
  • added a ModelSpecificEnhancements under Swarm Internal that you can uncheck to disable SDXL rescond smartness and possibly other enhancements like it in the future
  • added advanced selection for Mask Behavior inside init image, defaulting to Differential now with the ability to swap back to Simple Latent. This makes masked image gen (including eg <segment: usage) a bit cleaner/better
  • added align_your_steps scheduler, optimal for 10 steps but can do less, useful especially for coherent SVD outputs without taking a year to process
  • added Mask Shrink Grow parameter, for when using image editor / masks, to crop the image to just the mask, then grow the mask by a few pixels, and generate based on that. This is useful for refining small details.
  • model descriptions can now use (heavily restricted whitelist-only) html tags to add a bit of formatting
  • added a check-for-updates when swarm is starting. No telemetry, no downloading anything, just grabs a json off github to compare version number with. If you don't want this, you can disable it in server settings.
  • your selected tab is now in the URL, so you can bookmark specific tabs to get back to them with fewer clicks
  • number inputs now let you scrollwheel to quickchange (firefox-like behavior on all browsers), or click-and-drag
  • the image full view (when you doubleclick an image) can now be zoomed with mousewheel and panned with click-n-drag, to let you investigate details easily
  • published initial API docs https://github.com/Stability-AI/StableSwarmUI/blob/master/docs/API.md (note some routes don't have descriptions written yet, and OpenAPI stuff is planned)
  • as usual, many small improvements, bug fixes, better error checking, etc.
StableSwarmUI - 0.6.1-Beta

Published by mcmonkey4eva 7 months ago

Beta!

Swarm is now in Beta status! After months of "Alpha" status, I'm ready to say Swarm is one of the best Stable Diffusion UIs out there, and for many use cases simply the best.

  • Brand new to Stable Diffusion? Swarm is crazy easy to install and set up and will literally guide you through how to get started in-UI.
  • Inexperienced? There's a ? on every param to tell you how each feature works, and most features you might care about are available out of the box. There's tab completions for prompt syntax. Everything is ready to help you.
  • Pro and want a friendly UI? Swarm's main Generate tab is one of the friendliest out there.
  • Pro and want to maximize your comfy noodle hackery? Swarm wraps comfy in a tab and provides power features such as a workflow browser, the ability to generate reference workflows for you, the ability to reformat your favorite workflow into a friendly interface, large scale gridding, multi-GPU support, ...
  • Large scale commercial user? Swarm provides a great frontend for you to start building with and debug with, and a powerful backend + splitting engine + etc., and a friendly license.

Major Updates

Minor Updates

  • video by default now has smarter parameters that result in better quality videos with SVD
  • added Refiner Steps param
  • refiner save-before-refining is now a general "Save Intermediate Images" that can also save before segmentation
  • added automatic recognition for "Matrix" app formatted model metadata files
  • gridgenerator now has convenient autoreplacables in web-page output folder name
  • experimental AMD GPU auto-support in installer. Doesn't work great.
  • added "Video Boomerang" advanced option checkbox
  • added video resolution parameter
  • added native support for frame interpolation from https://github.com/Fannovel16/ComfyUI-Frame-Interpolation in video params
  • added user setting to exclude params from the "Reuse Parameters" button
  • added swarm internal 'image format' parameter override for convenience especially with api usage
  • added a close button to the image editor (most requested feature of 2024 lol)
  • the image editor has a working selector tool now, which also can be used to automatically create a regional prompt with a simple button press
  • added support for GLIGEN - only for SDv1 models, under Advanced parameters -> Regional Prompting, you can select a model as long as you have one installed per https://comfyanonymous.github.io/ComfyUI_examples/gligen/
  • added image history sort-by (Name vs Date and a Reverse option)
  • new parameter: ShiftedLatentAverageInit under advanced sampling - empty latents are normally initialized as zeroes, but that's not the correct average value for VAEs. Check this box to use actual channel-average values (based on FFHQ dataset as calculated by Birchlabs https://huggingface.co/datasets/Birchlabs/sdxl-latents-ffhq)
  • added <break> syntax to separate conditioning manually to multiple parts
  • improved the install and launch scripts for all platforms (no longer rebuilds the C# code every launch, now only when you update or launch with the dev mode powershell script)

--

ps yes of course when SD3 comes out Swarm will support it right away.

StableSwarmUI - 0.6.0-Alpha

Published by mcmonkey4eva 9 months ago

Plans

Swarm is gearing up for a transition out of Alpha status into Beta. Final work to do before it's ready for "Beta" includes documentation, translations, and finishing up the initial image editor tool. The Beta will most likely also be relicensed as the fully FOSS MIT License.

Swarm is already at a point where it's one of the best possible UIs you can have for Stable Diffusion, and there's very few reasons to not use Swarm. Reaching Beta will mark the point where the reasons to use anything other than Swarm are almost nil. I'm incredibly proud of the progress this project has made over the past months in Alpha.

Major Updates

  • A lot Grid Generator enhancements - https://github.com/Stability-AI/StableSwarmUI/discussions/11#discussioncomment-7862586
    • The GridGenerator tool can now generate "Just Images" or an "Image Grid" as an alternative to the webpage viewer thingy
    • The GridGenerator tool can now save/load configuration state
  • Added Self-Attention-Guidance parameter options, under Advanced Sampling
  • The 'Star' button on images now actually does something (saves a copy to the "Starred" folder). There's a usersetting for whether it'll use subfolders or not. Also highlights the starred images in the UI.
  • DotNET 8 was released recently, and swarm will update to it in the near future. I recommend you install DotNET 8 in advance to make sure the update is painless when it comes https://dotnet.microsoft.com/en-us/download/dotnet/8.0
  • made the installer a lot nicer (both the install UI experience, and a proper Linux install script) - https://github.com/Stability-AI/StableSwarmUI/discussions/11#discussioncomment-7936481
  • new parameter under Init Image group: Unsampler Prompt, use to unsample your input image before resampling. This replaces the random noise with noise based on the input image + a prompt, allowing for more tightly controlled image editing
  • added a parameter filter at the top to more easily find obscure params

Minor Updates

  • added a user setting to allow creating grids of images when you generate a bunch at once (eg a 2x2 grid when you generate 4 at once) (default disabled)
  • added a user setting for image history length (default max 1000 images)
  • image history now caches small preview thumbnails by default (you can disable this in user settings)
  • added advanced swarm internal param no seed increment if for example you want to muck with wildcards and not have seed change, or whatever
  • Grid Generator now has on-page docs and a link to the readme about it
  • Grid Generator now highlights invalid values
  • Grid Generator now supports skipping parameters with a SKIP: prefix
  • Image Edit Batcher now has image resolution controls
  • dropdowns now have fancy search text input
  • backends list now has "Restart" and "View Logs" buttons on each backend for convenience
  • Added No Previews advanced param under Swarm Internal. Implicitly enabled by default for grid generations. May make some generations slightly faster.
  • Added Refiner Save Before Refine advanced param (under Refiner)
  • video generation now has an option for different preview types, including live previews of the animated video while it generates
  • there's now memory freeing controls - API route FreeBackendMemory, buttons on Server Info tab, and server settings for autoclear after specified time
  • there's now a resource usage monitor on the server info tab
  • as always, a bunch of other fixes n small things
StableSwarmUI - 0.5.9-Alpha

Published by mcmonkey4eva 10 months ago

Major Updates

Minor Updates

  • added restart all backends quick button
  • added support for Cubiq's IPAdapter repo
  • new parameter: RemoveBackground under advanced sampling, to just quickly remove the background from your images
  • new prompt tag option: <clear:x> works like the <segment: node, but removes the content from the image - just like RemoveBackground but targetable with ClipSeg.
  • new parameters: ControlNet Start and End, to control which steps a controlnet applies to (only visible if Advanced params is checked)
  • TI Embeddings now have a metadata key that lets you mark them as Negative, so that when you click them they will be automatically added to the negative prompt rather than the positive
  • added a user-setting to select the default VAE for SDXL and for SDv1 separately. eg for SDXL you might want the fp16-fix model.
  • added Refiner HyperTile param (a technique that makes sampling a bit faster on large images)
  • When you right click Generate and click Generate Previews, it will now try to use a preset named Preview, and can even help you generate one if you have an sdxl_lcm_lora available already
  • the "Generate" button in the center area now has a more-obvious clickable menu button for additional options
  • there's now a "Show Prompt Tokenization" in that generate-options button to quickly view the clip tokenization details for it (ie via the utilities tab)
  • oh and as usual a bunch of random small fixes and wotnot
StableSwarmUI - 0.5.8-Alpha

Published by mcmonkey4eva 11 months ago

Major Updates

  • added support for animated GIF outputs, from eg AnimateDiff or ComfyVideoHelperSuite nodes. Swarm now recognizes and handles these properly throughout, including the grid generator and everything.
    • Video files (mp4/webm) are now partially supported as well.
  • Added Docker support, see readme for info
  • Added a bulk pickle-to-safetensors conversion utility, see here for detail
  • added support for remote Swarm backends having their own model lists recognized by the local Swarm instance, including automatically filtering requests to the correct backend that has the desired models, and support for model/lora/embedding/controlnet all being recognized
  • you can now drag an image into the image area and it will be loaded in and metadata scanned - this includes images created by swarm, or even images created by other tools (eg auto webui or fooocus)
  • ComfyUI Workflow Editor tab can now use multiple backends simultaneously! At the top left under MultiGPU selector choose All, then create a workflow where the output nodes (PreviewImage/SaveImage) have different colors (use 1 unique color per backend). Each output's sub-graph will be split to a different comfy backend - see video here https://github.com/Stability-AI/StableSwarmUI/discussions/11#discussioncomment-7555872
  • added wildcards system - for details see https://github.com/Stability-AI/StableSwarmUI/discussions/11#discussioncomment-7568190

Minor Updates

  • when using a custom Comfy workflow in the main generate tab, you can add a new SwarmLoraLoader node to let the main UI lora selector work, see here for details
  • added VAE Tile Size param under Advanced Sampling for if you need VAE tiling
  • added a mass model metadata reset utility
  • added automatic recognition of Embed model architectures
  • grid gen axis tab completion now offers an 'Add All' to quick-add a range of matches
  • grid gen now automatically reorders axes internally to maximize speed (eg changing model lastmost since model changes are slow) (and has a checkbox to turn this off if you don't want it to)
  • added used_embeddings to metadata when you use TI embeddings
  • added a button to manually clear the batch view (under the gear icon, at the top-left of the batch view area). Also the autoclear toggle now defaults to off.
  • there's now a button in the Models listing (under the hamburger menu) to Set As Refiner quickly
  • there's now a user setting to change the hint popovers (the ? clickies) between buttons, hoverables, or none (hidden)
  • image metadata is now saved in a format that's recognized by other tools (PNG Text Values for png, or UserComment EXIF for jpg)
  • added CLIP Stop At Layer under Advanced Sampling for if you want that for old SDv1 NAI style model
  • added Personal Note parameter, under advanced->swarm internal, if you want to handwrite a note into the metadata
  • added Open In Folder button on the hamburger menu for images
  • as always, a lot of assorted super minor things not worth listing
StableSwarmUI - 0.5.7-Alpha

Published by mcmonkey4eva almost 1 year ago

Major Updates

  • added segmentation-detailing - see https://github.com/Stability-AI/StableSwarmUI/discussions/11#discussioncomment-7236821 for details, but in short: you can do <segment:face> a beautiful face on the end of a prompt to trigger automatic inpainting of segments matched by text
  • ReVision can now use IP-Adapter if you have that node in your comfy
  • added "Seamless TIleable" checkbox under "advanced sampling" so you can make textures n stuff
  • clicking embeddings now actually works and puts it in the prompt and all

Minor Updates

  • large amounts of internal stuff n wotnot as always, including some API improvements and load time speedups
  • added MaskImage parameter - works as a jank way to inpaint for now, or as API inpainting, will be used to build UI around later
  • added new <object:...> syntax to complement <region:...> except object emits additional generation calls in the middle to more aggressively form the wanted image
  • image metadata now shifts to the side when doing so allows the image to be bigger
  • auto-clear-batch is now under a lil Gear icon
  • there's now a toggler to toggle whether to instant-swap to previews or not
  • added advanced parameters to modify sigma min/max and rho
  • added FreeU Apply To param to control whether freeU applies to base, refiner, or both
  • added ReVision Zero Prompt param to force ReVision to zero the prompts given to the base (and leave unzeroed for refiner)
StableSwarmUI - 0.5.6-Alpha

Published by mcmonkey4eva about 1 year ago

Major Updates

  • added prompt syntax - <embedding:filename>, <lora:filename:1>, <random:a, b, c>, <preset:presetname> - see https://github.com/Stability-AI/StableSwarmUI/discussions/11#discussioncomment-7057501 for detail
  • Variation Seed is now available for comfyui backend
  • added initial experimental regional prompting support (ComfyUI only), works with an initial dirty/simple syntax in the prompt like a photo of a catdog <region:0,0,1,0.5> a photo of a cat <region:0,0.5,1,0.5> a photo of a dog (proper UI for it and more advanced tooling TBA)

Minor Updates

  • the usual pile of bugfixes and minor QOL improvements and etc
  • made bulk generations fasterer through rapid prequeueing
  • adjusted default image save path and reversed the image history sort, so that together your most recent images end up at the top (the settings change won't self-update a pre-existing server settings file - 'reset to default' UI is TBA, but for now you can check The Settings source to reference default values)
  • there's now a button to preview ControlNet preprocessor outputs, in the controlnet param group
  • you can now use Control+Up/Down arrow when you've selected prompt text to adjust prompt weighting
  • added DynamicThresholding support for self-start ComfyUI, or any ComfyUI-API-By-URL that has the DynThresh node
  • Added custom comfy nodes (automatically enabled for self-start backends, you'll have to transfer manually for API URL backends - see ComfyUI Docs to do that) that make image loading & saving much cleaner and faster
  • Added "FreeU" support (https://arxiv.org/abs/2309.11497, Comfy backend only) as an option under "Advanced"
  • The Comfy Workflow Editor tab now has a QuickLoad dropdown so you can more quickly load workflows you use often
  • Refiner now has StepSwapNoisy method if you want to use the official-sample-workflow's way of handling noise, which is a bit better for standard refiner (SDXL Base + SDXL Refiner, No upscale) but much worse for anything else.
  • The default Refiner method has been changed to PostApply, as that always works, whereas StepSwap only works for some cases. (You can always change your personal defaults at will by just creating a preset and naming it Default!)
  • added the rescond tweaks used by Fooocus to get slightly better image quality
  • Previews of large-batches can now load concurrently on a comfyui backend, if you use the advanced "batch size" param. This might negatively impact performance when using TAESD.
  • Added a checkbox to use AITemplate in comfy if you have that extension - gens 2x faster, but has compatibility issues
StableSwarmUI - 0.5.5-Alpha

Published by mcmonkey4eva about 1 year ago

Major Updates

  • added an "Image Edit Batch" tool next to the grid gen, for if you want to batch edits to a folderload of images
  • added a "Utilities" tab at the top that has a CLIP Tokenizer for if you want to see what CLIP's doing with your text
  • Swarm can use remote instances of Swarm as its backend (ie it's now easier to set up Swarm across LAN/network)
  • LoRA/etc. browsers now recognize if they're the correct model type (SDv1 vs SDXL) - if they don't match, they fade out and get sorted to the bottom, so SDv1 loras aren't in the way while you're generating SDXL images (but they're still there with a fade and a message so you don't get confused where they went)

Minor Updates

  • Colab banned webuis :(
  • a bunch of small bug fixes and QOL as always. Like a lot. I can't be bothered listing them all, but you can look at the git commits to see em if you want. So many.
  • image metadata now holds the Swarm version and current date (alongside params used and gen time)
  • when generating images with a comfy backend there are now lil progress bars on the side. A green one shows the main generation (KSampler) progress, and a subtler purple one shows the overall comfy progress (ie how far along the node graph it is)
  • you can now customize the internal batch size if you want, as a parameter under advanced - see notes on https://github.com/Stability-AI/StableSwarmUI/issues/84 about that. Most users prefer the "Images" count at the top of core params
  • added a toggle for auto-clearing batches
  • custom comfy workflows now show Primitive inputs in the main param section, and auto-generated other options under "Advanced", to be less cluttered of an interface
  • added an option for comfy api by url to go idle, meaning its connection can go on and off automatically based on whether the remote server is available. Very useful for external servers that you might only sometimes have running
  • there's now a CLIP Token counter on prompt boxes so you can see when you're hitting the 75 token barrier
  • added a lil guide doc about how to use swarm over LAN or internet https://github.com/Stability-AI/StableSwarmUI/blob/master/docs/Advanced%20Usage.md#accessing-stableswarmui-from-other-devices
  • improved rendering of model cards
  • made it way more obvious how to close the image fullscreen popup view
StableSwarmUI - 0.5.4-Alpha

Published by mcmonkey4eva about 1 year ago

0.5.4 Alpha

I haven't written release notes thus far because there's just so much but I'm going to try for this one.

The major updates here and other features are announced as they happen in the Feature Announcement Thread, I highly recommend following that thread!

Major Updates

This is the first release containing the in-progress UI rework. (There's more to come on that still). Things look differenty now! Hopefully you like it better!

This also contains ReVision support. When using SDXL, drag images into the prompt box to have them parsed as part of the prompt. It's pretty neat!

Minor Updates

This release contains a significant amount of minor UI and QOL improvements, and a few fun new features

  • You can now set custom titles on backends
  • You can now toggle backends on/off (suggested by AIMusicExperiment)
  • presets work a bit more cleanly on the inside now, and can do fancier things like have loras in them
  • you can now edit all your parameter settings (including you can make the old prompt param visible again this way if you want)
  • errors are now tracked better all over the place
  • fixes to edge-cases where comfy did not launch properly for a couple users. I don't want to talk about how annoying this bug was to track down. It was an adventure. I cried a few times. But we did it. Thank you WyzBorrero for helping me test way more than any sane person should.
  • the image batch view now puts new images at the top, not the bottom, so you always got the latest visible
  • added "alt resolution multiplier" to make it easy to do things like grids of different sized images or whatever if you want to be weird with res
  • the Grid Generator now has tab completion suggestions
  • a whole lot of improvements to the browser UIs (images/presets/models/etc), including you can select new formats like "Small Cards" or "Giant Thumbnails" or etc now
  • made the colab more reliable. Thank you dotcsv, rocketpal, nyaneenee, and yoinked for reports and testing related to colab issues.
  • model folders now self-generate as a bunch of people were confused where to put loras and all
  • you can now use arrow keys to quickly jump between different images in the current generation batch, including both on the generate page and when you click the image to fullscreen view it (suggested by pookyjuice)
  • improved persistence (eg of loras) across page reloads
  • added a replacement for the icon, it's now an SDXL-generated purple robot bee. It's kinda cute. I need to ask an actual graphic designer to replace it properly lol
  • You can now right-click generate to "Generate Forever", and right-click interrupt to "Interrupt All Sessions" (ie kill old queues that are left from tabs you already closed)
  • changed timeout tracking to allow you to queue a thing and let it complete days later if you really want giant terrifying queues to run.
  • changed default param limits to let you put them ridiculously high if you want. Queue ten thousand images in one go if you want, who am I to stop you?
  • custom comfy workflows were made extra smooth

That. is not. A complete list. That's just the highlights. There's a lot more. Do you see why I normally don't write release notes? There's a lot.

StableSwarmUI - 0.5.3-Alpha

Published by mcmonkey4eva about 1 year ago

StableSwarmUI - 0.5.2-Alpha

Published by mcmonkey4eva about 1 year ago

StableSwarmUI -

Published by mcmonkey4eva about 1 year ago

StableSwarmUI - 0.5-Alpha

Published by mcmonkey4eva about 1 year ago

Initial public Alpha of StableSwarmUI.