No longer supported
This model is legacy (released November 2022) and is no longer actively maintained or recommended by Stability AI. Consider using their current flagship model instead.
by Stability AI· Released November 2022· Cutoff 2022
Stable Diffusion 2.0 is a text-to-image diffusion model that generates high-resolution images from textual descriptions. It introduces a new text encoder (OpenCLIP) and supports image-to-image, inpainting, and depth-guided generation. This model marked a significant improvement over the original Stable Diffusion in terms of image quality and compositional understanding.
Input cost
Free (open source)
Output cost
Free (open source)
Context window
—
Max output
—
Modalities
Parameters
1.4B (UNet) + 1.2B (VAE) + 354M (text encoder)
License
CreativeML Open RAIL-M
High-quality image generation from text prompts with improved compositional accuracy.