No longer supported

This model is legacy (released December 2022) and is no longer actively maintained or recommended by Stability AI. Consider using their current flagship model instead.

legacylegacyimage Open Source

Stable Diffusion 2.1

by Stability AI· Released December 2022

Stable Diffusion 2.1 is a text-to-image diffusion model that generates high-quality images from natural language prompts. It improves upon Stable Diffusion 2.0 with better aesthetic quality and reduced NSFW content filtering. This model is part of the Stable Diffusion 2.x series, which introduced a new text encoder (OpenCLIP) and higher resolution (768x768) capabilities.

Official Site API Docs 🤗 Hugging Face 📄 Research Paper

Input cost

Free (open source)

Output cost

Free (open source)

Context window

—

Max output

—

Modalities

image

Parameters

~1.5B (UNet) + ~1.2B (text encoder)

License

CreativeML Open RAIL-M

Capabilities

Text-to-ImageImage-to-ImageInpaintingOutpaintingUpscaling

Best For

High-quality image generation from text prompts with improved aesthetics and reduced censorship.

Strengths

Improved image quality over SD 2.0
Supports 768x768 resolution natively
Open source and freely available
Active community and ecosystem

Limitations

Less creative and diverse outputs compared to SD 1.x
Requires more VRAM for 768x768 generation
Not as widely adopted as SD 1.5
Some users report worse performance on certain prompts

Use Cases

Artistic image creation

Concept art and design prototyping

Photo editing and manipulation

Game asset generation

Educational content creation

Marketing visuals

Personal projects and experimentation

Improvements Over Previous Model

Improved image aesthetics and composition over SD 2.0
Reduced NSFW content filtering compared to SD 2.0
Better handling of text prompts with fewer artifacts
Enhanced color and lighting quality

Back to all models

No longer supported

This model is legacy (released December 2022) and is no longer actively maintained or recommended by Stability AI. Consider using their current flagship model instead.

legacylegacyimage Open Source

Stable Diffusion 2.1

by Stability AI· Released December 2022

Official Site API Docs 🤗 Hugging Face 📄 Research Paper

Input cost

Free (open source)

Output cost

Free (open source)

Context window

—

Max output

—

Modalities

image

Parameters

~1.5B (UNet) + ~1.2B (text encoder)

License

CreativeML Open RAIL-M

Capabilities

Text-to-ImageImage-to-ImageInpaintingOutpaintingUpscaling

Best For

High-quality image generation from text prompts with improved aesthetics and reduced censorship.

Strengths

Improved image quality over SD 2.0
Supports 768x768 resolution natively
Open source and freely available
Active community and ecosystem

Limitations

Less creative and diverse outputs compared to SD 1.x
Requires more VRAM for 768x768 generation
Not as widely adopted as SD 1.5
Some users report worse performance on certain prompts

Use Cases

Artistic image creation

Concept art and design prototyping

Photo editing and manipulation

Game asset generation

Educational content creation

Marketing visuals

Personal projects and experimentation

Improvements Over Previous Model

Improved image aesthetics and composition over SD 2.0
Reduced NSFW content filtering compared to SD 2.0
Better handling of text prompts with fewer artifacts
Enhanced color and lighting quality

Back to all models