by Google· Released May 2022
Imagen is Google's text-to-image diffusion model that generates high-fidelity images from natural language descriptions. It leverages a large language model (T5-XXL) for text understanding and a diffusion model for image generation, producing photorealistic outputs. Imagen is known for its exceptional image quality and deep language comprehension.
Input cost
—
Output cost
—
Context window
—
Max output
—
Modalities
License
proprietary
Generating high-quality, photorealistic images from detailed text prompts.