Stability.ai logo
Stability.ai

Stable Diffusion 3 Large

Released Apr 2024

AA Text→Image
#83
Parameters8B

Stable Diffusion 3 Large is a text-to-image AI model and the high-parameter variant of the Stable Diffusion 3 family. It is built on a Multimodal Diffusion Transformer (MMDiT) architecture, which uses separate sets of weights for image and text modalities. This bidirectional design allows information to flow between both representations during the generation process, enhancing the model's ability to interpret complex, multi-subject prompts and improving its overall spatial reasoning compared to previous generations.

The model features approximately 8 billion parameters and utilizes a triple-text-encoder strategy, integrating CLIP-L/14, OpenCLIP-ViT/G, and T5-v1.1-XXL. This configuration is specifically designed to maximize prompt adherence and enable the high-fidelity rendering of typography. It employs a Rectified Flow matching formulation during training, which creates a more direct path between noise and the final image, resulting in efficient sampling and improved generative quality.

Architecture and Capabilities

The core of the model consists of joint attention blocks where image and text tokens are processed in a unified latent space. This approach enables the model to effectively map semantic concepts to visual layouts, making it particularly capable of following intricate instructions and placing objects accurately within a scene. It supports generating images at high resolutions and is designed to handle various aspect ratios while maintaining structural integrity.

In terms of prompting, the model is optimized for natural language descriptions. The inclusion of the T5 encoder allows it to understand longer, more descriptive prompts than previous iterations. For optimal performance, providing detailed, context-rich sentences is generally recommended, although the model retains the ability to interpret traditional tag-based inputs used in earlier versions of Stable Diffusion.

Rankings & Comparison