NVIDIA · Image Generation
Cosmos 3 Super — Text to Image
Generate high-resolution images from text prompts. Supports aspect ratios 1:1, 16:9, 9:16, 4:3, and 3:4.
- 64B parameters
- MoT architecture
- #1 open on Text-to-Image eval
- JPEG / PNG output
Model Index
cosmos3ai is powered by NVIDIA Cosmos 3 Super — a 64B Mixture-of-Transformers model trained on 20 trillion multimodal tokens. Below are the capabilities currently available on the platform.
NVIDIA · Image Generation
Generate high-resolution images from text prompts. Supports aspect ratios 1:1, 16:9, 9:16, 4:3, and 3:4.
NVIDIA · Video Generation
Animate a still image into a short cinematic video clip. Supports 3, 5, and 8 second output durations.
All results are from official NVIDIA launch materials published on 2026-05-31. Source: NVIDIA Developer Blog.