Text

Frontier Reasoning Models

Large, general-purpose models optimized for multi-step reasoning, coding, and analysis.

Text · Fast

Lightweight Chat Models

Smaller, low-latency models tuned for conversational responsiveness.

Image Gen

Diffusion Image Models

Text-to-image systems producing photorealistic and stylized visuals from prompts.

Image Edit

Instruction-Based Image Editors

Models that modify existing images based on natural-language instructions.

Speech

Text-to-Speech Models

Neural voice synthesis systems producing natural, expressive audio.

Audio Gen

Generative Music Models

Systems that compose original music from text or melodic prompts.

Open Weight

Open-Source Language Families

Publicly released weights enabling local hosting and fine-tuning.

Open Weight

Community Fine-Tunes

Specialized variants adapted by the community for domain-specific tasks.

Multimodal

Vision-Language Models

Models that jointly reason over images and text within a single context.