Mistral Saba

Mistral Saba is a 24 billion-parameter language model tuned for Arabic and several Indian-origin languages, especially Tamil. It delivers higher accuracy than much larger general models, streams ~150 tokens/s, and runs on a single GPU—available both as a cloud API and for fully local, on-prem deployment.

Developers get a fast, low-cost way to build chatbots, content generators, or domain-specific tools that sound native to Middle-Eastern and South-Asian users. The model’s permissive deployment, fine-tuning support, and regional nuance mean you can ship compliant, high-quality features without juggling oversized models or external data-privacy risks.

Context

PricingPer 1M tokens

Capabilities

Latency

Benchmarks

Command Palette

Mistral Saba

Context

PricingPer 1M tokens

Capabilities

Latency

Benchmarks