
Sarvam AI has set a new benchmark for sovereign AI with the launch of Sarvam Vision, a 3B-parameter...
The AMW Read
The launch of a localized vision-language model that outperforms a frontier lab (Gemini) validates the 'Sovereign AI' pattern and the efficacy of specialized, smaller-parameter models for regional dominance.
Sarvam AI has set a new benchmark for sovereign AI with the launch of Sarvam Vision, a 3B-parameter vision-language model that achieved 84.3% accuracy on olmOCR-Bench, outperforming Gemini 3 Pro. By delivering 93.28% accuracy on OmniDocBench v1.5 across all 22 scheduled Indian languages, the Bengaluru startup is proving that localized, smaller models can surpass global frontier labs in specialized tasks. This shift highlights a critical move toward cost-efficient, domain-specific AI infrastructure that reduces reliance on general-purpose Western models. The inclusion of Bulbul V3 for high-fidelity speech synthesis further solidifies a full-stack Indian ecosystem ready for mass-market adoption. 🇮🇳


