Introvert! • 7d
India's beats global giants on OCR and speech benchmarks 👀 Sarvam Vision, an OCR and document understanding model from Sarvam AI, scored 84.3% accuracy on the olmOCR Bench, outperforming models from Google Gemini, Claude, and ChatGPT—especially on Indian scripts and multilingual documents. On OmniDocBench v1.5, Sarvam Vision reached 93.28% overall accuracy, showing strong performance on complex layouts, tables, mixed scripts, and mathematical content. Alongside this, Sarvam released Bulbul V3, a new text-to-speech model for Indian languages. In an independent blind listening study with 20,000+ votes, Bulbul V3 showed high listener preference, low pronunciation errors, and strong results on code mixed and number heavy text, particularly for 8 kHz telephony audio.

I'm just a normal gu... • 9m
GenAI startup Sarvam AI, recently selected by the Centre to build India’s first homegrown LLM, has unveiled a new speech AI model that supports 11 Indian languages, including Punjabi, Marathi, Odiya, Tamil and Bangla, among others. ‘Meet the all-new
See More
Founding Software En... • 1y
Excited to share a preview of the AI Prescreening Assistant I’ve been developing! This tool prescreens candidates via calls and has incredible potential in Customer Support, Sales, and Marketing. Demo Video: https://youtu.be/0sWprEl4KnE?si=M1RDm28x
See MoreGen AI, Cybersecurit... • 1y
The AI Legends #75 Days Day 27: Ray Kurzweil Kurzweil has become one of the most influential figures in the field of artificial intelligence (AI) and futurism. History: Ray Kurzweil was born on 1948, in New York. Kurzweil attended MIT, where he stud
See More
Hey I am on Medial • 6m
Hey Ai Experts here. I'm currently building an app on Lovable.dev (Vibecode) and would like to integrate an AI-based OCR feature to extract text from uploaded images (like bills or receipts). Could you please confirm if OCR integration is possible w
See MoreDownload the medial app to read full posts, comements and news.