| Event/Highlight | BharatGen - India's first sovereign, multilingual, and multimodal AI-driven Large Language Model (LLM) |
| Location | Highlighted at IIT Bombay during a visit by the Union Minister of Science and Technology |
| Developers | Led by IIT Bombay in collaboration with IIT Madras, IIT Kanpur, IIT Hyderabad, IIT Mandi, IIT Kharagpur, IIIT Hyderabad, IIIT Delhi, and IIM Indore |
| Funding | ₹235 crore under NM-ICPS and ₹1,058 crore under India AI Mission, totaling ₹1,293 crore |
| Objective | Advance India's digital sovereignty by leveraging Indian languages, cultural contexts, and national priorities |
| Core AI Models | Param-1 (Text/LLM), Shrutam (Speech/ASR), Sooktam (Speech/TTS), Patram (Document Vision) |
| Param-1 Details | Text model trained on 7.5 trillion tokens with 1/3 Indian linguistic data |
| Shrutam Details | Automatic Speech Recognition (ASR) model for complex Indian linguistic diversity and dialects |
| Sooktam Details | Text-to-Speech (TTS) model for speech synthesis in 9 Indic languages |
| Patram Details | Document-vision model for interpreting India-specific document formats like identity records and legal documents |
| Bharat Data Sagar | Sovereign data initiative for India-centric data curation, ensuring data sovereignty, accuracy, and national regulation |
| Languages Supported | Over 22 Indian languages across text, speech, and document-vision modalities |