RomanSetu: Efficiently unlocking multilingual capabilities of Large Language Models models via Romanization Paper • 2401.14280 • Published Jan 25, 2024 • 1
APEX: Large-scale Multi-task Aesthetic-Informed Popularity Prediction for AI-Generated Music Paper • 2605.03395 • Published 3 days ago • 2
RomanLens: The Role Of Latent Romanization In Multilinguality In LLMs Paper • 2502.07424 • Published Jun 9, 2025
IndicLLMSuite Collection Largest Collections of Pretraining and Instruction Finetuning datasets for 22 Indic languages. • 4 items • Updated Nov 5, 2024 • 18
Airavata Evaluation Suite Collection A collection of benchmarks used for evaluation of Airavata, an Hindi instruction-tuned model on top of Sarvam's OpenHathi base model. • 20 items • Updated Mar 2 • 9