Follow
Sunayana Sitaram
Sunayana Sitaram
Microsoft Research India
Verified email at microsoft.com - Homepage
Title
Cited by
Cited by
Year
Mega: Multilingual evaluation of generative ai
K Ahuja, H Diddee, R Hada, M Ochieng, K Ramesh, P Jain, A Nambi, ...
arXiv preprint arXiv:2303.12528, 2023
2122023
Language modeling for code-mixing: The role of linguistic theory based synthetic data
A Pratapa, G Bhat, M Choudhury, S Sitaram, S Dandapat, K Bali
Proceedings of the 56th Annual Meeting of the Association for Computational …, 2018
1692018
A survey of code-switched speech and language processing
S Sitaram, KR Chandu, SK Rallabandi, AW Black
arXiv preprint arXiv:1904.00784, 2019
1412019
GLUECoS: An evaluation benchmark for code-switched NLP
S Khanuja, S Dandapat, A Srinivasan, S Sitaram, M Choudhury
arXiv preprint arXiv:2004.12376, 2020
1322020
Multilingual and code-switching ASR challenges for low resource Indian languages
A Diwan, R Vaideeswaran, S Shah, A Singh, S Raghavan, S Khare, ...
arXiv preprint arXiv:2104.00235, 2021
922021
Interspeech 2018 Low Resource Automatic Speech Recognition Challenge for Indian Languages
BML Srivastava, S Sitaram, RK Mehta, KD Mohan, P Matani, S Satpal, ...
Proc. The 6th Intl. Workshop on Spoken Language Technologies for Under …, 2018
782018
A survey of code-switching: Linguistic and social perspectives for language technologies
AS Doğruöz, S Sitaram, BE Bullock, AJ Toribio
arXiv preprint arXiv:2301.01967, 2023
742023
Word embeddings for code-mixed language processing
A Pratapa, M Choudhury, S Sitaram
Proceedings of the 2018 conference on empirical methods in natural language …, 2018
722018
Polyglot neural language models: A case study in cross-lingual phonetic representation learning
Y Tsvetkov, S Sitaram, M Faruqui, G Lample, P Littell, D Mortensen, ...
arXiv preprint arXiv:1605.03832, 2016
722016
Speech synthesis of code-mixed text
S Sitaram, AW Black
Proceedings of the Tenth International Conference on Language Resources and …, 2016
482016
A survey of multilingual models for automatic speech recognition
H Yadav, S Sitaram
arXiv preprint arXiv:2202.12576, 2022
472022
Crowdsourcing speech data for low-resource languages from low-income workers
B Abraham, D Goel, D Siddarth, K Bali, M Chopra, M Choudhury, P Joshi, ...
Proceedings of the Twelfth Language Resources and Evaluation Conference …, 2020
462020
Unsung challenges of building and deploying language technologies for low resource language communities
P Joshi, C Barnes, S Santy, S Khanuja, S Shah, A Srinivasan, ...
arXiv preprint arXiv:1912.03457, 2019
462019
Culturellm: Incorporating cultural differences into large language models
C Li, M Chen, J Wang, S Sitaram, X Xie
arXiv preprint arXiv:2402.10946, 2024
442024
Curriculum design for code-switching: Experiments with language identification and language modeling with deep neural networks
M Choudhury, K Bali, S Sitaram, A Baheti
Proceedings of the 14th International Conference on Natural Language …, 2017
442017
Are large language model-based evaluators the solution to scaling up multilingual evaluation?
R Hada, V Gumma, A de Wynter, H Diddee, M Ahmed, M Choudhury, ...
arXiv preprint arXiv:2309.07462, 2023
422023
GCM: A toolkit for generating synthetic code-mixed text
MSZ Rizvi, A Srinivasan, T Ganu, M Choudhury, S Sitaram
Proceedings of the 16th Conference of the European Chapter of the …, 2021
422021
Fairness in language models beyond English: Gaps and challenges
K Ramesh, S Sitaram, M Choudhury
arXiv preprint arXiv:2302.12578, 2023
412023
A new dataset for natural language inference from code-mixed conversations
S Khanuja, S Dandapat, S Sitaram, M Choudhury
arXiv preprint arXiv:2004.05051, 2020
402020
Experiments with Cross-lingual Systems for Synthesis of Code-Mixed Text.
S Sitaram, SK Rallabandi, S Rijhwani, AW Black
SSW, 76-81, 2016
402016
The system can't perform the operation now. Try again later.
Articles 1–20