Sunayana Sitaram

Cited by

	All	Since 2020
Citations	2209	1914
h-index	26	23
i10-index	46	40

740

370

185

555

201320142015201620172018201920202021202220232024202516 14 16 28 31 68 107 173 222 321 382 739 74

Public access

View all

2 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Monojit ChoudhuryProfessor of Natural Language Processing, MBZUAIVerified email at mbzuai.ac.ae
KALIKA BALIResearcher, Microsoft Research Labs IndiaVerified email at microsoft.com
Sandipan DandapatMicrosoft, IndiaVerified email at microsoft.com
Alan W BlackProfessor, Language Technologies Institute, Carnegie Mellon UniversityVerified email at cs.cmu.edu
Kabir AhujaUniversity of WashingtonVerified email at cs.washington.edu
Sanket ShahMicrosoft ResearchVerified email at microsoft.com
Tanuja GanuMicrosoft ResearchVerified email at microsoft.com
Basil AbrahamVerified email at ee.iitm.ac.in
Prachi JainMicrosoft ResearchVerified email at microsoft.com
Vivek SeshadriStudent, CS, CMUVerified email at cs.cmu.edu
Varun GummaSCAI Center Fellow @ Microsoft Research IndiaVerified email at microsoft.com
Rishav HadaMicrosoft ResearchVerified email at microsoft.com
Simran KhanujaCarnegie Mellon UniversityVerified email at andrew.cmu.edu
SaiKrishna RallabandiSenior Data Scientist, Fidelity InvestmentsVerified email at andrew.cmu.edu
Akshay NambiPrincipal Researcher, Microsoft ResearchVerified email at microsoft.com
Mohamed Ahmed / Maxamed AxmedMicrosoftVerified email at microsoft.com
Krithika RameshJohns Hopkins UniversityVerified email at jh.edu
Millicent OchiengMicrosoftVerified email at microsoft.com
hemant yadavPhD scholar at IIIT Delhi.Verified email at iiitd.ac.in
Brij Mohan Lal SrivastavaNijtaVerified email at nijta.com

Sunayana Sitaram

Microsoft Research India

Verified email at microsoft.com - Homepage

Multilingual NLP evaluation LLMs and culture multilingualism LLMs


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Mega: Multilingual evaluation of generative ai K Ahuja, H Diddee, R Hada, M Ochieng, K Ramesh, P Jain, A Nambi, ... arXiv preprint arXiv:2303.12528, 2023	212	2023
Language modeling for code-mixing: The role of linguistic theory based synthetic data A Pratapa, G Bhat, M Choudhury, S Sitaram, S Dandapat, K Bali Proceedings of the 56th Annual Meeting of the Association for Computational …, 2018	169	2018
A survey of code-switched speech and language processing S Sitaram, KR Chandu, SK Rallabandi, AW Black arXiv preprint arXiv:1904.00784, 2019	141	2019
GLUECoS: An evaluation benchmark for code-switched NLP S Khanuja, S Dandapat, A Srinivasan, S Sitaram, M Choudhury arXiv preprint arXiv:2004.12376, 2020	132	2020
Multilingual and code-switching ASR challenges for low resource Indian languages A Diwan, R Vaideeswaran, S Shah, A Singh, S Raghavan, S Khare, ... arXiv preprint arXiv:2104.00235, 2021	92	2021
Interspeech 2018 Low Resource Automatic Speech Recognition Challenge for Indian Languages BML Srivastava, S Sitaram, RK Mehta, KD Mohan, P Matani, S Satpal, ... Proc. The 6th Intl. Workshop on Spoken Language Technologies for Under …, 2018	78	2018
A survey of code-switching: Linguistic and social perspectives for language technologies AS Doğruöz, S Sitaram, BE Bullock, AJ Toribio arXiv preprint arXiv:2301.01967, 2023	74	2023
Word embeddings for code-mixed language processing A Pratapa, M Choudhury, S Sitaram Proceedings of the 2018 conference on empirical methods in natural language …, 2018	72	2018
Polyglot neural language models: A case study in cross-lingual phonetic representation learning Y Tsvetkov, S Sitaram, M Faruqui, G Lample, P Littell, D Mortensen, ... arXiv preprint arXiv:1605.03832, 2016	72	2016
Speech synthesis of code-mixed text S Sitaram, AW Black Proceedings of the Tenth International Conference on Language Resources and …, 2016	48	2016
A survey of multilingual models for automatic speech recognition H Yadav, S Sitaram arXiv preprint arXiv:2202.12576, 2022	47	2022
Crowdsourcing speech data for low-resource languages from low-income workers B Abraham, D Goel, D Siddarth, K Bali, M Chopra, M Choudhury, P Joshi, ... Proceedings of the Twelfth Language Resources and Evaluation Conference …, 2020	46	2020
Unsung challenges of building and deploying language technologies for low resource language communities P Joshi, C Barnes, S Santy, S Khanuja, S Shah, A Srinivasan, ... arXiv preprint arXiv:1912.03457, 2019	46	2019
Culturellm: Incorporating cultural differences into large language models C Li, M Chen, J Wang, S Sitaram, X Xie arXiv preprint arXiv:2402.10946, 2024	44	2024
Curriculum design for code-switching: Experiments with language identification and language modeling with deep neural networks M Choudhury, K Bali, S Sitaram, A Baheti Proceedings of the 14th International Conference on Natural Language …, 2017	44	2017
Are large language model-based evaluators the solution to scaling up multilingual evaluation? R Hada, V Gumma, A de Wynter, H Diddee, M Ahmed, M Choudhury, ... arXiv preprint arXiv:2309.07462, 2023	42	2023
GCM: A toolkit for generating synthetic code-mixed text MSZ Rizvi, A Srinivasan, T Ganu, M Choudhury, S Sitaram Proceedings of the 16th Conference of the European Chapter of the …, 2021	42	2021
Fairness in language models beyond English: Gaps and challenges K Ramesh, S Sitaram, M Choudhury arXiv preprint arXiv:2302.12578, 2023	41	2023
A new dataset for natural language inference from code-mixed conversations S Khanuja, S Dandapat, S Sitaram, M Choudhury arXiv preprint arXiv:2004.05051, 2020	40	2020
Experiments with Cross-lingual Systems for Synthesis of Code-Mixed Text. S Sitaram, SK Rallabandi, S Rijhwani, AW Black SSW, 76-81, 2016	40	2016

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors