Wikimatrix: Mining 135m parallel sentences in 1620 language pairs from wikipedia H Schwenk, V Chaudhary, S Sun, H Gong, F Guzmán Proceedings of the 16th Conference of the European Chapter of the …, 2021 | 369 | 2021 |
Unsupervised quality estimation for neural machine translation M Fomicheva, S Sun, L Yankovskaya, F Blain, F Guzmán, M Fishel, ... Transactions of the Association for Computational Linguistics 8, 539-555, 2020 | 172 | 2020 |
Cross-lingual learning-to-rank with shared representations S Sasaki, S Sun, S Schamoni, K Duh, K Inui Proceedings of the 2018 Conference of the North American Chapter of the …, 2018 | 65 | 2018 |
MLQE-PE: A multilingual quality estimation and post-editing dataset M Fomicheva, S Sun, E Fonseca, C Zerva, F Blain, V Chaudhary, ... arXiv preprint arXiv:2010.04480, 2020 | 63 | 2020 |
CLIRMatrix: A Massively Large Collection of Bilingual and Multilingual Datasets for Cross-Lingual Information Retrieval S Sun, K Duh Proceedings of the 2020 Conference on Empirical Methods in Natural Language …, 2020 | 56 | 2020 |
Are we estimating or guesstimating translation quality? S Sun, F Guzmán, L Specia Proceedings of the 58th annual meeting of the association for computational …, 2020 | 29 | 2020 |
BERGAMOT-LATTE submissions for the WMT20 quality estimation shared task M Fomicheva, S Sun, L Yankovskaya, F Blain, V Chaudhary, M Fishel, ... Association for Computational Linguistics, 2020 | 27 | 2020 |
Collecting verified COVID-19 question answer pairs A Poliak, M Fleming, C Costello, K Murray, M Yarmohammadi, S Pandya, ... Proceedings of the 1st Workshop on NLP for COVID-19 (Part 2) at EMNLP 2020, 2020 | 20 | 2020 |
Audiobench: A universal benchmark for audio large language models B Wang, X Zou, G Lin, S Sun, Z Liu, W Zhang, Z Liu, AT Aw, NF Chen arXiv preprint arXiv:2406.16020, 2024 | 18 | 2024 |
AfriCLIRMatrix: Enabling cross-lingual information retrieval for african languages O Ogundepo, X Zhang, S Sun, K Duh, J Lin Proceedings of the 2022 Conference on Empirical Methods in Natural Language …, 2022 | 18 | 2022 |
Battle of the Large Language Models: Dolly vs LLaMA vs Vicuna vs Guanaco vs Bard vs ChatGPT--A Text-to-SQL Parsing Comparison S Sun, Y Zhang, J Yan, Y Gao, D Ong, B Chen, J Su arXiv preprint arXiv:2310.10190, 2023 | 12 | 2023 |
Modeling document interactions for learning to rank with regularized self-attention S Sun, K Duh arXiv preprint arXiv:2005.03932, 2020 | 9 | 2020 |
Clireval: Evaluating machine translation as a cross-lingual information retrieval task S Sun, S Sia, K Duh Proceedings of the 58th Annual Meeting of the Association for Computational …, 2020 | 7 | 2020 |
An exploratory study on multilingual quality estimation S Sun, M Fomicheva, F Blain, V Chaudhary, A El-Kishky, A Renduchintala, ... Proceedings of the 1st Conference of the Asia-Pacific Chapter of the …, 2020 | 5 | 2020 |
An analysis of bert faq retrieval models for covid-19 infobot S Sun, J Sedoc | 5 | 2020 |
Wikimatrix: Mining 135m parallel sentences in 1620 language pairs from wikipedia. arXiv cs H Schwenk, V Chaudhary, S Sun, H Gong, F Guzmán CL, 1907 | 5 | 1907 |
SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages H Lovenia, R Mahendra, SM Akbar, LJV Miranda, J Santoso, E Aco, ... arXiv preprint arXiv:2406.10118, 2024 | 3 | 2024 |
Classification-based Quality Estimation: Small and Efficient Models for Real-world Applications S Sun, A El-Kishky, V Chaudhary, J Cross, F Guzmán, L Specia Proceedings of the 2021 Conference on Empirical Methods in Natural Language …, 2021 | 3 | 2021 |
Interface display method and apparatus, device, and storage medium S Sun US Patent App. 18/197,207, 2023 | 1 | 2023 |
An Exploratory Study on Model Compression for Text-to-SQL S Sun, Y Gao, Y Zhang, J Su, B Chen, Y Lin, S Sun Findings of the Association for Computational Linguistics: ACL 2023, 11647-11654, 2023 | 1 | 2023 |