Multitask prompted training enables zero-shot task generalization V Sanh, A Webson, C Raffel, SH Bach, L Sutawika, Z Alyafeai, A Chaffin, ... arXiv preprint arXiv:2110.08207, 2021 | 1809 | 2021 |
Bloom: A 176b-parameter open-access multilingual language model T Le Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow, R Castagné, ... | 1746 | 2023 |
Pythia: A suite for analyzing large language models across training and scaling S Biderman, H Schoelkopf, QG Anthony, H Bradley, K O’Brien, E Hallahan, ... International Conference on Machine Learning, 2397-2430, 2023 | 973 | 2023 |
Crosslingual generalization through multitask finetuning N Muennighoff, T Wang, L Sutawika, A Roberts, S Biderman, TL Scao, ... arXiv preprint arXiv:2211.01786, 2022 | 682 | 2022 |
Emergent and predictable memorization in large language models S Biderman, U Prashanth, L Sutawika, H Schoelkopf, Q Anthony, ... Advances in Neural Information Processing Systems 36, 2024 | 148 | 2024 |
What language model to train if you have one million gpu hours? TL Scao, T Wang, D Hesslow, L Saulnier, S Bekman, MS Bari, ... arXiv preprint arXiv:2210.15424, 2022 | 114 | 2022 |
A framework for few-shot language model evaluation, 12 2023 L Gao, J Tow, B Abbasi, S Biderman, S Black, A DiPofi, C Foster, ... URL https://zenodo. org/records/10256836 7, 0 | 91 | |
Bloom+ 1: Adding language support to bloom for zero-shot prompting ZX Yong, H Schoelkopf, N Muennighoff, AF Aji, DI Adelani, K Almubarak, ... arXiv preprint arXiv:2212.09535, 2022 | 56 | 2022 |
Prompting multilingual large language models to generate code-mixed texts: The case of south east asian languages ZX Yong, R Zhang, JZ Forde, S Wang, A Subramonian, H Lovenia, ... arXiv preprint arXiv:2303.13592, 2023 | 34 | 2023 |
Lessons from the trenches on reproducible evaluation of language models S Biderman, H Schoelkopf, L Sutawika, L Gao, J Tow, B Abbasi, AF Aji, ... arXiv preprint arXiv:2405.14782, 2024 | 25 | 2024 |
Towards better structured and less noisy Web data: Oscar with Register annotations V Laippala, A Salmela, S Rönnqvist, AF Aji, LH Chang, A Dhifallah, ... Proceedings of the eighth workshop on noisy user-generated text (w-nut 2022 …, 2022 | 12 | 2022 |
An initial exploration of the suitability of long-short-term-memory networks for multiple site fatigue damage prediction on aircraft lap joints MI Mas, MI Fanany, T Devin, LA Sutawika 2017 International Conference on Advanced Computer Science and Information …, 2017 | 9 | 2017 |
Restricted Boltzmann machines for unsupervised feature selection with partial least square feature extractor for microarray datasets LA Sutawika, I Wasito 2017 International Conference on Advanced Computer Science and Information …, 2017 | 6 | 2017 |
Pangea: A fully open multilingual multimodal llm for 39 languages X Yue, Y Song, A Asai, S Kim, JD Nyandwi, S Khanuja, A Kantharuban, ... arXiv preprint arXiv:2410.16153, 2024 | 4 | 2024 |
Data Processing Matters: SRPH-Konvergen AI's Machine Translation System for WMT'21 L Sutawika, JCB Cruz arXiv preprint arXiv:2111.10513, 2021 | 3 | 2021 |
Samsung research philippines-datasaur ai’s submission for the wmt22 large scale multilingual translation task JCB Cruz, L Sutawika Proceedings of the Seventh Conference on Machine Translation (WMT), 1034-1038, 2022 | 2 | 2022 |
Constant-amplitude fatigue crack growth sequence regression on an aircraft lap joint using a 1-D convolutional network MI Mas, MI Fanany, T Devin, LA Sutawika 2017 1st International Conference on Informatics and Computational Sciences …, 2017 | 2 | 2017 |
Re-Evaluating Evaluation for Multilingual Summarization J Forde, R Zhang, L Sutawika, A Aji, S Cahyawijaya, GI Winata, M Wu, ... Proceedings of the 2024 Conference on Empirical Methods in Natural Language …, 2024 | | 2024 |
Current Status of NLP in South East Asia with Insights from Multilingualism and Language Diversity AF Aji, JZ Forde, AM Loo, L Sutawika, S Wang, GI Winata, ZX Yong, ... Proceedings of the 13th International Joint Conference on Natural Language …, 2023 | | 2023 |
Utilizing Weak Supervision To Generate Indonesian Conservation Dataset M Fransiska, D Pitaloka, S Putra, L Sutawika arXiv preprint arXiv:2310.11258, 2023 | | 2023 |