Følg
Lintang Sutawika
Lintang Sutawika
Language Technology Institute at Carnegie Mellon Institute
Verificeret mail på sutawika.com
Titel
Citeret af
Citeret af
År
Multitask prompted training enables zero-shot task generalization
V Sanh, A Webson, C Raffel, SH Bach, L Sutawika, Z Alyafeai, A Chaffin, ...
arXiv preprint arXiv:2110.08207, 2021
17962021
Bloom: A 176b-parameter open-access multilingual language model
T Le Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow, R Castagné, ...
17652023
Pythia: A suite for analyzing large language models across training and scaling
S Biderman, H Schoelkopf, QG Anthony, H Bradley, K O’Brien, E Hallahan, ...
International Conference on Machine Learning, 2397-2430, 2023
10382023
Crosslingual generalization through multitask finetuning
N Muennighoff, T Wang, L Sutawika, A Roberts, S Biderman, TL Scao, ...
arXiv preprint arXiv:2211.01786, 2022
7072022
Emergent and predictable memorization in large language models
S Biderman, U Prashanth, L Sutawika, H Schoelkopf, Q Anthony, ...
Advances in Neural Information Processing Systems 36, 28072-28090, 2023
1602023
What language model to train if you have one million GPU hours?
TL Scao, T Wang, D Hesslow, L Saulnier, S Bekman, MS Bari, ...
arXiv preprint arXiv:2210.15424, 2022
1192022
A framework for few-shot language model evaluation, 12 2023
L Gao, J Tow, B Abbasi, S Biderman, S Black, A DiPofi, C Foster, ...
URL https://zenodo. org/records/10256836 7, 2023
942023
BLOOM+ 1: Adding language support to BLOOM for zero-shot prompting
ZX Yong, H Schoelkopf, N Muennighoff, AF Aji, DI Adelani, K Almubarak, ...
arXiv preprint arXiv:2212.09535, 2022
632022
Prompting multilingual large language models to generate code-mixed texts: The case of south East Asian languages
ZX Yong, R Zhang, JZ Forde, S Wang, A Subramonian, H Lovenia, ...
arXiv preprint arXiv:2303.13592, 2023
362023
Lessons from the trenches on reproducible evaluation of language models
S Biderman, H Schoelkopf, L Sutawika, L Gao, J Tow, B Abbasi, AF Aji, ...
arXiv preprint arXiv:2405.14782, 2024
322024
Towards better structured and less noisy web data: Oscar with register annotations
V Laippala, A Salmela, S Rönnqvist, AF Aji, LH Chang, A Dhifallah, ...
Proceedings of the eighth workshop on noisy user-generated text (w-nut 2022 …, 2022
122022
An initial exploration of the suitability of long-short-term-memory networks for multiple site fatigue damage prediction on aircraft lap joints
MI Mas, MI Fanany, T Devin, LA Sutawika
2017 International Conference on Advanced Computer Science and Information …, 2017
92017
Restricted Boltzmann machines for unsupervised feature selection with partial least square feature extractor for microarray datasets
LA Sutawika, I Wasito
2017 International Conference on Advanced Computer Science and Information …, 2017
62017
Pangea: A fully open multilingual multimodal llm for 39 languages
X Yue, Y Song, A Asai, S Kim, JD Nyandwi, S Khanuja, A Kantharuban, ...
arXiv preprint arXiv:2410.16153, 2024
42024
Data Processing Matters: SRPH-Konvergen AI's Machine Translation System for WMT'21
L Sutawika, JCB Cruz
arXiv preprint arXiv:2111.10513, 2021
32021
Samsung research philippines-datasaur ai’s submission for the wmt22 large scale multilingual translation task
JCB Cruz, L Sutawika
Proceedings of the Seventh Conference on Machine Translation (WMT), 1034-1038, 2022
22022
Constant-amplitude fatigue crack growth sequence regression on an aircraft lap joint using a 1-D convolutional network
MI Mas, MI Fanany, T Devin, LA Sutawika
2017 1st International Conference on Informatics and Computational Sciences …, 2017
22017
Re-Evaluating Evaluation for Multilingual Summarization
J Forde, R Zhang, L Sutawika, A Aji, S Cahyawijaya, GI Winata, M Wu, ...
Proceedings of the 2024 Conference on Empirical Methods in Natural Language …, 2024
2024
Current Status of NLP in South East Asia with Insights from Multilingualism and Language Diversity
AF Aji, JZ Forde, AM Loo, L Sutawika, S Wang, GI Winata, ZX Yong, ...
Proceedings of the 13th International Joint Conference on Natural Language …, 2023
2023
Utilizing Weak Supervision To Generate Indonesian Conservation Dataset
M Fransiska, D Pitaloka, S Putra, L Sutawika
arXiv preprint arXiv:2310.11258, 2023
2023
Systemet kan ikke foretage handlingen nu. Prøv igen senere.
Artikler 1–20