Folgen
Lintang Sutawika
Lintang Sutawika
Language Technology Institute at Carnegie Mellon Institute
Bestätigte E-Mail-Adresse bei sutawika.com
Titel
Zitiert von
Zitiert von
Jahr
Multitask prompted training enables zero-shot task generalization
V Sanh, A Webson, C Raffel, SH Bach, L Sutawika, Z Alyafeai, A Chaffin, ...
arXiv preprint arXiv:2110.08207, 2021
18092021
Bloom: A 176b-parameter open-access multilingual language model
T Le Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow, R Castagné, ...
17462023
Pythia: A suite for analyzing large language models across training and scaling
S Biderman, H Schoelkopf, QG Anthony, H Bradley, K O’Brien, E Hallahan, ...
International Conference on Machine Learning, 2397-2430, 2023
9732023
Crosslingual generalization through multitask finetuning
N Muennighoff, T Wang, L Sutawika, A Roberts, S Biderman, TL Scao, ...
arXiv preprint arXiv:2211.01786, 2022
6822022
Emergent and predictable memorization in large language models
S Biderman, U Prashanth, L Sutawika, H Schoelkopf, Q Anthony, ...
Advances in Neural Information Processing Systems 36, 2024
1482024
What language model to train if you have one million gpu hours?
TL Scao, T Wang, D Hesslow, L Saulnier, S Bekman, MS Bari, ...
arXiv preprint arXiv:2210.15424, 2022
1142022
A framework for few-shot language model evaluation, 12 2023
L Gao, J Tow, B Abbasi, S Biderman, S Black, A DiPofi, C Foster, ...
URL https://zenodo. org/records/10256836 7, 0
91
Bloom+ 1: Adding language support to bloom for zero-shot prompting
ZX Yong, H Schoelkopf, N Muennighoff, AF Aji, DI Adelani, K Almubarak, ...
arXiv preprint arXiv:2212.09535, 2022
562022
Prompting multilingual large language models to generate code-mixed texts: The case of south east asian languages
ZX Yong, R Zhang, JZ Forde, S Wang, A Subramonian, H Lovenia, ...
arXiv preprint arXiv:2303.13592, 2023
342023
Lessons from the trenches on reproducible evaluation of language models
S Biderman, H Schoelkopf, L Sutawika, L Gao, J Tow, B Abbasi, AF Aji, ...
arXiv preprint arXiv:2405.14782, 2024
252024
Towards better structured and less noisy Web data: Oscar with Register annotations
V Laippala, A Salmela, S Rönnqvist, AF Aji, LH Chang, A Dhifallah, ...
Proceedings of the eighth workshop on noisy user-generated text (w-nut 2022 …, 2022
122022
An initial exploration of the suitability of long-short-term-memory networks for multiple site fatigue damage prediction on aircraft lap joints
MI Mas, MI Fanany, T Devin, LA Sutawika
2017 International Conference on Advanced Computer Science and Information …, 2017
92017
Restricted Boltzmann machines for unsupervised feature selection with partial least square feature extractor for microarray datasets
LA Sutawika, I Wasito
2017 International Conference on Advanced Computer Science and Information …, 2017
62017
Pangea: A fully open multilingual multimodal llm for 39 languages
X Yue, Y Song, A Asai, S Kim, JD Nyandwi, S Khanuja, A Kantharuban, ...
arXiv preprint arXiv:2410.16153, 2024
42024
Data Processing Matters: SRPH-Konvergen AI's Machine Translation System for WMT'21
L Sutawika, JCB Cruz
arXiv preprint arXiv:2111.10513, 2021
32021
Samsung research philippines-datasaur ai’s submission for the wmt22 large scale multilingual translation task
JCB Cruz, L Sutawika
Proceedings of the Seventh Conference on Machine Translation (WMT), 1034-1038, 2022
22022
Constant-amplitude fatigue crack growth sequence regression on an aircraft lap joint using a 1-D convolutional network
MI Mas, MI Fanany, T Devin, LA Sutawika
2017 1st International Conference on Informatics and Computational Sciences …, 2017
22017
Re-Evaluating Evaluation for Multilingual Summarization
J Forde, R Zhang, L Sutawika, A Aji, S Cahyawijaya, GI Winata, M Wu, ...
Proceedings of the 2024 Conference on Empirical Methods in Natural Language …, 2024
2024
Current Status of NLP in South East Asia with Insights from Multilingualism and Language Diversity
AF Aji, JZ Forde, AM Loo, L Sutawika, S Wang, GI Winata, ZX Yong, ...
Proceedings of the 13th International Joint Conference on Natural Language …, 2023
2023
Utilizing Weak Supervision To Generate Indonesian Conservation Dataset
M Fransiska, D Pitaloka, S Putra, L Sutawika
arXiv preprint arXiv:2310.11258, 2023
2023
Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.
Artikel 1–20