UDapter: Language Adaptation for Truly Universal Dependency Parsing A Üstün, A Bisazza, G Bouma, G van Noord Proceedings of the 2020 Conference on Empirical Methods in Natural Language …, 2020 | 122 | 2020 |
Aya model: An instruction finetuned open-access multilingual language model A Üstün, V Aryabumi, ZX Yong, WY Ko, D D'souza, G Onilude, N Bhandari, ... arXiv preprint arXiv:2402.07827, 2024 | 117 | 2024 |
Massive choice, ample tasks (MaChAmp): A toolkit for multi-task learning in NLP R Van Der Goot, A Üstün, A Ramponi, I Sharaf, B Plank arXiv preprint arXiv:2005.14672, 2020 | 101 | 2020 |
Pushing mixture of experts to the limit: Extremely parameter efficient moe for instruction tuning T Zadouri, A Üstün, A Ahmadian, B Ermiş, A Locatelli, S Hooker arXiv preprint arXiv:2309.05444, 2023 | 79 | 2023 |
Back to basics: Revisiting reinforce style optimization for learning from human feedback in llms A Ahmadian, C Cremer, M Gallé, M Fadaee, J Kreutzer, O Pietquin, ... arXiv preprint arXiv:2402.14740, 2024 | 76 | 2024 |
When less is more: Investigating data pruning for pretraining llms at scale M Marion, A Üstün, L Pozzobon, A Wang, M Fadaee, S Hooker arXiv preprint arXiv:2309.04564, 2023 | 76 | 2023 |
Aya dataset: An open-access collection for multilingual instruction tuning S Singh, F Vargus, D Dsouza, BF Karlsson, A Mahendiran, WY Ko, ... arXiv preprint arXiv:2402.06619, 2024 | 63 | 2024 |
Aya 23: Open weight releases to further multilingual progress V Aryabumi, J Dang, D Talupuru, S Dash, D Cairuz, H Lin, B Venkitesh, ... arXiv preprint arXiv:2405.15032, 2024 | 59 | 2024 |
Multilingual unsupervised neural machine translation with denoising adapters A Üstün, A Berard, L Besacier, M Gallé arXiv preprint arXiv:2110.10472, 2021 | 46 | 2021 |
Characters or morphemes: How to represent words? A Üstün, M Kurfalı, B Can Association for Computational Linguistics, 2018 | 45 | 2018 |
Siti Oryza Khairunnisa, Mamoru Komachi, and Barbara Plank. 2021. From masked language modeling to translation: Non-English auxiliary tasks improve zero-shot spoken language … R Van Der Goot, I Sharaf, A Imankulova, A Üstün, M Stepanovic, ... Proceedings of the 2021 Conference of the North American Chapter of the …, 2021 | 42 | 2021 |
Automatic judgement forecasting for pending applications of the European Court of Human Rights M Medvedeva, A Üstün, X Xu, M Vols, M Wieling Proceedings of the Fifth Workshop on Automatec Semantic Analysis of …, 2021 | 32 | 2021 |
Intriguing properties of quantization at scale A Ahmadian, S Dash, H Chen, B Venkitesh, ZS Gou, P Blunsom, A Üstün, ... Advances in Neural Information Processing Systems 36, 34278-34294, 2023 | 31 | 2023 |
Hyper-X: A unified hypernetwork for multi-task multilingual transfer A Üstün, A Bisazza, G Bouma, G van Noord, S Ruder arXiv preprint arXiv:2205.12148, 2022 | 30 | 2022 |
Unsupervised morphological segmentation using neural word embeddings A Üstün, B Can Statistical Language and Speech Processing: 4th International Conference …, 2016 | 20 | 2016 |
From masked language modeling to translation: Non-English auxiliary tasks improve zero-shot spoken language understanding R Van Der Goot, I Sharaf, A Imankulova, A Üstün, M Stepanović, ... arXiv preprint arXiv:2105.07316, 2021 | 14 | 2021 |
UDapter: Typology-based language adapters for multilingual dependency parsing and sequence labeling A Üstün, A Bisazza, G Bouma, G Noord Computational Linguistics 48 (3), 555-592, 2022 | 12 | 2022 |
Turkish POS tagging by reducing sparsity with morpheme tags in small datasets B Can, A Üstün, M Kurfalı Computational Linguistics and Intelligent Text Processing: 17th …, 2018 | 11 | 2018 |
On the difficulty of translating free-order case-marking languages A Bisazza, A Üstün, S Sportel Transactions of the Association for Computational Linguistics 9, 1233-1248, 2021 | 10 | 2021 |
When does Parameter-Efficient Transfer Learning Work for Machine Translation? A Üstün, AC Stickland arXiv preprint arXiv:2205.11277, 2022 | 8 | 2022 |