OpusFilter: A configurable parallel corpus filtering toolbox M Aulamo, S Virpioja, J Tiedemann 2020 Annual Conference of the Association for Computational Linguistics, 150-156, 2020 | 41 | 2020 |
OpusTools and parallel corpus diagnostics M Aulamo, U Sulubacak, S Virpioja, J Tiedemann Proceedings of the Twelfth Language Resources and Evaluation Conference …, 2020 | 35 | 2020 |
The OPUS resource repository: An open package for creating parallel corpora and machine translation services M Aulamo, J Tiedemann Proceedings of the 22nd Nordic Conference on Computational Linguistics, 389-394, 2019 | 19 | 2019 |
Democratizing neural machine translation with OPUS-MT J Tiedemann, M Aulamo, D Bakshandaeva, M Boggia, SA Grönroos, ... Language Resources and Evaluation 58 (2), 713-755, 2024 | 16 | 2024 |
A new massive multilingual dataset for high-performance language technologies O De Gibert, G Nail, N Arefyev, M Bañón, J Van Der Linde, S Ji, ... arXiv preprint arXiv:2403.14009, 2024 | 15 | 2024 |
Paraphrase detection on noisy subtitles in six languages E Sjöblom, M Creutz, M Aulamo arXiv preprint arXiv:1809.07978, 2018 | 12 | 2018 |
HPLT: High performance language technologies M Aulamo, N Bogoychev, S Ji, G Nail, G Ramírez‐Sánchez, J Tiedemann, ... Proceedings of the 24th Annual Conference of the European Association for …, 2023 | 11 | 2023 |
The fiskmö project: Resources and tools for finnish-swedish machine translation and cross-linguistic research J Tiedemann, T Nieminen, M Aulamo, J Kanerva, A Leino, F Ginter, ... Language Resources and Evaluation Conference, 3808-3815, 2020 | 11 | 2020 |
Boosting neural machine translation from Finnish to Northern Sámi with rule-based backtranslation M Aulamo, S Virpioja, Y Scherrer, J Tiedemann Proceedings of the 23rd Nordic Conference on Computational Linguistics …, 2021 | 9 | 2021 |
Democratizing machine translation with opus-mt J Tiedemann, M Aulamo, D Bakshandaeva, M Boggia, SA Grönroos, ... arXiv preprint arXiv:2212.01936, 2022 | 8 | 2022 |
Four approaches to low-resource multilingual NMT: The Helsinki submission to the AmericasNLP 2023 shared task O De Gibert, R Vázquez, M Aulamo, Y Scherrer, S Virpioja, J Tiedemann Proceedings of the Workshop on Natural Language Processing for Indigenous …, 2023 | 7 | 2023 |
Opuscleaner and opustrainer, open source toolkits for training machine translation and large language models N Bogoychev, J van der Linde, G Nail, B Haddow, J Zaragoza-Bernabeu, ... arXiv preprint arXiv:2311.14838, 2023 | 5 | 2023 |
Annotation of subtitle paraphrases using a new web tool MJ Aulamo, MJP Creutz, EI Sjöblom Digital Humanities in the Nordic Countries, 33-48, 2019 | 4 | 2019 |
Hybrid distillation from RBMT and NMT: Helsinki-NLP’s submission to the Shared Task on Translation into Low-Resource Languages of Spain O De Gibert, M Aulamo, Y Scherrer, J Tiedemann Proceedings of the Ninth Conference on Machine Translation, 908-917, 2024 | 2 | 2024 |
Open Translation Models, Tools and Services J Tiedemann, M Aulamo, S Hardwick, T Nieminen European Language Grid: A Language Technology Platform for Multilingual …, 2022 | 2 | 2022 |
HPLT's first release of data and models N Arefyev, M Aulamo, P Chen, ODG Bonet, B Haddow, J Helcl, B Malik, ... The 25th Annual Conference of The European Association for Machine …, 2024 | 1 | 2024 |
The University of Helsinki submission to the IWSLT2020 Offline Speech Translation Task R Vázquez, M Aulamo, U Sulubacak, J Tiedemann International Conference on Spoken Language Translation, 95-102, 2020 | 1 | 2020 |
Using POS n-grams to detect grammatical errors in Finnish text M Aulamo Magistritöö. Helsingi Ülikool, 2019 | 1 | 2019 |
Hybrid distillation from RBMT and NMT: Helsinki-NLP’s submission to the Shared Task on Translation into Low-Resource Languages of Spain O De Gibert Bonet, M Aulamo, Y Scherrer, J Tiedemann | | 2024 |
Hybrid distillation from RBMT and NMT: Helsinki-NLP’s submission to the Shared Task on Translation into Low-Resource Languages of Spain ODG Bonet, M Aulamo, Y Scherrer, J Tiedemann Hybrid distillation from RBMT and NMT: Helsinki-NLP’s submission to the …, 2024 | | 2024 |