Aku Rouhe

Geciteerd door

	Alles	Sinds 2020
Citaties	1308	1295
h-index	13	12
i10-index	14	14

480

240

120

360

2017201820192020202120222023202420254 5 3 9 66 273 430 469 42

Openbare toegang

Alles bekijken

15 artikelen

0 artikelen

beschikbaar

niet beschikbaar

Op basis van financieringsmachtigingen

Volgen

Aku Rouhe

PhD Student, Aalto University

Geverifieerd e-mailadres voor aalto.fi

speech recognition


Titel Sorteren op citaties Sorteren op jaar Sorteren op titel	Geciteerd door Geciteerd door	Jaar
SpeechBrain: A general-purpose speech toolkit M Ravanelli, T Parcollet, P Plantinga, A Rouhe, S Cornell, L Lugosch, ... arXiv preprint arXiv:2106.04624, 2021	793	2021
Samuele Cornell M Ravanelli, T Parcollet, P Plantinga, A Rouhe Loren Lugosch, Cem Subakan, Nauman Dawalatabad, Abdelwahab Heba, Jianyuan …, 2021	135	2021
Multimodal machine translation through visuals and speech U Sulubacak, O Caglayan, SA Grönroos, A Rouhe, D Elliott, L Specia, ... Machine Translation 34, 97-147, 2020	88	2020
Samuele Cornell, Loren Lugosch, Cem Subakan, Nauman Dawalatabad, Abdelwahab Heba, Jianyuan Zhong, et al. 2021. Speechbrain: A general-purpose speech toolkit M Ravanelli, T Parcollet, P Plantinga, A Rouhe arXiv preprint arXiv:2106.04624, 1-34, 2021	31	2021
Lahjoita puhetta: a large-scale corpus of spoken Finnish with some benchmarks A Moisio, D Porjazovski, A Rouhe, Y Getman, A Virkkunen, R AlGhezi, ... Language Resources and Evaluation 57 (3), 1295-1327, 2023	26	2023
SpeechBrain: A general-purpose speech toolkit. arXiv 2021 M Ravanelli, T Parcollet, P Plantinga, A Rouhe, S Cornell, L Lugosch, ... arXiv preprint arXiv:2106.04624, 0	24
SpeechBrain: a general-purpose speech toolkit. arXiv M Ravanelli, T Parcollet, P Plantinga, A Rouhe, S Cornell, L Lugosch, ... arXiv preprint arXiv:2106.04624 10, 2021	22	2021
Speechbrain M Ravanelli, T Parcollet, A Rouhe, P Plantinga, E Rastorgueva, ... GitHub repository, 2021	21	2021
SpeechBrain: a general-purpose speech toolkit (2021) M Ravanelli, T Parcollet, P Plantinga, A Rouhe, S Cornell, L Lugosch, ... arXiv preprint arXiv:2106.04624, 2022	19	2022
Self-supervised end-to-end ASR for low resource L2 Swedish R Al-Ghezi, Y Getman, A Rouhe, R Hildén, M Kurimo Annual Conference of the International Speech Communication Association …, 2021	19	2021
Finnish ASR with deep transformer models A Jain, A Rouhe, SA Grönroos, M Kurimo Interspeech 2020, 3630-3634, 2020	18	2020
Finnish parliament ASR corpus: Analysis, benchmarks and statistics A Virkkunen, A Rouhe, N Phan, M Kurimo Language Resources and Evaluation 57 (4), 1645-1670, 2023	17	2023
Digitala: An augmented test and review process prototype for high-stakes spoken foreign language examination R Karhila, A Rouhe, P Smit, A Mansikkaniemi, H Kallio, E Lindroos, ... Interspeech, 784-785, 2016	17	2016
Open-source conversational ai with speechbrain 1.0 M Ravanelli, T Parcollet, A Moumen, S de Langen, C Subakan, ... Journal of Machine Learning Research 25 (333), 1-11, 2024	11	2024
Low resource comparison of attention-based and hybrid ASR exploiting wav2vec 2.0 A Rouhe, A Virkkunen, J Leinonen, M Kurimo Interspeech, 3543-3547, 2022	8	2022
Samuele Cornell, Sung-Lin Yeh, Hwidong Na, Yan Gao, Szu-Wei Fu, Cem Subakan, Renato De Mori, and Yoshua Bengio. Speechbrain M Ravanelli, T Parcollet, A Rouhe, P Plantinga, E Rastorgueva, ...	8	2021
Speaker-aware training of attention-based end-to-end speech recognition using neural speaker embeddings A Rouhe, T Kaseva, M Kurimo ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	7	2020
An equal data setting for attention-based encoder-decoder and HMM/DNN models: A case study in Finnish ASR A Rouhe, A Van Camp, M Singh, H Van Hamme, M Kurimo Speech and Computer: 23rd International Conference, SPECOM 2021, St …, 2021	6	2021
Spherediar: An effective speaker diarization system for meeting data T Kaseva, A Rouhe, M Kurimo 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019	6	2019
Principled comparisons for end-to-end speech recognition: Attention vs hybrid at the 1000-hour scale A Rouhe, T Grósz, M Kurimo IEEE/ACM Transactions on Audio, Speech, and Language Processing 32, 623-638, 2023	5	2023

Het systeem kan de bewerking nu niet uitvoeren. Probeer het later opnieuw.

Artikelen 1–20

Citaties per jaar

Dubbele citaties

Samengevoegde citaties

Medeauteurs toevoegenMedeauteurs

Volgen

Geciteerd door