Volgen
Aku Rouhe
Aku Rouhe
PhD Student, Aalto University
Geverifieerd e-mailadres voor aalto.fi
Titel
Geciteerd door
Geciteerd door
Jaar
SpeechBrain: A general-purpose speech toolkit
M Ravanelli, T Parcollet, P Plantinga, A Rouhe, S Cornell, L Lugosch, ...
arXiv preprint arXiv:2106.04624, 2021
7932021
Samuele Cornell
M Ravanelli, T Parcollet, P Plantinga, A Rouhe
Loren Lugosch, Cem Subakan, Nauman Dawalatabad, Abdelwahab Heba, Jianyuan …, 2021
1352021
Multimodal machine translation through visuals and speech
U Sulubacak, O Caglayan, SA Grönroos, A Rouhe, D Elliott, L Specia, ...
Machine Translation 34, 97-147, 2020
882020
Samuele Cornell, Loren Lugosch, Cem Subakan, Nauman Dawalatabad, Abdelwahab Heba, Jianyuan Zhong, et al. 2021. Speechbrain: A general-purpose speech toolkit
M Ravanelli, T Parcollet, P Plantinga, A Rouhe
arXiv preprint arXiv:2106.04624, 1-34, 2021
312021
Lahjoita puhetta: a large-scale corpus of spoken Finnish with some benchmarks
A Moisio, D Porjazovski, A Rouhe, Y Getman, A Virkkunen, R AlGhezi, ...
Language Resources and Evaluation 57 (3), 1295-1327, 2023
262023
SpeechBrain: A general-purpose speech toolkit. arXiv 2021
M Ravanelli, T Parcollet, P Plantinga, A Rouhe, S Cornell, L Lugosch, ...
arXiv preprint arXiv:2106.04624, 0
24
SpeechBrain: a general-purpose speech toolkit. arXiv
M Ravanelli, T Parcollet, P Plantinga, A Rouhe, S Cornell, L Lugosch, ...
arXiv preprint arXiv:2106.04624 10, 2021
222021
Speechbrain
M Ravanelli, T Parcollet, A Rouhe, P Plantinga, E Rastorgueva, ...
GitHub repository, 2021
212021
SpeechBrain: a general-purpose speech toolkit (2021)
M Ravanelli, T Parcollet, P Plantinga, A Rouhe, S Cornell, L Lugosch, ...
arXiv preprint arXiv:2106.04624, 2022
192022
Self-supervised end-to-end ASR for low resource L2 Swedish
R Al-Ghezi, Y Getman, A Rouhe, R Hildén, M Kurimo
Annual Conference of the International Speech Communication Association …, 2021
192021
Finnish ASR with deep transformer models
A Jain, A Rouhe, SA Grönroos, M Kurimo
Interspeech 2020, 3630-3634, 2020
182020
Finnish parliament ASR corpus: Analysis, benchmarks and statistics
A Virkkunen, A Rouhe, N Phan, M Kurimo
Language Resources and Evaluation 57 (4), 1645-1670, 2023
172023
Digitala: An augmented test and review process prototype for high-stakes spoken foreign language examination
R Karhila, A Rouhe, P Smit, A Mansikkaniemi, H Kallio, E Lindroos, ...
Interspeech, 784-785, 2016
172016
Open-source conversational ai with speechbrain 1.0
M Ravanelli, T Parcollet, A Moumen, S de Langen, C Subakan, ...
Journal of Machine Learning Research 25 (333), 1-11, 2024
112024
Low resource comparison of attention-based and hybrid ASR exploiting wav2vec 2.0
A Rouhe, A Virkkunen, J Leinonen, M Kurimo
Interspeech, 3543-3547, 2022
82022
Samuele Cornell, Sung-Lin Yeh, Hwidong Na, Yan Gao, Szu-Wei Fu, Cem Subakan, Renato De Mori, and Yoshua Bengio. Speechbrain
M Ravanelli, T Parcollet, A Rouhe, P Plantinga, E Rastorgueva, ...
82021
Speaker-aware training of attention-based end-to-end speech recognition using neural speaker embeddings
A Rouhe, T Kaseva, M Kurimo
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
72020
An equal data setting for attention-based encoder-decoder and HMM/DNN models: A case study in Finnish ASR
A Rouhe, A Van Camp, M Singh, H Van Hamme, M Kurimo
Speech and Computer: 23rd International Conference, SPECOM 2021, St …, 2021
62021
Spherediar: An effective speaker diarization system for meeting data
T Kaseva, A Rouhe, M Kurimo
2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019
62019
Principled comparisons for end-to-end speech recognition: Attention vs hybrid at the 1000-hour scale
A Rouhe, T Grósz, M Kurimo
IEEE/ACM Transactions on Audio, Speech, and Language Processing 32, 623-638, 2023
52023
Het systeem kan de bewerking nu niet uitvoeren. Probeer het later opnieuw.
Artikelen 1–20