Low-resource expressive text-to-speech using data augmentation G Huybrechts, T Merritt, G Comini, B Perz, R Shah, J Lorenzo-Trueba ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 73 | 2021 |
Traditional machine learning for pitch detection T Drugman, G Huybrechts, V Klimkov, A Moinet IEEE Signal Processing Letters 25 (11), 1745-1749, 2018 | 42 | 2018 |
Voice conversion for whispered speech synthesis M Cotescu, T Drugman, G Huybrechts, J Lorenzo-Trueba, A Moinet IEEE Signal Processing Letters 27, 186-190, 2019 | 36 | 2019 |
Non-autoregressive TTS with explicit duration modelling for low-resource highly expressive speech R Shah, K Pokora, A Ezzerg, V Klimkov, G Huybrechts, B Putrycz, ... arXiv preprint arXiv:2106.12896, 2021 | 31 | 2021 |
Cross-speaker style transfer for text-to-speech using data augmentation MS Ribeiro, J Roth, G Comini, G Huybrechts, A Gabryś, J Lorenzo-Trueba ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 27 | 2022 |
Voice filter: Few-shot text-to-speech speaker adaptation using voice conversion as a post-processing module A Gabryś, G Huybrechts, MS Ribeiro, CM Chien, J Roth, G Comini, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 25 | 2022 |
Emocat: Language-agnostic emotional voice conversion B Schnell, G Huybrechts, B Perz, T Drugman, J Lorenzo-Trueba arXiv preprint arXiv:2101.05695, 2021 | 15 | 2021 |
Varying speaking styles with neural textto-speech T Wood, T Merritt Alexa Blogs, Nov 19, 2018 | 12 | 2018 |
Learning to rank with deep neural networks G Huybrechts, P Dupont Master’s thesis, Ecole polytechnique de Louvain (EPL), 2016 | 7* | 2016 |
Dynamic chunk convolution for unified streaming and non-streaming conformer asr X Li, G Huybrechts, S Ronanki, J Farris, S Bodapati ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 6 | 2023 |
Low-data? No problem: low-resource, language-agnostic conversational text-to-speech via F0-conditioned data augmentation G Comini, G Huybrechts, MS Ribeiro, A Gabrys, J Lorenzo-Trueba arXiv preprint arXiv:2207.14607, 2022 | 5 | 2022 |
SpeechGuard: Exploring the adversarial robustness of multimodal large language models R Peri, SM Jayanthi, S Ronanki, A Bhatia, K Mundnich, S Dingliwal, ... arXiv preprint arXiv:2405.08317, 2024 | 2 | 2024 |
Voice adaptation using synthetic speech processing AM Gabrys, JL Trueba, GS Huybrechts US Patent 11,915,683, 2024 | 2 | 2024 |
DCTX-Conformer: Dynamic context carry-over for low latency unified streaming and non-streaming Conformer G Huybrechts, S Ronanki, X Li, H Nosrati, S Bodapati, K Kirchhoff arXiv preprint arXiv:2306.08175, 2023 | 1 | 2023 |
Zero-resource speech translation and recognition with LLMs K Mundnich, X Niu, P Mathur, S Ronanki, B Houston, VR Elluru, N Das, ... arXiv preprint arXiv:2412.18566, 2024 | | 2024 |
Adaptive Video Understanding Agent: Enhancing efficiency with dynamic frame sampling and feedback-driven reasoning S Jeoung, G Huybrechts, B Ganesh, A Galstyan, S Bodapati arXiv preprint arXiv:2410.20252, 2024 | | 2024 |
Revisiting convolution-free Transformer for speech recognition Z Hou, G Huybrechts, A Bhatia, D Garcia-Romero, K Han, K Kirchhoff | | 2024 |