A Wavenet for speech denoising D Rethage, J Pons, X Serra International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2018 | 562 | 2018 |
Fsd50k: an open dataset of human-labeled sound events E Fonseca, X Favory, J Pons, F Font, X Serra IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 829-852, 2021 | 526 | 2021 |
Freesound Datasets: a platform for the creation of open audio datasets E Fonseca, J Pons, X Favory, F Font, D Bogdanov, A Ferraro, S Oramas, ... International Society for Music Information Retrieval Conference (ISMIR), 2017 | 285 | 2017 |
End-to-end learning for music audio tagging at scale J Pons, O Nieto, M Prockup, E Schmidt, A Ehmann, X Serra International Society for Music Information Retrieval Conference (ISMIR), 2018 | 256 | 2018 |
General-purpose tagging of freesound audio with audioset labels: Task description, dataset, and baseline E Fonseca, M Plakal, F Font, DPW Ellis, X Favory, J Pons, X Serra DCASE Workshop, 2018 | 211 | 2018 |
Experimenting with musically motivated convolutional neural networks J Pons, T Lidy, X Serra International Workshop on Content-Based Multimedia Indexing (CBMI), 1-6, 2016 | 202 | 2016 |
Timbre analysis of music audio signals with convolutional neural networks J Pons, O Slizovskaia, E Gómez Gutiérrez, X Serra European Signal Processing Conference (EUSIPCO), 2813-7, 2017 | 172 | 2017 |
Randomly weighted CNNs for (music) audio classification J Pons, X Serra International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2019 | 127 | 2019 |
MusiCNN: pre-trained convolutional neural networks for music audio tagging J Pons, X Serra Late breaking/demo session of the International Society for Music …, 2019 | 122 | 2019 |
End-to-end music source separation: Is it possible in the waveform domain? F Lluís, J Pons, X Serra arXiv preprint arXiv:1810.12187, 2018 | 96 | 2018 |
Universal speech enhancement with score-based diffusion J Serrà, S Pascual, J Pons, RO Araz, D Scaini arXiv preprint arXiv:2206.03065, 2022 | 95 | 2022 |
Fast timing-conditioned latent audio diffusion Z Evans, CJ Carr, J Taylor, SH Hawley, J Pons Forty-first International Conference on Machine Learning, 2024 | 87 | 2024 |
Designing efficient architectures for modeling temporal features with convolutional neural networks J Pons, X Serra International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2017 | 84 | 2017 |
Upsampling artifacts in neural audio synthesis J Pons, S Pascual, G Cengarle, J Serrà ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 80 | 2021 |
Training neural audio classifiers with few data J Pons, J Serrà, X Serra International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2019 | 79 | 2019 |
Remixing music using source separation algorithms to improve the musical experience of cochlear implant users J Pons, J Janer, T Rode, W Nogueira The Journal of the Acoustical Society of America 140 (6), 4338-4349, 2016 | 73 | 2016 |
Automatic multitrack mixing with a differentiable mixing console of neural audio effects CJ Steinmetz, J Pons, S Pascual, J Serrà ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 65 | 2021 |
An empirical study of Conv-TasNet B Kadioglu, M Horgan, X Liu, J Pons, D Darcy, V Kumar International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2020 | 60* | 2020 |
SESQA: semi-supervised learning for speech quality assessment J Serrà, J Pons, S Pascual ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 56 | 2021 |
On automatic drum transcription using non-negative matrix deconvolution and itakura saito divergence A Roebel, J Pons, M Liuni, M Lagrange International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2015 | 43 | 2015 |