Panns: Large-scale pretrained audio neural networks for audio pattern recognition Q Kong, Y Cao, T Iqbal, Y Wang, W Wang, MD Plumbley IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 2880-2894, 2020 | 1282 | 2020 |
Large-scale weakly supervised audio classification using gated convolutional neural network Y Xu, Q Kong, W Wang, MD Plumbley 2018 IEEE international conference on acoustics, speech and signal …, 2018 | 265 | 2018 |
Wavcaps: A chatgpt-assisted weakly-labelled audio captioning dataset for audio-language multimodal research X Mei, C Meng, H Liu, Q Kong, T Ko, C Zhao, MD Plumbley, Y Zou, ... IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024 | 156 | 2024 |
Sound event detection of weakly labelled data with cnn-transformer and automatic threshold optimization Q Kong, Y Xu, W Wang, MD Plumbley IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 2450-2460, 2020 | 149 | 2020 |
Polyphonic Sound Event Detection and Localization using a Two-Stage Strategy Y Cao, Q Kong, T Iqbal, F An, W Wang, MD Plumbley Workshop on Detection and Classification of Acoustic Scenes and Events, 30-34, 2019 | 138 | 2019 |
High-resolution piano transcription with pedals by regressing onset and offset times Q Kong, B Li, X Song, Y Wan, Y Wang IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 3707-3717, 2021 | 132 | 2021 |
Sound event detection and time–frequency segmentation from weakly labelled data Q Kong, Y Xu, I Sobieraj, W Wang, MD Plumbley IEEE/ACM Transactions on Audio, Speech, and Language Processing 27 (4), 777-787, 2019 | 132 | 2019 |
Audioldm 2: Learning holistic audio generation with self-supervised pretraining H Liu, Y Yuan, X Liu, X Mei, Q Kong, Q Tian, Y Wang, W Wang, Y Wang, ... IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024 | 128 | 2024 |
Audio set classification with attention model: A probabilistic perspective Q Kong, Y Xu, W Wang, MD Plumbley 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 124 | 2018 |
Convolutional gated recurrent neural network incorporating spatial features for audio tagging Y Xu, Q Kong, Q Huang, W Wang, MD Plumbley 2017 International Joint Conference on Neural Networks (IJCNN), 3461-3466, 2017 | 121 | 2017 |
Giantmidi-piano: A large-scale midi dataset for classical piano music Q Kong, B Li, J Chen, Y Wang arXiv preprint arXiv:2010.07061, 2020 | 116 | 2020 |
Decoupling magnitude and phase estimation with deep resunet for music source separation Q Kong, Y Cao, H Liu, K Choi, Y Wang arXiv preprint arXiv:2109.05418, 2021 | 113 | 2021 |
Weakly labelled audioset tagging with attention neural networks Q Kong, C Yu, Y Xu, T Iqbal, W Wang, MD Plumbley IEEE/ACM Transactions on Audio, Speech, and Language Processing 27 (11 …, 2019 | 100 | 2019 |
Cross-task learning for audio tagging, sound event detection and spatial localization: DCASE 2019 baseline systems Q Kong, Y Cao, T Iqbal, Y Xu, W Wang, MD Plumbley arXiv preprint arXiv:1904.03476, 2019 | 99 | 2019 |
Deep Neural Network Baseline for DCASE Challenge 2016. Q Kong, I Sobieraj, W Wang, MD Plumbley DCASE, 50-54, 2016 | 87 | 2016 |
Multi-level attention model for weakly supervised audio classification C Yu, KS Barsim, Q Kong, B Yang Workshop on the Detection and Classification of Acoustic Scenes and Events …, 2018 | 84 | 2018 |
Attention-based convolutional neural networks for acoustic scene classification Z Ren, Q Kong, K Qian, MD Plumbley, B Schuller Workshop on the Detection and Classification of Acoustic Scenes and Events …, 2018 | 78 | 2018 |
Audio for audio is better? An investigation on transfer learning models for heart sound classification T Koike, K Qian, Q Kong, MD Plumbley, BW Schuller, Y Yamamoto 2020 42nd Annual International Conference of the IEEE Engineering in …, 2020 | 76 | 2020 |
An improved event-independent network for polyphonic sound event localization and detection Y Cao, T Iqbal, Q Kong, F An, W Wang, MD Plumbley ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 74 | 2021 |
Attention-based atrous convolutional neural networks: Visualisation and understanding perspectives of acoustic scenes Z Ren, Q Kong, J Han, MD Plumbley, BW Schuller ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 74 | 2019 |