[HTML][HTML] Improving learning-based birdsong classification by utilizing combined audio augmentation strategies

AS Kumar, T Schlosser, S Kahl, D Kowerko - Ecological Informatics, 2024 - Elsevier
In ecology, changes in environmental conditions are often closely linked to shifts in species
diversity. This relationship can be investigated by analyzing avian vocalizations, which are …

Cross-corpora spoken language identification with domain diversification and generalization

S Dey, M Sahidullah, G Saha - Computer Speech & Language, 2023 - Elsevier
This work addresses the cross-corpora generalization issue for the low-resourced spoken
language identification (LID) problem. We have conducted the experiments in the context of …

Automated data augmentation for audio classification

Y Sun, K Xu, C Liu, Y Dou, H Wang… - … /ACM Transactions on …, 2024 - ieeexplore.ieee.org
Audio classification is a challenging task that requires categorizing audio data based on its
content or characteristics. Existing approaches for audio classification rely either on …

Automatic audio augmentation for requests sub-challenge

Y Sun, K Xu, C Liu, Y Dou, K Qian - Proceedings of the 31st ACM …, 2023 - dl.acm.org
This paper presents our solution for the Requests Sub-challenge of the ACM Multimedia
2023 Computational Paralinguistics Challenge. Drawing upon the framework of self …

Italian speech emotion recognition

I Mantegazza, S Ntalampiras - 2023 24th International …, 2023 - ieeexplore.ieee.org
Affective computing is gaining increased interest by the scientific community in the last
decades with the acoustic modality playing a central role. This paper presents an extensive …

Strumming in the Metaverse: A Deep-Learning-Enabled Virtual Air Guitar System in VR with Enhanced Chord Recognition and Simulated Pedal Effects

YZ Hsieh, JJ Lin, MC Su, WJ Lin - IEEE Transactions on …, 2025 - ieeexplore.ieee.org
Virtual reality (VR) is increasingly capable and inexpensive, and VR devices have become
indispensable in many domains, such as gaming, videoconferencing, education, and …

[PDF][PDF] Noisy Web Supervision for Audio Classification

T Iqbal - 2022 - openresearch.surrey.ac.uk
Audio classification and other fields of pattern recognition have developed at an astounding
pace due to advances in machine learning. The availability of training data, especially …