A survey of audio classification using deep learning
Deep learning can be used for audio signal classification in a variety of ways. It can be used
to detect and classify various types of audio signals such as speech, music, and …
to detect and classify various types of audio signals such as speech, music, and …
A hybrid feature-extracted deep CNN with reduced parameters substitutes an End-to-End CNN for the recognition of spoken Bengali digits
Speech Recognition (SR) is an emerging field in the native language nowadays.
Recognizing isolated words in the local language helps people use smartphones and …
Recognizing isolated words in the local language helps people use smartphones and …
A variational Bayesian approach to learning latent variables for acoustic knowledge transfer
We propose a variational Bayesian (VB) approach to learning distributions of latent
variables in deep neural network (DNN) models for cross-domain knowledge transfer, to …
variables in deep neural network (DNN) models for cross-domain knowledge transfer, to …
Speecheq: Speech emotion recognition based on multi-scale unified datasets and multitask learning
Z Kang, J Peng, J Wang, J **ao - arxiv preprint arxiv:2206.13101, 2022 - arxiv.org
Speech emotion recognition (SER) has many challenges, but one of the main challenges is
that each framework does not have a unified standard. In this paper, we propose SpeechEQ …
that each framework does not have a unified standard. In this paper, we propose SpeechEQ …
Boosting StarGANs for voice conversion with contrastive discriminator
Nonparallel multi-domain voice conversion methods such as the StarGAN-VCs have been
widely applied in many scenarios. However, the training of these models usually poses a …
widely applied in many scenarios. However, the training of these models usually poses a …
Information Bottleneck-Based Domain Adaptation for Hybrid Deep Learning in Scalable Network Slicing
Network slicing enables operators to efficiently support diverse applications on a shared
infrastructure. However, the evolving complexity of networks, compounded by inter-cell …
infrastructure. However, the evolving complexity of networks, compounded by inter-cell …
Instance-level loss based multiple-instance learning for acoustic scene classification
WG Choi, JH Chang, JM Yang, HG Moon - 2022 - osf.io
In acoustic scene classification (ASC) task, an acoustic scene consists of diverse attributes
and is inferred by identifying combinations of some distinct attributes among them. This …
and is inferred by identifying combinations of some distinct attributes among them. This …
Variational Bayesian Adaptive Learning of Deep Latent Variables for Acoustic Knowledge Transfer
In this work, we propose a novel variational Bayesian adaptive learning approach for cross-
domain knowledge transfer to address acoustic mismatches between training and testing …
domain knowledge transfer to address acoustic mismatches between training and testing …
Augmentation-induced consistency regularization for classification
Deep neural networks have become popular in many supervised learning tasks, but they
may suffer from overfitting when the training dataset is limited. To mitigate this, many …
may suffer from overfitting when the training dataset is limited. To mitigate this, many …
Uncertainty Calibration for Deep Audio Classifiers
Although deep Neural Networks (DNNs) have achieved tremendous success in audio
classification tasks, their uncertainty calibration are still under-explored. A well-calibrated …
classification tasks, their uncertainty calibration are still under-explored. A well-calibrated …