Time-frequency visual representation and texture features for audio applications: a comprehensive review, recent trends, and challenges
YD Mistry, GK Birajdar, AM Khodke - Multimedia Tools and Applications, 2023 - Springer
The conventional audio feature extraction methods employed in the audio analysis are
categorized into time-domain and frequency-domain. Recently, a new audio feature …
categorized into time-domain and frequency-domain. Recently, a new audio feature …
The sustainable development of intangible cultural heritage with AI: Cantonese opera singing genre classification based on CoGCNet model in China
Q Chen, W Zhao, Q Wang, Y Zhao - Sustainability, 2022 - mdpi.com
Chinese Cantonese opera, a UNESCO Intangible Cultural Heritage (ICH) of Humanity, has
faced a series of development problems due to diversified entertainment and emerging …
faced a series of development problems due to diversified entertainment and emerging …
Speech/music classification using visual and spectral chromagram features
Automatic speech/music classification is an important tool in multimedia content analysis
and retrieval which efficiently categorizes input audio and store it into relevant classes. This …
and retrieval which efficiently categorizes input audio and store it into relevant classes. This …
Stacked auto-encoders based visual features for speech/music classification
With the rapid rise of online available content, multimedia signal processing has become an
important area of research. The output of the speech/music classifier (SMC) is further used …
important area of research. The output of the speech/music classifier (SMC) is further used …
Indian language identification using time-frequency image textural descriptors and GWO-based feature selection
AA Chowdhury, VS Borkar… - Journal of Experimental & …, 2020 - Taylor & Francis
An ability to categorise and recognise a spoken language is an essential task in a multi-
lingual society like India. Language identification (LID) is the process of identifying the …
lingual society like India. Language identification (LID) is the process of identifying the …
Automatic tuning of radio stations based on listener's preference using Software Defined Radio and MATLAB
This work introduces a real-time system to automate the selection of radio stations based on
the listener's preference (either speech/music) by analyzing the incoming audio signals …
the listener's preference (either speech/music) by analyzing the incoming audio signals …
Enhanced audio classification leveraging pre-trained deep visual models
The differentiation between speech and music poses a prevalent issue in audio analytic,
specifically in dividing audio streams into segments and accurately labeling them as either …
specifically in dividing audio streams into segments and accurately labeling them as either …
Empirical mode decomposition based statistical features for discrimination of speech and low frequency music signal
This work aims to investigate the significance of different Empirical Mode Decomposition
(EMD) based statistical features for discrimination of speech and low frequency music signal …
(EMD) based statistical features for discrimination of speech and low frequency music signal …
[Retracted] Music Classification and Detection of Location Factors of Feature Words in Complex Noise Environment
Y Xu, Q Li - Complexity, 2021 - Wiley Online Library
In order to solve the problem of the influence of feature word position in lyrics on music
emotion classification, this paper designs a music classification and detection model in …
emotion classification, this paper designs a music classification and detection model in …
Spectrogram image textural descriptors for lung sound classification
Respiratory sound samples unveil key information about the patient's lung condition.
According to the World Health Organization, respiratory diseases are one of the major …
According to the World Health Organization, respiratory diseases are one of the major …