Time-frequency visual representation and texture features for audio applications: a comprehensive review, recent trends, and challenges

YD Mistry, GK Birajdar, AM Khodke - Multimedia Tools and Applications, 2023 - Springer
The conventional audio feature extraction methods employed in the audio analysis are
categorized into time-domain and frequency-domain. Recently, a new audio feature …

The sustainable development of intangible cultural heritage with AI: Cantonese opera singing genre classification based on CoGCNet model in China

Q Chen, W Zhao, Q Wang, Y Zhao - Sustainability, 2022 - mdpi.com
Chinese Cantonese opera, a UNESCO Intangible Cultural Heritage (ICH) of Humanity, has
faced a series of development problems due to diversified entertainment and emerging …

Speech/music classification using visual and spectral chromagram features

GK Birajdar, MD Patil - Journal of Ambient Intelligence and Humanized …, 2020 - Springer
Automatic speech/music classification is an important tool in multimedia content analysis
and retrieval which efficiently categorizes input audio and store it into relevant classes. This …

Stacked auto-encoders based visual features for speech/music classification

A Kumar, SS Solanki, M Chandra - Expert Systems with Applications, 2022 - Elsevier
With the rapid rise of online available content, multimedia signal processing has become an
important area of research. The output of the speech/music classifier (SMC) is further used …

Indian language identification using time-frequency image textural descriptors and GWO-based feature selection

AA Chowdhury, VS Borkar… - Journal of Experimental & …, 2020 - Taylor & Francis
An ability to categorise and recognise a spoken language is an essential task in a multi-
lingual society like India. Language identification (LID) is the process of identifying the …

Automatic tuning of radio stations based on listener's preference using Software Defined Radio and MATLAB

A Kumar, B Karan, SS Solanki, M Chandra… - … Applications of Artificial …, 2024 - Elsevier
This work introduces a real-time system to automate the selection of radio stations based on
the listener's preference (either speech/music) by analyzing the incoming audio signals …

Enhanced audio classification leveraging pre-trained deep visual models

A Kumar, R Kumar, M Chandra - Engineering Applications of Artificial …, 2025 - Elsevier
The differentiation between speech and music poses a prevalent issue in audio analytic,
specifically in dividing audio streams into segments and accurately labeling them as either …

Empirical mode decomposition based statistical features for discrimination of speech and low frequency music signal

A Kumar, M Chandra - Multimedia tools and applications, 2023 - Springer
This work aims to investigate the significance of different Empirical Mode Decomposition
(EMD) based statistical features for discrimination of speech and low frequency music signal …

[Retracted] Music Classification and Detection of Location Factors of Feature Words in Complex Noise Environment

Y Xu, Q Li - Complexity, 2021 - Wiley Online Library
In order to solve the problem of the influence of feature word position in lyrics on music
emotion classification, this paper designs a music classification and detection model in …

Spectrogram image textural descriptors for lung sound classification

B Kaushal, S Raveendran, MD Patil… - Machine learning and …, 2022 - taylorfrancis.com
Respiratory sound samples unveil key information about the patient's lung condition.
According to the World Health Organization, respiratory diseases are one of the major …