Speech/music classification using visual and spectral chromagram features
Automatic speech/music classification is an important tool in multimedia content analysis
and retrieval which efficiently categorizes input audio and store it into relevant classes. This …
and retrieval which efficiently categorizes input audio and store it into relevant classes. This …
Speech and music classification using spectrogram based statistical descriptors and extreme learning machine
GK Birajdar, MD Patil - Multimedia tools and applications, 2019 - Springer
This article proposes a novel feature extraction approach for speech/music classification
based on generalized Gaussian distribution descriptors extracted from IIR-CQT spectrogram …
based on generalized Gaussian distribution descriptors extracted from IIR-CQT spectrogram …
Enhanced audio classification leveraging pre-trained deep visual models
The differentiation between speech and music poses a prevalent issue in audio analytic,
specifically in dividing audio streams into segments and accurately labeling them as either …
specifically in dividing audio streams into segments and accurately labeling them as either …
Hilbert Spectrum based features for speech/music classification
Abstract Automatic Speech/Music classification uses different signal processing techniques
to categorize multimedia content into different classes. The proposed work explores Hilbert …
to categorize multimedia content into different classes. The proposed work explores Hilbert …
Speech/music discrimination for analysis of radio stations
S Kacprzak, B Chwiećko… - … International conference on …, 2017 - ieeexplore.ieee.org
A computationally efficient feature, called Minimum Energy Density (MED) was applied to
discriminate audio signals between speech and music in the radio stations programs. The …
discriminate audio signals between speech and music in the radio stations programs. The …
Monitoring of audio visual quality by key indicators: Detection of selected audio and audiovisual artefacts
IB Fernández, M Leszczuk - Multimedia Tools and Applications, 2018 - Springer
Over 10 billion hours of video are watched online every month. Together with high definition
television broadcasting and the rise in high quality video on demand, this makes quality …
television broadcasting and the rise in high quality video on demand, this makes quality …
Speech/Music Discrimination with High Accuracy Based on Deep Belief Network
W TIAN - Journal of Jishou University (Natural Sciences Edition), 2017 - zkxb.jsu.edu.cn
Application of deep belief network in speech/music discrimination is studied. According to
the characteristics that medium-high frequency energy of speech is lower than that of music …
the characteristics that medium-high frequency energy of speech is lower than that of music …
[PDF][PDF] Speech/music classification using visual and spectral chromagram
GK Birajdar, MD Patil - 2019 - academia.edu
Automatic speech/music classification is an important tool in multimedia content analysis
and retrieval which efficiently categorizes input audio and store it into relevant classes. This …
and retrieval which efficiently categorizes input audio and store it into relevant classes. This …
Detection of Lip Synchronization Artifacts
IB Fernández, M Leszczuk - … , MCSS 2015, Kraków, Poland, November 24 …, 2015 - Springer
Over 10 billion hours of video are watched each month on the Internet, what, together with
high definition television broadcasting and the rise in high quality video on demand makes …
high definition television broadcasting and the rise in high quality video on demand makes …
[PDF][PDF] 语音/音乐的深度置信网络高准确度识别方法
田旺兰 - 吉首大学学报 (自然科学版), 2017 - zkxb.jsu.edu.cn
语音/音乐的深度置信网络高准确度识别方法 Page 1 第38卷第1期 吉首大学学报(自然科学版)
Vol.38 No.1 2017年1月 JournalofJishouUniversity(NaturalScienceEdition) Jan.2017 文章 …
Vol.38 No.1 2017年1月 JournalofJishouUniversity(NaturalScienceEdition) Jan.2017 文章 …