[PDF][PDF] Generative Adversarial Network Based Acoustic Scene Training Set Augmentation and Selection Using SVM Hyper-Plane.

S Mun, S Park, DK Han, H Ko - DCASE, 2017 - dcase.community
Although it is typically expected that using a large amount of labeled training data would
lead to improve performance in deep learning, it is generally difficult to obtain such …

[PDF][PDF] CP-JKU submissions for DCASE-2016: a hybrid approach using binaural i-vectors and deep convolutional neural networks

H Eghbal-Zadeh, B Lehner, M Dorfer… - IEEE AASP Challenge on …, 2016 - cp.jku.at
This report describes the 4 submissions for Task 1 (Audio scene classification) of the
DCASE-2016 challenge of the CP-JKU team. We propose 4 different approaches for Audio …

Automatic speaker age and gender recognition using acoustic and prosodic level information fusion

M Li, KJ Han, S Narayanan - Computer Speech & Language, 2013 - Elsevier
The paper presents a novel automatic speaker age and gender identification approach
which combines seven different methods at both acoustic and prosodic levels to improve the …

Voice pathology detection on the Saarbrücken voice database with calibration and fusion of scores using multifocal toolkit

D Martínez, E Lleida, A Ortega, A Miguel… - Advances in Speech and …, 2012 - Springer
The paper presents a set of experiments on pathological voice detection over the
Saarbrücken Voice Database (SVD) by using the MultiFocal toolkit for a discriminative …

Convolutional neural networks and x-vector embedding for DCASE2018 acoustic scene classification challenge

H Zeinali, L Burget, J Cernocky - arxiv preprint arxiv:1810.04273, 2018 - arxiv.org
In this paper, the Brno University of Technology (BUT) team submissions for Task 1
(Acoustic Scene Classification, ASC) of the DCASE-2018 challenge are described. Also, the …

Detection of coronary artery atherosclerotic disease using novel features from synchrosqueezing transform of phonocardiogram

A Pathak, P Samanta, K Mandana, G Saha - Biomedical Signal Processing …, 2020 - Elsevier
Objective Atherosclerotic coronary artery disease (CAD) detection through a simple, non-
invasive approach will be useful in point-of-care diagnosis. Though numerous studies have …

Simplified supervised i-vector modeling with application to robust and efficient language identification and speaker verification

M Li, S Narayanan - Computer Speech & Language, 2014 - Elsevier
This paper presents a simplified and supervised i-vector modeling approach with
applications to robust and efficient language identification and speaker verification. First, by …

Deep joint learning for language recognition

L Li, Z Li, Y Liu, Q Hong - Neural Networks, 2021 - Elsevier
Deep learning methods for language recognition have achieved promising performance.
However, most of the studies focus on frameworks for single types of acoustic features and …

[PDF][PDF] Wavelet Transform Based Mel-scaled Features for Acoustic Scene Classification.

S Waldekar, G Saha - INTERSPEECH, 2018 - isca-archive.org
Acoustic scene classification (ASC) is an audio signal processing task where mel-scaled
spectral features are widely used by researchers. These features, considered de facto …

[PDF][PDF] Frequency Domain Linear Prediction Features for Replay Spoofing Attack Detection.

B Wickramasinghe, S Irtza, E Ambikairajah, J Epps - Interspeech, 2018 - researchgate.net
Automatic speaker verification (ASV) systems are vulnerable to various types of spoofing
attacks such as speech synthesis, voice conversion and replay attacks. Recent research has …