- Academic Search

[PDF][PDF] Generative Adversarial Network Based Acoustic Scene Training Set Augmentation and Selection Using SVM Hyper-Plane.

S Mun, S Park, DK Han, H Ko - DCASE, 2017 - dcase.community

Although it is typically expected that using a large amount of labeled training data would
lead to improve performance in deep learning, it is generally difficult to obtain such …

Enregistrer Citer Cité 198 fois Autres articles Les 3 versions Free GPT-4 DeepSeek Version HTML

[Free GPT-4]
[DeepSeek]

[PDF] jku.at

[PDF][PDF] CP-JKU submissions for DCASE-2016: a hybrid approach using binaural i-vectors and deep convolutional neural networks

H Eghbal-Zadeh, B Lehner, M Dorfer… - IEEE AASP Challenge on …, 2016 - cp.jku.at

This report describes the 4 submissions for Task 1 (Audio scene classification) of the
DCASE-2016 challenge of the CP-JKU team. We propose 4 different approaches for Audio …

Enregistrer Citer Cité 203 fois Autres articles Les 3 versions Free GPT-4 DeepSeek Version HTML

[Free GPT-4]
[DeepSeek]

[PDF] usc.edu

Automatic speaker age and gender recognition using acoustic and prosodic level information fusion

M Li, KJ Han, S Narayanan - Computer Speech & Language, 2013 - Elsevier

The paper presents a novel automatic speaker age and gender identification approach
which combines seven different methods at both acoustic and prosodic levels to improve the …

Enregistrer Citer Cité 241 fois Autres articles Les 9 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] researchgate.net

Voice pathology detection on the Saarbrücken voice database with calibration and fusion of scores using multifocal toolkit

D Martínez, E Lleida, A Ortega, A Miguel… - Advances in Speech and …, 2012 - Springer

The paper presents a set of experiments on pathological voice detection over the
Saarbrücken Voice Database (SVD) by using the MultiFocal toolkit for a discriminative …

Enregistrer Citer Cité 131 fois Autres articles Les 5 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Convolutional neural networks and x-vector embedding for DCASE2018 acoustic scene classification challenge

H Zeinali, L Burget, J Cernocky - arxiv preprint arxiv:1810.04273, 2018 - arxiv.org

In this paper, the Brno University of Technology (BUT) team submissions for Task 1
(Acoustic Scene Classification, ASC) of the DCASE-2018 challenge are described. Also, the …

Enregistrer Citer Cité 77 fois Autres articles Les 3 versions Free GPT-4 DeepSeek Version HTML

Detection of coronary artery atherosclerotic disease using novel features from synchrosqueezing transform of phonocardiogram

A Pathak, P Samanta, K Mandana, G Saha - Biomedical Signal Processing …, 2020 - Elsevier

Objective Atherosclerotic coronary artery disease (CAD) detection through a simple, non-
invasive approach will be useful in point-of-care diagnosis. Though numerous studies have …

Enregistrer Citer Cité 31 fois Autres articles

[Free GPT-4]
[DeepSeek]

[PDF] duke.edu

Simplified supervised i-vector modeling with application to robust and efficient language identification and speaker verification

M Li, S Narayanan - Computer Speech & Language, 2014 - Elsevier

This paper presents a simplified and supervised i-vector modeling approach with
applications to robust and efficient language identification and speaker verification. First, by …

Enregistrer Citer Cité 65 fois Autres articles Les 7 versions Free GPT-4 DeepSeek

Deep joint learning for language recognition

L Li, Z Li, Y Liu, Q Hong - Neural Networks, 2021 - Elsevier

Deep learning methods for language recognition have achieved promising performance.
However, most of the studies focus on frameworks for single types of acoustic features and …

Enregistrer Citer Cité 19 fois Autres articles Les 4 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] isca-archive.org

[PDF][PDF] Wavelet Transform Based Mel-scaled Features for Acoustic Scene Classification.

S Waldekar, G Saha - INTERSPEECH, 2018 - isca-archive.org

Acoustic scene classification (ASC) is an audio signal processing task where mel-scaled
spectral features are widely used by researchers. These features, considered de facto …

Enregistrer Citer Cité 32 fois Autres articles Les 6 versions Free GPT-4 DeepSeek Version HTML

[Free GPT-4]
[DeepSeek]

[PDF] researchgate.net

[PDF][PDF] Frequency Domain Linear Prediction Features for Replay Spoofing Attack Detection.

B Wickramasinghe, S Irtza, E Ambikairajah, J Epps - Interspeech, 2018 - researchgate.net

Automatic speaker verification (ASV) systems are vulnerable to various types of spoofing
attacks such as speech synthesis, voice conversion and replay attacks. Recent research has …

Enregistrer Citer Cité 31 fois Autres articles Les 4 versions Free GPT-4 DeepSeek Version HTML

Créer l'alerte

Citer

Recherche avancée

Enregistré dans Ma bibliothèque

Focal multi-class: Toolkit for evaluation, fusion and calibration of multi-class recognition...

[PDF][PDF] Generative Adversarial Network Based Acoustic Scene Training Set Augmentation and Selection Using SVM Hyper-Plane.

[PDF][PDF] CP-JKU submissions for DCASE-2016: a hybrid approach using binaural i-vectors and deep convolutional neural networks

Automatic speaker age and gender recognition using acoustic and prosodic level information fusion

Voice pathology detection on the Saarbrücken voice database with calibration and fusion of scores using multifocal toolkit

Convolutional neural networks and x-vector embedding for DCASE2018 acoustic scene classification challenge

Detection of coronary artery atherosclerotic disease using novel features from synchrosqueezing transform of phonocardiogram

Simplified supervised i-vector modeling with application to robust and efficient language identification and speaker verification

Deep joint learning for language recognition

[PDF][PDF] Wavelet Transform Based Mel-scaled Features for Acoustic Scene Classification.

[PDF][PDF] Frequency Domain Linear Prediction Features for Replay Spoofing Attack Detection.