Noise robust automatic speaker verification systems: review and analysis

S Joshi, M Dua - Telecommunication Systems, 2024 - Springer
Like any other biometric systems, Automatic Speaker Verification (ASV) systems are also
vulnerable to the spoofing attacks. Hence, it is important to develop the countermeasures in …

Advances in phase-aware signal processing in speech communication

P Mowlaee, R Saeidi, Y Stylianou - Speech communication, 2016 - Elsevier
During the past three decades, the issue of processing spectral phase has been largely
neglected in speech applications. There is no doubt that the interest of speech processing …

[CARTE][B] Single channel phase-aware signal processing in speech communication: theory and practice

P Mowlaee, J Kulmer, J Stahl, F Mayer - 2016 - books.google.com
An overview on the challenging new topic of phase-aware signal processing Speech
communication technology is a key factor in human-machine interaction, digital hearing …

On learning interpretable CNNs with parametric modulated kernel-based filters

E Loweimi, P Bell, S Renals - Interspeech 2019, 2019 - research.ed.ac.uk
We investigate the problem of direct waveform modelling using parametric kernel-based
filters in a convolutional neural network (CNN) framework, building on SincNet, a CNN …

[HTML][HTML] Gaussian-filtered high-frequency-feature trained optimized bilstm network for spoofed-speech classification

H Mewada, JF Al-Asad, FA Almalki, AH Khan… - Sensors, 2023 - mdpi.com
Voice-controlled devices are in demand due to their hands-free controls. However, using
voice-controlled devices in sensitive scenarios like smartphone applications and financial …

Multi-encoder learning and stream fusion for transformer-based end-to-end automatic speech recognition

T Lohrenz, Z Li, T Fingscheidt - arxiv preprint arxiv:2104.00120, 2021 - arxiv.org
Stream fusion, also known as system combination, is a common technique in automatic
speech recognition for traditional hybrid hidden Markov model approaches, yet mostly …

[PDF][PDF] Interspeech 2014 special session: Phase importance in speech processing applications

P Mowlaee, R Saeidi, Y Stylanou - Proc. Interspeech, 2014 - Citeseer
In many speech processing applications, the spectral amplitude is the dominant information
while the use of phase spectrum is not so widely spread. In this paper, we present an …

Sparse modeling of magnitude and phase-derived spectra for playing technique classification

L Su, HM Lin, YH Yang - IEEE/ACM Transactions on Audio …, 2014 - ieeexplore.ieee.org
Computational modeling of musical timbre is important for a variety of music information
retrieval applications. While considerable progress has been made to recognize musical …

Dysarthric speech recognition, detection and classification using raw phase and magnitude spectra

Z Yue, E Loweimi, Z Cvetkovic - Proceedings of INTERSPEECH …, 2023 - kclpure.kcl.ac.uk
In this paper, we explore the effectiveness of deploying the raw phase and magnitude
spectra for dysarthric speech recognition, detection and classification. In particular, we …

Speech acoustic modelling from raw phase spectrum

E Loweimi, Z Cvetkovic, P Bell… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org
Magnitude spectrum-based features are the most widely employed front-ends for acoustic
modelling in automatic speech recognition (ASR) systems. In this paper, we investigate the …