Speech and speaker recognition using raw waveform modeling for adult and children's speech: A comprehensive review

K Radha, M Bansal, RB Pachori - Engineering Applications of Artificial …, 2024 - Elsevier
Conventionally, the extraction of hand-crafted acoustic features has been separated from the
task of establishing robust machine-learning models in speech processing. The manual …

Modeling Suprasegmental Information Using Finite Difference Network for End-to-End Speaker Verification

J Li, MW Mak, N Yan, L Wang - 2023 Asia Pacific Signal and …, 2023 - ieeexplore.ieee.org
In recent years, using raw waveforms as input to deep networks has been widely explored
for speaker verification systems that process speech signals at the segmental level. A critical …