Multi-stream acoustic modelling using raw real and imaginary parts of the Fourier transform

E Loweimi, Z Yue, P Bell, S Renals… - … /ACM Transactions on …, 2023 - ieeexplore.ieee.org
In this paper, we investigate multi-stream acoustic modelling using the raw real and
imaginary parts of the Fourier transform of speech signals. Using the raw magnitude …

Wearable electrocardiogram quality assessment using wavelet scattering and LSTM

F Liu, S **a, S Wei, L Chen, Y Ren, X Ren, Z Xu… - Frontiers in …, 2022 - frontiersin.org
As the fast development of wearable devices and Internet of things technologies, real-time
monitoring of ECG signals is quite critical for cardiovascular diseases. However, dynamic …

[HTML][HTML] Noise-Robust Radar High-Resolution Range Profile Target Recognition Based on Residual Scattering Attention Network

P Huang, S Li, W Li, M Zheng, B Tian, S Xu - Electronics, 2024 - mdpi.com
In recent years, radar automatic target recognition (RATR) utilizing high-resolution range
profiles (HRRPs) has received significant attention. Approaches based on deep learning …

Wavelet scattering transform for improving generalization in low-resourced spoken language identification

S Dey, P Singh, G Saha - arxiv preprint arxiv:2310.00602, 2023 - arxiv.org
Commonly used features in spoken language identification (LID), such as mel-spectrogram
or MFCC, lose high-frequency information due to windowing. The loss further increases for …

Voiceprint Recognition under Cross-Scenario Conditions Using Perceptual Wavelet Packet Entropy-Guided Efficient-Channel-Attention–Res2Net–Time-Delay-Neural …

S Wang, H Zhang, X Zhang, Y Su, Z Wang - Mathematics, 2023 - mdpi.com
(1) Background: Voiceprint recognition technology uses individual vocal characteristics for
identity authentication and faces many challenges in cross-scenario applications. The sound …

[HTML][HTML] Mandarin Recognition Based on Self-Attention Mechanism with Deep Convolutional Neural Network (DCNN)-Gated Recurrent Unit (GRU)

X Chen, C Wang, C Hu, Q Wang - Big Data and Cognitive Computing, 2024 - mdpi.com
Speech recognition technology is an important branch in the field of artificial intelligence,
aiming to transform human speech into computer-readable text information. However …

Towards Robust Waveform-Based Acoustic Models

D Oglic, Z Cvetkovic, P Sollich… - … /ACM Transactions on …, 2022 - ieeexplore.ieee.org
We study the problem of learning robust acoustic models in adverse environments,
characterized by a significant mismatch between training and test conditions. This problem …

Learning waveform-based acoustic models using deep variational convolutional neural networks

D Oglic, Z Cvetkovic, P Sollich - IEEE/ACM Transactions on …, 2021 - ieeexplore.ieee.org
We investigate the potential of stochastic neural networks for learning effective waveform-
based acoustic models. The waveform-based setting, inherent to fully end-to-end speech …

Multiple Confidence Gates for Joint Training of SE and ASR

T Wang, W Zhu, Y Gao, J Feng, D Zhu, S Zhu… - National Conference on …, 2022 - Springer
Joint training of speech enhancement model (SE) and speech recognition model (ASR) is a
common solution for robust ASR in noisy environments. SE focuses on improving the …

Noise-Robust HRRP Target Recognition Based on Residual Scattering Network

P Huang, S Li, M Zheng, J **e… - 2024 9th International …, 2024 - ieeexplore.ieee.org
Extracting noise-robust features is a key issue for high-resolution range profile (HRRP)
target recognition. In order to enhance the recognition performance under low signal-to …