Multi-stream acoustic modelling using raw real and imaginary parts of the Fourier transform
In this paper, we investigate multi-stream acoustic modelling using the raw real and
imaginary parts of the Fourier transform of speech signals. Using the raw magnitude …
imaginary parts of the Fourier transform of speech signals. Using the raw magnitude …
Wearable electrocardiogram quality assessment using wavelet scattering and LSTM
F Liu, S **a, S Wei, L Chen, Y Ren, X Ren, Z Xu… - Frontiers in …, 2022 - frontiersin.org
As the fast development of wearable devices and Internet of things technologies, real-time
monitoring of ECG signals is quite critical for cardiovascular diseases. However, dynamic …
monitoring of ECG signals is quite critical for cardiovascular diseases. However, dynamic …
[HTML][HTML] Noise-Robust Radar High-Resolution Range Profile Target Recognition Based on Residual Scattering Attention Network
P Huang, S Li, W Li, M Zheng, B Tian, S Xu - Electronics, 2024 - mdpi.com
In recent years, radar automatic target recognition (RATR) utilizing high-resolution range
profiles (HRRPs) has received significant attention. Approaches based on deep learning …
profiles (HRRPs) has received significant attention. Approaches based on deep learning …
Wavelet scattering transform for improving generalization in low-resourced spoken language identification
Commonly used features in spoken language identification (LID), such as mel-spectrogram
or MFCC, lose high-frequency information due to windowing. The loss further increases for …
or MFCC, lose high-frequency information due to windowing. The loss further increases for …
Voiceprint Recognition under Cross-Scenario Conditions Using Perceptual Wavelet Packet Entropy-Guided Efficient-Channel-Attention–Res2Net–Time-Delay-Neural …
S Wang, H Zhang, X Zhang, Y Su, Z Wang - Mathematics, 2023 - mdpi.com
(1) Background: Voiceprint recognition technology uses individual vocal characteristics for
identity authentication and faces many challenges in cross-scenario applications. The sound …
identity authentication and faces many challenges in cross-scenario applications. The sound …
[HTML][HTML] Mandarin Recognition Based on Self-Attention Mechanism with Deep Convolutional Neural Network (DCNN)-Gated Recurrent Unit (GRU)
X Chen, C Wang, C Hu, Q Wang - Big Data and Cognitive Computing, 2024 - mdpi.com
Speech recognition technology is an important branch in the field of artificial intelligence,
aiming to transform human speech into computer-readable text information. However …
aiming to transform human speech into computer-readable text information. However …
Towards Robust Waveform-Based Acoustic Models
We study the problem of learning robust acoustic models in adverse environments,
characterized by a significant mismatch between training and test conditions. This problem …
characterized by a significant mismatch between training and test conditions. This problem …
Learning waveform-based acoustic models using deep variational convolutional neural networks
We investigate the potential of stochastic neural networks for learning effective waveform-
based acoustic models. The waveform-based setting, inherent to fully end-to-end speech …
based acoustic models. The waveform-based setting, inherent to fully end-to-end speech …
Multiple Confidence Gates for Joint Training of SE and ASR
T Wang, W Zhu, Y Gao, J Feng, D Zhu, S Zhu… - National Conference on …, 2022 - Springer
Joint training of speech enhancement model (SE) and speech recognition model (ASR) is a
common solution for robust ASR in noisy environments. SE focuses on improving the …
common solution for robust ASR in noisy environments. SE focuses on improving the …
Noise-Robust HRRP Target Recognition Based on Residual Scattering Network
P Huang, S Li, M Zheng, J **e… - 2024 9th International …, 2024 - ieeexplore.ieee.org
Extracting noise-robust features is a key issue for high-resolution range profile (HRRP)
target recognition. In order to enhance the recognition performance under low signal-to …
target recognition. In order to enhance the recognition performance under low signal-to …