دنبال کردن
Yang Ai
عنوان
نقل شده توسط
نقل شده توسط
سال
Waveform modeling and generation using hierarchical recurrent neural networks for speech bandwidth extension
ZH Ling, Y Ai, Y Gu, LR Dai
IEEE/ACM Transactions on Audio, Speech, and Language Processing 26 (5), 883-894, 2018
892018
MP-SENet: A speech enhancement model with parallel denoising of magnitude and phase spectra
YX Lu, Y Ai, ZH Ling
arXiv preprint arXiv:2305.13686, 2023
612023
Singing voice synthesis using deep autoregressive neural networks for acoustic modeling
YH Yi, Y Ai, ZH Ling, LR Dai
arXiv preprint arXiv:1906.08977, 2019
422019
A neural vocoder with hierarchical generation of amplitude and phase spectra for statistical parametric speech synthesis
Y Ai, ZH Ling
IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 839-851, 2020
402020
SampleRNN-based neural vocoder for statistical parametric speech synthesis
Y Ai, HC Wu, ZH Ling
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
332018
Bddr: An effective defense against textual backdoor attacks
K Shao, J Yang, Y Ai, H Liu, Y Zhang
Computers & Security 110, 102433, 2021
322021
Neural speech phase prediction based on parallel estimation architecture and anti-wrapping losses
Y Ai, ZH Ling
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
262023
APCodec: A neural audio codec with parallel amplitude and phase spectrum encoding and decoding
Y Ai, XH Jiang, YX Lu, HP Du, ZH Ling
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024
202024
APNet: An all-frame-level neural vocoder incorporating direct prediction of amplitude and phase spectra
Y Ai, ZH Ling
IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 2145-2157, 2023
152023
DNN-based spectral enhancement for neural waveform generators with low-bit quantization
Y Ai, JX Zhang, L Chen, ZH Ling
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
132019
The USTC-NERCSLIP System for the Track 1.2 of Audio Deepfake Detection (ADD 2023) Challenge.
H Wu, Z Li, L Xu, Z Zhang, W Zhao, B Gu, Y Ai, Y Lu, J Zhang, Z Ling, ...
DADA@ IJCAI, 119-124, 2023
102023
Knowledge-and-data-driven amplitude spectrum prediction for hierarchical neural vocoders
Y Ai, ZH Ling
arXiv preprint arXiv:2004.07832, 2020
92020
APNet2: high-quality and high-efficiency neural vocoder with direct prediction of amplitude and phase spectra
HP Du, YX Lu, Y Ai, ZH Ling
National Conference on Man-Machine Speech Communication, 66-80, 2023
82023
Denoising-and-dereverberation hierarchical neural vocoder for statistical parametric speech synthesis
Y Ai, ZH Ling, WL Wu, A Li
IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 2036-2048, 2022
72022
Towards high-quality and efficient speech bandwidth extension with parallel amplitude and phase prediction
YX Lu, Y Ai, HP Du, ZH Ling
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024
62024
Explicit estimation of magnitude and phase spectra in parallel for high-quality speech enhancement
YX Lu, Y Ai, ZH Ling
arXiv preprint arXiv:2308.08926, 2023
62023
Face-driven zero-shot voice conversion with memory-based face-voice alignment
ZY Sheng, Y Ai, YN Chen, ZH Ling
Proceedings of the 31st ACM International Conference on Multimedia, 8443-8452, 2023
52023
Zero-shot personalized lip-to-speech synthesis with face image based voice control
ZY Sheng, Y Ai, ZH Ling
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
52023
Incorporating ultrasound tongue images for audio-visual speech enhancement through knowledge distillation
RC Zheng, Y Ai, ZH Ling
arXiv preprint arXiv:2305.14933, 2023
52023
Reverberation modeling for source-filter-based neural vocoder
Y Ai, X Wang, J Yamagishi, ZH Ling
arXiv preprint arXiv:2005.07379, 2020
52020
سیستم در حال حاضر قادر به انجام عملکرد نیست. بعداً دوباره امتحان کنید.
مقاله‌ها 1–20