The codecfake dataset and countermeasures for the universally detection of deepfake audio Y Xie, Y Lu, R Fu, Z Wen, Z Wang, J Tao, X Qi, X Wang, Y Liu, H Cheng, ... IEEE Transactions on Audio, Speech and Language Processing, 2025 | 14 | 2025 |
Generalized Fake Audio Detection via Deep Stable Learning Z Wang, R Fu, Z Wen, Y Xie, Y Liu, X Wang, X Liu, Y Li, J Tao, Y Lu, X Qi, ... Interspeech.2024-1686, 2024 | 7 | 2024 |
Genuine-Focused Learning using Mask AutoEncoder for Generalized Fake Audio Detection X Wang, R Fu, Z Wen, Z Wang, Y Xie, Y Liu, J Tao, X Liu, Y Li, X Qi, Y Lu, ... interspeech2024, 2024 | 6 | 2024 |
Mixture of experts fusion for fake audio detection using frozen wav2vec 2.0 Z Wang, R Fu, Z Wen, J Tao, X Wang, Y Xie, X Qi, S Shi, Y Lu, Y Liu, C Li, ... icassp2025, 2024 | 2 | 2024 |
MINT: a Multi-modal Image and Narrative Text Dubbing Dataset for Foley Audio Content Planning and Generation R Fu, S Shi, H Guo, T Wang, C Qiang, Z Wen, J Tao, X Qi, Y Lu, X Wang, ... arXiv preprint arXiv:2406.10591, 2024 | 2 | 2024 |
Codecfake: An Initial Dataset for Detecting LLM-based Deepfake Audio Y Lu, Y Xie, R Fu, Z Wen, J Tao, Z Wang, X Qi, X Liu, Y Li, Y Liu, X Wang, ... arXiv preprint arXiv:2406.08112, 2024 | 2 | 2024 |
Generalized Source Tracing: Detecting Novel Audio Deepfake Algorithm with Real Emphasis and Fake Dispersion strategy Y Xie, R Fu, Z Wen, Z Wang, X Wang, H Cheng, L Ye, J Tao arXiv preprint arXiv:2406.03240, 2024 | 2 | 2024 |
Temporal Variability and Multi-Viewed Self-Supervised Representations to Tackle the ASVspoof5 Deepfake Challenge Y Xie, X Wang, Z Wang, R Fu, Z Wen, H Cheng, L Ye arXiv preprint arXiv:2408.06922, 2024 | 1 | 2024 |
The FruitShell French synthesis system at the Blizzard 2023 Challenge X Qi, X Wang, Z Wang, W Liu, M Ding, S Shi arXiv preprint arXiv:2309.00223, 2023 | 1 | 2023 |
EELE: Exploring Efficient and Extensible LoRA Integration in Emotional Text-to-Speech X Qi, R Fu, Z Wen, J Tao, S Shi, Y Lu, Z Wang, X Wang, Y Xie, Y Liu, G Li, ... 2024 IEEE 14th International Symposium on Chinese Spoken Language Processing …, 2024 | | 2024 |
Does Current Deepfake Audio Detection Model Effectively Detect ALM-based Deepfake Audio? Y Xie, C Xiong, X Wang, Z Wang, Y Lu, X Qi, R Fu, Y Liu, Z Wen, J Tao, ... 2024 IEEE 14th International Symposium on Chinese Spoken Language Processing …, 2024 | | 2024 |
A Noval Feature via Color Quantisation for Fake Audio Detection Z Wang, X Wang, Y Xie, R Fu, Z Wen, J Tao, Y Liu, G Li, X Qi, Y Lu, X Liu, ... 2024 IEEE 14th International Symposium on Chinese Spoken Language Processing …, 2024 | | 2024 |
DPI-TTS: Directional Patch Interaction for Fast-Converging and Style Temporal Modeling in Text-to-Speech X Qi, R Fu, Z Wen, T Wang, C Qiang, J Tao, C Li, Y Lu, S Shi, Z Wang, ... arXiv preprint arXiv:2409.11835, 2024 | | 2024 |
ASRRL-TTS: Agile Speaker Representation Reinforcement Learning for Text-to-Speech Speaker Adaptation R Fu, X Qi, Z Wen, J Tao, T Wang, C Qiang, Z Wang, Y Lu, X Wang, S Shi, ... arXiv preprint arXiv:2407.05421, 2024 | | 2024 |
A multi-speaker multi-lingual voice cloning system based on vits2 for limmits 2024 challenge X Wang, Y Lu, X Qi, Z Wang, Y Xie, S Shi, R Fu arXiv preprint arXiv:2406.17801, 2024 | | 2024 |
PPPR: Portable Plug-in Prompt Refiner for Text to Audio Generation S Shi, R Fu, Z Wen, J Tao, T Wang, C Qiang, Y Lu, X Qi, X Liu, Y Liu, Y Li, ... arXiv preprint arXiv:2406.04683, 2024 | | 2024 |