Ikuti
Zhiyong Wang
Zhiyong Wang
School of Artificial Intelligence, University of Chinese Academy of Sciences
Email yang diverifikasi di mails.ucas.ac.cn - Beranda
Judul
Dikutip oleh
Dikutip oleh
Tahun
The codecfake dataset and countermeasures for the universally detection of deepfake audio
Y Xie, Y Lu, R Fu, Z Wen, Z Wang, J Tao, X Qi, X Wang, Y Liu, H Cheng, ...
IEEE Transactions on Audio, Speech and Language Processing, 2025
142025
Generalized Fake Audio Detection via Deep Stable Learning
Z Wang, R Fu, Z Wen, Y Xie, Y Liu, X Wang, X Liu, Y Li, J Tao, Y Lu, X Qi, ...
Interspeech.2024-1686, 2024
72024
Genuine-Focused Learning using Mask AutoEncoder for Generalized Fake Audio Detection
X Wang, R Fu, Z Wen, Z Wang, Y Xie, Y Liu, J Tao, X Liu, Y Li, X Qi, Y Lu, ...
interspeech2024, 2024
62024
Mixture of experts fusion for fake audio detection using frozen wav2vec 2.0
Z Wang, R Fu, Z Wen, J Tao, X Wang, Y Xie, X Qi, S Shi, Y Lu, Y Liu, C Li, ...
icassp2025, 2024
22024
MINT: a Multi-modal Image and Narrative Text Dubbing Dataset for Foley Audio Content Planning and Generation
R Fu, S Shi, H Guo, T Wang, C Qiang, Z Wen, J Tao, X Qi, Y Lu, X Wang, ...
arXiv preprint arXiv:2406.10591, 2024
22024
Codecfake: An Initial Dataset for Detecting LLM-based Deepfake Audio
Y Lu, Y Xie, R Fu, Z Wen, J Tao, Z Wang, X Qi, X Liu, Y Li, Y Liu, X Wang, ...
arXiv preprint arXiv:2406.08112, 2024
22024
Generalized Source Tracing: Detecting Novel Audio Deepfake Algorithm with Real Emphasis and Fake Dispersion strategy
Y Xie, R Fu, Z Wen, Z Wang, X Wang, H Cheng, L Ye, J Tao
arXiv preprint arXiv:2406.03240, 2024
22024
Temporal Variability and Multi-Viewed Self-Supervised Representations to Tackle the ASVspoof5 Deepfake Challenge
Y Xie, X Wang, Z Wang, R Fu, Z Wen, H Cheng, L Ye
arXiv preprint arXiv:2408.06922, 2024
12024
The FruitShell French synthesis system at the Blizzard 2023 Challenge
X Qi, X Wang, Z Wang, W Liu, M Ding, S Shi
arXiv preprint arXiv:2309.00223, 2023
12023
EELE: Exploring Efficient and Extensible LoRA Integration in Emotional Text-to-Speech
X Qi, R Fu, Z Wen, J Tao, S Shi, Y Lu, Z Wang, X Wang, Y Xie, Y Liu, G Li, ...
2024 IEEE 14th International Symposium on Chinese Spoken Language Processing …, 2024
2024
Does Current Deepfake Audio Detection Model Effectively Detect ALM-based Deepfake Audio?
Y Xie, C Xiong, X Wang, Z Wang, Y Lu, X Qi, R Fu, Y Liu, Z Wen, J Tao, ...
2024 IEEE 14th International Symposium on Chinese Spoken Language Processing …, 2024
2024
A Noval Feature via Color Quantisation for Fake Audio Detection
Z Wang, X Wang, Y Xie, R Fu, Z Wen, J Tao, Y Liu, G Li, X Qi, Y Lu, X Liu, ...
2024 IEEE 14th International Symposium on Chinese Spoken Language Processing …, 2024
2024
DPI-TTS: Directional Patch Interaction for Fast-Converging and Style Temporal Modeling in Text-to-Speech
X Qi, R Fu, Z Wen, T Wang, C Qiang, J Tao, C Li, Y Lu, S Shi, Z Wang, ...
arXiv preprint arXiv:2409.11835, 2024
2024
ASRRL-TTS: Agile Speaker Representation Reinforcement Learning for Text-to-Speech Speaker Adaptation
R Fu, X Qi, Z Wen, J Tao, T Wang, C Qiang, Z Wang, Y Lu, X Wang, S Shi, ...
arXiv preprint arXiv:2407.05421, 2024
2024
A multi-speaker multi-lingual voice cloning system based on vits2 for limmits 2024 challenge
X Wang, Y Lu, X Qi, Z Wang, Y Xie, S Shi, R Fu
arXiv preprint arXiv:2406.17801, 2024
2024
PPPR: Portable Plug-in Prompt Refiner for Text to Audio Generation
S Shi, R Fu, Z Wen, J Tao, T Wang, C Qiang, Y Lu, X Qi, X Liu, Y Liu, Y Li, ...
arXiv preprint arXiv:2406.04683, 2024
2024
Sistem tidak dapat melakukan operasi ini. Coba lagi nanti.
Artikel 1–16