Volgen
Ge Zhu
Ge Zhu
Adobe Research, Music AI
Geverifieerd e-mailadres voor adobe.com - Homepage
Titel
Geciteerd door
Geciteerd door
Jaar
UR Channel-Robust Synthetic Speech Detection System for ASVspoof 2021
X Chen, Y Zhang, G Zhu, Z Duan
arXiv preprint arXiv:2107.12018, 2021
562021
An Empirical Study on Channel Effects for Synthetic Voice Spoofing Countermeasure Systems
Y Zhang, G Zhu, F Jiang, Z Duan
Interspeech 2021, 2021
352021
Y-Vector: Multiscale Waveform Encoder for Speaker Embedding
G Zhu, F Jiang, Z Duan
Interspeech 2021, 2021
282021
A probabilistic fusion framework for spoofing aware speaker verification
Y Zhang, G Zhu, Z Duan
The Speaker and Language Recognition Workshop (Odyssey 2022), 2022
26*2022
Filler Word Detection and Classification: A Dataset and Benchmark
G Zhu, JP Caceres, J Salamon
Interspeech 2022, 2022
152022
Music Source Separation With Generative Flow
G Zhu, J Darefsky, F Jiang, A Selitskiy, Z Duan
IEEE Signal Processing Letters 29, 2288-2292, 2022
132022
Cacophony: An Improved Contrastive Audio-Text Model
G Zhu, J Darefsky, Z Duan
IEEE/ACM Transactions on Audio, Speech, and Language Processing 32, 4867 - 4879, 2024
112024
EDMSound: Spectrogram Based Diffusion Models for Efficient and High-Quality Audio Synthesis
G Zhu, Y Wen, MA Carbonneau, Z Duan
NeurIPS Workshop in Machine Learning for Audio, 2023, 2023
82023
MusicHiFi: Fast High-Fidelity Stereo Vocoding
G Zhu, JP Caceres, Z Duan, NJ Bryan
IEEE Signal Processing Letters 31, 2365 - 2369, 2024
42024
Style-Talker: Finetuning Audio Language Model and Style-Based Text-to-Speech Model for Fast Spoken Dialogue Generation
YA Li, X Jiang, J Darefsky, G Zhu, N Mesgarani
First Conference on Language Modeling (COLM), 2024
32024
Transcription free filler word detection with Neural semi-CRFs
G Zhu, Y Yan, JP Caceres, Z Duan
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
22023
Generalizing Voice Presentation Attack Detection to Unseen Synthetic Attacks and Channel Variation
Y Zhang, F Jiang, G Zhu, X Chen, Z Duan
Handbook of Biometric Anti-Spoofing: Presentation Attack Detection and …, 2023
22023
A study of the robustness of raw waveform based speaker embeddings under mismatched conditions
G Zhu, F Cwitkowitz, Z Duan
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
22022
Data clustering analysis of early reflections in small room
Z Zhang, G Zhu, Y Shen
The Journal of the Acoustical Society of America 144 (4), EL328-EL332, 2018
22018
Comments on “Bandpass Subwoofer Design”
H Dong, Y Shen, G Zhu
Journal of the Audio Engineering Society 66 (10), 756-758, 2018
22018
恒定束宽扬声器线阵列优化研究
朱舸, 沈勇, 夏洁, 冯雪磊
应用声学 36 (2), 95-104, 2017
12017
Presto! Distilling Steps and Layers for Accelerating Music Generation
Z Novack, G Zhu, J Casebeer, J McAuley, T Berg-Kirkpatrick, NJ Bryan
arXiv preprint arXiv:2410.05167, 2024
2024
Detecting and classifying filler words in audio using neural networks
J Salamon, JPC CHOMALI, G Zhu, NJ Bryan
US Patent App. 18/055,739, 2024
2024
Estimation of Magnitude Response of Reflecting Loudspeaker System in Listening Area Using Near-Box Measurement
G Zhu, Z Liu, Y Shen, Y Shen
Audio Engineering Society Convention 143, 2017
2017
低频用周期性结构管道的研究
薛政, 沈勇, 朱舸, 夏洁
应用声学 36 (3), 200-204, 2017
2017
Het systeem kan de bewerking nu niet uitvoeren. Probeer het later opnieuw.
Artikelen 1–20