Volgen
Quan Wang
Quan Wang
Senior Staff Software Engineer @ Google DeepMind; Instructor @ Udemy; IEEE Senior Member
Geverifieerd e-mailadres voor google.com - Homepage
Titel
Geciteerd door
Geciteerd door
Jaar
Generalized end-to-end loss for speaker verification
L Wan, Q Wang, A Papir, IL Moreno
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
11342018
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
M Reid, N Savinov, D Teplyashin, D Lepikhin, T Lillicrap, J Alayrac, ...
arXiv preprint arXiv:2403.05530, 2024
11162024
Transfer learning from speaker verification to multispeaker text-to-speech synthesis
Y Jia, Y Zhang, R Weiss, Q Wang, J Shen, F Ren, P Nguyen, R Pang, ...
Advances in neural information processing systems, 4480-4490, 2018
10392018
VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram Masking
Q Wang, H Muckenhirn, K Wilson, P Sridhar, Z Wu, J Hershey, ...
Proc. Interspeech 2019, 2728-2732, 2019
4532019
Speaker diarization with LSTM
Q Wang, C Downey, L Wan, PA Mansfield, IL Moreno
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
4442018
ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech
X Wang, J Yamagishi, M Todisco, H Delgado, A Nautsch, N Evans, ...
Computer Speech & Language 64, 101114, 2020
4352020
Fully supervised speaker diarization
A Zhang, Q Wang, Z Zhu, J Paisley, C Wang
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
2822019
Kernel principal component analysis and its applications in face recognition and active shape models
Q Wang
arXiv preprint arXiv:1207.3538, 2012
2792012
Attention-based models for text-dependent speaker verification
FAR rahman Chowdhury, Q Wang, IL Moreno, L Wan
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
2102018
Wavenet based low rate speech coding
WB Kleijn, FSC Lim, A Luebs, J Skoglund, F Stimberg, Q Wang, ...
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
1982018
Sample Efficient Adaptive Text-to-Speech
Y Chen, Y Assael, B Shillingford, D Budden, S Reed, H Zen, Q Wang, ...
International Conference on Learning Representations (ICLR), 2019
1702019
VoiceFilter-Lite: Streaming Targeted Voice Separation for On-Device Speech Recognition
Q Wang, IL Moreno, M Saglam, K Wilson, A Chiao, R Liu, Y He, W Li, ...
Proc. Interspeech 2020, 2677-2681, 2020
1042020
Personal VAD: Speaker-Conditioned Voice Activity Detection
S Ding, Q Wang, S Chang, L Wan, IL Moreno
Proc. Odyssey 2020 The Speaker and Language Recognition Workshop, 433-439, 2020
1012020
HMRF-EM-image: implementation of the hidden markov random field model and its expectation-maximization algorithm
Q Wang
arXiv preprint arXiv:1207.3510, 2012
832012
CVSS Corpus and Massively Multilingual Speech-to-Speech Translation
Y Jia, MT Ramanovich, Q Wang, H Zen
Conference on Language Resources and Evaluation (LREC), 2022
772022
Turn-to-diarize: Online speaker diarization constrained by transformer transducer speaker turn detection
W Xia, H Lu, Q Wang, A Tripathi, Y Huang, IL Moreno, H Sak
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
702022
Fully supervised speaker diarization
C Wang, A Zhang, Q Wang, ZHU Zhenyao
US Patent 11,031,017, 2021
672021
Speaker verification
IL Moreno, L Wan, Q Wang
US Patent App. 15/211,317, 2018
602018
GMM-Based Hidden Markov Random Field for Color Image and 3D Volume Segmentation
Q Wang
arXiv preprint arXiv:1212.4527, 2012
392012
The Active Geometric Shape Model: A New Robust Deformable Shape Model and its Applications
Q Wang, KL Boyer
Computer Vision and Image Understanding, 2012
362012
Het systeem kan de bewerking nu niet uitvoeren. Probeer het later opnieuw.
Artikelen 1–20