팔로우
Huaming Wang
Huaming Wang
Partner Group Engineering Manager, Microsoft
microsoft.com의 이메일 확인됨
제목
인용
인용
연도
Neural codec language models are zero-shot text to speech synthesizers
C Wang, S Chen, Y Wu, Z Zhang, L Zhou, S Liu, Z Chen, Y Liu, H Wang, ...
arXiv preprint arXiv:2301.02111, 2023
6372023
An introduction to computational networks and the computational network toolkit
D Yu, A Eversole, M Seltzer, K Yao, Z Huang, B Guenter, O Kuchaiev, ...
Microsoft Technical Report MSR-TR-2014–112, 2014
4762014
Clap learning audio concepts from natural language supervision
B Elizalde, S Deshmukh, M Al Ismail, H Wang
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
4482023
Speak foreign languages with your own voice: Cross-lingual neural codec language modeling
Z Zhang, L Zhou, C Wang, S Chen, Y Wu, S Liu, Z Chen, Y Liu, H Wang, ...
arXiv preprint arXiv:2303.03926, 2023
1612023
Pengi: An audio language model for audio tasks
S Deshmukh, B Elizalde, R Singh, H Wang
Advances in Neural Information Processing Systems 36, 18090-18108, 2023
1402023
Advances in online audio-visual meeting transcription
T Yoshioka, I Abramovski, C Aksoylar, Z Chen, M David, D Dimitriadis, ...
2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019
902019
Multi-channel speech separation
Z Chen, J Li, X Xiao, T Yoshioka, H Wang, Z Wang, Y Gong
US Patent 10,839,822, 2020
872020
Personalized speech enhancement: New models and comprehensive evaluation
SE Eskimez, T Yoshioka, H Wang, X Wang, Z Chen, X Huang
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
682022
Natural language supervision for general-purpose audio representations
B Elizalde, S Deshmukh, H Wang
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
552024
Cracking the cocktail party problem by multi-beam deep attractor network
Z Chen, J Li, X Xiao, T Yoshioka, H Wang, Z Wang, Y Gong
2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2017
552017
Audio retrieval with wavtext5k and clap training
S Deshmukh, B Elizalde, H Wang
arXiv preprint arXiv:2209.14275, 2022
512022
Fast real-time personalized speech enhancement: End-to-end enhancement network (E3Net) and knowledge distillation
M Thakker, SE Eskimez, T Yoshioka, H Wang
arXiv preprint arXiv:2204.00771, 2022
372022
One model to enhance them all: array geometry agnostic multi-channel personalized speech enhancement
H Taherian, SE Eskimez, T Yoshioka, H Wang, Z Chen, X Huang
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
282022
Human listening and live captioning: Multi-task training for speech enhancement
SE Eskimez, X Wang, M Tang, H Yang, Z Zhu, Z Chen, H Wang, ...
arXiv preprint arXiv:2106.02896, 2021
262021
An introduction to computational networks and the computational network toolkit
Y Dong, E Adam, S Mike, Y Kaisheng, H Zhi-Heng, G Brian, K Oleksii, ...
Tech. Rep. MSR-TR-2014-112, 2014
242014
Notsofar-1 challenge: New datasets, baseline, and tasks for distant meeting transcription
A Vinnikov, A Ivry, A Hurvitz, I Abramovski, S Koubi, I Gurvich, S Peer, ...
arXiv preprint arXiv:2401.08887, 2024
202024
An overview of microsoft deep qa system on stanford webquestions benchmark
Z Wang, S Yan, H Wang, X Huang
2018-09-15]. https://www. microsoft. com/en-us/research/publication/an …, 2014
192014
Training audio captioning models without audio
S Deshmukh, B Elizalde, D Emmanouilidou, B Raj, R Singh, H Wang
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
182024
Artificial intelligence system utilizing microphone array and fisheye camera
Z Wang, X Huang, L Qin, K Wu, H Wang
US Patent App. 15/885,518, 2019
182019
Online verification of custom wake word
K Shahid, K Kumar, T Yi, V Miljanic, H Wang, Y Gong, HA Khalil
US Patent 11,158,305, 2021
132021
현재 시스템이 작동되지 않습니다. 나중에 다시 시도해 주세요.
학술자료 1–20