Urmăriți
Junyi Ao
Junyi Ao
Alte nume敖君逸
The Chinese University of Hong Kong, Shenzhen
Adresă de e-mail confirmată pe link.cuhk.edu.cn - Pagina de pornire
Titlu
Citat de
Citat de
Anul
SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language Processing
J Ao, R Wang, L Zhou, C Wang, S Ren, Y Wu, S Liu, T Ko, Q Li, Y Zhang, ...
Proceedings of the 60th Annual Meeting of the Association for Computational …, 2022
2582022
LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT
R Wang, Q Bai, J Ao, L Zhou, Z Xiong, Z Wei, Y Zhang, T Ko, H Li
INTERSPEECH 2022, 2022
612022
SpeechUT: Bridging Speech and Text with Hidden-Unit for Encoder-Decoder Based Speech-Text Pre-training
Z Zhang, L Zhou, J Ao, S Liu, L Dai, J Li, F Wei
Proceedings of the 2022 Conference on Empirical Methods in Natural Language …, 2022
562022
Multi-View Self-Attention Based Transformer for Speaker Recognition
R Wang, J Ao, L Zhou, S Liu, Z Wei, T Ko, Q Li, Y Zhang
ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and …, 2022
472022
Pre-Training Transformer Decoder for End-to-End ASR Model with Unpaired Speech Data
J Ao, Z Zhang, L Zhou, S Liu, H Li, T Ko, L Dai, J Li, Y Qian, F Wei
INTERSPEECH 2022, 2022
212022
SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond Words
J Ao, Y Wang, X Tian, D Chen, J Zhang, L Lu, Y Wang, H Li, Z Wu
The Thirty-eight Conference on Neural Information Processing Systems …, 2024
122024
The YiTrans End-to-End Speech Translation System for IWSLT 2022 Offline Shared Task
Z Zhang, J Ao, L Zhou, S Liu, F Wei, J Li
arXiv preprint arXiv:2206.05777, 2022
102022
token2vec: A Joint Self-Supervised Pre-training Framework Using Unpaired Speech and Text
X Yue, J Ao, X Gao, H Li
ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and …, 2023
92023
CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning
C Meng, J Ao, T Ko, M Wang, H Li
INTERSPEECH 2023, 2022
92022
USED: Universal Speaker Extraction and Diarization
J Ao, MS Yıldırım, R Tao, M Ge, S Wang, Y Qian, H Li
arXiv preprint arXiv:2309.10674, 2023
52023
Self-Supervised Acoustic Word Embedding Learning via Correspondence Transformer Encoder
J Lin, X Yue, J Ao, H Li
INTERSPEECH 2023, 2023
32023
SA-WavLM: Speaker-Aware Self-Supervised Pre-training for Mixture Speech
J Lin, M Ge, J Ao, L Deng, H Li
INTERSPEECH 2024, 2024
22024
Improving Attention-based End-to-end ASR by Incorporating an N-gram Neural Network
J Ao, T Ko
2021 12th International Symposium on Chinese Spoken Language Processing …, 2021
12021
Overview of the Amphion Toolkit (v0. 2)
J Li, X Zhang, Y Wang, H He, C Wang, L Wang, H Liao, J Ao, Z Xie, ...
arXiv preprint arXiv:2501.15442, 2025
2025
Text-guided HuBERT: Self-Supervised Speech Pre-training via Generative Adversarial Networks
D Ma, X Yue, J Ao, X Gao, H Li
IEEE Signal Processing Letters 31, 2055 - 2059, 2024
2024
The NUS-HLT System for ICASSP2024 ICMC-ASR Grand Challenge
M Ge, Y Peng, Y Jiang, J Lin, J Ao, MS Yildirim, S Wang, H Li, M Feng
arXiv preprint arXiv:2312.16002, 2023
2023
The YiTrans speech translation system for IWSLT 2022 offline shared task
Z Zhang, J Ao
Proceedings of the 19th International Conference on Spoken Language …, 2022
2022
Sounding the Alarm: Backdooring Acoustic Foundation Models for Physically Realizable Triggers
Z Yun, J Ao, T Ko, E Ronen, M Sharif
Sistemul nu poate realiza operația în acest moment. Încercați din nou mai târziu.
Articole 1–18