Прати
Aoxiong Yin
Aoxiong Yin
Верификована је имејл адреса на zju.edu.cn - Почетна страница
Наслов
Навело
Навело
Година
Gloss attention for gloss-free sign language translation
A Yin, T Zhong, L Tang, W Jin, T Jin, Z Zhao
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023
552023
Mlslt: Towards multilingual sign language translation
A Yin, Z Zhao, W Jin, M Zhang, X Zeng, X He
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022
472022
Simulslt: End-to-end simultaneous sign language translation
A Yin, Z Zhao, J Liu, W Jin, M Zhang, X Zeng, X He
Proceedings of the 29th ACM International Conference on Multimedia, 4118-4127, 2021
362021
Connecting multi-modal contrastive representations
Z Wang, Y Zhao, H Huang, J Liu, A Yin, L Tang, L Li, Y Wang, Z Zhang, ...
Advances in Neural Information Processing Systems 36, 22099-22114, 2023
342023
Mixspeech: Cross-modality self-learning with audio-visual stream mixup for visual speech translation and recognition
X Cheng, T Jin, R Huang, L Li, W Lin, Z Wang, Y Wang, H Liu, A Yin, ...
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
242023
Distilling coarse-to-fine semantic matching knowledge for weakly supervised 3d visual grounding
Z Wang, H Huang, Y Zhao, L Li, X Cheng, Y Zhu, A Yin, Z Zhao
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
192023
3drp-net: 3d relative position-aware network for 3d visual grounding
Z Wang, H Huang, Y Zhao, L Li, X Cheng, Y Zhu, A Yin, Z Zhao
arXiv preprint arXiv:2307.13363, 2023
182023
Transface: Unit-based audio-visual speech synthesizer for talking head translation
X Cheng, R Huang, L Li, T Jin, Z Wang, A Yin, M Li, X Duan, Z Zhao
arXiv preprint arXiv:2312.15197, 2023
72023
Traineragent: Customizable and efficient model training through llm-powered multi-agent system
H Li, H Jiang, T Zhang, Z Yu, A Yin, H Cheng, S Fu, Y Zhang, W He
arXiv preprint arXiv:2311.06622, 2023
72023
Mlslt: Towards multilingual sign language translation. In 2022 IEEE
A Yin, Z Zhao, W Jin, M Zhang, X Zeng, X He
CVF Conference on Computer Vision and Pattern Recognition (CVPR), 5099-5109, 2022
52022
T2S-GPT: Dynamic Vector Quantization for Autoregressive Sign Language Production from Text
A Yin, H Li, K Shen, S Tang, Y Zhuang
arXiv preprint arXiv:2406.07119, 2024
12024
Language Model is a Branch Predictor for Simultaneous Machine Translation
A Yin, T Zhong, H Li, S Tang, Z Zhao
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
2024
NaturalSigner: Diffusion Models are Natural Sign Language Generator
A Yin, J Xun, X Cheng, T Jin, S Zhang, Z Zhao, S Tang, F Wu
Систем тренутно не може да изврши ову радњу. Пробајте поново касније.
Чланци 1–13