Gloss attention for gloss-free sign language translation A Yin, T Zhong, L Tang, W Jin, T Jin, Z Zhao Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023 | 55 | 2023 |
Mlslt: Towards multilingual sign language translation A Yin, Z Zhao, W Jin, M Zhang, X Zeng, X He Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022 | 47 | 2022 |
Simulslt: End-to-end simultaneous sign language translation A Yin, Z Zhao, J Liu, W Jin, M Zhang, X Zeng, X He Proceedings of the 29th ACM International Conference on Multimedia, 4118-4127, 2021 | 36 | 2021 |
Connecting multi-modal contrastive representations Z Wang, Y Zhao, H Huang, J Liu, A Yin, L Tang, L Li, Y Wang, Z Zhang, ... Advances in Neural Information Processing Systems 36, 22099-22114, 2023 | 34 | 2023 |
Mixspeech: Cross-modality self-learning with audio-visual stream mixup for visual speech translation and recognition X Cheng, T Jin, R Huang, L Li, W Lin, Z Wang, Y Wang, H Liu, A Yin, ... Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 24 | 2023 |
Distilling coarse-to-fine semantic matching knowledge for weakly supervised 3d visual grounding Z Wang, H Huang, Y Zhao, L Li, X Cheng, Y Zhu, A Yin, Z Zhao Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 19 | 2023 |
3drp-net: 3d relative position-aware network for 3d visual grounding Z Wang, H Huang, Y Zhao, L Li, X Cheng, Y Zhu, A Yin, Z Zhao arXiv preprint arXiv:2307.13363, 2023 | 18 | 2023 |
Transface: Unit-based audio-visual speech synthesizer for talking head translation X Cheng, R Huang, L Li, T Jin, Z Wang, A Yin, M Li, X Duan, Z Zhao arXiv preprint arXiv:2312.15197, 2023 | 7 | 2023 |
Traineragent: Customizable and efficient model training through llm-powered multi-agent system H Li, H Jiang, T Zhang, Z Yu, A Yin, H Cheng, S Fu, Y Zhang, W He arXiv preprint arXiv:2311.06622, 2023 | 7 | 2023 |
Mlslt: Towards multilingual sign language translation. In 2022 IEEE A Yin, Z Zhao, W Jin, M Zhang, X Zeng, X He CVF Conference on Computer Vision and Pattern Recognition (CVPR), 5099-5109, 2022 | 5 | 2022 |
T2S-GPT: Dynamic Vector Quantization for Autoregressive Sign Language Production from Text A Yin, H Li, K Shen, S Tang, Y Zhuang arXiv preprint arXiv:2406.07119, 2024 | 1 | 2024 |
Language Model is a Branch Predictor for Simultaneous Machine Translation A Yin, T Zhong, H Li, S Tang, Z Zhao ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | | 2024 |
NaturalSigner: Diffusion Models are Natural Sign Language Generator A Yin, J Xun, X Cheng, T Jin, S Zhang, Z Zhao, S Tang, F Wu | | |