Follow
Wang Lin
Title
Cited by
Cited by
Year
Mixspeech: Cross-modality self-learning with audio-visual stream mixup for visual speech translation and recognition
X Cheng, T Jin, R Huang, L Li, W Lin, Z Wang, Y Wang, H Liu, A Yin, ...
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
232023
Opensr: Open-modality speech recognition via maintaining multi-modality alignment
X Cheng, T Jin, L Li, W Lin, X Duan, Z Zhao
arXiv preprint arXiv:2306.06410, 2023
162023
TAVT: Towards Transferable Audio-Visual Text Generation
W Lin, T Jin, W Pan, L Li, X Cheng, Y Wang, Z Zhao
Proceedings of the 61st Annual Meeting of the Association for Computational …, 2023
122023
Multi-granularity relational attention network for audio-visual question answering
L Li, T Jin, W Lin, H Jiang, W Pan, J Wang, S Xiao, Y Xia, W Jiang, Z Zhao
IEEE Transactions on Circuits and Systems for Video Technology, 2023
122023
Exploring group video captioning with efficient relational approximation
W Lin, T Jin, Y Wang, W Pan, L Li, X Cheng, Z Zhao
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
102023
EAGER: Two-Stream Generative Recommender with Behavior-Semantic Collaboration
Y Wang, J Xun, M Hong, J Zhu, T Jin, W Lin, H Li, L Li, Y Xia, Z Zhao, ...
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and …, 2024
82024
Rethinking Missing Modality Learning from a Decoding Perspective
T Jin, X Cheng, L Li, W Lin, Y Wang, Z Zhao
Proceedings of the 31st ACM International Conference on Multimedia, 4431-4439, 2023
82023
Weakly-supervised spoken video grounding via semantic interaction learning
Y Wang, W Lin, S Zhang, T Jin, L Li, X Cheng, Z Zhao
Proceedings of the 61st Annual Meeting of the Association for Computational …, 2023
72023
Contrastive token-wise meta-learning for unseen performer visual temporal-aligned translation
L Li, T Jin, X Cheng, Y Wang, W Lin, R Huang, Z Zhao
Findings of the Association for Computational Linguistics: ACL 2023, 10993-11007, 2023
62023
FedPAM: Federated Personalized Augmentation Model for Text-to-Image Retrieval
Y Feng, F Ma, W Lin, C Yao, J Chen, Y Yang
Proceedings of the 2024 International Conference on Multimedia Retrieval …, 2024
42024
Non-confusing Generation of Customized Concepts in Diffusion Models
W Lin, J Chen, J Shi, Y Zhu, C Liang, J Miao, T Jin, Z Zhao, F Wu, S Yan, ...
arXiv preprint arXiv:2405.06914, 2024
42024
Semantic-conditioned dual adaptation for cross-domain query-based visual segmentation
Y Wang, T Jin, W Lin, X Cheng, L Li, Z Zhao
Findings of the Association for Computational Linguistics: ACL 2023, 9797-9815, 2023
42023
Low-rank Prompt Interaction for Continual Vision-Language Retrieval
W Yan, Y Wang, W Lin, Z Guo, Z Zhao, T Jin
Proceedings of the 32nd ACM International Conference on Multimedia, 8257-8266, 2024
32024
Rethinking the multimodal correlation of multimodal sequential learning via generalizable attentional results alignment
T Jin, W Lin, Y Wang, L Li, X Cheng, Z Zhao
Proceedings of the 62nd Annual Meeting of the Association for Computational …, 2024
22024
Autogeo: Automating geometric image dataset creation for enhanced geometry understanding
Z Huang, T Wu, W Lin, S Zhang, J Chen, F Wu
arXiv preprint arXiv:2409.09039, 2024
12024
Bridging the Gap for Test-Time Multimodal Sentiment Analysis
Z Guo, T Jin, W Xu, W Lin, Y Wu
arXiv preprint arXiv:2412.07121, 2024
2024
Semantic Alignment for Multimodal Large Language Models
T Wu, M Li, J Chen, W Ji, W Lin, J Gao, K Kuang, Z Zhao, F Wu
Proceedings of the 32nd ACM International Conference on Multimedia, 3489-3498, 2024
2024
Instruction Tuning-free Visual Token Complement for Multimodal LLMs
D Wang, J Cui, M Li, W Lin, B Chen, H Zhang
European Conference on Computer Vision, 446-462, 2024
2024
: Exploring Embodied Emotion Through A Large-Scale Egocentric Video Dataset
W Lin, Y Feng, WK Han, T Jin, Z Zhao, F Wu, C Yao, J Chen
The Thirty-eight Conference on Neural Information Processing Systems …, 0
Action Imitation in Common Action Space for Customized Action Image Synthesis
W Lin, J Chen, J Shi, Z Guo, Y Zhu, Z Wang, T Jin, Z Zhao, F Wu, ...
The Thirty-eighth Annual Conference on Neural Information Processing Systems, 0
The system can't perform the operation now. Try again later.
Articles 1–20