دنبال کردن
Xudong Lin
عنوان
نقل شده توسط
نقل شده توسط
سال
Deep Adversarial Metric Learning
Y Duan, W Zheng, X Lin, J Lu, J Zhou
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2018
2712018
All in one: Exploring unified video-language pre-training
J Wang, Y Ge, R Yan, Y Ge, KQ Lin, S Tsutsui, X Lin, G Cai, J Wu, Y Shan, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
2302023
Dmc-net: Generating discriminative motion cues for fast compressed video action recognition
Z Shou, X Lin, Y Kalantidis, L Sevilla-Lara, M Rohrbach, SF Chang, Z Yan
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019
1612019
Clip-event: Connecting text and images with event structures
M Li, R Xu, S Wang, L Zhou, X Lin, C Zhu, M Zeng, H Ji, SF Chang
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
1402022
Deep Variational Metric Learning
X Lin, Y Duan, Q Dong, J Lu, J Zhou
Proceedings of the European Conference on Computer Vision (ECCV), 689-704, 2018
1322018
Language models with image descriptors are strong few-shot video-language learners
Z Wang, M Li, R Xu, L Zhou, J Lei, X Lin, S Wang, Z Yang, C Zhu, ...
Advances in Neural Information Processing Systems 35, 8483-8497, 2022
1312022
BLINK: Multimodal Large Language Models Can See but Not Perceive
X Fu, Y Hu, B Li, Y Feng, H Wang, X Lin, D Roth, NA Smith, WC Ma, ...
arXiv preprint arXiv:2404.12390, 2024
1012024
Object-aware Video-language Pre-training for Retrieval
J Wang, Y Ge, G Cai, R Yan, X Lin, Y Shan, X Qie, MZ Shou
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
882022
Learning To Recognize Procedural Activities with Distant Supervision
X Lin, F Petroni, G Bertasius, M Rohrbach, SF Chang, L Torresani
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
842022
VX2TEXT: End-to-End Learning of Video-Based Text Generation From Multimodal Inputs
X Lin, G Bertasius, J Wang, SF Chang, D Parikh, L Torresani
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021
762021
Context-Gated Convolution
X Lin, L Ma, W Liu, SF Chang
ECCV 2020, 2019
662019
RESIN: A Dockerized Schema-Guided Cross-document Cross-lingual Cross-media Information Extraction and Event Tracking System
H Wen, Y Lin, T Lai, X Pan, S Li, X Lin, B Zhou, M Li, H Wang, H Zhang, ...
Proceedings of the 2021 Conference of the North American Chapter of the …, 2021
632021
Supervised masked knowledge distillation for few-shot transformers
H Lin, G Han, J Ma, S Huang, X Lin, SF Chang
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
452023
Resin-11: Schema-guided event prediction for 11 newsworthy scenarios
X Du, Z Zhang, S Li, P Yu, H Wang, T Lai, X Lin, Z Wang, I Liu, B Zhou, ...
Proceedings of the 2022 Conference of the North American Chapter of the …, 2022
392022
GraphBit: Bitwise Interaction Mining via Deep Reinforcement Learning
Y Duan, Z Wang, J Lu, X Lin, J Zhou
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2018
372018
Exploring the reasoning abilities of multimodal large language models (mllms): A comprehensive survey on emerging trends in multimodal reasoning
Y Wang, W Chen, X Han, X Lin, H Zhao, Y Liu, B Zhai, J Yuan, Q You, ...
arXiv preprint arXiv:2401.06805, 2024
342024
Joint Multimedia Event Extraction from Video and Article
B Chen, X Lin, C Thomas, M Li, S Yoshida, L Chum, H Ji, SF Chang
arXiv preprint arXiv:2109.12776, 2021
312021
Towards fast adaptation of pretrained contrastive models for multi-channel video-language retrieval
X Lin, S Tiwari, S Huang, M Li, MZ Shou, H Ji, SF Chang
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
292023
Learning to Decompose Visual Features with Latent Textual Prompts
F Wang, M Li, X Lin, H Lv, AG Schwing, H Ji
arXiv preprint arXiv:2210.04287, 2022
292022
MuMuQA: Multimedia Multi-Hop News Question Answering via Cross-Media Knowledge Extraction and Grounding
R Gangi Reddy, X Rui, M Li, X Lin, H Wen, J Cho, L Huang, M Bansal, ...
arXiv e-prints, arXiv: 2112.10728, 2021
23*2021
سیستم در حال حاضر قادر به انجام عملکرد نیست. بعداً دوباره امتحان کنید.
مقاله‌ها 1–20