Suivre
Kevin Qinghong Lin
Kevin Qinghong Lin
Autres nomsQinghong Lin
Adresse e-mail validée de u.nus.edu - Page d'accueil
Titre
Citée par
Citée par
Année
All in one: Exploring unified video-language pre-training
J Wang, Y Ge, R Yan, Y Ge, KQ Lin, S Tsutsui, X Lin, G Cai, J Wu, Y Shan, ...
CVPR 2023, 2023
2312023
Egocentric video-language pretraining
KQ Lin, J Wang, M Soldan, M Wray, R Yan, EZ Xu, D Gao, RC Tu, W Zhao, ...
NeurIPS 2022 (Spotlight), 2022
207*2022
UniVTG: Towards unified video-language temporal grounding
KQ Lin, P Zhang, J Chen, S Pramanick, D Gao, AJ Wang, R Yan, MZ Shou
ICCV 2023, 2023
1202023
Assistgpt: A general multi-modal assistant that can plan, execute, inspect, and learn
D Gao, L Ji, L Zhou, KQ Lin, J Chen, Z Fan, MZ Shou
ACMMM 2024 HCMA workshop (Best Demo Paper), 2023
772023
Show-o: One single transformer to unify multimodal understanding and generation
J Xie, W Mao, Z Bai, DJ Zhang, W Wang, KQ Lin, Y Gu, Z Chen, Z Yang, ...
ICLR 2025, 2024
682024
Egovlpv2: Egocentric video-language pre-training with fusion in the backbone
S Pramanick, Y Song, S Nag, KQ Lin, H Shah, MZ Shou, R Chellappa, ...
ICCV 2023, 2023
642023
Unsupervised cross-modal hashing with modality-interaction
RC Tu, J Jiang, Q Lin, C Cai, S Tian, H Wang, W Liu
TCSVT 2023, 2023
462023
Unsupervised cross-modal hashing via semantic text mining
RC Tu, XL Mao, Q Lin, W Ji, W Qin, W Wei, H Huang
TMM 2023, 2023
282023
Affordance grounding from demonstration video to target image
J Chen, D Gao, KQ Lin, MZ Shou
CVPR 2023, 2023
272023
Too large; data reduction for vision-language pre-training
AJ Wang, KQ Lin, DJ Zhang, SW Lei, MZ Shou
ICCV 2023, 2023
252023
VideoLLM-online: Online Video Large Language Model for Streaming Video
J Chen, Z Lv, S Wu, KQ Lin, C Song, D Gao, JW Liu, Z Gao, D Mao, ...
CVPR 2024, 2024
232024
Deep unsupervised hashing with latent semantic components
Q Lin, X Chen, Q Zhang, S Cai, W Zhao, H Wang
AAAI 2022 (Oral), 2022
222022
Unsupervised hashing with semantic concept mining
RC Tu, XL Mao, KQ Lin, C Cai, W Qin, W Wei, H Wang, H Huang
SIGMOD 2023, 2023
162023
Deep superpixel cut for unsupervised image segmentation
Q Lin, W Zhong, J Lu
ICPR 2021, 2021
152021
Deep self-adaptive hashing for image retrieval
Q Lin, X Chen, Q Zhang, S Tian, Y Chen
CIKM 2021 (Oral), 2021
142021
Visorgpt: Learning visual prior via generative pre-training
J Xie, K Ye, Y Li, Y Li, KQ Lin, Y Zheng, L Shen, MZ Shou
NeurIPS 2023, 2023
13*2023
COSMO: COntrastive Streamlined MultimOdal Model with Interleaved Pre-Training
AJ Wang, L Li, KQ Lin, J Wang, K Lin, Z Yang, L Wang, MZ Shou
arXiv preprint arXiv:2401.00849, 2024
92024
DiffusionVMR: Diffusion Model for Joint Video Moment Retrieval and Highlight Detection
H Zhao, KQ Lin, R Yan, Z Li
TNNLS 2024, 2024
7*2024
ShowUI: One Vision-Language-Action Model for GUI Visual Agent
KQ Lin, L Li, D Gao, Z Yang, S Wu, Z Bai, W Lei, L Wang, MZ Shou
NeurIPS 2024 OWA workshop (Outstanding Paper Award), 2024
5*2024
GUI Action Narrator: Where and When Did That Action Take Place?
Q Wu, D Gao, KQ Lin, Z Wu, X Guo, P Li, W Zhang, H Wang, MZ Shou
arXiv preprint arXiv:2406.13719, 2024
32024
Le système ne peut pas réaliser cette opération maintenant. Veuillez réessayer plus tard.
Articles 1–20