Peng Jin

Citado por

	Total	Desde 2020
Citas	1322	1322
Índice h	14	14
Índice i10	16	16

1000

500

250

750

20222023202420256 123 992 196

Acceso público

Ver todo

11 artículos

0 artículos

disponibles

no disponibles

Basado en requisitos de financiación

Coautores

Li Yuan, 袁粒Peking University, Shenzhen Graduate School, School of ECEDirección de correo verificada de pku.edu.cn
Jie ChenPeking UniversityDirección de correo verificada de pku.edu.cn
Li HaoPhd candidate of computer science, Peking UniversityDirección de correo verificada de pku.edu.cn
Jinfa HuangUniversity of Rochester, Peking UniversityDirección de correo verificada de ur.rochester.edu
Zesen ChengPeking UniversityDirección de correo verificada de stu.pku.edu.cn
Bin ZhuPeking UniversityDirección de correo verificada de stu.pku.edu.cn
Bin Lin, 林彬Master student, Peking UniversityDirección de correo verificada de stu.pku.edu.cn
Kehan LiPeking University Shenzhen Graduate SchoolDirección de correo verificada de stu.pku.edu.cn
Yatian PangNational University of SingaporeDirección de correo verificada de u.nus.edu
Guoli SongPeng Cheng LaboratoryDirección de correo verificada de pcl.ac.cn
Fenglin LiuUniversity of OxfordDirección de correo verificada de eng.ox.ac.uk
Shuicheng Yan, Fellow of AAAI, ACM,...National University of Singapore, Ex: Skywork AI, Sea AI Lab | Looking for labmatesDirección de correo verificada de nus.edu.sg
Runyi YuPhD student at HKUSTDirección de correo verificada de connect.ust.hk
Zhiyuan YanPhD student, Peking University; Tencent Youtu Lab; Previously at CUHK-SZDirección de correo verificada de stu.pku.edu.cn

Seguir

Peng Jin

PhD student, Peking University

Dirección de correo verificada de stu.pku.edu.cn - Página principal

Vision and Language Multimodal LLM Cross-modal Retrieval


Título Ordenar por citas Ordenar por año Ordenar por título	Citado por Citado por	Año
Video-LLaVA: Learning United Visual Representation by Alignment Before Projection B Lin, Y Ye, B Zhu, J Cui, M Ning, P Jin, L Yuan EMNLP 2024, 2024	438	2024
Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding P Jin, R Takanobu, W Zhang, X Cao, L Yuan CVPR 2024 Highlight, 13700-13710, 2024	173	2024
MoE-LLaVA: Mixture of Experts for Large Vision-Language Models B Lin, Z Tang, Y Ye, J Cui, B Zhu, P Jin, J Zhang, M Ning, L Yuan arXiv preprint arXiv:2401.15947, 2024	170	2024
SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models D Liu, R Zhang, L Qiu, S Huang, W Lin, S Zhao, S Geng, Z Lin, P Jin, ... ICML 2024, 2024	96*	2024
Expectation-Maximization Contrastive Learning for Compact Video-and-Language Representations P Jin, J Huang, F Liu, X Wu, S Ge, G Song, D Clifton, J Chen NeurIPS 2022 Spotlight 35, 30291-30306, 2022	69	2022
Video-Text as Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learning P Jin, J Huang, P Xiong, S Tian, C Liu, X Ji, L Yuan, J Chen CVPR 2023 Highlight, 2472-2482, 2023	66	2023
DiffusionRet: Generative Text-Video Retrieval with Diffusion Model P Jin, H Li, Z Cheng, K Li, X Ji, C Liu, L Yuan, J Chen ICCV 2023, 2470-2481, 2023	60	2023
Weakly-Supervised 3D Spatial Reasoning for Text-based Visual Question Answering H Li, J Huang, P Jin, G Song, Q Wu, J Chen IEEE Transactions on Image Processing, 2023	40*	2023
Text-Video Retrieval with Disentangled Conceptualization and Set-to-Set Alignment P Jin, H Li, Z Cheng, J Huang, Z Wang, L Yuan, C Liu, J Chen IJCAI 2023, 938-946, 2023	35	2023
Act As You Wish: Fine-Grained Control of Motion Diffusion Model with Hierarchical Semantic Graphs P Jin, Y Wu, Y Fan, Z Sun, W Yang, L Yuan NeurIPS 2023, 2023	26	2023
LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context Inference Z Wan, Z Wu, C Liu, J Huang, Z Zhu, P Jin, L Wang, L Yuan EMNLP 2024 Findings, 2024	22	2024
Parallel Vertex Diffusion for Unified Visual Grounding Z Cheng, K Li, P Jin, X Ji, L Yuan, C Liu, J Chen AAAI 2024, 1326-1334, 2024	22	2024
Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable 2D Repainting J Zhang, Z Tang, Y Pang, X Cheng, P Jin, Y Wei, W Yu, M Ning, L Yuan ECCV 2024, 2024	20	2024
TG-VQA: Ternary Game of Video Question Answering H Li, P Jin, Z Cheng, S Zhang, K Chen, Z Wang, C Liu, J Chen IJCAI 2023, 1044-1052, 2023	16	2023
LLaVA-o1: Let Vision Language Models Reason Step-by-Step G Xu, P Jin, L Hao, Y Song, L Sun, L Yuan arXiv preprint arXiv:2411.10440, 2024	14	2024
RAP: Efficient Text-Video Retrieval with Sparse-and-Correlated Adapter M Cao, H Tang, J Huang, P Jin, C Zhang, R Liu, L Chen, X Liang, L Yuan, ... ACL 2024 Findings, 2024	10	2024
FreestyleRet: Retrieving Images from Style-Diversified Queries H Li, C Jia, P Jin, Z Cheng, K Li, J Sui, C Liu, L Yuan ECCV 2024, 2024	9	2024
Multi-granularity Interaction Simulation for Unsupervised Interactive Segmentation K Li, Y Zhao, Z Wang, Z Cheng, P Jin, X Ji, L Yuan, C Liu, J Chen ICCV 2023, 666-676, 2023	8	2023
MUSE: Mamba is Efficient Multi-scale Learner for Text-video Retrieval H Tang, M Cao, J Huang, R Liu, P Jin, G Li, X Liang AAAI 2025, 2025	6	2025
LLMBind: A Unified Modality-Task Integration Framework B Zhu, P Jin, M Ning, B Lin, J Huang, Q Song, M Pan, L Yuan arXiv preprint arXiv:2402.14891, 2024	6	2024

El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.

Artículos 1–20

Citas por año

Citas duplicadas

Citas combinadas

Añadir coautoresCoautores

Seguir

Citado por

Coautores