Sicong Leng

Citeret af

	Alle	Siden 2020
Henvisninger	539	539
h-index	7	7
i10-indeks	5	5

340

170

255

2021202220232024202513 48 67 331 79

Offentlig adgang

Se alle

3 artikler

0 artikler

tilgængelige

ikke tilgængelige

Baseret på krav i forbindelse med finansiering

Medforfattere

Xin LiAlibaba GroupVerificeret mail på se.cuhk.edu.hk
Lidong BingShanda Group, Alibaba DAMO, Tencent, CMU, CUHKVerificeret mail på alibaba-inc.com
Hang ZhangQwen Team; Zhejiang University; Sichuan UniversityVerificeret mail på stu.scu.edu.cn
Wei LuSingapore University of Technology and DesignVerificeret mail på sutd.edu.sg
Guoshun NanProfessor of Beijing University of Posts and TelecommunicationsVerificeret mail på bupt.edu.cn
Shijian LuCollege of Computing and Data Science, NTUVerificeret mail på ntu.edu.sg
Hao ZhangAlibaba DAMO Academy, Noah’s Ark Lab, A*STAR, NTUVerificeret mail på e.ntu.edu.sg
Chunyan MiaoNanyang Technological UniversityVerificeret mail på ntu.edu.sg
Rui QiaoPhd Student, National University of SingaporeVerificeret mail på u.nus.edu
Jun LiuProfessor, Lancaster UniversityVerificeret mail på lancaster.ac.uk

Følg

Sicong Leng

Nanyang Technological University & Alibaba DAMO Academy

Verificeret mail på e.ntu.edu.sg - Startside

Multi-modal Learning


Titel Sortér efter henvisninger Sortér efter årstal Sortér efter titel	Citeret af Citeret af	År
Interventional video grounding with dual contrastive learning G Nan, R Qiao, Y Xiao, J Liu, S Leng, H Zhang, W Lu CVPR 2021, 2021	174	2021
Mitigating object hallucinations in large vision-language models through visual contrastive decoding S Leng, H Zhang, G Chen, X Li, S Lu, C Miao, L Bing CVPR 2024, 2024	158	2024
Videollama 2: Advancing spatial-temporal modeling and audio understanding in video-llms Z Cheng, S Leng, H Zhang, Y Xin, X Li, G Chen, Y Zhu, W Zhang, Z Luo, ... arXiv preprint arXiv:2406.07476, 2024	155	2024
Uncovering What, Why and How: A Comprehensive Benchmark for Causation Understanding of Video Anomaly XT Hang Du , Sicheng Zhang , Binzhu Xie, Guoshun Nan, Jiayang Zhang, Junrui ... CVPR 2024, 2024	15*	2024
Agla: Mitigating object hallucinations in large vision-language models with assembly of global and local attention W An, F Tian, S Leng, J Nie, H Lin, QY Wang, G Dai, P Chen, S Lu arXiv preprint arXiv:2406.12718, 2024	12	2024
Speaker-oriented latent structures for dialogue-based relation extraction G Nan, G Luo, S Leng, Y Xiao, W Lu arXiv preprint arXiv:2109.05182, 2021	9	2021
Tell2Design: A Dataset for Language-Guided Floor Plan Generation S Leng, Y Zhou, MH Dupty, WS Lee, SC Joyce, W Lu ACL 2023, Area Chair Award, 2023	7	2023
The curse of multi-modalities: Evaluating hallucinations of large multimodal models across language, visual, and audio S Leng, Y Xing, Z Cheng, Y Zhou, H Zhang, X Li, D Zhao, S Lu, C Miao, ... arXiv preprint arXiv:2410.12787, 2024	5	2024
Constrained Layout Generation with Factor Graphs MH Dupty, Y Dong, S Leng, G Fu, YL Goh, W Lu, WS Lee CVPR 2024, 2024	4	2024
Refining Positive and Toxic Samples for Dual Safety Self-Alignment of LLMs with Minimal Human Interventions J Xu, G Nan, S Guan, S Leng, Y Liu, Z Wang, Y Ma, Z Zhou, Y Hou, X Tao arXiv preprint arXiv:2502.08657, 2025		2025
VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding B Zhang, K Li, Z Cheng, Z Hu, Y Yuan, G Chen, S Leng, Y Jiang, H Zhang, ... arXiv preprint arXiv:2501.13106, 2025		2025
Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss Z Cheng, H Zhang, K Li, S Leng, Z Hu, F Wu, D Zhao, X Li, L Bing arXiv preprint arXiv:2410.17243, 2024		2024
BenchX: A Unified Benchmark Framework for Medical Vision-Language Pretraining on Chest X-Rays Y Zhou, T Faith, Y Xu, S Leng, X Xu, Y Liu, RSM Goh NeurIPS 2024, 2024		2024

Systemet kan ikke foretage handlingen nu. Prøv igen senere.

Artikler 1–13

Henvisninger pr. år

Dublerede henvisninger

Flettede henvisninger

Tilføj medforfattereMedforfattere

Følg

Citeret af

Medforfattere