Følg
Sicong Leng
Sicong Leng
Nanyang Technological University & Alibaba DAMO Academy
Verificeret mail på e.ntu.edu.sg - Startside
Titel
Citeret af
Citeret af
År
Interventional video grounding with dual contrastive learning
G Nan, R Qiao, Y Xiao, J Liu, S Leng, H Zhang, W Lu
CVPR 2021, 2021
1742021
Mitigating object hallucinations in large vision-language models through visual contrastive decoding
S Leng, H Zhang, G Chen, X Li, S Lu, C Miao, L Bing
CVPR 2024, 2024
1582024
Videollama 2: Advancing spatial-temporal modeling and audio understanding in video-llms
Z Cheng, S Leng, H Zhang, Y Xin, X Li, G Chen, Y Zhu, W Zhang, Z Luo, ...
arXiv preprint arXiv:2406.07476, 2024
1552024
Uncovering What, Why and How: A Comprehensive Benchmark for Causation Understanding of Video Anomaly
XT Hang Du , Sicheng Zhang , Binzhu Xie, Guoshun Nan, Jiayang Zhang, Junrui ...
CVPR 2024, 2024
15*2024
Agla: Mitigating object hallucinations in large vision-language models with assembly of global and local attention
W An, F Tian, S Leng, J Nie, H Lin, QY Wang, G Dai, P Chen, S Lu
arXiv preprint arXiv:2406.12718, 2024
122024
Speaker-oriented latent structures for dialogue-based relation extraction
G Nan, G Luo, S Leng, Y Xiao, W Lu
arXiv preprint arXiv:2109.05182, 2021
92021
Tell2Design: A Dataset for Language-Guided Floor Plan Generation
S Leng, Y Zhou, MH Dupty, WS Lee, SC Joyce, W Lu
ACL 2023, Area Chair Award, 2023
72023
The curse of multi-modalities: Evaluating hallucinations of large multimodal models across language, visual, and audio
S Leng, Y Xing, Z Cheng, Y Zhou, H Zhang, X Li, D Zhao, S Lu, C Miao, ...
arXiv preprint arXiv:2410.12787, 2024
52024
Constrained Layout Generation with Factor Graphs
MH Dupty, Y Dong, S Leng, G Fu, YL Goh, W Lu, WS Lee
CVPR 2024, 2024
42024
Refining Positive and Toxic Samples for Dual Safety Self-Alignment of LLMs with Minimal Human Interventions
J Xu, G Nan, S Guan, S Leng, Y Liu, Z Wang, Y Ma, Z Zhou, Y Hou, X Tao
arXiv preprint arXiv:2502.08657, 2025
2025
VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding
B Zhang, K Li, Z Cheng, Z Hu, Y Yuan, G Chen, S Leng, Y Jiang, H Zhang, ...
arXiv preprint arXiv:2501.13106, 2025
2025
Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss
Z Cheng, H Zhang, K Li, S Leng, Z Hu, F Wu, D Zhao, X Li, L Bing
arXiv preprint arXiv:2410.17243, 2024
2024
BenchX: A Unified Benchmark Framework for Medical Vision-Language Pretraining on Chest X-Rays
Y Zhou, T Faith, Y Xu, S Leng, X Xu, Y Liu, RSM Goh
NeurIPS 2024, 2024
2024
Systemet kan ikke foretage handlingen nu. Prøv igen senere.
Artikler 1–13