Timechat: A time-sensitive multimodal large language model for long video understanding S Ren, L Yao, S Li, X Sun, L Hou Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 140 | 2024 |
MIT: A Large-Scale Dataset towards Multi-Modal Multilingual Instruction Tuning L Li, Y Yin, S Li, L Chen, P Wang, S Ren, M Li, Y Yang, J Xu, X Sun, ... arXiv preprint arXiv:2306.04387, 2023 | 108* | 2023 |
TempCompass: Do Video LLMs Really Understand Videos? Y Liu, S Li, Y Liu, Y Wang, S Ren, L Li, S Chen, X Sun, L Hou arXiv preprint arXiv:2403.00476, 2024 | 57 | 2024 |
Fetv: A benchmark for fine-grained evaluation of open-domain text-to-video generation Y Liu, L Li, S Ren, R Gao, S Li, S Chen, X Sun, L Hou Advances in Neural Information Processing Systems 36, 62352-62387, 2023 | 56 | 2023 |
MAVIS: Mathematical Visual Instruction Tuning with an Automatic Data Engine R Zhang, X Wei, D Jiang, Z Guo, S Li, Y Zhang, C Tong, J Liu, A Zhou, ... arXiv e-prints, arXiv: 2407.08739, 2024 | 38 | 2024 |
TESTA: Temporal-spatial token aggregation for long-form video-language understanding S Ren, S Chen, S Li, X Sun, L Hou arXiv preprint arXiv:2310.19060, 2023 | 22 | 2023 |
CAPT: Contrastive Pre-Training for LearningDenoised Sequence Representations F Luo, P Yang, S Li, X Ren, X Sun arXiv preprint arXiv:2010.06351, 2020 | 20 | 2020 |
Recall: A benchmark for llms robustness against external counterfactual knowledge Y Liu, L Huang, S Li, S Chen, H Zhou, F Meng, J Zhou, X Sun arXiv preprint arXiv:2311.08147, 2023 | 19 | 2023 |
Vitatecs: A diagnostic dataset for temporal concept understanding of video-language models S Li, L Li, Y Liu, S Ren, Y Liu, R Gao, X Sun, L Hou European Conference on Computer Vision, 331-348, 2024 | 15 | 2024 |
Dca: Diversified co-attention towards informative live video commenting Z Zhang, Z Yin, S Ren, X Li, S Li Natural Language Processing and Chinese Computing: 9th CCF International …, 2020 | 14 | 2020 |
Multi-Granularity Contrasting for Cross-Lingual Pre-Training S Li, P Yang, F Luo, J Xie Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021 …, 2021 | 7 | 2021 |
Modal-adaptive Knowledge-enhanced Graph-based Financial Prediction from Monetary Policy Conference Calls with LLM K Ouyang, Y Liu, S Li, R Bao, K Harimoto, X Sun arXiv preprint arXiv:2403.16055, 2024 | 6 | 2024 |
No stock is an island: Learning internal and relational attributes of stocks with contrastive learning S Li, W Li, Z Zhang, R Bao, K Harimoto, X Sun Proceedings of the Fourth Workshop on Financial Technology and Natural …, 2022 | 6 | 2022 |
Rethinking denoised auto-encoding in language pre-training F Luo, P Yang, S Li, X Ren, X Sun, S Huang, F Huang Proceedings of the 2021 Conference on Empirical Methods in Natural Language …, 2021 | 6 | 2021 |
Lumina-Video: Efficient and Flexible Video Generation with Multi-scale Next-DiT D Liu, S Li, Y Liu, Z Li, K Wang, X Li, Q Qin, Y Liu, Y Xin, Z Li, B Fu, C Si, ... arXiv preprint arXiv:2502.06782, 2025 | | 2025 |
Incremental Stock Volume Prediction with Gradient Distillation and Diversified Memory Selection S Li, Z Zhang, L Li, R Bao, K Harimoto, X Sun | | |