Lamm: Language-assisted multi-modal instruction-tuning dataset, framework, and benchmark Z Yin*, J Wang*, J Cao*, Z Shi*, D Liu, M Li, X Huang, Z Wang, L Sheng, ... Advances in Neural Information Processing Systems 36, 2024 | 148 | 2024 |
Danceformer: Music conditioned 3d dance generation with parametric motion transformer B Li, Y Zhao, Z Shi, L Sheng Proceedings of the AAAI Conference on Artificial Intelligence 36 (2), 1272-1279, 2022 | 122 | 2022 |
DanceFormer: Music Conditioned 3D Dance Generation with Parametric Motion Transformer B Li, Y Zhao, Z Shi, L Sheng arXiv preprint arXiv:2103.10206, 2021 | 25 | 2021 |
Openmmlab 3d human parametric model toolbox and benchmark M Contributors | 21 | 2021 |
Assessment of multimodal large language models in alignment with human values Z Shi*, Z Wang*, H Fan*, Z Zhang, L Li, Y Zhang, Z Yin, L Sheng, Y Qiao, ... arXiv preprint arXiv:2403.17830, 2024 | 16 | 2024 |
From gpt-4 to gemini and beyond: Assessing the landscape of mllms on generalizability, trustworthiness and causality through four modalities C Lu, C Qian, G Zheng, H Fan, H Gao, J Zhang, J Shao, J Deng, J Fu, ... arXiv preprint arXiv:2401.15071, 2024 | 14 | 2024 |
Chef: A comprehensive evaluation framework for standardized assessment of multimodal large language models Z Shi*, Z Wang*, H Fan*, Z Yin, L Sheng, Y Qiao, J Shao arXiv preprint arXiv:2311.02692, 2023 | 9 | 2023 |
Worldsimbench: Towards video generation models as world simulators Y Qin, Z Shi, J Yu, X Wang, E Zhou, L Li, Z Yin, X Liu, L Sheng, J Shao, ... arXiv preprint arXiv:2410.18072, 2024 | 5 | 2024 |
LAMM: Language-Assisted Multi-Modal Instruction-Tuning Dataset Z Yin*, J Wang*, J Cao*, Z Shi*, D Liu, M Li, L Sheng, L Bai, X Huang, ... Framework, and Benchmark, 1-37, 2023 | 4 | 2023 |
RH20T-P: A Primitive-Level Robotic Dataset Towards Composable Generalization Agents Z Chen*, Z Shi*, X Lu*, L He*, S Qian, HS Fang, Z Yin, W Ouyang, J Shao, ... arXiv preprint arXiv:2403.19622, 2024 | 3 | 2024 |
T2ISafety: Benchmark for Assessing Fairness, Toxicity, and Privacy in Image Generation L Li, Z Shi, X Hu, B Dong, Y Qin, X Liu, L Sheng, J Shao arXiv preprint arXiv:2501.12612, 2025 | | 2025 |
A Topic-level Self-Correctional Approach to Mitigate Hallucinations in MLLMs L He, Z Chen, Z Shi, T Yu, J Shao, L Sheng arXiv preprint arXiv:2411.17265, 2024 | | 2024 |
Danceformer: Music conditioned 3d dance generation with parametric motion transformer B Li, Y Zhao, S Zhelun, L Sheng Proceedings of the AAAI Conference on Artificial Intelligence 36 (2), 1272-1279, 2022 | | 2022 |
Benchmarking Ethics in Text-to-Image Models: A Holistic Dataset and Evaluator for Fairness, Toxicity, and Privacy L Li, Z Shi, X Hu, B Dong, Y Qin, X Liu, L Sheng, J Shao | | |
RH20T-P: A Primitive-Level Robotic Manipulation Dataset Towards Composable Generalization Agents in Real-world Scenarios Z Chen, Z Shi, X Lu, L He, S Qian, Z Yin, W Ouyang, J Shao, Y Qiao, C Lu, ... NeurIPS 2024 Workshop on Open-World Agents, 0 | | |