Mmmu: A massive multi-discipline multimodal understanding and reasoning benchmark for expert agi X Yue, Y Ni, K Zhang, T Zheng, R Liu, G Zhang, S Stevens, D Jiang, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 590 | 2024 |
Mmlu-pro: A more robust and challenging multi-task language understanding benchmark Y Wang, X Ma, G Zhang, Y Ni, A Chandra, S Guo, W Ren, A Arulraj, X He, ... NeurIPS 2024 Datasets and Benchmarks Track, 2024 | 156* | 2024 |
ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation W Ren, H Yang, G Zhang, C Wei, X Du, W Huang, W Chen Transactions on Machine Learning Research, 2024 | 48 | 2024 |
AnyV2V: A Tuning-Free Framework For Any Video-to-Video Editing Tasks M Ku, C Wei, W Ren, H Yang, W Chen Transactions on Machine Learning Research, 2024 | 37* | 2024 |
Video Diffusion Models: A Survey A Melnik, M Ljubljanac, C Lu, Q Yan, W Ren, H Ritter Transactions on Machine Learning Research, 2024 | 6 | 2024 |
Hicu: Leveraging hierarchy for curriculum learning in automated icd coding W Ren, R Zeng, T Wu, T Zhu, RG Krishnan Machine Learning for Healthcare Conference, 198-223, 2022 | 5 | 2022 |
StructLM: Towards Building Generalist Models for Structured Knowledge Grounding A Zhuang, G Zhang, T Zheng, X Du, J Wang, W Ren, SW Huang, J Fu, ... COLM 2024, 2024 | 4 | 2024 |
Towards transformer-based automated icd coding: Challenges pitfalls and solutions W Ren, T Zhu, R Zeng, T Wu | 2 | 2021 |
VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by Video Spatiotemporal Augmentation W Ren, H Yang, J Min, C Wei, W Chen arXiv preprint arXiv:2412.00927, 2024 | | 2024 |
OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision C Wei, Z Xiong, W Ren, X Du, G Zhang, W Chen ICLR 2025, 2024 | | 2024 |