Mp5: A multi-modal open-ended embodied system in minecraft via active perception Y Qin, E Zhou, Q Liu, Z Yin, L Sheng, R Zhang, Y Qiao, J Shao 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR …, 2024 | 24* | 2024 |
Minedreamer: Learning to follow instructions via chain-of-imagination for simulated-world control E Zhou, Y Qin, Z Yin, Y Huang, R Zhang, L Sheng, Y Qiao, J Shao Advances in Neural Information Processing Systems 37 (NeurIPS 2024 …, 2024 | 20 | 2024 |
Worldsimbench: Towards video generation models as world simulators Y Qin, Z Shi, J Yu, X Wang, E Zhou, L Li, Z Yin, X Liu, L Sheng, J Shao, ... arXiv preprint arXiv:2410.18072, 2024 | 5 | 2024 |
Agfsync: Leveraging ai-generated feedback for preference optimization in text-to-image generation J An, Y Zhu, Z Li, E Zhou, H Feng, X Huang, B Chen, Y Shi, C Pan Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), 2025, 2024 | 1 | 2024 |
Code-as-Monitor: Constraint-aware Visual Programming for Reactive and Proactive Robotic Failure Detection E Zhou, Q Su, C Chi, Z Zhang, Z Wang, T Huang, L Sheng, H Wang arXiv preprint arXiv:2412.04455, 2024 | | 2024 |