InternLM-XComposer2. 5-Reward: A Simple Yet Effective Multi-Modal Reward Model

Y Zang, X Dong, P Zhang, Y Cao, Z Liu, S Ding… - arxiv preprint arxiv …, 2025 - arxiv.org
Despite the promising performance of Large Vision Language Models (LVLMs) in visual
understanding, they occasionally generate incorrect outputs. While reward models (RMs) …