InternLM-XComposer2. 5-Reward: A Simple Yet Effective Multi-Modal Reward Model
Despite the promising performance of Large Vision Language Models (LVLMs) in visual
understanding, they occasionally generate incorrect outputs. While reward models (RMs) …
understanding, they occasionally generate incorrect outputs. While reward models (RMs) …