๐Ÿ† Trustworthy Behavior. Leveraging the latest RLAIF-V me... | ๐Ÿ† Trustworthy Behavior. Leveraging the latest RLAIF-V me...
๐Ÿ† Trustworthy Behavior. Leveraging the latest RLAIF-V method (the newest technology in the RLHF-V [CVPR'24] series), MiniCPM-Llama3-V 2.5 exhibits more trustworthy behavior. It achieves 10.3% hallucination rate on Object HalBench, lower than GPT-4V-1106 (13.6%), achieving the best-level performance within the open-source community. Data released.