๐Ÿ’ช Strong OCR Capabilities. MiniCPM-Llama3-V 2.5 can proc... | ๐Ÿ’ช Strong OCR Capabilities. MiniCPM-Llama3-V 2.5 can proc...
๐Ÿ’ช Strong OCR Capabilities. MiniCPM-Llama3-V 2.5 can process images with any aspect ratio and up to 1.8 million pixels (e.g., 1344x1344), achieving an 700+ score on OCRBench, surpassing proprietary models such as GPT-4o, GPT-4V-0409, Qwen-VL-Max and Gemini Pro. Based on recent user feedback, MiniCPM-Llama3-V 2.5 has now enhanced full-text OCR extraction, table-to-markdown conversion, and other high-utility capabilities, and has further strengthened its instruction-following and complex reasoning abilities, enhancing multimodal interaction experiences.