A GPT-4V Level Multimodal LLM on Your Phone [2024.08.10]... | A GPT-4V Level Multimodal LLM on Your Phone [2024.08.10]...
A GPT-4V Level Multimodal LLM on Your Phone [2024.08.10] ๐Ÿš€๐Ÿš€๐Ÿš€ MiniCPM-Llama3-V 2.5 is now fully supported by official llama.cpp! GGUF models of various sizes are available here.
[2024.08.06] ๐Ÿ”ฅ๐Ÿ”ฅ๐Ÿ”ฅ We open-source MiniCPM-V 2.6, which outperforms GPT-4V on single image, multi-image and video understanding. It advances popular features of MiniCPM-Llama3-V 2.5, and can support real-time video understanding on iPad. Try it now!