MiniCPM 2.4.2
type: entity tags: [model, on-device, openbmb, quantization, multimodal] related: [[gemma4-ondevice]], [[ggml-llamacpp-hf]], [[on-device-inference-memory-pressure]] sources: - https://github.com/OpenBMB/MiniCPM/releases/tag/2.4.2 created: 2026-04-14
核心问题¶
The recent surge of Multimodal Large Language Models (MLLMs) has fundamentally reshaped the landscape of AI research and industry, shedding light on a promising path toward the next AI milestone. However, significant challenges remain preventing MLLMs from being practical in real-world applications. The most notable challenge comes from the huge cost of running an MLLM with a massive number of parameters and extensive computation. As a result, most MLLMs need to be deployed on high-performing cloud servers, which greatly limits their application scopes such as mobile, offline, energy-sensitive
论文信息¶
- 标题: MiniCPM-V: A GPT-4V Level MLLM on Your Phone
- 作者: Yuan Yao, Tianyu Yu, Ao Zhang
- 来源: arXiv
方法/架构¶
详细方法论待补充。参考原始论文获取完整技术细节。
为什么重要¶
作为手机端 AIOS 生态的一部分,MiniCPM 2.4.2 对推动端侧 AI 落地具有重要意义。
关联¶
- [[clawmobile-agentic]] — Agent 系统参考
- [[mnn-350]] — 推理引擎