FaceLiVTv2: 移动端轻量人脸识别
type: concept tags: [mobile, face-recognition, hybrid-architecture, cnn-transformer, edge, 其他] related: [[fastshade-mobile-denoising]], [[on-device-inference-memory-pressure]], [[coremltools-9]] sources: - http://arxiv.org/abs/2604.09127v1 created: 2026-04-14
核心问题¶
This paper introduces FaceLiVT, a lightweight yet powerful face recognition model that integrates a hybrid Convolution Neural Network (CNN)-Transformer architecture with an innovative and lightweight Multi-Head Linear Attention (MHLA) mechanism. By combining MHLA alongside a reparameterized token mixer, FaceLiVT effectively reduces computational complexity while preserving competitive accuracy. Extensive evaluations on challenging benchmarks; including LFW, CFP-FP, AgeDB-30, IJB-B, and IJB-C; highlight its superior performance compared to state-of-the-art lightweight models. MHLA notably impro
论文信息¶
- 标题: FaceLiVT: Face Recognition using Linear Vision Transformer with Structural Reparameterization For Mobile Device
- 作者: Novendra Setyawan, Chi-Chia Sun, Mao-Hsiu Hsu
- 来源: arXiv
方法/架构¶
详细方法论待补充。参考原始论文获取完整技术细节。
为什么重要¶
作为手机端 AIOS 生态的一部分,FaceLiVTv2:移动端高效人脸识别 对推动端侧 AI 落地具有重要意义。
关联¶
- [[clawmobile-agentic]] — Agent 系统架构
- [[kv-cache-quantization-ondevice]] — 内存优化