MemReader: From Passive to Active Extraction for Long-Term Agent Memory¶

作者: Jingyi Kang, Chunyu Li, Ding Chen, Bo Tang, Feiyu Xiong 发表: 2026-04-09

摘要¶

Long-term memory is fundamental for personalized and autonomous agents, yet populating it remains a bottleneck. Existing systems treat memory extraction as a one-shot, passive transcription from context to structured entries, which struggles with noisy dialogue, missing references, and cross-turn dependencies, leading to memory pollution, low-value writes, and inconsistency. In this paper, we introduce the MemReader family for active long-term memory extraction in agent systems: MemReader-0.6B, a compact and cost-efficient passive extractor distilled for accurate and schema-consistent structured outputs, and MemReader-4B, an active extractor optimized with Group Relative Policy Optimization (GRPO) to make memory writing decisions. Under a ReAct-style paradigm, MemReader-4B explicitly evaluates whether the current interaction contains worth-writing information before writing, reducing unnecessary memory writes while maintaining memory quality.

核心貢獻¶

MemReader Family: 首个将记忆提取从被动转录转变为主动决策的端到端系统
MemReader-0.6B: 轻量级被动提取器，蒸馏自大模型，用于准确且 schema 一致的结构化输出
MemReader-4B: 主动提取器，使用 GRPO 优化记忆写入决策，明确评估当前交互是否包含值得写入的信息
Active Memory Writing: 在 ReAct 风格范式下，模型显式判断是否写入记忆，减少不必要的记忆污染
降低记忆污染: 通过主动决策机制过滤低价值写入，保持记忆质量

為什麼重要¶

现有 Agent 记忆系统将记忆提取视为一次性被动转录，这导致噪声对话、缺失引用和跨轮依赖问题，使记忆充满污染和低价值写入。MemReader 首次将记忆写入从被动转录转变为主动决策——模型主动评估当前交互是否值得记忆。这对 Agent 记忆系统的长期质量控制有根本性贡献。

與端側/移動端相關性¶

轻量级模型 (0.6B/4B): 蒸馏后的模型适合端侧部署
主动写入决策: 减少不必要的 API 调用和存储写入，降低带宽和存储消耗
GRPO 优化: 强化学习驱动的写入策略可在端侧资源约束下高效运行
记忆质量控制: 对移动端个人助理的记忆管理有直接参考价值