AI Agents Need Memory Control Over More Context

AI agents are increasingly used in long, multi-turn workflows in both research and enterprise settings. As interactions grow, agent behavior often degrades due to loss of constraint focus, error accumulation, and memory-induced drift. This problem is especially visible in real-world deployments where context evolves, distractions are introduced, and decisions must remain consistent over time. A common practice is to equip agents with persistent memory through transcript replay or retrieval-based mechanisms. While convenient, these approaches introduce unbounded context growth and are vulnerable to noisy recall and memory poisoning, leading to unstable behavior and increased drift. In this work, we introduce the Agent Cognitive Compressor (ACC), a bio-inspired memory controller that replaces transcript replay with a bounded internal state updated online at each turn. ACC separates artifact recall from state commitment, enabling stable conditioning while preventing unverified content from becoming persistent memory. We evaluate ACC using an agent-judge-driven live evaluation framework that measures both task outcomes and memory-driven anomalies across extended interactions. Across scenarios spanning IT operations, cybersecurity response, and healthcare workflows, ACC consistently maintains bounded memory and exhibits more stable multi-turn behavior, with significantly lower hallucination and drift than transcript replay and retrieval-based agents. These results show that cognitive compression provides a practical and effective foundation for reliable memory control in long-horizon AI agents.

翻译：AI智能体在研究和企业环境中越来越多地应用于长流程、多轮次的工作流。随着交互的增长，智能体行为常因约束焦点丧失、错误累积和记忆诱导漂移而退化。这一问题在现实世界部署中尤为明显，因为上下文会动态演变、干扰因素不断引入，且决策需随时间保持一致性。当前普遍做法是通过对话记录回放或基于检索的机制为智能体配备持久记忆。这些方法虽便捷，却会导致上下文无限增长，并易受噪声回忆和记忆污染的影响，从而引发行为不稳定和漂移加剧。本研究提出一种受生物学启发的记忆控制器——智能体认知压缩器（ACC），它用有限内部状态替代对话记录回放，并在每轮交互中实时更新。ACC将信息回溯与状态确认分离，在实现稳定条件约束的同时，防止未经验证的内容转化为持久记忆。我们采用智能体-评判驱动的实时评估框架对ACC进行测试，该框架同时衡量任务结果和长程交互中的记忆驱动异常。在涵盖IT运维、网络安全响应和医疗工作流的多种场景中，ACC始终保持有限记忆，并展现出更稳定的多轮次行为，其幻觉率和漂移率显著低于基于对话记录回放和检索机制的智能体。这些结果表明，认知压缩为长周期AI智能体的可靠记忆控制提供了实用且有效的基础。

相关内容

关注 7110

人工智能杂志AI(Artificial Intelligence)是目前公认的发表该领域最新研究成果的主要国际论坛。该期刊欢迎有关AI广泛方面的论文，这些论文构成了整个领域的进步，也欢迎介绍人工智能应用的论文，但重点应该放在新的和新颖的人工智能方法如何提高应用领域的性能，而不是介绍传统人工智能方法的另一个应用。关于应用的论文应该描述一个原则性的解决方案，强调其新颖性，并对正在开发的人工智能技术进行深入的评估。官网地址：http://dblp.uni-trier.de/db/journals/ai/

构建面向终端的 AI 编程智能体：脚手架、测试环境、上下文工程及实践经验

专知会员服务

25+阅读 · 3月8日

下半场思考：基础智能体记忆机制

专知会员服务

21+阅读 · 2月9日

智能体 AI (Agentic AI) 的新进展：回归初心，预见未来

专知会员服务

30+阅读 · 1月2日

专业软件开发者不靠“氛围编程”（Vibe Coding），而靠“控制”：2025 年 AI Agent 在编程中的应用研究

专知会员服务

21+阅读 · 2025年12月31日