This work addresses the challenge of personalized question answering in long-term human-machine interactions: when conversational history spans weeks or months and exceeds the context window, existing personalization mechanisms struggle to continuously absorb and leverage users' incremental concepts, aliases, and preferences. Current personalized multimodal models are predominantly static-concepts are fixed at initialization and cannot evolve during interactions. We propose M2A, an agentic dual-layer hybrid memory system that maintains personalized multimodal information through online updates. The system employs two collaborative agents: ChatAgent manages user interactions and autonomously decides when to query or update memory, while MemoryManager breaks down memory requests from ChatAgent into detailed operations on the dual-layer memory bank, which couples a RawMessageStore (immutable conversation log) with a SemanticMemoryStore (high-level observations), providing memories at different granularities. In addition, we develop a reusable data synthesis pipeline that injects concept-grounded sessions from Yo'LLaVA and MC-LLaVA into LoCoMo long conversations while preserving temporal coherence. Experiments show that M2A significantly outperforms baselines, demonstrating that transforming personalization from one-shot configuration to a co-evolving memory mechanism provides a viable path for high-quality individualized responses in long-term multimodal interactions. The code is available at https://github.com/Little-Fridge/M2A.
翻译:本研究致力于解决长期人机交互中的个性化问答挑战:当对话历史跨越数周甚至数月并超出上下文窗口时,现有个性化机制难以持续吸收和利用用户不断增长的概念、别名和偏好。当前的多模态个性化模型主要为静态模型——概念在初始化时固定,无法在交互过程中演化。我们提出M2A,一种具备双层混合记忆的智能体系统,通过在线更新维护个性化多模态信息。该系统采用两个协同工作的智能体:ChatAgent负责管理用户交互并自主决定何时查询或更新记忆;MemoryManager则将ChatAgent的记忆请求分解为对双层记忆库的详细操作。该记忆库耦合了RawMessageStore(不可变的对话日志)与SemanticMemoryStore(高层级观察),提供不同粒度的记忆。此外,我们开发了一个可复用的数据合成流程,将来自Yo'LLaVA和MC-LLaVA的概念锚定会话注入LoCoMo长对话中,同时保持时间连贯性。实验表明,M2A显著优于基线模型,证明将个性化从一次性配置转变为协同演化的记忆机制,为长期多模态交互中实现高质量个体化响应提供了可行路径。代码发布于https://github.com/Little-Fridge/M2A。