Lifelong Knowledge Editing for Vision Language Models with Low-Rank Mixture-of-Experts

Model editing aims to correct inaccurate knowledge, update outdated information, and incorporate new data into Large Language Models (LLMs) without the need for retraining. This task poses challenges in lifelong scenarios where edits must be continuously applied for real-world applications. While some editors demonstrate strong robustness for lifelong editing in pure LLMs, Vision LLMs (VLLMs), which incorporate an additional vision modality, are not directly adaptable to existing LLM editors. In this paper, we propose LiveEdit, a LIfelong Vision language modEl Edit to bridge the gap between lifelong LLM editing and VLLMs. We begin by training an editing expert generator to independently produce low-rank experts for each editing instance, with the goal of correcting the relevant responses of the VLLM. A hard filtering mechanism is developed to utilize visual semantic knowledge, thereby coarsely eliminating visually irrelevant experts for input queries during the inference stage of the post-edited model. Finally, to integrate visually relevant experts, we introduce a soft routing mechanism based on textual semantic relevance to achieve multi-expert fusion. For evaluation, we establish a benchmark for lifelong VLLM editing. Extensive experiments demonstrate that LiveEdit offers significant advantages in lifelong VLLM editing scenarios. Further experiments validate the rationality and effectiveness of each module design in LiveEdit.

翻译：模型编辑旨在无需重新训练的情况下，修正大型语言模型（LLMs）中的错误知识、更新过时信息并融入新数据。在终身学习场景中，为适应实际应用需持续进行编辑，该任务面临诸多挑战。尽管部分编辑器在纯文本LLMs的终身编辑中展现出较强的鲁棒性，但引入额外视觉模态的视觉语言模型（VLLMs）无法直接适配现有的LLM编辑器。本文提出LiveEdit（终身视觉语言模型编辑器），以弥合终身LLM编辑与VLLMs之间的鸿沟。我们首先训练一个编辑专家生成器，使其能够为每个编辑实例独立生成低秩专家，以修正VLLM的相关输出响应。我们开发了一种硬筛选机制，利用视觉语义知识，在后编辑模型的推理阶段对输入查询进行视觉无关专家的粗粒度过滤。最后，为整合视觉相关专家，我们引入基于文本语义相关性的软路由机制，实现多专家融合。为进行评估，我们构建了终身VLLM编辑基准测试集。大量实验表明，LiveEdit在终身VLLM编辑场景中具有显著优势。进一步实验验证了LiveEdit中各模块设计的合理性与有效性。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日