Composable Interventions for Language Models

Arinbjorn Kolbeinsson,Kyle O'Brien,Tianjin Huang,Shanghua Gao,Shiwei Liu,Jonathan Richard Schwarz,Anurag Vaidya,Faisal Mahmood,Marinka Zitnik,Tianlong Chen,Thomas Hartvigsen

Test-time interventions for language models can enhance factual accuracy, mitigate harmful outputs, and improve model efficiency without costly retraining. But despite a flood of new methods, different types of interventions are largely developing independently. In practice, multiple interventions must be applied sequentially to the same model, yet we lack standardized ways to study how interventions interact. We fill this gap by introducing composable interventions, a framework to study the effects of using multiple interventions on the same language models, featuring new metrics and a unified codebase. Using our framework, we conduct extensive experiments and compose popular methods from three emerging intervention categories -- Knowledge Editing, Model Compression, and Machine Unlearning. Our results from 310 different compositions uncover meaningful interactions: compression hinders editing and unlearning, composing interventions hinges on their order of application, and popular general-purpose metrics are inadequate for assessing composability. Taken together, our findings showcase clear gaps in composability, suggesting a need for new multi-objective interventions. All of our code is public: https://github.com/hartvigsen-group/composable-interventions.

翻译：语言模型的测试时干预能够在不进行昂贵重新训练的情况下提高事实准确性、缓解有害输出并提升模型效率。尽管新方法层出不穷，但不同类型的干预技术大多独立发展。实践中，必须对同一模型顺序应用多种干预措施，然而我们缺乏标准化方法来研究干预措施之间的相互作用。我们通过引入可组合干预框架填补了这一空白，该框架具备新度量标准和统一代码库，可用于研究对同一语言模型实施多重干预的效果。基于该框架，我们开展了大量实验，并组合了来自三个新兴干预类别——知识编辑、模型压缩与机器遗忘——的流行方法。通过对310种不同组合的实验，我们发现了有意义的交互规律：压缩会阻碍编辑与遗忘效果、干预措施的组合效果取决于其应用顺序、现有通用评估指标不足以衡量可组合性。综合而言，我们的研究结果揭示了可组合性方面存在的明显缺陷，表明需要开发新的多目标干预技术。所有代码均已开源：https://github.com/hartvigsen-group/composable-interventions。

相关内容

MoDELS

关注 0

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

【ACL2020】多模态信息抽取，365页ppt

专知会员服务

151+阅读 · 2020年7月6日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日