基于人类反馈数据的文本到图像扩散模型对齐自动过滤方法 (Automated Filtering of Human Feedback Data for Aligning Text-to-Image Diffusion Models)

Fine-tuning text-to-image diffusion models with human feedback is an effective method for aligning model behavior with human intentions. However, this alignment process often suffers from slow convergence due to the large size and noise present in human feedback datasets. In this work, we propose FiFA, a novel automated data filtering algorithm designed to enhance the fine-tuning of diffusion models using human feedback datasets with direct preference optimization (DPO). Specifically, our approach selects data by solving an optimization problem to maximize three components: preference margin, text quality, and text diversity. The concept of preference margin is used to identify samples that are highly informative in addressing the noisy nature of feedback dataset, which is calculated using a proxy reward model. Additionally, we incorporate text quality, assessed by large language models to prevent harmful contents, and consider text diversity through a k-nearest neighbor entropy estimator to improve generalization. Finally, we integrate all these components into an optimization process, with approximating the solution by assigning importance score to each data pair and selecting the most important ones. As a result, our method efficiently filters data automatically, without the need for manual intervention, and can be applied to any large-scale dataset. Experimental results show that FiFA significantly enhances training stability and achieves better performance, being preferred by humans 17% more, while using less than 0.5% of the full data and thus 1% of the GPU hours compared to utilizing full human feedback datasets.

翻译：利用人类反馈数据微调文本到图像扩散模型，是使模型行为与人类意图对齐的有效方法。然而，由于人类反馈数据集规模庞大且存在噪声，该对齐过程往往收敛缓慢。本文提出FiFA，一种新颖的自动数据过滤算法，旨在通过直接偏好优化（DPO）利用人类反馈数据集增强扩散模型的微调效果。具体而言，我们的方法通过求解一个优化问题来选择数据，以最大化三个组成部分：偏好边际、文本质量和文本多样性。偏好边际的概念用于识别在应对反馈数据集噪声特性方面信息量高的样本，其计算通过一个代理奖励模型实现。此外，我们引入由大型语言模型评估的文本质量以防止有害内容，并通过k近邻熵估计器考虑文本多样性以提升泛化能力。最后，我们将所有这些组成部分整合到一个优化过程中，通过为每个数据对分配重要性分数并选择最重要的数据对来近似求解。因此，我们的方法能够高效地自动过滤数据，无需人工干预，并可应用于任何大规模数据集。实验结果表明，FiFA显著提升了训练稳定性并获得了更优的性能，在人类偏好评估中胜出率高出17%，同时仅使用不到完整数据的0.5%，从而相比使用完整人类反馈数据集，仅需约1%的GPU计算时数。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

31+阅读 · 2021年9月29日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日