Social Reward: Evaluating and Enhancing Generative AI through Million-User Feedback from an Online Creative Community

from arxiv, 16 pages with 10 figures, accepted at ICLR 2024 as a spotlight, codes can be accessed at https://github.com/Picsart-AI-Research/Social-Reward

Social reward as a form of community recognition provides a strong source of motivation for users of online platforms to engage and contribute with content. The recent progress of text-conditioned image synthesis has ushered in a collaborative era where AI empowers users to craft original visual artworks seeking community validation. Nevertheless, assessing these models in the context of collective community preference introduces distinct challenges. Existing evaluation methods predominantly center on limited size user studies guided by image quality and prompt alignment. This work pioneers a paradigm shift, unveiling Social Reward - an innovative reward modeling framework that leverages implicit feedback from social network users engaged in creative editing of generated images. We embark on an extensive journey of dataset curation and refinement, drawing from Picsart: an online visual creation and editing platform, yielding a first million-user-scale dataset of implicit human preferences for user-generated visual art named Picsart Image-Social. Our analysis exposes the shortcomings of current metrics in modeling community creative preference of text-to-image models' outputs, compelling us to introduce a novel predictive model explicitly tailored to address these limitations. Rigorous quantitative experiments and user study show that our Social Reward model aligns better with social popularity than existing metrics. Furthermore, we utilize Social Reward to fine-tune text-to-image models, yielding images that are more favored by not only Social Reward, but also other established metrics. These findings highlight the relevance and effectiveness of Social Reward in assessing community appreciation for AI-generated artworks, establishing a closer alignment with users' creative goals: creating popular visual art. Codes can be accessed at https://github.com/Picsart-AI-Research/Social-Reward

翻译：社交奖励作为一种社区认可形式，为在线平台用户参与和贡献内容提供了强大的激励。近期文本条件图像合成领域的进展开启了协作时代，人工智能赋能用户创作需寻求社区验证的原创视觉艺术作品。然而，在集体社区偏好情境下评估此类模型带来了独特挑战。现有评估方法主要聚焦于有限规模用户研究，其导向标准为图像质量与提示对齐度。本研究开创范式转变，提出"社交奖励"——一种创新性奖励建模框架，利用参与生成图像创意编辑的社交网络用户的隐式反馈。我们基于在线视觉创作与编辑平台Picsart展开大规模数据集构建与精炼工作，首次产出百万用户量级的隐式人类偏好数据集（命名为Picsart Image-Social），用于评估用户生成视觉艺术。分析揭示了现有指标在建模文本到图像模型输出的社区创意偏好方面的缺陷，促使我们提出专为克服这些局限而设计的新型预测模型。严格的量化实验与用户研究表明，我们的社交奖励模型在社交流行度对齐方面优于现有指标。此外，我们利用社交奖励微调文本到图像模型，产出的图像不仅更受社交奖励模型青睐，也更符合其他成熟指标的评估标准。这些发现凸显了社交奖励在评估AI生成艺术社区认可度方面的相关性与有效性，使评估更贴近用户创作目标——创作流行视觉艺术。代码可通过https://github.com/Picsart-AI-Research/Social-Reward获取。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/