In product advertising applications, the automated inpainting of backgrounds utilizing AI techniques in product images has emerged as a significant task. However, the techniques still suffer from issues such as inappropriate background and inconsistent product in generated product images, and existing approaches for evaluating the quality of generated product images are mostly inconsistent with human feedback causing the evaluation for this task to depend on manual annotation. To relieve the issues above, this paper proposes Human Feedback and Product Consistency (HFPC), which can automatically assess the generated product images based on two modules. Firstly, to solve inappropriate backgrounds, human feedback on 44,000 automated inpainting product images is collected to train a reward model based on multi-modal features extracted from BLIP and comparative learning. Secondly, to filter generated product images containing inconsistent products, a fine-tuned segmentation model is employed to segment the product of the original and generated product images and then compare the differences between the above two. Extensive experiments have demonstrated that HFPC can effectively evaluate the quality of generated product images and significantly reduce the expense of manual annotation. Moreover, HFPC achieves state-of-the-art(96.4% in precision) in comparison to other open-source visual-quality-assessment models. Dataset and code are available at: https://github.com/created-Bi/background_inpainting_products_dataset
翻译:在商品广告应用中,利用人工智能技术对商品图像进行自动化背景修复已成为一项重要任务。然而,现有技术仍存在生成商品图像背景不当、产品不一致等问题,且当前评估生成商品图像质量的方法大多与人类反馈不一致,导致该任务的评估依赖人工标注。为缓解上述问题,本文提出基于人类反馈与产品一致性的评估框架,该框架通过两个模块自动评估生成的商品图像。首先,为解决背景不当问题,我们收集了44,000张自动化修复商品图像的人类反馈数据,基于BLIP提取的多模态特征并通过对比学习训练奖励模型。其次,为筛选包含不一致产品的生成图像,采用微调的分割模型对原始商品图像与生成图像中的产品进行分割,进而比较两者的差异。大量实验表明,HFPC能有效评估生成商品图像的质量,并显著降低人工标注成本。此外,与其他开源视觉质量评估模型相比,HFPC达到了最优性能(精确度96.4%)。数据集与代码已开源:https://github.com/created-Bi/background_inpainting_products_dataset