DomainGallery: Few-shot Domain-driven Image Generation by Attribute-centric Finetuning

The recent progress in text-to-image models pretrained on large-scale datasets has enabled us to generate various images as long as we provide a text prompt describing what we want. Nevertheless, the availability of these models is still limited when we expect to generate images that fall into a specific domain either hard to describe or just unseen to the models. In this work, we propose DomainGallery, a few-shot domain-driven image generation method which aims at finetuning pretrained Stable Diffusion on few-shot target datasets in an attribute-centric manner. Specifically, DomainGallery features prior attribute erasure, attribute disentanglement, regularization and enhancement. These techniques are tailored to few-shot domain-driven generation in order to solve key issues that previous works have failed to settle. Extensive experiments are given to validate the superior performance of DomainGallery on a variety of domain-driven generation scenarios. Codes are available at https://github.com/Ldhlwh/DomainGallery.

翻译：在大规模数据集上预训练的文本到图像模型的最新进展使我们能够生成各种图像，只要我们提供描述所需内容的文本提示。然而，当我们期望生成属于特定领域（这些领域要么难以描述，要么模型未曾见过）的图像时，这些模型的可用性仍然有限。在本工作中，我们提出了DomainGallery，一种少样本领域驱动的图像生成方法，旨在以属性为中心的方式，在少样本目标数据集上微调预训练的Stable Diffusion模型。具体而言，DomainGallery具有先验属性擦除、属性解耦、正则化和增强等特性。这些技术专为少样本领域驱动生成而设计，以解决先前工作未能解决的关键问题。我们进行了大量实验，以验证DomainGallery在各种领域驱动生成场景下的卓越性能。代码可在 https://github.com/Ldhlwh/DomainGallery 获取。

相关内容

小样本学习

关注 216

小样本学习（Few-Shot Learning，以下简称 FSL ）用于解决当可用的数据量比较少时，如何提升神经网络的性能。在 FSL 中，经常用到的一类方法被称为 Meta-learning。和普通的神经网络的训练方法一样，Meta-learning 也包含训练过程和测试过程，但是它的训练过程被称作 Meta-training 和 Meta-testing。

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日