A-SDM: Accelerating Stable Diffusion through Model Assembly and Feature Inheritance Strategies

The Stable Diffusion Model (SDM) is a prevalent and effective model for text-to-image (T2I) and image-to-image (I2I) generation. Despite various attempts at sampler optimization, model distillation, and network quantification, these approaches typically maintain the original network architecture. The extensive parameter scale and substantial computational demands have limited research into adjusting the model architecture. This study focuses on reducing redundant computation in SDM and optimizes the model through both tuning and tuning-free methods. 1) For the tuning method, we design a model assembly strategy to reconstruct a lightweight model while preserving performance through distillation. Second, to mitigate performance loss due to pruning, we incorporate multi-expert conditional convolution (ME-CondConv) into compressed UNets to enhance network performance by increasing capacity without sacrificing speed. Third, we validate the effectiveness of the multi-UNet switching method for improving network speed. 2) For the tuning-free method, we propose a feature inheritance strategy to accelerate inference by skipping local computations at the block, layer, or unit level within the network structure. We also examine multiple sampling modes for feature inheritance at the time-step level. Experiments demonstrate that both the proposed tuning and the tuning-free methods can improve the speed and performance of the SDM. The lightweight model reconstructed by the model assembly strategy increases generation speed by $22.4%$, while the feature inheritance strategy enhances the SDM generation speed by $40.0%$.

翻译：稳定扩散模型（Stable Diffusion Model，SDM）是一种流行且有效的文本到图像（T2I）与图像到图像（I2I）生成模型。尽管已有多种采样器优化、模型蒸馏及网络量化方面的尝试，但这些方法通常保持原始网络架构不变。庞大的参数量级与巨大的计算需求限制了针对模型架构调整的研究。本研究聚焦于减少SDM中的冗余计算，并通过调优与非调优两种方法对模型进行优化。1）在调优方法方面，我们设计了一种模型组装策略，通过蒸馏在保持性能的同时重构轻量化模型。其次，为减轻剪枝带来的性能损失，我们将多专家条件卷积（ME-CondConv）引入压缩后的UNet中，通过在不牺牲速度的前提下增加网络容量来提升性能。第三，我们验证了多UNet切换方法对于提升网络速度的有效性。2）在非调优方法方面，我们提出了一种特征继承策略，通过在网络结构中的块级、层级或单元级跳过局部计算来加速推理。我们还研究了在时间步级别进行特征继承的多种采样模式。实验表明，所提出的调优与非调优方法均能提升SDM的速度与性能。通过模型组装策略重构的轻量化模型将生成速度提升了$22.4%$，而特征继承策略则将SDM的生成速度提升了$40.0%$。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日