元学习低秩适配组件：面向领域感知身份个性化的Meta-LoRA框架 (Meta-LoRA: Meta-Learning LoRA Components for Domain-Aware ID Personalization)

Recent advancements in text-to-image generative models, particularly latent diffusion models (LDMs), have demonstrated remarkable capabilities in synthesizing high-quality images from textual prompts. However, achieving identity personalization-ensuring that a model consistently generates subject-specific outputs from limited reference images-remains a fundamental challenge. To address this, we introduce Meta-Low-Rank Adaptation (Meta-LoRA), a novel framework that leverages meta-learning to encode domain-specific priors into LoRA-based identity personalization. Our method introduces a structured three-layer LoRA architecture that separates identity-agnostic knowledge from identity-specific adaptation. In the first stage, the LoRA Meta-Down layers are meta-trained across multiple subjects, learning a shared manifold that captures general identity-related features. In the second stage, only the LoRA-Mid and LoRA-Up layers are optimized to specialize on a given subject, significantly reducing adaptation time while improving identity fidelity. To evaluate our approach, we introduce Meta-PHD, a new benchmark dataset for identity personalization, and compare Meta-LoRA against state-of-the-art methods. Our results demonstrate that Meta-LoRA achieves superior identity retention, computational efficiency, and adaptability across diverse identity conditions. The code, model weights, and dataset will be released publicly upon acceptance.

翻译：近年来，文本到图像生成模型——尤其是潜在扩散模型（LDMs）——在根据文本提示合成高质量图像方面展现出卓越能力。然而，实现身份个性化（即确保模型能够基于有限参考图像持续生成特定主体的输出）仍然是一个根本性挑战。为此，我们提出元学习低秩适配（Meta-LoRA）框架，该框架利用元学习将领域特定先验知识编码至基于LoRA的身份个性化系统中。本方法构建了结构化的三层LoRA架构，将身份无关知识与身份特定适配进行分离。在第一阶段，LoRA元下层通过跨多个主体的元训练学习共享流形，以捕捉通用的身份相关特征。第二阶段仅优化LoRA中层与LoRA上层，使其专精于特定主体，在显著缩短适配时间的同时提升身份保真度。为评估本方法，我们构建了身份个性化新基准数据集Meta-PHD，并将Meta-LoRA与前沿方法进行对比。实验结果表明，Meta-LoRA在不同身份条件下均实现了更优的身份保持性、计算效率与适应能力。相关代码、模型权重及数据集将在论文录用后公开发布。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日