Sentinel-Guided Zero-Shot Learning: A Collaborative Paradigm without Real Data Exposure

With increasing concerns over data privacy and model copyrights, especially in the context of collaborations between AI service providers and data owners, an innovative SG-ZSL paradigm is proposed in this work. SG-ZSL is designed to foster efficient collaboration without the need to exchange models or sensitive data. It consists of a teacher model, a student model and a generator that links both model entities. The teacher model serves as a sentinel on behalf of the data owner, replacing real data, to guide the student model at the AI service provider's end during training. Considering the disparity of knowledge space between the teacher and student, we introduce two variants of the teacher model: the omniscient and the quasi-omniscient teachers. Under these teachers' guidance, the student model seeks to match the teacher model's performance and explores domains that the teacher has not covered. To trade off between privacy and performance, we further introduce two distinct security-level training protocols: white-box and black-box, enhancing the paradigm's adaptability. Despite the inherent challenges of real data absence in the SG-ZSL paradigm, it consistently outperforms in ZSL and GZSL tasks, notably in the white-box protocol. Our comprehensive evaluation further attests to its robustness and efficiency across various setups, including stringent black-box training protocol.

翻译：随着对数据隐私和模型版权日益增长的关注，特别是在AI服务提供商与数据所有者合作的背景下，本文提出了一种创新的SG-ZSL范式。SG-ZSL旨在实现高效协作，而无需交换模型或敏感数据。它由教师模型、学生模型以及连接这两个模型实体的生成器组成。教师模型代表数据所有者充当哨兵，替代真实数据，在训练过程中指导AI服务提供商端的学生模型。考虑到教师与学生之间的知识空间差异，我们引入了教师模型的两种变体：全知教师与准全知教师。在这些教师的指导下，学生模型力求匹配教师模型的性能，并探索教师尚未覆盖的领域。为了在隐私与性能之间取得平衡，我们进一步引入了两种不同安全级别的训练协议：白盒协议与黑盒协议，增强了范式的适应性。尽管SG-ZSL范式中存在真实数据缺失的内在挑战，它在ZSL和GZSL任务中始终表现出色，尤其是在白盒协议下。我们的全面评估进一步证明了其在各种设置（包括严格的黑盒训练协议）下的鲁棒性和效率。

相关内容

MoDELS

关注 46

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日