Which PPML Would a User Choose? A Structured Decision Support Framework for Developers to Rank PPML Techniques Based on User Acceptance Criteria

Using Privacy-Enhancing Technologies (PETs) for machine learning often influences the characteristics of a machine learning approach, e.g., the needed computational power, timing of the answers or how the data can be utilized. When designing a new service, the developer faces the problem that some decisions require a trade-off. For example, the use of a PET may cause a delay in the responses or adding noise to the data to improve the users' privacy might have a negative impact on the accuracy of the machine learning approach. As of now, there is no structured way how the users' perception of a machine learning based service can contribute to the selection of Privacy Preserving Machine Learning (PPML) methods. This is especially a challenge since one cannot assume that users have a deep technical understanding of these technologies. Therefore, they can only be asked about certain attributes that they can perceive when using the service and not directly which PPML they prefer. This study introduces a decision support framework with the aim of supporting the selection of PPML technologies based on user preferences. Based on prior work analysing User Acceptance Criteria (UAC), we translate these criteria into differentiating characteristics for various PPML techniques. As a final result, we achieve a technology ranking based on the User Acceptance Criteria while providing technology insights for the developers. We demonstrate its application using the use case of classifying privacy-relevant information. Our contribution consists of the decision support framework which consists of a process to connect PPML technologies with UAC, a process for evaluating the characteristics that separate PPML techniques, and a ranking method to evaluate the best PPML technique for the use case.

翻译：在机器学习中使用隐私增强技术（PETs）通常会改变机器学习方法的特性，例如所需的计算能力、响应时间或数据利用方式。在设计新服务时，开发者面临需要权衡取舍的决策问题。例如，采用PET可能导致响应延迟；为增强用户隐私而向数据添加噪声可能对机器学习方法的准确性产生负面影响。目前，尚缺乏结构化方法将用户对基于机器学习服务的感知纳入隐私保护机器学习方法的选择过程中。这一挑战尤为突出，因为不能假定用户对这些技术具备深入的技术理解。因此，只能询问用户在使用服务时可感知的特定属性，而非直接征询其对PPML技术的偏好。本研究提出一种决策支持框架，旨在基于用户偏好辅助PPML技术的选择。基于先前对用户接受标准的分析研究，我们将这些标准转化为区分不同PPML技术的特征指标。最终，我们实现了基于用户接受标准的技术排序，同时为开发者提供技术洞察。我们通过隐私相关信息分类的应用案例演示了该框架的实施。本研究的贡献在于构建了包含三个核心组件的决策支持框架：连接PPML技术与用户接受标准的流程、评估PPML技术区分特征的流程，以及针对具体用例评估最佳PPML技术的排序方法。

相关内容

Machine Learning

关注 2251

机器学习（Machine Learning）是一个研究计算学习方法的国际论坛。该杂志发表文章，报告广泛的学习方法应用于各种学习问题的实质性结果。该杂志的特色论文描述研究的问题和方法，应用研究和研究方法的问题。有关学习问题或方法的论文通过实证研究、理论分析或与心理现象的比较提供了坚实的支持。应用论文展示了如何应用学习方法来解决重要的应用问题。研究方法论文改进了机器学习的研究方法。所有的论文都以其他研究人员可以验证或复制的方式描述了支持证据。论文还详细说明了学习的组成部分，并讨论了关于知识表示和性能任务的假设。官网地址：http://dblp.uni-trier.de/db/journals/ml/

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日