The Right to be Forgotten (RTBF) was first established as the result of the ruling of Google Spain SL, Google Inc. v AEPD, Mario Costeja Gonz\'alez, and was later included as the Right to Erasure under the General Data Protection Regulation (GDPR) of European Union to allow individuals the right to request personal data be deleted by organizations. Specifically for search engines, individuals can send requests to organizations to exclude their information from the query results. With the recent development of Large Language Models (LLMs) and their use in chatbots, LLM-enabled software systems have become popular. But they are not excluded from the RTBF. Compared with the indexing approach used by search engines, LLMs store, and process information in a completely different way. This poses new challenges for compliance with the RTBF. In this paper, we explore these challenges and provide our insights on how to implement technical solutions for the RTBF, including the use of machine unlearning, model editing, and prompting engineering.
翻译:被遗忘权最初确立于谷歌西班牙公司、谷歌公司诉西班牙数据保护局及马里奥·科斯特哈·冈萨雷斯案的裁决,后作为删除权被纳入欧盟《通用数据保护条例》,允许个人请求组织删除其个人数据。具体而言,个人可向搜索引擎提交请求,要求将其信息排除在查询结果之外。随着近期大语言模型及其在聊天机器人中的应用发展,基于LLM的软件系统已变得普及,但此类系统并未被排除在被遗忘权的适用范围之外。与搜索引擎采用的索引方法相比,LLM以截然不同的方式存储和处理信息,这为遵守被遗忘权带来了新的挑战。本文探究了这些挑战,并就如何实施被遗忘权的技术解决方案(包括采用机器遗忘、模型编辑和提示工程)提出见解。