Understanding Help Seeking for Digital Privacy, Safety, and Security

Kurt Thomas,Sai Teja Peddinti,Sarah Meiklejohn,Tara Matthews,Amelia Hassoun,Animesh Srivastava,Jessica McClearn,Patrick Gage Kelley,Sunny Consolvo,Nina Taft

The complexity of navigating digital privacy, safety, and security threats often falls directly on users. This leads to users seeking help from family and peers, platforms and advice guides, dedicated communities, and even large language models (LLMs). As a precursor to improving resources across this ecosystem, our community needs to understand what help seeking looks like in the wild. To that end, we blend qualitative coding with LLM fine-tuning to sift through over one billion Reddit posts from the last four years to identify where and for what users seek digital privacy, safety, or security help. We isolate three million relevant posts with 93% precision and recall and automatically annotate each with the topics discussed (e.g., security tools, privacy configurations, scams, account compromise, content moderation, and more). We use this dataset to understand the scope and scale of help seeking, the communities that provide help, and the types of help sought. Our work informs the development of better resources for users (e.g., user guides or LLM help-giving agents) while underscoring the inherent challenges of supporting users through complex combinations of threats, platforms, mitigations, context, and emotions.

翻译：应对数字隐私、安全与防护威胁的复杂性往往直接落在用户身上。这导致用户向家人与同伴、平台与指导手册、专业社区乃至大型语言模型（LLM）寻求帮助。作为改进这一生态系统中各类资源的先导步骤，我们学界需要理解真实场景中的求助行为究竟呈现何种形态。为此，我们融合质性编码与LLM微调技术，对过去四年间超过十亿条Reddit发帖进行筛选，以识别用户在何处、因何种问题寻求数字隐私、安全或防护方面的帮助。我们以93%的精确率与召回率提取出三百万条相关发帖，并通过自动标注为每条帖子标记所讨论的主题（例如：安全工具、隐私配置、网络诈骗、账户盗用、内容审核等）。利用该数据集，我们深入剖析了求助行为的范围与规模、提供帮助的社区类型以及所寻求帮助的具体类别。本研究不仅为开发更优质的用户资源（如使用指南或LLM助手机器人）提供了依据，同时揭示了在应对威胁、平台、缓解措施、情境与情感等多重因素交织的复杂局面时，为用户提供支持所面临的固有挑战。

相关内容

大语言模型

关注 66

大语言模型是基于海量文本数据训练的深度学习模型。它不仅能够生成自然语言文本，还能够深入理解文本含义，处理各种自然语言任务，如文本摘要、问答、翻译等。2023年，大语言模型及其在人工智能领域的应用已成为全球科技研究的热点，其在规模上的增长尤为引人注目，参数量已从最初的十几亿跃升到如今的一万亿。参数量的提升使得模型能够更加精细地捕捉人类语言微妙之处，更加深入地理解人类语言的复杂性。在过去的一年里，大语言模型在吸纳新知识、分解复杂任务以及图文对齐等多方面都有显著提升。随着技术的不断成熟，它将不断拓展其应用范围，为人类提供更加智能化和个性化的服务，进一步改善人们的生活和生产方式。

综述：面向移动端大语言模型的隐私与安全

专知会员服务

19+阅读 · 2025年9月7日

探索大型语言模型在网络安全中的作用：一项系统综述

专知会员服务

22+阅读 · 2025年4月27日

【新书】大规模语言模型的隐私与安全，

专知会员服务

29+阅读 · 2024年12月4日

大型语言模型代理的安全与隐私综述

专知会员服务

30+阅读 · 2024年8月5日