Role-Aware Language Models for Secure and Contextualized Access Control in Organizations

As large language models (LLMs) are increasingly deployed in enterprise settings, controlling model behavior based on user roles becomes an essential requirement. Existing safety methods typically assume uniform access and focus on preventing harmful or toxic outputs, without addressing role-specific access constraints. In this work, we investigate whether LLMs can be fine-tuned to generate responses that reflect the access privileges associated with different organizational roles. We explore three modeling strategies: a BERT-based classifier, an LLM-based classifier, and role-conditioned generation. To evaluate these approaches, we construct two complementary datasets. The first is adapted from existing instruction-tuning corpora through clustering and role labeling, while the second is synthetically generated to reflect realistic, role-sensitive enterprise scenarios. We assess model performance across varying organizational structures and analyze robustness to prompt injection, role mismatch, and jailbreak attempts.

翻译：随着大型语言模型（LLM）在企业环境中的部署日益增多，基于用户角色控制模型行为已成为一项关键需求。现有安全方法通常假设统一的访问权限，并侧重于防止有害或有毒输出，而未解决基于角色的访问约束问题。本研究探讨了是否可以通过微调LLM，使其生成反映不同组织角色对应访问权限的响应。我们探索了三种建模策略：基于BERT的分类器、基于LLM的分类器以及角色条件生成。为评估这些方法，我们构建了两个互补的数据集。第一个数据集通过对现有指令微调语料库进行聚类和角色标注而得到，第二个数据集则通过合成生成，以反映真实且对角色敏感的企业场景。我们在不同组织结构下评估模型性能，并分析了模型对提示注入、角色不匹配及越狱尝试的鲁棒性。

相关内容

数据集

关注 88

数据集，又称为资料集、数据集合或资料集合，是一种由数据所组成的集合。
Data set（或dataset）是一个数据的集合，通常以表格形式出现。每一列代表一个特定变量。每一行都对应于某一成员的数据集的问题。它列出的价值观为每一个变量，如身高和体重的一个物体或价值的随机数。每个数值被称为数据资料。对应于行数，该数据集的数据可能包括一个或多个成员。

综述：面向移动端大语言模型的隐私与安全

专知会员服务

19+阅读 · 2025年9月7日

【斯坦福大学Xiang Lisa Li博士论文】控制语言模型

专知会员服务

22+阅读 · 2025年6月11日

探索大型语言模型在网络安全中的作用：一项系统综述

专知会员服务

21+阅读 · 2025年4月27日

【新书】大规模语言模型的隐私与安全，

专知会员服务

29+阅读 · 2024年12月4日