Disincentivizing Polarization in Social Networks

On social networks, algorithmic personalization drives users into filter bubbles where they rarely see content that deviates from their interests. We present a model for content curation and personalization that avoids filter bubbles, along with algorithmic guarantees and nearly matching lower bounds. In our model, the platform interacts with $n$ users over $T$ timesteps, choosing content for each user from $k$ categories. The platform receives stochastic rewards as in a multi-arm bandit. To avoid filter bubbles, we draw on the intuition that if some users are shown some category of content, then all users should see at least a small amount of that content. We first analyze a naive formalization of this intuition and show it has unintended consequences: it leads to ``tyranny of the majority'' with the burden of diversification borne disproportionately by those with minority interests. This leads us to our model which distributes this burden more equitably. We require that the probability any user is shown a particular type of content is at least $\gamma$ times the average probability all users are shown that type of content. Full personalization corresponds to $\gamma = 0$ and complete homogenization corresponds to $\gamma = 1$; hence, $\gamma$ encodes a hard cap on the level of personalization. We also analyze additional formulations where the platform can exceed its cap but pays a penalty proportional to its constraint violation. We provide algorithmic guarantees for optimizing recommendations subject to these constraints. These include nearly matching upper and lower bounds for the entire range of $\gamma \in [0,1]$ showing that the reward of a multi-agent variant of UCB is nearly optimal. Using real-world preference data, we empirically verify that under our model, users share the burden of diversification with only minor utility loss under our constraints.

翻译：在社交网络中，算法个性化使用户陷入信息茧房，鲜少接触到与其兴趣相悖的内容。我们提出了一种避免信息茧房的内容策展与个性化模型，同时给出了算法保证及近乎匹配的下界。在该模型中，平台与$n$名用户进行$T$轮交互，为每位用户从$k$个类别中选择内容。平台获得类似于多臂老虎机中的随机奖励。为避免信息茧房，我们基于如下直觉：若某些用户接触到某类内容，则所有用户都应至少看到该内容的少量片段。我们首先分析了该直觉的朴素形式化，并发现其存在意外后果：它导致"多数人暴政"，使得多样化的负担不成比例地由少数群体承担。这促使我们设计出更公平分配该负担的模型。我们要求任意用户看到特定类型内容的概率至少为$\gamma$倍的所有用户看到该类型内容的平均概率。完全个性化对应$\gamma = 0$，完全同质化对应$\gamma = 1$；因此$\gamma$编码了对个性化程度的硬性上限。我们还分析了其他形式化方案，其中平台可超过上限但需支付与违规比例成正比的罚金。我们为在这些约束下优化推荐提供了算法保证，包括针对$\gamma \in [0,1]$整个区间给出近乎匹配的上界与下界，证明多智能体UCB变体的奖励近乎最优。利用真实世界偏好数据，我们通过实验验证：在该模型下，用户分担多样化负担，且约束条件下效用损失极小。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

自然语言处理顶会NAACL2022最佳论文出炉！

专知会员服务

43+阅读 · 2022年6月30日

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

52+阅读 · 2020年12月14日