SWaT：基于用户行为分析的视频观看时长统计建模 (SWaT: Statistical Modeling of Video Watch Time through User Behavior Analysis)

The significance of estimating video watch time has been highlighted by the rising importance of (short) video recommendation, which has become a core product of mainstream social media platforms. Modeling video watch time, however, has been challenged by the complexity of user-video interaction, such as different user behavior modes in watching the recommended videos and varying watching probability over the video progress bar. Despite the importance and challenges, existing literature on modeling video watch time mostly focuses on relatively black-box mechanical enhancement of the classical regression/classification losses, without factoring in user behavior in a principled manner. In this paper, we for the first time take on a user-centric perspective to model video watch time, from which we propose a white-box statistical framework that directly translates various user behavior assumptions in watching (short) videos into statistical watch time models. These behavior assumptions are portrayed by our domain knowledge on users' behavior modes in video watching. We further employ bucketization to cope with user's non-stationary watching probability over the video progress bar, which additionally helps to respect the constraint of video length and facilitate the practical compatibility between the continuous regression event of watch time and other binary classification events. We test our models extensively on two public datasets, a large-scale offline industrial dataset, and an online A/B test on a short video platform with hundreds of millions of daily-active users. On all experiments, our models perform competitively against strong relevant baselines, demonstrating the efficacy of our user-centric perspective and proposed framework.

翻译：随着（短视频）推荐重要性的日益凸显，视频观看时长估计的意义愈发显著，已成为主流社交媒体平台的核心产品。然而，视频观看时长建模一直面临用户-视频交互复杂性的挑战，例如用户观看推荐视频时的不同行为模式，以及视频进度条上观看概率的动态变化。尽管该问题至关重要且充满挑战，现有关于视频观看时长建模的研究大多聚焦于对经典回归/分类损失的相对黑盒式机制增强，未能以系统化方式纳入用户行为因素。本文首次采用以用户为中心的视角来建模视频观看时长，由此提出一个白盒统计框架，能够将用户在观看（短）视频时的多种行为假设直接转化为统计观看时长模型。这些行为假设源自我们对用户观看视频行为模式的领域知识。为进一步处理用户在视频进度条上非平稳的观看概率，我们采用分桶策略，这不仅有助于遵循视频时长的约束，还能促进连续回归事件（观看时长）与其他二分类事件之间的实际兼容性。我们在两个公共数据集、一个大规模离线工业数据集以及一个拥有数亿日活跃用户的短视频平台在线A/B测试中，对我们的模型进行了广泛验证。在所有实验中，我们的模型相较于相关强基线均表现出竞争力，验证了以用户为中心的视角及所提框架的有效性。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日