Learning Group Activity Features Through Person Attribute Prediction

This paper proposes Group Activity Feature (GAF) learning in which features of multi-person activity are learned as a compact latent vector. Unlike prior work in which the manual annotation of group activities is required for supervised learning, our method learns the GAF through person attribute prediction without group activity annotations. By learning the whole network in an end-to-end manner so that the GAF is required for predicting the person attributes of people in a group, the GAF is trained as the features of multi-person activity. As a person attribute, we propose to use a person's action class and appearance features because the former is easy to annotate due to its simpleness, and the latter requires no manual annotation. In addition, we introduce a location-guided attribute prediction to disentangle the complex GAF for extracting the features of each target person properly. Various experimental results validate that our method outperforms SOTA methods quantitatively and qualitatively on two public datasets. Visualization of our GAF also demonstrates that our method learns the GAF representing fined-grained group activity classes. Code: https://github.com/chihina/GAFL-CVPR2024.

翻译：本文提出群体活动特征学习（Group Activity Feature, GAF），将多人活动特征学习为紧凑的潜在向量。不同于以往工作中需要手动标注群体活动进行监督学习的方法，本方法通过人物属性预测学习GAF，无需群体活动标注。通过端到端方式训练整个网络，使得GAF成为预测群体中人物属性的必要特征，从而将GAF训练为多人活动的特征。在人物属性方面，本文提出采用人物的动作类别与外观特征，前者因简单性而易于标注，后者则无需人工标注。此外，我们引入位置引导的属性预测机制，以解耦复杂的GAF，从而正确提取每个目标人物的特征。多项实验结果表明，本方法在两个公开数据集上的定量与定性性能均优于现有最先进方法。对GAF的可视化分析亦证明，本方法学习的GAF能够表征细粒度的群体活动类别。代码：https://github.com/chihina/GAFL-CVPR2024。

相关内容

GROUP

关注 1

Group一直是研究计算机支持的合作工作、人机交互、计算机支持的协作学习和社会技术研究的主要场所。该会议将社会科学、计算机科学、工程、设计、价值观以及其他与小组工作相关的多个不同主题的工作结合起来，并进行了广泛的概念化。官网链接：https://group.acm.org/conferences/group20/

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日