Fine-grained Action Analysis: A Multi-modality and Multi-task Dataset of Figure Skating

The fine-grained action analysis of the existing action datasets is challenged by insufficient action categories, low fine granularities, limited modalities, and tasks. In this paper, we propose a Multi-modality and Multi-task dataset of Figure Skating (MMFS) which was collected from the World Figure Skating Championships. MMFS, which possesses action recognition and action quality assessment, captures RGB, skeleton, and is collected the score of actions from 11671 clips with 256 categories including spatial and temporal labels. The key contributions of our dataset fall into three aspects as follows. (1) Independently spatial and temporal categories are first proposed to further explore fine-grained action recognition and quality assessment. (2) MMFS first introduces the skeleton modality for complex fine-grained action quality assessment. (3) Our multi-modality and multi-task dataset encourage more action analysis models. To benchmark our dataset, we adopt RGB-based and skeleton-based baseline methods for action recognition and action quality assessment.

翻译：现有动作数据集的细粒度分析面临动作类别不足、细粒度低、模态有限以及任务类型单一等挑战。本文提出一个多模态多任务的花样滑冰数据集（MMFS），该数据集收集自世界花样滑冰锦标赛。MMFS支持动作识别与动作质量评估，包含RGB图像、骨骼数据，并收录了来自11671个视频片段中256个类别的动作得分，同时提供空间与时间标签。本数据集的主要贡献体现在以下三个方面：（1）首次独立提出空间与时间类别，以进一步探索细粒度动作识别与质量评估；（2）MMFS首次引入骨骼模态用于复杂细粒度动作质量评估；（3）我们的多模态多任务数据集可促进更多动作分析模型的发展。为建立基准评估，我们采用基于RGB和基于骨骼的基线方法进行动作识别与动作质量评估。

相关内容

运动行为分析

关注 959

计算机视觉中运动行为分析就是在不需要人为干预的情况下，综合利用计算机视觉、模式识别、图像处理、人工智能等诸多方面的知识和技术对摄像机拍录的图像序列进行自动分析，实现动态场景中的人体定位、跟踪和识别，并在此基础上分析和判断人的行为，其最终目标是通过对行为特征数据的分析来获取行为的语义描述与理解。运动人体行为分析在智能视频监控、高级人机交互、视频会议、基于行为的视频检索以及医疗诊断等方面有着广泛的应用前景和潜在的商业价值，是近年来计算机视觉领域最活跃的研究方向之一。它包含视频中运动人体的自动检测、行为特征提取以及行为理解和描述等，属于图像分析和理解的范畴。从技术角度讲，人体行为分析和识别的研究内容相当丰富，涉及到图像处理、计算机视觉、模式识别、人工智能、形态学等学科知识。

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日