基于几何不变性的步态识别学习 (Learning Geometric Invariance for Gait Recognition) - 专知论文

会员服务 ·

0

不变 · 不变性 · 识别 · 步态识别 · 几何变换 ·

Learning Geometric Invariance for Gait Recognition

翻译：基于几何不变性的步态识别学习

Zengbin Wang,Junjie Li,Saihui Hou,Xu Liu,Chunshui Cao,Yongzhen Huang,Muyi Sun,Siye Wang,Man Zhang

The goal of gait recognition is to extract identity-invariant features of an individual under various gait conditions, e.g., cross-view and cross-clothing. Most gait models strive to implicitly learn the common traits across different gait conditions in a data-driven manner to pull different gait conditions closer for recognition. However, relatively few studies have explicitly explored the inherent relations between different gait conditions. For this purpose, we attempt to establish connections among different gait conditions and propose a new perspective to achieve gait recognition: variations in different gait conditions can be approximately viewed as a combination of geometric transformations. In this case, all we need is to determine the types of geometric transformations and achieve geometric invariance, then identity invariance naturally follows. As an initial attempt, we explore three common geometric transformations (i.e., Reflect, Rotate, and Scale) and design a $\mathcal{R}$eflect-$\mathcal{R}$otate-$\mathcal{S}$cale invariance learning framework, named ${\mathcal{RRS}}$-Gait. Specifically, it first flexibly adjusts the convolution kernel based on the specific geometric transformations to achieve approximate feature equivariance. Then these three equivariant-aware features are respectively fed into a global pooling operation for final invariance-aware learning. Extensive experiments on four popular gait datasets (Gait3D, GREW, CCPG, SUSTech1K) show superior performance across various gait conditions.

翻译：步态识别的目标是在不同步态条件下（例如跨视角和跨着装）提取个体的身份不变特征。大多数步态模型致力于以数据驱动的方式隐式学习不同步态条件间的共性特征，以拉近不同步态条件间的距离从而实现识别。然而，相对较少的研究明确探索了不同步态条件之间的内在关联。为此，我们尝试建立不同步态条件间的联系，并提出一种实现步态识别的新视角：不同步态条件的变化可近似视为几何变换的组合。在此情况下，我们仅需确定几何变换的类型并实现几何不变性，身份不变性便会自然随之而来。作为初步尝试，我们探索了三种常见的几何变换（即反射、旋转和缩放），并设计了一个名为${\mathcal{RRS}}$-Gait的反射-旋转-缩放不变性学习框架。具体而言，该框架首先根据特定几何变换灵活调整卷积核，以实现近似的特征等变性。随后，这三种等变感知特征分别输入全局池化操作，以进行最终的不变性感知学习。在四个主流步态数据集（Gait3D、GREW、CCPG、SUSTech1K）上的大量实验表明，该框架在不同步态条件下均表现出优越性能。

0

相关内容

用于识别任务的视觉 Transformer 综述

用于识别任务的视觉 Transformer 综述

专知会员服务

75+阅读 · 2023年2月25日

多模态数据的行为识别综述

多模态数据的行为识别综述

专知会员服务

88+阅读 · 2022年11月30日

TPAMI 2022 | 最新综述：基于不同数据模态的行为识别

TPAMI 2022 | 最新综述：基于不同数据模态的行为识别

专知会员服务

53+阅读 · 2022年7月2日

【CVPR 2022】基于双噪声标签的可见光-红外人再识别学习，Learning with Twin Noisy Labels for Visible-Infrared Person Re-Identification

【CVPR 2022】基于双噪声标签的可见光-红外人再识别学习，Learning with Twin Noisy Labels for Visible-Infrared Person Re-Identification

专知会员服务

14+阅读 · 2022年3月28日

【MM 2021】基于统一中间模态学习的视红外人再识别,Towards a Unified Middle Modality Learning for Visible-Infrared Person Re-Identification

【MM 2021】基于统一中间模态学习的视红外人再识别,Towards a Unified Middle Modality Learning for Visible-Infrared Person Re-Identification

专知会员服务

12+阅读 · 2022年3月22日

【TPAMI2022】深度步态识别研究进展，Deep Gait Recognition: A Survey

【TPAMI2022】深度步态识别研究进展，Deep Gait Recognition: A Survey

专知会员服务

28+阅读 · 2022年3月1日

基于深度学习的跨模态检索综述

专知会员服务

62+阅读 · 2021年3月25日

多模态视觉语言表征学习研究综述

多模态视觉语言表征学习研究综述

专知会员服务

195+阅读 · 2020年12月3日

【Uber AI新论文】持续元学习，Learning to Continually Learn

【Uber AI新论文】持续元学习，Learning to Continually Learn

专知会员服务

37+阅读 · 2020年2月27日

基于深度学习的行人重识别研究进展，自动化学报

基于深度学习的行人重识别研究进展，自动化学报

专知会员服务

39+阅读 · 2019年12月5日

多模态视觉语言表征学习研究综述

多模态视觉语言表征学习研究综述

专知

27+阅读 · 2020年12月3日

步态识别新动态！专家报告 + 大咖观点

步态识别新动态！专家报告 + 大咖观点

中国图象图形学报

21+阅读 · 2020年10月14日

【ACM Multimedia2020】跨模态注意力Transformer模型的深度视频理解

【ACM Multimedia2020】跨模态注意力Transformer模型的深度视频理解

专知

15+阅读 · 2020年8月30日

深度多模态表示学习综述论文，22页pdf

深度多模态表示学习综述论文，22页pdf

专知

33+阅读 · 2020年6月21日

【Uber AI新论文】持续元学习，Learning to Continually Learn

【Uber AI新论文】持续元学习，Learning to Continually Learn

专知

19+阅读 · 2020年2月27日

【加州理工】什么是模仿学习(Imitation Learning（模仿学习), 这62页ppt带你了解进展，附下载

【加州理工】什么是模仿学习(Imitation Learning（模仿学习), 这62页ppt带你了解进展，附下载

专知

21+阅读 · 2019年11月14日

AI综述专栏 | 步态识别的深度学习综述

AI综述专栏 | 步态识别的深度学习综述

人工智能前沿讲习班

29+阅读 · 2018年6月27日

【论文推荐】最新5篇行人再识别（ReID）相关论文—迁移学习、特征集成、重排序、多通道金字塔、深层生成模型

【论文推荐】最新5篇行人再识别（ReID）相关论文—迁移学习、特征集成、重排序、多通道金字塔、深层生成模型

专知

12+阅读 · 2018年3月24日

行人再识别中的迁移学习

行人再识别中的迁移学习

计算机视觉战队

11+阅读 · 2017年12月20日

干货｜基于双流递归神经网络的人体骨架行为识别！

干货｜基于双流递归神经网络的人体骨架行为识别！

全球人工智能

13+阅读 · 2017年12月15日

基于分类能力结构度量与类相关性关系保留的特征选取方法研究

国家自然科学基金

1+阅读 · 2017年12月31日

人类步行机理认知下的双足机器人步态与智能协同进化理论基础

国家自然科学基金

0+阅读 · 2015年12月31日

基于深层特征学习的RGB-D人体行为识别方法

国家自然科学基金

4+阅读 · 2015年12月31日

飞行器三维不变矩特征提取与识别研究

国家自然科学基金

2+阅读 · 2015年12月31日

基于人脸表情、身体姿态和语音的多模态情感识别方法研究

国家自然科学基金

10+阅读 · 2015年12月31日

基于生态演替的文本大数据特征学习研究

国家自然科学基金

1+阅读 · 2015年12月31日

基于极限学习单元的多生物特征图像深度学习建模与识别研究

国家自然科学基金

2+阅读 · 2015年12月31日

基于记忆的不变图像特征学习方法研究

国家自然科学基金

3+阅读 · 2015年12月31日

面向大规模多步学习问题的学习分类元系统技术研究

国家自然科学基金

5+阅读 · 2015年12月31日

基于集成流形学习的监控视频中人体行为识别研究

国家自然科学基金

3+阅读 · 2014年12月31日

Multiview Self-Representation Learning across Heterogeneous Views

Arxiv

0+阅读 · 2月4日

Learning Adaptive Cross-Embodiment Visuomotor Policy with Contrastive Prompt Orchestration

Arxiv

0+阅读 · 2月1日

Dissecting Multimodal In-Context Learning: Modality Asymmetries and Circuit Dynamics in modern Transformers

Arxiv

0+阅读 · 1月28日

Language-Guided and Motion-Aware Gait Representation for Generalizable Recognition

Arxiv

0+阅读 · 1月23日

ALIGNAgent: Adaptive Learner Intelligence for Gap Identification and Next-step guidance

Arxiv

0+阅读 · 1月22日

Curriculum-Based Strategies for Efficient Cross-Domain Action Recognition

Arxiv

0+阅读 · 1月20日

Reinforcement Learning with Multi-Step Lookahead Information Via Adaptive Batching

Arxiv

0+阅读 · 1月15日

Variational Contrastive Learning for Skeleton-based Action Recognition

Variational Contrastive Learning for Skeleton-based Action Recognition

Arxiv

0+阅读 · 1月12日

Causal Invariance Learning via Efficient Nonconvex Optimization

Arxiv

0+阅读 · 1月7日

Real-Time Forecasting of Pathological Gait via IMU Navigation: A Few-Shot and Generative Learning Framework for Wearable Devices

Arxiv

0+阅读 · 1月2日

VIP会员

文章信息

相关主题

相关VIP内容

用于识别任务的视觉 Transformer 综述

用于识别任务的视觉 Transformer 综述

专知会员服务

75+阅读 · 2023年2月25日

多模态数据的行为识别综述

多模态数据的行为识别综述

专知会员服务

88+阅读 · 2022年11月30日

TPAMI 2022 | 最新综述：基于不同数据模态的行为识别

TPAMI 2022 | 最新综述：基于不同数据模态的行为识别

专知会员服务

53+阅读 · 2022年7月2日

【CVPR 2022】基于双噪声标签的可见光-红外人再识别学习，Learning with Twin Noisy Labels for Visible-Infrared Person Re-Identification

【CVPR 2022】基于双噪声标签的可见光-红外人再识别学习，Learning with Twin Noisy Labels for Visible-Infrared Person Re-Identification

专知会员服务

14+阅读 · 2022年3月28日

【MM 2021】基于统一中间模态学习的视红外人再识别,Towards a Unified Middle Modality Learning for Visible-Infrared Person Re-Identification

【MM 2021】基于统一中间模态学习的视红外人再识别,Towards a Unified Middle Modality Learning for Visible-Infrared Person Re-Identification

专知会员服务

12+阅读 · 2022年3月22日

【TPAMI2022】深度步态识别研究进展，Deep Gait Recognition: A Survey

【TPAMI2022】深度步态识别研究进展，Deep Gait Recognition: A Survey

专知会员服务

28+阅读 · 2022年3月1日

基于深度学习的跨模态检索综述

专知会员服务

62+阅读 · 2021年3月25日

多模态视觉语言表征学习研究综述

多模态视觉语言表征学习研究综述

专知会员服务

195+阅读 · 2020年12月3日

【Uber AI新论文】持续元学习，Learning to Continually Learn

【Uber AI新论文】持续元学习，Learning to Continually Learn

专知会员服务

37+阅读 · 2020年2月27日

基于深度学习的行人重识别研究进展，自动化学报

基于深度学习的行人重识别研究进展，自动化学报

专知会员服务

39+阅读 · 2019年12月5日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】基于自适应表征的高效视觉建模

《多域作战中融合网络、电子战与动能机动》

AI智能体时代大模型安全风险与攻防新挑战

迈向个性化大语言模型驱动的智能体：基础、评估与未来方向

相关资讯

多模态视觉语言表征学习研究综述

多模态视觉语言表征学习研究综述

专知

27+阅读 · 2020年12月3日

步态识别新动态！专家报告 + 大咖观点

步态识别新动态！专家报告 + 大咖观点

中国图象图形学报

21+阅读 · 2020年10月14日

【ACM Multimedia2020】跨模态注意力Transformer模型的深度视频理解

【ACM Multimedia2020】跨模态注意力Transformer模型的深度视频理解

专知

15+阅读 · 2020年8月30日

深度多模态表示学习综述论文，22页pdf

深度多模态表示学习综述论文，22页pdf

专知

33+阅读 · 2020年6月21日

【Uber AI新论文】持续元学习，Learning to Continually Learn

【Uber AI新论文】持续元学习，Learning to Continually Learn

专知

19+阅读 · 2020年2月27日

【加州理工】什么是模仿学习(Imitation Learning（模仿学习), 这62页ppt带你了解进展，附下载

【加州理工】什么是模仿学习(Imitation Learning（模仿学习), 这62页ppt带你了解进展，附下载

专知

21+阅读 · 2019年11月14日

AI综述专栏 | 步态识别的深度学习综述

AI综述专栏 | 步态识别的深度学习综述

人工智能前沿讲习班

29+阅读 · 2018年6月27日

【论文推荐】最新5篇行人再识别（ReID）相关论文—迁移学习、特征集成、重排序、多通道金字塔、深层生成模型

【论文推荐】最新5篇行人再识别（ReID）相关论文—迁移学习、特征集成、重排序、多通道金字塔、深层生成模型

专知

12+阅读 · 2018年3月24日

行人再识别中的迁移学习

行人再识别中的迁移学习

计算机视觉战队

11+阅读 · 2017年12月20日

干货｜基于双流递归神经网络的人体骨架行为识别！

干货｜基于双流递归神经网络的人体骨架行为识别！

全球人工智能

13+阅读 · 2017年12月15日

相关论文

Multiview Self-Representation Learning across Heterogeneous Views

Arxiv

0+阅读 · 2月4日

Learning Adaptive Cross-Embodiment Visuomotor Policy with Contrastive Prompt Orchestration

Arxiv

0+阅读 · 2月1日

Dissecting Multimodal In-Context Learning: Modality Asymmetries and Circuit Dynamics in modern Transformers

Arxiv

0+阅读 · 1月28日

Language-Guided and Motion-Aware Gait Representation for Generalizable Recognition

Arxiv

0+阅读 · 1月23日

ALIGNAgent: Adaptive Learner Intelligence for Gap Identification and Next-step guidance

Arxiv

0+阅读 · 1月22日

Curriculum-Based Strategies for Efficient Cross-Domain Action Recognition

Arxiv

0+阅读 · 1月20日

Reinforcement Learning with Multi-Step Lookahead Information Via Adaptive Batching

Arxiv

0+阅读 · 1月15日

Variational Contrastive Learning for Skeleton-based Action Recognition

Variational Contrastive Learning for Skeleton-based Action Recognition

Arxiv

0+阅读 · 1月12日

Causal Invariance Learning via Efficient Nonconvex Optimization

Arxiv

0+阅读 · 1月7日

Real-Time Forecasting of Pathological Gait via IMU Navigation: A Few-Shot and Generative Learning Framework for Wearable Devices

Arxiv

0+阅读 · 1月2日

相关基金

基于分类能力结构度量与类相关性关系保留的特征选取方法研究

国家自然科学基金

1+阅读 · 2017年12月31日

人类步行机理认知下的双足机器人步态与智能协同进化理论基础

国家自然科学基金

0+阅读 · 2015年12月31日

基于深层特征学习的RGB-D人体行为识别方法

国家自然科学基金

4+阅读 · 2015年12月31日

飞行器三维不变矩特征提取与识别研究

国家自然科学基金

2+阅读 · 2015年12月31日

基于人脸表情、身体姿态和语音的多模态情感识别方法研究

国家自然科学基金

10+阅读 · 2015年12月31日

基于生态演替的文本大数据特征学习研究

国家自然科学基金

1+阅读 · 2015年12月31日

基于极限学习单元的多生物特征图像深度学习建模与识别研究

国家自然科学基金

2+阅读 · 2015年12月31日

基于记忆的不变图像特征学习方法研究

国家自然科学基金

3+阅读 · 2015年12月31日

面向大规模多步学习问题的学习分类元系统技术研究

国家自然科学基金

5+阅读 · 2015年12月31日

基于集成流形学习的监控视频中人体行为识别研究

国家自然科学基金

3+阅读 · 2014年12月31日

微信扫码咨询专知VIP会员