SkelMamba：一种用于神经系统疾病高效骨架动作识别的状态空间模型 (SkelMamba: A State Space Model for Efficient Skeleton Action Recognition of Neurological Disorders)

We introduce a novel state-space model (SSM)-based framework for skeleton-based human action recognition, with an anatomically-guided architecture that improves state-of-the-art performance in both clinical diagnostics and general action recognition tasks. Our approach decomposes skeletal motion analysis into spatial, temporal, and spatio-temporal streams, using channel partitioning to capture distinct movement characteristics efficiently. By implementing a structured, multi-directional scanning strategy within SSMs, our model captures local joint interactions and global motion patterns across multiple anatomical body parts. This anatomically-aware decomposition enhances the ability to identify subtle motion patterns critical in medical diagnosis, such as gait anomalies associated with neurological conditions. On public action recognition benchmarks, i.e., NTU RGB+D, NTU RGB+D 120, and NW-UCLA, our model outperforms current state-of-the-art methods, achieving accuracy improvements up to $3.2\%$ with lower computational complexity than previous leading transformer-based models. We also introduce a novel medical dataset for motion-based patient neurological disorder analysis to validate our method's potential in automated disease diagnosis.

翻译：我们提出了一种基于状态空间模型（SSM）的新型框架，用于基于骨架的人体动作识别。该框架采用解剖学引导的架构，在临床诊断和通用动作识别任务中均提升了当前最先进的性能。我们的方法将骨骼运动分析分解为空间、时间和时空流，利用通道划分来高效捕捉不同的运动特征。通过在SSM中实施结构化的多方向扫描策略，我们的模型能够捕捉多个解剖身体部位的局部关节交互和全局运动模式。这种具有解剖学意识的分解增强了识别细微运动模式的能力，这些模式在医学诊断中至关重要，例如与神经系统疾病相关的步态异常。在公开的动作识别基准测试（即NTU RGB+D、NTU RGB+D 120和NW-UCLA）上，我们的模型优于当前最先进的方法，在比先前领先的基于Transformer的模型计算复杂度更低的情况下，准确率提升高达$3.2\%$。我们还引入了一个新颖的基于运动的患者神经系统疾病分析医学数据集，以验证我们的方法在自动化疾病诊断中的潜力。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

31+阅读 · 2021年9月29日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日