Modeling spinal motion is fundamental to understanding human biomechanics, yet remains underexplored in computer vision due to the spine's complex multi-joint kinematics and the lack of large-scale 3D annotations. We present a biomechanics-aware keypoint simulation framework that augments existing human pose datasets with anatomically consistent 3D spinal keypoints derived from musculoskeletal modeling. Using this framework, we create the first open dataset, named SIMSPINE, which provides sparse vertebra-level 3D spinal annotations for natural full-body motions in indoor multi-camera capture without external restraints. With 2.14 million frames, this enables data-driven learning of vertebral kinematics from subtle posture variations and bridges the gap between musculoskeletal simulation and computer vision. In addition, we release pretrained baselines covering fine-tuned 2D detectors, monocular 3D pose lifting models, and multi-view reconstruction pipelines, establishing a unified benchmark for biomechanically valid spine motion estimation. Specifically, our 2D spine baselines improve the state-of-the-art from 0.63 to 0.80 AUC in controlled environments, and from 0.91 to 0.93 AP for in-the-wild spine tracking. Together, the simulation framework and SIMSPINE dataset advance research in vision-based biomechanics, motion analysis, and digital human modeling by enabling reproducible, anatomically grounded 3D spine estimation under natural conditions.
翻译:脊柱运动建模是理解人体生物力学的基石,然而由于脊柱复杂的多关节运动学特性及大规模三维标注数据的缺乏,该领域在计算机视觉中仍未得到充分探索。本文提出一种生物力学感知的关键点仿真框架,该框架通过从肌肉骨骼模型导出的解剖学一致的三维脊柱关键点,对现有人体姿态数据集进行增强。基于此框架,我们创建了首个开放数据集SIMSPINE,该数据集为室内多相机无约束条件下采集的自然全身运动提供了稀疏椎骨级三维脊柱标注。该数据集包含214万帧数据,能够从细微姿态变化中实现数据驱动的椎骨运动学学习,并弥合了肌肉骨骼仿真与计算机视觉之间的鸿沟。此外,我们发布了涵盖微调二维检测器、单目三维姿态提升模型及多视角重建流程的预训练基线模型,为生物力学有效的脊柱运动估计建立了统一基准。具体而言,我们的二维脊柱基线模型在受控环境中将最佳性能从0.63 AUC提升至0.80 AUC,在自然场景脊柱跟踪任务中将平均精度从0.91 AP提升至0.93 AP。该仿真框架与SIMSPINE数据集共同通过实现自然条件下可复现、解剖学基础的三维脊柱估计,推动了基于视觉的生物力学、运动分析与数字人体建模领域的研究进展。