Semantic-Aware Motion Encoding for Topology-Agnostic Character Animation

Generalizing motion representation across diverse characters remains challenging due to significant topological variations in skeletal structures across datasets and species, which hinder the development of scalable generative models. To bridge this gap, we propose a Semantic-Aware Topology-Agnostic framework that learns a unified latent manifold shared by disparate species. Unlike methods relying on fixed hierarchies or rigid padding strategies, our approach leverages a semantic modulation mechanism to align functional joint correspondences, thereby decoupling motion from topology. This design enables the construction of a continuous, generative-friendly motion space from large-scale, unaligned raw BVH data. Experiments on human and animal datasets demonstrate that our framework achieves high-fidelity reconstruction and supports downstream text-to-motion tasks. Notably, the model enables zero-shot cross-species retargeting without paired data. Code and demos are available at: https://github.com/zzysteve/SATA

翻译：通用化跨不同角色的运动表示仍然具有挑战性，原因在于数据集和物种之间骨骼结构的显著拓扑差异阻碍了可扩展生成模型的发展。为弥合这一差距，我们提出了一种语义感知的拓扑无关框架，该框架学习一个由不同物种共享的统一潜在流形。与依赖固定层级结构或刚性填充策略的方法不同，我们的方法利用语义调制机制来对齐功能性的关节对应关系，从而将运动与拓扑解耦。这种设计能够从大规模、未对齐的原始BVH数据中构建一个连续且适合生成的运动空间。在人类和动物数据集上的实验表明，我们的框架实现了高保真重构，并支持下游文本到运动的任务。值得注意的是，该模型实现了零样本跨物种重定向，无需成对数据。代码和演示可在以下网址获取：https://github.com/zzysteve/SATA

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【ICML 2026】MotiMotion：用视觉推理增强运动可控视频生成

专知会员服务

5+阅读 · 5月23日

【CVPR2026】CARE-Edit: 面向上下文相关图像编辑的条件感知专家路由机制

专知会员服务

6+阅读 · 3月10日

在无标注条件下适配视觉—语言模型：全面综述

专知会员服务

13+阅读 · 2025年8月9日

【CVPR2025】知识桥接器：走向无训练的缺失模态补全

专知会员服务

14+阅读 · 2025年2月28日