Force signals provide critical interaction cues for contact-rich robotic manipulation. However, existing methods mostly use force as an additional observation modality, without fully exploiting its role in modeling future interaction dynamics or guiding execution-time feedback correction. In this paper, we propose FAWAM, a force-aware world action model that incorporates force information at three levels: perception, prediction, and closed-loop execution. FAWAM first encodes historical 6-axis force/torque signals to modulate action generation, then jointly predicts future actions and end-effector wrenches to explicitly model contact evolution. It further introduces a residual correction module that uses the predicted wrench trajectory as an execution-time reference to refine actions online based on real-time force feedback. Real-world experiments across multiple contact-rich tasks show that FAWAM improves the average success rate by 36.25% over vision-only baselines and 21.25% over existing force-aware baselines, demonstrating the effectiveness of our force-aware framework for robust contact-rich manipulation.
翻译:力信号为富接触机器人操作提供了关键的交互信息。然而,现有方法大多将力仅作为附加观测模态,未能充分利用其在建模未来交互动力学或引导执行时反馈校正中的作用。本文提出了FAWAM,一种包含力信息的三层次(感知、预测和闭环执行)力感知世界动作模型。FAWAM首先对历史六轴力/力矩信号进行编码以调节动作生成,随后联合预测未来动作与末端执行器力螺旋,从而显式建模接触演化。进一步,它引入残差校正模块,将预测的力螺旋轨迹作为执行时参考,基于实时力反馈在线修正动作。跨多个富接触任务的实际实验表明,FAWAM相比纯视觉基线方法平均成功率提升36.25%,相比现有力感知基线方法提升21.25%,证明了所提力感知框架对于鲁棒富接触操作的有效性。