Long-horizon dexterous robot manipulation of deformable objects, such as banana peeling, is a problematic task because of the difficulties in object modeling and a lack of knowledge about stable and dexterous manipulation skills. This paper presents a goal-conditioned dual-action (GC-DA) deep imitation learning (DIL) approach that can learn dexterous manipulation skills using human demonstration data. Previous DIL methods map the current sensory input and reactive action, which often fails because of compounding errors in imitation learning caused by the recurrent computation of actions. The method predicts reactive action only when the precise manipulation of the target object is required (local action) and generates the entire trajectory when precise manipulation is not required (global action). This dual-action formulation effectively prevents compounding error in the imitation learning using the trajectory-based global action while responding to unexpected changes in the target object during the reactive local action. The proposed method was tested in a real dual-arm robot and successfully accomplished the banana-peeling task.
翻译:长时域灵巧机器人操作可变形物体(如剥香蕉)是一项具有挑战性的任务,原因在于物体建模的困难以及缺乏关于稳定灵巧操作技能的知识。本文提出了一种基于目标条件的双动作(GC-DA)深度模仿学习(DIL)方法,该方法能够利用人类演示数据学习灵巧操作技能。以往的DIL方法将当前感官输入与反应性动作直接映射,但由于动作的循环计算导致模仿学习中出现累积误差,常常失败。本方法仅在需要对目标物体进行精确操作时预测反应性动作(局部动作),并在不需要精确操作时生成完整轨迹(全局动作)。这种双动作公式通过基于轨迹的全局动作有效防止了模仿学习中的累积误差,同时在反应性局部动作期间对目标物体的意外变化做出响应。所提出的方法在真实双臂机器人上进行了测试,并成功完成了剥香蕉任务。