This report describes our 1st place solution to the 8th HANDS workshop challenge (ARCTIC track) in conjunction with ECCV 2024. In this challenge, we address the task of bimanual category-agnostic hand-object interaction reconstruction, which aims to generate 3D reconstructions of both hands and the object from a monocular video, without relying on predefined templates. This task is particularly challenging due to the significant occlusion and dynamic contact between the hands and the object during bimanual manipulation. We worked to resolve these issues by introducing a mask loss and a 3D contact loss, respectively. Moreover, we applied 3D Gaussian Splatting (3DGS) to this task. As a result, our method achieved a value of 38.69 in the main metric, CD$_h$, on the ARCTIC test set.
翻译:本报告介绍了我们在ECCV 2024第八届HANDS研讨会挑战赛(ARCTIC赛道)中获得冠军的解决方案。该挑战赛旨在解决双臂类别无关手-物交互重建任务,其目标是从单目视频中生成双手及物体的三维重建,且不依赖预定义模板。由于双臂操作过程中手部与物体之间存在严重的遮挡和动态接触,该任务极具挑战性。我们分别通过引入掩码损失和三维接触损失来解决这些问题。此外,我们将三维高斯溅射(3DGS)技术应用于此任务。最终,我们的方法在ARCTIC测试集上的核心指标CD$_h$达到了38.69。