面向流程任务错误检测的多重正常动作表征建模 (Modeling Multiple Normal Action Representations for Error Detection in Procedural Tasks)

Error detection in procedural activities is essential for consistent and correct outcomes in AR-assisted and robotic systems. Existing methods often focus on temporal ordering errors or rely on static prototypes to represent normal actions. However, these approaches typically overlook the common scenario where multiple, distinct actions are valid following a given sequence of executed actions. This leads to two issues: (1) the model cannot effectively detect errors using static prototypes when the inference environment or action execution distribution differs from training; and (2) the model may also use the wrong prototypes to detect errors if the ongoing action label is not the same as the predicted one. To address this problem, we propose an Adaptive Multiple Normal Action Representation (AMNAR) framework. AMNAR predicts all valid next actions and reconstructs their corresponding normal action representations, which are compared against the ongoing action to detect errors. Extensive experiments demonstrate that AMNAR achieves state-of-the-art performance, highlighting the effectiveness of AMNAR and the importance of modeling multiple valid next actions in error detection. The code is available at https://github.com/iSEE-Laboratory/AMNAR.

翻译：在增强现实辅助与机器人系统中，流程活动的错误检测对于保证结果的一致性与正确性至关重要。现有方法通常聚焦于时序顺序错误，或依赖静态原型来表征正常动作。然而，这些方法普遍忽视了在给定已执行动作序列后，存在多个不同但均有效的后续动作这一常见场景。这导致两个问题：(1) 当推理环境或动作执行分布与训练数据不同时，模型无法利用静态原型有效检测错误；(2) 若当前执行的动作标签与预测标签不一致，模型亦可能使用错误的原型进行错误检测。为解决此问题，我们提出了一种自适应多重正常动作表征（AMNAR）框架。AMNAR 预测所有有效的后续动作，并重构其对应的正常动作表征，通过将这些表征与当前执行的动作进行比较以实现错误检测。大量实验表明，AMNAR 取得了最先进的性能，凸显了该框架的有效性以及对多重有效后续动作进行建模在错误检测中的重要性。代码发布于 https://github.com/iSEE-Laboratory/AMNAR。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

31+阅读 · 2021年9月29日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日