Defining and measuring trust in dynamic, multiagent teams is important in a range of contexts, particularly in defense and security domains. Team members should be trusted to work towards agreed goals and in accordance with shared values. In this paper, our concern is with the definition of goals and values such that it is possible to define 'trust' in a way that is interpretable, and hence usable, by both humans and robots. We argue that the outcome of team activity can be considered in terms of 'goal', 'individual/team values', and 'legal principles'. We question whether alignment is possible at the level of 'individual/team values', or only at the 'goal' and 'legal principles' levels. We argue for a set of metrics to define trust in human-robot teams that are interpretable by human or robot team members, and consider an experiment that could demonstrate the notion of 'satisficing trust' over the course of a simulated mission.
翻译:在动态多智能体团队中定义和衡量信任在诸多领域(尤其是国防与安全领域)具有重要意义。团队成员应被信任依照共同目标与共享价值观进行协作。本文关注目标与价值观的定义问题,旨在使"信任"能够以人类与机器均可解释、进而可用的方式加以定义。我们提出团队活动成果可从"目标"、"个体/团队价值观"及"法律原则"三个维度进行考量,并探讨对齐是否可能在"个体/团队价值观"层面实现,抑或仅能停留在"目标"与"法律原则"层面。我们主张建立一套可供人类或机器人团队成员解释的人机团队信任度量指标,并设计一项实验以验证在模拟任务过程中"满意型信任"这一概念的可行性。