We aim to understand how people assess human likeness in navigation produced by people and artificially intelligent (AI) agents in a video game. To this end, we propose a novel AI agent with the goal of generating more human-like behavior. We collect hundreds of crowd-sourced assessments comparing the human-likeness of navigation behavior generated by our agent and baseline AI agents with human-generated behavior. Our proposed agent passes a Turing Test, while the baseline agents do not. By passing a Turing Test, we mean that human judges could not quantitatively distinguish between videos of a person and an AI agent navigating. To understand what people believe constitutes human-like navigation, we extensively analyze the justifications of these assessments. This work provides insights into the characteristics that people consider human-like in the context of goal-directed video game navigation, which is a key step for further improving human interactions with AI agents.
翻译:我们旨在理解人们如何在电子游戏中评估由人类和人工智能(AI)智能体产生的导航行为的类人性。为此,我们提出一种新型AI智能体,其目标是生成更具类人特征的行为。我们收集了数百份众包评估,比较我们的智能体与基线AI智能体生成的导航行为与人类生成行为的类人程度。所提出的智能体通过了图灵测试,而基线智能体未能通过。通过图灵测试是指人类评判者无法在定量层面上区分人类导航与AI智能体导航的视频片段。为探究人们认为哪些特征构成类人导航,我们广泛分析了这些评估的论证依据。本研究揭示了在目标导向的电子游戏导航中,人们认为具有类人性的特征,这为进一步改善人类与AI智能体的交互提供了关键启示。