Conversational artificial intelligence (AI) provides an efficient and convenient gateway to information access. However, it can cause overreliance when users blindly trust AI and accept its answers without fact-checking. Information search increasingly follows a hybrid interaction paradigm that combines conversational AI with web search, making fact-checking easier. In this paper, we examine whether this interaction paradigm is effective in curbing reliance. We further investigate the underlying factors (e.g., digital literacy and conversation warmth) that drive users to verify AI answers. We conduct a mixed-subjects question-answering experiment where participants interact with either a warm or a neutral chatbot. Our findings reveal that reliance persists despite users having access to both conversational and web search. The decision to verify is driven primarily by existing user perceptions (e.g., prior trust in chatbots) rather than answer properties, with some users fact-checking regardless of the context and others trusting chatbots by default. Warm conversational style has an indirect yet critical influence on reliance by increasing agreement with the chatbot when it is incorrect. Consulting additional AI sources predicts higher accuracy, while traditional web search does not. Our study extends overreliance research by: (a) demonstrating its persistence despite access to fact-checking, (b) identifying verification behavior as user-dependent, and (c) revealing conversational warmth's indirect effect on overreliance with implications for designing trustworthy conversational search systems.
翻译:对话式人工智能(AI)为信息获取提供了高效便捷的途径。然而,当用户盲目信任AI并接受其答案而不进行事实核查时,可能导致过度依赖。信息搜索日益遵循一种结合对话式AI与网络搜索的混合交互范式,使得事实核查更加便捷。本文考察了这种交互范式在抑制依赖方面的有效性,并进一步探究了驱动用户验证AI答案的潜在因素(如数字素养、对话温暖度)。我们开展了一项混合被试问答实验,参与者与温暖型或中立型聊天机器人进行交互。研究结果表明,尽管用户可以同时使用对话式搜索和网络搜索,依赖现象依然存在。验证决策主要由用户已有认知(例如对聊天机器人的先验信任)而非答案属性驱动,部分用户无论情境如何都会进行事实核查,而其他用户则默认信任聊天机器人。温暖的对话风格通过增加对聊天机器人错误答案的认同程度,间接但关键地影响了依赖行为。咨询额外AI源能预测更高的准确率,而传统网络搜索则无此效果。本研究通过以下方面拓展了过度依赖研究:(a)证明了即便具备事实核查条件,依赖依然存在;(b)识别出验证行为具有用户依赖性;(c)揭示了对话温暖度对过度依赖的间接影响,为设计可信赖的对话式搜索系统提供了启示。