Cultural and language factors significantly influence counseling, but Natural Language Processing research has not yet examined whether the findings of conversational analysis for counseling conducted in English apply to other languages. This paper presents a first step towards this direction. We introduce MIDAS (Motivational Interviewing Dataset in Spanish), a counseling dataset created from public video sources that contains expert annotations for counseling reflections and questions. Using this dataset, we explore language-based differences in counselor behavior in English and Spanish and develop classifiers in monolingual and multilingual settings, demonstrating its applications in counselor behavioral coding tasks.
翻译:文化与语言因素对咨询会话具有显著影响,但自然语言处理研究尚未探讨基于英语会话分析的咨询研究结论是否适用于其他语言。本文迈出了该方向的第一步。我们提出了MIDAS(西班牙语动机性访谈数据集),这是一个基于公开视频资源构建的咨询数据集,其中包含专家对咨询反思与提问的标注。利用该数据集,我们探究了英语与西班牙语咨询师行为的语言差异,并在单语及多语环境下开发了分类器,展示了其在咨询师行为编码任务中的应用价值。