Opportunities and Challenges in Neural Dialog Tutoring - 专知论文

会员服务 ·

0

格分析 · 自动评估 · 语言模型 · 用户研究 · 质量评估 ·

2023 年 3 月 27 日

Opportunities and Challenges in Neural Dialog Tutoring

翻译：神经对话辅导的机遇与挑战

Jakub Macina,Nico Daheim,Lingzhi Wang,Tanmay Sinha,Manu Kapur,Iryna Gurevych,Mrinmaya Sachan

from arxiv, EACL 2023 (main conference, camera-ready)

Designing dialog tutors has been challenging as it involves modeling the diverse and complex pedagogical strategies employed by human tutors. Although there have been significant recent advances in neural conversational systems using large language models (LLMs) and growth in available dialog corpora, dialog tutoring has largely remained unaffected by these advances. In this paper, we rigorously analyze various generative language models on two dialog tutoring datasets for language learning using automatic and human evaluations to understand the new opportunities brought by these advances as well as the challenges we must overcome to build models that would be usable in real educational settings. We find that although current approaches can model tutoring in constrained learning scenarios when the number of concepts to be taught and possible teacher strategies are small, they perform poorly in less constrained scenarios. Our human quality evaluation shows that both models and ground-truth annotations exhibit low performance in terms of equitable tutoring, which measures learning opportunities for students and how engaging the dialog is. To understand the behavior of our models in a real tutoring setting, we conduct a user study using expert annotators and find a significantly large number of model reasoning errors in 45% of conversations. Finally, we connect our findings to outline future work.

翻译：设计对话辅导系统极具挑战性，因为它需要模拟人类教师所采用多样且复杂的教学策略。尽管近年来基于大型语言模型（LLMs）的神经对话系统取得了重大进展，且可用对话语料库不断增长，但对话辅导领域在很大程度上仍未受这些进展的影响。本文在两个语言学习的对话辅导数据集上，利用自动评估与人工评估系统地对多种生成式语言模型进行严格分析，旨在理解这些进步带来的新机遇，以及我们在构建可用于真实教育场景的模型时必须克服的挑战。研究发现，虽然当前方法能在受限的学习场景（如待教授概念数量及教师策略可能范围较小）中有效建模辅导过程，但在非受限场景下表现欠佳。我们的质量评估显示，在衡量学生参与对话机会及对话吸引力的“公平辅导”维度上，模型与人工标注的真实答案均表现低下。为探究模型在真实辅导场景中的行为，我们通过专家标注者开展用户研究，发现45%的对话中存在大量模型推理错误。最后，我们整合研究成果并为未来工作指明方向。

0

相关内容

格分析

最新《可解释人工智能XAI：机会与挑战》25页pdf，Opportunities and Challenges in Explainable Artificial Intelligence (XAI): A Survey

最新《可解释人工智能XAI：机会与挑战》25页pdf，Opportunities and Challenges in Explainable Artificial Intelligence (XAI): A Survey

专知会员服务

182+阅读 · 2020年6月23日

【深度学习社区检测】Deep Learning for Community Detection: Progress, Challenges and Opportunities

【深度学习社区检测】Deep Learning for Community Detection: Progress, Challenges and Opportunities

专知会员服务

28+阅读 · 2020年6月13日

【综述】生成式对抗网络(GANs)最新2020综述:挑战、解决方案和未来方向，Generative Adversarial Networks (GANs): Challenges, Solutions, and Future Directions

【综述】生成式对抗网络(GANs)最新2020综述:挑战、解决方案和未来方向，Generative Adversarial Networks (GANs): Challenges, Solutions, and Future Directions

专知会员服务

63+阅读 · 2020年5月12日

对话管理的综述论文:最近的进展和挑战，A Survey on Dialog Management: Recent Advances and Challenges

对话管理的综述论文:最近的进展和挑战，A Survey on Dialog Management: Recent Advances and Challenges

专知会员服务

83+阅读 · 2020年5月10日

【清华大学】面向任务的对话系统的最新进展和挑战，Task-oriented Dialog System

【清华大学】面向任务的对话系统的最新进展和挑战，Task-oriented Dialog System

专知会员服务

84+阅读 · 2020年3月24日

【微软雷德蒙研究院】小样本自然语言生成，Few-shot Natural Language Generation for Task-Oriented Dialog

【微软雷德蒙研究院】小样本自然语言生成，Few-shot Natural Language Generation for Task-Oriented Dialog

专知会员服务

33+阅读 · 2020年2月29日

【北京智源大会2019】AV+AI -挑战和机遇（AV+AI - Challenges and Opportunities），伯克利DeepDrive副主任詹景尧

【北京智源大会2019】AV+AI -挑战和机遇（AV+AI - Challenges and Opportunities），伯克利DeepDrive副主任詹景尧

专知会员服务

14+阅读 · 2019年11月22日

【清华大学-微软研究院】构建智能开放域对话系统的挑战综述论文，31页pdf，Challenges in Building Intelligent Open-domain Dialog Systems

【清华大学-微软研究院】构建智能开放域对话系统的挑战综述论文，31页pdf，Challenges in Building Intelligent Open-domain Dialog Systems

专知会员服务

28+阅读 · 2019年10月23日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

不可错过！斯坦福《图学习》研讨会，Jure Leskovec主持，附slides！

不可错过！斯坦福《图学习》研讨会，Jure Leskovec主持，附slides！

图与推荐

0+阅读 · 2022年10月7日

大视觉模型方向，计算机视觉顶尖期刊 IJCV 特刊征稿

大视觉模型方向，计算机视觉顶尖期刊 IJCV 特刊征稿

机器之心

0+阅读 · 2022年9月19日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

微软等ACL2022《知识增强自然语言处理》教程，阐述最新前沿技术，附185页ppt

微软等ACL2022《知识增强自然语言处理》教程，阐述最新前沿技术，附185页ppt

专知

1+阅读 · 2022年5月24日

对话管理的综述论文:最近的进展和挑战，A Survey on Dialog Management

对话管理的综述论文:最近的进展和挑战，A Survey on Dialog Management

专知

12+阅读 · 2020年5月14日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

NLP 2018 Highlights：2018自然语言处理技术亮点汇总

NLP 2018 Highlights：2018自然语言处理技术亮点汇总

AINLP

10+阅读 · 2019年2月9日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

TIM-1-Fc介导辅助T淋巴细胞反应调控异位小肠移植免疫应答机制的研究

国家自然科学基金

0+阅读 · 2016年12月31日

NeuroD2调控人恶性神经胶质瘤细胞向神经元分化的作用研究

国家自然科学基金

0+阅读 · 2015年12月31日

脂质代谢调控基因SRB1和ABCA1在前列腺癌恶性进展中的作用和机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

Axl受体酪氨酸激酶在系统性红斑狼疮单核/巨噬细胞极化调控中机制的研究

国家自然科学基金

0+阅读 · 2014年12月31日

CD4+T细胞亚群失衡在高眼压视神经损伤中的作用

国家自然科学基金

0+阅读 · 2012年12月31日

RI与Angiogenin相互作用调控PI3K/AKT/mTOR信号通路和ANG的核转位在膀胱癌发生发展中的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

先天免疫RIG-I信号通路与动脉粥样硬化的研究

国家自然科学基金

1+阅读 · 2012年12月31日

MicroRNA-199b-HIF-1α-VEGF通路对肺移植慢性免疫排异反应的调控作用及其机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

hTERT/Tet-on/GAL基因修饰BMSCs对慢性神经病理痛的可控性镇痛作用及其机制

国家自然科学基金

0+阅读 · 2011年12月31日

单壁碳纳米管的免疫毒理和毒理基因组学分子机制研究

国家自然科学基金

0+阅读 · 2008年12月31日

Knowledge Graphs: Opportunities and Challenges

Arxiv

182+阅读 · 2023年3月24日

A Comprehensive Survey of Few-shot Learning: Evolution, Applications, Challenges, and Opportunities

Arxiv

52+阅读 · 2022年5月13日

Federated Learning Challenges and Opportunities: An Outlook

Arxiv

15+阅读 · 2022年2月1日

Advances in Multi-turn Dialogue Comprehension: A Survey

Arxiv

23+阅读 · 2021年10月11日

AI in Finance: Challenges, Techniques and Opportunities

Arxiv

46+阅读 · 2021年7月20日

Imitation Learning: Progress, Taxonomies and Opportunities

Arxiv

12+阅读 · 2021年6月23日

Attention, please! A survey of Neural Attention Models in Deep Learning

Arxiv

59+阅读 · 2021年3月31日

Recent Advances and Challenges in Task-oriented Dialog System

Recent Advances and Challenges in Task-oriented Dialog System

Arxiv

19+阅读 · 2020年3月19日

Explainable Artificial Intelligence (XAI): Concepts, Taxonomies, Opportunities and Challenges toward Responsible AI

Explainable Artificial Intelligence (XAI): Concepts, Taxonomies, Opportunities and Challenges toward Responsible AI

Arxiv

78+阅读 · 2019年10月22日

Challenges in Building Intelligent Open-domain Dialog Systems

Arxiv

21+阅读 · 2019年5月13日

VIP会员

文章信息

相关主题

最新内容

无人机自主控制与人工智能：系统性综述

无人机自主控制与人工智能：系统性综述

专知会员服务

1+阅读 · 16分钟前

巡飞弹与反无人机系统——现代战场的两大支柱

巡飞弹与反无人机系统——现代战场的两大支柱

专知会员服务

1+阅读 · 47分钟前

《打造“黄金舰队”》57页报告

《打造“黄金舰队”》57页报告

专知会员服务

0+阅读 · 49分钟前

《北约数字教官网络发展路径》128页报告

《北约数字教官网络发展路径》128页报告

专知会员服务

1+阅读 · 今天6:33

ECCV 2026 | MIMFlow：MIM与归一化流统一图像生成

ECCV 2026 | MIMFlow：MIM与归一化流统一图像生成

专知会员服务

6+阅读 · 6月25日

超越自回归边界：扩散模型、世界模型与SSM如何重塑代码智能

超越自回归边界：扩散模型、世界模型与SSM如何重塑代码智能

专知会员服务

5+阅读 · 6月25日

重塑决策优势：美军作战艺术与多域作战中联盟联合全域指挥控制（CJADC2）体系的融合

重塑决策优势：美军作战艺术与多域作战中联盟联合全域指挥控制（CJADC2）体系的融合

专知会员服务

7+阅读 · 6月25日

网状网络及其在军事领域的运用

网状网络及其在军事领域的运用

专知会员服务

7+阅读 · 6月25日

《意识即战场——全球安全体系中认知战的演进：乌克兰构建认知作战体系的展望》

《意识即战场——全球安全体系中认知战的演进：乌克兰构建认知作战体系的展望》

专知会员服务

7+阅读 · 6月25日

无美国参与的欧洲战争方式（万字长文）

无美国参与的欧洲战争方式（万字长文）

专知会员服务

8+阅读 · 6月25日

重构“下一场战争”的制胜理论：超越兰彻斯特方程与现代系统

重构“下一场战争”的制胜理论：超越兰彻斯特方程与现代系统

专知会员服务

9+阅读 · 6月25日

《国防工业中基于模型定义的实施：产品定义数字化转型的战略路径》90页

《国防工业中基于模型定义的实施：产品定义数字化转型的战略路径》90页

专知会员服务

9+阅读 · 6月25日

《国防领域敏感性分析白皮书》

《国防领域敏感性分析白皮书》

专知会员服务

8+阅读 · 6月25日

综述 | 从问答到任务完成：Agent系统与Harness设计

综述 | 从问答到任务完成：Agent系统与Harness设计

专知会员服务

9+阅读 · 6月24日

Agentic RL：框架、实践与长程智能体训练

Agentic RL：框架、实践与长程智能体训练

专知会员服务

10+阅读 · 6月24日

相关VIP内容

最新《可解释人工智能XAI：机会与挑战》25页pdf，Opportunities and Challenges in Explainable Artificial Intelligence (XAI): A Survey

最新《可解释人工智能XAI：机会与挑战》25页pdf，Opportunities and Challenges in Explainable Artificial Intelligence (XAI): A Survey

专知会员服务

182+阅读 · 2020年6月23日

【深度学习社区检测】Deep Learning for Community Detection: Progress, Challenges and Opportunities

【深度学习社区检测】Deep Learning for Community Detection: Progress, Challenges and Opportunities

专知会员服务

28+阅读 · 2020年6月13日

【综述】生成式对抗网络(GANs)最新2020综述:挑战、解决方案和未来方向，Generative Adversarial Networks (GANs): Challenges, Solutions, and Future Directions

【综述】生成式对抗网络(GANs)最新2020综述:挑战、解决方案和未来方向，Generative Adversarial Networks (GANs): Challenges, Solutions, and Future Directions

专知会员服务

63+阅读 · 2020年5月12日

对话管理的综述论文:最近的进展和挑战，A Survey on Dialog Management: Recent Advances and Challenges

对话管理的综述论文:最近的进展和挑战，A Survey on Dialog Management: Recent Advances and Challenges

专知会员服务

83+阅读 · 2020年5月10日

【清华大学】面向任务的对话系统的最新进展和挑战，Task-oriented Dialog System

【清华大学】面向任务的对话系统的最新进展和挑战，Task-oriented Dialog System

专知会员服务

84+阅读 · 2020年3月24日

【微软雷德蒙研究院】小样本自然语言生成，Few-shot Natural Language Generation for Task-Oriented Dialog

【微软雷德蒙研究院】小样本自然语言生成，Few-shot Natural Language Generation for Task-Oriented Dialog

专知会员服务

33+阅读 · 2020年2月29日

【北京智源大会2019】AV+AI -挑战和机遇（AV+AI - Challenges and Opportunities），伯克利DeepDrive副主任詹景尧

【北京智源大会2019】AV+AI -挑战和机遇（AV+AI - Challenges and Opportunities），伯克利DeepDrive副主任詹景尧

专知会员服务

14+阅读 · 2019年11月22日

【清华大学-微软研究院】构建智能开放域对话系统的挑战综述论文，31页pdf，Challenges in Building Intelligent Open-domain Dialog Systems

【清华大学-微软研究院】构建智能开放域对话系统的挑战综述论文，31页pdf，Challenges in Building Intelligent Open-domain Dialog Systems

专知会员服务

28+阅读 · 2019年10月23日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

巡飞弹与反无人机系统——现代战场的两大支柱

《北约数字教官网络发展路径》128页报告

无人机自主控制与人工智能：系统性综述

《打造“黄金舰队”》57页报告

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

不可错过！斯坦福《图学习》研讨会，Jure Leskovec主持，附slides！

不可错过！斯坦福《图学习》研讨会，Jure Leskovec主持，附slides！

图与推荐

0+阅读 · 2022年10月7日

大视觉模型方向，计算机视觉顶尖期刊 IJCV 特刊征稿

大视觉模型方向，计算机视觉顶尖期刊 IJCV 特刊征稿

机器之心

0+阅读 · 2022年9月19日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

微软等ACL2022《知识增强自然语言处理》教程，阐述最新前沿技术，附185页ppt

微软等ACL2022《知识增强自然语言处理》教程，阐述最新前沿技术，附185页ppt

专知

1+阅读 · 2022年5月24日

对话管理的综述论文:最近的进展和挑战，A Survey on Dialog Management

对话管理的综述论文:最近的进展和挑战，A Survey on Dialog Management

专知

12+阅读 · 2020年5月14日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

NLP 2018 Highlights：2018自然语言处理技术亮点汇总

NLP 2018 Highlights：2018自然语言处理技术亮点汇总

AINLP

10+阅读 · 2019年2月9日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Knowledge Graphs: Opportunities and Challenges

Arxiv

182+阅读 · 2023年3月24日

A Comprehensive Survey of Few-shot Learning: Evolution, Applications, Challenges, and Opportunities

Arxiv

52+阅读 · 2022年5月13日

Federated Learning Challenges and Opportunities: An Outlook

Arxiv

15+阅读 · 2022年2月1日

Advances in Multi-turn Dialogue Comprehension: A Survey

Arxiv

23+阅读 · 2021年10月11日

AI in Finance: Challenges, Techniques and Opportunities

Arxiv

46+阅读 · 2021年7月20日

Imitation Learning: Progress, Taxonomies and Opportunities

Arxiv

12+阅读 · 2021年6月23日

Attention, please! A survey of Neural Attention Models in Deep Learning

Arxiv

59+阅读 · 2021年3月31日

Recent Advances and Challenges in Task-oriented Dialog System

Recent Advances and Challenges in Task-oriented Dialog System

Arxiv

19+阅读 · 2020年3月19日

Explainable Artificial Intelligence (XAI): Concepts, Taxonomies, Opportunities and Challenges toward Responsible AI

Explainable Artificial Intelligence (XAI): Concepts, Taxonomies, Opportunities and Challenges toward Responsible AI

Arxiv

78+阅读 · 2019年10月22日

Challenges in Building Intelligent Open-domain Dialog Systems

Arxiv

21+阅读 · 2019年5月13日

相关基金

TIM-1-Fc介导辅助T淋巴细胞反应调控异位小肠移植免疫应答机制的研究

国家自然科学基金

0+阅读 · 2016年12月31日

NeuroD2调控人恶性神经胶质瘤细胞向神经元分化的作用研究

国家自然科学基金

0+阅读 · 2015年12月31日

脂质代谢调控基因SRB1和ABCA1在前列腺癌恶性进展中的作用和机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

Axl受体酪氨酸激酶在系统性红斑狼疮单核/巨噬细胞极化调控中机制的研究

国家自然科学基金

0+阅读 · 2014年12月31日

CD4+T细胞亚群失衡在高眼压视神经损伤中的作用

国家自然科学基金

0+阅读 · 2012年12月31日

RI与Angiogenin相互作用调控PI3K/AKT/mTOR信号通路和ANG的核转位在膀胱癌发生发展中的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

先天免疫RIG-I信号通路与动脉粥样硬化的研究

国家自然科学基金

1+阅读 · 2012年12月31日

MicroRNA-199b-HIF-1α-VEGF通路对肺移植慢性免疫排异反应的调控作用及其机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

hTERT/Tet-on/GAL基因修饰BMSCs对慢性神经病理痛的可控性镇痛作用及其机制

国家自然科学基金

0+阅读 · 2011年12月31日

单壁碳纳米管的免疫毒理和毒理基因组学分子机制研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员