We introduce Edu-ConvoKit, an open-source library designed to handle pre-processing, annotation and analysis of conversation data in education. Resources for analyzing education conversation data are scarce, making the research challenging to perform and therefore hard to access. We address these challenges with Edu-ConvoKit. Edu-ConvoKit is open-source (https://github.com/stanfordnlp/edu-convokit ), pip-installable (https://pypi.org/project/edu-convokit/ ), with comprehensive documentation (https://edu-convokit.readthedocs.io/en/latest/ ). Our demo video is available at: https://youtu.be/zdcI839vAko?si=h9qlnl76ucSuXb8- . We include additional resources, such as Colab applications of Edu-ConvoKit to three diverse education datasets and a repository of Edu-ConvoKit related papers, that can be found in our GitHub repository.
翻译:我们介绍了Edu-ConvoKit,这是一个专为处理教育领域对话数据的预处理、标注与分析而设计的开源工具库。目前,用于分析教育对话数据的资源极为匮乏,这使得相关研究难以开展,进而阻碍了学术可及性。针对这些挑战,我们推出了Edu-ConvoKit。该库以开源形式发布(https://github.com/stanfordnlp/edu-convokit ),可通过pip安装(https://pypi.org/project/edu-convokit/ ),并配有详尽的文档(https://edu-convokit.readthedocs.io/en/latest/ )。演示视频参见:https://youtu.be/zdcI839vAko?si=h9qlnl76ucSuXb8- 。此外,我们在GitHub仓库中提供了附加资源,包括将Edu-ConvoKit应用于三个不同教育数据集的Colab示例,以及相关论文资料库。