VerAs: Verify then Assess STEM Lab Reports

With an increasing focus in STEM education on critical thinking skills, science writing plays an ever more important role in curricula that stress inquiry skills. A recently published dataset of two sets of college level lab reports from an inquiry-based physics curriculum relies on analytic assessment rubrics that utilize multiple dimensions, specifying subject matter knowledge and general components of good explanations. Each analytic dimension is assessed on a 6-point scale, to provide detailed feedback to students that can help them improve their science writing skills. Manual assessment can be slow, and difficult to calibrate for consistency across all students in large classes. While much work exists on automated assessment of open-ended questions in STEM subjects, there has been far less work on long-form writing such as lab reports. We present an end-to-end neural architecture that has separate verifier and assessment modules, inspired by approaches to Open Domain Question Answering (OpenQA). VerAs first verifies whether a report contains any content relevant to a given rubric dimension, and if so, assesses the relevant sentences. On the lab reports, VerAs outperforms multiple baselines based on OpenQA systems or Automated Essay Scoring (AES). VerAs also performs well on an analytic rubric for middle school physics essays.

翻译：随着STEM教育日益重视批判性思维技能，科学写作在强调探究技能的课程中发挥着愈发重要的作用。近期发布的一个数据集包含基于探究式物理课程的两组大学水平实验报告，该数据集采用多维度分析性评估量规，分别考察学科知识掌握程度及优秀解释的通用要素。每个分析维度按6分制评分，旨在为学生提供有助于提升科学写作技能的详细反馈。人工评估速度缓慢且难以校准大班全体学生成绩的一致性。尽管关于STEM学科开放式问题的自动评估已有大量研究，但针对实验报告等长篇写作的自动评估研究仍明显不足。受开放领域问答（OpenQA）方法启发，我们提出一种端到端神经网络架构，该架构包含独立的验证器与评估模块。VerAs首先验证报告是否包含与给定量规维度相关的内容，若包含则对相关句子进行评估。在实验报告数据集上，VerAs的表现优于基于OpenQA系统或自动作文评分（AES）的多个基线方法。此外，VerAs在中学物理论文的分析性量规评估中也展现出优异性能。

相关内容

Automator

关注 5

Automator是苹果公司为他们的Mac OS X系统开发的一款软件。 只要通过点击拖拽鼠标等操作就可以将一系列动作组合成一个工作流，从而帮助你自动的（可重复的）完成一些复杂的工作。Automator还能横跨很多不同种类的程序，包括：查找器、Safari网络浏览器、iCal、地址簿或者其他的一些程序。它还能和一些第三方的程序一起工作，如微软的Office、Adobe公司的Photoshop或者Pixelmator等。

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日