WikiHow is an open-domain repository of instructional articles for a variety of tasks, which can be revised by users. In this paper, we extract pairwise versions of an instruction before and after a revision was made. Starting from a noisy dataset of revision histories, we specifically extract and analyze edits that involve cases of vagueness in instructions. We further investigate the ability of a neural model to distinguish between two versions of an instruction in our data by adopting a pairwise ranking task from previous work and showing improvements over existing baselines.
翻译:WikiHow是一个面向开放领域的指令文章知识库,涵盖多种任务,用户可对其实施修订。本文提取了每次修订前后指令的成对版本。我们从包含噪声的修订历史数据集中,专门提取并分析了涉及指令模糊性的编辑案例。进一步地,我们采用先前工作中的成对排序任务,探究神经模型区分数据集中指令两个版本的能力,并展示出相较于现有基线模型的改进。