This paper addresses the problem of generating questions from a given context and an answer, specifically focusing on questions that require multi-hop reasoning across an extended context. Previous studies have suggested that key phrase selection is essential for question generation (QG), yet it is still challenging to connect such disjointed phrases into meaningful questions, particularly for long context. To mitigate this issue, we propose MultiFactor, a novel QG framework based on multi-level content planning. Specifically, MultiFactor includes two components: FA-model, which simultaneously selects key phrases and generates full answers, and Q-model which takes the generated full answer as an additional input to generate questions. Here, full answer generation is introduced to connect the short answer with the selected key phrases, thus forming an answer-aware summary to facilitate QG. Both FA-model and Q-model are formalized as simple-yet-effective Phrase-Enhanced Transformers, our joint model for phrase selection and text generation. Experimental results show that our method outperforms strong baselines on two popular QG datasets. Our code is available at https://github.com/zeaver/MultiFactor.
翻译:本文针对从给定上下文和答案生成问题这一任务,特别关注需要跨扩展上下文进行多跳推理的问题。先前研究表明关键短语选择对于提问生成至关重要,但如何将这些不连贯短语连接成有意义的问题仍具挑战性,尤其在长上下文中。为缓解该问题,我们提出MultiFactor——一种基于多层次内容规划的新型提问生成框架。具体而言,MultiFactor包含两个组件:FA模型(同时选择关键短语并生成完整答案)和Q模型(将生成的完整答案作为额外输入以生成问题)。其中,引入完整答案生成旨在将短答案与所选关键短语相连接,从而形成答案感知摘要以促进提问生成。FA模型和Q模型均被形式化为简单而有效的短语增强Transformer——我们针对短语选择和文本生成的联合模型。实验结果表明,我们的方法在两个主流提问生成数据集上优于强基线模型。我们的代码已开源:https://github.com/zeaver/MultiFactor。