The accessibility of legal information remains a constant challenge, particularly for laypersons seeking to understand and apply complex institutional texts. While the European Union provides open access to legislation, parliamentary responses, and regulatory documents, these resources can be challenging for laypeople to explore. In this paper, we introduce EUDial, a proactive multi-turn dialogue dataset constructed from 204 blogs curated by the Citizens' Enquiries Unit (AskEP) of the European Parliamentary Research Service. EUDial contains 880 dialogue turns (averaging 4.3 turns per dialogue), where each dialogue includes initial questions, structured answers, and follow-up questions. Beyond dataset construction, we propose the LexGuide framework that leverages retrieval-augmented generation with hierarchical topic organization to structure dialogue progression, ensuring both comprehensive coverage of legal aspects and coherence across conversational turns. The results demonstrate that proactive, structured navigation closes the gap between the availability of legal information and citizen comprehension, establishing EUDial and LexGuide as practical resources for advancing proactive legal dialogue systems.
翻译:法律信息的可及性始终是一个持续挑战,尤其对于试图理解和应用复杂机构文本的非专业人士而言。尽管欧盟提供了立法、议会答复和监管文件的公开访问渠道,但这些资源对于非专业人士而言可能难以探索。本文介绍了EUDial,这是一个基于欧洲议会研究服务局公民咨询处(AskEP)整理的204篇博客构建的主动多轮对话数据集。EUDial包含880个对话轮次(平均每个对话4.3轮),其中每个对话均包含初始问题、结构化答案和后续问题。除数据集构建外,我们提出了LexGuide框架,该框架利用检索增强生成与分层主题组织来结构化对话进程,确保法律层面的全面覆盖和跨对话轮次的连贯性。结果表明,主动的结构化导航弥合了法律信息可用性与公民理解之间的差距,确立了EUDial和LexGuide作为推进主动法律对话系统的实用资源。