City council minutes are typically lengthy and formal documents with a bureaucratic writing style. Although publicly available, their structure often makes it difficult for citizens or journalists to efficiently find information. In this demo, we present CitiLink, a platform designed to transform unstructured municipal meeting minutes into structured and searchable data, demonstrating how NLP and IR can enhance the accessibility and transparency of local government. The system employs LLMs to extract metadata, discussed subjects, and voting outcomes, which are then indexed in a database to support full-text search with BM25 ranking and faceted filtering through a user-friendly interface. The developed system was built over a collection of 120 minutes made available by six Portuguese municipalities. To assess its usability, CitiLink was tested through guided sessions with municipal personnel, providing insights into how real users interact with the system. In addition, we evaluated Gemini's performance in extracting relevant information from the minutes, highlighting its effectiveness in data extraction.
翻译:市议会会议记录通常是篇幅冗长、行文正式的公文,采用官僚主义写作风格。尽管这些记录公开可用,但其结构往往使公民或记者难以高效查找信息。在本演示中,我们介绍了CitiLink平台,该平台旨在将非结构化的市政会议记录转化为结构化、可搜索的数据,展示了自然语言处理与信息检索技术如何提升地方政府信息的可访问性与透明度。该系统利用大语言模型提取元数据、讨论议题及投票结果,随后将其索引至数据库,通过用户友好界面支持基于BM25排序的全文检索与分面过滤功能。该系统的开发基于葡萄牙六个市政当局提供的120份会议记录数据集。为评估其可用性,我们通过市政人员的引导式测试环节对CitiLink进行了实测,从而深入理解真实用户与系统的交互模式。此外,我们评估了Gemini模型从会议记录中提取相关信息的表现,凸显了其在数据提取任务中的有效性。