This paper introduces an innovative approach using Retrieval-Augmented Generation (RAG) pipelines with Large Language Models (LLMs) to enhance information retrieval and query response systems for university-related question answering. By systematically extracting data from the university official webpage and employing advanced prompt engineering techniques, we generate accurate, contextually relevant responses to user queries. We developed a comprehensive university benchmark, UniversityQuestionBench (UQB), to rigorously evaluate our system performance, based on common key metrics in the filed of RAG pipelines, assessing accuracy and reliability through various metrics and real-world scenarios. Our experimental results demonstrate significant improvements in the precision and relevance of generated responses, enhancing user experience and reducing the time required to obtain relevant answers. In summary, this paper presents a novel application of RAG pipelines and LLMs, supported by a meticulously prepared university benchmark, offering valuable insights into advanced AI techniques for academic data retrieval and setting the stage for future research in this domain.
翻译:本文提出了一种创新方法,利用检索增强生成(RAG)流程与大语言模型(LLMs)相结合,以提升高校相关问答场景中的信息检索与查询响应系统性能。通过系统性地从高校官方网站提取数据,并采用先进的提示工程技术,我们能够针对用户查询生成准确且上下文相关的回答。我们构建了一个全面的高校基准测试集UniversityQuestionBench(UQB),基于RAG流程领域的常用关键指标,通过多种度量标准和真实场景严格评估系统性能。实验结果表明,所生成回答的精确度和相关性得到显著提升,改善了用户体验并缩短了获取相关答案所需的时间。总之,本文提出了一种RAG流程与LLMs的新颖应用,并辅以精心构建的高校基准测试集,为学术数据检索的先进人工智能技术提供了有价值的见解,并为该领域的未来研究奠定了基础。