Experimenting with Large Language Models and vector embeddings in NASA SciX

Sergi Blanco-Cuaresma,Ioana Ciucă,Alberto Accomazzi,Michael J. Kurtz,Edwin A. Henneken,Kelly E. Lockhart,Felix Grezes,Thomas Allen,Golnaz Shapurian,Carolyn S. Grant,Donna M. Thompson,Timothy W. Hostetler,Matthew R. Templeton,Shinyi Chen,Jennifer Koch,Taylor Jacovich,Daniel Chivvis,Fernanda de Macedo Alves,Jean-Claude Paquin,Jennifer Bartlett,Mugdha Polimera,Stephanie Jarmak

from arxiv, To appear in the proceedings of the 33th annual international Astronomical Data Analysis Software & Systems (ADASS XXXIII)

Open-source Large Language Models enable projects such as NASA SciX (i.e., NASA ADS) to think out of the box and try alternative approaches for information retrieval and data augmentation, while respecting data copyright and users' privacy. However, when large language models are directly prompted with questions without any context, they are prone to hallucination. At NASA SciX we have developed an experiment where we created semantic vectors for our large collection of abstracts and full-text content, and we designed a prompt system to ask questions using contextual chunks from our system. Based on a non-systematic human evaluation, the experiment shows a lower degree of hallucination and better responses when using Retrieval Augmented Generation. Further exploration is required to design new features and data augmentation processes at NASA SciX that leverages this technology while respecting the high level of trust and quality that the project holds.

翻译：开源大型语言模型使得NASA SciX（即NASA ADS）等科研项目能够突破传统思维，尝试采用创新方法进行信息检索与数据增强，同时兼顾数据版权保护与用户隐私。然而，当大型语言模型在缺乏上下文语境的情况下直接响应提问时，极易产生"幻觉"现象（即生成不准确或虚构的内容）。为此，我们在NASA SciX中设计了一项实验：为海量科研摘要及全文内容构建语义向量，并开发了基于系统上下文片段进行提问的提示系统。基于非系统化的人工评估结果表明，采用检索增强生成（RAG）技术时，模型幻觉程度显著降低且响应质量更优。未来仍需进一步探索如何基于该技术，在坚持项目对可信度与内容质量的严格要求下，为NASA SciX设计新型功能及数据增强处理流程。

相关内容

大语言模型

关注 66

大语言模型是基于海量文本数据训练的深度学习模型。它不仅能够生成自然语言文本，还能够深入理解文本含义，处理各种自然语言任务，如文本摘要、问答、翻译等。2023年，大语言模型及其在人工智能领域的应用已成为全球科技研究的热点，其在规模上的增长尤为引人注目，参数量已从最初的十几亿跃升到如今的一万亿。参数量的提升使得模型能够更加精细地捕捉人类语言微妙之处，更加深入地理解人类语言的复杂性。在过去的一年里，大语言模型在吸纳新知识、分解复杂任务以及图文对齐等多方面都有显著提升。随着技术的不断成熟，它将不断拓展其应用范围，为人类提供更加智能化和个性化的服务，进一步改善人们的生活和生产方式。

【CVPR 2022】基于元内存传输的跨域少镜头语义分割，Remember the Difference: Cross-Domain Few-Shot Semantic Segmentation via Meta-Memory Transfer

专知会员服务

13+阅读 · 2022年3月12日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【WSDM2020】超越统计关系：将知识关系整合到多标签音乐风格分类的风格关联中（附pdf）

专知会员服务

18+阅读 · 2019年11月23日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日