Improving Generalizability of Extracting Social Determinants of Health Using Large Language Models through Prompt-tuning

The progress in natural language processing (NLP) using large language models (LLMs) has greatly improved patient information extraction from clinical narratives. However, most methods based on the fine-tuning strategy have limited transfer learning ability for cross-domain applications. This study proposed a novel approach that employs a soft prompt-based learning architecture, which introduces trainable prompts to guide LLMs toward desired outputs. We examined two types of LLM architectures, including encoder-only GatorTron and decoder-only GatorTronGPT, and evaluated their performance for the extraction of social determinants of health (SDoH) using a cross-institution dataset from the 2022 n2c2 challenge and a cross-disease dataset from the University of Florida (UF) Health. The results show that decoder-only LLMs with prompt tuning achieved better performance in cross-domain applications. GatorTronGPT achieved the best F1 scores for both datasets, outperforming traditional fine-tuned GatorTron by 8.9% and 21.8% in a cross-institution setting, and 5.5% and 14.5% in a cross-disease setting.

翻译：自然语言处理（NLP）在大型语言模型（LLMs）应用方面的进展极大提升了从临床叙述中提取患者信息的能力。然而，基于微调策略的大多数方法在跨领域应用中泛化能力有限。本研究提出了一种新方法，采用基于软提示的学习架构，通过引入可训练提示来引导LLMs生成期望输出。我们研究了两种类型的LLM架构，包括仅编码器GatorTron和仅解码器GatorTronGPT，并使用2022年n2c2挑战赛的跨机构数据集和佛罗里达大学（UF）健康中心的跨疾病数据集，评估了它们在提取健康社会决定因素（SDoH）方面的性能。结果表明，采用提示调优的仅解码器LLM在跨领域应用中表现更佳。GatorTronGPT在两个数据集上均取得了最佳F1分数，在跨机构设置中分别比传统微调GatorTron高出8.9%和21.8%，在跨疾病设置中分别高出5.5%和14.5%。

相关内容

大语言模型

关注 66

大语言模型是基于海量文本数据训练的深度学习模型。它不仅能够生成自然语言文本，还能够深入理解文本含义，处理各种自然语言任务，如文本摘要、问答、翻译等。2023年，大语言模型及其在人工智能领域的应用已成为全球科技研究的热点，其在规模上的增长尤为引人注目，参数量已从最初的十几亿跃升到如今的一万亿。参数量的提升使得模型能够更加精细地捕捉人类语言微妙之处，更加深入地理解人类语言的复杂性。在过去的一年里，大语言模型在吸纳新知识、分解复杂任务以及图文对齐等多方面都有显著提升。随着技术的不断成熟，它将不断拓展其应用范围，为人类提供更加智能化和个性化的服务，进一步改善人们的生活和生产方式。

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日