Demystifying Large Language Models for Medicine: A Primer

Qiao Jin,Nicholas Wan,Robert Leaman,Shubo Tian,Zhizheng Wang,Yifan Yang,Zifeng Wang,Guangzhi Xiong,Po-Ting Lai,Qingqing Zhu,Benjamin Hou,Maame Sarfo-Gyamfi,Gongbo Zhang,Aidan Gilson,Balu Bhasuran,Zhe He,Aidong Zhang,Jimeng Sun,Chunhua Weng,Ronald M. Summers,Qingyu Chen,Yifan Peng,Zhiyong Lu

from arxiv, Under review

Large language models (LLMs) represent a transformative class of AI tools capable of revolutionizing various aspects of healthcare by generating human-like responses across diverse contexts and adapting to novel tasks following human instructions. Their potential application spans a broad range of medical tasks, such as clinical documentation, matching patients to clinical trials, and answering medical questions. In this primer paper, we propose an actionable guideline to help healthcare professionals more efficiently utilize LLMs in their work, along with a set of best practices. This approach consists of several main phases, including formulating the task, choosing LLMs, prompt engineering, fine-tuning, and deployment. We start with the discussion of critical considerations in identifying healthcare tasks that align with the core capabilities of LLMs and selecting models based on the selected task and data, performance requirements, and model interface. We then review the strategies, such as prompt engineering and fine-tuning, to adapt standard LLMs to specialized medical tasks. Deployment considerations, including regulatory compliance, ethical guidelines, and continuous monitoring for fairness and bias, are also discussed. By providing a structured step-by-step methodology, this tutorial aims to equip healthcare professionals with the tools necessary to effectively integrate LLMs into clinical practice, ensuring that these powerful technologies are applied in a safe, reliable, and impactful manner.

翻译：大型语言模型（LLMs）代表了一类变革性的人工智能工具，能够通过在不同场景下生成类人响应并适应人类指令下的新任务，从而革新医疗健康的多个方面。其潜在应用涵盖广泛的医疗任务，例如临床文档记录、患者与临床试验匹配以及医学问题解答。在本入门指南中，我们提出一套可操作的指导原则，帮助医疗专业人员更高效地在工作中运用LLMs，并附以一系列最佳实践。该方法包含若干主要阶段，包括任务定义、LLMs选择、提示工程、微调及部署。我们首先讨论识别与LLMs核心能力相匹配的医疗任务时的关键考量，以及根据选定任务与数据、性能需求和模型接口选择模型的标准。随后，我们综述了如提示工程和微调等策略，以使通用LLMs适配专业医疗任务。本文亦探讨了部署相关的考量，包括法规合规性、伦理准则以及对公平性与偏见的持续监测。通过提供结构化的分步方法，本教程旨在为医疗专业人员提供必要的工具，以将LLMs有效整合到临床实践中，确保这些强大技术能够以安全、可靠且具有影响力的方式得以应用。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/