Large language models (LLMs) represent a transformative class of AI tools capable of revolutionizing various aspects of healthcare by generating human-like responses across diverse contexts and adapting to novel tasks following human instructions. Their potential application spans a broad range of medical tasks, such as clinical documentation, matching patients to clinical trials, and answering medical questions. In this primer paper, we propose an actionable guideline to help healthcare professionals more efficiently utilize LLMs in their work, along with a set of best practices. This approach consists of several main phases, including formulating the task, choosing LLMs, prompt engineering, fine-tuning, and deployment. We start with the discussion of critical considerations in identifying healthcare tasks that align with the core capabilities of LLMs and selecting models based on the selected task and data, performance requirements, and model interface. We then review the strategies, such as prompt engineering and fine-tuning, to adapt standard LLMs to specialized medical tasks. Deployment considerations, including regulatory compliance, ethical guidelines, and continuous monitoring for fairness and bias, are also discussed. By providing a structured step-by-step methodology, this tutorial aims to equip healthcare professionals with the tools necessary to effectively integrate LLMs into clinical practice, ensuring that these powerful technologies are applied in a safe, reliable, and impactful manner.
翻译:大型语言模型(LLMs)代表了一类变革性的人工智能工具,能够通过在不同场景下生成类人响应并适应人类指令下的新任务,从而革新医疗健康的多个方面。其潜在应用涵盖广泛的医疗任务,例如临床文档记录、患者与临床试验匹配以及医学问题解答。在本入门指南中,我们提出一套可操作的指导原则,帮助医疗专业人员更高效地在工作中运用LLMs,并附以一系列最佳实践。该方法包含若干主要阶段,包括任务定义、LLMs选择、提示工程、微调及部署。我们首先讨论识别与LLMs核心能力相匹配的医疗任务时的关键考量,以及根据选定任务与数据、性能需求和模型接口选择模型的标准。随后,我们综述了如提示工程和微调等策略,以使通用LLMs适配专业医疗任务。本文亦探讨了部署相关的考量,包括法规合规性、伦理准则以及对公平性与偏见的持续监测。通过提供结构化的分步方法,本教程旨在为医疗专业人员提供必要的工具,以将LLMs有效整合到临床实践中,确保这些强大技术能够以安全、可靠且具有影响力的方式得以应用。