From Generative Modeling to Clinical Classification: A GPT-Based Architecture for EHR Notes

from arxiv, This submission is a full-length research manuscript consisting of 37 pages and 15 figures. The paper presents a GPT-based architecture with selective fine-tuning for clinical text classification, including detailed architectural diagrams, learning curves, and evaluation figures such as ROC curves and confusion matrices

The increasing availability of unstructured clinical narratives in electronic health records (EHRs) has created new opportunities for automated disease characterization, cohort identification, and clinical decision support. However, modeling long, domain-specific clinical text remains challenging due to limited labeled data, severe class imbalance, and the high computational cost of adapting large pretrained language models. This study presents a GPT-based architecture for clinical text classification that adapts a pretrained decoder-only Transformer using a selective fine-tuning strategy. Rather than updating all model parameters, the majority of the GPT-2 backbone is frozen, and training is restricted to the final Transformer block, the final layer normalization, and a lightweight classification head. This approach substantially reduces the number of trainable parameters while preserving the representational capacity required to model complex clinical language. The proposed method is evaluated on radiology reports from the MIMIC-IV-Note dataset using uncertainty-aware CheXpert-style labels derived directly from report text. Experiments cover multiple problem formulations, including multi-label classification of radiographic findings, binary per-label classification under different uncertainty assumptions, and aggregate disease outcome prediction. Across varying dataset sizes, the model exhibits stable convergence behavior and strong classification performance, particularly in settings dominated by non-mention and negated findings. Overall, the results indicate that selective fine-tuning of pretrained generative language models provides an efficient and effective pathway for clinical text classification, enabling scalable adaptation to real-world EHR data while significantly reducing computational complexity.

翻译：电子健康记录（EHR）中非结构化临床叙事数据的日益增多，为自动化疾病表征、队列识别和临床决策支持创造了新的机遇。然而，由于标注数据有限、类别严重不平衡以及适应大型预训练语言模型的高计算成本，对长篇幅、领域特定的临床文本进行建模仍然具有挑战性。本研究提出了一种基于GPT的临床文本分类架构，该架构采用选择性微调策略来适应预训练的仅解码器Transformer模型。该方法不更新所有模型参数，而是冻结GPT-2主干网络的大部分权重，并将训练限制在最后的Transformer块、最后的层归一化层以及一个轻量级分类头上。这种方法显著减少了可训练参数的数量，同时保留了建模复杂临床语言所需的表征能力。所提出的方法在MIMIC-IV-Note数据集的放射学报告上进行了评估，使用了直接从报告文本中提取的、具有不确定性感知的CheXpert风格标签。实验涵盖了多种问题设定，包括放射学发现的多标签分类、不同不确定性假设下的二元逐标签分类以及聚合疾病结果预测。在不同数据集规模下，该模型表现出稳定的收敛行为和强大的分类性能，尤其是在以未提及和否定发现为主的情境中。总体而言，结果表明，对预训练生成语言模型进行选择性微调为临床文本分类提供了一条高效且有效的途径，使其能够可扩展地适应现实世界的EHR数据，同时显著降低计算复杂度。

相关内容

健康

关注 27

健康是指一个人在身体、精神和社会等方面都处于良好的状态。健康包括两个方面的内容：

一是主要脏器无疾病，身体形态发育良好，体形均匀，人体各系统具有良好的生理功能，有较强的身体活动能力和劳动能力，这是对健康最基本的要求；

二是对疾病的抵抗能力较强，能够适应环境变化，各种生理刺激以及致病因素对身体的作用。传统的健康观是“无病即健康”，现代人的健康观是整体健康，世界卫生组织提出“健康不仅是躯体没有疾病，还要具备心理健康、社会适应良好和有道德”。因此，现代人的健康内容包括：躯体健康、心理健康、心灵健康、社会健康、智力健康、道德健康、环境健康等。健康是人的基本权利。健康是人生的第一财富。

超越生成式人工智能：用于临床预测、反事实推断与规划的世界模型

专知会员服务

23+阅读 · 2025年11月23日

利用表示学习推动多机构电子健康记录数据研究

专知会员服务

16+阅读 · 2025年2月17日

【牛津大学博士论文】面向电子健康记录的深度学习:风险预测、可解释性和不确定性，200页pdf

专知会员服务

46+阅读 · 2023年7月18日

【斯坦福大学博士论文】通过建模多维依赖性改进医学时间序列数据的深度学习，124页pdf

专知会员服务

50+阅读 · 2023年3月23日