Difficult-to-treat depression (DTD) has been proposed as a broader and more clinically comprehensive perspective on a person's depressive disorder where despite treatment, they continue to experience significant burden. We sought to develop a Large Language Model (LLM)-based tool capable of interrogating routinely-collected, narrative (free-text) electronic health record (EHR) data to locate published prognostic factors that capture the clinical syndrome of DTD. In this work, we use LLM-generated synthetic data (GPT3.5) and a Non-Maximum Suppression (NMS) algorithm to train a BERT-based span extraction model. The resulting model is then able to extract and label spans related to a variety of relevant positive and negative factors in real clinical data (i.e. spans of text that increase or decrease the likelihood of a patient matching the DTD syndrome). We show it is possible to obtain good overall performance (0.70 F1 across polarity) on real clinical data on a set of as many as 20 different factors, and high performance (0.85 F1 with 0.95 precision) on a subset of important DTD factors such as history of abuse, family history of affective disorder, illness severity and suicidality by training the model exclusively on synthetic data. Our results show promise for future healthcare applications especially in applications where traditionally, highly confidential medical data and human-expert annotation would normally be required.
翻译:难治性抑郁症(DTD)被提出作为一种更广泛且更具临床综合性的视角,用以描述尽管接受治疗,患者仍持续承受显著负担的抑郁障碍。我们旨在开发一种基于大语言模型(LLM)的工具,能够分析常规收集的非结构化(自由文本)电子健康记录(EHR)数据,以定位可捕捉DTD临床综合征的已知预后因素。在本研究中,我们利用LLM生成的合成数据(GPT3.5)和非极大值抑制(NMS)算法,训练了一个基于BERT的跨度提取模型。由此得到的模型能够从真实临床数据中提取并标注与多种相关正负因素相关的文本跨度(即增加或减少患者符合DTD综合征可能性的文本片段)。我们证明,仅通过合成数据训练模型,即可在多达20种不同因素的真实临床数据上获得良好的整体性能(跨极性的F1值为0.70),并在诸如虐待史、情感障碍家族史、疾病严重程度及自杀倾向等重要DTD因素的子集上实现高精度(F1值为0.85,精确度为0.95)。我们的结果表明,该方法在未来的医疗应用(尤其是传统上需要高度机密医疗数据和人类专家标注的场景)中具有广阔前景。