Clinical texts, represented in electronic medical records (EMRs), contain rich medical information and are essential for disease prediction, personalised information recommendation, clinical decision support, and medication pattern mining and measurement. Relation extractions between medication mentions and temporal information can further help clinicians better understand the patients' treatment history. To evaluate the performances of deep learning (DL) and large language models (LLMs) in medication extraction and temporal relations classification, we carry out an empirical investigation of \textbf{MedTem} project using several advanced learning structures including BiLSTM-CRF and CNN-BiLSTM for a clinical domain named entity recognition (NER), and BERT-CNN for temporal relation extraction (RE), in addition to the exploration of different word embedding techniques. Furthermore, we also designed a set of post-processing roles to generate structured output on medications and the temporal relation. Our experiments show that CNN-BiLSTM slightly wins the BiLSTM-CRF model on the i2b2-2009 clinical NER task yielding 75.67, 77.83, and 78.17 for precision, recall, and F1 scores using Macro Average. BERT-CNN model also produced reasonable evaluation scores 64.48, 67.17, and 65.03 for P/R/F1 using Macro Avg on the temporal relation extraction test set from i2b2-2012 challenges. Code and Tools from MedTem will be hosted at \url{https://github.com/HECTA-UoM/MedTem}
翻译:临床文本(以电子病历为代表)蕴含丰富的医疗信息,对于疾病预测、个性化信息推荐、临床决策支持以及用药模式挖掘与评估至关重要。药物提及与时间信息之间的关系提取可进一步帮助临床医生更好地理解患者的治疗历史。为评估深度学习与大语言模型在药物提取及时间关系分类中的性能,我们基于MedTem项目进行了实证研究,采用多种先进学习结构:在临床领域命名实体识别任务中应用BiLSTM-CRF与CNN-BiLSTM模型,在时间关系抽取任务中使用BERT-CNN模型,同时探索了不同词嵌入技术。此外,我们还设计了一套后处理规则,用于生成药物及其时间关系的结构化输出。实验结果表明,在i2b2-2009临床命名实体识别任务中,CNN-BiLSTM模型略优于BiLSTM-CRF模型,在宏平均指标下精确率、召回率和F1分数分别达到75.67、77.83和78.17。BERT-CNN模型在i2b2-2012挑战赛的时间关系抽取测试集上也产生了合理的评估结果,宏平均下的P/R/F1分别为64.48、67.17和65.03。MedTem项目代码与工具将托管于\url{https://github.com/HECTA-UoM/MedTem}。