A clinical trial is a study that evaluates new biomedical interventions. To design new trials, researchers draw inspiration from those current and completed. In 2022, there were on average more than 100 clinical trials submitted to ClinicalTrials.gov every day, with each trial having a mean of approximately 1500 words [1]. This makes it nearly impossible to keep up to date. To mitigate this issue, we have created a batch clinical trial summarizer called CliniDigest using GPT-3.5. CliniDigest is, to our knowledge, the first tool able to provide real-time, truthful, and comprehensive summaries of clinical trials. CliniDigest can reduce up to 85 clinical trial descriptions (approximately 10,500 words) into a concise 200-word summary with references and limited hallucinations. We have tested CliniDigest on its ability to summarize 457 trials divided across 27 medical subdomains. For each field, CliniDigest generates summaries of $\mu=153,\ \sigma=69 $ words, each of which utilizes $\mu=54\%,\ \sigma=30\% $ of the sources. A more comprehensive evaluation is planned and outlined in this paper.
翻译:临床试验是评估新型生物医学干预措施的研究。为设计新试验,研究人员从当前及已完成的试验中汲取灵感。2022年,每天平均有超过100项临床试验提交至ClinicalTrials.gov,每项试验平均约含1500字[1]。这使得及时跟踪进展几乎不可能。为解决此问题,我们利用GPT-3.5创建了名为CliniDigest的批量临床试验摘要工具。据我们所知,CliniDigest是首个能够提供实时、真实且全面临床试验摘要的工具。CliniDigest可将多达85项临床试验描述(约10,500字)压缩为一份引用有限、幻觉现象较少且简洁的200字摘要。我们在27个医学子领域对457项试验的摘要能力进行了测试。针对每个领域,CliniDigest生成的摘要平均字数为μ=153、σ=69,每份摘要平均利用了μ=54%、σ=30%的源信息。本文规划并概述了更全面的评估方案。