Automated radiology report generation aims to generate radiology reports that contain rich, fine-grained descriptions of radiology imaging. Compared with image captioning in the natural image domain, medical images are very similar to each other, with only minor differences in the occurrence of diseases. Given the importance of these minor differences in the radiology report, it is crucial to encourage the model to focus more on the subtle regions of disease occurrence. Secondly, the problem of visual and textual data biases is serious. Not only do normal cases make up the majority of the dataset, but sentences describing areas with pathological changes also constitute only a small part of the paragraph. Lastly, generating medical image reports involves the challenge of long text generation, which requires more expertise and empirical training in medical knowledge. As a result, the difficulty of generating such reports is increased. To address these challenges, we propose a disease-oriented retrieval framework that utilizes similar reports as prior knowledge references. We design a factual consistency captioning generator to generate more accurate and factually consistent disease descriptions. Our framework can find most similar reports for a given disease from the CXR database by retrieving a disease-oriented mask consisting of the position and morphological characteristics. By referencing the disease-oriented similar report and the visual features, the factual consistency model can generate a more accurate radiology report.
翻译:自动化放射学报告生成旨在生成包含丰富、细粒度描述的放射影像报告。与自然图像域中的图像描述相比,医学图像彼此高度相似,仅在疾病出现的细微差异上有所不同。鉴于这些细微差异在放射学报告中的重要性,鼓励模型更关注疾病出现的细微区域至关重要。其次,视觉与文本数据偏差问题严重。不仅正常病例占据数据集的大多数,描述病理变化区域的句子也只占段落的一小部分。最后,生成医学影像报告涉及长文本生成的挑战,这需要更多的医学知识专业性和经验性训练。因此,此类报告生成的难度随之增加。为应对这些挑战,我们提出一种面向疾病的检索框架,利用相似报告作为先验知识参考。我们设计了一个事实一致性描述生成器,以生成更准确且事实一致的疾病描述。该框架通过检索由位置和形态特征构成的面向疾病掩码,从胸部X光图像数据库中为给定疾病找到最相似的报告。通过参考面向疾病的相似报告和视觉特征,事实一致性模型能够生成更准确的放射学报告。