Generative Active Learning with Variational Autoencoder for Radiology Data Generation in Veterinary Medicine

Recently, with increasing interest in pet healthcare, the demand for computer-aided diagnosis (CAD) systems in veterinary medicine has increased. The development of veterinary CAD has stagnated due to a lack of sufficient radiology data. To overcome the challenge, we propose a generative active learning framework based on a variational autoencoder. This approach aims to alleviate the scarcity of reliable data for CAD systems in veterinary medicine. This study utilizes datasets comprising cardiomegaly radiograph data. After removing annotations and standardizing images, we employed a framework for data augmentation, which consists of a data generation phase and a query phase for filtering the generated data. The experimental results revealed that as the data generated through this framework was added to the training data of the generative model, the frechet inception distance consistently decreased from 84.14 to 50.75 on the radiograph. Subsequently, when the generated data were incorporated into the training of the classification model, the false positive of the confusion matrix also improved from 0.16 to 0.66 on the radiograph. The proposed framework has the potential to address the challenges of data scarcity in medical CAD, contributing to its advancement.

翻译：近年来，随着宠物医疗关注度的提升，兽医学领域对计算机辅助诊断系统的需求日益增长。由于缺乏充足的放射学数据，兽医学计算机辅助诊断的发展停滞不前。为应对这一挑战，我们提出了一种基于变分自编码器的生成式主动学习框架。该方法旨在缓解兽医学计算机辅助诊断系统中可靠数据稀缺的问题。本研究使用了包含心脏肥大放射影像的数据集。在移除标注信息并标准化图像后，我们采用了一个数据增强框架，该框架包含数据生成阶段和用于筛选生成数据的查询阶段。实验结果表明，随着通过该框架生成的数据被加入生成模型的训练集，放射影像的Frechet初始距离从84.14持续下降至50.75。随后，当生成数据被纳入分类模型的训练后，混淆矩阵的假阳性率也由0.16改善至0.66。本框架有望解决医学计算机辅助诊断领域的数据稀缺难题，推动该领域的发展。

相关内容

CAD

关注 3

《计算机辅助设计》是一份领先的国际期刊，为学术界和工业界提供有关计算机应用于设计的研究和发展的重要论文。计算机辅助设计邀请论文报告新的研究以及新颖或特别重要的应用，在广泛的主题中，跨越所有阶段的设计过程，从概念创造到制造超越。官网地址：http://dblp.uni-trier.de/db/journals/cad/

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日