WCEbleedGen: A wireless capsule endoscopy dataset and its benchmarking for automatic bleeding classification, detection, and segmentation

Computer-based analysis of Wireless Capsule Endoscopy (WCE) is crucial. However, a medically annotated WCE dataset for training and evaluation of automatic classification, detection, and segmentation of bleeding and non-bleeding frames is currently lacking. The present work focused on development of a medically annotated WCE dataset called WCEbleedGen for automatic classification, detection, and segmentation of bleeding and non-bleeding frames. It comprises 2,618 WCE bleeding and non-bleeding frames which were collected from various internet resources and existing WCE datasets. A comprehensive benchmarking and evaluation of the developed dataset was done using nine classification-based, three detection-based, and three segmentation-based deep learning models. The dataset is of high-quality, is class-balanced and contains single and multiple bleeding sites. Overall, our standard benchmark results show that Visual Geometric Group (VGG) 19, You Only Look Once version 8 nano (YOLOv8n), and Link network (Linknet) performed best in automatic classification, detection, and segmentation-based evaluations, respectively. Automatic bleeding diagnosis is crucial for WCE video interpretations. This diverse dataset will aid in developing of real-time, multi-task learning-based innovative solutions for automatic bleeding diagnosis in WCE. The dataset and code are publicly available at https://zenodo.org/records/10156571 and https://github.com/misahub2023/Benchmarking-Codes-of-the-WCEBleedGen-dataset.

翻译：无线胶囊内镜（WCE）的计算机辅助分析至关重要。然而，目前缺乏用于训练和评估出血与非出血帧自动分类、检测及分割的医学标注WCE数据集。本研究致力于构建一个名为WCEbleedGen的医学标注WCE数据集，用于出血与非出血帧的自动分类、检测与分割。该数据集包含2,618张WCE出血与非出血帧图像，采集自多种网络资源及现有WCE数据集。通过九种基于分类、三种基于检测及三种基于分割的深度学习模型，对构建的数据集进行了全面的基准测试与评估。本数据集质量高、类别均衡，并包含单发与多发出血病灶。总体而言，标准基准测试结果表明：视觉几何组（VGG）19、You Only Look Once version 8 nano（YOLOv8n）及Link网络（Linknet）分别在基于分类、检测与分割的自动评估中表现最佳。自动出血诊断对WCE视频判读至关重要。本多样化数据集将有助于开发基于实时多任务学习的创新解决方案，以实现WCE中的自动出血诊断。数据集与代码已公开于https://zenodo.org/records/10156571 与 https://github.com/misahub2023/Benchmarking-Codes-of-the-WCEBleedGen-dataset。

相关内容

数据集

关注 88

数据集，又称为资料集、数据集合或资料集合，是一种由数据所组成的集合。
Data set（或dataset）是一个数据的集合，通常以表格形式出现。每一列代表一个特定变量。每一行都对应于某一成员的数据集的问题。它列出的价值观为每一个变量，如身高和体重的一个物体或价值的随机数。每个数值被称为数据资料。对应于行数，该数据集的数据可能包括一个或多个成员。

《生成式模型: 变分自编码器与扩散模型》，75页ppt，Google DeepMind科学家Ruiqi Gao

专知会员服务

66+阅读 · 2023年6月10日

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

生成性对抗网络:理论模型、评估指标和最近发展的概述，Generative Adversarial Networks (GANs): An Overview of Theoretical Model, Evaluation Metrics, and Recent Developments

专知会员服务

42+阅读 · 2020年5月30日