GIRAFE: Glottal Imaging Dataset for Advanced Segmentation, Analysis, and Facilitative Playbacks Evaluation

The advances in the development of Facilitative Playbacks extracted from High-Speed videoendoscopic sequences of the vocal folds are hindered by a notable lack of publicly available datasets annotated with the semantic segmentations corresponding to the area of the glottal gap. This fact also limits the reproducibility and further exploration of existing research in this field. To address this gap, GIRAFE is a data repository designed to facilitate the development of advanced techniques for the semantic segmentation, analysis, and fast evaluation of High-Speed videoendoscopic sequences of the vocal folds. The repository includes 65 high-speed videoendoscopic recordings from a cohort of 50 patients (30 female, 20 male). The dataset comprises 15 recordings from healthy controls, 26 from patients with diagnosed voice disorders, and 24 with an unknown health condition. All of them were manually annotated by an expert, including the masks corresponding to the semantic segmentation of the glottal gap. The repository is also complemented with the automatic segmentation of the glottal area using different state-of-the-art approaches. This data set has already supported several studies, which demonstrates its usefulness for the development of new glottal gap segmentation algorithms from High-Speed-Videoendoscopic sequences to improve or create new Facilitative Playbacks. Despite these advances and others in the field, the broader challenge of performing an accurate and completely automatic semantic segmentation method of the glottal area remains open.

翻译：从声带高速视频内窥镜序列中提取辅助回放技术的发展，因缺乏公开可用的、标注有声门间隙区域语义分割的数据集而受到显著阻碍。这一事实也限制了该领域现有研究的可复现性与进一步探索。为填补这一空白，GIRAFE是一个旨在促进声带高速视频内窥镜序列语义分割、分析与快速评估先进技术开发的数据存储库。该库包含来自50名患者（30名女性，20名男性）的65段高速视频内窥镜记录。数据集包括15段健康对照者的记录、26段确诊嗓音障碍患者的记录以及24段健康状况未知的记录。所有记录均由专家手动标注，包含声门间隙语义分割对应的掩码。该存储库还补充了使用不同前沿方法实现的声门区域自动分割结果。该数据集已支持多项研究，证明了其在开发基于高速视频内窥镜序列的新型声门间隙分割算法以改进或创建新辅助回放技术方面的实用性。尽管该领域已取得这些进展及其他成果，实现精确且完全自动化的声门区域语义分割方法这一更广泛的挑战仍然有待解决。

相关内容

数据集

关注 88

数据集，又称为资料集、数据集合或资料集合，是一种由数据所组成的集合。
Data set（或dataset）是一个数据的集合，通常以表格形式出现。每一列代表一个特定变量。每一行都对应于某一成员的数据集的问题。它列出的价值观为每一个变量，如身高和体重的一个物体或价值的随机数。每个数值被称为数据资料。对应于行数，该数据集的数据可能包括一个或多个成员。

【CVPR 2022】基于元内存传输的跨域少镜头语义分割，Remember the Difference: Cross-Domain Few-Shot Semantic Segmentation via Meta-Memory Transfer

专知会员服务

13+阅读 · 2022年3月12日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日