The advances in the development of Facilitative Playbacks extracted from High-Speed videoendoscopic sequences of the vocal folds are hindered by a notable lack of publicly available datasets annotated with the semantic segmentations corresponding to the area of the glottal gap. This fact also limits the reproducibility and further exploration of existing research in this field. To address this gap, GIRAFE is a data repository designed to facilitate the development of advanced techniques for the semantic segmentation, analysis, and fast evaluation of High-Speed videoendoscopic sequences of the vocal folds. The repository includes 65 high-speed videoendoscopic recordings from a cohort of 50 patients (30 female, 20 male). The dataset comprises 15 recordings from healthy controls, 26 from patients with diagnosed voice disorders, and 24 with an unknown health condition. All of them were manually annotated by an expert, including the masks corresponding to the semantic segmentation of the glottal gap. The repository is also complemented with the automatic segmentation of the glottal area using different state-of-the-art approaches. This data set has already supported several studies, which demonstrates its usefulness for the development of new glottal gap segmentation algorithms from High-Speed-Videoendoscopic sequences to improve or create new Facilitative Playbacks. Despite these advances and others in the field, the broader challenge of performing an accurate and completely automatic semantic segmentation method of the glottal area remains open.
翻译:从声带高速视频内窥镜序列中提取辅助回放技术的发展,因缺乏公开可用的、标注有声门间隙区域语义分割的数据集而受到显著阻碍。这一事实也限制了该领域现有研究的可复现性与进一步探索。为填补这一空白,GIRAFE是一个旨在促进声带高速视频内窥镜序列语义分割、分析与快速评估先进技术开发的数据存储库。该库包含来自50名患者(30名女性,20名男性)的65段高速视频内窥镜记录。数据集包括15段健康对照者的记录、26段确诊嗓音障碍患者的记录以及24段健康状况未知的记录。所有记录均由专家手动标注,包含声门间隙语义分割对应的掩码。该存储库还补充了使用不同前沿方法实现的声门区域自动分割结果。该数据集已支持多项研究,证明了其在开发基于高速视频内窥镜序列的新型声门间隙分割算法以改进或创建新辅助回放技术方面的实用性。尽管该领域已取得这些进展及其他成果,实现精确且完全自动化的声门区域语义分割方法这一更广泛的挑战仍然有待解决。