SegRap2025: A Benchmark of Gross Tumor Volume and Lymph Node Clinical Target Volume Segmentation for Radiotherapy Planning of Nasopharyngeal Carcinoma

翻译：SegRap2025：用于鼻咽癌放疗计划的大体肿瘤体积与淋巴结临床靶区分割基准

Jia Fu,Litingyu Wang,He Li,Zihao Luo,Huamin Wang,Chenyuan Bian,Zijun Gao,Chunbin Gu,Xin Weng,Jianghao Wu,Yicheng Wu,Jin Ye,Linhao Li,Yiwen Ye,Yong Xia,Elias Tappeiner,Fei He,Abdul qayyum,Moona Mazher,Steven A Niederer,Junqiang Chen,Chuanyi Huang,Lisheng Wang,Zhaohu Xing,Hongqiu Wang,Lei Zhu,Shichuan Zhang,Shaoting Zhang,Wenjun Liao,Guotai Wang

Accurate delineation of Gross Tumor Volume (GTV), Lymph Node Clinical Target Volume (LN CTV), and Organ-at-Risk (OAR) from Computed Tomography (CT) scans is essential for precise radiotherapy planning in Nasopharyngeal Carcinoma (NPC). Building upon SegRap2023, which focused on OAR and GTV segmentation using single-center paired non-contrast CT (ncCT) and contrast-enhanced CT (ceCT) scans, the SegRap2025 challenge aims to enhance the generalizability and robustness of segmentation models across imaging centers and modalities. SegRap2025 comprises two tasks: Task01 addresses GTV segmentation using paired CT from the SegRap2023 dataset, with an additional external testing set to evaluate cross-center generalization, and Task02 focuses on LN CTV segmentation using multi-center training data and an unseen external testing set, where each case contains paired CT scans or a single modality, emphasizing both cross-center and cross-modality robustness. This paper presents the challenge setup and provides a comprehensive analysis of the solutions submitted by ten participating teams. For GTV segmentation task, the top-performing models achieved average Dice Similarity Coefficient (DSC) of 74.61% and 56.79% on the internal and external testing cohorts, respectively. For LN CTV segmentation task, the highest average DSC values reached 60.24%, 60.50%, and 57.23% on paired CT, ceCT-only, and ncCT-only subsets, respectively. SegRap2025 establishes a large-scale multi-center, multi-modality benchmark for evaluating the generalization and robustness in radiotherapy target segmentation, providing valuable insights toward clinically applicable automated radiotherapy planning systems. The benchmark is available at: https://hilab-git.github.io/SegRap2025_Challenge.

翻译：从计算机断层扫描（CT）图像中准确勾画大体肿瘤体积（GTV）、淋巴结临床靶区（LN CTV）以及危及器官（OAR），对于鼻咽癌（NPC）的精确放疗计划至关重要。SegRap2023挑战赛专注于使用单中心配对的平扫CT（ncCT）与增强CT（ceCT）进行OAR与GTV分割，而SegRap2025挑战赛在此基础上，旨在提升分割模型在不同影像中心与成像模态间的泛化性与鲁棒性。SegRap2025包含两项任务：Task01利用SegRap2023数据集中的配对CT进行GTV分割，并引入额外的外部测试集以评估跨中心泛化能力；Task02则专注于LN CTV分割，使用多中心训练数据和一个未见过的外部测试集，其中每个病例包含配对CT扫描或单一模态数据，重点考察跨中心与跨模态的鲁棒性。本文介绍了该挑战赛的设置，并对十个参赛团队提交的解决方案进行了全面分析。在GTV分割任务中，表现最佳的模型在内部与外部测试队列上分别达到了74.61%和56.79%的平均Dice相似系数（DSC）。在LN CTV分割任务中，最佳模型在配对CT、仅ceCT和仅ncCT子集上的最高平均DSC值分别达到了60.24%、60.50%和57.23%。SegRap2025建立了一个大规模、多中心、多模态的基准，用于评估放疗靶区分割的泛化性与鲁棒性，为开发临床可用的自动化放疗计划系统提供了宝贵洞见。该基准可通过以下网址获取：https://hilab-git.github.io/SegRap2025_Challenge。