Development of a national deep learning-based auto-segmentation model for the heart on clinical delineations from the DBCG RT nation cohort

Emma Riis Skarsø, Lasse Refsgaard, Abhilasha Saini, Ditte Sloth Møller, Ebbe Laugaard Lorenzen, Else Maae, Karen Andersen, Maja Vestmø Maraldo, Marie Louise Milo, Tine Bisballe Nyeng, Birgitte Vrou Offersen, Stine Sofia Korreman*

*Corresponding author for this work

Research output: Contribution to journalJournal articleResearchpeer-review

Abstract

BACKGROUND: This study aimed at investigating the feasibility of developing a deep learning-based auto-segmentation model for the heart trained on clinical delineations.

MATERIAL AND METHODS: This study included two different datasets. The first dataset contained clinical heart delineations from the DBCG RT Nation study (1,561 patients). The second dataset was smaller (114 patients), but with corrected heart delineations. Before training the model on the clinical delineations an outlier-detection was performed, to remove cases with gross deviations from the delineation guideline. No outlier detection was performed for the dataset with corrected heart delineations. Both models were trained with a 3D full resolution nnUNet. The models were evaluated with the dice similarity coefficient (DSC), 95% Hausdorff distance (HD95) and Mean Surface Distance (MSD). The difference between the models were tested with the Mann-Whitney U-test. The balance of dataset quantity versus quality was investigated, by stepwise reducing the cohort size for the model trained on clinical delineations.

RESULTS: During the outlier-detection 137 patients were excluded from the clinical cohort due to non-compliance with delineation guidelines. The model trained on the curated clinical cohort performed with a median DSC of 0.96 (IQR 0.94-0.96), median HD95 of 4.00 mm (IQR 3.00 mm-6.00 mm) and a median MSD of 1.49 mm (IQR 1.12 mm-2.02 mm). The model trained on the dedicated and corrected cohort performed with a median DSC of 0.95 (IQR 0.93-0.96), median HD95 of 5.65 mm (IQR 3.37 mm-8.62 mm) and median MSD of 1.63 mm (IQR 1.35 mm-2.11 mm). The difference between the two models were found non-significant for all metrics (p > 0.05). Reduction of cohort size showed no significant difference for all metrics (p > 0.05). However, with the smallest cohort size, a few outlier structures were found.

CONCLUSIONS: This study demonstrated a deep learning-based auto-segmentation model trained on curated clinical delineations which performs on par with a model trained on dedicated delineations, making it easier to develop multi-institutional auto-segmentation models.

Original languageEnglish
JournalActa Oncologica
Volume62
Issue number10
Pages (from-to)1201-1207
Number of pages7
ISSN0284-186X
DOIs
Publication statusPublished - Oct 2023

Keywords

  • Deep learning-based auto-segmentation
  • breast cancer
  • clinical delineations
  • radiotherapy
  • whole heart

Fingerprint

Dive into the research topics of 'Development of a national deep learning-based auto-segmentation model for the heart on clinical delineations from the DBCG RT nation cohort'. Together they form a unique fingerprint.

Cite this