TY - JOUR
T1 - Development and comprehensive evaluation of a national DBCG consensus-based auto-segmentation model for lymph node levels in breast cancer radiotherapy
AU - Buhl, Emma Skarsø
AU - Lorenzen, Ebbe Laugaard
AU - Refsgaard, Lasse
AU - Nielsen, Anders Winther Mølby
AU - Brixen, Annette Torbøl Lund
AU - Maae, Else
AU - Holm, Hanne Spangsberg
AU - Schøler, Joachim
AU - Thai, Linh My Hoang
AU - Matthiessen, Louise Wichmann
AU - Maraldo, Maja Vestmø
AU - Nielsen, Mathias Maximiliano
AU - Johansen, Marianne Besserman
AU - Milo, Marie Louise
AU - Mogensen, Marie Benzon
AU - Nielsen, Mette Holck
AU - Møller, Mette
AU - Sand, Maja
AU - Schultz, Peter
AU - Al-Rawi, Sami Aziz Jowad
AU - Esser-Naumann, Saskia
AU - Yammeni, Sophie
AU - Petersen, Stine Elleberg
AU - Offersen, Birgitte Vrou
AU - Korreman, Stine Sofia
N1 - Publisher Copyright:
© 2024 The Authors
PY - 2024/12
Y1 - 2024/12
N2 - Background and purpose: This study aimed at training and validating a multi-institutional deep learning (DL) auto segmentation model for nodal clinical target volume (CTVn) in high-risk breast cancer (BC) patients with both training and validation dataset created with multi-institutional participation, with the overall aim of national clinical implementation in Denmark. Materials and methods: A gold standard (GS) dataset and a high-quality training dataset were created by 21 BC delineation experts from all radiotherapy centres in Denmark. The delineations were created according to ESTRO consensus delineation guidelines. Four models were trained: One per laterality and extension of CTVn internal mammary nodes. The DL models were tested quantitatively in their own test-set and in relation to interobserver variation (IOV) in the GS dataset with geometrical metrics, such as the Dice Similarity Coefficient (DSC). A blinded qualitative evaluation was conducted with a national board, presented to both DL and manual delineations. Results: A median DSC > 0.7 was found for all, except the CTVn interpectoral node in one of the models. In the qualitative evaluation ‘no corrections needed’ were acquired for 297 (36 %) in the DL structures and 286 (34 %) for manual delineations. A higher rate of ‘major corrections’ and ‘easier to start from scratch’ was found in the manual delineations. The models performed within the IOV of an expert group, with two exceptions. Conclusion: DL models were developed on a national consensus cohort and performed on par with the IOV between BC experts and had a comparable or higher clinical acceptance than expert manual delineations.
AB - Background and purpose: This study aimed at training and validating a multi-institutional deep learning (DL) auto segmentation model for nodal clinical target volume (CTVn) in high-risk breast cancer (BC) patients with both training and validation dataset created with multi-institutional participation, with the overall aim of national clinical implementation in Denmark. Materials and methods: A gold standard (GS) dataset and a high-quality training dataset were created by 21 BC delineation experts from all radiotherapy centres in Denmark. The delineations were created according to ESTRO consensus delineation guidelines. Four models were trained: One per laterality and extension of CTVn internal mammary nodes. The DL models were tested quantitatively in their own test-set and in relation to interobserver variation (IOV) in the GS dataset with geometrical metrics, such as the Dice Similarity Coefficient (DSC). A blinded qualitative evaluation was conducted with a national board, presented to both DL and manual delineations. Results: A median DSC > 0.7 was found for all, except the CTVn interpectoral node in one of the models. In the qualitative evaluation ‘no corrections needed’ were acquired for 297 (36 %) in the DL structures and 286 (34 %) for manual delineations. A higher rate of ‘major corrections’ and ‘easier to start from scratch’ was found in the manual delineations. The models performed within the IOV of an expert group, with two exceptions. Conclusion: DL models were developed on a national consensus cohort and performed on par with the IOV between BC experts and had a comparable or higher clinical acceptance than expert manual delineations.
KW - Breast cancer
KW - Deep learning-based auto-segmentation
KW - National
KW - Quantitative and qualitative evaluation
KW - Radiotherapy
KW - Target delineation
UR - http://www.scopus.com/inward/record.url?scp=85205818085&partnerID=8YFLogxK
U2 - 10.1016/j.radonc.2024.110567
DO - 10.1016/j.radonc.2024.110567
M3 - Journal article
C2 - 39374675
AN - SCOPUS:85205818085
SN - 0167-8140
VL - 201
JO - Radiotherapy and Oncology
JF - Radiotherapy and Oncology
M1 - 110567
ER -