Skip to main navigation Skip to search Skip to main content

Larynx cancer survival model developed through open-source federated learning

  • The Christie NHS Foundation Trust
  • Ingham Institute for Applied Medical Research
  • University of New South Wales
  • Macarthur Cancer Therapy Centre
  • The University of Sydney
  • Aarhus University Hospital
  • Liverpool and Macarthur Cancer Therapy Centres

Research output: Contribution to journalJournal articleResearchpeer-review

124 Downloads (Pure)

Abstract

Introduction: Federated learning has the potential to perfrom analysis on decentralised data; however, there are some obstacles to survival analyses as there is a risk of data leakage. This study demonstrates how to perform a stratified Cox regression survival analysis specifically designed to avoid data leakage using federated learning on larynx cancer patients from centres in three different countries. Methods: Data were obtained from 1821 larynx cancer patients treated with radiotherapy in three centres. Tumour volume was available for all 786 of the included patients. Parameter selection among eleven clinical and radiotherapy parameters were performed using best subset selection and cross-validation through the federated learning system, AusCAT. After parameter selection, β regression coefficients were estimated using bootstrap. Calibration plots were generated at 2 and 5-years survival, and inner and outer risk groups’ Kaplan-Meier curves were compared to the Cox model prediction. Results: The best performing Cox model included log(GTV), performance status, age, smoking, haemoglobin and N-classification; however, the simplest model with similar statistical prediction power included log(GTV) and performance status only. The Harrell C-indices for the simplest model were for Odense, Christie and Liverpool 0.75[0.71–0.78], 0.65[0.59–0.71], and 0.69[0.59–0.77], respectively. The values are slightly higher for the full model with C-index 0.77[0.74–0.80], 0.67[0.62–0.73] and 0.71[0.61–0.80], respectively. Smoking during treatment has the same hazard as a ten-years older nonsmoking patient. Conclusion: Without any patient-specific data leaving the hospitals, a stratified Cox regression model based on data from centres in three countries was developed without data leakage risks. The overall survival model is primarily driven by tumour volume and performance status.

Original languageEnglish
JournalRadiotherapy & Oncology
Volume176
Pages (from-to)179-186
ISSN0167-8140
DOIs
Publication statusPublished - Nov 2022

Bibliographical note

Publisher Copyright:
© 2022 The Authors

Funding

Danish Cancer Society grant, University of Southern Denmark scholarship, Odense University Hospital scholarship and Danish Cancer Research Fund. CB acknowledges support from AgeCare (Academy of Geriatric Cancer Research), an international research collaboration based at Odense University Hospital, Denmark. MF acknowledges support from Cancer Institute NSW Early Career Fellowship 2019/ECF004.

Keywords

  • Cox survival model
  • Data leakage
  • Distributed learning
  • Federated learning
  • Larynx cancer
  • Stratified Cox model

Fingerprint

Dive into the research topics of 'Larynx cancer survival model developed through open-source federated learning'. Together they form a unique fingerprint.

Cite this