Abstract
Introduction: Federated learning has the potential to perfrom analysis on decentralised data; however, there are some obstacles to survival analyses as there is a risk of data leakage. This study demonstrates how to perform a stratified Cox regression survival analysis specifically designed to avoid data leakage using federated learning on larynx cancer patients from centres in three different countries. Methods: Data were obtained from 1821 larynx cancer patients treated with radiotherapy in three centres. Tumour volume was available for all 786 of the included patients. Parameter selection among eleven clinical and radiotherapy parameters were performed using best subset selection and cross-validation through the federated learning system, AusCAT. After parameter selection, β regression coefficients were estimated using bootstrap. Calibration plots were generated at 2 and 5-years survival, and inner and outer risk groups’ Kaplan-Meier curves were compared to the Cox model prediction. Results: The best performing Cox model included log(GTV), performance status, age, smoking, haemoglobin and N-classification; however, the simplest model with similar statistical prediction power included log(GTV) and performance status only. The Harrell C-indices for the simplest model were for Odense, Christie and Liverpool 0.75[0.71–0.78], 0.65[0.59–0.71], and 0.69[0.59–0.77], respectively. The values are slightly higher for the full model with C-index 0.77[0.74–0.80], 0.67[0.62–0.73] and 0.71[0.61–0.80], respectively. Smoking during treatment has the same hazard as a ten-years older nonsmoking patient. Conclusion: Without any patient-specific data leaving the hospitals, a stratified Cox regression model based on data from centres in three countries was developed without data leakage risks. The overall survival model is primarily driven by tumour volume and performance status.
| Original language | English |
|---|---|
| Journal | Radiotherapy & Oncology |
| Volume | 176 |
| Pages (from-to) | 179-186 |
| ISSN | 0167-8140 |
| DOIs | |
| Publication status | Published - Nov 2022 |
Bibliographical note
Publisher Copyright:© 2022 The Authors
Funding
Danish Cancer Society grant, University of Southern Denmark scholarship, Odense University Hospital scholarship and Danish Cancer Research Fund. CB acknowledges support from AgeCare (Academy of Geriatric Cancer Research), an international research collaboration based at Odense University Hospital, Denmark. MF acknowledges support from Cancer Institute NSW Early Career Fellowship 2019/ECF004.
Keywords
- Cox survival model
- Data leakage
- Distributed learning
- Federated learning
- Larynx cancer
- Stratified Cox model
Fingerprint
Dive into the research topics of 'Larynx cancer survival model developed through open-source federated learning'. Together they form a unique fingerprint.Related research output
- 1 Ph.D. thesis
-
Data mining in head and neck cancer
Hansen, C. R., 19. Aug 2022, Syddansk Universitet. Det Sundhedsvidenskabelige Fakultet. 181 p.Research output: Thesis › Ph.D. thesis
Open AccessFile350 Downloads (Pure)
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver