Multimodal survival analysis integrates diverse medical data, such as whole slide images and genomic profiles, to improve prognostic accuracy in healthcare. However, data privacy regulations prevent the centralisation of patient data across institutions, limiting the applicability of conventional approaches. Federated learning addresses this challenge by enabling collaborative model training across distributed data silos without sharing raw data. This thesis presents an investigation into the integration of federated learning with multimodal survival analysis. This work builds upon a state-of-the-art early fusion architecture, adapting the training process for horizontally federated environments and implementing three federated algorithms: FedAvg, FedProx, and SCAFFOLD. Results demonstrate that federated learning consistently outperforms isolated training across all datasets and approaches or exceeds centralised performance in balanced settings. Among federated algorithms, SCAFFOLD emerges as the most robust, achieving the narrowest confidence intervals and performances most closely aligned with the centralised ideal. This work establishes the first benchmarks for federated multimodal survival analysis, providing a foundation for future privacy-preserving medical machine learning research.
Multimodal Survival Analysis integra diversi tipi di dati medici, come immagini istopatologiche (whole slide images) e profili genomici, per migliorare l’accuratezza prognostica in ambito sanitario. Tuttavia, le normative sulla privacy impediscono la centralizzazione delle informazioni dei pazienti provenienti da diversi istituti, limitando l’applicabilità degli approcci convenzionali. Federated Learning affronta questa sfida consentendo l’addestramento collaborativo di modelli su dati distribuiti, senza la necessità di condividere i dati grezzi. Questa tesi presenta un’indagine sull’integrazione del Federated Learning con il Multimodal Survival Analysis. Questo studio si basa su un’architettura ’early fusion’ dello stato dell’arte, in cui il processo di addestramento è stato adattato per supportare ambienti federati orizzontali implementando tre algoritmi federati: FedAvg, FedProx e SCAFFOLD. I risultati dimostrano che l’apprendimento federato supera costantemente l’addestramento isolato su tutti i dataset e, in contesti bilanciati, eguaglia o supera le performance dell’approccio centralizzato. Tra gli algoritmi federati, SCAFFOLD emerge come il più robusto, raggiungendo intervalli di confidenza più stretti e prestazioni più allineate con l’ideale centralizzato. Questo lavoro stabilisce i primi benchmark per l’analisi della sopravvivenza multimodale in ambito federato, fornendo le basi per la futura ricerca nel machine learning medico che rispetti la privacy dei dati.
Exploring multimodality in federated survival analysis
PERINI, SOFIA
2024/2025
Abstract
Multimodal survival analysis integrates diverse medical data, such as whole slide images and genomic profiles, to improve prognostic accuracy in healthcare. However, data privacy regulations prevent the centralisation of patient data across institutions, limiting the applicability of conventional approaches. Federated learning addresses this challenge by enabling collaborative model training across distributed data silos without sharing raw data. This thesis presents an investigation into the integration of federated learning with multimodal survival analysis. This work builds upon a state-of-the-art early fusion architecture, adapting the training process for horizontally federated environments and implementing three federated algorithms: FedAvg, FedProx, and SCAFFOLD. Results demonstrate that federated learning consistently outperforms isolated training across all datasets and approaches or exceeds centralised performance in balanced settings. Among federated algorithms, SCAFFOLD emerges as the most robust, achieving the narrowest confidence intervals and performances most closely aligned with the centralised ideal. This work establishes the first benchmarks for federated multimodal survival analysis, providing a foundation for future privacy-preserving medical machine learning research.| File | Dimensione | Formato | |
|---|---|---|---|
|
2026_3_Perini_Executive-Summary.pdf
accessibile in internet per tutti
Dimensione
1.55 MB
Formato
Adobe PDF
|
1.55 MB | Adobe PDF | Visualizza/Apri |
|
2026_3_Perini_Thesis.pdf
accessibile in internet per tutti
Dimensione
3.05 MB
Formato
Adobe PDF
|
3.05 MB | Adobe PDF | Visualizza/Apri |
I documenti in POLITesi sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.
https://hdl.handle.net/10589/253585