Comparison of 2D and 3D convolutional neural networks for medical imaging semantic segmentation

Biblioteche e Archivi
POLITesi - Archivio digitale delle tesi di laurea e di dottorato

In the last decade Deep Learning has revolutionized Computer Vision with Convolutional Neural Networks (CNN), a class of algorithms that achieve state-of-the-art results in many tasks. In the medical field imaging techniques such as MRI and CT make available 3D images that need to be analysed to find therein elements of interest and semantic segmentation is a common image processing technique to address this kind of analysis. With this type of data we can choose to work either on slices or volumes which leads to the choice of 2D CNNs vs. 3D CNNs; yet no research has been done to compare the performance of the two approaches. This thesis investigates whether 3D CNNs are better than 2D CNNs at the job and, if so, under what conditions. To answer these questions we employ an architecture which is a slight modification of UNet and compare dice scores of the 2D network and its 3D counterpart on two different datasets.

Nell'ultima decade il Deep Learning ha rivoluzionato il campo della Computer Vision con le Reti Neurali Convoluzionali (CNN), una classe di algoritmi le cui prestazioni raggiungo lo stato dell'arte in molti ambiti. Nel campo medico tecniche di imaging come MRI e CT rendono disponibili immagini 3D che devono essere analizzate per trovare in esse elementi di interesse e la segmentazione semantica è una tecnica comune di image processing per affrontare questo tipo di analisi. Con questo tipo di dati si può scegliere se lavorare su fette o volumi, il che conduce alla scelta tra CNN 2D e CNN 3D, eppure nessuna ricerca è stata fatta per confrontare le prestazioni dei due approcci. Lo scopo di questa tesi è indagare se le CNN 3D sono meglio di quelle 2D e, se così fosse, in quali circostanze. Per rispondere a queste domande impieghiamo un'architettura che è una leggera modfica di UNet e confrontiamo i dice scores della rete 2D e della sua controparte 3D su due diversi dataset.

Comparison of 2D and 3D convolutional neural networks for medical imaging semantic segmentation

SARTORI, PIERANDREA

2020/2021

Abstract

In the last decade Deep Learning has revolutionized Computer Vision with Convolutional Neural Networks (CNN), a class of algorithms that achieve state-of-the-art results in many tasks. In the medical field imaging techniques such as MRI and CT make available 3D images that need to be analysed to find therein elements of interest and semantic segmentation is a common image processing technique to address this kind of analysis. With this type of data we can choose to work either on slices or volumes which leads to the choice of 2D CNNs vs. 3D CNNs; yet no research has been done to compare the performance of the two approaches. This thesis investigates whether 3D CNNs are better than 2D CNNs at the job and, if so, under what conditions. To answer these questions we employ an architecture which is a slight modification of UNet and compare dice scores of the 2D network and its 3D counterpart on two different datasets.

Scheda breve

Scheda completa

	Relatore
	
				LOIACONO, DANIELE
			
	Correlatore/i
	
				GIACOMELLO, EDOARDO
			
	Scuola / Dip.
	
				ING  - Scuola di Ingegneria Industriale e dell'Informazione
			
	Data
	
				21-dic-2021
			
	Anno accademico
	
				2020/2021
			
	Abstract in italiano
	
				Nell'ultima decade il Deep Learning ha rivoluzionato il campo della Computer Vision con le Reti Neurali Convoluzionali (CNN), una classe di algoritmi le cui prestazioni raggiungo lo stato dell'arte in molti ambiti. Nel campo medico tecniche di imaging come MRI e CT rendono disponibili immagini 3D che devono essere analizzate per trovare in esse elementi di interesse e la segmentazione semantica è una tecnica comune di image processing per affrontare questo tipo di analisi. Con questo tipo di dati si può scegliere se lavorare su fette o volumi, il che conduce alla scelta tra CNN 2D e CNN 3D, eppure nessuna ricerca è stata fatta per confrontare le prestazioni dei due approcci. Lo scopo di questa tesi è indagare se le CNN 3D sono meglio di quelle 2D e, se così fosse, in quali circostanze. Per rispondere a queste domande impieghiamo un'architettura che è una leggera modfica di UNet e confrontiamo i dice scores della rete 2D e della sua controparte 3D su due diversi dataset.
			
	Appare nelle tipologie:
	
				Tesi di laurea Magistrale

File allegati

File	Dimensione	Formato
tesi.pdf accessibile in internet per tutti Dimensione 4.71 MB Formato Adobe PDF Visualizza/Apri	4.71 MB	Adobe PDF	Visualizza/Apri

I documenti in POLITesi sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10589/183846