Classification of low and high-credibility information on Twitter based on time-stamped diffusion networks

Biblioteche e Archivi
POLITesi - Archivio digitale delle tesi di laurea e di dottorato

In modern societies, the spread of false news jeopardizes the right and need of citizens to receive correct and truthful information. In this regard, social networks are becoming more and more vectors designed to spread news similarly to television and newspapers: it is therefore essential to understand quickly and effectively whether a news is true (mainstream) or false ( disinformation) as soon as it starts circulating on these platforms. In this work we will deal with analyzing how news spreads on Twitter. In particular, our goal is to identify a set of possible features (i.e, quantitative measures) that allow us to discriminate between the way in which true and false news spread within the social network over time. Once these features have been chosen and computed, we will show and compare their evolution over time in the case of true and false news. We will then use these features to train different classifiers with the aim of observing how long after the beginning of the diffusion of a news it is possible to state with a certain level of certainty whether it is true or false.

Nella società moderna la diffusione di notizie false mette a rischio il diritto e la necessità dei cittadini di ricevere informazioni corrette e veritiere. A questo proposito i social network stanno diventando sempre di più dei veri e propri vettori di diffusione delle notizie al pari dei telegiornali e delle testate giornalistiche: risulta quindi fondamentale comprendere in maniera veloce ed efficace se una notizia risulta essere vera (mainstream) oppure falsa (disinformation) non appena inizia a circolare su queste piattaforme. In questo lavoro analizzeremo la diffusione di notizie su Twitter. In particolare il nostro obiettivo è quello di individuare un insieme di possibili features, cioè indicatori quantitativi, che ci consentano di effettuare una discriminazione tra il modo in cui le notizie vere e false si diffondono all'interno del social network con il passare del tempo. Una volta scelte e calcolate queste features, mostreremo e confronteremo la loro evoluzione nel tempo nel caso di notizie vere e nel caso di notizie false. Utilizzeremo poi tali features per addestrare differenti classificatori con lo scopo di osservare dopo quanto tempo dall'inizio della diffusione di una notizia è possibile affermare con un certo livello di certezza se essa è vera oppure è falsa.

Classification of low and high-credibility information on Twitter based on time-stamped diffusion networks

Bertolasi, William

2019/2020

Abstract

In modern societies, the spread of false news jeopardizes the right and need of citizens to receive correct and truthful information. In this regard, social networks are becoming more and more vectors designed to spread news similarly to television and newspapers: it is therefore essential to understand quickly and effectively whether a news is true (mainstream) or false ( disinformation) as soon as it starts circulating on these platforms. In this work we will deal with analyzing how news spreads on Twitter. In particular, our goal is to identify a set of possible features (i.e, quantitative measures) that allow us to discriminate between the way in which true and false news spread within the social network over time. Once these features have been chosen and computed, we will show and compare their evolution over time in the case of true and false news. We will then use these features to train different classifiers with the aim of observing how long after the beginning of the diffusion of a news it is possible to state with a certain level of certainty whether it is true or false.

Scheda breve

Scheda completa

	Relatore
	
				PICCARDI, CARLO
			
	Correlatore/i
	
				PIERRI, FRANCESCO
			
	Scuola / Dip.
	
				ING  - Scuola di Ingegneria Industriale e dell'Informazione
			
	Data
	
				28-apr-2021
			
	Anno accademico
	
				2019/2020
			
	Abstract in italiano
	
				Nella società moderna la diffusione di notizie false mette a rischio il diritto e la necessità dei cittadini di ricevere informazioni corrette e veritiere. A questo proposito i social network stanno diventando sempre di più dei veri e propri vettori di diffusione delle notizie al pari dei telegiornali e delle testate giornalistiche: risulta quindi fondamentale comprendere in maniera veloce ed efficace se una notizia risulta essere vera (mainstream) oppure falsa (disinformation) non appena inizia a circolare su queste piattaforme. In questo lavoro analizzeremo la diffusione di notizie su Twitter. In particolare il nostro obiettivo è quello di individuare un insieme di possibili features, cioè indicatori quantitativi, che ci consentano di effettuare una discriminazione tra il modo in cui le notizie vere e false si diffondono all'interno del social network con il passare del tempo. Una volta scelte e calcolate queste features, mostreremo e confronteremo la loro evoluzione nel tempo nel caso di notizie vere e nel caso di notizie false. Utilizzeremo poi tali features per addestrare differenti classificatori con lo scopo di osservare dopo quanto tempo dall'inizio della diffusione di una notizia è possibile affermare con un certo livello di certezza se essa è vera oppure è falsa.
			
	Appare nelle tipologie:
	
				Tesi di laurea Magistrale

File allegati

File	Dimensione	Formato
2021_04_Bertolasi.pdf accessibile in internet per tutti Descrizione: Testo della tesi Dimensione 4.76 MB Formato Adobe PDF Visualizza/Apri	4.76 MB	Adobe PDF	Visualizza/Apri

I documenti in POLITesi sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10589/173190