Building models of accidents and failures in energy systems by Natural Language Processing

During the lifetime of energy systems, accidents and failures threaten the system safety, availability and productivity. Learning from past accidents and failures, by identifying the factors that influence their occurrence and severity, constitutes one of the basis of effective decision-making in asset management. Textual reports of accidents and failures, compiled by system operators to describe the events, their causes, their consequences, their severities, and, in case of maintenance, the performed inspection, repair and replacement activities, are typically recorded and stored in repositories. Traditionally, this valuable source of information is not systematically exploited for asset management of energy systems. This is mainly due to: i) the unstructured nature of the textual data, ii) the inhomogeneity of the repositories (e.g. due to the subjectivity of the operators writing the reports), and iii) the large number of reports to consider. In this context, this PhD work develops a methodological framework for systematically exploiting the information contained in textual reports of accidents and failures in energy systems with the aim of supporting decision-making for asset management. We consider the following two fundamental tasks of asset management: 1) Quantitative Risk Assessment (QRA), which aims at estimating the level of risk and is typically performed during the design phase, and 2) maintenance, which is performed during operation and aims at reducing the number of failures and increasing the availability and productivity of the system. The developed framework consists of the three sequential steps of: i) transforming the textual reports into numerical vectors by means of Natural Language Processing (NLP) techniques, ii) identifying the factors influencing accident occurrence and severity by relying on machine learning and expert-based taxonomies, and iii) modelling the events by combining machine learning techniques of classification and clustering with logic and causal models of the accidents and failures. Specifically, NLP models, such as Term Frequency Inverse Document Frequency (TFIDF), topic modelling based on Hierarchical Dirichlet Processes (HDP), Word2Vec and Bidirectional Encoder Representation vi from Transformers (BERT), have been combined with machine learning techniques, such as Neural Networks (NNs), Random Forest (RF) classifier and Density-Based Spatial Clustering of Applications with Noise (DBSCAN) clustering. Then, Bayesian Networks (BNs) are used for system logic modelling in support to QRA and maintenance. The effectiveness of the proposed framework is showed considering three real repositories of textual data: a) road accident reports provided by US National Highway Traffic Safety Administration, b) accident reports of process safety events occurring in hydrocarbon production plants provided by an oil and gas company, and c) maintenance reports of freight transport trains provided by a railway company. The PhD work demonstrates the feasibility of systematically exploiting the information content of textual reports of accidents and failures in energy systems. Indeed, the proposed framework allows effectively supporting decision-making at the design phase by QRA and during operation by maintenance analysis.

Durante il ciclo di vita dei sistemi energetici, gli incidenti e i guasti minacciano la sicurezza, la disponibilità e la produttività del sistema. Imparare dagli incidenti e dai guasti passati, identificando i fattori che ne influenzano l'occorrenza e la gravità, costituisce una delle basi per un decision-making efficace nella gestione degli asset. I rapporti testuali degli incidenti e dei guasti, compilati dagli operatori di sistema per descrivere gli eventi, le loro cause, le loro conseguenze, le loro gravità e, in caso di manutenzione, le attività di ispezione, riparazione e sostituzione svolte, vengono tipicamente registrati e archiviati in depositi dati. Tradizionalmente, questa preziosa fonte di informazioni non viene sfruttata sistematicamente per la gestione degli asset dei sistemi energetici. Questo è principalmente dovuto a: i) la natura non strutturata dei dati testuali, ii) l'eterogeneità dei depositi dati (ad esempio a causa della soggettività degli operatori che compilano i rapporti), e iii) il gran numero di rapporti da considerare. In questo contesto, questo lavoro di dottorato sviluppa un metodo per sfruttare sistematicamente le informazioni contenute nei rapporti testuali di incidenti e guasti nei sistemi energetici al fine di sostenere la presa di decisioni per la gestione degli asset. Sono considerate le seguenti due attività fondamentali della gestione degli asset: 1) Valutazione Quantitativa del Rischio (QRA), che mira a stimare il livello di rischio e viene tipicamente eseguita durante la fase di progettazione, e 2) manutenzione, che viene eseguita durante l'operazione e mira a ridurre il numero di guasti e ad aumentare la disponibilità e la produttività del sistema. Il metodo sviluppato consiste in tre fasi sequenziali: i) la trasformazione dei resoconti testuali in vettori numerici mediante tecniche di elaborazione del linguaggio naturale (NLP), ii) l'identificazione dei fattori che influenzano l'occorrenza e la gravità degli incidenti, basandosi su tassonomie basate su machine learning ed esperti, e iii) la modellazione degli eventi combinando tecniche di machine learning di classificazione e clustering con modelli logici e causali degli incidenti e dei guasti. In particolare, tecniche di NLP come Term Frequency Inverse Document Frequency (TFIDF), modellizzazione dei topic basata su Hierarchical Dirichlet Processes (HDP), ix Word2Vec e Bidirectional Encoder Representation from Transformers (BERT), sono stati combinati con tecniche di machine learning come le reti neurali (NN), il classificatore Random Forest (RF) e il clustering Density-Based Spatial Clustering of Applications with Noise (DBSCAN). Successivamente, le reti bayesiane (BN) vengono utilizzate per la modellazione logica del sistema a supporto di QRA e manutenzione. L'efficacia del metodo proposto è dimostrata considerando tre depositi reali di dati testuali: a) rapporti sugli incidenti stradali forniti dall'Amministrazione Nazionale per la Sicurezza del Traffico Stradale degli Stati Uniti, b) rapporti sugli incidenti di sicurezza dei processi verificatisi nelle piante di produzione di idrocarburi forniti da un'azienda petrolifera e del gas, e c) rapporti sulla manutenzione dei treni merci forniti da una compagnia ferroviaria. Il lavoro di dottorato dimostra la fattibilità dello sfruttamento sistematico del contenuto informativo dei rapporti testuali sugli incidenti e i guasti nei sistemi energetici. Infatti, il metodo consente di sostenere efficacemente il decision-making nella fase di progettazione tramite la QRA e durante l'operazione attraverso l'analisi della manutenzione.