Inference without tracking

We consider the problem of inferring the diffusivities of a set of 2-dimensional Brownian random walkers, using discrete-time snapshots of their positions. Even though this is a rather general mathematical problem, the aim of this work is to find solutions applicable to random walks of biomolecules. In particular our aim is to tackle 3 levels of cumulative difficulty: 1. The random walkers are not labeled, which makes the particle matching between two consecutive snapshots uncertain, especially if the particles are very dense, very diffusive or if the snapshots are a long time away from each other. 2. Point 1 and the random walkers can disappear and appear randomly: they are blinking. This makes the matching between consecutive snapshots even more uncertain. 3. Points 1 and 2 and a diffusivity that depends on the location: We need to infer maps of diffusivities. We describe different solution methods to those three difficulties and evaluate their performance on simulations. One such method consists in using the Most Probable Assignment (MPA) to match particles. MPA leads to severe biases when the density is high and to a high variance when the blinking rate is high. Another algorithm we study is Belief Propagation (BP) that matches particles in a probabilistic manner and that performs much better than MPA in terms of bias and variance. We study three different methods of tackling blinking: A soft constraint in the matching cost function analogous to a chemical potential in statistical mechanics and two fully Bayesian formulations. Simulation studies show that the fully Bayesian methods work very well up to rather high densities. We expose how we could adapt our algorithms to inferring maps of diffusivities and test them on simulations. Finally we apply the studied methods to inferring diffusivity maps of the Rac1 protein, in order to show the applicability of our methods to real data.

Consideriamo il problema di inferire le diffusività di un insieme di passeggiate aleatorie browniane 2-dimensionali, usando le loro posizioni a intervalli discreti. Benché questo sia un problema matematico abbastanza generale, lo scopo di questo lavoro è di trovare soluzioni applicabili alle passeggiate aleatorie di biomolecole. In particolare studiamo 3 livelli di difficoltà cumulativa: 1. Le particole sono indistinguibile, ciò che rende difficile l'identificazione delle particole tra due immagini successivi. Questa difficoltà aumenta quando le particole sono molto densi, quando la diffusività è molto alta oppure quando l'intervallo di tempo tra due immagine successivi è molto grande. 2. Punto 1 e le particole possono apparire e sparire in modo casuale: lampeggiano (blinking). Questo rende ancora più difficile l'identificazione delle particole tra due immagini successivi. 3. Punto 1 e 2 e la diffusività varia spazialmente. Dobbiamo inferire mappe di diffusività. Descriviamo metodi di soluzione a queste tre difficoltà e valutiamo la loro accuratezza con simulazioni. Uno di questi metodi (MPA) consiste nella scelta dell'identificazione la più verosimile. MPA conduce a errori sistematici nell'estimazione della diffusività quando la densità è alta, e ad una varianza alta quando il tasso di blinking è alto. Un altro algoritmo che studiamo è il Belief Propagation (BP) che identifica le particole in modo probabilistico. In termini di accuratezza e varianza dell'estimazione, BP è molto migliore che MPA. Studiamo tre metodi differenti per affrontare il blinking: Una costrizione molla nella funzione costo del identificazione, analoga a un potenziale chimico in meccanica statistica e due formulazioni completamente Bayesiani. Le simulazione mostrano che i metodi Bayesiani funzionano molto bene fino a densità abbastanza elevate. Mostriamo come possiamo adattare i nostri algoritmi al inferenza di mappe di diffusività e le proviamo su simulazioni. Finalmente applichiamo i nostri metodi al inferenza di una mappa di diffusività della proteina Rac1, in modo da mostrare l'applicabilità dei nostri metodi a dati reali.