Non-planar object detection and identification by features matching and triangulation growth

Object detection and identification is surely a fundamental topic in the computer vision field; it plays a crucial role in many applications such as object tracking, industrial robots control, image retrieval, etc. We propose a feature-based approach for detecting and identifying distorted occurrences of a given template in a scene image by incremental grouping of feature matches between the image and the template. For this purpose, we consider the Delaunay triangulation of template features as an useful tool through which to be guided in this iterative approach. The triangulation is treated as a graph and, starting from a single triangle, neighboring nodes are considered and the corresponding features are identified; then matches related to them are evaluated to determine if they are worthy to be grouped. This evaluation is based on local consistency criteria derived from geometric and photometric properties of local features. Our solution allows the identification of the object in situations where geometric hypothesis does not hold, unlike the homography-based methods that are used when the template is planar and its occurrences in the scene are not distorted. We show that our approach performs just as well or better than application of homography-based RANSAC in scenarios in which distortion is nearly absent, while when the deformation becomes relevant our method shows better description performance.

La localizzazione e l'identificazione degli oggetti sono sicuramente argomenti fondamentali nell'ambito della visione artificiale; essi svolgono un ruolo cruciale in molte applicazioni come il tracciamento degli oggetti, controllo di robot industriali, recupero di immagini, ecc. Noi proponiamo un approccio basato sulle features per rilevare ed identificare occorrenze distorte di un dato template in un'immagine di uno scenario tramite il raggruppamento incrementale delle corrispondenze delle features tra l'immagine ed il template. Per questo scopo, consideriamo la triangolazione di Delaunay delle features del template come strumento utile attraverso cui essere guidati in questo approccio iterativo. La triangolazione è trattata come un grafo e, partendo da un singolo triangolo, i nodi vicini sono presi in considerazione e le corrispondenti features sono identificate; quindi vengono valutate le corrispondenze ad esse correlate per determinare se sono degne di essere raggruppate. Questa valutazione è basata su criteri di consistenza locali derivati da proprietà geometriche e fotometriche delle feature locali. La nostra soluzione consente l'identificazione dell'oggetto in situazioni dove ipotesi geometriche non sussistono, a differenza dei metodi basati sull'omografia che vengono utilizzati quando il template è planare e le sue occorrenze nella scena non sono distorte. Dimostriamo che il nostro approccio funziona altrettanto bene o meglio dell'applicazione di RANSAC basato sull'omografia in scenari in cui la distorsione è quasi assente, mentre quando la deformazione diventa rilevante il nostro metodo mostra una migliore capacità adattiva.