Flexible model-based detection and localization of a 3D rotational symmetric object

Recently, the model-based detection and localization become a popular topic as their application for selecting an object has an important role in the industrial robotic vision field. However, it is a challenging task to make the robot can recognize the pose of the object and pick the best selection object because of its 3D rotational symmetric shape. Therefore, this thesis presents an Outer Line Mode algorithm for the template model generation. The proposed algorithm is used to produce an outer line (border) of a 3D flexible template model by scanning the evaluated line both in vertical and horizontal direction and select the minimum and the maximum point in each evaluated line. A Directional Chamfer Matching (DCM) algorithm computes an integral distance transform of the captured image to find the lowest cost of the distance and orientation mismatch between the chamfer and the captured image with the aim to recognize the object pose. In order to optimize the recognition of the object pose, the least squares projection error is minimized using a Levenberg-Marquardt algorithm together with creating a threshold based on the median value of DCM cost to encounter the missing edge case. Finally, a Gradient and Orientation Score algorithm will determine the best match selection object to be manipulated based on a computation of the local gradient and orientation of the image. The result shows that the average error between the real object pose and the pose found by the computational algorithms with respect to the world frame is 0.46 mm for the translation and 2.14° for the rotation, respectively. While, the highest score of the best match selection between the template model and the captured image reaches 0.67 out of 1. Overall, the proposed algorithm enables a flexible template model to detect and localize any pose of the rotational symmetric object.

Recentemente, il rilevamento e la localizzazione basati su modelli sono diventati un argomento molto diffuso, in quanto la loro applicazione per la selezione di un oggetto ha assunto un ruolo importante nel campo della visione della robotica industriale. Tuttavia, rendere il robot in grado di riconoscere la posa dell'oggetto e di scegliere l'oggetto migliore da selezionare è un compito impegnativo a causa della forma simmetrica rotazionale 3D dell’oggetto. Pertanto, in questa tesi verrá presentato un algoritmo del tipo ‘Modalitá Linea Esterna’ per la generazione del modello. L'algoritmo proposto viene utilizzato per produrre il contorno di un modello flessibile 3D mediante la scansione verticale e orizzontale della linea considerata e permette di selezionare il punto minimo e massimo in ciascuna linea. Un algoritmo Directional Chamfer Matching (DCM) calcola una trasformata integral distance dell’immagine catturata per trovare il costo minimo della distanza e del disallineamento dell'orientamento tra il modello dell’immagine e l’immagine catturata con lo scopo di riconoscere la posa dell'oggetto. Per ottimizzare il riconoscimento della posa dell'oggetto, l'errore di proiezione dei minimi quadrati viene minimizzato utilizzando un algoritmo Levenberg-Marquardt insieme alla creazione di una soglia basata sul valore mediano del costo DCM per trovare il segmento mancante del modello. Infine, un algoritmo Gradient and Orientation Score, basato sul calcolo del gradiente locale e dell’orientamento dell’immagine, determinerà l'oggetto migliore da selezionare. Il risultato mostra che l'errore medio tra la posa reale dell’oggetto e la posa ottenuta dagli algoritmi computazionali rispetto alla tema mondo è, rispettivamente, di 0,46 mm per la traslazione e di 2,14° per la rotazione. Invece, il più alto punteggio relativo al miglior match tra il modello dell’immagine e l’immagine catturata é di 0.67 su 1. Nel complesso, l'algoritmo proposto consente a un modello flessibile di rilevare e localizzare qualsiasi posizione dell'oggetto simmetrico rotazionale.