A GPU parallel approach improving the density of patch based multi-view stereo reconstruction

Haines, Benjamin A. (2016) A GPU parallel approach improving the density of patch based multi-view stereo reconstruction. PhD thesis, University of Nottingham.

[img] PDF (Thesis - as examined) - Repository staff only - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
Download (78MB)


Multi-view stereo is the process of recreating three-dimensional data from a set of two or more images of a scene. The ability to acquire 3D data from 2D images is a core concept in computer vision with wide-ranging applications throughout areas such as 3D printing, robotics, recognition, navigation and a vast number of other fields. While 3D reconstruction has been increasingly well studied over the past decades, it is only with the recent evolution of CPU and GPU technologies that practical implementations, able to accurately, robustly and efficiently capture 3D data of photographed objects have begun to emerge. Whilst current research has been shown to perform well under specific circumstances and for a subset of objects, there are still many practical and implementary issues that remain an open problem for these techniques. Most notably, the ability to robustly reconstruct objects from sparse image sets or objects with low texture.

Alongside a review of algorithms within the multi-view field, the work proposed in this thesis outlines a massively parallel patch based multi-view stereo pipeline for static scene recovery. By utilising advances in GPU technology, a particle swarm algorithm implemented on the GPU forms the basis for improving the density of patch-based methods. The novelty of such an approach removes the reliance on feature matching and gradient descent to better account for the optimisation of patches within textureless regions, for which current methods struggle. An enhancement to the photo-consistency matching metric, which is used to evaluate the optimisation of each patch, is then defined. Specifically targeting the shortcomings of the photo-consistency metric when used inside a particle swarm optimisation, increasing its effectiveness over textureless areas. Finally, a multi-resolution reconstruction system based on a wavelet framework is presented to further improve upon the robustness of reconstruction over low textured regions.

Item Type: Thesis (University of Nottingham only) (PhD)
Supervisors: Bai, Li
Subjects: Q Science > QA Mathematics > QA 75 Electronic computers. Computer science
T Technology > TA Engineering (General). Civil engineering (General) > TA1501 Applied optics. Phonics
Faculties/Schools: UK Campuses > Faculty of Science > School of Computer Science
Item ID: 32746
Depositing User: Haines, Benjamin
Date Deposited: 20 Jul 2016 13:52
Last Modified: 17 Oct 2017 09:42
URI: https://eprints.nottingham.ac.uk/id/eprint/32746

Actions (Archive Staff Only)

Edit View Edit View