Numerical Methods for Geometric Vision: From Minimal to Large Scale Problems

University dissertation from Centre for Mathematical Sciences, Lund University Faculty of Engineering

Abstract: This thesis presents a number of results and algorithms for the numerical solution of problems in geometric computer vision. In geometric computer vision one tries to extract geometric information about the world and the observer from a sequence of images. Estimation of scene structure and camera motion using only image data has been one of the central themes of research in photogrammetry, geodesy and computer vision. It has important applications for robotics, autonomous vehicles, cartography, architecture, the movie industry, photography etc. Images inherently provide ambiguous and uncertain data about the world. Hence, geometric computer vision turns out to be as much about statistics as about geometry. Basically we consider two types of problems: Minimal problems where the number of constraints exactly matches the number of unknowns and large scale problems which need to be addressed using efficient optimization algorithms. Solvers for minimal problems are used heavily during preprocessing to eliminate outliers in uncertain data. Such problems are usually solved by finding the zeros of a system of polynomial equations. The numerical solution of general systems of polynomial equations is largely an open problem. In this thesis we present several new results on techniques for solving systems of polynomial equations in computer vision. gb techniques for equation solving have been applied successfully to several geometric computer vision problems. However, in many cases these methods are plagued by numerical problems. In this work we show that an extension of the current state of the art is possible by relaxing many of the restrictions imposed by requiring properly defined gbs and monomial orderings. This unlocks substantial freedom which can be used to derive new, numerically more stable algorithms. These new techniques are studied on some of the latest reported uses of gb methods in computer vision and we demonstrate dramatically improved numerical stability. Finally, the new techniques are applied to a set of previously unsolved computer vision problems and we show that efficient and stable algorithms for numerical solution of these problems can now be given. Bundle adjustment is a key component of almost any feature based 3D reconstruction system, used to compute accurate estimates of calibration parameters and structure and motion configurations. These problems tend to be very large, often involving thousands of variables. Thus, efficient optimization methods are crucial. The traditional Levenberg-Marquardt algorithm with a direct sparse solver can be efficiently adapted to the special structure of the problem and works well for small to medium size setups. However, for larger scale configurations the cubic computational complexity makes this approach prohibitively expensive. An alternative to this approach is to apply the conjugate gradients algorithm in the inner loop. This is appealing since the main computational step of the CG algorithm involves only a simple matrix-vector multiplication with the Jacobian. In this work we improve on the latest published approaches to bundle adjustment with conjugate gradients by making full use of the least squares nature of the problem and adressing the difficult question of preconditioning.

  This dissertation MIGHT be available in PDF-format. Check this page to see if it is available for download.