Departament of Geometry and Topology, Universidad Complutense de Madrid
Image Processing Group (GTI), Universidad Politécnica de Madrid

3D Reconstruction with the Minimum Number of
Square-pixel Uncalibrated Cameras

José I. Ronda · Antonio Valdés · Guillermo Gallego

Abstract. We address the problem of the Euclidean upgrading of a projective calibration of a minimal set of cameras with known pixel shape and otherwise arbitrarily varying intrinsic and extrinsic parameters. To this purpose, we introduce as our basic geometric tool the six-line conic variety (SLCV), consisting in the set of planes intersecting six given lines in 3D space in points of a conic. We show that the set of solutions of the Euclidean upgrading problem for three cameras with known pixel shape can be parameterized by means of a one-to-two easily computable mapping and, as a consequence, we propose an algorithm that permits to reduce the number of required cameras to the theoretical minimum of 5 cameras to perform Euclidean upgrading with the pixel shape as the only constraint. We provide experiments with real images showing the good performance of the technique.

Summary

Given a set of images of a static scene, we address the problem of recovering the camera parameters solely from image measurements (without any knowledge about the scene). In computer vision, this problem is known as the autocalibration or self-calibration of the cameras. More specifically, this problem arises if there is a projective reconstruction of the scene (i.e., a reconstruction that differs from the true acquired scene by a projective transformation) and we wish to obtain a Euclidean reconstruction (i.e., one that differs from the true scene in a similarity transformation, which consists of a rigid body motion and a scaling). Geometrically, it is well known that the solution of the problem is equivalent to the estimation of the absolute conic, which lies in the plane at infinity.

To solve the autocalibration problem, some data about the camera must be available. We address the problem of autocalibration in its less restrictive setting in practice: cameras with arbitrarily varying parameters with the exception of the pixel shape, which is assumed to be known. It can be easily seen that this is equivalent to having cameras with square pixels. In the past, algorithms based on this restriction have been proposed that result in a set of linear equations, but with the drawback of requiring 10 or more cameras. These algorithms are inspired by the geometric observation that, from the optical center of each square-pixel camera, two lines can be identified in the projective reconstruction that must intersect the absolute conic. The absolute quadratic complex (AQC) encodes the set of lines intersecting this conic.

Illustration of the incidence relations between the isotropic lines
of three cameras, the plane at infinity and the absolute conic.

However, an informal parameter count reveals that far fewer cameras are theoretically sufficient. The autocalibration problem has 8 unknowns, which correspond to the degrees of freedom (dof) needed to determine the plane at infinity (3 dof) and the absolute conic within it (5 dof). Knowing camera skew and aspect ratio amounts to two equations per camera and thus at least 4 cameras should be given in order to solve the problem. Given the non-linear nature of these equations, multiple solutions can be expected and so 5 cameras should be the minimum required to obtain, generically, a unique solution.

The main aim of this work is to obtain a Euclidean reconstruction from the minimum number of cameras using exclusively the pixel shape restriction. The geometric object that will be employed for this purpose is the variety of conics intersecting six given spatial lines simultaneously, which will be termed the six-lines conic variety (SLCV). In this paper we are interested in the SLCV given by the absolute conic at infinity.

The SLCV for six lines in generic position can be identified with a surface of P^3* (i.e., the projective space given by the planes of space) of degree 8. We prove that this degree reduces to 5 in the case of the three pairs of isotropic lines of three square-pixel cameras. We show that the fifth-degree SLCV has three singularities of multiplicity three, given by the three principal planes of the cameras. This result is used in to generate a bidimensional parameterization of the candidate planes at infinity compatible with three square-pixel cameras. This parameterization, together with the additional data given by another two or more square-pixel cameras permits to identify the true plane at infinity through a two-dimensional optimization process. However, the technique could as well use other additional data such as some scene constraints (e.g. vanishing points detected in the images).

Experiments with real images for the autocalibration of scenes with 5 and more cameras with square pixels and otherwise varying parameters are provided, showing the good performance of the proposed technique compared to other autocalibration methods. In the absence of knowledge about the principal point of the cameras, the SLCV algorithm turns out to be the only feasible approach to solve the autocalibration problem in the minimal case of 5 cameras up to the case of 9 cameras. For 10 or more cameras, the results are similar to those of the AQC algorithm.

Experiment with LED bar dataset

The SLCV autocalibration method has been tested on a set of 5 synchronized square-pixel video cameras with a resolution of 1280 x 920 pixels. A rigid bar with three light-emitting diodes (LEDs) was employed in the tests. It provides ground truth to compare the results of the SLCV autocalibration method.

Sample acquired images from the view point of one of the cameras.
Triplets of aligned LEDs in a rigid bar. The triplet structure is used for comparison purposes.

Sampled points of the Six-Line Conic Variety (SLCV).
The dots represent planes in dual space P^3*.

The search for the plane at infinity is carried out by sampling the space of candidate planes and evaluating a cost function. Normally, a candidate plane at infinity is parameterized by 3 degrees of freedom, but the SLCV of three square-pixel cameras provides a two-dimensional parametrization of the candidate planes at infinity. This reduction of the dimensionality of the search space makes the optimization of the cost function more feasible. Here is what the cost function looks like for the LED bar dataset. Observe that the cost function can have a wild variation and it is not easy to find a global minimum unless a dense sampling of the parameter space (i.e. the complex plane) is carried out.

Plots of the sampled cost function.
Left: values at the unit disc of the complex plane, |z|≤1. Right:
values at the complement of the unit dist, at positions 1/conj(z).
The white cross in the left plot marks the location of the minimum.

Reconstruction of an indoor scene

Experiments with 5 to 10 images of an indoor scene containing three checkerboards were also carried out. The checkerboards provide a means to compare the intrinsic parameters of the cameras to those resulting from the autocalibration method. The images have a resolution of 1280 x 960 pixels and were acquired with a SONY DSC F-828 camera varying the focal length between two values: 50 and 100 mm (in an equivalent 35 mm film).

Sample input images (4 out of 10):

Equivalent focal length is 50 mm.

Equivalent focal length is 100 mm.

Three-dimensional Euclidean reconstructions:

Experiment with 10 images of the Checkerboard dataset: reconstructed 3D scene.

We provide VRML files to navigate through the reconstructed 3D scene. To view the reconstructions, a VRML plug-in such as this is required. Since it can take some time to download the images with the VRML viewer and some browsers might collapse in the process, we recommend to download the scenes and view them locally.

Here is a VRML version of the previous reconstruction.

Downloadable RAR file with VRML scene and images.
Ply file (to visualize with Meshlab)

Again, observe that the cost function can have wild variations with many local minima. Therefore, a global minimum is not easy to find unless a dense sampling of the parameter space (i.e. the complex plane) is carried out.

The SLCV method is able to autocalibrate 5 or more cameras with varying parameters solely based on the known pixel shape constraint (e.g., without any knowledge about the principal points of the cameras).

Experiment with 5 images of the Checkerboard dataset: reconstructed 3D scene.
Here is a VRML version of it.
Downloadable RAR file with VRML scene and images.
Ply file (to visualize with Meshlab)

Reconstruction of an outdoor scene

Also, experiments with real images from an outdoor scene were also conducted. Some images of the Plaza de la Villa in Madrid, with a resolution of 1280 x 960 pixels, were acquired with an OLYMPUS E-620 camera varying the focal length between 50 and 70 mm (in an equivalent 35 mm film).

Sample input images (4 out of 16):

Equivalent focal length is 50 mm.

Equivalent focal length is 70 mm.

Three-dimensional Euclidean reconstructions:

Experiment with 16 images of the Plaza de la Villa dataset: reconstructed 3D scene.
Here is a VRML version of it.
Downloadable RAR file with VRML scene and images.
Ply file (to visualize with Meshlab)

Experiment with 5 images of the Plaza de la Villa dataset: reconstructed 3D scene.
Here is a VRML version of it.
Downloadable RAR file with VRML scene and images.
Ply file (to visualize with Meshlab)

Coefficients of the polynomial H₀(λ, μ) = A₀λ ² + B₀λμ + C₀μ²

Contact

José I. Ronda	e-mail: jir at gti.ssr.upm.es
Antonio Valdés	e-mail: Antonio_Valdes at mat.ucm.es
Guillermo Gallego	e-mail: ggb at gti.ssr.upm.es

Back to the main page

3D Reconstruction with the Minimum Number of Square-pixel Uncalibrated Cameras

3D Reconstruction with the Minimum Number of
Square-pixel Uncalibrated Cameras