Image Mosaicing

Image alignement for mosaicing images, with focus on synchronization algorithms

Image mosaicing is an effective means of constructing a single seamless image by aligning multiple partially overlapped images. In Computer Vision (CV), many applications, such as super-resolution imaging and medical imaging, require image mosaicing require image mosaicing. This technique can also be used in panoramic stitching, allowing the creation of wide-angle images (without using fish-eye lenses) overcoming the difficulties in taking a photo with a very large field of view (FOV). Thus, image stitching algorithms have been used for decades to create the high-resolution photo-mosaics used to produce digital maps and satellite photos. Ideally, the resulting stitched image should be as natural as a real photo that covers the entire scene.

However, while creating a mosaic from only two overlapping images is a relatively easy task and standard techniques can provide very good results, aligning multiple images is much more difficult, particularly if some input images do not overlap.

More in depth, image mosaicing could be regarded as a special case of scene reconstruction where the images are related by planar homography only. This is a reasonable assumption if the images exhibit no parallax effects, i.e., when the scene is approximately planar or the camera purely rotates about its optical centre.

In general, this procedure can be divided into image alignment and image compositing steps. The goal of image alignment is to align the images into a common coordinate system (most of our work is related to this step). The goal of image compositing is to overlay the aligned images on a larger canvas by merging pixel values of the overlapping portions and retaining pixels where no overlap occurs. It is usually performed in two steps: colour correction and blending. Colour correction is needed since neighbouring images can present different colours due to factors such as the exposure level and differences in the lighting condition.

As stated in the result of image mosaicing should not be confused with orthophotos. In fact, the former allows the visualization of a wide area on a single image under perspective projection, whereas the latter considers orthographic projections. For this reason, image mosaicing does not need any prior Structure-from-Motion or dense matching phases, that are instead required to generate orthophotos.

Several methods for automatic image mosaicing are present in the literature, presenting a complete pipeline for the final mosaic generation or focusing on one of the previously cited steps .

The main focus of this post is related to the image alignement. The image alignment step refers to the alignment of the images into a common coordinate system using the computed geometric transformations. Existing algorithms for this task are broadly categorised based on the information they extrapolate from the image.

Direct methods exploit the entire image data, thus providing very accurate registration but requiring at the same time a close initialization. They either compute the similarity based on image intensity values or based on the quantity of information (mutual information) shared between two images.

In contrast, feature-based algorithms rely on the computation of transformations using a sparse set of low-level features and can be computationally less expensive. Commonly used low-level features (e.g., edges, corners, pixels, colors, histograms) can be extracted exploiting a variety of approaches . In particular, Brown et al. proved that formulating stitching as a multi-image matching problem and using invariant local features to find matches between the images, allows building a method insensitive to ordering, orientation, scale and illumination of the input images.

Image alignment and colour correction can be both solved using graph synchronization techniques. The synchronization problem can be defined as follows:

Given a graph where nodes are characterized by an unknown state, and edges measure the ratio (or difference) between the states of the connected nodes, try to infer the unknown states from the pairwise measures.

More precisely, states are represented by the elements of a specific group (this is why the problem is referred to as group synchronization). Recently, the synchronization problem has been extensively investigated in the Computer Vision community . Schroeder et al. proposed four closed-form solutions to the synchronization problem for the specific task of image alignment. In this specific scenario, the global homographies represent the unknown states of a graph where the edges are pairwise homographies. For this reason, states belong to the SL(3) group (Special Linear group, i.e., set of 3x3 matrices with unit determinant).

Arrigoni et al. review several methods based on synchronization where the groups have a matrix representation, that allow closed-form solutions.

Dal Cin et al. proposed an algorithm (MULTISYNC) for solving the synchronization problem in the case of multi-graphs (graphs with multiple edges connecting the same pair of nodes) based on an expansion algorithm coupled with a constrained spectral solution to deal with replicated nodes. Our work moves in the direction of trying to apply multi-graph synchronization in the image alignment scenario as considered in .

As explained in , the basic solution to multi-graph synchronization is edge averaging, i.e, converting a multi-graph into a simple-graph by averaging the measurements of the edges having the same source and destination nodes. However, edge-averaging is not well defined for all the groups. For instance, while it is possible to average rotations , there is not a theoretically sustained averaging for homographies. Thus, we study the results provided by edge averaging in the case of homographies and compare them with multi-graph synchronization. The same multi-graph framework can be applied to partition classical synchronization tasks, achieving a good trade-off between accuracy and complexity. The whole procedure can be seen as composed of three main steps:

Graph Building: in this phase, the graph representation of the pairwise homographies, able to align one image to another one, is built.
Image Alignment: in this phase, multi-graph synchronization is applied to the previously constructed graph. Thanks to the synchronization algorithm, the unknown states representing global homographies are inferred. Differently from pariwise hommographies, global homographies are able to align the images to the common coordinate system.
Image Stitching: in this phase, the stitched image is created. Each image is transformed in the reference frame by exploiting the estimated global homographies and fused with the others.

I proposed applied MULTISYNC to solve partitioned synchronization problems estimating global homographies for image mosaicing. You can find more detail about this projects at this link.