| |
||||
|
|
Research Projects: |
1. Active Vision System Calibration project We built this binocular head to investigate some active vision problems, e.g., 3D model/environment reconstruction and 3D tracking. It includes twoprismatic joints (X-Y table) and six revolute joints (pan, tilt, left verge, left focus, right verge, and right focus). We developed a four stage (motorized lens camera calibration, kinematics calibration, head/eye calibration, and global kinematic refinement) process to calibrate this binocular head and achieve accuracy of one pixel prediction error and 0.2 pixel epipolar error. Our YG2 binocular vision system includes HelpMate BiSight platform and two tailor-made motorized lens. It uses 10 servo motors as axes of pan, tilt, and left/right verge, focus, zoom, and aperture. We are now calibrating this binocular head. III. Demo:
Current research emphases :
4.Image-Based Rendering In our laboratory, we have developed a useful technique, the Disparity Morphing, for image-based rendering systems. We have successfully applied the disparity morphing technique to several image-based rendering systems, such as the Stereo Panoramic Imaging SYstem (SPISY) and the Object Movie System, to enhance the realistic and interaction capability of those systems. Projects under the image-based rendering research topic:
5.Motion Estimation and Visual Tracking A 3D feature-based tracker for tracking multiple moving objects in the IIS-head was developed. We also successfully developed a free-hand pointer based on hand tracking and a human head tracker based on automatic detection and tracking of human heads. Recently, we develop a model-based tracker based on robust estimation technique, and then apply it in the project of augmented reality.
MPEG-4 also provides a large amount of research topics about the audiovisual processing and communication. The most interesting MPEG-4 related research topics for our laboratory include: How to code video data efficiently; How to segmentation video objects; How to identify background objects during video conferencing for sprite coding; How to generate animation mesh for foreground objects during video conferencing; How to generate 3D objects from stereo images; And how to generate face object information combined with head-tracking system . Current Research Projects under the MPEG 4 Coding research topic include:
(1) Face Recognition In this project, a coarse-to-fine, LDA-based face recognition system is proposed. Through careful implementation, we found that the database adopted by two state-of-art face recognition systems were incorrect because they mistakenly use some none-face portions for face recognition. Hence, a face-only database is used in the proposed system. Since the facial organs on a human face only differ slightly from person to person, the decision-boundary determination process is tougher in this system than it is in the conventional approached. Therefore, in order to avoid the above mentioned ambiguity problem, we propose to retrieve a closest subset of database samples instead of retrieving a single sample. The proposed face recognition system has several advantages. First, the system is able to deal with a very large database and can thus provide a basis for efficient search. Second, due to its design nature, the system can handle the defocus and noise problems. Third, the system is faster than the autocorrelation plus LDA approach and the PCA plus LDA approach, which are believed to be two statistics-based, state-of-art face recognition systems. (2) Face Detection A useful geometrical face model and an efficient facial feature detection approach are proposed. Based on the fact that human faces are constructed in the same geometrical configuration, the proposed approach can accurately detect facial features, especially the eyes, even when the images are complex backgrounds. The average computation time for one image of size 512x340 is less than 5 sec. By a SUN-Sparc 20 workstation. Experimental results demonstrate that the proposed approach can efficiently detect human facial features and satisfactorily deal with the problems caused by bad lighting conditions, skew face orientation, and even facial expression. (3) Signature Verification System In this project, a wavelet-based off-line signature verification system is proposed. The proposed system can automatically identify useful and common features which consistently exist within different signatures of the same person and, based on these features, verify whether a signature is a forgery or not. The system starts with a closed-contour tracing algorithm. The curvature data of the traced closed contour are decomposed into multiresolutional signals using wavelet transforms. Then the zero-crossing corresponding to the curvature data are extracted as features for matching. Moreover, a statistical measurement is devised to decide systematically which closed contours and their associated frequency data of a writer are most stable and discriminating. Based on these data, the optimal threshold value which controls the accuracy of the feature extraction process is calculated. The proposed approach can be applied to both on-line and off-line signature verification systems. Experimental results show that the average success rates for English signatures and Chinese signatures are 91.71% and 93%, respectively.
8. Shape Similarity
9. Sparse Representations for Image Decompositions 10.Wavelet-based Image Analysis This project proposes a new wavelet-based approach to solving the edge detection problem. The proposed scheme adopts Canny`s three criteria as a guide to derive a wavelet-style edge filter such that the edge points of an image can be detected directly and accurately at different scales. Since Canny`s criteria are suitable for those edge detectors that detect local extremes, the desired wavelet is, therefore, chosen to be anti-symmetric. In order to obtain sufficient information for reconstructing and analyzing the original image, the dual of the desired wavelet is also required. Basically, the pair of wavelets is represented as a linear combination of translations of a scaling function. By introducing a constrained optimization process, the set of expansion coefficients of the desired wavelet and its dual as well can be determined. On order to implement the desired edge detector, a continuous wavelet has to be converted into the discrete form. For this purpose, the format of the discrete wavelet transform has to be developed. Since the proposed edge filter is wavelet-based, the inherent multiresolution nature of the wavelet transform provides more flexibility on the analysis of images. Also, since an optimization process is introduced in the filter derivation process, the performance of the proposed filter is better than that of Mallat-Zhong edge detector.
(2) Wavelet-based Shape from Shading This project proposes a wavelet-based approach for solving the shape from shading (SFS) problem. The proposed method takes advantage of the nature of wavelet theory, which can be applied to efficiently and accurately represent ¡§thing¡¨,to develop a faster algorithm for reconstructing better surfaces. To derive the algorithm, the formulation of Horn and Brooks ((Eds.) Shape From Shading, MIT Press, Cambridge, MA, 1989), which combines several constraints into an objective function, is adopted. In order to improve the robustness of the algorithm, two new constraints are introduced into the objective function to strengthen the relation between an estimated surface and its counterpart in the original image. Thus, solving the SFS problem becomes a constrained optimization process.
(3) Wavelet-based Image Registration In this project, we propose a new edge-based approach to efficiently deal with the image registration problem. The proposed method applies the wavelet transform technique to extract feature points from a partially overlapping image pair. By defining a similarity measure metric, the two sets of feature points can be compared, and the correspondences between the feature points can be established. Once the set of correctly matched feature point pairs between two images are found, the registration parameters can be derived accordingly. The proposed method can tolerate approximately 10% scaling variation and does not have to restrict the position and orientation of the input images. |
| [Home] [People] [Projects ] [Publications] [Seminars] [Demos] [Download] [Other Links] |
|