基于几何不变量的视觉计算

资源描述

《基于几何不变量的视觉计算》由会员分享，可在线阅读，更多相关《基于几何不变量的视觉计算（87页珍藏版）》请在金锄头文库上搜索。

1、上海交通大学硕士学位论文基于几何不变量的视觉计算姓名：刘博文申请学位级别：硕士专业：图像处理指导教师：余松煜;杨小康20071201摘要第 I 页基于几何不变量的视觉计算基于几何不变量的视觉计算摘摘要要随着计算机智能化的发展越来越迅速，计算机也开始像人一样有了可以观察的眼睛，它们将摄像机捕捉到的画面读入到内存中，然后对画面进行各种各样的处理，比如图像中的特定目标检测，特定目标识别和特定目标跟踪等等，这就形成了这些年来最热门的研究领域之一计算机视觉。而在计算机视觉研究领域中，图像匹配、目标识别受到广泛关注。比如在一些主动视觉系统中，通常需要多个代理对同一场景中的感兴趣目标一

2、起进行处理，以提高系统智能分析感兴趣目标的能力。这种情况下多视图图像匹配则是系统中的核心算法。但主动视觉系统的视场角一般较大，通常在宽基线成像模型下采集到多个视图，视间显著的仿射畸变增加了图像匹配问题的复杂性。同时，主动视觉系统以最佳视角观察目标并保持目标在图像平面的中心位置基本不变，因此摄像头需做实时的姿态调整，由此导致的视间几何关系变化进一步加深了匹配问题的解决难度。针对这样一种系统需要，本文引出了几何不变量的概念。几何不变量描述的是图像中基于像素的本质特征，它能够在剧烈的图像变换条件下保持自身的不变性，并且它要有容易被检测到的特征载体，这两个条件是图像匹配算法能够利用

3、几何不变量进行成功匹配的保证。本文正是在几何不变量的基础上进行了宽基线图像匹配算法的研究以及感兴趣目标对应技术的研究。首先，本文对几何不变量应用的必要性及已有的几何不变量方法进行了全面的综述，指出应用在特定的宽基线条件下的图像匹配需要怎样的几何不变量。其次，本文对基于几何不变量的宽基线图像匹配的整个算法流程进行详细分析与讨论，研究了从特征提取，特征描述，特征匹配到生成多视几何关系每个部分的算法选择及适用性，并在已有的最佳特征提取算法 MSER摘要第 II 页及最佳特征描述算法SIFT基础上创新性地提出了将这两种算法结合的方法以形成更鲁棒高效的图像匹配结果。实验证明，这种

4、结合与叠加的方法可以获得更准确的匹配点对以及更准确的视间几何关系。最后，本文在几何不变量的基础上提出了一种新的解决感兴趣目标对应问题的方法多外极线约束法。它弥补了以往基础矩阵法与单应矩阵法的缺陷，能够更适用于实时视觉系统，经过实验证明，它能够生成十分准确的一对一对应的感兴趣目标，并且不需要借助任何附加的对图像内容进行搜索的过程，鲁棒高效。基于几何不变量的视觉计算应用范围非常广，本文也提出了下一步的研究方向。关键词：关键词：几何不变量，宽基线，图像匹配，感兴趣目标对应技术 ABSTRACT 第 III 页 MULTI-VIEW COMPUTATION BASED ON GEOM

5、ETRIC INVARIANTS ABSTRACT With the rapid development of computer intelligence, computers are able to see things just like human being. They input the images captured by cameras and then perform diverse processing of the images; this is how computer vision, one of the most popular research domains re

6、cently, comes to existence. In compute vision research area, image matching, object recognition, etc. gain much concentration. For instance, in active vision system, intelligent analysis of objects-of-interest (OOI) in 3D scene generally needs multi-agent cooperation, which depends heavily on the ac

7、curacy of multi-view geometry computation and the robust correspondence of OOI. Thus, multi-view image matching algorithm becomes the vital part of the whole system. In most cases, active vision system captures multiple views under wide baseline stereo model, which results in distinct affine distort

8、ions and complex description of correspondence problem. Furthermore, the difficulty of depicting correspondence problem in active vision is aggravated when the cameras perform pose adjustment for active OOI tracking with the best viewpoint. It is known the geometry constraint between views is import

9、ant to make correspondence problem well-posed. However, the multi-view geometry is time-variant in active vision because of the cameras pose adjustment. In order to meet the needs of active vision system, the paper refers to geometric invariants, which describe the essential features based on pixels

10、 of images. Geometric invariants stay unchanged after diverse image transformations such as affine transformation, and they also have themselves carried by detectable features. Both of these two conditions ensure that image ABSTRACT 第 IV 页 matching algorithm can perform well using geometric invarian

11、ts. The paper gives a thorough and deep research on both wide-baseline image matching and correspondence of OOI. Firstly, the paper presents a comprehensive review on necessity of geometric invariants application and existing methods and then gives a description on suitable geometric invariants for

12、wide baseline image matching. Secondly, the paper gives a detailed discussion on the whole process of wide-baseline image matching algorithm based on geometric invariants, including feature detection, feature description, feature matching and multi-view geometry generation. The paper also proposes a

13、n innovative method of image matching by combining one of the best feature detectors MSER and one of the best feature descriptors SIFT. The experiments demonstrate that our method can obtain more robust matching results between wide baseline images. Finally, the most striking creation of this paper

14、is to propose a new solution to OOI correspondence problem between wide baseline images, named multi-epipolar-line constraint. It defeats existing methods by generating the more accurate one-to-one OOI correspondence and thus can be more suitable for real-time system. The experimental results show t

15、hat our new method is very portable, robust and efficient. The paper also gives a prospect of the research related on multi-view computation based on geometric invariants. Keywords: geometric invariants, wide baseline, image matching, correspondence of object-of-interest 攻读硕士学位期间已发表的论文第 81 页上海交通大学

16、上海交通大学学位论文原创性声明学位论文原创性声明本人郑重声明：所呈交的学位论文，是本人在导师的指导下，独立进行研究工作所取得的成果。除文中已经注明引用的内容外，本论文不包含任何其他个人或集体已经发表或撰写过的作品成果。对本文的研究做出重要贡献的个人和集体，均已在文中以明确方式标明。本人完全意识到本声明的法律结果由本人承担。学位论文作者签名：刘博文日期：2008 年 1 月 21 日攻读硕士学位期间已发表的论文第 82 页上海交通大学上海交通大学学位论文版权使用授权书学位论文版权使用授权书本学位论文作者完全了解学校有关保留、使用学位论文的规定，同意学校保留并向国家有关部门或机构送交论文的复印件和电子版，允许论文被查阅和借阅。本人授权上海交通大学可以将本学位论文的全部或部分内容编入有关数据库进行检索，可以采用影印、缩印或扫描等复制手段保存和汇编本学位论文。保密保密，在年解密后适用本授权书。本学位论文属于不保密不保密。（请在以上方框内打“

展开阅读全文