支持语义的分布式视频检索系统的设计与实现

资源描述

《支持语义的分布式视频检索系统的设计与实现》由会员分享，可在线阅读，更多相关《支持语义的分布式视频检索系统的设计与实现（70页珍藏版）》请在金锄头文库上搜索。

1、中国科学技术大学硕士学位论文支持语义的分布式视频检索系统的设计与实现姓名：李少波申请学位级别：硕士专业：控制理论与控制工程指导教师：郑烇 2011-05-10 1 摘摘要要随着计算机处理能力、数字化设备、多媒体、数据库等技术的不断发展，以及互联网基础设施的改善，大量视频信息迅速产生。因此，如何进行分类、组织、索引以实现海量视频信息的快速检索，就成为人们迫切需要研究解决的问题。目前的视频检索技术主要分为三类：一是基于视频的标题、导演等文本标注和描述的元信息的视频检索技术；二是基于内容和图像的视频检索技术；三是基于语义的视频检索技术。但是，均未能跨越语义鸿沟并

2、提取有效的视频语义特征。因此，在视频语义内容的机器理解尚未突破的情况下，在商业上最实用的视频索引仍然是在人工语义标注的基础上建立文本索引。随着 MPEG-7 的提出，使得可以使用结构化的文本形式来描述视频的元信息、低级特征和高级语义特征。为了实现支持语义的海量视频检索，本文设计和实现了一个基于 MPEG-7 的、支持语义的分布式视频检索系统： “语寻” ，其中的主要研究内容如下： 1) 研制了一个改进的视频语义处理工具。该工具可以对视频进行自动的低级特征抽取、自动与半自动的文本标注以及人工的语义图标注，生成包含语义串信息的 MPEG-7 描述文件。 2) 提出了语义图查询的检索

3、机制。利用人工标注的语义图来描述视频的内容，并将语义图串化生成的 0 级、1 级、2 级的关键字形式的语义串集建立倒排索引；查询时，利用语义图来描述用户的查询意图，通过 0 级、1 级、2 级的多层级的关键字形式的语义串查询匹配来实现语义图查询。 3) 提出了中文自然语言查询的检索机制。对文本标注信息进行中文自然语言的浅层语义分析和处理，生成语义图，并建立语义图的倒排索引；查询时，利用中文自然语言来描述用户的查询意图，通过中文自然语言的浅层语义分析生成的语义图的查询匹配来实现中文自然语言查询。 4) 提出了语义扩展查询的检索机制。查询时，通过对用户查询意图生成的关键字集进行同义

4、词扩展和上位扩展来实现语义扩展查询。 5) 提出了 Lucene 的分布式改造方法。修改评分公式并建立分布式索引。目前， “语寻”系统已开发完毕，能够提供丰富的 Web 语义查询接口，包括关键字语义扩展查询，语义图查询以及中文自然语句查询；用户能够迅速地检索到语义相似的视频和片段，并且可以浏览点播；整个系统采用分布式架构，具备良好的可扩展性，并能够支持海量视频信息的索引和检索以及海量视频存储。关键词：关键词：MPEG-7；语义；视频检索；分布式；Lucene 3 ABSTRACT Along with the development of technologies such as c

5、omputer processing performance, digital devices, multimedia and database, and the improvement of internet infrastructure, mass of video information are produced rapidly. Therefore, facing ocean volume of video information, how to classify and organize these contents effectively to realize quick sear

6、ching has become an urgent problem. For the moment, video retrieval technologies could be divided in three categories: the first is based on meta information of video depictions; the second relies on content and images analyzing called low-level feature retrieval; the third on the basis of semantic

7、analyzing. Nevertheless, none of these methods could step over semantic gaps and extract effective video semantic features. In the circumstance of no evident breakthrough machine learning, the most practical video indexes are also based on manual annotation from the commercial aspect. As the creatio

8、n of video description interfaceMPEG-7, depicting video meta information, low feature and advanced semantic feature with a structured form has become possible. In order to realize the mass video search supporting semantic, this dissertation designs and realizes an MPEG-7 based and distributed video

9、retrieval systemXunet. Main research contents are as following: 1) The system develop an improved video semantic processing tool which could extract videos low-level feature automatically, complete automatic or semi-automatic text annotation and manual semantic graph annotation, then generate MPEG-7

10、 description files containing semantic strings. 2) We propose a semantic graph retrieval mechanism. It utilizes semantic graph generated from manual annotation to describe video contents. Subsequently, semantic graph could be serialized to level-0, level-1 and level-2 semantic strings in the form of

11、 keywords which built as inverted indexes. On the other hand, we use semantic graph to express users query intentions and multi-level semantic strings in forms of keywords as level-0, level-1, level-2 are matched with query semantic strings to realize semantic search. 3) We propose a Chinese natural

12、 language retrieval mechanism. It processes and analyzes text annotation information through Chinese natural language parsing to generate semantic graphs and establish inverted indexes with semantic graph. From another intention, users query intention could be represented by Chinese natural 4 langua

13、ge and be proceeded and transformed to semantic graphs. Thus query intention could be matched with indexed information through natural language parsing. 4) We propose a semantic expansion mechanism. During the query procedure, both synonyms and hypernyms expansions are adopted when parsing query key

14、words set. 5) We reconstruct Lucenes rating formula to adapt distributed indexes establishment. At present, Xunet has been developed, the system provides various web query interfaces, including keywords semantic expansion query, semantic graph query and natural language query. Users could find out s

15、emantic similar videos and segments promptly. Results browsing and video on demand could be served as well. Owing to adopting a distributed architecture, the system is highly scalable and supports mass video information index and retrieval. Key Words: MPEG-7, Semantic, Video retrieval, Distributed s

16、ystem, Lucene 中国科学技术大学学位论文原创性声明本人声明所呈交的学位论文,是本人在导师指导下进行研究工作所取得的成果。除已特别加以标注和致谢的地方外，论文中不包含任何他人已经发表或撰写过的研究成果。与我一同工作的同志对本研究所做的贡献均已在论文中作了明确的说明。作者签名：_ 签字日期：_ 中国科学技术大学学位论文授权使用声明作为申请学位的条件之一，学位论文著作权拥有者授权中国科学技术大学拥有学位论文的部分使用权，即：学校有权按有关规定向国家有关部门或机构送交论文的复印件和电子版，允许论文被查阅和借阅，可以将学位论文编入有关数据库进行检索，可以采用影印、缩印或扫描等复制手段保存、汇编学位论文。本人提交的电子文档的内容和纸质论文的内容相一致。保密的学位论文在解密后也遵守此规定。公开保密（_年

展开阅读全文