自动图像标注论文：基于机器学习算法的自动图像标注

资源描述

《自动图像标注论文：基于机器学习算法的自动图像标注》由会员分享，可在线阅读，更多相关《自动图像标注论文：基于机器学习算法的自动图像标注（6页珍藏版）》请在金锄头文库上搜索。

1、自动图像标注论文：基于机器学习算法的自动图像标注【中文摘要】 ”语义清晰”是大规模数字图像管理的重要前提,现有的基于底层特征的图像内容和高级人为理解的图像语义之间存在巨大的鸿沟,因此通过计算机自动获取图像语义内容的研究具有十分重要的意义。自动图像标注的实质是通过对图像的底层视觉特征的处理和分析来获取高层语义关键词,用这组语义关键词表示图像的含义。基于分类的自动图像标注方法是当前图像标注领域中使用最广泛的方法之一。本文的研究目标是结合当前标注模型的特点应用机器学习算法对图像进行标注,与前期基于分类模型的自动图像标注经典算法相比,本文采用的决策树改进算法在分类精度和时间上有所改善,并且该系统可以

2、利用人能理解的规则模型来标注图像。为了获取标注规则,本文将采集到的图像数据库预定义一组需要的关键词(或语义概念)。利用图像分割技术将数据库中的图像分割成许多不同的区域,每个区域大致对应于一个语义对象。然后对图像分割后所得到的各个区域提取出底层视觉特征,包括颜色、纹理和形状特征等。提取出区域的特征属性后,手动将有意义的区域归并为几个类,这几个类均为预定义的语义概念。特征属性数据可以作为后续机器学习的训练数据。然后该系统可以通过机器学习方法从这些特征数据中学习到语义概念,利用预定义关键词来标注各个区域,最后图像就可以被这些关键词标注出来。本文主要关注的机器学习算法为改进后的 NewNBtree 算

3、法、SimpleC4.5 算法和 FastRandomForest 算法,通过训练可以得到相应的标注模型,最终实现自动图像标注。在自动语义标注阶段,本文利用图像信息熵的概念对噪声区域进行剔除,更有效地提高了标注系统的准确度。本文通过标准 Corel 图像库和基于Corel 图像库的不同 10 组训练集对采用的算法进行实验分析,验证了改进算法和标注系统的有效性和鲁棒性。实验结果表明本文所采用的机器学习算法比传统决策树算法更能有效地分类图像数据,并能够应用到较大规模图像集中实现图像的自动标注。【英文摘要】 ”Semantic Clarity” is an important prerequisi

4、te of a large-scale digital image management, it exists a big gap between the underlying features of the image and advanced semantics of the image understood by human. Therefore, automatic acquisition of the semantic content of the image through computer information technology is very important theo

5、retical and practical significance. The substance of automatic image annotation is to obtain high-level semantic keywords through processing and analyzing the underlying visual information features of image. We use this set of top semantic keywords to represent the image features in the same way whi

6、ch image can be retrieved as current text search. Automatic image annotation based on classification is one of the most widely used methods in the current image annotation fields.The research goal is to combine the characteristics of the current annotation model, and use machine learning classificat

7、ion algorithm to annotate the image. Compared with the previous classification based on the classic model of automatic image annotation algorithm, the proposed decision tree algorithm classification has a high improvement in accuracy, and the system can use rules to mark the image that can be unders

8、tood. In order to obtain the labeling rules, we must first carry out the training process of the whole system. After each image on the training set are segmented, we have all regions of a certain similarity, then extract the visual features of each region, finally train on the segmented regions usin

9、g machine learning algorithm. In this paper, the main concern is the improved NewNBtree algorithm based on the classical algorithm, SimpleC4.5 algorithm and FastRandomForest algorithm training. The appropriate decision rules can be obtained through the training, and ultimately automatic semantic ann

10、otation can be realized. In the stage of the automatic semantic annotation, we use the concept of information entropy of image to exclude the noisy region, which in turn more effectively can improve the annotation system in accuracy.In this paper, experiments are performed to verify the effectivenes

11、s and robustness of the algorithms and system with a standard Corel image library. It includes 10 different data sets based on Corel image database. The experimental results shows that the proposed algorithm is better than the traditional decision tree learning algorithm for classification of image

12、data and is effectively applied to large-scale training image sets. At last, automatic image annotation system can be implemented based on the machine learning algorithms.【关键词】自动图像标注机器学习决策树集成分类算法【英文关键词】Automatic image annotation Machine learning Decision tree Ensemble learning【目录】基于机器学习算法的自动图像标注

13、摘要 6-7 Abstract 7 目录 8-10 第 1 章绪论 10-16 1.1 研究背景与研究意义 10-11 1.2 国内外研究现状 11-13 1.2.1 基于分类的自动图像标注模型 12 1.2.2 基于概率的自动图像标注模型 12-13 1.2.3 其他方法 13 1.3 图像标注系统关键问题及研究任务 13-15 1.3.1 自动标注系统的框架 13-14 1.3.2 关键问题 14 1.3.3 研究任务 14-15 1.4 本文的结构安排 15-16 第 2 章基于单棵决策树的自动图像标注 16-28 2.1 NewNBtree 算法 16-18 2.1.1 算法思想

14、16-17 2.1.2 算法流程 17-18 2.1.3 算法实现 18 2.2 SimpleC4.5 算法 18-22 2.2.1 算法思想 19-21 2.2.2 算法流程 21-22 2.2.3 算法实现 22 2.3 自动图像标注方法 22-27 2.3.1 自动图像标注流程 22-26 2.3.2 自动图像标注算法描述 26-27 2.4 本章小结 27-28 第 3 章基于集成分类器的自动图像标注 28-36 3.1 集成分类器 28-33 3.1.1 集成学习算法 28-30 3.1.2 快速随机森林算法 30-33 3.2 基于快速随机森林算法的自动图像标注方法 33-35 3.2.1 基于快速随机森林的自动图像标注流程 33-34 3.2.2 基于快速随机森林的图像自动标注算法描述 34-

展开阅读全文

自动图像标注论文：基于机器学习算法的自动图像标注

最新文档