粒子群优化模糊神经网络用于语音识别的研究

资源描述

《粒子群优化模糊神经网络用于语音识别的研究》由会员分享，可在线阅读，更多相关《粒子群优化模糊神经网络用于语音识别的研究（73页珍藏版）》请在金锄头文库上搜索。

1、太原理工大学硕士学位论文粒子群优化模糊神经网络用于语音识别的研究姓名：孙慧申请学位级别：硕士专业：信号与信息处理指导教师：张雪英 20100601 太原理工大学硕士研究生学位论文 I 粒子群优化模糊神经网络用于语音识别的研究摘要语音识别技术是近年来高速发展的一项人工智能技术，具有广泛的应用前景，在互联网、通信、军事、国防、人机交互等方面都有重要的应用价值，长期以来一直是人们研究的热点。语音信号的非线性和语义的模糊性是语音识别研究的关键所在。因此，结合了人工神经网络的非线性、自学习性等特点和模糊系统的模糊推理和模糊划分等优点的模糊神经网络（Fuzzy Neural

2、Network, FNN），很快被应用到语音识别中。同时，粒子群优化算法（Particle Swarm Optimization, PSO）是一种性能良好的全局优化算法，也是一种很有潜力的神经网络训练算法。本文对粒子群优化模糊神经网络在语音识别中的应用进行了深入的研究，取得了以下的成果： 1、针对传统 BP 算法训练模糊神经网络依赖于初始值、训练时间较长、易陷入局部极值的问题，本文提出了粒子群算法优化模糊神经网络。同时为平衡粒子群算法的全局探索和局部改良能力，算法引入了自适应动态改变的惯性因子。考虑到语音识别过程中语音特征参数的维数都比较大，模糊神经网络的结构较复杂

3、，不宜将全部参数都采用粒子群算法优化，文中利用粒子群算法对模糊神经网络模糊层的中心进行聚类，宽度和权值则分别采用距离测度和伪逆法获得，将训练好的网络应用于语音识别。实验表明，该算法有很好的识别性能，在识别率和收敛速度上都优于 BP 算法训练的网络。太原理工大学硕士研究生学位论文 II 2、针对基本粒子群算法易陷入局部极值的问题，本文引入了量子粒子群算法（Quantum-behaved Particle Swarm Optimization, QPSO）。并用四个典型的测试函数对两种优化算法的性能进行了测试，结果表明量子粒子群算法比基本粒子群算法更易找到全局最优值，因此本文提

4、出用量子粒子群算法优化模糊神经网络。 3、针对当前语音识别系统对环境依赖性强、实现过程复杂的问题，本文提出了一种改进的语音识别系统，结合两种信噪比环境下的语音对系统进行训练，训练的模板库可以适用于任何信噪比下语音的识别，大大提高了识别系统对环境的适应性，简化了语音识别的过程。本文将量子粒子群优化的模糊神经网络应用于改进的语音识别系统，并与在相同条件下的原系统实验结果进行对比，表明改进的系统具有良好的识别性能。关键词：粒子群算法，量子粒子群，模糊神经网络，语音识别太原理工大学硕士研究生学位论文 III STUDY OF SPEECH RECOGNITION BASED ON FU

5、ZZY NEURAL NETWORK OPTIMIZED BY PSO ABSTRACT In recent years, speech recognition is an artificial intelligence technology that is developping rapidly, which has broad application prospects. It is a hot research spot which has been widely applied in the Internet, communications, military, national de

6、fense and human-computer interaction. It is the key that nonlinear of the speech signal and semantic ambiguity in the speech recognition research. Therefore, fuzzy neural network(FNN) combined nonlinear and self-learning of artificial neural network with fuzzy inference and fuzzy partition of fuzzy

7、system is applied in the speech recognition quickly. Particle swarm optimization(PSO) algorithm is a global optimization algorithm that has good performance, while it is a promising neural network training algorithm. In this paper, the in-depth research is done to the speech recognition based on fuz

8、zy neural network optimized by PSO. The main achievements are as follows: 1、Fuzzy neural network trained by particle swarm optimization algorithm is proposed in order to overcome shortage of traditional BP algorithm which replies on initial conditions, has the longer training time, and is easy to be

9、太原理工大学硕士研究生学位论文 IV trapped into the local extremum. The paper introduces the inertia factor changed adaptively and dynamically in order to balance the glabal and local search ability. Taking into account the dimension of the speech feature in speech recognition is large, and the structure of fuzzy

10、neural network is complicated, it is inappropriate to optimize all parameters by PSO. Therefore, PSO algorithm is used to cluster the center of fuzzy layer, while the width and the weight is respectively got by the distance measure algorithm and the pseudo inverse algorithm. The trained network is u

11、se to the speech recognition. The experiment results show that the proposed algorithm has good performance, which is better than the network trained by BP algorithm in the recognition rate and the convergence speed. 2、Point at the problem of premature convergence of the basic PSO algorithm, it intro

12、duces quantum-behaved particle swarm optimization (QPSO). And two optimization algorithms is tested using four typical test functions. The results show that QPSO is easier to find the global optimal value than the basic PSO. Therefore, fuzzy neural network optimized by QPSO is proposed. 3、Aimed to t

13、he shortage of the old system that has strong dependence on the environment and has complicated operation, an improved speech recognition system is proposed. It is trained to be a template library by speech feature in two SNRs environments combined low SNR with noiseless. The trained network is appr

14、opriate to recognize in any SNR environment, which improves the adaptability to environment, and simplifies the process of speech 太原理工大学硕士研究生学位论文 V recognition. Finally, fuzzy neural network optimized by QPSO is used to the improved speech recognition. The experiment shows that the improved system h

15、as better optimization performance, compared with the result of the old system which does the experiment in the same condition. KEY WORDS: particle swarm optimization algorithm, quantum-behaved particle swarm optimization, fuzzy neural network, speech recognition 太原理工大学硕士研究生学位论文 1 第一章绪论 1.1 课题研究的背

16、景及意义随着信息技术的不断发展，人们对信息的数量与质量提出了比较高的要求，同时如何将信息以最快速、最方便、最自然的形式输入到计算机及其它相关信息处理设备中，即建立良好的人机交互界面成为当前计算机应用技术发展的研究热点之一。语音是人类进行信息交流的主要媒介迅速、有效、自然，所以信息以语音的形式与计算机进行交流也就成为一种趋势，由此语音识别技术应运而生，它使计算机不但能够接受键盘和鼠标的输入，而且能够听懂人的话，按人的意志进行操作，把人类从繁重危险的劳动中解放出来，大大缓解信息交互的瓶颈。近 20 年来，语音识别技术取得显著进步，开始从实验室走向市场，被认为是 2000-2010 年间信息技术领域重要的十大科技发展技术之一。语音识别最基本的定义是“让计算机听懂人的话” 。它是以语音为研究对象，内容涉及生理学、心理学、语言学、语音声学、信号处理、通信和信息理论、计算机科学、模式识别等学科的原理和方法，具有高度的学科交叉性质1。它是模式识别的一个分支，是多媒体音频技术的一个重要研究方向，是发展人机有声通信和新一代智能计算机的重要组成

展开阅读全文