强背景噪声环境下语音增强算法的研究及应用毕业论文

资源描述

《强背景噪声环境下语音增强算法的研究及应用毕业论文》由会员分享，可在线阅读，更多相关《强背景噪声环境下语音增强算法的研究及应用毕业论文（107页珍藏版）》请在金锄头文库上搜索。

1、密级：内部 2 年强背景噪声环境下语音增强算法的研究及应用Research and Application of Speech Enhancement in Strong Noise Environment(申请清华大学工学硕士学位论文)培养单位：学科：研究生：指导教师:关于学位论文使用授权的说明本人完全了解清华大学有关保留、使用学位论文的规定，即：清华大学拥有在著作权法规定范围内学位论文的使用权，其中包括：（1）已获学位的研究生必须按学校规定提交学位论文，学校可以采用影印、缩印或其他复制手段保存研究生上交的学位论文；（2）为教学和科研目的，学校可以将公开的学位论文作为资料在

2、图书馆、资料室等场所供校内师生阅读，或在校园网上供校内师生浏览部分内容；（3）根据中华人民共和国学位条例暂行实施办法，向国家图书馆报送可以公开的学位论文。本人保证遵守上述规定。（保密的论文在解密后遵守此规定）作者签名：导师签名：日期：日期：摘要摘要人们在语音通信过程中不可避免地会受到各种噪声的干扰，影响了通信质量。同时，噪声的存在也使低速率语音编码以及语音识别等语音处理系统的性能下降。语音增强作为一种预处理手段，不失为解决噪声污染的一种有效途径。在各类语音增强技术中，基于短时谱幅度估计的语音增强算法处于主流位置，得到广泛应用。本文针对短时谱幅度估计在低输入信噪比以及非平稳噪声环

3、境下，性能不够理想这一不足加以研究改进，完成稳健的语音增强算法设计和实时实现。根据对短时谱幅度估计算法进行分析，指出了影响其性能的关键技术：噪声统计特性的估计、语音激活检测算法、先验信噪比估计、增益函数的修正等，为算法改进指明了研究方向。首先，提出了一种快速估计噪声统计特性的算法，减小了噪声估计的跟踪延时，在一定程度上改善了噪声过估计的现象，因而能够在低输入信噪比以及非平稳噪声干扰环境下快速准确地跟踪噪声特性。其次，研究了一种基于高斯模型和一致最大势检验的语音激活检测算法，由于检测门限与噪声估计相适应，因而提高了检测性能，尤其在非平稳噪声干扰环境下得到了良好的检测效果。同时，利用语音激活检测

4、算法中傅里叶变换得到的频谱信息，方便地实现了双音多频信号的检测和生成。再次，对先验信噪比估计的反馈因子进行最优化调整，引入与长时统计信息相关的自适应下限，有效地协调了先验信噪比估计的稳定性和快速跟踪能力。同时，利用软判决信息，即有关语音存在的概率信息，修正增益函数表达式中的先验/后验信噪比，有效地消除了“音乐”噪声，使去噪后的语音更加平滑、自然、易于接受。最后，相关算法已用C语言和TMS320C55x DSP汇编语言在PC机和DSP硬件平台上实现。仿真实验和实时测试均表明改进的语音增强算法在不明显损伤语音可懂度的同时，可以有效地抑制背景噪声、提高信噪比，在低输入信噪比和非平稳噪声环境下性能提高

5、尤为显著。关键词：语音增强非平稳噪声短时谱幅度估计噪声估计语音激活检测先验信噪比IAbstract AbstractIn voice communications, speech signals can be contaminated by various noises inevitably, which affects the communication quality. Furthermore, noise interference degrades the performance of speech processing systems, such as low bit-rat

6、e vocoder and speech recognition. Speech enhancement is an effective preprocessing method to reduce the disturbance of noise.Among the speech enhancement techniques, the methods based on Short-Time Spectral Amplitude(STSA) estimation have been well investigated and brought into wide use. In this dis

7、sertation, a lot of research work has been done to improve the performance of STSA estimation against the environments of low Signal-to-Noise Ratio(SNR) input and non-stationary noise. The proposed robust approach of speech enhancement has been implemented on the TMS320C55x DSP platform. According t

8、o the analysis of the STSA-based speech enhancement algorithm, the key techniques are summarized, including the estimation of noise statistic characteristics, voice activity detection(VAD) algorithm, the estimation of a priori SNR, and the modification of gain function,etc.Firstly, a fast estimation

9、 method of noise statistic characteristics is proposed. The method decreases the adaptation time of tracking noise, avoids the over-estimation phenomenon to some extent, thus can track the noise characteristics with higher accuracy and less time in low SNR input and non-stationary noise environments

10、.Secondly, a voice activity detector based on Gaussian model and Uniformly Most Powerful(UMP) test is designed. Because of detection threshold related to the noise estimation, the algorithm provides higher detection perfomance, especially in non-stationary noise environments. Furthermore, utilizing

11、the spectral information of DFT in the VAD algorithm, the dual tone multiple frequency(DTMF) signal detector and generator are realized conveniently.Thirdly，to balance the stability and the tracking ability in the estimation of a priori SNR, the feedback factor of a priori SNR estimation is optimall

12、y adjusted and the estimation result is limited above an adaptive threshold related to long time statistic information. Moreover, the soft-decision modified gain function is obtained by introducing speech presence probability to a priori SNR and a priori SNR. These methods effectively eliminate the“

13、musical” noise and make the enhanced speech smooth, natural and acceptable.Finally, The algorithm mentioned above has been implemented in both C language and assembler language on the PC and TMS320C55 DSP hardware platform. Both simulation experiments and real-time tests show that the improved speec

14、h enhancement algorithm is effective in suppressing background noise and increasing the SNR without apparently impairing the intelligibility of speech .The perfomance is significantly enhanced in low SNR input and non-stationary noise environments. Keywords: speech enhancement non-stationary noise S

15、TSA noise estimation voice activity detection a priori SNR VI目录目录第1章引言11.1语音增强课题背景11.2带噪语音模型21.2.1 语音的主要特性21.2.2 噪声的主要特性31.2.3 人耳的感知特性41.2.4 背景噪声对于语音的影响41.3语音增强的发展历史51.4语音增强算法分类51.5论文研究工作71.6论文内容组织71.7本章小结7第2章基于短时谱幅度估计的语音增强算法概述82.1本章引论82.2语音增强算法概述82.2.1 自适应噪声对消法82.2.2 谐波增强法92.2.3 基于语音生成模型的语音增强算法102.2.4 基于短时谱幅度估计的语音增强算法102.2.5 其他几种形式的语音增强算法112.3基于短时谱幅度估计的语音增强算法112.3.1 谱减法的一般形式122.3.2 谱减法的改进形式142.3.3 维纳滤波法15

展开阅读全文

强背景噪声环境下语音增强算法的研究及应用毕业论文

最新文档