基于预测的无损音频编码

资源描述

《基于预测的无损音频编码》由会员分享，可在线阅读，更多相关《基于预测的无损音频编码（77页珍藏版）》请在金锄头文库上搜索。

1、上海交通大学硕士学位论文基于预测的无损音频编码姓名：郑珊瑚申请学位级别：硕士专业：信号与信息处理指导教师：胡剑凌20060101上海交通大学硕士学位论文I基于预测的无损音频编码摘要人耳对于声音的敏感度很高数字音频的有损压缩技术无法提供完美音质的音频因为有损压缩技术在获取较高的压缩比的同时也带来了解压后信号音质的降低另一方面由于无损压缩压缩比小往往无法适应传输的需要但近年来随着大容量存储设备的出现和网络传输带宽的增加为无损压缩提供了物理保证从而推动了其发展本文针对目前存在的这些无损音频算法的应用趋势提出了一种基于预测的无损音频压缩算法方案本文采用的基于预测的无损音频压缩编码器由缓冲器系数估计和

2、量化模块预测器熵编码模块和复用器等组成原始音频信号按帧处理输入数据首先存放于缓冲器内然后由系数估计和量化模块计算滤波器系数并对其量化接着预测器根据量化后的预测系数进行预测获得残差信号对残差信号进行熵编码获得最终的码字及其相关边信息这些信息与之前获得的预测器系数一起送入复用器再加上相应的头信息得到压缩的比特流解码器结构组成与编码器相似由解复用器熵解码器和合成器组成解复用器从比特流中分解出码字信息和预测器系数等熵解码器根据码字信息重构残差然后送入预测器中恢复出原始音频数据解码器比编码器简单其计算量主要取决于预测器的阶数和熵解码在本文整个编解码系统中由于无损音频压缩算法的非对称性编码端的计算要远远大

3、于解码端在编码端先是信号分帧处理为了适应暂态信号根据实际情况可将一帧再划分成若干个短块进行处理本文采用时域分块峰值变化率准则能够通过简单运算取得比较准确的判决结果在预测模块中采用经典的AR模型进行线性预测考虑到可以在预测时就利用声道间的相关性论文选用双声道联合差分编码同时本文还选用一种立体声互预测方法立体预测器不仅利用到该声道以前的样值还利用其他声上海交通大学硕士学位论文II道的样值即相当于每个声道的预测器包含两个预测器一个为基于自身声道样值预测的自预测器另一个为利用相邻声道样值预测的互预测器实验表明立体声互预测能够得到比较高的压缩率同时需要的计算量是巨大的所以综合考虑压缩率和计算量等因素来决

4、定所采用的预测方法在无损编码模块中由实验表明预测后信号接近拉普拉斯Laplacian分布本文中经过映射转化近似于单边几何分布后采用Golomb-Rice码对预测后信号进行熵编码获得码字及其相关边信息实现简单快速编码本文仿真实现了该编解码系统最后将该算法与目前常见的无损音频编码算法作比较并且给出了比较结果从比较中可以看出采用双声道联合差分编码和立体声互预测编码具有较好的性能在压缩比上双声道联合差分编码算法得到的结果可以与Monkey相比拟当采用立体声互预测时较多情况下会超过它尽管立体声互预测编码时编码模块计算量很大若进一步对算法进行优化提高其编码解码速度将其优势突出相信会得到广泛的应用关键词无损

5、音频编码线性预测编码熵编码Golomb-Rice 编码上海交通大学硕士学位论文IIILossless Audio Coding Based On PredictionAbstractWith the development of the internet transmission technology andlarge capacity storage device, audio lossless coding became possible,which may provide high quality audio for “golden ears”. Althoughlossless com

6、pression algorithms rarely obtain a high compression ratio,lossless audio coding of audio signals becomes an essential technologyfor digital music distribution over the Internet. This paper brings forwardan algorithm for audio lossless compression and realizes the codec systemfinally.The adopted alg

7、orithm contains several basic parts such as framing,linear prediction, entropy coding and multiplexing. In the encoderprediction residuals are calculated using linear prediction with adaptivepredictor coefficients. The coefficients are quantized prior to filtering andtransmitted as side information.

8、 The prediction residual is entropy coded.Then the multiplexing unit combines coded residual, code indices,predictor coefficients and other additional information to form thecompressed bit stream. The structure of the decoder is similar with theencoder, and it contains functions such as de-multiplex

9、ing unit, entropy上海交通大学硕士学位论文IVdecoder and predictor etc. The de-multiplexing unit splits the bit stream,and the entropy decoder reconstructs the residual and then adds it to thepredicted samples to produce the original signal. The decoder is simplethan the encoder, and the computation complexity of

10、 the decoder mainlydepends on the order of the predictor chosen by the encoder.In the prediction part the input audio data is firstly partitioned intoframes. A frame of audio samples can be further subdivided into sub-frames to allow faster adaptation for transient parts of audio signals,which is ca

11、lled block length switching in the paper. This paper uses time-domain sub- frame peak judging principal. It consumes little calculationand can get a good estimation for transient signals.In order to make use of the inter-channel correlations, the paper hasadopted stereo prediction as well as joint c

12、hannel coding. Thus inter-channel redundancy can be removed further. Results show that highercompression efficiency can be achieved at the expense of largecomputation complexity with stereo prediction.In the entropy coding part, experiments show that the distribution ofthe prediction residuals can b

13、e closely modeled by Laplacian distribution.So after flipping the negative side of the distribution and merge it withthe positive one, the paper chooses to apply Golomb-Rice codes andobtains simple and quick coding.After those work above, the paper realized the codec and perform上海交通大学硕士学位论文Vmany tes

14、ts. Results show that the compression rate can compete withMonkeys Audio algorithm by using joint channel coding technique andis even a little better than Monkeys Audio algorithm by using stereochannel coding method. Without doubt, lossless compression requiresfurther investigation and will obtain m

15、ore and more applications.Keywords: Audio Lossless Coding, Linear Predictive Coding, Entropycoding, Golomb-Rice code上海交通大学学位论文原创性声明本人郑重声明所呈交的学位论文是本人在导师的指导下独立进行研究工作所取得的成果除文中已经注明引用的内容外本论文不包含任何其他个人或集体已经发表或撰写过的作品成果对本文的研究做出重要贡献的个人和集体均已在文中以明确方式标明本人完全意识到本声明的法律结果由本人承担学位论文作者签名郑珊瑚日期2 0 0 6 年 1 月 1 7 日上海交通大学学位

16、论文版权使用授权书本学位论文作者完全了解学校有关保留使用学位论文的规定同意学校保留并向国家有关部门或机构送交论文的复印件和电子版允许论文被查阅和借阅本人授权上海交通大学可以将本学位论文的全部或部分内容编入有关数据库进行检索可以采用影印缩印或扫描等复制手段保存和汇编本学位论文保密在年解密后适用本授权书本学位论文属于不保密请在以上方框内打学位论文作者签名郑珊瑚指导教师签名胡剑凌日期2 0 0 6年 1 月 1 7 日日期2 0 0 6 年 1 月 1 7 日上海交通大学硕士学位论文1第 1章绪论人耳对于声音的敏感度很高数字音频的有损压缩技术无法提供完美音质的音频而无损压缩的压缩率小往往无法适应传输的需要但是近年来由于大容量存储设备的出现和网络宽带的提高为无损压缩编码提供了技术保证因而推动了其发展尤其在数字音频方面因为无损压缩算法很少有超过 3 的压缩率而有损压缩算法却能达到 12 或者更高的压缩比这些使得短期内无损压缩技术没有能够成为数字音频信号处理领域中的主流技术但它确是与有损压缩技术并驾齐驱的音频数据压缩技术由于有损压缩技术在获

展开阅读全文

基于预测的无损音频编码

最新文档