用户名: 密码: 验证码:
基于听觉模型反演方法的语音信号的分析及其应用
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
在吸收听觉生理学研究成果基础上,建立了一个模拟外围听觉系统和部分中枢圣经系统功能的听觉模型。模型由表征基底膜的频率分析的带通滤波器组、内毛细胞的半波整流特性和神经纤维的能量转换特性组成,该模型可以作为前端处理来提取语音信号的自相关图谱。
     接着,文章阐述了通过实现听觉模型反演过程从信号的自相关图谱中恢复出原始的语音信号的过程。介绍了如何从信号的自相关图谱中逆变换中得到信号的神经发放率函数,怎样从信号的半波整流逆变换中恢复丢失的信息等等。又根据听觉生理特点提出了抗噪声方案,使我们的语音分析/重构系统不仅可以对无噪信号进行重构,而且还可以在噪声环境下的语音信号进行重构。
     实验结果表明,我们通过听觉模型反演从信号的自相关图谱中恢复出的语音信号,具有较好的自然度和良好的噪声鲁棒性。
In this paper, on the basis of absorption of achievements of the research on auditory physiology,an auditory model simulationg the peripheral auditory system and part of the central auditory system is set up.The model is made of the fitlters presenting the characteristics of the basilar membrane for analyzing the voice signals,the half wave rectification modeling the inner hair cells and energy transfer of nerve fiber. With the auditory model as the front-end to extract the correlogram of signals.
    Following,this paper present the implementation of suditory model inversion procedure by resynthesizing original signal from the correlogra -m. this paper introduced that how resynthesize the neural firing rate by correlogram inversion,how resume the lost information by half wave rectification inversion.and a solution of speech enhancement based on masking properties of human auditory is proposed and implemented,we can not only analyse and resynthesize speech signal ,but also analyse and resynthesize speech signal in noisy entironment with the system.
    The result of experiments show that resynthesized speech signals form the its correlogram by auditory model inversion is nature and robust in noisy environment.
引文
[1] Tonndorf.J. and Khanna. S.M.,"Some properties of sound transmission in the middle and outer ears of cats,"J.Acoust.Soc.Am.,Vol.41,pp.513-521,1967;
    [2] Kobrak,H.G., "The middle ear," University of Chicago Press,Chicago, ILL., 1959;
    [3] Fischler,H.,Frei,E.H., Rubinstein,M. and Spira, D., "Measurements of sound transmission in the middle ear," Med. Electron. Biol.Eng., Vol.2,pp.289-298,1964;
    [4] Rhode,W.S.,"Observations of the vibration of the basilar membrane in squirrel monkeys using the mossbauer technique."J.Acoust.Soc.Am.Vol.49,pp. 1218-1231,1971;
    [5] Bekesy,G.V., "On the elasticity of the cochlear partition,"J.Acoust.Soc.Am.,Vol.22,pp227-241,1948;
    [6] Johnstone,B.M.,Taylor, K.J. and Boryle,A.J., "Mechanics of the guinea pig cochlea," J.Acoust.Soc.Am.,Vol.47,pp.504-509,1970;
    [7] Johnstone,B.M.,and Boyle,A.J., "Basilar membrane vibrations examined with the Mossbauer technique," Science,Vol.158,pp.390-391,1987
    [8] Johnstone,B.M. and Yates,G.K., "Basilar membrane tuning curves in the guinea pig,"J.Acoust.Soc.Am.,Vol.55,pp.584-587,1974;
    [9] Rhode,W.S. and Robles,L.,"Evidence from Mossbauer experiments for nonlinear vibration in the cochlea,"J.Acoust.Soc.Am.,Vol.55,pp.588-596,1974;
    [10] Lien,M.,"A mathematical model of the mechanics of the cochlea,"Ph.D.dissertation,Sever Institute of Technology,Washington University,St.Louis,Mo., 1973;
    [11] Allen,J.B.,"Two-dimmensional cochlea fluid model:New results,"J.Acoust.Soc.Am.,Vol.61,pp. 110-119,1977;
    [12] Lyon,R.F., "A computational model of filtering,detecting and compre
    
    -ssion in the cochlea," Proc. ICASSP'82,pp. 1282-1285,1982;
    [13] Lyon,R.F., "A computational model of binaural localization and separation," Proc. ICASSP'83,pp. 1148-1151,1983;
    [14] Lyon,R.F., "Computational model of neural auditory processing,"Proc,ICASSP'84,pp.36.1.1-36.1.4,1984;
    [15] Allen,J.B.,"Cochlear modeling,"IEEE AS SP Magazine,No.1,pp.3-28,1985;
    [16] Davis,H.,"Biophysics physiology of the inner ear,"Physiol.Rev.,Vol.37,pp.1-49,1957;
    [17] Davis,.H.,"An active process in cochlear mechanics,"Hearing R-esearch,Vol.9,pp.79-90,1983;
    [18] Steele,C.R. and Taber,L.A., "Three-dimensional model calculations for the guinea pig cochlea,"J.Acoust.Soc.Am.,Vol.89,pp.1107-1111,1981;
    [19] Malcolm Slaney, "An Efficient Implementation of the Patterson-Holdsworth Auditory Filter Bank" Apple Computer Technical Report#35 Perception Group-Advanced Technology Group, 1993,Apple, Computer,Inc;
    [20] M.P.Cooke, "Modeling auditory processing and organization",Cambridge,U.K.,Cambridge University Press,1993;
    [21] R.D.Patterson,K.Robinson,J.Holdsworth,D.McKeown,C.Zhang,and M.H Allerhand, "Complex sounds and auditory images,"In Auditory Physiology and Perception,(Eds.)Y Cazals,L.Demany,K.Homer,Pergamon,oxford, 1992,pp.429-446;
    [22] B.R.Glasberg,B.C.J.Moore,"Derivation of auditory filter shapes from notched-noise data",Hearing Reaearch Vol47,pp.103-138,1990;
    [23] R. Meddis, "Simulation of auditory-neural transduction: Further studies",J.Acoust.Soc.Am.,Vol.83,pp.1056-1063,1988;
    [24] Meddis,R."Simulation of Mechanical to Neural Transduction in the Auditory Receptor", J.Acoust.Soc.Am., Vol.79, No.3, pp.702-711, 1986;
    [25] M.Slaney,D.Naar,and R.F.Lyon, "Auditory Model Inversion For Sound
    
    Separation,"IEEE ICASSP-94,1994;
    [26] M.R. Portnoff, "Time-scale modification of speech based on short-time Fourier analysis," IEEE Trans.Acoust,Speech,Signal Processing,vol.ASSP-29,no.3,pp.374-390,June 1981;
    [27] D.W Griffin,D.S.Deadrick,and J.S.Lim, "Speech synthesis from short-time fourier transform magnitude and its application to speech processing." In Proceedings ICASSP-84,San Diego,1984;
    [28] D.W.Griffin and J.S.Lim, "Signal estimation from modified short-time Fourier transform."IEEE Transactions on ASSP,32(2):236-243,1984;
    [29] S.H.Nawab,T.F.Quatieri,and J.S.Lim, "Signal reconstruction from short-time Fourier transform magnitude."IEEE Trans.Acoust,Speech,Signal Processing,vol.ASSP-25,pp.235-238,June 1977.
    [30] 西山淳一,北泽茂良,“听觉系统半波整流逆变换的语音恢复”,日本音响学会讲演论文集,pp.301-302,平成8年3月;
    [31] 蔡汉添,袁波涛,“一种基于听觉掩蔽模型的语音增强算法”,通信学报,第23卷第8期,2002年8月。
    [32] 王炜,扬道淳等,“基于听觉模型的小波包变换的语音增强”,南京大学学报(自然科学),的37卷第5期,2001年9月;
    [32] 陆生礼,时龙兴,“听觉模拟的语音增强方法”,声学学报,第21卷第6期,1996年11月;
    [33] 陈雪勤,“基于计算声场景分析的混叠语音分离研究”,2002.5;
    [34] L.R.拉宾纳,R.W.谢弗,《语音信号数字处理》,1995,电子工业出版社;
    [35] 姚东、王爱民、冯峰、王朝阳等,《MATLAB命令大全》,人民邮电出版社;

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700