聽覺模型及其應(yīng)用
AUDITORY SYSTEM MODEL AND ITS APPLICATIONS
-
摘要: 結(jié)合生理聲學(xué)和心理聲學(xué)資料,本文提出了一個(gè)由非均勻間距帶通濾波器組、檢測(cè)器組和主頻選取機(jī)構(gòu)等三部分組成的聽覺模型。它們依次表征基底膜、內(nèi)毛細(xì)胞和神經(jīng)纖維的特性?;谒犛X模型并結(jié)合修正的臨界帶寬參數(shù)構(gòu)成的語(yǔ)音分析系統(tǒng),輸入模擬了鼓膜上的聲壓波,輸出模擬了各種神經(jīng)沖動(dòng)圖特征。語(yǔ)音綜合系統(tǒng)采用簡(jiǎn)單相加法來(lái)獲取重建語(yǔ)音。計(jì)算機(jī)模擬實(shí)驗(yàn)表明,重建語(yǔ)音是高可懂的、自然的,證明了所建聽覺模型的正確性以及臨界帶寬參數(shù)的修正是有意義的。
-
關(guān)鍵詞:
- 聽覺系統(tǒng); 臨界帶寬; 語(yǔ)音分析/綜合
Abstract: A new auditory system model based on a combination of physiological andpsychological acoustic data has been proposed. This model consists of a bank of nonuniform bandpass filters, detectors and main-frequency choosing mechanisms, they act as basilar membranes, inner hair cells and nerve fibers, respectively. Combining with the improved critical bandwidth parameters, the input to this model is analogous to the pressure at the eardrum, and the output of this model simulates various features of the firing patterns. The synthesizer obtains the resultant speech by use of the simple adding method. Computer simulations show that the resultant speech is highly intelligible and natural. The proposed model is correct, and the improvement of the critical bandwidth parameters is effective. -
J. L. Flangan, Speech Analysis, Synthesis, and Perception, Academic Press, New York, (1965).[2]楊俊,樊昌信,聽覺系統(tǒng)的生物物理模型,中國(guó)神經(jīng)網(wǎng)絡(luò)首屆學(xué)術(shù)大會(huì)論文集,1990年12月,北京,第171-174頁(yè).[3]J. C. Anderson, Speech Analysis/Synthesis Based on Perception, TR-707, AD-A151 320, (1984).[4]M. R. Schroeder, Proc. IEEE, 63(1975),9, 1332-1350.[5]S. Seneff, Pitch and Spectral Estimation of Speech Based on Auditory Synchrony Model, ICASSP, 1984, San Diego, PP. 36.2.1-36.2.4.[6]R. F. Lyon, Experiments with a Computational Model of the Cochlea, ICASSP, 1986, Japan, pp. 1975-1978.[7]E. Zwicker, J. Acoust, Soc. Am., 33(1961)2, 248-249.[8]E. Zwicker et al., J. Acoust. Soc. Am., 68(1980)5, 1523-1525.[9]B. C. J. Moore, B. R. Glasberg, J. Acoust. Coc. Am., 74(1983)3, 750-753.[10]楊俊,樊昌信,按聽覺模型分析綜合語(yǔ)音中頻率匹配準(zhǔn)則的改進(jìn),第四屆語(yǔ)音圖象通訊信號(hào)處理會(huì)議論文集, 1989年10月,北京,第104-107頁(yè). -
計(jì)量
- 文章訪問(wèn)數(shù): 1942
- HTML全文瀏覽量: 112
- PDF下載量: 516
- 被引次數(shù): 0