基于最大后驗(yàn)相位估計(jì)的多帶譜減語(yǔ)音增強(qiáng)算法

李真; 吳文錦; 張勤; 任慧

doi:10.11999/JEIT161381

基于最大后驗(yàn)相位估計(jì)的多帶譜減語(yǔ)音增強(qiáng)算法

doi: 10.11999/JEIT161381

基金項(xiàng)目:

十二五國(guó)家科技支撐計(jì)劃重大項(xiàng)目(2012BAH38F00)

計(jì)量
- 文章訪問(wèn)數(shù): 1317
- HTML全文瀏覽量: 202
- PDF下載量: 276
- 被引次數(shù): 0
出版歷程
- 收稿日期: 2016-12-21
- 修回日期: 2017-04-25
- 刊出日期: 2017-09-19

Multi-band Spectral Subtraction of Speech Enhancement Based on Maximum Posteriori Phase Estimation

Funds:

The National Science and Technology Planning?Project (2012BAH38F00)

摘要

摘要: 傳統(tǒng)語(yǔ)音增強(qiáng)算法中因?yàn)樽V減法算法簡(jiǎn)單易于實(shí)現(xiàn)而得到廣泛研究，譜減法的原理是將帶噪語(yǔ)音幅度與估計(jì)的噪聲幅度進(jìn)行相減，并疊加帶噪語(yǔ)音相位，進(jìn)而重構(gòu)增強(qiáng)語(yǔ)音譜。該方法在低信噪比下因?yàn)闆]有進(jìn)行相位估計(jì)，會(huì)存在較大的估計(jì)誤差，并且因?yàn)閷?duì)噪聲估計(jì)的不準(zhǔn)確，會(huì)產(chǎn)生音樂噪聲?；谧V減法的缺點(diǎn)該文提出一種基于最大后驗(yàn)相位估計(jì)的多帶譜減法，其中多帶譜減法可減少音樂噪聲的影響，最大后驗(yàn)方法估計(jì)純凈語(yǔ)音相位，可以減少在低信噪比時(shí)的估計(jì)誤差。實(shí)驗(yàn)結(jié)果表明該方法在低信噪比時(shí)取得了較好的增強(qiáng)效果。
- 語(yǔ)音增強(qiáng) /
- 最大后驗(yàn)相位估計(jì) /
- 多帶譜減 /
- 低信噪比
Abstract: The spectral subtraction speech enhancement is extensively used due to its simplicity and easy to implement. The principle of this method is to subtract the estimated magnitude of the noise from the magnitude of the noisy signal, but the phase of the noisy signal is unchanged. This conventional method produces the estimating error because it exploits the noisy phase, especially in low SNR, and it produces musical noise because of the inaccuracy of the noise estimation. This paper proposes a multi-band spectral subtraction algorithm based on maximum posteriori phase estimation. Experimental results show that the proposed method can get better performance than the conventional method especially in low SNR.
- Speech enhancement /
- Maximum posteriori phase estimation /
- Multi-band spectral subtraction /
- Low SNR

HTML全文

參考文獻(xiàn)(14)

WJCICKI K, MILACIC M, STARK A, et al. Exploiting conjugate symmetry of the short-time fourier spectrum for speech enhancement[J]. IEEE Signal Processing Letters, 2008, 15: 461-464. doi: 10.1109/LSP.2008.923579.

WANG Jiaching, LIN Changhong, WANG Shufan, et al. Compressive Sensing-based speech enhancement[J]. IEEE Transactions on Audio, Speech and Language Processing, 2016, 24(11): 2122-2131. doi: 10.1109/TASLP.2016.2598306.

MOWLAEE P and KULMER J. Harmonic phase estimation in single-channel speech enhancement using phase decomposition and SNR information[J]. IEEE Transactions on Audio, Speech and Language Processing, 2015, 23(9): 1521-1532. doi: 10.1109/TASLP.2015.2439038.

KULMER J and MOWLAEE P. Phase estimation in single channel speech enhancement using phase decomposition[J]. IEEE Signal Processing Letters, 2015, 22(5): 598-602. doi: 10.1109/LSP.2014.2365040.

BOLLS F. Suppression of acoustic noise in speech using spectral subtraction [J]. IEEE Transactions on Acoustics, Speech and Signal Processing, 1979, 27(2): 113-120. doi: 10.1109/TASSP.1979.1163209.

WIENER N. The Extrapolation, Interpolation, and Smoothing of Stationary Time Series With Engineering Applications[M]. Cambridge: Massachusetts, MIT, 1949: 81-101.

EPHRAIM Y and MALAH D. Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator[J]. IEEE Transactions on Acoustics, Speech, and Signal Processing, 1984, 32(6): 1109-1121. doi: 10.1109/ TASSP.1984.1164453.

KAMATH S and LOIZOU P C. A multi-band spectral subtraction method for enhancing speech corrupted by colored noise[C]. IEEE International Conference on Acoustics, Speech, and Signal Processing, Orlando, FL, USA, 2002: IV-4164-IV-4164.

VARY P. Noise Suppression by spectral magnitude estimation: Mechanism and theoretical limits[J]. Signal Processing, 1985, 8(4): 387-400. doi: 10.1016/0165-1684(85) 90002-7.

SAMUI S. Improved single channel phase-aware speech enhancement technique for low signal-to-noise ration signal[J]. IET Signal Processing, 2016, 10(6): 641-650. doi: 10.1049/ iet-spr.2015.0182.

KULMER J and MOWLAEE P. Harmonic phase estimation in single-channel speech enhancement using Von Mises distribution and prior SNR[C]. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brisbane, Australia, 2015: 5063-5067.

KAY S M. Fundamentals of Statistical Signal Processing, Volume I: Estimation Theory[M]. New Jersey: Prentice Hall PTR, 1993: 164-172.

TAAL C H, HENDRIKS R C, HEUSDENS R, et al. An algorithm for intelligibility prediction of time-frequency weighted noisy speech[J]. IEEE Transactions on Audio, Speech and Languages, 2011, 19(7): 2125-2136. doi: 10.1109/ TASL.2011.2114881.

GAICH A and MOWLAEE P. On speech quality estimation of phase-aware single-channel speech enhancement[C]. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing(ICASSP), Brisbane, Australia, 2015: 216-220.

相關(guān)文章

施引文獻(xiàn)

資源附件(0)

訪問(wèn)統(tǒng)計(jì)