碼激勵線性預測語音編碼器中的非均勻和部分搜索域代數(shù)碼書
Non-uniform and Part-searching-area Algebraic Codebook for Code Excited Linear Prediction Speech Coder
-
摘要: 該文基于代數(shù)碼激勵線性預測(ACELP)語音編碼算法提出了非均勻和部分搜索域代數(shù)碼書。非均勻代數(shù)碼書由代數(shù)碼書的脈沖非均勻統(tǒng)計特性確定,部分搜索域代數(shù)碼書則由代數(shù)碼書矢量的周期性確定,該方法有效地彌補了低比特率情況下代數(shù)碼書中脈沖數(shù)不足的缺點。在使用上述兩項技術時,為保持基音的連續(xù)性,該編碼器對語音段和非語音段采用了不同的基音估計方法。主觀和客觀的聽力測試表明,當該技術應用于4kb/s 散布脈沖碼激勵線性預測(DP-CELP)語音編碼器時,重建語音的質量得到明顯改善,尤其是對女性講話者。Abstract: This paper presents a non-uniform and part-searching-area algebraic codebook based on Algebraic Code Excited Linear Preiction(ACELP) speech coding algorithm. The non-uniform algebraic codebook is determined by the non-uniform statistical properties of the algebraic codebook, and the part-searching-area is determined by the periodicity of the algebraic codebook excitation vector, which makes up the insufficient numbers of signed pulses in algebraic codebook at low bit rate. In order to preserve the continuity of pitch, different pitch detection methods are employed for speech/silence frame when these two techniques are used. Subjective and objective test results indicate that the reconstructed speech quality of 4kb/s DP-CELP speech coder is improved based on these techniques, especially for the female speakers.
-
ITU-T Recommendation G.729. Coding of speech at 8 kbit/s using conjugate-structure algebraic-code-excited linear-prediction (CS-ACELP), 1996.ITU-T Recommendation G.723.1. Dual rate speech coder for multimedia communications transmitting at 5.3 and 6.3 kbit/s, 1996.[2]Yasunaga K, et al.. Dispersed-pulse codebook and its application to a 4kb/s speech coder. IEEE Proc, ICASSP, 2000, Istanbul, Turkey, III : 1503-1506.[3]Gao Y, et al.. eX-CELP: A speech coding paradigm. IEEE Proc, ICASSP, 2001, Salt Lake City, Utah, II : 689-692.[4]Rao A V, Ahmadi S, et al.. Pitch adaptive windows for improved excitation coding in low-rate CELP coders[J].IEEE Trans. on Speech Audio Processing.2003, 11(6):648-659[5]鮑長春. 高質量的4kb/s散布脈沖CELP語音編碼算法. 電子學報, 2003, 31(2): 309-313.[6]李悅,唐昆等. 高質量3.35kb/s MPD-USACELP語音編碼算法研究. 清華大學學報(自然科學版), 2004, 44(10): 1410-1413.[7]Chu W C. Speech coding algorithmsFoundation and evolution of standardized coders. New Jersey: Wiley-Interscience, 2003: 471-474.[8]Bao Changchun. Harmonic excited LPC (HE-LPC) speech coding at 2.3kb/s. IEEE Proc. ICASSP, 2003, Hongkong, I : 784-787.[9]ITU-T Recommendation P.862. Perceptual evaluation of speech quality (PESQ), 2001. -
計量
- 文章訪問數(shù): 2590
- HTML全文瀏覽量: 139
- PDF下載量: 838
- 被引次數(shù): 0