基于分類特征空間高斯混合模型和神經(jīng)網(wǎng)絡(luò)融合的說話人識別

黃偉; 戴蓓蒨; 李輝

一级黄色片免费播放|中国黄色视频播放片|日本三级a|可以直接考播黄片影视免费一级毛片

留言板

尊敬的讀者、作者、審稿人, 關(guān)于本刊的投稿、審稿、編輯和出版的任何問題, 您可以本頁添加留言。我們將盡快給您答復(fù)。謝謝您的支持!

姓名

郵箱

手機號碼

標(biāo)題

留言內(nèi)容

驗證碼

基于分類特征空間高斯混合模型和神經(jīng)網(wǎng)絡(luò)融合的說話人識別

黃偉, 戴蓓蒨, 李輝

文章導(dǎo)航 > 電子與信息學(xué)報 > 2004 > 26(10): 1607-1612

黃偉, 戴蓓蒨, 李輝. 基于分類特征空間高斯混合模型和神經(jīng)網(wǎng)絡(luò)融合的說話人識別[J]. 電子與信息學(xué)報, 2004, 26(10): 1607-1612.

引用本文:

黃偉, 戴蓓蒨, 李輝. 基于分類特征空間高斯混合模型和神經(jīng)網(wǎng)絡(luò)融合的說話人識別[J]. 電子與信息學(xué)報, 2004, 26(10): 1607-1612.

Huang Wei, Dai Bei-qian, Li Hui. Speaker Identification Based on Classify Feature Sub-space Gaussian Mixture Model and Neural Net Fusion[J]. Journal of Electronics & Information Technology, 2004, 26(10): 1607-1612.

Citation:

Huang Wei, Dai Bei-qian, Li Hui. Speaker Identification Based on Classify Feature Sub-space Gaussian Mixture Model and Neural Net Fusion[J]. Journal of Electronics & Information Technology, 2004, 26(10): 1607-1612.

黃偉, 戴蓓蒨, 李輝. 基于分類特征空間高斯混合模型和神經(jīng)網(wǎng)絡(luò)融合的說話人識別[J]. 電子與信息學(xué)報, 2004, 26(10): 1607-1612.

引用本文:

黃偉, 戴蓓蒨, 李輝. 基于分類特征空間高斯混合模型和神經(jīng)網(wǎng)絡(luò)融合的說話人識別[J]. 電子與信息學(xué)報, 2004, 26(10): 1607-1612.

Citation:

基于分類特征空間高斯混合模型和神經(jīng)網(wǎng)絡(luò)融合的說話人識別

計量
- 文章訪問數(shù): 2810
- HTML全文瀏覽量: 153
- PDF下載量: 1057
- 被引次數(shù): 0
出版歷程
- 收稿日期: 2003-05-16
- 修回日期: 2003-12-04
- 刊出日期: 2004-10-19

Speaker Identification Based on Classify Feature Sub-space Gaussian Mixture Model and Neural Net Fusion

摘要

摘要: 該文提出了一種基于分類高斯混合模型和神經(jīng)網(wǎng)絡(luò)融合(FS-GMM/NN)的說話人識別方法，通過對特征矢量進行聚類分析，將說話人的訓(xùn)練語音分成若干類。然后根據(jù)各個類中含特征矢量的多少采用不同的模型混合度，訓(xùn)練建立分類高斯混合模型。并采用神經(jīng)網(wǎng)絡(luò)實現(xiàn)各個分類高斯混合模型輸出的融合。在100個男性話者的與文本無關(guān)的說話人識別實驗中，基于分類高斯混合模型和神經(jīng)網(wǎng)絡(luò)融合的方法在識別性能及噪聲魯棒性上都優(yōu)于不分類的GMM識別系統(tǒng)，并具有較高的模型訓(xùn)練效率，且可以有效地降低話者模型的混合度和測試語音長度。
- 說話人識別; 分類特征空間; 高斯混合模型; 神經(jīng)網(wǎng)絡(luò)融合
Abstract: In this paper, a speaker identification system is proposed based on classify Fea-ture Sub-space Gaussian Mixture Model and Neural Net fusion (FS-GMM/NN) . With clus-tering analysis of the feature vectors, the speakers training feature vectors can be classified to some subsets and training classify Gaussian Mixture Models (GMM) with different mix-tures according to the subsets feature vectorss number. Finally, the outputs of every classify GMM will be fused by Neural Net (NN). In the experiment of text-independent speaker iden-tification of 100 speakers (male), the system based on FS-GMM/NN overmatch the Baseline Gaussian Mixture Model (B-GMM) in identification performance and noise robustness with fewer mixtures and shorter test speech. Moreover, the training of FS-GMM/NN is more effective.

HTML全文

參考文獻(1)

Reynolds D A, Rose R C. Robust text-independent speaker identification using Gaussian mixture speaker models[J].IEEE Trans. on Speech Audio Process.1995, 3(1):72-83[2]Reynolds D A. Speaker identification and verification using Gaussian mixture speaker models[J].Speech Communication.1995, 17(1-2):91-108[3]Reynolds D A. Speaker verification using adapted Gaussian mixture models[J].Digital Signal Processing.2000, 10(1-3):19-41[4]Deller J R, Proakisa J G, Hansenm J H L. Discrete-Time Processing of Speech Signals. New York: Macmillan Publishing Company, 1993.[5]Reynolds D A. Experimental evaluation of features for robust speaker identification[J].IEEE Trans.on Speech Audio Process.1994, 2(4):639-643[6]Chang E, Shi Y, Zhou J, Huang C. Speech lab in a box: A mandarin speech toolbox to jumpstart speech related research. in EUROSPEECH, Aalborg, Denmark, 2001: 192-199.

相關(guān)文章

施引文獻

資源附件(0)

訪問統(tǒng)計