標題: 正交式高斯混合模型之語者驗證系統
An Improved Speaker Verification System Using Orthogonal GMM
作者: 謝忠穎
Hsieh, Chung-Ying
關鍵字: text-independent speaker verification;非特定文字語者驗證;Gaussian mixture model;vector quantization;MFCC;LPCC;高斯混合模型;向量量化;梅爾倒頻譜係數;線性預測倒頻譜係數
語者驗證在安全和犯罪監控上是一種很常見的技術。本篇論文著眼於研究以三個演算法來改良傳統特定與半特定文字語者驗證系統的效能。首先,我們利用MFCC與LPCC所混合之特徵參數來替代傳統MFCC特徵參數以期獲得更佳的語者特性表現。第二,我們以向量量化為基礎的LBG演算法來替代K-means 演算法以期獲得更快的特徵參數分群與模型訓練所需時間且將不影響系統驗證之效能。最後,我們再利用正交化高斯混合模型以期獲得與各群特徵參數之分布有最佳近似。接著,利用不同實驗來驗證上述方法將可獲取系統更佳的效能並且能夠減少若干計算量。

Speaker verification is an important technique in security and crime monitored, in this thesis, we proposed three methods to improve a traditional text-independent and text-semidependent speaker verification system. First, an MFCC-LPCC combined feature set is used in place of conventional MFCC feature. Second, VQ-based LBG algorithm is proposed to enhance the efficiency of feature clustering and model training. Lastly, we use the orthogonal GMM for well approximation to distributions of feature sets. Subsequently, experimental results demonstrate that our proposed methods are efficiency on both text-independent and text-semidependent speaker verification systems.
