Please use this identifier to cite or link to this item: http://hdl.handle.net/11455/37821
標題: Disambiguating the senses of non-text symbols for Mandarin TTS systems with a three-layer classifier
作者: Yu, M.S.
余明興 
Huang, F.L.
關鍵字: sense disambiguation;non-text symbol;three-layer classifier;Bayesian;theory;voting scheme;word
Project: Speech Communication
期刊/報告no:: Speech Communication, Volume 39, Issue 3-4, Page(s) 191-229.
摘要: 
Various kinds of non-text symbols appear in texts. The oral expressions. of these symbols may vary with their senses. This paper proposes a three-layer classifier (TLC) which can disambiguate the senses of these symbols effectively. The layers within TLC are employed in sequence. The 1st layer is composed of two components: pattern table and decision tree. if this layer can disambiguate the sense of the target symbol, the disambiguation task stops. Otherwise the next two layers will be triggered. In such a situation, the procedure will go through the TLC. Based on the Bayesian theory, the 2nd layer adopts the voting scheme to compute the disambiguation score. Several features of token, which may affect the effectiveness of our voting scheme, are analyzed and compared With each other to achieve better accuracy. According to the algorithm confidence of sense disambiguation, the 3rd layer may exploit an alter. native. model to enhance the performance. Experiments show that our approaches can learn well. even with only a small amount of data. The overall accuracies of training and testing sets are 99.8% and 97.5%, respectively. (C) 2002 Elsevier Science B.V. All rights reserved.
URI: http://hdl.handle.net/11455/37821
ISSN: 0167-6393
DOI: 10.1016/s0167-6393(02)00015-8
Appears in Collections:資訊科學與工程學系所

Show full item record
 

Google ScholarTM

Check

Altmetric

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.