DocumentCode :
2232729
Title :
Two-Level phoneme recognition based on successive use of monophone and diphone models
Author :
Somervuo, Panu
Author_Institution :
Neural Networks Res. Centre, Helsinki Univ. of Technol., Helsinki, Finland
fYear :
2002
fDate :
3-6 Sept. 2002
Firstpage :
1
Lastpage :
4
Abstract :
Two-level phoneme recognition method is proposed based on successive use of monophone and diphone models. In the first level of the recognition, computationally lighter (in terms of the number of the models) monophone models are used for selecting a subset of diphone models. For each input utterance, those diphone models are set active whose left or right contexts are present in the recognized monophone sequence. The chosen diphone models are then evaluated in the next level of the recognition. This substantially decreases the computational load compared to the case where all diphone models must be examined for each input utterance. In the Finnish speaker-independent phoneme recognition task on average half of the diphone models could be eliminated in the second level of the recognition per word utterance while still achieving the same recognition accuracy as when using all the models. Clustered monophone and diphone models were also experimented as the models in the first-level recognizer. This did not, however, bring any further improvement to the results obtained by using unclustered monophone and diphone models.
Keywords :
speech recognition; Finnish speaker independent phoneme recognition; clustered diphone model; clustered monophone model; phone sequence recognition; two-level phoneme recognition method; unclustered diphone model; unclustered monophone model; Abstracts; Hidden Markov models; Training; Tutorials; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing Conference, 2002 11th European
Conference_Location :
Toulouse
ISSN :
2219-5491
Type :
conf
Filename :
7071955
Link To Document :
بازگشت