Title :
A high performance Mandarin digit recognizer
Author :
Zang Bo ; Juan, Lei ; Wang, William S.-Y
Author_Institution :
Dept. of Electron. Eng., City Univ. of Hong Kong, Kowloon
Abstract :
Digit recognition is important in some applications such as automated banking systems or database information retrieving systems. To design a high performance Mandarin digit recognizer, a Mandarin phonetic question set was first carefully designed and then used to cluster 846 gender dependent cross word triphones. To model the fine differences in high frequency region of Mandarin initials, inverse mel-frequency warping was used to calculate the IMFCC feature. The IMFCC feature was shown to be quite effective in recovering the substitution errors caused by similarity of the Mandarin initials. Combined with triphone duration modeling, the recognizer produced 98.81% word accuracy rate and 95.20% sentence correct rate
Keywords :
decision trees; pattern clustering; speech recognition; IMFCC feature; Mandarin phonetic question set; automated banking systems; database information retrieving systems; gender dependent cross word triphones; high performance Mandarin digit recognizer; inverse mel-frequency warping; substitution errors; Australia; Banking; Data engineering; Databases; Decision trees; Frequency; Hidden Markov models; Information retrieval; Signal processing; Speech recognition;
Conference_Titel :
Signal Processing and Its Applications, 1999. ISSPA '99. Proceedings of the Fifth International Symposium on
Conference_Location :
Brisbane, Qld.
Print_ISBN :
1-86435-451-8
DOI :
10.1109/ISSPA.1999.815751