DocumentCode :
3230796
Title :
Pitch dependent phone modelling for HMM based speech recognition
Author :
Singer, Harald ; Sagayama, Shigeki
Author_Institution :
ATR Interpreting Telephony Res. Lab., Kyoto, Japan
Volume :
1
fYear :
1992
fDate :
23-26 Mar 1992
Firstpage :
273
Abstract :
The authors propose a novel method of incorporating pitch information into a hidden Markov model (HMM) phoneme recognizer by exploiting the correlation between pitch and spectral parameters, e.g. cepstrum. Pitch patterns are not used explicitly; instead, spectral parameters are normalized framewise according to the pitch value. Evidence is given to show that the use of pitch information consistently improves the recognition performance. Experiments with 24 phoneme labels showed that the phoneme error rate for fast continuous speech could be improved by about 10%
Keywords :
hidden Markov models; spectral analysis; speech recognition; HMM based speech recognition; phoneme error rate; pitch-dependent phone modelling; Cepstrum; Data mining; Databases; Frequency; Hidden Markov models; Laboratories; Speech analysis; Speech recognition; Telephony; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on
Conference_Location :
San Francisco, CA
ISSN :
1520-6149
Print_ISBN :
0-7803-0532-9
Type :
conf
DOI :
10.1109/ICASSP.1992.225918
Filename :
225918
Link To Document :
بازگشت