DocumentCode :
2426332
Title :
The prompt of lip shape modification of cacology based on the speech evaluation techniques — a case of basic Chinese learning
Author :
Hsieh, Chi-Wen ; Lin, Chih-Huang ; Jong, Tai-Lang ; Hsieh, Chi-Yi
Author_Institution :
Dept. of Electrophys., Nat. Chiao-Tung Univ., Hsinchu
fYear :
2008
fDate :
7-9 July 2008
Firstpage :
708
Lastpage :
712
Abstract :
In the study, a Chinese learning assisted system based on speech recognition and lip shape image processing is proposed. The mel-frequency cepstral coefficient (MFCC), the pitch contour, and energy curve were adopted as the parameters of voiceprint, speech tone, and magnitude of speech signals, respectively. On the other hand, the height and width of the lip shape were sent into the lip shape analysis. In the scoring stage of speech utterances, the dynamic time warping (DTW) algorithm and probabilistic neural network (PNN) were applied to determine whether the test speech was qualified or not during Chinese learning process. The simulation results indicated that the hybrid of MFCC, pitch contour, and energy curve parameters of speech signal could slightly promote the accuracy of classification-could achieve up to 90%. Finally, the receiver operating characteristic curve (ROC) was introduced to quantitatively evaluate the sensitivity and specificity of the performance of the proposed algorithm.
Keywords :
neural nets; probability; speech recognition; Chinese learning; energy curve parameters; mel-frequency cepstral coefficient; pitch contour algorithm; probabilistic neural network; receiver operating characteristic curve; speech evaluation techniques; speech recognition; Cepstral analysis; Image processing; Mel frequency cepstral coefficient; Neural networks; Shape; Spectrogram; Speech analysis; Speech processing; Speech recognition; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Audio, Language and Image Processing, 2008. ICALIP 2008. International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4244-1723-0
Electronic_ISBN :
978-1-4244-1724-7
Type :
conf
DOI :
10.1109/ICALIP.2008.4590196
Filename :
4590196
Link To Document :
بازگشت