DocumentCode :
1613536
Title :
Pre-processing of input features using LPC and warping process
Author :
Sudirman, Rubita ; Salleh, Sh-Hussain ; Ming, Ting Chee
Author_Institution :
Biomedical Engineering Research Group, Faculty of Electrical Engineering, Universiti Teknologi Malaysia, 81310 Skudai, Johor, Malaysia
fYear :
2005
Firstpage :
300
Lastpage :
303
Abstract :
This paper presents pre-processing of input features to artificial neural network (NN). This is for preparation of reliable reference templates for the set of words to be recognized. The first task is to extract pitch features using Pitch Scale Harmonic Filter (PSHF) algorithm. Another task is to align the input frames (test set) to the reference template (training set) using a modified DTW algorithm called DTW fixing frame (DTW-FF) algorithm. This proper time normalization is needed since NN is designed to compare data of the same length; same speech can varies in their duration. By performing frame fixing or time normalization, the test set and the training set is adjusted to a fix number of frames throughout the sets utilizing the local distance score of the matched features. Then those features can be adapted to NN for further recognition tuning.
Keywords :
Artificial neural networks; Biomedical engineering; Data preprocessing; Feature extraction; Hidden Markov models; Linear predictive coding; Neural networks; Speech processing; Speech recognition; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computers, Communications, & Signal Processing with Special Track on Biomedical Engineering, 2005. CCSP 2005. 1st International Conference on
Conference_Location :
Kuala Lumpur, Malaysia
Print_ISBN :
978-1-4244-0011-9
Electronic_ISBN :
978-1-4244-0012-6
Type :
conf
DOI :
10.1109/CCSP.2005.4977211
Filename :
4977211
Link To Document :
بازگشت