Title :
Keyword word recognition using a fusion of spectral, cepstral and modulation features
Author :
Gopalan, Kaliappan ; Chu, Tao
Author_Institution :
Purdue Univ. Calumet, Hammond, IN, USA
Abstract :
We present the results of applying a combination of features for recognizing word utterances extracted from a continuous stream of speech. Three sets of features, namely, spectral energy in Bark bands, mel frequency cepstral coefficients, and parameters from an AM-FM model, were employed for training and testing a set of keywords in the CallHome telephone speech database. A pair-wise comparison between the feature set of an unknown word utterance and that of each of the reference utterances in a dynamic time warping process showed a false negative score of 4 out of 12, and a false positive score of 5 out of 132 for a subset of speech from the database. Long, multisyllabic words were spotted correctly while two short words in the word list contributed to errors.
Keywords :
audio databases; cepstral analysis; feature extraction; speech recognition; AM-FM model; Bark bands; CallHome telephone speech database; Mel frequency cepstral coefficients; cepstral features; dynamic time warping process; keyword word recognition; modulation features; multisyllabic words; reference utterances; spectral energy; spectral features; speech continuous stream; word utterance recognition; Feature extraction; Frequency modulation; Mel frequency cepstral coefficient; Speech; Speech recognition;
Conference_Titel :
Electrical Communications and Computers (CONIELECOMP), 2012 22nd International Conference on
Conference_Location :
Cholula, Puebla
Print_ISBN :
978-1-4577-1326-2
DOI :
10.1109/CONIELECOMP.2012.6189915