DocumentCode
1925274
Title
Keyword word recognition using a fusion of spectral, cepstral and modulation features
Author
Gopalan, Kaliappan ; Chu, Tao
Author_Institution
Purdue Univ. Calumet, Hammond, IN, USA
fYear
2012
fDate
27-29 Feb. 2012
Firstpage
234
Lastpage
238
Abstract
We present the results of applying a combination of features for recognizing word utterances extracted from a continuous stream of speech. Three sets of features, namely, spectral energy in Bark bands, mel frequency cepstral coefficients, and parameters from an AM-FM model, were employed for training and testing a set of keywords in the CallHome telephone speech database. A pair-wise comparison between the feature set of an unknown word utterance and that of each of the reference utterances in a dynamic time warping process showed a false negative score of 4 out of 12, and a false positive score of 5 out of 132 for a subset of speech from the database. Long, multisyllabic words were spotted correctly while two short words in the word list contributed to errors.
Keywords
audio databases; cepstral analysis; feature extraction; speech recognition; AM-FM model; Bark bands; CallHome telephone speech database; Mel frequency cepstral coefficients; cepstral features; dynamic time warping process; keyword word recognition; modulation features; multisyllabic words; reference utterances; spectral energy; spectral features; speech continuous stream; word utterance recognition; Feature extraction; Frequency modulation; Mel frequency cepstral coefficient; Speech; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Electrical Communications and Computers (CONIELECOMP), 2012 22nd International Conference on
Conference_Location
Cholula, Puebla
Print_ISBN
978-1-4577-1326-2
Type
conf
DOI
10.1109/CONIELECOMP.2012.6189915
Filename
6189915
Link To Document