• DocumentCode
    1925274
  • Title

    Keyword word recognition using a fusion of spectral, cepstral and modulation features

  • Author

    Gopalan, Kaliappan ; Chu, Tao

  • Author_Institution
    Purdue Univ. Calumet, Hammond, IN, USA
  • fYear
    2012
  • fDate
    27-29 Feb. 2012
  • Firstpage
    234
  • Lastpage
    238
  • Abstract
    We present the results of applying a combination of features for recognizing word utterances extracted from a continuous stream of speech. Three sets of features, namely, spectral energy in Bark bands, mel frequency cepstral coefficients, and parameters from an AM-FM model, were employed for training and testing a set of keywords in the CallHome telephone speech database. A pair-wise comparison between the feature set of an unknown word utterance and that of each of the reference utterances in a dynamic time warping process showed a false negative score of 4 out of 12, and a false positive score of 5 out of 132 for a subset of speech from the database. Long, multisyllabic words were spotted correctly while two short words in the word list contributed to errors.
  • Keywords
    audio databases; cepstral analysis; feature extraction; speech recognition; AM-FM model; Bark bands; CallHome telephone speech database; Mel frequency cepstral coefficients; cepstral features; dynamic time warping process; keyword word recognition; modulation features; multisyllabic words; reference utterances; spectral energy; spectral features; speech continuous stream; word utterance recognition; Feature extraction; Frequency modulation; Mel frequency cepstral coefficient; Speech; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Electrical Communications and Computers (CONIELECOMP), 2012 22nd International Conference on
  • Conference_Location
    Cholula, Puebla
  • Print_ISBN
    978-1-4577-1326-2
  • Type

    conf

  • DOI
    10.1109/CONIELECOMP.2012.6189915
  • Filename
    6189915