• DocumentCode
    417278
  • Title

    Integrating thumbnail features for speech recognition using conditional exponential models

  • Author

    Yu, Hua ; Waibel, Alex

  • Author_Institution
    Interactive Syst. Labs, Carnegie Mellon Univ., Pittsburgh, PA, USA
  • Volume
    1
  • fYear
    2004
  • fDate
    17-21 May 2004
  • Abstract
    We describe a novel approach for modeling segmental information in speech recognition, through the use of thumbnail features. By taking into account dependencies at the segmental level, thumbnail features are more resistant to changes in speaking rates and other factors. While the traditional acoustic features are fixed for every utterance, one set of thumbnail features is computed for each hypothesis, which may violate the traditional scoring paradigm. To this end, we introduce a conditional exponential modeling framework. It allows better integration of various knowledge sources in a discriminative fashion. We present preliminary experiments on the Switchboard task.
  • Keywords
    feature extraction; hidden Markov models; learning (artificial intelligence); speech recognition; HMM; Switchboard task; conditional exponential models; segmental information; speech recognition; thumbnail features; Handwriting recognition; Hidden Markov models; Image segmentation; Interactive systems; Machine learning; Pattern matching; Proposals; Robustness; Speech analysis; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-8484-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2004.1326130
  • Filename
    1326130