• DocumentCode
    542260
  • Title

    Acoustic-phonetic speech parameters for speaker-independent speech recognition

  • Author

    Deshmukh, Om ; Espy-Wilson, Carol Y. ; Juneja, Amit

  • Author_Institution
    University of Maryland, Department of Electrical & Computer Engineering, A. V. Williams Bldg., College Park, 20752, USA
  • Volume
    1
  • fYear
    2002
  • fDate
    13-17 May 2002
  • Abstract
    Coping with inter-speaker variability (i.e., differences in the vocal tract characteristics of speakers) is still a major challenge for Automatic Speech Recognizers. In this paper, we discuss a method that compensates for differences in speaker characteristics. In particular, we demonstrate that when continuous density hidden Markov model based system is used as the back-end, a Knowledge-Based Front End (KBFE) can outperform the traditional Mel-Frequency Cepstral Coefficients (MFCCs), particularly when there is a mismatch in the gender and ages of the subjects used to train and test the recognizer. This work was supported by NSF grant # SBR-9729688 and NIH grant # IK02DCOOI49.
  • Keywords
    Data models; Dentistry; Hidden Markov models; Speech; Speech recognition; Testing; Training;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
  • Conference_Location
    Orlando, FL, USA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7402-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2002.5743787
  • Filename
    5743787