• DocumentCode
    2979280
  • Title

    Phone-context specific gender-dependent acoustic-models for continuous speech recognition

  • Author

    Neti, C. ; Roukos, S.

  • Author_Institution
    IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
  • fYear
    1997
  • fDate
    14-17 Dec 1997
  • Firstpage
    192
  • Lastpage
    198
  • Abstract
    Gender dependent systems are usually created by splitting the training data into each gender and building two separate acoustic models for each gender. This method assumes that every state of a subphonetic model is uniformly dependent on the gender. We use the premise that the acoustic realizations of various sub phonetic units are dependent on gender in varying degrees across phones and more particularly context dependent. We show that this is indeed the case by using gender as a question in addition to phone context questions in the context decision trees. Using these trees we build phone specific gender dependent acoustic models and demonstrate a novel method to pick between genders during decoding based on a measure of confidence of the decoded hypothesis. An improvement of 6.3% in word error is achieved relative to a gender independent system
  • Keywords
    acoustic signal processing; decision theory; speech coding; speech recognition; trees (mathematics); acoustic models; acoustic realizations; context decision trees; context dependent; continuous speech recognition; decoded hypothesis; decoding; gender dependent systems; gender independent system; phone context questions; phone context specific gender dependent acoustic models; sub phonetic units; subphonetic model; training data; word error; Acoustic measurements; Context modeling; Decision trees; Decoding; Hidden Markov models; Speech analysis; Speech recognition; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition and Understanding, 1997. Proceedings., 1997 IEEE Workshop on
  • Conference_Location
    Santa Barbara, CA
  • Print_ISBN
    0-7803-3698-4
  • Type

    conf

  • DOI
    10.1109/ASRU.1997.659005
  • Filename
    659005