• DocumentCode
    353735
  • Title

    Use of word level side information to improve speech recognition

  • Author

    Vergyri, Dimitra

  • Author_Institution
    Center for Language & Speech Process., Johns Hopkins Univ., Baltimore, MD, USA
  • Volume
    3
  • fYear
    2000
  • fDate
    2000
  • Firstpage
    1823
  • Abstract
    Word level information obtained from the output of a speech recognizer has been used in the past to extract confidence features for the hypothesized words. This work describes a post-recognition process which treats these word-level features as independent knowledge sources and combines them in one log linear model for the posterior probability of a word sequence. This model is used for rescoring the hypotheses. The parameters of the model are optimized using a discriminative model combination approach, where a simplex optimization method, known as amoeba search, is used to minimize the non-smooth function of empirical error rate on training data. The method is evaluated on the SWITCHBOARD database. After training 20 new parameters, we obtain a significant word error rate reduction over the baseline system. A correlation measure between features and word accuracy is defined to help analyze and explain the results
  • Keywords
    feature extraction; optimisation; search problems; speech recognition; SWITCHBOARD database; amoeba search; confidence features extraction; correlation measure; discriminative model combination approach; empirical error rate; hypotheses rescoring; independent knowledge sources; log linear model; model parameters; non-smooth function minimization; post-recognition process; posterior probability; simplex optimization method; speech recognition; training data; word accuracy; word error rate reduction; word level side information; word sequence; word-level features; Adaptation model; Data mining; Error analysis; Feature extraction; Natural languages; Optimization methods; Spatial databases; Speech processing; Speech recognition; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
  • Conference_Location
    Istanbul
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-6293-4
  • Type

    conf

  • DOI
    10.1109/ICASSP.2000.862109
  • Filename
    862109