• DocumentCode
    417256
  • Title

    Model complexity control and compression using discriminative growth functions

  • Author

    Liu, X. ; Gales, M.J.F.

  • Author_Institution
    Dept. of Eng., Cambridge Univ., UK
  • Volume
    1
  • fYear
    2004
  • fDate
    17-21 May 2004
  • Abstract
    State-of-the-art large vocabulary speech recognition systems are highly complex. Many techniques affect both system complexity and recognition performance. The need to determine the appropriate complexity without having to build each possible system has led to the development of automatic complexity control criteria. In this paper further experiments are carried out using a recently proposed criterion based on marginalizing a maximum mutual information (MMI) growth function. The use of this criterion is much detailed for determining the appropriate dimensionality in a multiple HLDA system and the number of components per state. A scheme for also using this criterion for model compression is described. Experimental results on a spontaneous telephone speech recognition task are described. Initial system compression experiments are inconclusive. However, comparing a standard state-of-the-art system with one generated using complexity control shows a reduction in word error rate.
  • Keywords
    computational complexity; error statistics; speech recognition; vocabulary; MMI growth function; discriminative growth functions; large vocabulary speech recognition; maximum mutual information; model complexity control; model compression; multiple HLDA system; recognition performance; spontaneous telephone speech recognition; system complexity; word error rate reduction; Automatic control; Automatic generation control; Bayesian methods; Control systems; Error analysis; Maximum likelihood estimation; Speech recognition; Telephony; Training data; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-8484-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2004.1326106
  • Filename
    1326106