• DocumentCode
    417277
  • Title

    Extended cluster information vector quantization (ECI-VQ) for robust classification

  • Author

    Arrowood, Jon A. ; Clements, Mark A.

  • Author_Institution
    Center for Signal & Image Process., Georgia Inst. of Technol., Atlanta, GA, USA
  • Volume
    1
  • fYear
    2004
  • fDate
    17-21 May 2004
  • Abstract
    This paper presents a novel extension to vector quantization referred to as extended cluster information (ECI). In this method the decoder retains more general statistics about the VQ clusters found during codebook training than the single prototypical point of conventional VQ systems. Typically this information is unnecessary, however if the items being compressed are feature space vectors used as input to a statistical pattern classification system, the extra probabilistic information can be used during the classification as in Bayes predictive classification (BPC) to improve recognition results. To demonstrate ECI-VQ, a simple experiment is described where the Aurora2 distributed speech recognition front end is altered to provide more aggressive mel frequency cepstral coefficient (MFCC) compression. As the bit-rate drops, the corresponding recognition performance suffers. It is then shown that using ECI-VQ as the input to an uncertain observation (UO) speech recognizer, a number of errors due to compression can be corrected with no extra cost in bit-rate.
  • Keywords
    cepstral analysis; error correction; feature extraction; pattern classification; pattern clustering; speech recognition; statistical analysis; table lookup; vector quantisation; Aurora2 distributed speech recognition front end; ECI-VQ; aggressive MFCC compression; codebook training; error correction; extended cluster information; feature space vectors; mel frequency cepstral coefficient; recognition performance; robust classification; statistical pattern classification system; uncertain observation speech recognizer; vector quantization; Decoding; Error correction; Mel frequency cepstral coefficient; Pattern classification; Pattern recognition; Prototypes; Robustness; Speech recognition; Statistics; Vector quantization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-8484-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2004.1326129
  • Filename
    1326129