• DocumentCode
    3428668
  • Title

    Development of a VQ-HMM continuous speech speaker-independent recognition system for small vocabularies

  • Author

    Velez, Edgar ; Cossette, Louis ; Cuperman, Vladimir

  • Author_Institution
    Sch. of Eng. Sci., Simon Fraser Univ., Burnaby, BC, Canada
  • fYear
    1991
  • fDate
    9-10 May 1991
  • Firstpage
    469
  • Abstract
    The authors describe the first phase in the development of a speech recognition system for small vocabularies. The system is designed to handle speaker-independent continuous speech and can be easily modified for different vocabularies. The system consists of a spectral analysis stage followed by vector quantization (VQ) and hidden Markov modeling (HMM). VQ is performed by multiple acoustic parameter codebooks, which are independent or dependent on speech units. The approach seeks to incorporate more knowledge about phoneme allophonic, linguistic, and speaker-dependent variations. The increase in number of codebooks is compensated by their decrease in size, minimizing effects on storage requirements. Phoneme HMM models permit an easy adaptation to new vocabulary requirements. A loop HMM structure with optional silences between words allows continuous speech recognition using a Viterbi search. Preliminary experiments on continuous phoneme and digit recognition were performed on an unrestricted-speaker telephone database
  • Keywords
    Markov processes; data compression; encoding; spectral analysis; speech recognition; telephony; HMM; VQ; Viterbi search; allophones; continuous digit recognition; continuous phoneme recognition; continuous speech recognition; hidden Markov modeling; linguistics; multiple acoustic parameter codebooks; small vocabularies; speaker-dependent variations; speaker-independent recognition system; spectral analysis; speech units; telephone database; vector quantization; Character recognition; Databases; Hidden Markov models; Loudspeakers; Spectral analysis; Speech recognition; Telephony; Vector quantization; Viterbi algorithm; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Communications, Computers and Signal Processing, 1991., IEEE Pacific Rim Conference on
  • Conference_Location
    Victoria, BC
  • Print_ISBN
    0-87942-638-1
  • Type

    conf

  • DOI
    10.1109/PACRIM.1991.160778
  • Filename
    160778