• DocumentCode
    312005
  • Title

    Inclusion of temporal information into features for speech recognition

  • Author

    Milner, Ben

  • Author_Institution
    Speech Technol. Unit, British Telecom Res. Labs., Ipswich, UK
  • Volume
    1
  • fYear
    1996
  • fDate
    3-6 Oct 1996
  • Firstpage
    256
  • Abstract
    Conventional methods for incorporating temporal information into speech features apply regression to a series of successive cepstral vectors to generate differential cepstra, or apply a cosine transform to generate cepstral-time matrices. This paper aims to generalise these techniques such that a series of stacked cepstral vectors is multiplied by a temporal transform matrix to produce the final speech feature. This can made to incorporate both static and dynamic speech information. Using this method, the coding of temporal information is not restricted to regression or cosine coefficients-any suitable transform may be used. Results are presented for a variety of transforms, such as Legendre, Karhunen-Loeve, Cosine, Rectangle, where it is shown that the transform based techniques offer higher performance than conventional differential cepstrum
  • Keywords
    cepstral analysis; feature extraction; matrix algebra; speech recognition; statistical analysis; transforms; vectors; Cosine transform; Karhunen-Loeve transform; Legendre transform; Rectangle transform; cepstral-time matrices; cosine coefficients; cosine transform; differential cepstra; dynamic speech information; regression; speech features; speech recognition; stacked cepstral vectors; static speech information; successive cepstral vectors; temporal information; temporal transform matrix; Cepstral analysis; Cepstrum; Equations; Hidden Markov models; Laboratories; Probability density function; Speech processing; Speech recognition; State-space methods; Writing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
  • Conference_Location
    Philadelphia, PA
  • Print_ISBN
    0-7803-3555-4
  • Type

    conf

  • DOI
    10.1109/ICSLP.1996.607093
  • Filename
    607093