• DocumentCode
    1096892
  • Title

    A parametric representation and a clustering method for phoneme recognition--Application to stops in a CV environment

  • Author

    Tanaka, Kazuyo

  • Author_Institution
    Ministry of International Trade and Industry, Ibaraki, Japan.
  • Volume
    29
  • Issue
    6
  • fYear
    1981
  • fDate
    12/1/1981 12:00:00 AM
  • Firstpage
    1117
  • Lastpage
    1127
  • Abstract
    A new method of representing phonemic categories and determining their standard values from a training sample distribution is presented. It is an essential part of a phoneme recognition system aiming at speaker-independent speech recognition. The phonemic value of a short-duration speech signal of up to 50 ms is represented by a matrix composed of acoustic parameters. Standard phonemic categories (SPC´s) are defined by a combination of several simple potential functions in this matrix space. The potential function set, as well as its number, is determined automatically by the proposed method. Processing is primarily by algebraic operation and is formulated according to an analogy to particle dynamics. The method is applied to voiceless and voiced stop consonant sets spoken by twelve speakers. The relationship between the classification rate and the number of SPC´s is investigated under several initial conditions. Stop consonant recognition tests in CV-syllables are made using derived SPC sets irrespective of following vowels. Recognition rates for the utterances of four speakers not included among the twelve speakers used for training were 84 percent for voiceless and 81 percent for voiced stops.
  • Keywords
    Clustering methods; Feature extraction; Helium; Instruments; Isolation technology; Loudspeakers; Speech recognition; Standards development; Testing; Vocabulary;
  • fLanguage
    English
  • Journal_Title
    Acoustics, Speech and Signal Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0096-3518
  • Type

    jour

  • DOI
    10.1109/TASSP.1981.1163693
  • Filename
    1163693