DocumentCode
1096892
Title
A parametric representation and a clustering method for phoneme recognition--Application to stops in a CV environment
Author
Tanaka, Kazuyo
Author_Institution
Ministry of International Trade and Industry, Ibaraki, Japan.
Volume
29
Issue
6
fYear
1981
fDate
12/1/1981 12:00:00 AM
Firstpage
1117
Lastpage
1127
Abstract
A new method of representing phonemic categories and determining their standard values from a training sample distribution is presented. It is an essential part of a phoneme recognition system aiming at speaker-independent speech recognition. The phonemic value of a short-duration speech signal of up to 50 ms is represented by a matrix composed of acoustic parameters. Standard phonemic categories (SPC´s) are defined by a combination of several simple potential functions in this matrix space. The potential function set, as well as its number, is determined automatically by the proposed method. Processing is primarily by algebraic operation and is formulated according to an analogy to particle dynamics. The method is applied to voiceless and voiced stop consonant sets spoken by twelve speakers. The relationship between the classification rate and the number of SPC´s is investigated under several initial conditions. Stop consonant recognition tests in CV-syllables are made using derived SPC sets irrespective of following vowels. Recognition rates for the utterances of four speakers not included among the twelve speakers used for training were 84 percent for voiceless and 81 percent for voiced stops.
Keywords
Clustering methods; Feature extraction; Helium; Instruments; Isolation technology; Loudspeakers; Speech recognition; Standards development; Testing; Vocabulary;
fLanguage
English
Journal_Title
Acoustics, Speech and Signal Processing, IEEE Transactions on
Publisher
ieee
ISSN
0096-3518
Type
jour
DOI
10.1109/TASSP.1981.1163693
Filename
1163693
Link To Document