DocumentCode :
3522347
Title :
Phoneme environment clustering for speech recognition
Author :
Sagayama, Shigeki
Author_Institution :
Human Interface Lab., NTT, Tokyo, Japan
fYear :
1989
fDate :
23-26 May 1989
Firstpage :
397
Abstract :
A general principle is proposed to solve problems in context-dependent phoneme segment (or subword unit) based speech recognition, namely, how to choose the set of units and how to estimate the context effect missing in the training data. A phoneme environment clustering algorithm, which automatically selects an optimal set of allophones and estimates the missing context, is presented. This algorithm additionally gives the means to analyze coarticulation effects automatically and quantitatively. The problem is formulated as a clustering technique in phoneme environment space to approximate the mapping function from phoneme environment space to phoneme pattern space by a limited number of centroid patterns based on a distortion measure defined on the phoneme pattern space. The algorithm is tested for phoneme recognition and word recognition, and results are discussed
Keywords :
speech recognition; allophones; centroid patterns; coarticulation effects; context-dependent phoneme segment; distortion measure; mapping function; phoneme environment clustering algorithm; phoneme environment space; phoneme pattern space; phoneme recognition; speech recognition; subword unit; word recognition; Algorithm design and analysis; Clustering algorithms; Distortion measurement; Environmental factors; Extraterrestrial measurements; Humans; Laboratories; Speech analysis; Speech recognition; Training data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1989. ICASSP-89., 1989 International Conference on
Conference_Location :
Glasgow
ISSN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.1989.266449
Filename :
266449
Link To Document :
بازگشت