DocumentCode
2018508
Title
Supervised and unsupervised clustering of the speaker space for connectionist speech recognition
Author
Konig, Yochai ; Morgan, Nelson
Author_Institution
Int. Comput. Sci. Inst., Berkeley, CA, USA
Volume
1
fYear
1993
fDate
27-30 April 1993
Firstpage
545
Abstract
One of the challenging problems of a speaker-independent continuous speech recognition system is how to achieve good performance with a new speaker, when the only available source of information about the new speaker is the utterance to be recognized. The authors propose a first step toward a solution, based on clustering of the speaker space. The study had two steps. The first was searching for a set of features to cluster speakers. Second, using the chosen features, two kinds of clustering were investigated: supervised-using two clusters, males and females-and unsupervised-using two, three, and five clusters. The cluster information was integrated into the connectionist speech recognition system by using the speaker cluster neural network (SCNN). The SCNN attempts to share the speaker-independent parameters and to model the cluster-dependent parameters. The results show that the best performance is achieved with the supervised clusters, resulting in an overall improvement in recognition performance.<>
Keywords
neural nets; search problems; speech recognition; connectionist speech recognition; performance; speaker cluster neural network; supervised clustering; unsupervised clustering;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1993. ICASSP-93., 1993 IEEE International Conference on
Conference_Location
Minneapolis, MN, USA
ISSN
1520-6149
Print_ISBN
0-7803-7402-9
Type
conf
DOI
10.1109/ICASSP.1993.319176
Filename
319176
Link To Document