DocumentCode
1938933
Title
A neural speaker model for speaker clustering
Author
Nakamura, Satoshi ; Akabane, Toshio
Author_Institution
Sharp Corp., Nara, Japan
fYear
1991
fDate
14-17 Apr 1991
Firstpage
853
Abstract
A speaker model using a neural network is proposed for reference speaker clustering on speaker independent speech recognition. Speaker individuality is embedded in not only a static short time spectrum and a pitch frequency, but also a dynamic spectral pattern and pitch pattern. In conventional modeling, speaker individuality is based on the former static features. The authors try to capture the latter dynamic features, of speaker by a neural speaker model. Two methods, neural prediction modeling by multilayer perceptron and learning matrix vector-quantization, are considered for the speaker modeling. Using the measures of speaker modeling, speaker clustering of the reference patterns based on mutual information is carried out for speaker independent speech recognition
Keywords
data compression; learning systems; neural nets; speech recognition; dynamic features; dynamic spectral pattern; learning matrix vector-quantization; multilayer perceptron; mutual information; neural network; neural prediction modeling; neural speaker model; pitch frequency; pitch pattern; reference patterns; reference speaker; speaker independent speech recognition; speaker individuality; static short time spectrum; Databases; Distortion measurement; Frequency; Information technology; Multilayer perceptrons; Mutual information; Neural networks; Predictive models; Speech recognition; Vector quantization;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference on
Conference_Location
Toronto, Ont.
ISSN
1520-6149
Print_ISBN
0-7803-0003-3
Type
conf
DOI
10.1109/ICASSP.1991.150472
Filename
150472
Link To Document