A neural speaker model for speaker clustering

Author

Nakamura, Satoshi ; Akabane, Toshio

Author_Institution

Sharp Corp., Nara, Japan

fYear

1991

fDate

14-17 Apr 1991

Firstpage

853

Abstract

A speaker model using a neural network is proposed for reference speaker clustering on speaker independent speech recognition. Speaker individuality is embedded in not only a static short time spectrum and a pitch frequency, but also a dynamic spectral pattern and pitch pattern. In conventional modeling, speaker individuality is based on the former static features. The authors try to capture the latter dynamic features, of speaker by a neural speaker model. Two methods, neural prediction modeling by multilayer perceptron and learning matrix vector-quantization, are considered for the speaker modeling. Using the measures of speaker modeling, speaker clustering of the reference patterns based on mutual information is carried out for speaker independent speech recognition

Keywords

data compression; learning systems; neural nets; speech recognition; dynamic features; dynamic spectral pattern; learning matrix vector-quantization; multilayer perceptron; mutual information; neural network; neural prediction modeling; neural speaker model; pitch frequency; pitch pattern; reference patterns; reference speaker; speaker independent speech recognition; speaker individuality; static short time spectrum; Databases; Distortion measurement; Frequency; Information technology; Multilayer perceptrons; Mutual information; Neural networks; Predictive models; Speech recognition; Vector quantization;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference on

Conference_Location

Toronto, Ont.

ISSN

1520-6149

Print_ISBN

0-7803-0003-3

Type

conf

DOI

10.1109/ICASSP.1991.150472

Filename

150472