• DocumentCode
    1938933
  • Title

    A neural speaker model for speaker clustering

  • Author

    Nakamura, Satoshi ; Akabane, Toshio

  • Author_Institution
    Sharp Corp., Nara, Japan
  • fYear
    1991
  • fDate
    14-17 Apr 1991
  • Firstpage
    853
  • Abstract
    A speaker model using a neural network is proposed for reference speaker clustering on speaker independent speech recognition. Speaker individuality is embedded in not only a static short time spectrum and a pitch frequency, but also a dynamic spectral pattern and pitch pattern. In conventional modeling, speaker individuality is based on the former static features. The authors try to capture the latter dynamic features, of speaker by a neural speaker model. Two methods, neural prediction modeling by multilayer perceptron and learning matrix vector-quantization, are considered for the speaker modeling. Using the measures of speaker modeling, speaker clustering of the reference patterns based on mutual information is carried out for speaker independent speech recognition
  • Keywords
    data compression; learning systems; neural nets; speech recognition; dynamic features; dynamic spectral pattern; learning matrix vector-quantization; multilayer perceptron; mutual information; neural network; neural prediction modeling; neural speaker model; pitch frequency; pitch pattern; reference patterns; reference speaker; speaker independent speech recognition; speaker individuality; static short time spectrum; Databases; Distortion measurement; Frequency; Information technology; Multilayer perceptrons; Mutual information; Neural networks; Predictive models; Speech recognition; Vector quantization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference on
  • Conference_Location
    Toronto, Ont.
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-0003-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.1991.150472
  • Filename
    150472