• DocumentCode
    2018508
  • Title

    Supervised and unsupervised clustering of the speaker space for connectionist speech recognition

  • Author

    Konig, Yochai ; Morgan, Nelson

  • Author_Institution
    Int. Comput. Sci. Inst., Berkeley, CA, USA
  • Volume
    1
  • fYear
    1993
  • fDate
    27-30 April 1993
  • Firstpage
    545
  • Abstract
    One of the challenging problems of a speaker-independent continuous speech recognition system is how to achieve good performance with a new speaker, when the only available source of information about the new speaker is the utterance to be recognized. The authors propose a first step toward a solution, based on clustering of the speaker space. The study had two steps. The first was searching for a set of features to cluster speakers. Second, using the chosen features, two kinds of clustering were investigated: supervised-using two clusters, males and females-and unsupervised-using two, three, and five clusters. The cluster information was integrated into the connectionist speech recognition system by using the speaker cluster neural network (SCNN). The SCNN attempts to share the speaker-independent parameters and to model the cluster-dependent parameters. The results show that the best performance is achieved with the supervised clusters, resulting in an overall improvement in recognition performance.<>
  • Keywords
    neural nets; search problems; speech recognition; connectionist speech recognition; performance; speaker cluster neural network; supervised clustering; unsupervised clustering;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1993. ICASSP-93., 1993 IEEE International Conference on
  • Conference_Location
    Minneapolis, MN, USA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7402-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.1993.319176
  • Filename
    319176