• DocumentCode
    454682
  • Title

    On Maximizing the Within-Cluster Homogeneity of Speaker Voice Characteristics For Speech Utterance Clustering

  • Author

    Tsai, Wei-Ho ; Wang, Hsin-Min

  • Author_Institution
    Inst. of Inf. Sci., Acad. Sinica, Taipei
  • Volume
    1
  • fYear
    2006
  • fDate
    14-19 May 2006
  • Abstract
    This paper investigates the problem of how to partition unknown speech utterances into clusters, such that the overall within-cluster homogeneity of speakers´ voice characteristics can be maximized. The within-cluster homogeneity is characterized by the likelihood probability that a cluster model, trained using all the utterances within a cluster, matches each of the within-cluster utterances. Such probability is then maximized by using a genetic algorithm, which determines the best cluster where each utterance should be located. For greater computational efficiency, also proposed is an alternative solution that approximates the likelihood probability with a divergence-based model similarity. The method is further designed to estimate the optimal number of clusters automatically
  • Keywords
    genetic algorithms; pattern clustering; probability; speaker recognition; divergence-based model; genetic algorithm; likelihood probability; speaker voice characteristics; speech utterance clustering; within-cluster homogeneity; Adaptation model; Audio recording; Computational efficiency; Design methodology; Genetic algorithms; Humans; Indexing; Information science; Performance evaluation; Speech;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
  • Conference_Location
    Toulouse
  • ISSN
    1520-6149
  • Print_ISBN
    1-4244-0469-X
  • Type

    conf

  • DOI
    10.1109/ICASSP.2006.1660168
  • Filename
    1660168