• DocumentCode
    2800772
  • Title

    Covariance clustering on Riemannian manifolds for acoustic model compression

  • Author

    Shinohara, Yusuke ; Masuko, Takashi ; Akamine, Masami

  • Author_Institution
    Corp. R&D Center, Toshiba Corp., Kawasaki, Japan
  • fYear
    2010
  • fDate
    14-19 March 2010
  • Firstpage
    4326
  • Lastpage
    4329
  • Abstract
    A new method of covariance clustering for acoustic model compression is proposed. Since covariance matrices do not form a Euclidean vector space, standard vector clustering algorithms cannot be used effectively for covariance clustering. In this paper, we propose a novel clustering algorithm based on a Riemannian framework, where the covariance space is considered as a Riemannian manifold equipped with the Fisher information metric, and notions of distance and mean are defined on the manifold. The LBG clustering algorithm is naturally extended to the covariance space under the Riemannian framework. Experimental results show the effectiveness of the proposed method, reducing the acoustic model size nearly to the half without noticeable loss in recognition performance.
  • Keywords
    acoustic signal processing; covariance matrices; pattern clustering; speech recognition; Euclidean vector space; Fisher information metric; LBG clustering algorithm; Riemannian manifold; acoustic model compression; covariance clustering; covariance matrix; recognition performance; Acoustic devices; Automatic speech recognition; Clustering algorithms; Covariance matrix; Euclidean distance; Gaussian distribution; Information geometry; Performance loss; Research and development; Solid modeling; Automatic speech recognition; Fisher information metric; Riemannian geometry; acoustic model compression; covariance clustering;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
  • Conference_Location
    Dallas, TX
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-4295-9
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2010.5495661
  • Filename
    5495661