• DocumentCode
    1451880
  • Title

    A Large-Deviation Analysis of the Maximum-Likelihood Learning of Markov Tree Structures

  • Author

    Tan, Vincent Y F ; Anandkumar, Animashree ; Tong, Lang ; Willsky, Alan S.

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Univ. of Wisconsin-Madison, Madison, WI, USA
  • Volume
    57
  • Issue
    3
  • fYear
    2011
  • fDate
    3/1/2011 12:00:00 AM
  • Firstpage
    1714
  • Lastpage
    1735
  • Abstract
    The problem of maximum-likelihood (ML) estimation of discrete tree-structured distributions is considered. Chow and Liu established that ML-estimation reduces to the construction of a maximum-weight spanning tree using the empirical mutual information quantities as the edge weights. Using the theory of large-deviations, we analyze the exponent associated with the error probability of the event that the ML-estimate of the Markov tree structure differs from the true tree structure, given a set of independently drawn samples. By exploiting the fact that the output of ML-estimation is a tree, we establish that the error exponent is equal to the exponential rate of decay of a single dominant crossover event. We prove that in this dominant crossover event, a non-neighbor node pair replaces a true edge of the distribution that is along the path of edges in the true tree graph connecting the nodes in the non-neighbor pair. Using ideas from Euclidean information theory, we then analyze the scenario of ML-estimation in the very noisy learning regime and show that the error exponent can be approximated as a ratio, which is interpreted as the signal-to-noise ratio (SNR) for learning tree distributions. We show via numerical experiments that in this regime, our SNR approximation is accurate.
  • Keywords
    Markov processes; error statistics; maximum likelihood estimation; Euclidean information theory; Markov tree structures; discrete tree-structured distributions; empirical mutual information quantities; error probability; large-deviation analysis; maximum-likelihood learning; maximum-weight spanning tree; Approximation methods; Error probability; Graphical models; Markov processes; Maximum likelihood estimation; Mutual information; Signal to noise ratio; Error exponent; Euclidean information theory; Markov structure; large-deviations principle; maximum-likelihood (ML) distribution estimation; tree-structured distributions;
  • fLanguage
    English
  • Journal_Title
    Information Theory, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9448
  • Type

    jour

  • DOI
    10.1109/TIT.2011.2104513
  • Filename
    5714274