• DocumentCode
    1135023
  • Title

    Inference of Finite-State Probabilistic Grammars

  • Author

    Maryanski, Fred J. ; Booth, Taylor L.

  • Issue
    6
  • fYear
    1977
  • fDate
    6/1/1977 12:00:00 AM
  • Firstpage
    521
  • Lastpage
    536
  • Abstract
    The problem of the inference of finite-state probabilistic grammars is studied from two points of view. First, the theoretical aspects of grammatical inference are considered. Among the topics investigated are the structural and statistical properties of probabilistic grammars, methods for assigning probability measures to rewrite rules of probabilistic grammars, and statistical measures for determining how well an inferred probabilistic grammar approximates a sample set. The second concern of the study is the development and implementation of an algorithm for the inference of finite-state probabilistic grammars. This finite-state inference procedure produces a deterministic finite-state probabilistic grammar whose language approximates the sample set within a user-supplied acceptance region under the chi-square test. This procedure is enumerative. Heuristic tree-searching techniques are used to improve efficiency. The convergence of the procedure to an acceptable grammar is demonstrated and the steps of the procedure are theoretically justified. Test results of a PL/I implementation are presented. The inference procedure developed provides a means of synthesizing a probabilistic model of both physical and abstract systems from samples of their behavior.
  • Keywords
    Deterministic grammars, finite-state grammars, grammatical inference, Markov process, probabilistic grammars, statistical estimation.; Automata; Character generation; Computer science; Convergence; Inference algorithms; Markov processes; Probability; Process design; Production; Testing; Deterministic grammars, finite-state grammars, grammatical inference, Markov process, probabilistic grammars, statistical estimation.;
  • fLanguage
    English
  • Journal_Title
    Computers, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9340
  • Type

    jour

  • DOI
    10.1109/TC.1977.1674878
  • Filename
    1674878