• DocumentCode
    286259
  • Title

    Deriving a probabilistic grammar of semantic markers from unrestricted English text

  • Author

    Jost, Uwe ; Atwell, Eric

  • Author_Institution
    Centre for Comput. Anal. of Language & Speech, Leeds Univ., UK
  • fYear
    1993
  • fDate
    22-23 Apr 1993
  • Abstract
    The derivation is described of a probabilistic grammar for main subject field codes from the machine readable version of the Longman Dictionary of Contemporary English (LDOCE) (P. Procter, 1978). These codes are used in the dictionary to mark the subject area to which a certain sense of a word belongs. The grammar consists of the dictionary itself and a matrix that describes how closely two main subject fields are related to each other in a large training corpus of unrestricted English text
  • Keywords
    computational linguistics; grammars; natural languages; probabilistic logic; LDOCE; large training corpus; machine readable version; main subject field codes; matrix; probabilistic grammar; semantic markers; subject area; unrestricted English text;
  • fLanguage
    English
  • Publisher
    iet
  • Conference_Titel
    Grammatical Inference: Theory, Applications and Alternatives, IEE Colloquium on
  • Conference_Location
    Colchester
  • Type

    conf

  • Filename
    243120