Title of article
Inducing terminologies from text: A case study for the consumer health domain
Author/Authors
Smaranda Muresan1، نويسنده , , Judith L. Klavans2، نويسنده ,
Issue Information
ماهنامه با شماره پیاپی سال 2013
Pages
18
From page
727
To page
744
Abstract
Specialized medical ontologies and terminologies, such as SNOMED CT and the Unified Medical Language System (UMLS), have been successfully leveraged in medical information systems to provide a standard web-accessible medium for interoperability, access, and reuse. However, these clinically oriented terminologies and ontologies cannot provide sufficient support when integrated into consumer-oriented applications, because these applications must “understand” both technical and lay vocabulary. The latter is not part of these specialized terminologies and ontologies. In this article, we propose a two-step approach for building consumer health terminologies from text: 1) automatic extraction of definitions from consumer-oriented articles and web documents, which reflects language in use, rather than relying solely on dictionaries, and 2) learning to map definitions expressed in natural language to terminological knowledge by inducing a syntactic-semantic grammar rather than using hand-written patterns or grammars. We present quantitative and qualitative evaluations of our two-step approach, which show that our framework could be used to induce consumer health terminologies from text.
Keywords
Natural language processing , Terminology , semantic analysis
Journal title
Journal of the American Society for Information Science and Technology
Serial Year
2013
Journal title
Journal of the American Society for Information Science and Technology
Record number
994841
Link To Document