مرکز منطقه ای اطلاع رساني علوم و فناوري - Finding Difficult-to-Disambiguate Words: Towards an Efficient Workflow to Implement Word Sense Disambiguation

DocumentCode :

3706641

Title :

Finding Difficult-to-Disambiguate Words: Towards an Efficient Workflow to Implement Word Sense Disambiguation

Author :

Manabu Torii;Jung-Wei Fan;Daniel S. Zisook

Author_Institution :

Med. Inf., Syst. Solutions &

fYear :

2015

Firstpage :

448

Lastpage :

448

Abstract :

In the biomedical and clinical domain, valuable information is frequently represented in free-text documents. Natural language processing (NLP) is a powerful tool that can extract structured information from theses documents. Word sense disambiguation (WSD) is a critical component in an NLP pipeline that increases the accuracy of the extracted information. However, WSD is expensive to apply for all known ambiguous words. Given limited time and resources, one practical strategy is to prioritize easy-to-disambiguate words and efficiently maximize the coverage of disambiguation. To aid prioritization efforts, we studied two quantitative indicators that are associated with how easy/difficult it is to disambiguate any given word.

Keywords :

"Natural language processing","Vocabulary","Data mining","Training","Informatics","Benchmark testing","Bioinformatics"

Publisher :

ieee

Conference_Titel :

Healthcare Informatics (ICHI), 2015 International Conference on

Type :

conf

DOI :

10.1109/ICHI.2015.66

Filename :

7349727

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3706641