Title :
Word sense distribution in a web corpus
Author :
Chen, Ping ; Brown, David ; Tran, Andrew ; Ozoka, Noble ; Ortiz, Rafael
Author_Institution :
Dept. of Comput. & Math. Sci., Univ. of Houston-Downtown, Houston, TX, USA
Abstract :
World Wide Web has become an important knowledge source for many research fields, and quality of Web-acquired knowledge has direct impact on their performance. While evaluation of the vast amount of Web resources is out of question, in this paper we examined thousands of sentences containing twelve preselected words and produced several quality measures including sentence coherence and sense distribution information. Our goal is to provide some insight to several Computational Linguistics areas that acquire knowledge from the Web.
Keywords :
Internet; Web sites; computational linguistics; knowledge acquisition; word processing; Web corpus; Web resources; Web-acquired knowledge quality; World Wide Web; computational linguistics; knowledge source; sentence coherence; word sense distribution information; Coherence; Computational linguistics; Knowledge engineering; Search engines; Semantics; Speech; Syntactics; Computational Linguistics; Sense annotation; Web corpus acquisition and quality analysis; Word sense distribution;
Conference_Titel :
Cognitive Informatics (ICCI), 2010 9th IEEE International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4244-8041-8
DOI :
10.1109/COGINF.2010.5599697