DocumentCode :
3368333
Title :
A Fast Algorithm of Computing Word Similarity
Author :
Xingyuan Chen ; Xia Yang ; Bingjun Su
Author_Institution :
Sch. of Comput. Sci., Leshan Normal Univ., Leshan, China
fYear :
2013
fDate :
14-15 Dec. 2013
Firstpage :
405
Lastpage :
408
Abstract :
Computing distributional similarity is an effective strategy for finding synonyms. The time complexity of the naive nearest-neighbor approach of computing distributional word similarity is O(n*n*m), it is inefficient for accurately representing synonymy using large corpus. We find a parse property of triple that the growth rate of average triples number of each word leveled off as corpus´s size increases. Using this property we design a fast algorithm for computing word similarity whose time complexity is O(n*n). We demonstrate the efficiency of this algorithm based on the English Gig word corpus.
Keywords :
computational complexity; natural language processing; English Gig word corpus; distributional word similarity computation; naive nearest-neighbor approach; natural language processing; parse triple property; synonym finding; synonymy representation; time complexity; Algorithm design and analysis; Computer science; Context; Educational institutions; Manuals; Time complexity; Vocabulary; computing complexity; distributional word similarity; triples;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Intelligence and Security (CIS), 2013 9th International Conference on
Conference_Location :
Leshan
Print_ISBN :
978-1-4799-2548-3
Type :
conf
DOI :
10.1109/CIS.2013.92
Filename :
6746428
Link To Document :
بازگشت