DocumentCode
3368333
Title
A Fast Algorithm of Computing Word Similarity
Author
Xingyuan Chen ; Xia Yang ; Bingjun Su
Author_Institution
Sch. of Comput. Sci., Leshan Normal Univ., Leshan, China
fYear
2013
fDate
14-15 Dec. 2013
Firstpage
405
Lastpage
408
Abstract
Computing distributional similarity is an effective strategy for finding synonyms. The time complexity of the naive nearest-neighbor approach of computing distributional word similarity is O(n*n*m), it is inefficient for accurately representing synonymy using large corpus. We find a parse property of triple that the growth rate of average triples number of each word leveled off as corpus´s size increases. Using this property we design a fast algorithm for computing word similarity whose time complexity is O(n*n). We demonstrate the efficiency of this algorithm based on the English Gig word corpus.
Keywords
computational complexity; natural language processing; English Gig word corpus; distributional word similarity computation; naive nearest-neighbor approach; natural language processing; parse triple property; synonym finding; synonymy representation; time complexity; Algorithm design and analysis; Computer science; Context; Educational institutions; Manuals; Time complexity; Vocabulary; computing complexity; distributional word similarity; triples;
fLanguage
English
Publisher
ieee
Conference_Titel
Computational Intelligence and Security (CIS), 2013 9th International Conference on
Conference_Location
Leshan
Print_ISBN
978-1-4799-2548-3
Type
conf
DOI
10.1109/CIS.2013.92
Filename
6746428
Link To Document