Title :
Context Dependent Bag of words generation
Author :
Jadhav, Swapnil Ashok ; Somayajulu, D.V.L.N. ; Bhattu, S. Nagesh ; Subramanyam, R.B.V. ; Suresh, Padmashri
Author_Institution :
Dept. of Comput. Sci. & Eng., NIT, Warangal, India
Abstract :
Query spelling correction is a crucial component in modern text mining systems such as Question-answering systems and Sentiment Analysis systems where noise can affect the query matching score. In many existing query matching systems Bag of Words (BoW) generation method is used to generate candidates for noisy words. But in these systems candidate generation do not depend upon context of a query sentence. BoW count for each noisy word may vary and selecting correct candidates from such list is not easy and may result in wrong selection. With our context dependent BoW generation method very few but highly probable candidates are generated which are easy for look up and process of query spelling correction would be easier and efficient.
Keywords :
data mining; query processing; text analysis; BoW generation method; context dependent bag-of-words generation; noisy words candidate generation; query matching score; query sentence; query spelling correction; question-answering systems; sentiment analysis systems; text mining systems; Accuracy; Context; Dictionaries; Hidden Markov models; Mobile communication; Noise; Noise measurement; faq retrieval; information retrieval; nlp; noise removal; question answering; sentiment analysis; spelling correction; text mining;
Conference_Titel :
Advances in Computing, Communications and Informatics (ICACCI), 2013 International Conference on
Conference_Location :
Mysore
Print_ISBN :
978-1-4799-2432-5
DOI :
10.1109/ICACCI.2013.6637406