Title :
Automated Assessment of Review Quality Using Latent Semantic Analysis
Author :
Ramachandran, Lakshmi ; Gehringer, Edward F.
Author_Institution :
Dept. of Comput. Sci., North Carolina State Univ., Raleigh, NC, USA
Abstract :
Quality of a review can be identified by reviewing a review. Quantifiable factors that help identify the quality of a review include quality and tone of review comments, and the number of tokens each contains. We use machine-learning techniques such as latent semantic analysis (LSA) and cosine similarity to classify comments based on their quality and tone. Our paper details experiments that were conducted on student review and metareview data by using different data pre-processing steps. We compare these pre-processing steps and show that when applied to student review data, they help improve data quality by providing better text classification. Our technique helps predict metareview scores for student reviews.
Keywords :
computer aided instruction; natural language processing; pattern classification; reviews; text analysis; data pre-processing; data quality; latent semantic analysis; machine learning techniques; metareview scores; quantifiable factors; review quality assessment; student reviews; text classification; Accuracy; Humans; Matrix decomposition; Semantics; Syntactics; Thumb; Training; automated metareviewing; latent semantic analysis; quality of reviews; text mining;
Conference_Titel :
Advanced Learning Technologies (ICALT), 2011 11th IEEE International Conference on
Conference_Location :
Athens, GA
Print_ISBN :
978-1-61284-209-7
Electronic_ISBN :
2161-3761
DOI :
10.1109/ICALT.2011.46