Title of article :
Conventional approaches to text analysis and information retrieval which measured document similarity by using considering all of the information in texts are a relatively inefficiency for processing large text collections in heterogeneous subject areas.
Author/Authors :
J. Morato، نويسنده , , J. Llorens، نويسنده , , G. Genova، نويسنده , , J. A. Moreiro، نويسنده ,
Keywords :
k-means , Co-wording , Discourse-model , Computational-linguistics , Context-analysis , Text-analysis-methods , n-grams , filtering
Journal title :
Astroparticle Physics