DocumentCode
600976
Title
2D visualization of terms and documents in Malay language
Author
Ismail, N.K. ; Saad, N.H.M. ; Omar, S.B.S. ; Sembok, T.M.T.
Author_Institution
Fac. of Comput. & Math. Sci., Univ. Teknol. MARA, Shah Alam, Malaysia
fYear
2013
fDate
26-27 March 2013
Firstpage
1
Lastpage
6
Abstract
In the technology era, information is just at our fingertip. Much information nowadays can be searched in digitize way. The output of the data is still in the listed form, and this linear form (one dimension) makes user hardly to find information related to the data requested. In addition, Malay document is still displayed in textual listed form. 676 documents from Jilid 1 of Hadith Al Tarmizi in Malay language are used to visualize the relationship. Method used to develop vector space model for term-document relationship is TF*IDF and Cosine Similarity technique used for document-document relationship. Prefuse toolkit is used as the visualization tool. From the 2D graphic, the relationship between Hadith can be found easily. From the questionnaire conducted, 90% participants agree that more relevant documents can be found using the 2D graphics system.
Keywords
data visualisation; natural language processing; text analysis; 2D graphic system; 2D visualization; Hadith Al Tarmizi; Jilid; Malay document; Malay language; TF*IDF; cosine similarity technique; digitize information; document-document relationship; prefuse toolkit; technology era; term-document relationship; textual listed form; vector space model; visualization tool; Computers; Data visualization; Electronic mail; Image color analysis; Prototypes; Vectors; graphical user interface; information retrieval; information visualization;
fLanguage
English
Publisher
ieee
Conference_Titel
Information and Communication Technology for the Muslim World (ICT4M), 2013 5th International Conference on
Conference_Location
Rabat
Print_ISBN
978-1-4799-0134-0
Type
conf
DOI
10.1109/ICT4M.2013.6518919
Filename
6518919
Link To Document