DocumentCode :
240382
Title :
Text normalization algorithm for facebook chats in Hausa language
Author :
Maitama, Jaafar Zubairu ; Haruna, Usman ; Gambo, Abdullahi Ya´u ; Thomas, Bimba Andrew ; Binti Idris, Norisma ; Gital, Abdulsalam Ya´u ; Abubakar, Adamu I.
Author_Institution :
Dept. of Artificial Intell., Univ. of Malaya, Kuala Lumpur, Malaysia
fYear :
2014
fDate :
17-18 Nov. 2014
Firstpage :
1
Lastpage :
4
Abstract :
The rapid increase in using non-standard words (NSWs) in communication through the social media is causing difficulties in understanding contents of the text messages. In addition, it affects the performance of several natural language processing (NLP) task such as machine translation, information retrievals, summarization and etc. In this study, we present an automatic text normalization system on Facebook chatting based on Hausa language. The proposed algorithm manually developed dictionary that employ normalization of each non-standard word with its equivalent standard word. This is accomplished through modification of the technique employed by [1] to fit Hausa NSWs´ formation. It was found that our proposed algorithm was able to normalized Hausa NSWs with an accuracy of 100%The results of this research can facilitate comprehensive communication via Facebook using Hausa language.
Keywords :
information retrieval; language translation; natural language processing; social networking (online); text analysis; Facebook chatting; Hausa language; NLP; automatic text normalization system; information retrievals; machine translation; natural language processing task; nonstandard words; social media; summarization; text messages; text normalization algorithm; Decision support systems; US Department of Defense; Facebook Chat; Hausa; Non-standard word; Text Normalization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information and Communication Technology for The Muslim World (ICT4M), 2014 The 5th International Conference on
Conference_Location :
Kuching
Type :
conf
DOI :
10.1109/ICT4M.2014.7020605
Filename :
7020605
Link To Document :
بازگشت