Title : 
Benchmark of Arabic morphological analyzers challenges and solutions
         
        
            Author : 
Jaafar, Younes ; Bouzoubaa, Karim
         
        
            Author_Institution : 
Mohammadia Sch. of Eng., Mohammed Vth Univ. - Adgal, Rabat, Morocco
         
        
        
        
        
        
            Abstract : 
Arabic Natural Language Processing (ANLP) has known an important development during the last decade. Nowadays, several ANLP tools are already developed such as morphological analyzers. These analyzers are often used in more advanced applications such as syntactic parsers, search engines, machine translation systems, etc. However, the choice of a morphological analyzer to use, among others, can be difficult for researchers if they ignore its metrics. In this article, we present the challenges of the benchmark of Arabic morphological analyzers. We present also our solution developed in Java, which allows the benchmark by returning the most common metrics, namely the accuracy, precision, f-measure and execution time. This solution has the advantage of being cross-platform, flexible and allows to be extended to cover new morphological analyzers to compare.
         
        
            Keywords : 
Java; natural language processing; ANLP tools; Arabic morphological analyzers; Arabic natural language processing; Java; accuracy metrics; cross-platform; execution time metrics; f-measure metrics; precision metrics; Accuracy; Benchmark testing; Gold; Java; Measurement; Standards; XML; Arabic morphological analyzers; Benchmark; SAFAR platform; Standard corpus;
         
        
        
        
            Conference_Titel : 
Intelligent Systems: Theories and Applications (SITA-14), 2014 9th International Conference on
         
        
            Conference_Location : 
Rabat
         
        
            Print_ISBN : 
978-1-4799-3566-6
         
        
        
            DOI : 
10.1109/SITA.2014.6847312