Title : 
Contextual behaviour features and grammar rules for Thai sentence-breaking
         
        
            Author : 
Tangsirirat, Nathacha ; Suchato, Atiwong ; Punyabukkana, Proadpran ; Wutiwiwatchai, Chai
         
        
            Author_Institution : 
Dept. of Comput. Eng., Chulalongkorn Univ., Bangkok, Thailand
         
        
        
        
        
        
            Abstract : 
Statistical approach with surrounding context around a space was widely used as a main feature for Thai sentence-breaking. However, it does not represent a contextual behaviour regarding an entire context in a sentence. Moreover, it does not take an advantage of Thai grammar rules to determine a sentence boundary. This paper proposes the use of a hybrid approach integrating between rule-based method and statistical approach using contextual behaviour features reflecting natural language behaviour for Thai sentence-breaking. The performance of Thai sentence-breaking using a number of words in a chunk, existence of verb, and rules are compared. Experimental results show that using a number of words in a chunk achieves higher accuracy than other features. Moreover, integration of those features and rule-based method achieves better accuracy. The space-correct and false-break scores are 93.54% and 2.99% respectively.
         
        
            Keywords : 
grammars; knowledge based systems; natural language processing; statistical analysis; Thai grammar rules; Thai sentence-breaking; contextual behaviour features; false-break score; natural language behaviour; rule-based method; sentence boundary; space-correct score; statistical approach; Accuracy; Classification algorithms; Context; Grammar; Natural language processing; Training data; rule-based; sentence boundary; sentence segmentation; sentence-breaking; statistical approach;
         
        
        
        
            Conference_Titel : 
Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology (ECTI-CON), 2013 10th International Conference on
         
        
            Conference_Location : 
Krabi
         
        
            Print_ISBN : 
978-1-4799-0546-1
         
        
        
            DOI : 
10.1109/ECTICon.2013.6559581