DocumentCode
3489925
Title
A Systematic Comparison of SVM and Maximum Entropy Classifiers for Translation Error Detection
Author
Jinhua Du ; Sha Wang
Author_Institution
Sch. of Autom. & Inf. Eng., Xi´an Univ. of Technol., Xi´an, China
fYear
2012
fDate
13-15 Nov. 2012
Firstpage
125
Lastpage
128
Abstract
In recent years, the translation error detection or confidence estimation for SMT has been becoming a hot question, especially in the localization industry. This paper mainly focuses on a systematic comparison on two different classifiers Maximum Entropy (MaxEnt) and SVM over different features to illustrate their error detection capabilities. Three typical word posterior probabilities (WPP) and three linguistic features are introduced and fairly compared over two classifiers on Chinese to-English NIST datasets. Experimental results show that the combination of WPP with linguistic features can significantly reduce the CER, and the SVM classifier outperforms the MaxEnt classifier in terms of the CER and F measure.
Keywords
language translation; maximum entropy methods; natural language processing; pattern classification; probability; support vector machines; CER; Chinese to-English NIST datasets; F measure; MaxEnt; SM; SVM; WPP; confidence estimation; error detection capabilities; localization industry; maximum entropy classifiers; statistical machine translation; systematic comparison; translation error detection; word posterior probabilities; Estimation; Feature extraction; NIST; Pragmatics; Support vector machines; Syntactics; Systematics; Error Detection; Linguistic Features; Maximum Entropy Classifier; SVM classifier; Word Posterior Probability;
fLanguage
English
Publisher
ieee
Conference_Titel
Asian Language Processing (IALP), 2012 International Conference on
Conference_Location
Hanoi
Print_ISBN
978-1-4673-6113-2
Electronic_ISBN
978-0-7695-4886-9
Type
conf
DOI
10.1109/IALP.2012.20
Filename
6473712
Link To Document