DocumentCode :
1954538
Title :
English-Hindi Automatic Word Alignment with Scarce Resources
Author :
Venkataramani, Eknath ; Gupta, Deepa
Author_Institution :
Dept. of Inf. Technol., Amrita Vishwa Vidyapeetham, Bangalore, India
fYear :
2010
fDate :
28-30 Dec. 2010
Firstpage :
253
Lastpage :
256
Abstract :
Many automatic word alignment techniques have been so far developed in Natural Language Processing (NLP). However, word alignment between English and Hindi has not progressed much due to two main reasons viz. complex structure of the participating languages and the scarcity of Hindi-language resources. This paper provides a corpus-augmented method of word alignment in which these limitations have been overcome. We see this work as an improved approach in establishing a word alignment algorithm with scarce resources for Indian languages in general and for English-Hindi in particular.
Keywords :
natural language processing; word processing; English-Hindi automatic word alignment; Hindi language resource scarcity; Indian languages; corpus augmented method; natural language processing; Computational linguistics; Conferences; Data models; Dictionaries; Hidden Markov models; Training; Training data; Giza++; NATools; Scarce resources; Word alignment; corpus-augmented approach;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Asian Language Processing (IALP), 2010 International Conference on
Conference_Location :
Harbin
Print_ISBN :
978-1-4244-9063-9
Type :
conf
DOI :
10.1109/IALP.2010.54
Filename :
5681567
Link To Document :
بازگشت