DocumentCode
3160091
Title
A neuron model for documents containing multilingual Indian texts
Author
Kolla, Bhanu Prakash ; Dorairangaswamy, M.A. ; Rajaraman, A.
Author_Institution
Sathyabama Univ., Chennai, India
fYear
2010
fDate
17-19 Sept. 2010
Firstpage
451
Lastpage
454
Abstract
A two input neuron model for multi-lingual web pages for education Indian subcontinent is an area of multilingual and multi-cultural diversity. Introducing web-enabled education is a challenge as one needs content-based approaches for web pages with english and regional language texts. Translational tools many times get hampered due to time and context-based interpretations. So a generic tool to identify the content of web pages is needed and this will form the basis for later elaborations on the text. The aim of the paper is to present a 2-input neural model which captures salient features of a subject to identify the content in a regular web page containing both English and Telugu-one of the regional languages of India- and test it with computer generated, printed and handwritten formats. Typical words having common content are chosen and neural network is used to arrive at a normalised output. The example of a physics text discussing magnetism is used as an illustration.
Keywords
Internet; language translation; natural language processing; neural nets; text analysis; 2-input neural model; Web enabled education; Web pages content; content based approach; context based interpretations; document handling; education Indian subcontinent; handwritten formats; multicultural diversity; multilingual Indian texts; multilingual Web pages; neural network; neuron model; regional language texts; translational tools; Computational modeling; Computers; Education; Multimedia communication; Multimedia databases; Pixel; Web pages;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer and Communication Technology (ICCCT), 2010 International Conference on
Conference_Location
Allahabad, Uttar Pradesh
Print_ISBN
978-1-4244-9033-2
Type
conf
DOI
10.1109/ICCCT.2010.5640489
Filename
5640489
Link To Document