International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 65 - Number 4 |
Year of Publication: 2013 |
Authors: Kolla Bhanu Prakash, M. A. Dorai Rangaswamy, Arun Raja Raman |
10.5120/10909-5837 |
Kolla Bhanu Prakash, M. A. Dorai Rangaswamy, Arun Raja Raman . Neural Model for Content Extraction in Multilingual Web Documents. International Journal of Computer Applications. 65, 4 ( March 2013), 1-3. DOI=10.5120/10909-5837
Neural model for multilingual web documents in Indian sub-continent is gaining prominence in day to day life. While translation and transliteration are gaining its importance on web pages, it becomes difficult for the common man to understand what the web page says about, especially when regional language is not known to the user. So, our effort here is a generic tool applied in Neural networks to overcome this problem. The model takes inputs in both English and Telugu, an Indian regional language in both printed and handwritten formats. Words having common content are chosen and neural network is used to normalize the output. A sample page from a physics textbook dealing with magnetism is taken for consideration for this paper.