We apologize for a recent technical issue with our email system, which temporarily affected account activations. Accounts have now been activated. Authors may proceed with paper submissions. PhDFocusTM
CFP last date
20 November 2024
Call for Paper
December Edition
IJCA solicits high quality original research papers for the upcoming December edition of the journal. The last date of research paper submission is 20 November 2024

Submit your paper
Know more
Reseach Article

Neural Model for Content Extraction in Multilingual Web Documents

by Kolla Bhanu Prakash, M. A. Dorai Rangaswamy, Arun Raja Raman
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 65 - Number 4
Year of Publication: 2013
Authors: Kolla Bhanu Prakash, M. A. Dorai Rangaswamy, Arun Raja Raman
10.5120/10909-5837

Kolla Bhanu Prakash, M. A. Dorai Rangaswamy, Arun Raja Raman . Neural Model for Content Extraction in Multilingual Web Documents. International Journal of Computer Applications. 65, 4 ( March 2013), 1-3. DOI=10.5120/10909-5837

@article{ 10.5120/10909-5837,
author = { Kolla Bhanu Prakash, M. A. Dorai Rangaswamy, Arun Raja Raman },
title = { Neural Model for Content Extraction in Multilingual Web Documents },
journal = { International Journal of Computer Applications },
issue_date = { March 2013 },
volume = { 65 },
number = { 4 },
month = { March },
year = { 2013 },
issn = { 0975-8887 },
pages = { 1-3 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume65/number4/10909-5837/ },
doi = { 10.5120/10909-5837 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T21:17:43.784085+05:30
%A Kolla Bhanu Prakash
%A M. A. Dorai Rangaswamy
%A Arun Raja Raman
%T Neural Model for Content Extraction in Multilingual Web Documents
%J International Journal of Computer Applications
%@ 0975-8887
%V 65
%N 4
%P 1-3
%D 2013
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Neural model for multilingual web documents in Indian sub-continent is gaining prominence in day to day life. While translation and transliteration are gaining its importance on web pages, it becomes difficult for the common man to understand what the web page says about, especially when regional language is not known to the user. So, our effort here is a generic tool applied in Neural networks to overcome this problem. The model takes inputs in both English and Telugu, an Indian regional language in both printed and handwritten formats. Words having common content are chosen and neural network is used to normalize the output. A sample page from a physics textbook dealing with magnetism is taken for consideration for this paper.

References
  1. Rafael C. Gonzalez, Richard E. Woods, Steven L. Eddins, "Digital image processing using matlab",2002.
  2. Renu dhir, "Feature extraction and classification for bilingual script (Gurumukhi and Roman)", April 2007.
  3. Bing Zhao, Stephen Vogel, "Adaptive parallel sentences mining from web bilingual news collection", 2002.
  4. Y. Li, C. -C. J. Kuo and X. Wan, Introduction to content-based image retrieval — Overview of key techniques, in Image Databases: Search and Retrieval of Digital Imagery,eds. V. Castelli and L. D. Bergman (John Wiley, New York, 2002), pp. 261–284.
  5. Kolla Bhanu Prakash, M. A. Dorai Ranga Swamy, Arun Raja Raman, "Statistical Interpretation for Mining Hybrid Regional Web Documents", ICIP 2012, CCIS 292, pp. 503–512, 2012 © Springer-Verlag Berlin Heidelberg 2012.
  6. Kolla Bhanu Prakash, M. A. Dorai Ranga Swamy, Arun Raja Raman, "ANN for Multilingual Regional Web Communication", ICONIP 2012, Part V, LNCS 7667, pp. 473–478, 2012, © Springer-Verlag Berlin Heidelberg 2012.
  7. Kolla Bhanu Prakash, M. A. Dorai Ranga Swamy, Arun Raja Raman, "Performance of Content Based Mining Approach for Multi-lingual Textual Data", International Journal of Modern Engineering Research, Vol. 1, Issue1, Sep-Oct 2011, pp-146-150.
  8. Kolla Bhanu Prakash, M. A. Dorai Ranga Swamy, Arun Raja Raman, Content Extraction with Web Pages having Hand-Written Texts" (NCEVENT 2011) Sathyabama University, Chennai.
  9. Kolla Bhanu Prakash, M. A. Dorai Ranga Swamy, Arun Raja Raman, "Text Studies Towards Multi-lingual Content Mining for Web Communication" (TISC2010), Sathyabama University, Chennai.
  10. Kolla Bhanu Prakash, M. A. Dorai Ranga Swamy, Arun Raja Raman, "Content Extraction for Multi-lingual Web documents", CIT Journal of Research Volume 1, Issue 3, nov 2010, pp. 93-101, Chhattisgarh Institute Of Technology, Rajnandgaon.
  11. Kolla Bhanu Prakash, M. A. Dorai Ranga Swamy, Arun Raja Raman, A Neuron Model for Documents Containing Multilingual Indian Texts (ICCCT 2010), Allahabad.
  12. Kolla Bhanu Prakash, M. A. Dorai Ranga Swamy, Arun Raja Raman, "Feature extraction for content mining in multi-lingual documents" (NCICN 2010), Sathyabama University, Chennai.
Index Terms

Computer Science
Information Sciences

Keywords

Media mining Multilingual Web communication Neural network