International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 146 - Number 3 |
Year of Publication: 2016 |
Authors: S. S. Bhamare, B. V. Pawar |
10.5120/ijca2016910657 |
S. S. Bhamare, B. V. Pawar . An Efficient Method of Web Page Noise Cleaning for Effective Web Mining. International Journal of Computer Applications. 146, 3 ( Jul 2016), 18-22. DOI=10.5120/ijca2016910657
In the huge network of World Wide Web, web pages contained large amount of information. Web researches are always requiring main content (e.g., an article text) from the web pages to be gathered, processed and stored quickly and efficiently. Mining the data on the Web has become a major task for locating useful information from the Web. The Web information‘s that are considered as useful information usually has huge amounts of noise data‘s such as navigation bars, links, advertisements, copyright notices etc. Performance of Web mining can be improved by identifying and removing noises from Web pages. In this paper new method is proposed for removing noise content tag and extracts the information of main content tag from web pages.