International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 103 - Number 6 |
Year of Publication: 2014 |
Authors: Anchal Garg, Bikrampal Kaur |
10.5120/18079-8632 |
Anchal Garg, Bikrampal Kaur . Enhancing Performance of Web Page by Removing Noises using LRU. International Journal of Computer Applications. 103, 6 ( October 2014), 23-27. DOI=10.5120/18079-8632
Data mining is the procedure of extracting or taking out the information from the huge set of data. Web Mining is an important application of data mining, which is to extract knowledge from Web data including Web documents, hyperlinks, usage logs of web sites, etc. A Web Page contains many blocks such as content blocks, copyrights, privacy notes and advertisements. These blocks like advertisements and copyrights etc. don't come under main content blocks. These blocks are known as noisy blocks or it can be said that these blocks contain noisy information. This noisy information adversely effects web data mining. Eliminating this noisy information will improve web data mining. In this paper, it will be discussed how to identify these noises and how to eliminate them to improve efficiency of web mining. There are many types of algorithms which are used in web mining i. e. Visitor method, Dom Tree. Visitor and Dom Tree both are complex and time consuming methods. We will also discuss removal of noises by using simple LRU algorithm and variants of LRU, which will result into less time consuming algorithm for web mining.