International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 96 - Number 16 |
Year of Publication: 2014 |
Authors: Rachit Goel |
10.5120/16880-6882 |
Rachit Goel . Enhanced Web Mining Technique To Clean Web Log File. International Journal of Computer Applications. 96, 16 ( June 2014), 25-29. DOI=10.5120/16880-6882
The arrival of the computer technology has contributed the ability to produce and store the massive amounts of data. Now the world is not confined only to manually generated files or reports, but has become a giant store where vast amounts of data are collected and exchanged daily. Web pages typically contain a large amount of information that is not part of the main content of the pages, e. g. banner ads, navigation bars, copyright notices, etc. Such noise on web pages usually leads to poor results in Web Mining which mainly depends upon the web page content. Therefore, it becomes very essential to extract information from the bulks of data and structure them into useful knowledge that will be helpful for some type of understanding. This leads to the birth of data mining. Web usage mining is the subject field of Data Mining which deals with the discovery and analysis of usage patterns from web data specifically web logs in order to improve the web based applications. The motive of mining is to find users' access models automatically and quickly from the vast Web log data, such as frequent access paths, frequent access page groups and user clustering. Through web usage mining, the server log, registration information and other relative information left by user provide foundation for decision making of organizations.