We apologize for a recent technical issue with our email system, which temporarily affected account activations. Accounts have now been activated. Authors may proceed with paper submissions. PhDFocusTM
CFP last date
20 November 2024
Reseach Article

Extraction of Frequent Patterns from Web Logs using Web Log Mining Techniques

by Rakesh Kumar, Kanwal Garg, Vinod Kumar
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 59 - Number 10
Year of Publication: 2012
Authors: Rakesh Kumar, Kanwal Garg, Vinod Kumar
10.5120/9584-4063

Rakesh Kumar, Kanwal Garg, Vinod Kumar . Extraction of Frequent Patterns from Web Logs using Web Log Mining Techniques. International Journal of Computer Applications. 59, 10 ( December 2012), 19-25. DOI=10.5120/9584-4063

@article{ 10.5120/9584-4063,
author = { Rakesh Kumar, Kanwal Garg, Vinod Kumar },
title = { Extraction of Frequent Patterns from Web Logs using Web Log Mining Techniques },
journal = { International Journal of Computer Applications },
issue_date = { December 2012 },
volume = { 59 },
number = { 10 },
month = { December },
year = { 2012 },
issn = { 0975-8887 },
pages = { 19-25 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume59/number10/9584-4063/ },
doi = { 10.5120/9584-4063 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T21:03:49.729831+05:30
%A Rakesh Kumar
%A Kanwal Garg
%A Vinod Kumar
%T Extraction of Frequent Patterns from Web Logs using Web Log Mining Techniques
%J International Journal of Computer Applications
%@ 0975-8887
%V 59
%N 10
%P 19-25
%D 2012
%I Foundation of Computer Science (FCS), NY, USA
Abstract

World Wide Web is a huge repository of web pages and links. It provides profusion of information for the Internet users. The growth of web is tremendous as approximately one million pages are added daily. User's accesses are recorded in web logs. Because of the incredible usage of web, the web log files are growing at a faster rate and the size is becoming huge. Web data mining is the application of data mining techniques in web data. Web log Mining applies mining techniques in log data to extract the behaviour of users. Web log mining consists of three phases pre-processing, pattern discovery and pattern analysis. Web log data is usually noisy and ambiguous and pre-processing is an important process before web log mining. For discovering patterns sessions are to be constructed efficiently. This paper presents the existing work done to extracting patterns by using decision tree methodology in the technique of web log mining.

References
  1. Berry Michael J. A. and Linoff Gordon, "Data Mining Techniques: For Marketing, Sales and Customer Support, John Wiley & Sons Ltd. ", ISBN: 0-471-17980- 9, (1997).
  2. Decision Tree http://scikit-learn. sourceforge. net/stable/modules/tree. html
  3. Damianou Charalambos," Statistics, Symmetria Publications", (1998).
  4. Gao, "Research On Client Behavior Pattern Recognition System Based On Web Log Mining"," Proceedings of the Ninth International Conference on Machine Learning and Cybernetics, Qingdao, 11-14 July 2010", 978-1-4244-6527-9/10, (2010).
  5. Jean-Fran et. al. ,"Mining free-sets under constraints", In Michel E. Adiba, Christine Collet, and Bipin C. Desai, editors, "Proceedings of International Database Engineering & Applications Symposium (IDEAS'01)", pages 322 – 329, Grenoble, France, July (2001).
  6. Kimmo Hatonen, "Data mining for telecommunications network log analysis", (2009).
  7. Kimmo Hatonen et. al. ,"Comprehensive log compression with frequent patterns, "Proceedings of Data Warehousing and Knowledge Discovery, 5th International Conference", (DaWaK 2003), volume 2737 of LNCS, pages 360 – 370, Prague, Czech Republic, September (2003).
  8. Lekeas, "Data mining the web: the case of City University's Log Files", (2000).
  9. Loh, W. and Shih, Y. (1997), "Split Selection Methods for Classification Trees," Statistica Sinica, 7, 815-840. Introduces the QUEST algorithm. Refer to http://www. stat. wisc. edu/~loh.
  10. Nicolas Pasquier et. al. ,"Closed set based discovery of small covers for association rules", "In Christine Collet, editor, Proceedings of BDA'99", pages 361 – 381, Bordeaux, France, October (1999).
  11. Osmar R. Zaiane et. al. "Discovering Web Access Patterns and Trends by Applying OLAP and Data Mining Technology on Web Logs", (2001).
  12. Veeramalai et. al. "Efficient Web Log Mining Using Enhanced Apriori Algorithm with Hash Tree and Fuzzy", "International journal of computer science & information technology (IJCSIT) Vol. 2, No. 4, August 2010", 10. 5121/ijcsit. 2010. 2406, (2010).
Index Terms

Computer Science
Information Sciences

Keywords

Web Log mining Web Log Files World Wide Web (WWW) HTTP (Hyper Text Transfer Protocol) and CHAID (Chi-Squared Automatic Interaction Detection)