CFP last date
20 December 2024
Reseach Article

Preprocessing on Web Server Log Data for Web Usage Pattern Discovery

by Ketan D. Patel, Satyen M. Parikh
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 165 - Number 10
Year of Publication: 2017
Authors: Ketan D. Patel, Satyen M. Parikh
10.5120/ijca2017913978

Ketan D. Patel, Satyen M. Parikh . Preprocessing on Web Server Log Data for Web Usage Pattern Discovery. International Journal of Computer Applications. 165, 10 ( May 2017), 29-32. DOI=10.5120/ijca2017913978

@article{ 10.5120/ijca2017913978,
author = { Ketan D. Patel, Satyen M. Parikh },
title = { Preprocessing on Web Server Log Data for Web Usage Pattern Discovery },
journal = { International Journal of Computer Applications },
issue_date = { May 2017 },
volume = { 165 },
number = { 10 },
month = { May },
year = { 2017 },
issn = { 0975-8887 },
pages = { 29-32 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume165/number10/27610-2017913978/ },
doi = { 10.5120/ijca2017913978 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-07T00:12:05.925678+05:30
%A Ketan D. Patel
%A Satyen M. Parikh
%T Preprocessing on Web Server Log Data for Web Usage Pattern Discovery
%J International Journal of Computer Applications
%@ 0975-8887
%V 165
%N 10
%P 29-32
%D 2017
%I Foundation of Computer Science (FCS), NY, USA
Abstract

World Wide Web has gained popularity because of the fact that it acts as an effective communication medium between business and end users. Company needs to have a web site which satisfies the intended needs of their end users. Users like to revisit a web site which is usable in nature. Web usage patterns of end users must be identified to improve usability on any web site. It is done with analyzing web server log files. Web logs contain noisy, redundant and incomplete data in huge volume which restricts to identify precise usage pattern from it. So, the effective data pre-processing techniques are required. In this paper algorithms are proposed and implemented for pre-processing tasks includes Data Cleaning, User identifications and Session Identification. Pre-processing algorithms are implemented on web log files of two websites and results of these algorithms are useful to study usage pattern of end users.

References
  1. Suneetha, K. R. and Dr. Krishnamoorthi, R. “Identifying User Behavior by Analyzing Web Server Access Log File”, IJCSNS International Journal of Computer Science and Network Security, VOL.9 No.4, April 2009.
  2. Pamutha, T., Chimphlee, S., Kimpon, C. and Sanguansat, P. “Data Preprocessing on Web Server Log Files for Mining Users Access Patterns”, International Journal of Research and Reviews in Wireless Communications (IJRRWC) Vol. 2, No. 2, June 2012.
  3. Kharwar, A., Naik, C. and Desai, N. “A Complete Pre Processing Method for Web Usage Mining”, International Journal of Emerging Technology and Advanced Engineering, October 2013.
  4. Punjani, M. and Gupta, V. “A Survey on Data Preprocessing in Web Usage Mining”, IOSR Journal of Computer Engineering (IOSR-JCE), 2013.
  5. Dr. Dhawan, S. and Lathwal, M. “Study of Preprocessing Methods in Web Server Logs”, International Journal of Advanced Research in Computer Science and Software Engineering, 2013.
  6. Verma, P. and Dr. Keswani, N. “Web Usage mining framework for Data Cleaning and IP address Identification”, IJASCSE, 2014.
  7. Muskan. and Dr. Garg, K., “An Efficient Algorithm for Data Cleaning of Web Logs with Spider Navigation Removal”, International Journal of Computer Application (2250-1797) Volume 6– No.3, May- June 2016.
  8. Pushpa, V. and Vidyapriya V., “An Efficient Preprocessing Method to Detect User Access Patterns from Weblogs”, International Journal of Computer Science and Mobile Computing, Vol.5 Issue.9, September- 2016.
  9. Meghwal, A. and Dr. Sharma A. “Identifying System Errors through Web Server Log Files in Web Log Mining”, IJCST Vol. 7, Issue 1, 2016.
  10. Neelima, G. and Dr. Rodda, S. “Predicting user behavior through sessions using the web log mining”, International Conference on Advances in Human Machine Interaction, IEEE 2016.
  11. Web crawler, ScienceDaily https://www.sciencedaily.com/terms/web_crawler.htm
Index Terms

Computer Science
Information Sciences

Keywords

Preprocessing Web Server log data Web Usage Mining Sessions Users Data Cleaning