International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 64 - Number 16 |
Year of Publication: 2013 |
Authors: S. Chitra, B. Kalpana |
10.5120/10716-5031 |
S. Chitra, B. Kalpana . A Novel Preprocessing Technique for Session Construction using Propositional DAGs. International Journal of Computer Applications. 64, 16 ( February 2013), 8-12. DOI=10.5120/10716-5031
This paper continues the line of research on Web access log analysis. Web access log analysis is to analyze the patterns of web site usage and the features of users’ behavior. It is the fact that the normal Log data is very noisy and unclear and it is vital to preprocess the log data for efficient web usage mining process. Preprocessing is the process comprises of three phases which includes data cleaning, user identification and session construction. Session construction is very vital and numerous real world problems can be modeled as traversals on graph and mining from these traversals would provide the requirement for preprocessing phase. On the other hand, the traversals on unweighted graph have been taken into consideration in existing works. This paper oversimplifies this to the case where vertices of graph are given weights to reflect their significance. The proposed method constructs sessions as a Propositional Directed Acyclic Graph (PDAGs) which contains pages with calculated weights. We identify a new property called simple-negation, which is an implicit restriction of all Negation Normal Form (NNFs) and Binary Decision Diagram(BDDs). The removal of this restriction leads to Propositional Directed Acyclic Graphs (PDAG), a more general family of graph-based languages for representing Boolean functions or propositional theories. This will help site administrators to find the interesting pages for users and to redesign their web pages. After weighting each page according to browsing time a PDAGs structure is constructed for each user session. Existing system in which there is a problem of learning with the Boolean function and the problem can be overcome by the proposed method.