International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 9 - Number 9 |
Year of Publication: 2010 |
Authors: Sunil Joshi, Dr. R. S . Jadon, Dr. R. C. Jain |
10.5120/1410-1904 |
Sunil Joshi, Dr. R. S . Jadon, Dr. R. C. Jain . Article:An Implementation of Frequent Pattern Mining Algorithm using Dynamic Function. International Journal of Computer Applications. 9, 9 ( November 2010), 37-41. DOI=10.5120/1410-1904
An Important Problem in Data Mining in Various Fields like Medicine, Telecommunications and World Wide Web is Discovering Patterns. Frequent patterns mining is the focused research topic in association rule analysis. Apriori algorithm is a classical algorithm of association rule mining. Lots of algorithms for mining association rules and their mutations are proposed on basis of Apriori Algorithm. Most of the previous studies adopt Apriori-like algorithms which generate-and-test candidates and improving algorithm strategy and structure but no one concentrate on the structure of database. A simple approach is if we implement in Transposed database then result is very fast. Recently, different works proposed a new way to mine patterns in transposed databases where a database with thousands of attributes but only tens of objects. In this case, mining the transposed database runs through a smaller search space. In this paper, we systematically explore the search space of frequent patterns mining and represent database in transposed form. We developed an algorithm (termed DFPMT—A Dynamic Approach for Frequent Patterns Mining Using Transposition of Database) for mining frequent patterns which are based on Apriori algorithm and used Dynamic function for Longest Common Subsequence [1]. The main distinguishing factors among the proposed schemes is the database stores in transposed form and in each iteration database is filter /reduce by generating LCS of transaction id for each pattern. Our solutions provide faster result. A quantitative exploration of these tradeoffs is conducted through an extensive experimental study on synthetic and real-life data sets.