We apologize for a recent technical issue with our email system, which temporarily affected account activations. Accounts have now been activated. Authors may proceed with paper submissions. PhDFocusTM
CFP last date
20 November 2024
Reseach Article

User Assistance for Effective Data Mining

by Pankaj S. Kulkarni, Varsha C. Belokar, S. S. Sane
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 58 - Number 5
Year of Publication: 2012
Authors: Pankaj S. Kulkarni, Varsha C. Belokar, S. S. Sane
10.5120/9282-3473

Pankaj S. Kulkarni, Varsha C. Belokar, S. S. Sane . User Assistance for Effective Data Mining. International Journal of Computer Applications. 58, 5 ( November 2012), 35-39. DOI=10.5120/9282-3473

@article{ 10.5120/9282-3473,
author = { Pankaj S. Kulkarni, Varsha C. Belokar, S. S. Sane },
title = { User Assistance for Effective Data Mining },
journal = { International Journal of Computer Applications },
issue_date = { November 2012 },
volume = { 58 },
number = { 5 },
month = { November },
year = { 2012 },
issn = { 0975-8887 },
pages = { 35-39 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume58/number5/9282-3473/ },
doi = { 10.5120/9282-3473 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T21:01:42.768104+05:30
%A Pankaj S. Kulkarni
%A Varsha C. Belokar
%A S. S. Sane
%T User Assistance for Effective Data Mining
%J International Journal of Computer Applications
%@ 0975-8887
%V 58
%N 5
%P 35-39
%D 2012
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Today, several tools are available for solving data mining problems, both in open source and commercial category. For solving classification problems these tools provide variety of strategies such as decision tree, neural networks, lazy classifiers etc. For each strategy, the tools allow the user to select specific values for large number parameters[1] for e. g. in case of a neural network classifier, user needs to provide values for parameters such as epochs, learning rate, momentum etc . Although default setting for such parameters is provided by tools, it is often found that the classifier performance (accuracy) may be enhanced by making series of experiments with different values for these parameters. Thus, for a novice user it is difficult to guess proper values for these parameters and the only option is to try with series of experiments which is time consuming. This paper aims at developing a database to record the nature of data such as number and type of attributes, presence or absence of missing values etc along with various values for building classifier models and the accuracy of the classifier. Such a data is then can be made available to novice users to build a model based on past experience. The work also aims at developing required forms reports

References
  1. Data Mining: A Knowledge Discovery Approach, K. Cios, W. Pedrycz, R. Swiniarski, L. Kurgan, Springer, ISBN: 978-0-387-33333-5, 2007.
  2. Data Mining: Concepts, Models, Methods, and Algorithms, Mehmed Kantardzic, ISBN: 0471228524, Wiley-IEEE Press, 2002.
  3. Ian Witten and Eibe Frank, Data Mining: Practical Machine Learning Tools and Techniques, 2nd Edition, Morgan Kaufmann, ISBN 0120884070,2005.
  4. WEKA manual.
  5. Zdravko Markov, Ingrid Russell, An Introduction to the Weka DataMining System
Index Terms

Computer Science
Information Sciences

Keywords

Data Mining classification classifier filter WEKA