CFP last date
20 December 2024
Reseach Article

Mining of High Dimensional Data using Efficient Feature Subset Selection Clustering Algorithm (WEKA)

by Lakshmi Sarika T, Ch. Samsonu, B Tarakeswara Rao, B. Satyanarayana Reddy
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 107 - Number 6
Year of Publication: 2014
Authors: Lakshmi Sarika T, Ch. Samsonu, B Tarakeswara Rao, B. Satyanarayana Reddy
10.5120/18753-0009

Lakshmi Sarika T, Ch. Samsonu, B Tarakeswara Rao, B. Satyanarayana Reddy . Mining of High Dimensional Data using Efficient Feature Subset Selection Clustering Algorithm (WEKA). International Journal of Computer Applications. 107, 6 ( December 2014), 7-12. DOI=10.5120/18753-0009

@article{ 10.5120/18753-0009,
author = { Lakshmi Sarika T, Ch. Samsonu, B Tarakeswara Rao, B. Satyanarayana Reddy },
title = { Mining of High Dimensional Data using Efficient Feature Subset Selection Clustering Algorithm (WEKA) },
journal = { International Journal of Computer Applications },
issue_date = { December 2014 },
volume = { 107 },
number = { 6 },
month = { December },
year = { 2014 },
issn = { 0975-8887 },
pages = { 7-12 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume107/number6/18753-0009/ },
doi = { 10.5120/18753-0009 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T22:40:20.598584+05:30
%A Lakshmi Sarika T
%A Ch. Samsonu
%A B Tarakeswara Rao
%A B. Satyanarayana Reddy
%T Mining of High Dimensional Data using Efficient Feature Subset Selection Clustering Algorithm (WEKA)
%J International Journal of Computer Applications
%@ 0975-8887
%V 107
%N 6
%P 7-12
%D 2014
%I Foundation of Computer Science (FCS), NY, USA
Abstract

We exhibited the thought of data mining through the free and open source programming Waikato Environment for Knowledge Analysis (WEKA), which allows you to burrow own data for examples and cases. We moreover depicted about the first methodology for data mining — backslide — which allows you to anticipate a numerical worth for a given set of insight qualities. This method for dismemberment is most easy to perform and the base fit system for data mining, yet it filled a not too bad need as a prolog to WEKA and gave a not too bad example of how unrefined data can be changed into convincing information. We will take you through two additional data mining techniques that are hardly more mind boggling than a backslide model, however all the more compelling in their individual goals. Where a backslide model could simply accommodate you a numerical yield with specific inputs, these additional models grant you to translate your data particularly data mining is about applying the right model to your data. You could have the best data about your customers (whatever that even means), however in case you don't have any kind of effect the right models to it, it will just be refuse. Consider this an exchange way: If you recently used backslide models, which make a numerical yield, how would Amazon have the ability to let you know "Distinctive Customers Who Bought X Also Bought Y?" There's no numerical limit that could accommodate you this kind of information. So we should delve into the two additional models you can use with your witness.

References
  1. Johannes grabmeier andreas rudolph: Techniques of Cluster Algorithms in Data Mining May 23, 2001 Kluwer Academic Publishers. Manufactured in the Netherlands.
  2. Karthikeyan. P1, Saravanan. P2, Vanitha. E: High Dimensional Data Clustering Using Fast Cluster Based Feature Selection, March 2014 Int. Journal of Engineering Research and Applications ISSN: 2248-9622.
  3. YongSeog Kim, W. Nick Street, and Filippo Menczer: Feature Selection in Data Mining, University of Iowa, USA.
  4. M. Ramaswami and R. Bhaskaran: A Study on Feature Selection Techniques in Educational Data Mining journal of computing, volume 1, issue 1, december 2009, issn: 2151-9617.
  5. Tarek Amr: Survey on Feature Selection.
  6. Michael Abernethy: Data mining with WEKA, Part 2: Classification and clustering IBM.
  7. Marcos Evandro Cintra, Trevor P. Martin, Maria Carolina Monard, and Heloisa de Arruda Camargo: Feature Subset Selection Using a Fuzzy Method 200 International Conference on Intelligent Human-Machine Systems and Cybernetics.
  8. Isabelle Guyon, Andre Elisseeff: An Introduction to Variable and Feature Selection, Journal of Machine Learning Research 3 (2003) 1157-1182.
  9. YongSeog Kim, W. Nick Street, and Filippo Menczer: Feature Selection in Data Mining, University of Iowa, USA.
  10. Marie Gaudard: Interactive Data Mining and Design of Experiments: the JMP® Partition and Custom Design Platforms, March 2006 North Haven Group.
Index Terms

Computer Science
Information Sciences

Keywords

WEKA Clustering Classification Feature.