CFP last date
20 January 2025
Reseach Article

Multicore Processing for Classification and Clustering Algorithms

Published on March 2012 by V. Vaitheeshwaran, Kapil Kumar Nagwanshi, T. V. Rao
2nd National Conference on Innovative Paradigms in Engineering and Technology (NCIPET 2013)
Foundation of Computer Science USA
NCIPET - Number 12
March 2012
Authors: V. Vaitheeshwaran, Kapil Kumar Nagwanshi, T. V. Rao
27399337-e836-4e01-97f1-bb8998ed87eb

V. Vaitheeshwaran, Kapil Kumar Nagwanshi, T. V. Rao . Multicore Processing for Classification and Clustering Algorithms. 2nd National Conference on Innovative Paradigms in Engineering and Technology (NCIPET 2013). NCIPET, 12 (March 2012), 20-24.

@article{
author = { V. Vaitheeshwaran, Kapil Kumar Nagwanshi, T. V. Rao },
title = { Multicore Processing for Classification and Clustering Algorithms },
journal = { 2nd National Conference on Innovative Paradigms in Engineering and Technology (NCIPET 2013) },
issue_date = { March 2012 },
volume = { NCIPET },
number = { 12 },
month = { March },
year = { 2012 },
issn = 0975-8887,
pages = { 20-24 },
numpages = 5,
url = { /proceedings/ncipet/number12/5280-1092/ },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Proceeding Article
%1 2nd National Conference on Innovative Paradigms in Engineering and Technology (NCIPET 2013)
%A V. Vaitheeshwaran
%A Kapil Kumar Nagwanshi
%A T. V. Rao
%T Multicore Processing for Classification and Clustering Algorithms
%J 2nd National Conference on Innovative Paradigms in Engineering and Technology (NCIPET 2013)
%@ 0975-8887
%V NCIPET
%N 12
%P 20-24
%D 2012
%I International Journal of Computer Applications
Abstract

Data Mining algorithms such as classification and clustering are the future of computation, though multidimensional data-processing is required. People are using multicore processors with GPU’s. Most of the programming languages doesn’t provide multiprocessing facilities and hence wastage of processing resources. Clustering and classification algorithms are more resource consuming. In this paper we have shown strategies to overcome such deficiencies using multicore processing platform OpenCL.

References
  1. C. Pizzuti and D. Talia, "P-AutoClass: Scalable Parallel Clustering for Mining Large Data Sets," IEEE Transactions On Knowledge And Data Engineering,vol. 15, no. 3, pp. 629-641, 2003.
  2. U. Fayyad, G. Piatesky-Shapiro and P. Smith, From Data Mining to Knowledge Discovery: An Overview, NY: AAAI/MIT Press, 1996.
  3. L. Hunter and D. States, "Bayesian Classification of Protein Structure," Expert, vol. 7, no. 4, pp. 67-75, 1992.
  4. C. Olson, " Parallel Algorithms for Hierarchical Clustering," Parallel Computing, vol. 21, pp. 1313-1325, 1995.
  5. D. Judd, P. McKinley and A. Jain, "Large-Scale Parallel Data Clustering," in Int'l Conf. Pattern Recognition, New York, 1996.
  6. J. Potts, Seeking Parallelism in Discovery Programs, Arlington: Master Thesis : Univ. of Texas, 1996.
  7. K. Stoffel and A. Belkoniene, "Parallel K-Means Clustering for Large Data Sets," in Parallel Processing, UK, 1999.
  8. R. Agrawal, T. Imielinski and A. Swami, "Database mining: A performance perspective," vol. 5, no. 6, p. 914–925, Dec 1993.
  9. S. Weiss and C. Kulikowski, Computer Systems that Learn. ,, vol. 1, New York: Morgan Kaufman, 1991.
  10. D. Michie, Machine Learning, Neural and Statistical Classification, vol. I, NJ: Ellis Horwood, 1994.
  11. J. Quinlan, Programs for Machine Learning, vol. I, New York: Morgan Kaufman, 1999.
  12. R. Agrawal, "An interval classifier for database mining applications," in VLDB Conference, New York, Aug 1992.
  13. J. Catlett, Megainduction Machine Learning on Very Large Databases. PhD thesis,, vol. I, Sydney: Univ. of Sydney, 1991.
  14. M. Mehta, R. Agrawal and J. Rissanen, "SLIQ: A fast scalable classifier for data mining," in 5th Intl. Conf. on Extending Database Technology, NJ, March 1996.
  15. J. Shafer, R. Agrawal and M. Mehta., "SPRINT: A scalable parallel classifier for data mining," in 22nd VLDB Conferenc, NJ, Sept 1996.
  16. K. Alsabti, S. Ranka and V. Singh, "CLOUDS: A decision tree classifier for large datasets," in 4th Intl. Conf. on Knowledge Discovery and DataMining, Aug 1998.
  17. D. Fifield, Distributed tree construction from large data-sets: Bachelor Thesis,, Australian Natl. Univ., 1992.
  18. L. Breiman, Classification and Regression Trees, National Conference on Innovative Paradigms in Engineering & Technology (NCIPET-2012) Proceedings published by International Journal of Computer Applications® (IJCA) 409 Belmont: Wadsworth, 1984.
  19. M. Joshi, G. Karypis and V. Kumar, ScalParC: A scalable and parallel classification algorithm for mining large datasets, Intl. Parallel Processing Symp, 1998.
  20. "Technical Brief: NVIDIA GeForce 8800 GPU architecture overview," [Online]. Available: www.nvidia.com.
  21. "NVIDIA’s next generation CUDA compute architecture: Fermi," [Online]. Available: http://www.nvidia.com/content/ PDF/fermi_white_papers/NVIDIAFermiComputeArchitectureWhitepaper.pdf.
  22. Z. Fan, F. Qiu, A. Kaufman and S. Yoakum-Stover, "GPU cluster for high performance computing," NY, 2004.
  23. "ATI Mobility Radeon HD 5870 GPU specifications," [Online]. Available: http://www.amd.com/us/products/notebook/graphics/ati-mobility-hd-5800/Pages/hd-5870-specs.aspx.
  24. D. Judd, P. McKinley and A. Jain, "Performance Evaluation on Large-Scale Parallel Clustering in NOW Environments," in Eighth SIAM Conf. Parallel Processing for Scientific Computing, Mar 1997.
Index Terms

Computer Science
Information Sciences

Keywords

Parallel Processing Clustering Classification OpenCL CUDA NVIDIA AMD GPU