CFP last date
20 March 2025
Reseach Article

A Comparative Study on K Means and PAM Algorithm using Physical Characters of Different Varieties of Mango in India

by Bhaskar Mondal, J. Paul Choudhury
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 78 - Number 5
Year of Publication: 2013
Authors: Bhaskar Mondal, J. Paul Choudhury

Bhaskar Mondal, J. Paul Choudhury . A Comparative Study on K Means and PAM Algorithm using Physical Characters of Different Varieties of Mango in India. International Journal of Computer Applications. 78, 5 ( September 2013), 21-24. DOI=10.5120/13485-1189

@article{ 10.5120/13485-1189,
author = { Bhaskar Mondal, J. Paul Choudhury },
title = { A Comparative Study on K Means and PAM Algorithm using Physical Characters of Different Varieties of Mango in India },
journal = { International Journal of Computer Applications },
issue_date = { September 2013 },
volume = { 78 },
number = { 5 },
month = { September },
year = { 2013 },
issn = { 0975-8887 },
pages = { 21-24 },
numpages = {9},
url = { },
doi = { 10.5120/13485-1189 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
%0 Journal Article
%1 2024-02-06T21:50:48.870005+05:30
%A Bhaskar Mondal
%A J. Paul Choudhury
%T A Comparative Study on K Means and PAM Algorithm using Physical Characters of Different Varieties of Mango in India
%J International Journal of Computer Applications
%@ 0975-8887
%V 78
%N 5
%P 21-24
%D 2013
%I Foundation of Computer Science (FCS), NY, USA

Clustering is the most important and popular technique for finding pattern and relationships in databases. In this paper a comparative study has been done on the clustering techniques like k-means and k-mediod (PAM) with difference distance measures to classify the different varieties of mango based on physical characters of fruit. As the purity of result of a clustering algorithm depend upon the distance measure technique used in that algorithm we have validate the result using different distance measure also. Classification of agricultural data is still remains a challenge due to its high dimension and noise. This type of study may be helpful for the agricultural research as well as for the field of science and technology.

  1. Alizadeh A. , Eisen M. B, Davis R. E, et al. Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling. Nature. 2000; 403(6769):503–511.
  2. Guha, S. , Rastogi, R. , and Shim K. (1998). CURE: An Efficient Clustering Algorithm for Large Databases. In Proceedings of the ACM SIGMOD Conference.
  3. L. Kaufman and P. J. Rousseeuw, Finding Groups in Data: an Introduction to Cluster Analysis, John Wiley & Sons, 1990.
  4. MacQueen, J. B. (1967). Some Methods for Classification and Analysis of Multivariate Observations. In Proc. Of 5th Berkley Symposium on Mathematical Statistics and Probability, Volume I: Statistics, pp. 281–297.
  5. Nielsen T. O, West R. B, Linn S. C, et al. Molecular characterisation of soft tissue tumours: a gene expression study. Lancet2002.
  6. Steinhaus, 1956] STEINHAUS, H. 1956. Sur la division des corp materiels en parties. Bulletin of acad. polon. sci.
  7. Lloyd, 1982] LLOYD, S. 1982. Least squares quantization in PCM. Ieee transactions on information theory.
  8. Ball & Hall, 1965] BALL, G. , & HALL, D. 1965. ISODATA, a novel method of data anlysis and pattern classification. Tech. rept. NTIS AD 699616. Stanford Research Institute, Stanford, CA.
  9. MacQueen, 1967] MACQUEEN, J. 1967. Some methods for classification and analysis of multivariate observations.
  10. Figueiredo & Jain, 2002] FIGUEIREDO,MARIO, & JAIN, ANIL K. 2002. Unsupervised learning of finite mixture models.
  11. Tibshirani et al. , 2001] TIBSHIRANI, R. ,WALTHER, G. , , & HASTIE, T. 2001. Estimating the number of clusters in a data set via the gap statistic. Journal of the royal statistical society.
  12. Ferguson, 1973] FERGUSON, THOMAS S. 1973. A bayesian analysis of some nonparametric problems. Annals of statistics.
  13. J. Paul Choudhury, Satyendra Nath Mandal, Dilip Dey, S. R. Bhadra Choudhury, Growth Estimation with Simulated Annealing considering weather parameters using Factor and Principal Component Analysis, Proceedings of National Conference on Methods & Models in Computing, Department of Computer and System Sciences, Jawaharlal Nehru University, New Delhi, pp 184-197, December 2007
  14. J. Paul Choudhury, Satyendra Nath Mandal, Dilip Dey, S. R. Bhadra Choudhury, A Framework to Predict Size of Different Types Of Mango Considering Effect of Different Parameters Using Factor and Principal Component Analysis , Proceedings of International Journal IJITKM, Department of Computer and System Sciences, Kurukshetra university, Vol-1, Number-2,Page No. 303-309,December 2008.
Index Terms

Computer Science
Information Sciences


Clustering k-means k-mediod PAM distance