CFP last date
20 December 2024
Reseach Article

Article:Data Clustering Method for Discovering Clusters in Spatial Cancer Databases

by Ritu Chauhan, Harleen Kaur, M.Afshar Alam
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 10 - Number 6
Year of Publication: 2010
Authors: Ritu Chauhan, Harleen Kaur, M.Afshar Alam
10.5120/1487-2004

Ritu Chauhan, Harleen Kaur, M.Afshar Alam . Article:Data Clustering Method for Discovering Clusters in Spatial Cancer Databases. International Journal of Computer Applications. 10, 6 ( November 2010), 9-14. DOI=10.5120/1487-2004

@article{ 10.5120/1487-2004,
author = { Ritu Chauhan, Harleen Kaur, M.Afshar Alam },
title = { Article:Data Clustering Method for Discovering Clusters in Spatial Cancer Databases },
journal = { International Journal of Computer Applications },
issue_date = { November 2010 },
volume = { 10 },
number = { 6 },
month = { November },
year = { 2010 },
issn = { 0975-8887 },
pages = { 9-14 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume10/number6/1487-2004/ },
doi = { 10.5120/1487-2004 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T19:59:01.750364+05:30
%A Ritu Chauhan
%A Harleen Kaur
%A M.Afshar Alam
%T Article:Data Clustering Method for Discovering Clusters in Spatial Cancer Databases
%J International Journal of Computer Applications
%@ 0975-8887
%V 10
%N 6
%P 9-14
%D 2010
%I Foundation of Computer Science (FCS), NY, USA
Abstract

The vast amount of hidden data in huge databases has created tremendous interests in the field of data mining. This paper discusses the data analytical tools and data mining techniques to analyze the medical data as well as spatial data. Spatial data mining includes discovery of interesting and useful patterns from spatial databases by grouping the objects into clusters. This study focuses on discrete and continuous spatial medical databases on which clustering techniques are applied and the efficient clusters were formed. The clusters of arbitrary shapes are formed if the data is continuous in nature. Furthermore, this application investigated data mining techniques such as classical clustering and hierarchical clustering on the spatial data set to generate the efficient clusters. The experimental results showed that there are certain facts that are evolved and can not be superficially retrieved from raw data.

References
  1. Rao, Y.N, Sudir Gupta and S.P. Agarwal 2003. National Cancer Control Programme:Current status and strategies, 50 years of cancer control in India,NCD Section, Director General of Health.
  2. Jain, A.K., Murty M.N., and Flynn P.J. (1999): Data Clustering: A Review.
  3. M. Ester, H.-P. Kriegel, J. Sander, and X. Xu.1996. A density-based algorithm for discovering clusters in large spatial databases. KDD'96.
  4. Ng R.T., and Han J. 1994. Efficient and Effective Clustering Methods for Spatial Data Mining, Proc. 20th Int. Conf. on Very Large Data Bases, Chile.
  5. W. Wang, J. Yang, and R. Muntz, STING: A Statistical Information grid approach to spatial data mining, Proc. 23rd1nt. Conf. on Very Large Databases, Morgan Kaufmann, pp. 186-195 (1997).
  6. T. Zhang, R. Ramakrishnan, and M. L1nvy, B1RCH: An Efficient Data C1ustering Method for Very Large Databases, Proc. ACM SIGMOD Int’L Conf. On Management of Data, ACM Press, pp. 103-114 (1996).
  7. J. Han and M. Kamber. Data Mining: Concepts and Techniques. Morgan Kaufmann, 2001.
  8. L. Kaufinan, and P.J. Rousseeuw, Finding Groups in Data: an Introduction to Cluster Analysis, John Wiley & Sons1990.
  9. Y. Zhao and G. Karypis. Evaluation of hierarchical clustering algorithms for document datasets. In CIKM, 2002.
  10. http://eric.univlyon2.fr/~ricco/tanagra/en/tanagra.html.
  11. Surveillance, Epidemiology, and End Results (SEER) Program (www.seer.cancer.gov)Public-Use Data (1973-2002), National Cancer Institute, DCCPS, Surveillance Research Program, Cancer Statistics Branch, released April 2005.
  12. Mihael Ankerst, Markus M. Breunig, Hans-Peter Kriegel, Jörg Sander (1999). "OPTICS: Ordering Points to Identify the Clustering Structure". ACM SIGMOD international conference on Management of data.
  13. U.M. Fayyad and P. Smyth. Advances in Knowledge Discovery and Data Mining. AAAI/MIT Press, Menlo Park, CA, 1996.
  14. Kaur H, Wasan S K, Al-Hegami A S and Bhatnagar V, A Unified Approach for Discovery of Interesting Association Rules in Medical Databases, Advances in Data Mining, Lecture Notes in Artificial Intelligence, Vol. 4065, Springer-Verlag, Berlin, Heidelberg (2006).
  15. Kaur H and Wasan S K, An Integrated Approach in Medical Decision Making for Eliciting Knowledge, Web-based Applications in Health Care & Biomedicine, Annals of Information Systems (AoIS), ed. A. Lazakidou, Springer 2009.
  16. M. S. Chen, J. Han, and P. S. Yu. Data mining: an overview from database perspective. IEEE Trans. On Knowledge and Data Engineering, 5(1):866—883, Dec.1996
Index Terms

Computer Science
Information Sciences

Keywords

Data Mining Clustering K-means Hierarchical agglomerative clustering (HAC) SEER