International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 155 - Number 10 |
Year of Publication: 2016 |
Authors: V. Mahalakshmi, M. Govindarajan |
10.5120/ijca2016912451 |
V. Mahalakshmi, M. Govindarajan . Comparison of Outlier Detection Methods in Diabetes Data. International Journal of Computer Applications. 155, 10 ( Dec 2016), 28-32. DOI=10.5120/ijca2016912451
Outlier is defined as an observation that deviates extensively from other observations. The identification of outliers can lead to the discovery of useful and meaningful knowledge. Outlier detection has been widely studied in the past decades. Most refined methods in data mining address this issue to some extent, but not fully, and can be improved by addressing the problem more directly. The detection of outliers can lead to the invention of unpredicted facts in areas such as credit card fraud detection, calling card fraud detection, discovering criminal behaviors, discovering network intrusions, etc. This paper mainly discusses and compares approach of different outlier detection from data mining perspective, which can be grouped into distance-based approach and density-based approach.