International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 58 - Number 10 |
Year of Publication: 2012 |
Authors: Pranjali Kasture, Jayant Gadge |
10.5120/9317-3549 |
Pranjali Kasture, Jayant Gadge . Cluster based Outlier Detection. International Journal of Computer Applications. 58, 10 ( November 2012), 11-15. DOI=10.5120/9317-3549
Outlier detection is a fundamental issue in data mining, specifically it has been used to detect and remove anomalous objects from data. mining. The proposed approach to detect outlier includes three methods which are clustering, pruning and computing outlier score. For clustering k-means algorithm is used which partition the dataset into given number of clusters. In pruning, based on some distance measure, points which are closed to centroid of each cluster are pruned. For the unpruned points, local distance based outlier factor (LDOF) measure is calculated. A measure called LDOF, tells how much a point is deviating from its neighbors. The high LDOF value of a point indicates that the point is deviating more from its neighbors and probably it may be an outlier.