CFP last date
20 January 2025
Reseach Article

Detecting and Revamping of X-Outliers in Time Series Database

by S. Sridevi, S. Abirami, S. Rajaram
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 60 - Number 19
Year of Publication: 2012
Authors: S. Sridevi, S. Abirami, S. Rajaram
10.5120/9809-4385

S. Sridevi, S. Abirami, S. Rajaram . Detecting and Revamping of X-Outliers in Time Series Database. International Journal of Computer Applications. 60, 19 ( December 2012), 28-33. DOI=10.5120/9809-4385

@article{ 10.5120/9809-4385,
author = { S. Sridevi, S. Abirami, S. Rajaram },
title = { Detecting and Revamping of X-Outliers in Time Series Database },
journal = { International Journal of Computer Applications },
issue_date = { December 2012 },
volume = { 60 },
number = { 19 },
month = { December },
year = { 2012 },
issn = { 0975-8887 },
pages = { 28-33 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume60/number19/9809-4385/ },
doi = { 10.5120/9809-4385 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T21:07:26.181469+05:30
%A S. Sridevi
%A S. Abirami
%A S. Rajaram
%T Detecting and Revamping of X-Outliers in Time Series Database
%J International Journal of Computer Applications
%@ 0975-8887
%V 60
%N 19
%P 28-33
%D 2012
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Dataset with Outliers causes poor accuracy in future analysis of data mining tasks. To improve the performance of mining task, it is necessary to detect and revamp of outliers which are there in the dataset. Existing techniques like ARMA (Auto-Regressive Moving Average), ARIMA (Auto- Regressive Integrated Moving Average) and Multivariate Linear Gaussian state space model don't consider the periodicity for outlier detection. The above methods are used to find out only Y Outliers which are present in Y axis. These methods are not applicable to detect the time at which the peculiar data occurs (so called X-Outliers). This paper focuses different methods for detecting and revamping of X-Outliers that have abnormal data according to a known periodicity. These are practically applied in fraud detection, Market-basket analysis and medical applications to detect certain abnormal diseases. First the data is modeled to get the trend of the data and to remove noises by means of kernel smoothing. Next the outliers are detected by similarity measurements. If the dataset has outliers it can be replaced by considering periodic indices from the historical dataset. The performance of system is measured by precision, recall and F Score. The proposed method is tested with three different time series datasets namely, Electricity power consumption dataset, Weather dataset and Electricity price market dataset. Experimental results have demonstrated that the proposed method is effective and accurate than the earlier methods.

References
  1. Ning Zhong, Yiyu (Y. Y. ) Yao, Muneaki Ohshima, "Peculiarity Oriented Multidatabase Mining", IEEE Trans on Knowledge and Data Engg, Vol. 15, No. 4,pp. 613-628, July/August 2003.
  2. J. Chen, W. Li, A. Lau, J. Cao, and K. Wang, "Automated load curve data cleansing in power systems",IEEE Trans. Smart Grid, vol. 1, no. 2, pp. 213-221, Sep. 2010.
  3. V. J. Hodge and J. Austin, "A survey of outlier detection methodologies", Artif. Intell. Rev. , vol. 22, no. 2, pp. 5-126, Oct. 2004.
  4. A. J. Fox, "Outliers in time series",J. Roy. Stat. Soc. B ,, Methodol. vol. 34, pp. 350-363, 1972.
  5. G. M. Ljung,"On outlier detection in time series",J. Roy. Stat. Soc. B, Methodol. , vol. 55, pp. 559-567, 1993.
  6. W. Hardle, "Applied Nonparametric Regression", Cambridge,U. K . Cambridge Univ. Prss,1990.
  7. B. Abraham and A. Chuang,"Outlier detection and time series modeling",Technometrics , vol. 31, pp. 241-248, 1989
  8. W. Schmid,"The multiple outlier problems in time series analysis",Australian. J. Statist. , vol. 28, pp. 400-413, 1986
  9. I. Chang and C. iao,"Estimation of time series parameters in the presence of outliers",Technometrics, vol. 30, pp. 193-204, 1988
  10. D. Gasgupta and S. Forrest,"Novelty detection in time series data using ideas from immunology", in Proc. Int. Conf. Intelligent Systems, 1996, pp. 82-87.
  11. V. Barnett and T. Lewis, " Outliers in Statistical Data",3 rd edition, New York, Wiley 1994,pp 397-415.
  12. Zhihui Guo, Wenyuan Li, Fellow, Adriel Lau, Tito Inga-Rojas, and Ke Wang, "Detecting X-Outliers in Load Curve Data in Power Systems",IEEE trans. on power systems, VOL. 27, NO. 2, MAY 2012.
Index Terms

Computer Science
Information Sciences

Keywords

Outlier Time series database Smoothing methods Revamping