We apologize for a recent technical issue with our email system, which temporarily affected account activations. Accounts have now been activated. Authors may proceed with paper submissions. PhDFocusTM
CFP last date
20 December 2024
Reseach Article

Load Shedding using Window Aggregation Queries on Data Streams

by S. Senthamilarasu, M. Hemalatha
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 54 - Number 9
Year of Publication: 2012
Authors: S. Senthamilarasu, M. Hemalatha
10.5120/8598-2362

S. Senthamilarasu, M. Hemalatha . Load Shedding using Window Aggregation Queries on Data Streams. International Journal of Computer Applications. 54, 9 ( September 2012), 42-49. DOI=10.5120/8598-2362

@article{ 10.5120/8598-2362,
author = { S. Senthamilarasu, M. Hemalatha },
title = { Load Shedding using Window Aggregation Queries on Data Streams },
journal = { International Journal of Computer Applications },
issue_date = { September 2012 },
volume = { 54 },
number = { 9 },
month = { September },
year = { 2012 },
issn = { 0975-8887 },
pages = { 42-49 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume54/number9/8598-2362/ },
doi = { 10.5120/8598-2362 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T20:55:37.256235+05:30
%A S. Senthamilarasu
%A M. Hemalatha
%T Load Shedding using Window Aggregation Queries on Data Streams
%J International Journal of Computer Applications
%@ 0975-8887
%V 54
%N 9
%P 42-49
%D 2012
%I Foundation of Computer Science (FCS), NY, USA
Abstract

The processes of extracting knowledge structures for continuous, rapid records are known as the Data Stream Mining. The main issue in stream mining is handling streams of elements delivered rapidly which makes it infeasible to store everything in active storage. To overcome this problem of handling voluminous data we exposed a novel load shedding system using window based aggregate function of the data stream in which we accept those tuples in the stream that meet a criterion. Accepted tuples are conceded to another process as a stream, while further tuples are dropped. This proposed model conceivably segregates the data input stream into windows and probabilistically decides which tuple to drop based on the window function. The best window aggregate function used for dropping tuples is identified with the three prediction models used in data mining they are Decision Tree, Naïve Bayes and Logistic Regression. The result shows that the cumulative distance and density rank functions outperforms the remaining methods. Distinct to prior methods, our method preserves uniformity of windows all over a query plan, and constantly distributes subsets of the original query responds with insignificant denial in the excellence of the consequence.

References
  1. Gert Brettlecker, Heiko Schuldt , Peter Fischer , Hans-Jörg Schek ,: Integration of Reliable Sensor Data Stream Management into Digital Libraries:.
  2. Georges HEBRAIL. : Data stream management and mining. In: Mining Massive Data Sets for Security : version 1 - 30 Apr 2010, pages 89-101.
  3. Helmy, Y. M. El Zanfaly, D. S. ; Othman, N. A. . :Prioritized query shedding technique for continuous queries over data streams. In: IEEE, Computer Engineering & Systems, 2009. ICCES 2009. International Conference on 14-16 Dec. 2009, Page (s): 418 - 422 .
  4. Rusu, F. Dobra, A. . :Sketching Sampled Data Streams. In: Data Engineering, 2009. ICDE '09. IEEE 25th International Conference on March 29 -April 2 2009. Page (s): 381 – 392.
  5. Yunyi Zhang , Deyun Zhang ; Chongzheng Huang . : A Novel Adaptive Load Shedding Scheme for Data Stream Processing. In: IEEE, Future Generation Communication and Networking (FGCN 2007) on 6-8 Dec. 2007 Volume: 1 , Page (s): 378 - 384 .
  6. Das, A. Gehrke, J. ; Riedewald, M. :Semantic approximation of data stream joins Knowledge and Data Engineering. In: IEEE Transactions on Jan. 2005 Volume: 17, Issue: 1, Page (s): 44 - 59 .
  7. Kuen-Fang Jea, Chao-Wei Li, Chih-Wei Hsu, Ru-Ping Lin, Ssu-Fan Yen. :A load-controllable mining system for frequent-pattern discovery in dynamic data streams In: IEEE, Machine Learning and Cybernetics, International Conference on 11-14 July 2010, Vol. IV, Page (s): 2466-2471.
  8. Chao-Wei Li, Kuen-Fang Jea, Chih-Wei Hsu, Ru-Ping Lin, Ssu-Fan Yen. :A load shedding scheme for frequent pattern mining in transactional data streams. :IEEE, Fuzzy Systems and Knowledge Discovery, Eighth International Conference on 26-28 July 2011, Vol. II, Page (s): 1294- 1299 .
  9. B. Babcock, M. Datar, R. Motwani. :Load shedding for aggregation queries over data streams. In: IEEE, Data Engineering, Proceedings. 20th International Conference on 30 March-2 April 2004, page (s): 350-361.
  10. Zhang Longbo,Li Zhanhuai,Wang Zhenyou,Yu Min. :Semantic Load Shedding for Sliding Window Join-Aggregation Queries over Data Streams. In: IEEE, Convergence Information Technology, 2007. International Conference on 21-23 Nov. 2007, Page (s): 2152-2155.
  11. Chao-Wei Li, Kuen-Fang Jea, Ru-Ping Lin, Ssu-Fan Yen, Chih-Wei Hsu. : Mining frequent patterns from dynamic data streams with data load Management. In: The Journal of Systems and Software 85 (2012) 1346– 1362.
  12. Manganaris S. , Christensen M. , Zerkle D. , Hermiz K. : A data mining analysis of RTID alarms: Computer Networks, 34, 2000, page(s). 571-577.
  13. Shengliang Xu ,Magdalena Balazinska. : Sensor Data Stream Exploration for monitoring applications: DMSN'2011.
  14. Department of Transporation Coordinated Highways Action Respose Team,Maryland, http://www. chart. state. md. us/default. asp.
  15. Carlos H. C. Teixeira, Gustavo H. Orair, Wagner Meira Jr. , Srinivasan Parthasarathy. : An Ef?cient Algorithm for Outlier Detection in High Dimensional Real Databases.
  16. Mohamed Medhat Gaber, Shonali Krishnaswamy, Arkady Zaslavsky: Adaptive mining Techniques for Data Stream Using Algorithm Output Granularity.
  17. Mohamed Medhat Gaber, Arkady Zaslavsky and Shonali Krishnaswamy . :Mining Data Streams: A Review. In: SIGMOD Record, Vol. 34, No. 2, June 2005,Page(s)18-26.
  18. Jia WU, Zhihua CAI. : Attribute Weighting via Differential Evolution Algorithm for Attribute Weighted Naive Bayes (WNB). : Journal of Computational Information Systems 7:5 (2011) page(s) 1672-1679.
  19. PostgreSQL Documentation, http://www. postgresql. org.
  20. Knowledge Discovery in Databases- Confusion Matrix, http://www2. cs. uregina. ca.
  21. F. Akyildiz, W. Su, Y. Sankarasubramaniam, and E. Cayirci, . A survey on sensor networks,. IEEE Communications Magazine, vol. 40, no. 8,pp. 102. 114, August 2002.
  22. T. Arampatzis, J. Lygeros, and S. Manesis, . A survey of applications of wireless sensors and wireless sensor networks,. in Mediterranean Control Conference (Med05), 2005.
  23. E. Elnahrawy, . Research directions in sensor data streams: Solutions and challenges,. Rutgers University, Tech. Rep. DCIS-TR- 527, May 2003.
  24. E. Elnahrawy and B. Nath. Cleaning and querying noisy sensors. In Submitted for review, 2003.
  25. S. Tilak, N. B. Abu-Ghazaleh, and W. Heinzelman, . A taxonomy of wireless micro-sensor network models,. ACM SIGMOBILE Mobile Computing and Communications Review, vol. 6, no. 2, pp. 28. 36, April 2002.
  26. A. Lins, E. F. Nakamura, A. A. Loureiro, and C. J. Coelho Jr. , . Beanwatcher: A tool to generate multimedia monitoring applications for wireless sensor networks,. in Management of Multimedia Networks and Services, ser. Lecture Notes in Computer Science, A. Marshall and N. Agoulmine, Eds. , vol. 2839. Belfast, Northern Ireland: Springer-Verlag Heidelberg, September 2003, pp. 128. 141.
  27. LIns. A,Semi automatic Generation of monitoring applications for wireless networks. In: Proceedings of the 9th IEEE International Conference on Emerging Technologies and Factory Automation (ETFA 2003). Vol (1), pp. 506-511.
  28. D. J. Abadi, W. Lindner, S. Madden, and J. Schuler,: An integration framework for sensor networks and data stream management systems. In: Proceedings of the Thirtieth International Conference on Very Large Data Bases. VLDB 2004, September 2004, pp. 1361. 1364.
  29. B. Babcock, S. Babu, M. Datar, R. Motwani, and J. Widom, . Models and issues in data stream systems,. in Proceedings of the twentyfirst ACM SIGMOD. SIGACT. SIGART symposium on Principles of database systems, June 2002, pp. 1. 16.
  30. S. R. Madden, M. J. Franklin, J. M. Hellerstein, and W. Hong, . Tinydb: An acquisitional query processing system for sensor networks,. ACM Transactions on Database Systems (TODS), vol. 30, no. 1, pp. 122. 173, March 2005.
  31. Y. Yao and J. Gehrke, . Query processing for sensor networks,. in First Conf. on Innovative Data Systems Research (CIDR), January 2003.
  32. T. Arampatzis, J. Lygeros, and S. Manesis, . A survey of applications of wireless sensors and wireless sensor networks,. in Mediterranean Control Conference (Med05), 2005.
  33. A. Lins, E. F. Nakamura, A. A. Loureiro, and C. J. Coelho Jr. , . Beanwatcher: A tool to generate multimedia monitoring applications for wireless sensor networks,. in Management of Multimedia Networks and Services, ser. Lecture Notes in Computer Science,vol. 2839. Belfast, Northern Ireland: Springer- Verlag Heidelberg, September 2003, pp. 128-141.
  34. Hua-Fu Li , Suh-Yin Lee. : Mining frequent itemsets over data streams using efficient
  35. Window sliding techniques: Expert Systems with Applications 36 (2009) page(s). 1466- 1477.
  36. Yun Chi, Philip S. Yu, Haixun Wang, Richard R. Muntz :Loadstar: A Load Shedding Scheme for Classifying Data Streams, Proceedings of the 31st VLDB Conference,Trondheim, Norway, 2005.
Index Terms

Computer Science
Information Sciences

Keywords

Data stream mining Windows functions Load Shedding Scheme