CFP last date
20 December 2024
Reseach Article

Scaling Up for High Dimensional Data in Data Stores and Streams

Published on December 2013 by G. V. Sam Kumar, S. Ramakrishnan
International Conference on Computing and information Technology 2013
Foundation of Computer Science USA
IC2IT - Number 2
December 2013
Authors: G. V. Sam Kumar, S. Ramakrishnan
626d9b1b-f9ec-4ba5-bb11-102535cc47f0

G. V. Sam Kumar, S. Ramakrishnan . Scaling Up for High Dimensional Data in Data Stores and Streams. International Conference on Computing and information Technology 2013. IC2IT, 2 (December 2013), 21-23.

@article{
author = { G. V. Sam Kumar, S. Ramakrishnan },
title = { Scaling Up for High Dimensional Data in Data Stores and Streams },
journal = { International Conference on Computing and information Technology 2013 },
issue_date = { December 2013 },
volume = { IC2IT },
number = { 2 },
month = { December },
year = { 2013 },
issn = 0975-8887,
pages = { 21-23 },
numpages = 3,
url = { /proceedings/ic2it/number2/14396-1328/ },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Proceeding Article
%1 International Conference on Computing and information Technology 2013
%A G. V. Sam Kumar
%A S. Ramakrishnan
%T Scaling Up for High Dimensional Data in Data Stores and Streams
%J International Conference on Computing and information Technology 2013
%@ 0975-8887
%V IC2IT
%N 2
%P 21-23
%D 2013
%I International Journal of Computer Applications
Abstract

The data in engineering and science has been on a massive scale and stored in gigantic storage devices. The data is moved in and out in the form of data streams. Data storage levels are reaching Yottabytes in terms of storage. Science and engineering transforms such data into rich and resourceful data. Intensive methods have been researched for high dimensionality. Science also uses high speed images and video data types in applications where data streams are reaching huge volumes with dynamic data distribution. Storage and computing such data is a challenging activity and especially in terms of system interactions and communications. Mining data streams is extracting knowledge in non stopping data streams. Research in this area has gained attraction due to the importance of its applications and the increasing potential of enhancement in streaming information. This paper discusses these challenges of data mining with a focus on issues like domain specific data integration, mining unstructured data, mining data streams.

References
  1. Gaber, M. M. , Krishnaswamy, S. and Zaslavsky, A. (2004). Cost-Efficient Mining Techniques for Data Streams. In Proc. Australasian Workshop on Data Mining and Web Intelligence (DMWI2004), Dunedin, New Zealand. CRPIT, 32. Purvis, M. , Ed. ACS.
  2. Papadimitriou, J. Sun, C. Faloutsos, "Streaming Pattern Discovery in Multiple Time-Series", Proceedings of the 31st VLDB Conference, Trondheim, Norway, 2005,p697-708, Mohamed Medhat Gaber, Arkady Zaslavsky and Shonali Krishnaswamy. "Mining Data Streams: A Review", VIC3145, Australia, ACM SIGMOD Record Vol. 34, No. 2; June 2005
  3. Maria Halkidi, "Quality assessment and Uncertainty Handling in Data Mining Process" http://www. edbt2000. unikonstanz de/phd-workshop/papers/Halkidi. pdf
  4. Fayyad, U. M. , G. P. Shapiro, P. Smyth. "From Data Mining to Knowledge Discovery in Databases", 0738-4602-1996, AI Magazine (Fall 1996): 37–53
  5. Jiawei Han, Micheline Kamber. "Data Mining: Concepts and Techniques", Morgan Kaufmann Publishers, Champaign:CS497JH, fall 2001, www. cs. sfu. ca/~han/DM_Book. htm
  6. Claude Seidman. "Data Mining with Microsoft SQL Server 2000 Technical Reference", ISBN: 0-7356-1271-4,amazon. com/Mining-Microsoft-Server-Technical-Reference/dp/0735612714
  7. Gaber, M, M. , Zaslavsky, A. , and Krishnaswamy, S. , A Cost-Efficient Model for Ubiquitous Data Stream Mining, Accepted for publication in the Tenth International Conference on Information Processing and Management of Uncertainty in Knowledge-Based Systems (IPMU 2004), Perugia Italy, July 4-9.
  8. Gaber, M, M. , Zaslavsky, A. , and Krishnaswamy, S. , (2004), Towards an Adaptive Approach for Mining Data Streams in Resource Constrained Environments, Accepted for publication in the Proceedings of Sixth International Conference on Data Warehousing and Knowledge Discovery - Industry Track (DaWak 2004), Zaragoza, Spain, 30 August - 3 September, Lecture Notes in Computer Science (LNCS), Springer Verlag.
  9. Garofalakis M. , Gehrke J. , Rastogi R. : Querying and mining data streams: you only get one look a tutorial. SIGMOD Conference 2002: 635. (2002).
  10. Ganti V. , Gehrke J. , Ramakrishnan R. : Mining Data Streams under Block Evolution. SIGKDD Explorations 3(2): (2002) 1-10.
  11. Jiawei Han, Micheline Kamber. "Data Mining: Concepts and Techniques", Morgan Kaufmann Publishers, Champaign:CS497JH, fall 2001, www. cs. sfu. ca/~han/DM_Book. htm
  12. David Hand, Heikki Mannila, Padhraic Smyth. "Principles of Data Mining", ISBN: 026208290 MIT Press, Cambridge, MA, 2001.
  13. He, B. , Patel, M. , Zhang, Z. , Chang, K. C. : Accessing the deep Web: A survey. Communications of the ACM, 50(2):94{101, 2007. . Sa_s, F. , Pernelle, N. , Rousset, M. C. : Combining a logical and a numerical method for data reconciliation. J. Data Semantics, 12:66{94, 2009
  14. World Wide Web Consortium. W3C Semantic Web Activity, 1994. http://www. w3. org/2001/sw/
  15. Mohamed Medhat Gaber, Arkady Zaslavsky and Shonali Krishnaswamy. "Mining Data Streams: A Review", VIC3145, Australia, ACM SIGMOD Record Vol. 34, No. 2; June 2005.
  16. Fernando Crespoa, Richard Weberb. "A methodology for dynamic data mining based on fuzzy clustering", Fuzzy Sets and Systems 150 (2005) 267–284.
Index Terms

Computer Science
Information Sciences

Keywords

Data Streams Unbound Data Data Structures.