CFP last date
20 December 2024
Reseach Article

Data Dashboard- Integrating Data Mining with Data Deduplication

by Vitasta Abrol, Jyoti Malhotra
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 71 - Number 22
Year of Publication: 2013
Authors: Vitasta Abrol, Jyoti Malhotra
10.5120/12620-9332

Vitasta Abrol, Jyoti Malhotra . Data Dashboard- Integrating Data Mining with Data Deduplication. International Journal of Computer Applications. 71, 22 ( June 2013), 28-33. DOI=10.5120/12620-9332

@article{ 10.5120/12620-9332,
author = { Vitasta Abrol, Jyoti Malhotra },
title = { Data Dashboard- Integrating Data Mining with Data Deduplication },
journal = { International Journal of Computer Applications },
issue_date = { June 2013 },
volume = { 71 },
number = { 22 },
month = { June },
year = { 2013 },
issn = { 0975-8887 },
pages = { 28-33 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume71/number22/12620-9332/ },
doi = { 10.5120/12620-9332 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T21:36:23.145702+05:30
%A Vitasta Abrol
%A Jyoti Malhotra
%T Data Dashboard- Integrating Data Mining with Data Deduplication
%J International Journal of Computer Applications
%@ 0975-8887
%V 71
%N 22
%P 28-33
%D 2013
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Many applications deal with huge amount of data and that scattered data needs to be transformed into something relevant and meaningful. To make sense of such data is the need of many applications and areas of technology. The data that is already present is very huge, noisy and has a complex structure. We are working on the idea of integrating data mining with data deduplication. Data Dashboard is a tool which can take complex data involving various dimensions and simultaneously uses data deduplication algorithms that help in removing redundancy in the data up to 95%. Thus it provides high reliability, low disk space and high throughput. The tool then uses this data mined information and deploys it on dynamic platform such as web which provides ease to user to access huge database.

References
  1. Ming-Syan Chen, "Data Mining: An Overview from a Database Perspective", IEEE Transactions on Knowledge and Data Engineering, VOL 8, NO. 6.
  2. Michael Steinbach, Levent Ertöz, and Vipin Kumar "The Challenges of Clustering High Dimensional Data".
  3. Mamatha Nadikota, Satya P Kumar Somayajula,Dr. C. P. V. N. J. Mohan Rao ,CSE Department,Avanthi College of Engg &Tech ,Tamaram,Visakhapatnam,A,P. . ,India "Hashing and Pipelining Techniques for Association Rule Mining" International Journal of Computer Science and Information Technologies, Vol. 2 (4), 2011, 1448-1452.
  4. Ying-Hsiang Wen, Jen-Wei Huang, and Ming-Syan Chen,"Hardware-Enhanced Association Rule Mining with Hashing and Pipelining" IEEE Transactions on knowledge and data engineering, VOL. 20, NO. 6, JUNE 2008.
  5. Alissar NASSER, Denis HAMAD, Chaiban NASR, KAIST Ky Ho Park CORE Lab. KAIST "K-means Clustering Algorithm in Projected Spaces".
  6. R. Agrawal and al, "Automatic Clustering of High Dimensional Data used for Data Mining Applications," Proc. ACM SIGMOD conf. , pp. 94- 105, 1998.
  7. N. Mandagere, P. and S. Uttamchandani. "Demystifying Data Deduplication" presented In Companion '08: Proceedings of ACM/IFIP/USENIX Middleware '08, pages 12-17,2008
Index Terms

Computer Science
Information Sciences

Keywords

Data mining Data Deduplication Clustering Hashing API