CFP last date
20 January 2025
Reseach Article

Privacy Preserving Combinatorial Function for Multi-Partitioned Data Sets

by V. S. Prakash, A. Shanmugam, P. Murugesan
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 44 - Number 8
Year of Publication: 2012
Authors: V. S. Prakash, A. Shanmugam, P. Murugesan
10.5120/6281-8450

V. S. Prakash, A. Shanmugam, P. Murugesan . Privacy Preserving Combinatorial Function for Multi-Partitioned Data Sets. International Journal of Computer Applications. 44, 8 ( April 2012), 8-10. DOI=10.5120/6281-8450

@article{ 10.5120/6281-8450,
author = { V. S. Prakash, A. Shanmugam, P. Murugesan },
title = { Privacy Preserving Combinatorial Function for Multi-Partitioned Data Sets },
journal = { International Journal of Computer Applications },
issue_date = { April 2012 },
volume = { 44 },
number = { 8 },
month = { April },
year = { 2012 },
issn = { 0975-8887 },
pages = { 8-10 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume44/number8/6281-8450/ },
doi = { 10.5120/6281-8450 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T20:34:59.318582+05:30
%A V. S. Prakash
%A A. Shanmugam
%A P. Murugesan
%T Privacy Preserving Combinatorial Function for Multi-Partitioned Data Sets
%J International Journal of Computer Applications
%@ 0975-8887
%V 44
%N 8
%P 8-10
%D 2012
%I Foundation of Computer Science (FCS), NY, USA
Abstract

To preserve the electronic fitness records of data mining applications, Data perturbation is used. It is a form of privacy-preserving data mining. An improved amount and range of information stored in databases has direct to an enhancement in the desire for ranked and "best match" queries. Such queries are mainly applicable when dealing with privacy-sensitive information. To facilitate privacy preservation in data mining or machine learning algorithms over horizontally partitioned or vertically partitioned data, many protocols have been proposed using SMC and various secure building blocks. In this work, we plan to present an effective and efficient cluster based privacy preserving data perturbation technique to mine Multi-partitioned data sets that comprises of both vertical and horizontal data sets which is current demand of e-business and e-commerce data mining environment. To evaluate a trade of between data privacy and transparency of individual's data, data perturbation technique is presented with validation and authentication. In multi-partitioned data distribution, data perturbation raised ambiguity between vertical and horizontal partitions of the data. To overcome the ambiguity, we plan to introduce divisive k-neighbor clusters for multi-partitioned data sets. The performance is evaluated with bench data sets obtained from popular e-business / e-commerce sites. (Amazon, e-bay etc. ).

References
  1. Jaideep Vaidya,, Christopher W. Clifton , "Privacy-Preserving Kth Element Score over Vertically Partitioned Data", IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, VOL. 21, NO. 2, FEBRUARY 2009
  2. Shuguo Han, Wee Keong Ng, Li Wan, and Vincent C. S. Lee" Privacy-Preserving Gradient-Descent Methods", IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, VOL. 22, NO. 6, JUNE 2010
  3. W. Jiang and C. Clifton, "Ac-Framework for Privacy-Preserving Collaboration," Proc. Seventh SIAM Int'l Conf. Data Mining (SDM), 2007.
  4. M. Shaneck, Y. Kim, and V. Kumar, "Privacy Preserving Nearest Neighbor Search," Proc. ICDM Workshops, pp. 541-545, 2006.
  5. Dasgupta, A. ; Kosara, R. , "Adaptive Privacy-Preserving Visualization Using Parallel Coordinates" Visualization and Computer Graphics, IEEE Transactions, Vol. 17, Issue: 12, Page No. 2241 – 2248, 2011.
  6. H. Yu, J. Vaidya, and X. Jiang, "Privacy-Preserving SVM Classification on Vertically Partitioned Data," Proc. 10th Pacific- Asia Conf. Knowledge Discovery and Data Mining (PAKDD), pp. 647- 656, Apr. 2006.
  7. D. Metzler, "Using Gradient Descent to Optimize Language Modeling Smoothing Parameters," Proc. 30th Ann. Int'l ACM SIGIR, pp. 687-688, 2007.
  8. S. Han and W. K. Ng, "Privacy-Preserving Linear Fisher Discriminant Analysis," Proc. 12th Pacific-Asia Conf. Knowledge Discovery and Data Mining (PAKDD), May 2008
  9. S. Han and W. K. Ng, "Privacy-Preserving Genetic Algorithms for Rule Discovery," Proc. Ninth Int'l Conf. Data Warehousing and Knowledge Discovery (DaWak), pp. 407-417, Sept. 2007.
  10. Rozita A. Dara, Masoud Makrehchi, Mohamed S. Kamel" Filter-Based Data Partitioning for Training Multiple Classifier Systems", IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, VOL. 22, NO. 4, APRIL 2010.
  11. Keng-Pei Lin and Ming-Syan Chen "On the Design and Analysis of the Privacy-Preserving SVM Classifier", IEEE Transactions on Knowledge and Data Engineering,Volume 23 Issue 11, November 2011
  12. Bhaduri, K. , Stefanski, M. D. , Srivastava, A. N. , "Privacy-Preserving Outlier Detection Through Random Nonlinear Data Distortion", Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on Vol. 41 , Issue: 1, Page No. 260 – 272, 2011
  13. R. Jenssen, D. Erdogmus, K. E. Hild, J. C. Principe, and T. Eltoft, "Information Cut for Clustering Using a Gradient Descent Approach," Pattern Recognition, vol. 40, no. 3, pp. 796-806, 2007.
Index Terms

Computer Science
Information Sciences

Keywords

Multi-partitioned Data Sets Privacy Preservation Data Perturbation Technique