CFP last date
20 January 2025
Reseach Article

Multi-Relational Algebra and its Application to Unrealized Datasets used in C4.5

by Ranjan Baghel, Maitreyee Dutta
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 73 - Number 7
Year of Publication: 2013
Authors: Ranjan Baghel, Maitreyee Dutta
10.5120/12755-9710

Ranjan Baghel, Maitreyee Dutta . Multi-Relational Algebra and its Application to Unrealized Datasets used in C4.5. International Journal of Computer Applications. 73, 7 ( July 2013), 25-28. DOI=10.5120/12755-9710

@article{ 10.5120/12755-9710,
author = { Ranjan Baghel, Maitreyee Dutta },
title = { Multi-Relational Algebra and its Application to Unrealized Datasets used in C4.5 },
journal = { International Journal of Computer Applications },
issue_date = { July 2013 },
volume = { 73 },
number = { 7 },
month = { July },
year = { 2013 },
issn = { 0975-8887 },
pages = { 25-28 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume73/number7/12755-9710/ },
doi = { 10.5120/12755-9710 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T21:39:27.812665+05:30
%A Ranjan Baghel
%A Maitreyee Dutta
%T Multi-Relational Algebra and its Application to Unrealized Datasets used in C4.5
%J International Journal of Computer Applications
%@ 0975-8887
%V 73
%N 7
%P 25-28
%D 2013
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Relational databases are based on the theory of relational algebra because all the operations of RDBMS draw their functioning from the operations in relational algebra. The operations of relational algebra are defined on the sets, however, In general, the datamining algorithms requires databases which adopts the multiset philosophy to give better and more accurate results. Unrealized datasets ensures confidentiality of the actual datasets in the datamining process. C4. 5 is a classic algorithm which works on mixed real world datasets. This paper proposes the application of Relational algebra for multisets to find the split criterion to be used in classification by the C4. 5 algorithm. The results are shown by making the changes in original C4. 5 algorithm in the weka tool setting.

References
  1. Joseph Albert. 1991 "Algebraic properties of bag data types", In VLDB '91: Proceedings of the 17th International Conference on Very Large Data Bases, pages 211–219, San Francisco, CA, USA. Morgan Kaufmann Publishers Inc.
  2. Grefen, P. W. P. J. ; de By, R. A. 1994. Data Engineering, Proceedings. 10th International Conference A multi-set extended relational algebra: a formal approach to a practical issue
  3. Apers, P. M. G. ; van den Berg, C. A. ; Flokstra, J. ;Grefen, P. W. P. J. ; Kersten,M. L. ; Wilschut,A. N. 1992. PRISMA/DB: a parallel, main memory relational DBMS Knowledge and Data Engineering, IEEE Transactions on Volume: 4 , Issue: 6.
  4. Han, J. ; Kamber, M. 2006. Data Mining: Concepts and Techniques, 2nd edition, Morgan Kaufmann Publishers.
  5. Fong, P. K. ; and Jens H. Weber-Jahnke, Feb 2012. "Privacy Preserving Decision Tree Learning Using Unrealized Data Sets", IEEE Transactions on Knowledge and Data engineering, vol. 24, no. 2, page no. 353
  6. Williams, J. 2010. Unrealization Approaches for Privacy Preserving Data Mining, A Thesis submitted in Department of Computer Science, University of Victoria.
  7. Quinlan, J. R. , 1993. " C4. 5: Programs for Machine Learning", Morgan Kaufmann Publishers.
  8. Quinlan, J. R, 1986. Induction of Decision Trees. Machine Learning, 1, 1, 81-106.
  9. Weka Primer: URL:http://weka. wikispaces. com/Primer.
Index Terms

Computer Science
Information Sciences

Keywords

Multirelational algebra relation gain ratio