International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 127 - Number 2 |
Year of Publication: 2015 |
Authors: Ali El-Matarawy, Mohammad El-Ramly, Reem Bahgat |
10.5120/ijca2015906324 |
Ali El-Matarawy, Mohammad El-Ramly, Reem Bahgat . Code Clone Detection using Sequential Pattern Mining. International Journal of Computer Applications. 127, 2 ( October 2015), 10-18. DOI=10.5120/ijca2015906324
This paper presents a new technique for clone detection using sequential pattern mining titled EgyCD. Over the last decade many techniques and tools for software clone detection have been proposed such as textual approaches, lexical approaches, syntactic approaches, semantic approaches …, etc. In this paper, we explore the potential of data mining techniques in clone detection. In particular, we developed a clone detection technique based on sequential pattern mining (SPM). The source code is treated as a sequence of transactions processed by the SPM algorithm to find frequent itemsets. We run three experiments to discover code clones of Type I, Type II and Type III and for plagiarism detection. We compared the results with other established code clone detectors. Our technique discovers all code clones in the source code and hence it is slower than the compared code clone detectors since they discover few code clones compared with EgyCD.