CFP last date
20 January 2025
Reseach Article

Subject based Clustering for Digital Forensic Investigation with Subject Suggestion

by Sweedle Mascarnes, Joanne Gomes
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 102 - Number 11
Year of Publication: 2014
Authors: Sweedle Mascarnes, Joanne Gomes
10.5120/17856-8715

Sweedle Mascarnes, Joanne Gomes . Subject based Clustering for Digital Forensic Investigation with Subject Suggestion. International Journal of Computer Applications. 102, 11 ( September 2014), 1-6. DOI=10.5120/17856-8715

@article{ 10.5120/17856-8715,
author = { Sweedle Mascarnes, Joanne Gomes },
title = { Subject based Clustering for Digital Forensic Investigation with Subject Suggestion },
journal = { International Journal of Computer Applications },
issue_date = { September 2014 },
volume = { 102 },
number = { 11 },
month = { September },
year = { 2014 },
issn = { 0975-8887 },
pages = { 1-6 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume102/number11/17856-8715/ },
doi = { 10.5120/17856-8715 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T22:32:48.767527+05:30
%A Sweedle Mascarnes
%A Joanne Gomes
%T Subject based Clustering for Digital Forensic Investigation with Subject Suggestion
%J International Journal of Computer Applications
%@ 0975-8887
%V 102
%N 11
%P 1-6
%D 2014
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Recently digital forensics has become a prominent activity in crime investigation since computers are increasingly used as tools to commit crimes. During forensic investigation the digital devices such as desktops, notebooks, smart phones etc. found at the crime scene are collected for further investigation. Investigators have to go through humongous amount of data stored on these devices to gather evidence. This activity exceeds the expert's ability of analyzing and interpreting the data. In this context data mining techniques such as clustering are used for automated data analysis. This research work focuses on a novel document clustering model that allows an investigator to semantically cluster the documents stored on a suspect's digital devices with the help of subject suggestions initially provided to him. Providing subject suggestion improves the accuracy and speeds up the process of searching the evidence. Without subject suggestion, the investigators are heedless about the suspect's dataset and fail to give appropriate search query which may delay the process of investigation.

References
  1. G. L. Palmer. 2001. A Road Map for Digital Forensics Research. Technical Report. First Digital Forensics Research Workshop (DFRWS).
  2. J. Han and M. Kamber. 2006. Data mining: Concepts and Techniques. Second Edition. Elsevier.
  3. Tool: Guidance Encase. http://www. guidancesoftware. com/computer-forensics ediscover software-digitalevidence. htm.
  4. Tool: Access Data Forensic Toolkit. http://www. accessdata. com/forensictoolkit. html.
  5. Tool: Sleuth Kit & Authopsy. http://www. sleuthkit. org.
  6. S. L. Garfinkel. 2010. Digital forensics research: The next 10 years. Digital Investigation. pp. 64–73.
  7. G. Dagher and B. Fung, "Subject-based semantic document clustering for digital forensic investigations", Journal of Data & Knowledge Engineering, Vol. 86, pp. 224–241, 2013.
  8. A. K. Jain. Data Clustering: 50 Years Beyond K-Means. 2010. Pattern Recognition Letters, Vol. 31, pp. 651-666.
  9. B. K. L. Fei, J. H. P. Eloff, H. S. Venter and M. S. Oliver. 2005. Exploring forensic data with self-organizing maps. In Proc. IFIP International Conference on Digital Forensics. pp. 113–123.
  10. N. L. Beebe and J. G. Clark. 1997. Digital forensic text string searching: Improving information retrieval effectiveness by thematically clustering search results. Digital Investigation. Elsevier. Vol. 4. pp. 49–54.
  11. L. Nassif and E. Hruschka. Document Clustering for Forensic Analysis: An Approach for Improving Computer Inspection. 2013. IEEE Transactions on Information Forensic and Security, Vol. 8. pp 46-54
  12. S. Decherchi, S. Tacconi, J. Redi, A. Leoncini, F. Sangiacomo and R. Zunino. 2009. Text clustering for digital forensics analysis. Computat. Intell. Security Information System. Vol. 63. pp. 29–36.
  13. M. F. Porter. 1997. An Algorithm for Suffix Stripping, Morgan Kaufmann Publishers Inc. USA. pp. 313–316.
  14. Wu HC, Luk RWP, Wong KF and Kwok KL. 2008 Interpreting tf–idf term weights as making relevance decisions. ACM Transactions on Information Systems. pp. 1–37.
  15. G. A. Miller. 1995. WordNet: a lexical database for English, Communications of the ACM, pp. 39–41.
  16. G. Salton, A. Wong and C. S. Yang. 1975. A vector space model for automatic indexing. Communications of the ACM, Vol. 18. pp. 613-620.
  17. J. Becker and D. Kuropka. . 2003. Topic-based Vector Space Model. In Proc. of the 6th International Conference on Business Information Systems. Colorado Springs.
  18. G. Salton and M. J. 1986. Introduction to Modern Information Retrieval. McGraw-Hill.
  19. http://en. wikipedia. org/wiki/F1_score 8/07/2014
Index Terms

Computer Science
Information Sciences

Keywords

Crime Investigation Digital Forensic Semantic Clustering Subject Suggestion