International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 41 - Number 7 |
Year of Publication: 2012 |
Authors: A. Pandian, Mohamed Abdul Karim |
10.5120/5551-7619 |
A. Pandian, Mohamed Abdul Karim . Detection of Fraudulent Emails by Authorship Extraction. International Journal of Computer Applications. 41, 7 ( March 2012), 7-12. DOI=10.5120/5551-7619
Fraudulent emails can be detected by extraction of authorship information from the contents of emails. This paper presents information extraction based on unique words from the emails. These unique words will be used as representative features to train Radial Basis function (RBF). Final weights are obtained and subsequently used for testing. The percentage of identification of email authorship depends upon number of RBF centers and the type of functional words used for training RBF. One hundred and fifty authors with over one hundred files from the sent folder of Enron email dataset are considered. A total of 300 unique words of number of characters in each word ranging from three to seven are considered. Training and testing of RBF are done by taking different lengths of words. Our simulation shows the effectiveness of the proposed RBF network for email authorship identification. The accuracy of authorship identification ranges from 95% to 97%.