International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 165 - Number 4 |
Year of Publication: 2017 |
Authors: Masoome Esmaeili, Arezoo Arjomandzadeh, Reza Shams, Morteza Zahedi |
10.5120/ijca2017913842 |
Masoome Esmaeili, Arezoo Arjomandzadeh, Reza Shams, Morteza Zahedi . An Anti-Spam System using Naive Bayes Method and Feature Selection Methods. International Journal of Computer Applications. 165, 4 ( May 2017), 1-5. DOI=10.5120/ijca2017913842
Electronic mail is one of the important means of communication. Thus, this useful tool has invaded by invaders for different purposes. One such Invasion is the posting of useless, unwanted e-mails known as spam or junk e-mails. Several methods of spam detection exist, but each has certain weaknesses. This paper address these weaknesses by implementing and describing a spam detection system in text classification mode, which uses Bayesian method vs. PCA to filter out written spam mails from the user’s mail box. In the proposed method first extract all tokens that exist in body of emails for classifying emails based on them. But sum of these tokens aren’t useful. Sum of them are repeated in two categories spam and non-spam mails equally, so they aren’t appropriate for distinguishing two types of emails. So proposed method finds best tokens as main features using feature selection methods such as genetic algorithm (GA), forward and backward feature selection methods.