International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 181 - Number 8 |
Year of Publication: 2018 |
Authors: Esraa M. EL-Mohdy, A. F. El-Gamal, Hanan E. Abdelkader |
10.5120/ijca2018917622 |
Esraa M. EL-Mohdy, A. F. El-Gamal, Hanan E. Abdelkader . Web Mining Techniques to Block Spam Web Sites. International Journal of Computer Applications. 181, 8 ( Aug 2018), 36-42. DOI=10.5120/ijca2018917622
The aim of this paper is to introduce a system based on web mining techniques to prevent spamming web pages. The system relies on content analysis, used features are Uniform Resource Locator(URL), Number of words in page Title, Globally Popular Keywords(GPK) and N-GRAM. The proposed system used Decision Tree(DT) rules ; which is the best classifier to detect Web spam content. It produces accuracy of .97 % in detecting spam web sites.