National Conference on Networking, Cloud Computing, Analytics and Computing Technology |
Foundation of Computer Science USA |
NCNCCACT2017 - Number 1 |
August 2018 |
Authors: Harshita Tiwary, Indu Kashyap |
b80da68b-1f37-4460-b3a0-e8f6fe16a89a |
Harshita Tiwary, Indu Kashyap . Feature Subset Selection for Twitter Spam Detection. National Conference on Networking, Cloud Computing, Analytics and Computing Technology. NCNCCACT2017, 1 (August 2018), 24-28.
Rapid growth of social networking have had an immense effect on today's general public and Web stage. Social networking sites are developing in both size and prevalence with a high rate in recent years. Twitter is one of the quickest developing Social Networking Sites. With the measure of information developing in Twitter lately, detection of spam in real time has become a challenging task for researchers as well asfor Twitter itself. Enormous work is being done towards spam detection. The work done previously was not giving the appropriate results in the context of content based spam discovery on Twitter. In this paper accuracy is analyzed by using Classical approaches like Naïve Bayes and Random forest algorithm. It is observed that these algorithms are not giving accurate results. With a specific end goal to increase the accuracy of spam detection Random forest with Feature Subset Selection have been used. Here the aim is to propose a Feature Subset Based Classification Approach where a set of features will be tested using Random Forest Classifier for twitter spam detection. In this paper the capabilities of Random Forest Classifier has been extended for detecting spam by including Feature Subset with it.