International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 154 - Number 6 |
Year of Publication: 2016 |
Authors: Abha Tewari, Smita Jangale |
10.5120/ijca2016912153 |
Abha Tewari, Smita Jangale . Spam Filtering Methods and machine Learning Algorithm - A Survey. International Journal of Computer Applications. 154, 6 ( Nov 2016), 8-12. DOI=10.5120/ijca2016912153
Social networking websites are used by millions of people around the world. People express their views, opinions and share current topics. Millions of data generated every day. It’s a good platform to connect with the people. Now a day’s spammers used this platform to advertise spam content on the social networking websites. The proposed system used to classify tweets into different groups as spam and non spam tweets .The system will use 120 character tweets for analysis purpose. Various active and verified twitter accounts would be chosen to extract the tweets. Each tweet is to be classified into 2 category-spam and non-spam. These classified tweets then are used to train the various machine learning techniques. Words of each tweet considered as features and a feature vector was created using bag-of-words approach in order to create the instances. The data will be trained using SVM (Support Vector Machine).