National Conference on Advancements in Computer & Information Technology |
Foundation of Computer Science USA |
NCACIT2016 - Number 2 |
May 2016 |
Authors: Dhanashri S. Hulavale, Saurabh H. Deshmukh |
2f61370b-c52c-4519-b8ac-cd0981c5b126 |
Dhanashri S. Hulavale, Saurabh H. Deshmukh . A Three Stage Classifier for Efficient Website Categorization. National Conference on Advancements in Computer & Information Technology. NCACIT2016, 2 (May 2016), 22-24.
Website categorization is one of the challenging tasks in the world of ever increasing web technologies. There are different way to categorization of web pages using different features and approach. Website contains lot of information like text, images, animation, video and links. So this information is call as features of website. For the website categorization purpose all Feature have most important role. The web has a lot of information in the form of images, video, animation and text etc present in the document. In proposed System uses number of feature of website and use three different classifier for website classification are naive bays classifier, linear classifier- perceptron and stochastic classifier. Here eight major categories of website have been selected for categorization; these are business & economy, job search, and science, education, sports, news & media, government, entertainment. Proposed system gives ranking to website. It will be more helpful for software developer or website designer for evolution of their site using our system so that they can judge that their website belongs to respective category or not.