CFP last date
20 December 2024
Reseach Article

Classification of Text using Innovative Algorithm

Published on April 2016 by Nausheen Dange, V.v. Bag
National Seminar on Recent Trends in Data Mining
Foundation of Computer Science USA
RTDM2016 - Number 3
April 2016
Authors: Nausheen Dange, V.v. Bag
bfc67528-abe7-4bb0-9ee9-212443bab193

Nausheen Dange, V.v. Bag . Classification of Text using Innovative Algorithm. National Seminar on Recent Trends in Data Mining. RTDM2016, 3 (April 2016), 19-20.

@article{
author = { Nausheen Dange, V.v. Bag },
title = { Classification of Text using Innovative Algorithm },
journal = { National Seminar on Recent Trends in Data Mining },
issue_date = { April 2016 },
volume = { RTDM2016 },
number = { 3 },
month = { April },
year = { 2016 },
issn = 0975-8887,
pages = { 19-20 },
numpages = 2,
url = { /proceedings/rtdm2016/number3/24694-2590/ },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Proceeding Article
%1 National Seminar on Recent Trends in Data Mining
%A Nausheen Dange
%A V.v. Bag
%T Classification of Text using Innovative Algorithm
%J National Seminar on Recent Trends in Data Mining
%@ 0975-8887
%V RTDM2016
%N 3
%P 19-20
%D 2016
%I International Journal of Computer Applications
Abstract

The exponential growth of the internet has led to a great deal of interest in developing useful and efficient tools and software to assist users in searching the Web. Document retrieval, categorization, routing and filtering can all be formulated as classification problems. However, the complexity of natural languages and the extremely high dimensionality of the feature space of documents have made this classification problem very difficult. We have different methods for text classification: the Naive Bayes classifier, the nearest neighbor classifier, SVM (Support Vector Machine), Feature Selection, Feature Extraction Algorithms, decision trees and a subspace method. Each method involved has its own advantage and disadvantage. In order to avoid these ambiguities and redundancies, some of these methods can be combined together to produce highly accurate results. In addition to this, the produced algorithm will help to enhance the performance of the overall text classification system.

References
  1. W. Cohen. "Learning with set-valued features". AAAI Conference, 1996.
  2. Y. Yang, L. Liu. " A re-examination of text categorization methods", ACM SIGIR Conference, 1999.
  3. M. James. " Classi?cation Algorithms", Wiley Interscience, 1985.
  4. A, Basu, C. Watters and M. Shepherd faculty of computer science Support Vector machine for text classification , Delhousie University, Canada
  5. "Feature selection for text classification", Inoshika, Dilruksh
Index Terms

Computer Science
Information Sciences

Keywords

Text Classification Support Vectors Training Set Feature Set