International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 26 - Number 9 |
Year of Publication: 2011 |
Authors: S.Sagar Imambi, T.Sudha |
10.5120/3131-4315 |
S.Sagar Imambi, T.Sudha . A Novel Feature Selection Method for Classification of Medical Documents from Pubmed. International Journal of Computer Applications. 26, 9 ( July 2011), 29-33. DOI=10.5120/3131-4315
The exponential growth of online repositories in medical science has led to the development of various text mining tool . Theses tools assist the users in analyzing text data stored in the online repositories like Pubmed and medline. The pubmed repositories are growing at the rate of 500000 articles per year. Classification of Medline documents becomes very complex due to high dimensionality of feature space. In this study we discussed how dimensionality is reduced. We study and compared various dimensionality reduction techniques at the preprocessing stage. We introduce a novel feature weighting scheme ‘GRW ‘ and proved that this schema improves classification accuracy. Our experimental results indicate that existing feature weighting methods has less accuracy rate when compared to GRW schema and tested on medical data set.