International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 128 - Number 2 |
Year of Publication: 2015 |
Authors: Anagha N. Chaudhari |
10.5120/ijca2015906459 |
Anagha N. Chaudhari . A Novel Approach for Development of an Expert IR System using Dimensionality Reduction Techniques and Clustering Approaches for High Dimensionality Dataset. International Journal of Computer Applications. 128, 2 ( October 2015), 48-53. DOI=10.5120/ijca2015906459
In day to day life huge amount of electronic data is generated from various resources. Such data is literally large and not easy to work with for storage and retrieval. This type of data can be treated with various efficient techniques for cleaning, compression and sorting of data. Preprocessing can be used to remove basic English stop-words from data making it compact and easy for further processing; later dimensionality reduction techniques make data more efficient and specific. This data later can be clustered for better information retrieval. This paper elaborates the various dimensionality reduction and clustering techniques applied on sample dataset C50test of 2500 documents giving promising results, their comparison and better approach for relevant information retrieval.