International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 39 - Number 12 |
Year of Publication: 2012 |
Authors: G. Thavasi Raja, R. Malmathanraj, M. Arun |
10.5120/4872-7299 |
G. Thavasi Raja, R. Malmathanraj, M. Arun . Document Clustering using Learning from Examples. International Journal of Computer Applications. 39, 12 ( February 2012), 17-24. DOI=10.5120/4872-7299
Information filtering (IF) systems usually filter data items by correlating a set of terms representing the user’s interest with similar sets of terms representing the data items. Many techniques have been employed for constructing user profiles automatically, but they usually yield large sets of data. Various dimensionality-reduction techniques can be applied in order to reduce the number of terms in a user query. A new framework is described to classify large scale documents and retrieve the documents related to the user’s query based on the application of trained artificial neural network (ANN) model. Its novel feature is the identification of an optimal set of documents that are relevant to the user. As a case study the government orders issued by Tamil Nadu state government, a state in India are classified according to their semantic similarity. Various neural architectures such as back propagation neural network (BPN), radial basis function (RBF), Learning Vector Quantization (LVQ) and Support vector machines (SVM) are used and their performance evaluation is analyzed.