International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 1 - Number 9 |
Year of Publication: 2010 |
Authors: B. Janet, A. V. Reddy |
10.5120/192-330 |
B. Janet, A. V. Reddy . Cube Index: A Text Index Model for Retrieval and Mining. International Journal of Computer Applications. 1, 9 ( February 2010), 88-92. DOI=10.5120/192-330
Text retrieval, Analysis, Mining and Knowledge management have gained a lot of importance in a time when we drown in information but are starved for knowledge. In this paper, we propose a novel Index that uses a Text Cube model to store the text information similar to a data cube in Data Mining. This model creates a direct index, next word index and inverted index in a single Cube Index which is three dimensional in nature. The Dimensions considered are first word, next word and document. The measure of the cube is the frequency of occurrence of the word next-word pair. The cube index has been tested by modifying the open source of terrier 2.1.