Advanced Computing and Information Technology |
Foundation of Computer Science USA |
TACIT2016 - Number 1 |
August 2016 |
Authors: Priti Shende, V. B. Kute |
f2267d75-9d98-4aa3-8600-8ee75a9d5201 |
Priti Shende, V. B. Kute . Mining Text for Meaningful Words with Stemming Algorithm. Advanced Computing and Information Technology. TACIT2016, 1 (August 2016), 13-16.
With the growth of explosive Internet information, data availability is easy. However, raw data is useful when mined. Therefore, mining is an important research area. The text mining primarily aims at discovery and retrieval of useful and interesting patterns from a large database. Identification and understanding of appropriate words is important to retrieve appropriate documents. Referring dictionary is time consuming and tedious job for understanding meaning of words every time. This can be prevented by converting different occurrences of word forms to its root. Frequency of words occurrences in a file used to prioritized documents. This works target avoidance of incomplete and meaningless words generation using stemming. We propose a method to compare different forms of words present in the document up to certain length. Sixty percent length of the word considered for comparison. Words having common letters are considered as different forms of same root.