International Conference on Current Trends in Advanced Computing ICCTAC 2013 |
Foundation of Computer Science USA |
ICCTAC - Number 1 |
June 2013 |
Authors: R. Menaha, G. Anupriya |
9bcb28b4-598d-4d55-aca5-46b448a70734 |
R. Menaha, G. Anupriya . Semantic similarity measurement between words using SWD & Snippets. International Conference on Current Trends in Advanced Computing ICCTAC 2013. ICCTAC, 1 (June 2013), 31-35.
Semantic similarity plays a significant role in the areas of Web mining, Information Retrieval, NLP and Text mining. Even though it is exploited in various applications accurately measuring semantic similarity still remains a challenging task. In this paper a method is proposed to measure semantic similarity between words using web as information source and by combining two existing approaches to measure semantic similarity they are: Semantic Word Distance (SWD) and Snippets. The SWD measure finds the semantic similarity by determining the frequency of occurrences of the words in web pages (corpus). The semantic relation between words are also obtained through lexical patterns which are extracted from text snippets. A robust method is used to integrate these similarity scores using support vector machine. For the experimental purpose 100 word pairs are used to train the support vector machine and it classifies the word pair as either synonymous or non synonymous with higher accuracy.