International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 5 - Number 3 |
Year of Publication: 2010 |
Authors: H. S. Dhami, Raj Kishor Bisht |
10.5120/895-1269 |
H. S. Dhami, Raj Kishor Bisht . Fuzzy Set Theoretic Approach To Collocation Extraction. International Journal of Computer Applications. 5, 3 ( August 2010), 43-49. DOI=10.5120/895-1269
Fuzzy approach deals with the linguistic properties of elements such as beauty, coldness, hotness etc. Collocations are linguistically motivated. Decision of word combination for being collocation is a linguistic term as merely co-occurrence of word combinations does not signify the presence of collocation. Thus collocation extraction can be made possible by looking its linguistic aspect. In the present paper, an attempt has been made to make two different fuzzy sets of word combinations to be considered for collocations. Mutual information and t-test have been taken as basis for the construction of fuzzy sets. Two fuzzy set theoretical models have been proposed to identify collocations. It has been shown that fuzzy set theoretical approach works very well for collocation extraction. The working data has been based on a corpus of about one million words contained in different novels constituting project Gutenberg available on www.gutenberg.org.