CFP last date
20 December 2024
Reseach Article

Fuzzy Logic for Document Clustering

Published on July 2016 by Bhushan Talekar, Saniket Kudoo, Pragati Patil, Pallavi Vartak
National Conference on Role of Engineers in National Building
Foundation of Computer Science USA
NCRENB2016 - Number 1
July 2016
Authors: Bhushan Talekar, Saniket Kudoo, Pragati Patil, Pallavi Vartak
d8904dc1-244d-400d-a90b-50c289b29529

Bhushan Talekar, Saniket Kudoo, Pragati Patil, Pallavi Vartak . Fuzzy Logic for Document Clustering. National Conference on Role of Engineers in National Building. NCRENB2016, 1 (July 2016), 28-31.

@article{
author = { Bhushan Talekar, Saniket Kudoo, Pragati Patil, Pallavi Vartak },
title = { Fuzzy Logic for Document Clustering },
journal = { National Conference on Role of Engineers in National Building },
issue_date = { July 2016 },
volume = { NCRENB2016 },
number = { 1 },
month = { July },
year = { 2016 },
issn = 0975-8887,
pages = { 28-31 },
numpages = 4,
url = { /proceedings/ncrenb2016/number1/25555-4034/ },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Proceeding Article
%1 National Conference on Role of Engineers in National Building
%A Bhushan Talekar
%A Saniket Kudoo
%A Pragati Patil
%A Pallavi Vartak
%T Fuzzy Logic for Document Clustering
%J National Conference on Role of Engineers in National Building
%@ 0975-8887
%V NCRENB2016
%N 1
%P 28-31
%D 2016
%I International Journal of Computer Applications
Abstract

This paper shows document clustering by applying fuzzy logic. The method involves cleaning up the text and stemming of words. Then, chose ‘m’ features which differ significantly in their word frequencies (WF), normalized by document length, between documents belonging to these two clusters. The documents to be clustered are represented as a collection of ‘m’ normalized WF values. Then use Fuzzy c-means (FCM) algorithm to cluster these documents into two clusters. After the FCM execution finishes, the documents in the two clusters are analyzed for the values of their respective ‘m’ features. By using fuzzy logic, we not only get the cluster name, but also the degree to which a document belongs to a cluster.

References
  1. Vishal Gupta, Gurpreet S. Lehal; “A Survey of Text Mining Techniques and Applications”;Journal of Emerging Technologies in Web Intelligence, Vol.1, No.1, August 2009
  2. K.Sathiyakumari, V.Preamsudha, G.Manimekalai; “Unsupervised Approach for Document Clustering Using Modified Fuzzy C mean Algorithm”; International Journal of Computer & Organization Trends –Volume 11 Issue3-2011.
  3. R. Rajendra Prasath, Sudeshna Sarkar: Unsupervised Feature Generation using Knowledge Repositories for Effective Text Categorization. ECAI 2010: 1101-1102
  4. Sumit Goswami, Sudeshna Sarkar, Mayur Rustagi: Stylometric Analysis of Bloggers' Age and Gender. ICWSM 2009
  5. Sumit Goswami, Mayank Singh Shishodia; “A fuzzy based approach to stylometric analysis of blogger?s age and gender”; HIS 2012: 47-51
  6. Ross, T. J. (2010); “Fuzzy Logic with Engineering Applications”, Third Edition, John Wiley & Sons, Ltd, Chichester, UK
  7. Fuzzy logic vs Proobability (Good Math, BadMath);http://scientopia.org/blogs /goodmath/2011 /02/02 / fuzzy-logic-vs-probability/, last checked on 28thJuly 2012.
  8. Nogueira, T.M. ; “ On The Use of Fuzzy Rules to Text Document Classification ”; 2010 10th International Conference on Hybrid Intelligent Systems (HIS),; 23-25Aug 2010 Atlanta, US.
Index Terms

Computer Science
Information Sciences

Keywords

Fuzzy c-means algorithm fuzzy logic Document clustering