Unsupervised Cluster Matching for Content Model

E. Suchitha; N. Venkata Subba Reddy; Prasanta Kumar Sahoo

Call for Paper

November Edition

IJCA solicits high quality original research papers for the upcoming November edition of the journal. The last date of research paper submission is 20 October 2025

Submit your paper

Know more

The week's pick

Zero Trust Architecture Implementation in Enterprise Networks: Evaluating Effectiveness Against Cyber Threats

Stephen Kofi Dotse Samuel Yao Sebuabe Augustus Obeng Silas Asani Abudu Edna Awisie Pappoe

Random Articles

Reseach Article

Unsupervised Cluster Matching for Content Model

by E. Suchitha, N. Venkata Subba Reddy, Prasanta Kumar Sahoo

International Journal of Computer Applications

Foundation of Computer Science (FCS), NY, USA

Volume 176 - Number 20

Year of Publication: 2020

Authors: E. Suchitha, N. Venkata Subba Reddy, Prasanta Kumar Sahoo

10.5120/ijca2020920176

E. Suchitha, N. Venkata Subba Reddy, Prasanta Kumar Sahoo . Unsupervised Cluster Matching for Content Model. International Journal of Computer Applications. 176, 20 ( May 2020), 39-41. DOI=10.5120/ijca2020920176

@article{ 10.5120/ijca2020920176,

author = { E. Suchitha, N. Venkata Subba Reddy, Prasanta Kumar Sahoo },

title = { Unsupervised Cluster Matching for Content Model },

journal = { International Journal of Computer Applications },

issue_date = { May 2020 },

volume = { 176 },

number = { 20 },

month = { May },

year = { 2020 },

issn = { 0975-8887 },

pages = { 39-41 },

numpages = {9},

url = { https://ijcaonline.org/archives/volume176/number20/31318-2020920176/ },

doi = { 10.5120/ijca2020920176 },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Journal Article

%1 2024-02-07T00:43:04.540189+05:30

%A E. Suchitha

%A N. Venkata Subba Reddy

%A Prasanta Kumar Sahoo

%T Unsupervised Cluster Matching for Content Model

%J International Journal of Computer Applications

%@ 0975-8887

%V 176

%N 20

%P 39-41

%D 2020

%I Foundation of Computer Science (FCS), NY, USA

Abstract

People are generating huge amount of data which user need to store data in the storage devices. But storing the data in cloud is unsecure and people are storing the same data again and again. to avoid waste of storing the data again and again on same documents, we are going to use clustering the data documents for same documents and transform them into single language and store them. Whenever user need the document then it translate into user defined language and shows results to user. The application of document clustering can be categorized to two types, online and offline. Online applications are usually constrained by efficiency problems when compared to offline applications. Text clustering may be used for different tasks, such as grouping similar documents (news, tweets, etc.) and the analysis of customer/employee feedback, discovering meaningful implicit subjects across all documents. Documents can be clustered based on structure or content based meaning of the documents. In the existing system when documents are translated into user defined language documents are getting different meaning and it can convert into only two languages only.

References

“PolylingualTopicModels” Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, pages 880–889, Singapore, 6-7 August 2009.
“BILINGUAL SENTENCE MATCHING USING KERNEL CCA” 2010 IEEE International Workshop on Machine Learning for Signal Processing (MLSP 2010) August 29 – September 1, 2010, Kittilä, Finland.
“Unsupervised Many-to-Many Object Matching for Relational Data” IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE.
“Document Recognition/Aut henticat ion Based on Medium-Embedded Random Patterns” https://ieeexplore.ieee.org/document/395774
” Scalability Analysis of Semantics based Distributed Document Clustering Algorithms “2017 International Conference on Intelligent Computing,Instrumentation and Control Technologies (ICICICT) https://ieeexplore.ieee.org/document/8342660
“An improved Document Clustering Approach with Multi-Viewpoint based on different similarity measures “Proceedings of the Second International Conference on Intelligent Computing and Control Systems (ICICCS 2018) IEEE Xplore Compliant Part Number: CFP18K74-ART; ISBN:978-1-5386-2842-3.
“Efficient Phrase-Based Document Indexing for Web Document Clustering” IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, VOL. 16, NO. 10, OCTOBER 2004.
“Document Clustering Method using Weighted Semantic Features and Cluster Similarity” 2010 IEEE International Conference on Digital Game and Intelligent Toy Enhanced Learning.
“Clustering Web Retrieval Results Accompanied by Removing Duplicate Documents” 2010 International Conference on Web Information Systems and Mining.
N. Quadrianto, A. J. Smola, L. Song, and T. Tuytelaars, “Kernelized sorting,” IEEE Trans. on Pattern Analysis and Machine Intelligence,zvol. 32, no. 10, pp. 1809–1821, 2010.

Index Terms

Computer Science

Information Sciences

Keywords

Data Mining RSA HSIC(HILBERT-SCHMIDT INDEPENDENCE CRITERION).