We apologize for a recent technical issue with our email system, which temporarily affected account activations. Accounts have now been activated. Authors may proceed with paper submissions. PhDFocusTM
CFP last date
20 December 2024
Reseach Article

Unsupervised Cluster Matching for Content Model

by E. Suchitha, N. Venkata Subba Reddy, Prasanta Kumar Sahoo
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 176 - Number 20
Year of Publication: 2020
Authors: E. Suchitha, N. Venkata Subba Reddy, Prasanta Kumar Sahoo
10.5120/ijca2020920176

E. Suchitha, N. Venkata Subba Reddy, Prasanta Kumar Sahoo . Unsupervised Cluster Matching for Content Model. International Journal of Computer Applications. 176, 20 ( May 2020), 39-41. DOI=10.5120/ijca2020920176

@article{ 10.5120/ijca2020920176,
author = { E. Suchitha, N. Venkata Subba Reddy, Prasanta Kumar Sahoo },
title = { Unsupervised Cluster Matching for Content Model },
journal = { International Journal of Computer Applications },
issue_date = { May 2020 },
volume = { 176 },
number = { 20 },
month = { May },
year = { 2020 },
issn = { 0975-8887 },
pages = { 39-41 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume176/number20/31318-2020920176/ },
doi = { 10.5120/ijca2020920176 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-07T00:43:04.540189+05:30
%A E. Suchitha
%A N. Venkata Subba Reddy
%A Prasanta Kumar Sahoo
%T Unsupervised Cluster Matching for Content Model
%J International Journal of Computer Applications
%@ 0975-8887
%V 176
%N 20
%P 39-41
%D 2020
%I Foundation of Computer Science (FCS), NY, USA
Abstract

People are generating huge amount of data which user need to store data in the storage devices. But storing the data in cloud is unsecure and people are storing the same data again and again. to avoid waste of storing the data again and again on same documents, we are going to use clustering the data documents for same documents and transform them into single language and store them. Whenever user need the document then it translate into user defined language and shows results to user. The application of document clustering can be categorized to two types, online and offline. Online applications are usually constrained by efficiency problems when compared to offline applications. Text clustering may be used for different tasks, such as grouping similar documents (news, tweets, etc.) and the analysis of customer/employee feedback, discovering meaningful implicit subjects across all documents. Documents can be clustered based on structure or content based meaning of the documents. In the existing system when documents are translated into user defined language documents are getting different meaning and it can convert into only two languages only.

References
  1. “PolylingualTopicModels” Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, pages 880–889, Singapore, 6-7 August 2009.
  2. “BILINGUAL SENTENCE MATCHING USING KERNEL CCA” 2010 IEEE International Workshop on Machine Learning for Signal Processing (MLSP 2010) August 29 – September 1, 2010, Kittilä, Finland.
  3. “Unsupervised Many-to-Many Object Matching for Relational Data” IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE.
  4. “Document Recognition/Aut henticat ion Based on Medium-Embedded Random Patterns” https://ieeexplore.ieee.org/document/395774
  5. “ONLINE BINARY VISUALIZATION FOR PDF DOCUMENTS” 978-1-5386-4615-1/18/$31.00 ©2018 IEEE
  6. ” Scalability Analysis of Semantics based Distributed Document Clustering Algorithms “2017 International Conference on Intelligent Computing,Instrumentation and Control Technologies (ICICICT) https://ieeexplore.ieee.org/document/8342660
  7. “An improved Document Clustering Approach with Multi-Viewpoint based on different similarity measures “Proceedings of the Second International Conference on Intelligent Computing and Control Systems (ICICCS 2018) IEEE Xplore Compliant Part Number: CFP18K74-ART; ISBN:978-1-5386-2842-3.
  8. “Efficient Phrase-Based Document Indexing for Web Document Clustering” IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, VOL. 16, NO. 10, OCTOBER 2004.
  9. “Document Clustering Method using Weighted Semantic Features and Cluster Similarity” 2010 IEEE International Conference on Digital Game and Intelligent Toy Enhanced Learning.
  10. “Clustering Web Retrieval Results Accompanied by Removing Duplicate Documents” 2010 International Conference on Web Information Systems and Mining.
  11. N. Quadrianto, A. J. Smola, L. Song, and T. Tuytelaars, “Kernelized sorting,” IEEE Trans. on Pattern Analysis and Machine Intelligence,zvol. 32, no. 10, pp. 1809–1821, 2010.
Index Terms

Computer Science
Information Sciences

Keywords

Data Mining RSA HSIC(HILBERT-SCHMIDT INDEPENDENCE CRITERION).