CFP last date
20 January 2025
Reseach Article

Performance Analysis for Crowdsourcing Context Submission using Hierarchical Clustering Algorithm and Classification

by S. P. Jadhav, M. R. Patil
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 92 - Number 11
Year of Publication: 2014
Authors: S. P. Jadhav, M. R. Patil
10.5120/16055-5275

S. P. Jadhav, M. R. Patil . Performance Analysis for Crowdsourcing Context Submission using Hierarchical Clustering Algorithm and Classification. International Journal of Computer Applications. 92, 11 ( April 2014), 33-37. DOI=10.5120/16055-5275

@article{ 10.5120/16055-5275,
author = { S. P. Jadhav, M. R. Patil },
title = { Performance Analysis for Crowdsourcing Context Submission using Hierarchical Clustering Algorithm and Classification },
journal = { International Journal of Computer Applications },
issue_date = { April 2014 },
volume = { 92 },
number = { 11 },
month = { April },
year = { 2014 },
issn = { 0975-8887 },
pages = { 33-37 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume92/number11/16055-5275/ },
doi = { 10.5120/16055-5275 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T22:14:03.974843+05:30
%A S. P. Jadhav
%A M. R. Patil
%T Performance Analysis for Crowdsourcing Context Submission using Hierarchical Clustering Algorithm and Classification
%J International Journal of Computer Applications
%@ 0975-8887
%V 92
%N 11
%P 33-37
%D 2014
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Very well know that the complexity and volume of the data is increasing rapidly in some Crowdsourcing websites. The term Crowdsourcing means the action of outsourcing tasks, traditionally performed by an employee or contractor, which are now performed by a large group of people. It is more expensive and more time consuming process because of increase in rate of submission and so short listing the winners. Data submitted by crowdsourcing websites can be noisy, inconsistent. To overcome this problems related to data one of the method was proposed which named as text mining method; this method performs the number of operations like extraction of data, preprocessing process, tf-idf calculation and calculation of similarity. Results obtained by existing system shows that k-means algorithm with text mining methods do not do the entire trick of evaluating submissions. Hence proposed system uses hierarchical clustering algorithm with text mining methods and classification for relation submission to overcome the problems present in the existing system.

References
  1. J. C. Bongard, Member, IEEE, Paul D. H. Hines, Member, IEEE, Dylan Conger, Peter Hurd, and Zhenyu Lu," Crowdsourcing Predictors of Behavioral Outcomes", IEEETRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS: SYSTEM, VOL 43, NO. 1, JAN2013.
  2. Thomas Walter, Andera Back," A Text Mining Approach to Evaluate Submissions to Crowdsourcing Contests", 2013, 46th Hawaii International Conference on System Sciences.
  3. E. A. Calvillo1, A. Padilla, J. Munoz, J. Ponce, J. T. Fernandez," Searching Research Papers Using Clustering and Text Mining",2013, IEEE Tavel, P. 2007 Modeling and Simulation Design. AK Peters Ltd.
  4. S. Subbaiah," Extracting Knowledge using Probabilistic Classifier for Text Mining", Proceedings of the 2013 International Conference on Pattern Recognition, Informatics and Mobile Engineering, February 21-22
  5. Matthew Lease ,Emine Yilmaz," Crowdsourcing for information retrieval: introduction to the special issue",Springer Science Business Media New York, March 2013.
  6. Kai Kuikkaniemi," White paper: Crowdsourcing in Media Industry", 2010.
  7. Man-Ching Yuen, Irwin King and Kwong-Sak Leung," A Survey of Crowdsourcing Systems", 2011 IEEE International Conference on Privacy, Security, Risk, and Trust, and IEEE International Conference on Social Computing.
Index Terms

Computer Science
Information Sciences

Keywords

Apriori Algorithm Clustering Crowdsourcing Hierarchical clustering TDM (Term Document Matrix) IR (Information Retrieval).