International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 4 - Number 5 |
Year of Publication: 2010 |
Authors: K. Ramar, N. Sivaram |
10.5120/823-1165 |
K. Ramar, N. Sivaram . Applicability of Clustering and Classification Algorithms for Recruitment Data Mining. International Journal of Computer Applications. 4, 5 ( July 2010), 23-28. DOI=10.5120/823-1165
Recruitment of appropriate employees and their retention are the major concerns towards creating the competitive strength in the knowledge economy. Every year IT companies recruit fresh graduates through their campus selection programs after examining their skills by conducting tests, group discussion and a number of interviews. The recruitment process requires enormous amount of effort and investment. During each phase of the recruitment process, candidates are filtered based on some performance criteria. Intense analysis on the system indicates that a pattern exists among the candidates selected for an industry. The problem domain is complex and the aspects of candidates that impact the recruitment process is not explicit. In this research, the domain knowledge is extracted through knowledge acquisition techniques. Data mining techniques that fit the problem are determined. A study has been made by applying K-means and fuzzy C-means clustering and decision tree classification algorithms to the recruitment data of an industry. Experiments were conducted with the data collected from an IT industry to support their hiring decisions. Pruned and unpruned trees were constructed using ID3, C4.5 and CART algorithms. From the comparative study, it has been observed that the clustering algorithms are not much suitable for the problem and performance of the C4.5 decision tree algorithm is high. Using the constructed decision trees, discussions were made with the domain experts to deduce viable decision rules.