Cluster based Ranking Index for Enhancing Recruitment Process using Text Mining and Machine Learning

Mayuri Verma

Call for Paper

May Edition

IJCA solicits high quality original research papers for the upcoming May edition of the journal. The last date of research paper submission is 20 April 2026

Submit your paper

Know more

The week's pick

Evaluating Text-to-Text Generation from LLMs: A Case Study and Scalable Framework

Ziqiao Ao Juhi Singh Sebastian Antinome

Random Articles

Reseach Article

Cluster based Ranking Index for Enhancing Recruitment Process using Text Mining and Machine Learning

by Mayuri Verma

International Journal of Computer Applications

Foundation of Computer Science (FCS), NY, USA

Volume 157 - Number 9

Year of Publication: 2017

Authors: Mayuri Verma

10.5120/ijca2017912812

Mayuri Verma . Cluster based Ranking Index for Enhancing Recruitment Process using Text Mining and Machine Learning. International Journal of Computer Applications. 157, 9 ( Jan 2017), 23-30. DOI=10.5120/ijca2017912812

@article{ 10.5120/ijca2017912812,

author = { Mayuri Verma },

title = { Cluster based Ranking Index for Enhancing Recruitment Process using Text Mining and Machine Learning },

journal = { International Journal of Computer Applications },

issue_date = { Jan 2017 },

volume = { 157 },

number = { 9 },

month = { Jan },

year = { 2017 },

issn = { 0975-8887 },

pages = { 23-30 },

numpages = {9},

url = { https://ijcaonline.org/archives/volume157/number9/26860-2017912812/ },

doi = { 10.5120/ijca2017912812 },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Journal Article

%1 2024-02-07T00:03:29.017804+05:30

%A Mayuri Verma

%T Cluster based Ranking Index for Enhancing Recruitment Process using Text Mining and Machine Learning

%J International Journal of Computer Applications

%@ 0975-8887

%V 157

%N 9

%P 23-30

%D 2017

%I Foundation of Computer Science (FCS), NY, USA

Abstract

This paper presents an effective approach for extracting relevant words from the resumes using Term Document Matrix. The role of the candidate, various skills, familiarity with various frameworks, experienced skills and operating systems have been considered. A clustering methodology has been used to find the similar resumes. The importance of each word has been calculated according to the cluster which makes this paper unique. The appropriate rank of the resumes have been calculated. The experimental results shows that Cluster Based Ranking gives the potentially best candidate for a particular job profile. The weighted importance in calculating the ranks is the very first effort in itself. Further work can be done in this area for improving the productivity in the recruitment process.

References

https://www.linkedin.com
http://www.monsterindia.com/
https://www.naukri.com/
http://www.indeed.com/resumes
Yu, Kun, Gang Guan, and Ming Zhou. "Resume information extraction with cascaded hybrid model." Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics. Association for Computational Linguistics, 2005.
Chandola, Divyanshu, et al. "ONLINE RESUME PARSING SYSTEM USING TEXT ANALYTICS."
Kopparapu, Sunil Kumar. "Automatic extraction of usable information from unstructured resumes to aid search." Progress in Informatics and Computing (PIC), 2010 IEEE International Conference on. Vol. 1. IEEE, 2010.
Zhi Xiang Jiang, Chuang Zhang, Bo Xiao, Zhiqing Lin, “Research and Implementation of Intelligent Chinese Resume Parsing”, WRI International Conference on Communications and Mobile Computing, Jan 2009.
Zhang Chuang, Wu Ming, Li Chun Guang, Xiao Bo, “Resume Parser: Semi-structured Chinese Document Analysis”, WRI World Congress on Computer Science and Information Engineering, April 2009.
Celik Duygu, Karakas Askyn, Bal Gulsen, Gultunca Cem, “Towards an Information Extraction System Based on Ontology to Match Resumes and Jobs”, IEEE 37th Annual Workshops on Computer Software and Applications Conference Workshops, July 2013.
Feldman, Ronen, and James Sanger. The text mining handbook: advanced approaches in analyzing unstructured data. Cambridge University Press, 2007.
Manning, Christopher D., and Hinrich Schütze. Foundations of statistical natural language processing. Vol. 999. Cambridge: MIT press, 1999.
https://cran.r-project.org/web/packages/tm/tm.pdf
https://en.wikipedia.org/wiki/Uniform_Resource_Identifier
Hartigan, John A., and Manchek A. Wong. "Algorithm AS 136: A k-means clustering algorithm." Journal of the Royal Statistical Society. Series C (Applied Statistics) 28.1 (1979): 100-108.
Tibshirani, Robert, Guenther Walther, and Trevor Hastie. "Estimating the number of clusters in a data set via the gap statistic." Journal of the Royal Statistical Society: Series B (Statistical Methodology) 63.2 (2001): 411-423.
Liu, Huan, and Hiroshi Motoda, eds. Computational methods of feature selection. CRC Press, 2007.
https://en.wikipedia.org/wiki/Euclidean_distance

Index Terms

Computer Science

Information Sciences

Keywords

Resume K Means ReliefF