Enhancing Online Recruitment Fraud Detection: A Comparative Analysis of Gradient Boosting and Transformer Architectures under Severe Class Imbalance

Azizur Rahman; Nakib Uddin Ahmed

Call for Paper

April Edition

IJCA solicits high quality original research papers for the upcoming April edition of the journal. The last date of research paper submission is 20 March 2026

Submit your paper

Know more

The week's pick

Explainable Hybrid Deep Learning for Automated Diagnosis of Canine Mammary Tumors

Elham Shawky Salama Heba Askr Ashraf Darwish Aboul Ella Hassanien

Random Articles

Reseach Article

Enhancing Online Recruitment Fraud Detection: A Comparative Analysis of Gradient Boosting and Transformer Architectures under Severe Class Imbalance

by Azizur Rahman, Nakib Uddin Ahmed

International Journal of Computer Applications

Foundation of Computer Science (FCS), NY, USA

Volume 187 - Number 91

Year of Publication: 2026

Authors: Azizur Rahman, Nakib Uddin Ahmed

10.5120/ijca2026926617

Azizur Rahman, Nakib Uddin Ahmed . Enhancing Online Recruitment Fraud Detection: A Comparative Analysis of Gradient Boosting and Transformer Architectures under Severe Class Imbalance. International Journal of Computer Applications. 187, 91 ( Mar 2026), 1-10. DOI=10.5120/ijca2026926617

@article{ 10.5120/ijca2026926617,

author = { Azizur Rahman, Nakib Uddin Ahmed },

title = { Enhancing Online Recruitment Fraud Detection: A Comparative Analysis of Gradient Boosting and Transformer Architectures under Severe Class Imbalance },

journal = { International Journal of Computer Applications },

issue_date = { Mar 2026 },

volume = { 187 },

number = { 91 },

month = { Mar },

year = { 2026 },

issn = { 0975-8887 },

pages = { 1-10 },

numpages = {9},

url = { https://ijcaonline.org/archives/volume187/number91/enhancing-online-recruitment-fraud-detection-a-comparative-analysis-of-gradient-boosting-and-transformer-architectures-under-severe-class-imbalance/ },

doi = { 10.5120/ijca2026926617 },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Journal Article

%1 2026-03-20T22:55:41.926473+05:30

%A Azizur Rahman

%A Nakib Uddin Ahmed

%T Enhancing Online Recruitment Fraud Detection: A Comparative Analysis of Gradient Boosting and Transformer Architectures under Severe Class Imbalance

%J International Journal of Computer Applications

%@ 0975-8887

%V 187

%N 91

%P 1-10

%D 2026

%I Foundation of Computer Science (FCS), NY, USA

Abstract

Through the exponential rise in online recruitment services, the job hunting process has been simplified to a great extent, but has also created a breed of online job ads that are extremely dangerous to job seekers in terms of data security and finances. It is computationally hard to differentiate legitimate and illegitimate postings because of the advanced linguistic structure of fake advertisements and because the real-world data is severely class imbalanced. This research paper presents a comparative and indepth analysis of Machine Learning (ML), Deep Learning (DL), and Transformer-based architectures in detecting fraudulent job postings automatically. A dataset of 17,883 records was utilized, and robust text preprocessing techniques were applied, such as semantic representation using Word2Vec embeddings. The Synthetic Minority Over-Sampling Technique (SMOTE) was applied to address the significant imbalance between authentic (17,014) and invalid (866) samples. A broad range of classifiers was evaluated, including Random Forest (RF), Support Vector Machine (SVM), K-Nearest Neighbors (KNN), Decision Tree (DT), XGBoost (XGB), and Logistic Regression (LR), along with Deep Learning models (ANN, LSTM) and state-of-the-art Transformers (BERT, RoBERTa). Experimental outcomes showed that ensemble learning and Transformer-based models are highly effective compared to traditional linear classifiers. In particular, XGBoost delivered the best results with 99.44% accuracy and an F1-score of 0.99, followed closely by Random Forest (99.37%) and RoBERTa (98.81%). SVM, on the other hand, demonstrated a low level of efficacy with an accuracy of 50.44 per cent. The results indicate that the combination of SMOTE with gradient-boosting algorithms or pre-trained Transformers offers a highly promising framework for protecting the online recruitment ecosystem against fraud cases.

References

S. Vidros, C. Kolias, G. Kambourakis, and L. Akoglu. Automatic detection of online recruitment frauds: Characteristics, methods, and a public dataset. Future Internet, 9(1):6, Mar 2017.
S. Lal, R. Jiaswal, N. Sardana, A. Verma, A. Kaur, and R. Mourya. Orfdetector: Ensemble learning based online recruitment fraud detection. In 2019 12th International Conference on Contemporary Computing (IC3), pages 1–5, Noida, India, Aug 2019.
J. Lee and M. J. Cho. Online job scams: Unveiling the impact of overconfidence, digital literacy, and algorithmic literacy on user susceptibility to false job advertisements. New Media & Society, 2025.
V. Anbarasu, S. Selvakani, and M. K. Vasumathi. Fake job prediction using machine learning. Ubiquity, 13(1):12–20, 2024.
R. Rofik, R. A. Hakim, J. Unjung, B. Prasetiyo, and M. A. Muslim. Optimization of svm and gradient boosting models using gridsearchcv in detecting fake job postings. MATRIK: Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer, 23(2):419–430, 2024.
A. S. Pillai. Detecting fake job postings using bidirectional lstm. International Research Journal of Modern Engineering and Technology Science, 5(3):1825–1830, Mar 2023.
S. Chavhan, R. C. Dharmik, and S. Jain. Evaluation of cnnbigru and cnn-bilstm model for fake job post detection: A deep learning approach. In 2024 2nd International Conference on Emerging Trends in Engineering and Medical Sciences (ICETEMS), 2024.
S. Badere et al. An intelligent system for identifying fake job ads using cnn-bigru and cnn-bilstm. In 2024 4th International Conference on Advancement in Electronics & Communication Engineering (AECE), 2024.
S. S. Sanisetty, S. V. Kotamaraja, B. N. Reddy, and S. Vekkot. Comprehensive approach to fraudulent job post detection using machine learning and bert models. In 2025 4th International Conference on Distributed Computing and Electrical Circuits and Electronics (ICDCECE), 2025.
K. Taneja, J. Vashishtha, and S. Ratnoo. Fraud-bert: Transformer based context aware online recruitment fraud detection. Discover Computing, 28(1):9, 2025.
C. Srikanth, M. Rashmi, S. Ramu, and R. M. Guddeti. A novel fake job posting detection: An empirical study and performance evaluation using ml and ensemble techniques. In International Conference on Security, Privacy and Data Analytics, 2022.
F. G. Hussain et al. Fake news detection landscape: Datasets, data modalities, ai approaches, their challenges, and future perspectives. IEEE Access, 2025.
A. Papasavva et al. Applications of ai-based models for online fraud detection and analysis. Crime Science, 14(1):7, 2025.
S. Safdar and M. Wasim. Dfn-scnc: Detecting fake news based on social context and news content: A hybrid approach using bert and bi-gru. In 2024 International Conference on Frontiers of Information Technology (FIT), 2024.
M. Wasim et al. Keepup: A unified framework fusing knowledge extraction, social platform engagement, and user profiling for fake news detection. Array, 29:100687, 2026.
K. Yu, S. Jiao, and Z. Ma. Fake news detection based on bert multi-domain and multi-modal fusion network. Computer Vision and Image Understanding, 252:104301, 2025.
L. Caruccio et al. Identifying fake reviews for refund purposes: Evaluating the effectiveness of a transfer-learning model against emerging large language models. Engineering Applications of Artificial Intelligence, 162:112448, 2025.
R. Gupta, I. Kashyap, and V. Jindal. Sbilm: Siamese bi-lstm model for handling imbalance in fake review detection. Procedia Computer Science, 235:1157–1166, 2024.
L. Das, L. Ahuja, and A. Pandey. A novel deep learning model-based optimization algorithm for text message spam detection. The Journal of Supercomputing, 80(12):17823– 17848, 2024.
H. Lee, S. Jeong, S. Cho, and E. Choi. Visualization technology and deep-learning for multilingual spam message detection. Electronics, 12(3):582, 2023.
K. R. Reddy, G. Indrani, N. P. Kumar, and K. V. Krishna. Fake job posting detection using machine learning algorithms. In 2025 4th International Conference on Innovative Mechanisms for Industry Applications (ICIMIA), 2025.
A. J. Veliyath et al. Fake job detection using statistical and nlp based analysis. In 2025 IEEE 15th International Conference on Electronics Information and Emergency Communication (ICEIEC), 2025.
T. Bhatia and J. Meena. Detection of fake online recruitment using machine learning techniques. In 2022 4th International Conference on Advances in Computing, Communication Control and Networking (ICAC3N), 2022.
H. Afzal et al. Identifying fake job posting using selective features and resampling techniques. Multimedia Tools and Applications, 83(6):15591–15615, 2024.
A. J. Hilman, G. Lionardi, L. A. Wulandhari, and G. Z. Nabiilah. Real or fake job posting prediction using random forest, long short-term memory, and multinational naive bayes. In 2025 International Conference on Information and Communication Technology (ICoICT), 2025.
K. Akhila et al. Improving online job authenticity detection using deep learning and focal loss. In 2024 International Conference on Data Science and Network Security (ICDSNS), 2024.
A. S. Filani, O. M. Adegoke, A. A. Joseph, and O. A. Opeyemi. Development of a fake job posting detection system using deep neural networks and voting ensemble methods. Journal of Science Innovation and Technology Research, 2025.
K. A. Gopinathan et al. Deep learning-based detection of fraud in online recruitment. International Journal, page P12, 2025.
B. Praveen. A deep learning framework for detecting fraudulent online job postings. Anusandhanvallari, pages 170–177, Dec 2023.
K. Patil, A. Shetty, A. Rajagopal, and S. Sonawani. A hybrid approach to fake job detection using nlp and machine learning. In 2025 IEEE 5th International Conference on ICT in Business Industry & Government (ICTBIG), 2025.

Index Terms

Computer Science

Information Sciences

Keywords

Online Recruitment Fraud Natural Language Processing Synthetic Minority Over-sampling Technique XGBoost Transformers