CFP last date
20 February 2025
Reseach Article

Gradient Edge: Advancing Predictive Modelling with Enhanced Gradient Boosting: A Multi-Dataset Approach

by Samuel Benny Varghese, Eldho K.J.
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 186 - Number 49
Year of Publication: 2024
Authors: Samuel Benny Varghese, Eldho K.J.
10.5120/ijca2024924160

Samuel Benny Varghese, Eldho K.J. . Gradient Edge: Advancing Predictive Modelling with Enhanced Gradient Boosting: A Multi-Dataset Approach. International Journal of Computer Applications. 186, 49 ( Nov 2024), 1-6. DOI=10.5120/ijca2024924160

@article{ 10.5120/ijca2024924160,
author = { Samuel Benny Varghese, Eldho K.J. },
title = { Gradient Edge: Advancing Predictive Modelling with Enhanced Gradient Boosting: A Multi-Dataset Approach },
journal = { International Journal of Computer Applications },
issue_date = { Nov 2024 },
volume = { 186 },
number = { 49 },
month = { Nov },
year = { 2024 },
issn = { 0975-8887 },
pages = { 1-6 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume186/number49/gradient-edge-advancing-predictive-modelling-with-enhanced-gradient-boosting-a-multi-dataset-approach/ },
doi = { 10.5120/ijca2024924160 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-11-27T00:39:32.238912+05:30
%A Samuel Benny Varghese
%A Eldho K.J.
%T Gradient Edge: Advancing Predictive Modelling with Enhanced Gradient Boosting: A Multi-Dataset Approach
%J International Journal of Computer Applications
%@ 0975-8887
%V 186
%N 49
%P 1-6
%D 2024
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Using gradient boosting, it assesses the effectiveness of many machine learning algorithms on three datasets: water potability, diabetes, and heart disease. Its goal is to evaluate these models’ ability to forecast various environmental and health events. Since every dataset presents unique difficulties, a thorough understanding of the algorithms’ advantages and disadvantages is possible. The goal of this study’s introduction of Enhanced Gradient Boosting is to advance the field by improving prediction accuracy. The study’s enhanced method will overcome the drawbacks of the traditional Gradient Boosting approach in managing the complexities present in diverse datasets. Standard Gradient Boosting performs poorly on the water potability dataset, particularly when it comes to class separation. For example, class 0 and class 1 model precisions are equivalent to 0.66 and 0.64, respectively, with recall rates of 0.93 and 0.20 and F1-scores of 0.78 and 0.31. The typical Gradient Boosting model performed well, with an accuracy of 0.66, on both the diabetes and heart disease datasets. On the other hand, the new method outperformed the old one, particularly when handling the noisy water potability data. An organized method that includes the following phases is used to construct the Enhanced Gradient Boosting model: data collecting, data preprocessing, EDA, data splitting, and scaling. This end-to-end method shows how creative algorithm creation combined with comparison analysis may lead to tailored machine learning solutions. These findings show how well the algorithm performed on the two datasets and highlight how versatile Gradient Boosting is for solving various prediction issues. The discovered results have great significance not only for applications related to medical diagnostics and environmental monitoring, but also for paving the way for future advancements within the machine learning research framework.

References
  1. Wissam H Alawee, Luttfi A Al-Haddad, Ali Basem, Dheyaa J Jasim, Hasan Sh Majdi, and Abbas J Sultan. Forecasting sustainable water production in convex tubular solar stills using gradient boosting analysis. Desalination and Water Treatment, 318:100344, 2024.
  2. George Bai and Rohitash Chandra. Gradient boosting bayesian neural networks via langevin mcmc. Neurocomputing, 558:126726, 2023.
  3. KP Rasheed Abdul Haq and VP Harigovindan. Water quality prediction for smart aquaculture using hybrid deep learning models. Ieee Access, 10:60078–60098, 2022.
  4. Liangjun Jiang, Zhenhua Xia, Ronghui Zhu, Haimei Gong, Jing Wang, Juan Li, and Lei Wang. Diabetes risk prediction model based on community follow-up data using machine learning. Preventive Medicine Reports, 35:102358, 2023.
  5. Madhuri Kawarkhe and Parminder Kaur. Prediction of diabetes using diverse ensemble learning classifiers. Procedia Computer Science, 235:403–413, 2024.
  6. Wen Li, Wei Wang, and Wenjun Huo. Regboost: a gradient boosted multivariate regression algorithm. International Journal of Crowd Science, 4(1):60–72, 2020.
  7. Shihua Luo and Tianxin Chen. Two derivative algorithms of gradient boosting decision tree for silicon content in blast furnace system prediction. IEEE Access, 8:196112–196122, 2020.
  8. G Manikandan, B Pragadeesh, V Manojkumar, AL Karthikeyan, R Manikandan, and Amir H Gandomi. Classification models combined with boruta feature selection for heart disease prediction. Informatics in Medicine Unlocked, 44:101442, 2024.
  9. Subasish Mohapatra, Sushree Maneesha, Prashanta Kumar Patra, and Subhadarshini Mohanty. Heart diseases prediction based on stacking classifiers model. Procedia Computer Science, 218:1621–1630, 2023.
  10. Temidayo Oluwatosin Omotehinwa, David Opeoluwa Oyewola, and Ervin Gubin Moung. Optimizing the light gradientboosting machine algorithm for an efficient early detection of coronary heart disease. Informatics and Health, 1(2):70–81, 2024.
  11. FE Sghiouer, A Nahli, H Bouka, and M Chlaida. Analysis of the drought effects on the physicochemical and bacteriological quality of the inaouene river water (taza, morocco). Scientific African, page e02328, 2024.
Index Terms

Computer Science
Information Sciences

Keywords

Gradient Boosting Machine Learning Predictive Modelling Healthcare Diagnostics Environmental Monitoring