•  
  •  
 

Keywords

Software effort estimation, Machine learning, Feature selection, Predictive modeling, Data analysis

Article Type

Original Article

Abstract

Software effort estimation is a critical process in the management of software projects, essential for predicting the resources and time required for application development. Accurate estimation is paramount for delivering projects on time and computational resources. This paper investigates techniques that utilize machine learning models to ameliorate software effort estimation. We employ four popular machine learning models including the Linear Regression, Gradient Boosting, Random Forest, and Decision Tree and evaluate their performance across five diverse climate-related datasets: Antarctica Hotpoints/Climate Change NASA, Crop Production and Climate Change, Global Warming on Earth (Global Temp), Climate Change, and Climate Treasure (Marine). A key focus is the influence of correlation-based feature selection on model reliability and predictive accuracy. For evaluation, R-squared (R2) and Root Mean Squared Error (RMSE) are employed. The outcomes delineate that correlation-based feature selection consistently and substantially optimizes the performance of all models across these datasets. Linear Regression demonstrated exceptional performance on datasets with predominantly linear relationships, achieving a near-perfect R2 of 0.999 and an RMSE of 0.0000000007 on Crop Production, and an R2 of 0.98 and RMSE of 6.3 on Climate Treasure (Marine). Ensemble models, Gradient Boosting and Random Forest, consistently delivered strong results on more complex datasets, with Gradient Boosting achieving an R2 of 0.94 and RMSE of 0.12 on Antarctica Hotpoints, and Random Forest an R2 of 0.70 and RMSE of 22900.00 on Global Warming. This juxtaposed analysis provides a solid framework for future research, enabling project managers to build more fact-based effort predictions.

Creative Commons License

Creative Commons Attribution 4.0 International License
This work is licensed under a Creative Commons Attribution 4.0 International License.

Share

COinS