Automation in Construction, cilt.131, 2021 (SCI-Expanded)
© 2021 Elsevier B.V.The construction industry is among the riskiest industries around the world. Hence, the preliminary studies exploring the consequences of occupational accidents have received considerable attention in research society. This study aims to develop a comprehensive framework to predict the post-accident disability status of construction workers. The dataset comprising 47,938 construction accidents recorded in Turkey was subjected to a detailed multi-step feature engineering approach, including data encoding, data scaling, dimension reduction, and data resampling. Predictions were performed through four tree-based ensemble machine learning models: Random Forest, XGBoost, AdaBoost, and Extra Trees, as well as a state-of-the-art optimization method for hyperparameter tuning, Genetic Algorithm (GA). GA-XGBoost presented the highest prediction rate with 0.8292 in terms of accuracy and 0.8120 with respect to AUROC. The findings may aid in predicting construction workers' post-accident disability status, resulting in a safer working environment and productivity planning in construction projects.