TY - JOUR
T1 - A novel approach to explain the black-box nature of machine learning in compressive strength predictions of concrete using Shapley additive explanations (SHAP)
AU - Ekanayake, I. U.
AU - Meddage, D. P.P.
AU - Rathnayake, Upaka
N1 - Publisher Copyright:
© 2022 The Authors
PY - 2022/6
Y1 - 2022/6
N2 - Machine learning (ML) techniques are often employed for the accurate prediction of the compressive strength of concrete. Despite higher accuracy, previous ML models failed to interpret the rationale behind predictions. Model interpretability is essential to appeal to the interest of domain experts. Therefore, overcoming research gaps identified, this research study proposes a way to predict the compressive strength of concrete using supervised ML algorithms (Decision tree, Extra tree, Adaptive boost (AdaBoost), Extreme gradient boost (XGBoost), Light gradient boosting method (LGBM), and Laplacian Kernel Ridge Regression (LKRR). Alternatively, SHapley Additive exPlainations (SHAP) – a novel black-box interpretation approach - was employed to elucidate the predictions. The comparison revealed that tree-based algorithms and LKRR provide acceptable accuracy for compressive strength predictions. Moreover, XGBoost and LKRR algorithms evinced superior performance (R = 0.98). According to SHAP interpretation, XGBoost predictions capture complex relationships among the constituents. On the other hand, SHAP provides unified measures on feature importance and the impact of a variable for a prediction. Interestingly, SHAP interpretations were in accordance with what is generally observed in the compressive behavior of concrete, thus validating the causality of ML predictions.
AB - Machine learning (ML) techniques are often employed for the accurate prediction of the compressive strength of concrete. Despite higher accuracy, previous ML models failed to interpret the rationale behind predictions. Model interpretability is essential to appeal to the interest of domain experts. Therefore, overcoming research gaps identified, this research study proposes a way to predict the compressive strength of concrete using supervised ML algorithms (Decision tree, Extra tree, Adaptive boost (AdaBoost), Extreme gradient boost (XGBoost), Light gradient boosting method (LGBM), and Laplacian Kernel Ridge Regression (LKRR). Alternatively, SHapley Additive exPlainations (SHAP) – a novel black-box interpretation approach - was employed to elucidate the predictions. The comparison revealed that tree-based algorithms and LKRR provide acceptable accuracy for compressive strength predictions. Moreover, XGBoost and LKRR algorithms evinced superior performance (R = 0.98). According to SHAP interpretation, XGBoost predictions capture complex relationships among the constituents. On the other hand, SHAP provides unified measures on feature importance and the impact of a variable for a prediction. Interestingly, SHAP interpretations were in accordance with what is generally observed in the compressive behavior of concrete, thus validating the causality of ML predictions.
KW - Compressive strength
KW - Explainable machine learning
KW - Laplacian kernel Ridge Regression
KW - SHAP explanation
KW - Tree-based regression
UR - http://www.scopus.com/inward/record.url?scp=85127831520&partnerID=8YFLogxK
U2 - 10.1016/j.cscm.2022.e01059
DO - 10.1016/j.cscm.2022.e01059
M3 - Article
AN - SCOPUS:85127831520
SN - 2214-5095
VL - 16
JO - Case Studies in Construction Materials
JF - Case Studies in Construction Materials
M1 - e01059
ER -