TY - JOUR
T1 - Machine learning approaches to predict the photocatalytic performance of bismuth ferrite-based materials in the removal of malachite green
AU - Jaffari, Zeeshan Haider
AU - Abbas, Ather
AU - Lam, Sze Mun
AU - Park, Sanghun
AU - Chon, Kangmin
AU - Kim, Eun Sik
AU - Cho, Kyung Hwa
N1 - Funding Information:
This work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korean government (MSIT) [No. 2020R1A4A1019568 and No. 2022R1A2C2006172 ].
Publisher Copyright:
© 2022 Elsevier B.V.
PY - 2023/1/15
Y1 - 2023/1/15
N2 - This study focuses on the potential capability of numerous machine learning models, namely CatBoost, GradientBoosting, HistGradientBoosting, ExtraTrees, XGBoost, DecisionTree, Bagging, light gradient boosting machine (LGBM), GaussianProcess, artificial neural network (ANN), and light long short-term memory (LightLSTM). These models were investigated to predict the photocatalytic degradation of malachite green from wastewater using various NM-BiFeO3 composites. A comprehensive databank of 1200 data points was generated under various experimental conditions. The ten input variables selected were the catalyst type, reaction time, light intensity, initial concentration, catalyst loading, solution pH, humic acid concentration, anions, surface area, and pore volume of various photocatalysts. The MG dye degradation efficiency was selected as the output variable. An evaluation of the performance metrics suggested that the CatBoost model, with the highest test coefficient of determination (0.99) and lowest mean absolute error (0.64) and root-mean-square error (1.34), outperformed all other models. The CatBoost model showed that the photocatalytic reaction conditions were more important than the material properties. The modeling results suggested that the optimized process conditions were a light intensity of 105 W, catalyst loading of 1.5 g/L, initial MG dye concentration of 5 mg/L and solution pH of 7. Finally, the implications and drawbacks of the current study were stated in detail.
AB - This study focuses on the potential capability of numerous machine learning models, namely CatBoost, GradientBoosting, HistGradientBoosting, ExtraTrees, XGBoost, DecisionTree, Bagging, light gradient boosting machine (LGBM), GaussianProcess, artificial neural network (ANN), and light long short-term memory (LightLSTM). These models were investigated to predict the photocatalytic degradation of malachite green from wastewater using various NM-BiFeO3 composites. A comprehensive databank of 1200 data points was generated under various experimental conditions. The ten input variables selected were the catalyst type, reaction time, light intensity, initial concentration, catalyst loading, solution pH, humic acid concentration, anions, surface area, and pore volume of various photocatalysts. The MG dye degradation efficiency was selected as the output variable. An evaluation of the performance metrics suggested that the CatBoost model, with the highest test coefficient of determination (0.99) and lowest mean absolute error (0.64) and root-mean-square error (1.34), outperformed all other models. The CatBoost model showed that the photocatalytic reaction conditions were more important than the material properties. The modeling results suggested that the optimized process conditions were a light intensity of 105 W, catalyst loading of 1.5 g/L, initial MG dye concentration of 5 mg/L and solution pH of 7. Finally, the implications and drawbacks of the current study were stated in detail.
KW - CatBoost
KW - Machine learning
KW - Malachite green
KW - Modeling
KW - Photocatalysis
UR - http://www.scopus.com/inward/record.url?scp=85138822762&partnerID=8YFLogxK
U2 - 10.1016/j.jhazmat.2022.130031
DO - 10.1016/j.jhazmat.2022.130031
M3 - Article
C2 - 36179629
AN - SCOPUS:85138822762
SN - 0304-3894
VL - 442
JO - Journal of hazardous materials
JF - Journal of hazardous materials
M1 - 130031
ER -