TY - JOUR
T1 - In-stream Escherichia coli modeling using high-temporal-resolution data with deep learning and process-based models
AU - Abbas, Ather
AU - Baek, Sangsoo
AU - Silvera, Norbert
AU - Soulileuth, Bounsamay
AU - Pachepsky, Yakov
AU - Ribolzi, Olivier
AU - Boithias, Laurie
AU - Cho, Kyung Hwa
N1 - Funding Information:
Acknowledgements. This work was supported by the Korea Environment Industry and Technology Institute (KEITI) through the Aquatic Ecosystem Conservation Research Program, funded by the Korea Ministry of Environment (MOE) (RE202001319). The authors sincerely thank the Lao Department of Agricultural Land Management (DALaM) for its support, including granting the per- mission for field access, and the M-TROPICS Critical Zone Observatory (https://mtropics.obs-mip.fr/, last access: 1 December 2021), which belongs to the French Research Infrastructure OZCAR (http: //www.ozcar-ri.org/, last access: 1 December 2021), for data access. The authors also thank Campus France (PHC STAR 41510WH) for their financial support.
Funding Information:
Financial support. This work was supported by the Korea Environment Industry and Technology Institute (KEITI) through the Aquatic Ecosystem Conservation Research Program, funded by the Korea Ministry of Environment (MOE) (RE202001319).
Publisher Copyright:
© 2021 The Author(s).
PY - 2021/12/6
Y1 - 2021/12/6
N2 - Contamination of surface waters with microbiological pollutants is a major concern to public health. Although long-term and high-frequency Escherichia coli (E. coli) monitoring can help prevent diseases from fecal pathogenic microorganisms, such monitoring is time-consuming and expensive. Process-driven models are an alternative means for estimating concentrations of fecal pathogens. However, process-based modeling still has limitations in improving the model accuracy because of the complexity of relationships among hydrological and environmental variables. With the rise of data availability and computation power, the use of data-driven models is increasing. In this study, we simulated fate and transport of E. coli in a 0.6gkm2 tropical headwater catchment located in the Lao People's Democratic Republic (Lao PDR) using a deep-learning model and a process-based model. The deep learning model was built using the long short-term memory (LSTM) methodology, whereas the process-based model was constructed using the Hydrological Simulation Program-FORTRAN (HSPF). First, we calibrated both models for surface as well as for subsurface flow. Then, we simulated the E. coli transport with 6gmin time steps with both the HSPF and LSTM models. The LSTM provided accurate results for surface and subsurface flow with 0.51 and 0.64 of the Nash-Sutcliffe efficiency (NSE) values, respectively. In contrast, the NSE values yielded by the HSPF were -0.7 and 0.59 for surface and subsurface flow. The simulated E. coli concentrations from LSTM provided the NSE of 0.35, whereas the HSPF gave an unacceptable performance with an NSE value of -3.01 due to the limitations of HSPF in capturing the dynamics of E. coli with land-use change. The simulated E. coli concentration showed the rise and drop patterns corresponding to annual changes in land use. This study showcases the application of deep-learning-based models as an efficient alternative to process-based models for E. coli fate and transport simulation at the catchment scale.
AB - Contamination of surface waters with microbiological pollutants is a major concern to public health. Although long-term and high-frequency Escherichia coli (E. coli) monitoring can help prevent diseases from fecal pathogenic microorganisms, such monitoring is time-consuming and expensive. Process-driven models are an alternative means for estimating concentrations of fecal pathogens. However, process-based modeling still has limitations in improving the model accuracy because of the complexity of relationships among hydrological and environmental variables. With the rise of data availability and computation power, the use of data-driven models is increasing. In this study, we simulated fate and transport of E. coli in a 0.6gkm2 tropical headwater catchment located in the Lao People's Democratic Republic (Lao PDR) using a deep-learning model and a process-based model. The deep learning model was built using the long short-term memory (LSTM) methodology, whereas the process-based model was constructed using the Hydrological Simulation Program-FORTRAN (HSPF). First, we calibrated both models for surface as well as for subsurface flow. Then, we simulated the E. coli transport with 6gmin time steps with both the HSPF and LSTM models. The LSTM provided accurate results for surface and subsurface flow with 0.51 and 0.64 of the Nash-Sutcliffe efficiency (NSE) values, respectively. In contrast, the NSE values yielded by the HSPF were -0.7 and 0.59 for surface and subsurface flow. The simulated E. coli concentrations from LSTM provided the NSE of 0.35, whereas the HSPF gave an unacceptable performance with an NSE value of -3.01 due to the limitations of HSPF in capturing the dynamics of E. coli with land-use change. The simulated E. coli concentration showed the rise and drop patterns corresponding to annual changes in land use. This study showcases the application of deep-learning-based models as an efficient alternative to process-based models for E. coli fate and transport simulation at the catchment scale.
UR - http://www.scopus.com/inward/record.url?scp=85121001663&partnerID=8YFLogxK
U2 - 10.5194/hess-25-6185-2021
DO - 10.5194/hess-25-6185-2021
M3 - Article
AN - SCOPUS:85121001663
SN - 1027-5606
VL - 25
SP - 6185
EP - 6202
JO - Hydrology and Earth System Sciences
JF - Hydrology and Earth System Sciences
IS - 12
ER -