TY - JOUR
T1 - Improving Bayesian Local Spatial Models in Large Datasets
AU - Lenzi, Amanda
AU - Castruccio, Stefano
AU - Rue, Haavard
AU - Genton, Marc G.
N1 - KAUST Repository Item: Exported on 2020-10-29
Acknowledged KAUST grant number(s): OSR-2018-CRG7-3742.
Acknowledgements: This publication is based on work supported by the King abdullah university of science and technology (KAUST), Office of sponsored research OSR under award no. OSR-2018-CRG7-3742.
PY - 2020/9/1
Y1 - 2020/9/1
N2 - Environmental processes resolved at a sufficiently small scale in space and time inevitably display nonstationary behavior. Such processes are both challenging to model and computationally expensive when the data size is large. Instead of modeling the global non-stationarity explicitly, local models can be applied to disjoint regions of the domain. The choice of the size of these regions is dictated by a bias-variance trade-off; large regions will have smaller variance and larger bias, whereas small regions will have higher variance and smaller bias. From both the modeling and computational point of view, small regions are preferable to better accommodate the non-stationarity. However, in practice, large regions are necessary to control the variance. We propose a novel Bayesian three-step approach that allows for smaller regions without compromising the increase of the variance that would follow. We are able to propagate the uncertainty from one step to the next without issues caused by reusing the data. The improvement in inference also results in improved prediction, as our simulated example shows. We illustrate this new approach on a dataset of simulated high-resolution wind speed data over Saudi Arabia. Supplemental files for this article are available online.
AB - Environmental processes resolved at a sufficiently small scale in space and time inevitably display nonstationary behavior. Such processes are both challenging to model and computationally expensive when the data size is large. Instead of modeling the global non-stationarity explicitly, local models can be applied to disjoint regions of the domain. The choice of the size of these regions is dictated by a bias-variance trade-off; large regions will have smaller variance and larger bias, whereas small regions will have higher variance and smaller bias. From both the modeling and computational point of view, small regions are preferable to better accommodate the non-stationarity. However, in practice, large regions are necessary to control the variance. We propose a novel Bayesian three-step approach that allows for smaller regions without compromising the increase of the variance that would follow. We are able to propagate the uncertainty from one step to the next without issues caused by reusing the data. The improvement in inference also results in improved prediction, as our simulated example shows. We illustrate this new approach on a dataset of simulated high-resolution wind speed data over Saudi Arabia. Supplemental files for this article are available online.
UR - http://hdl.handle.net/10754/660681
UR - https://www.tandfonline.com/doi/full/10.1080/10618600.2020.1814789
UR - http://www.scopus.com/inward/record.url?scp=85092726898&partnerID=8YFLogxK
U2 - 10.1080/10618600.2020.1814789
DO - 10.1080/10618600.2020.1814789
M3 - Article
SN - 1537-2715
SP - 1
EP - 11
JO - Journal of Computational and Graphical Statistics
JF - Journal of Computational and Graphical Statistics
ER -