Competition on Spatial Statistics for Large Datasets

Huang Huang, Sameh Abdulah, Ying Sun, Hatem Ltaief, David E. Keyes, Marc G. Genton

Research output: Contribution to journalArticlepeer-review

13 Scopus citations

Abstract

As spatial datasets are becoming increasingly large and unwieldy, exact inference on spatial models becomes computationally prohibitive. Various approximation methods have been proposed to reduce the computational burden. Although comprehensive reviews on these approximation methods exist, comparisons of their performances are limited to small and medium sizes of datasets for a few selected methods. To achieve a comprehensive comparison comprising as many methods as possible, we organized the Competition on Spatial Statistics for Large Datasets. This competition had the following novel features: (1) we generated synthetic datasets with the ExaGeoStat software so that the number of generated realizations ranged from 100 thousand to 1 million; (2) we systematically designed the data-generating models to represent spatial processes with a wide range of statistical properties for both Gaussian and non-Gaussian cases; (3) the competition tasks included both estimation and prediction, and the results were assessed by multiple criteria; and (4) we have made all the datasets and competition results publicly available to serve as a benchmark for other approximation methods. In this paper, we disclose all the competition details and results along with some analysis of the competition outcomes.

ASJC Scopus subject areas

  • General Agricultural and Biological Sciences
  • General Environmental Science
  • Applied Mathematics
  • Statistics and Probability
  • Statistics, Probability and Uncertainty
  • Agricultural and Biological Sciences (miscellaneous)

Fingerprint

Dive into the research topics of 'Competition on Spatial Statistics for Large Datasets'. Together they form a unique fingerprint.

Cite this