Prediction for Big Data Through Kriging: Small Sequential and One-Shot Designs

12Citations
Citations of this article
22Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Kriging—or Gaussian process (GP) modeling—is an interpolation method assuming that the outputs (responses) are more correlated, as the inputs (explanatory or independent variables) are closer. Such a GP has unknown (hyper)parameters that are usually estimated through the maximum-likelihood method. Big data, however, make it problematic to compute these estimated parameters, and the corresponding Kriging predictor and its predictor variance. To solve this problem, some authors select a relatively small subset from the big set of previously observed “old” data. These selection methods are sequential, and they depend on the variance of the Kriging predictor; this variance requires a specific Kriging model and the estimation of its parameters. The resulting designs turn out to be “local”; i.e., most selected old input combinations are concentrated around the new combination to be predicted. We develop a simpler one-shot (fixed-sample, non-sequential) design; i.e., from the big data set we select a small subset with the nearest neighbors of the new combination. To compare our designs and the sequential designs empirically, we use the squared prediction errors, in several numerical experiments. These experiments show that our design may yield reasonable performance.

References Powered by Scopus

Choosing the sample size of a computer experiment: A practical guide

506Citations
N/AReaders
Get full text

Big data analytics in supply chain management: A state-of-the-art literature review

404Citations
N/AReaders
Get full text

Local Gaussian Process Approximation for Large Computer Experiments

248Citations
N/AReaders
Get full text

Cited by Powered by Scopus

A Survey on High-dimensional Gaussian Process Modeling with Application to Bayesian Optimization

72Citations
N/AReaders
Get full text

On physics-informed data-driven isotropic and anisotropic constitutive models through probabilistic machine learning and space-filling sampling

72Citations
N/AReaders
Get full text

Local approximate Gaussian process regression for data-driven constitutive models: development and comparison with neural networks

51Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Kleijnen, J. P. C., & van Beers, W. C. M. (2020). Prediction for Big Data Through Kriging: Small Sequential and One-Shot Designs. American Journal of Mathematical and Management Sciences, 39(3), 199–213. https://doi.org/10.1080/01966324.2020.1716281

Readers over time

‘20‘21‘22‘23‘24‘2502468

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 11

79%

Professor / Associate Prof. 3

21%

Readers' Discipline

Tooltip

Engineering 8

57%

Computer Science 3

21%

Mathematics 2

14%

Energy 1

7%

Save time finding and organizing research with Mendeley

Sign up for free
0