Kriging is a regression technique used in geostatistics to approximate or interpolate data. The theory of Kriging was developed from the seminal work of its inventor, Danie G. Krige and further developed by Georges Matheron. In the statistical community, it is also known as Gaussian process regression. Kriging is also a reproducing kernel method (like splines and support vector machines).
Figure: example of one-dimensional data interpolation by Kriging, with confidence intervals
Kriging can be understood as linear prediction or a form of Bayesian inference. Kriging starts with a prior distribution over functions. This prior takes the form of a Gaussian process: samples from a function will be normally distributed, where the covariance between any two samples is the covariance function (or kernel) of the Gaussian process evaluated at the spatial location of two points.
A set of values are then observed, each value associated with a spatial location. Now, a new value can be predicted at any new spatial location, by combining the Gaussian prior with a Gaussian likelihood function for each of the observed values. The resulting posterior distribution is also a Gaussian, with a mean and covariance that can be simply computed from the observed values, their variance, and the kernel matrix derived from the prior.
From the geological point of view, Kriging uses prior knowledge about the spatial distribution of a mineral: this prior knowledge encapsulates how minerals co-occur as a function of space. Then, given a series of measurements of mineral concentrations, Kriging can predict mineral concentrations at unobserved points.
Kriging is a family of linear least squares estimation algorithms. The end result of Kriging is to obtain the conditional expectation as a best estimate for all unsampled locations in a field and consequently, a minimized error variance at each location. The conditional expectation minimizes the error variance when the optimality criterion is based on least squares residuals. The Kriging estimate is a weighted linear combination of the data. The weights that are assigned to each known datum are determined by solving the Kriging system of linear equations, where the weights are the unknown regression parameters. The optimality criterion used to arrive at the Kriging system, as mentioned above, is a minimization of the error variance in the least-squares sense.
There is some controversy over the assumption of spatial dependency in kriging. For example, Clarkâs hypothetical uranium datain Practical Geostatistics do not display a significant degree of spatial dependence but the author reports a kriged estimate for some selected coordinates within this sample space anyway. The practice of kriging lends itself to abuse, particularly when applied to a model ore distribution based on the assumption that ore concentrations display a significant degree of spatial dependency in the sample space under examination, which can then be modelled by a Gaussian process. However, some practitioners question the assumption that spatial dependence follows a stochastic process, and that the stochastic process can be correctly estimated from an empirical variogram. Other practitioners recommend using statistical tests to test the assumption of spatial dependency.. For example, in the figure above, the function fits the graph perfectly, but the primary data set may not display a statistically significant degree of spatial dependence. Failing to pass a test for spatial dependence would indicate that a constant model cannot be distinguished from a kriging model without further information or knowledge. Armstrong and Champigny's A Study on Kriging Small Blocks cautions against oversmoothing when the authors noticed that kriging variances converge on the zero kriging variance and kriging covariances on the unity kriging covariance as subsets of kriged estimates converge on the infinite set of kriged estimates. Unsurprisingly because kriged estimates are functionally dependent and deprived of degrees of freedom.
A series of related terms were also named after Krige, including kriged estimate, kriged estimator, kriging variance, kriging covariance, zero kriging variance, unity kriging covariance, kriging matrix, kriging method, kriging model, kriging plan, kriging process, kriging system, block kriging, co-kriging, disjunctive kriging, linear kriging, ordinary kriging, point kriging, random kriging, regular grid kriging, simple kriging and universal kriging.
See also: Sampling variogram, variogram (also known as a semivariogram).