Smoothing noisy data with spline functions - Estimating the correct degree of smoothing by the method of generalized cross-validation

2.6kCitations
Citations of this article
320Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Smoothing splines are well known to provide nice curves which smooth discrete, noisy data. We obtain a practical, effective method for estimating the optimum amount of smoothing from the data. Derivatives can be estimated from the data by differentiating the resulting (nearly) optimally smoothed spline. We consider the model yi(ti)+εi, i=1, 2, ..., n, ti∈[0, 1], where g∈W2(m)={f:f, f′, ..., f(m-1) abs. cont., f(m)∈ℒ2[0,1]}, and the {εi} are random errors with Eεi=0, Eεiεj=σ2δij. The error variance σ2 may be unknown. As an estimate of g we take the solution gn, λ to the problem: Find f∈W2(m) to minimize {Mathematical expression}. The function gn, λ is a smoothing polynomial spline of degree 2 m-1. The parameter λ controls the tradeoff between the "roughness" of the solution, as measured by {Mathematical expression}, and the infidelity to the data as measured by {Mathematical expression}, and so governs the average square error R(λ; g)=R(λ) defined by {Mathematical expression}. We provide an estimate {Mathematical expression}, called the generalized cross-validation estimate, for the minimizer of R(λ). The estimate {Mathematical expression} is the minimizer of V(λ) defined by {Mathematical expression}, where y=(y1, ..., yn)t and A(λ) is the n×n matrix satisfying (gn, λ (t1), ..., gn, λ (tn))t=A (λ) y. We prove that there exist a sequence of minimizers {Mathematical expression} of EV(λ), such that as the (regular) mesh {ti}i=1n becomes finer, {Mathematical expression}. A Monte Carlo experiment with several smooth g's was tried with m=2, n=50 and several values of σ2, and typical values of {Mathematical expression} were found to be in the range 1.01-1.4. The derivative g′ of g can be estimated by {Mathematical expression}. In the Monte Carlo examples tried, the minimizer of {Mathematical expression} tended to be close to the minimizer of R(λ), so that {Mathematical expression} was also a good value of the smoothing parameter for estimating the derivative. © 1979 Springer-Verlag.

References Powered by Scopus

Theory of reproducing kernels

4081Citations
N/AReaders
Get full text

Some comments on C<inf>p</inf>

2542Citations
N/AReaders
Get full text

Singular value decomposition and least squares solutions

2112Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Deep Learning in neural networks: An overview

14272Citations
N/AReaders
Get full text

Generalized Additive Models: An Introduction with R

6563Citations
N/AReaders
Get full text

Variable selection via nonconcave penalized likelihood and its oracle properties

6475Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Craven, P., & Wahba, G. (1978). Smoothing noisy data with spline functions - Estimating the correct degree of smoothing by the method of generalized cross-validation. Numerische Mathematik, 31(4), 377–403. https://doi.org/10.1007/BF01404567

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 154

64%

Researcher 49

20%

Professor / Associate Prof. 28

12%

Lecturer / Post doc 9

4%

Readers' Discipline

Tooltip

Engineering 48

31%

Computer Science 44

28%

Mathematics 40

25%

Agricultural and Biological Sciences 25

16%

Article Metrics

Tooltip
Mentions
News Mentions: 1
References: 4
Social Media
Shares, Likes & Comments: 8

Save time finding and organizing research with Mendeley

Sign up for free