The Nyquist–Shannon sampling theorem is a fundamental result in the field of information theory, in particular telecommunications and signal processing.
The theorem is commonly called Shannon's sampling theorem, and is also known as Nyquist–Shannon–Kotelnikov, Whittaker–Shannon–Kotelnikov, Whittaker–Nyquist–Kotelnikov–Shannon, WKS, etc., sampling theorem. In addition to its mathematical originator E. T. Whittaker, and its American engineering originators Claude Shannon and Harry Nyquist, it is also attributed to the Russian engineering originator V. A. Kotelnikov and sometimes to its German engineering originators Karl Küpfmüller and H. Raabe. It is often referred to as simply the sampling theorem. See the historical background section.
Sampling is the process of converting a signal (e.g., a function of continuous time or space) into a numeric sequence (a function of discrete time or space). The theorem states conditions under which the samples represent no loss of information and can therefore be used to reconstruct the original signal with arbitrarily good fidelity. It states that unambiguous reconstruction is possible if the signal is bandlimited and the sampling frequency is greater than twice the signal bandwidth.
A signal that is bandlimited is constrained in terms of how fast it can change and therefore how much detail it can convey in between discrete moments of time. The sampling theorem means that the discrete samples are a complete representation of the signal if the bandwidth is less than half the sampling rate, which is referred to as the Nyquist frequency. Frequency components that are at or above the Nyquist frequency are subject to a phenomenon called aliasing, which is undesirable in most applications. The severity of the problem depends on the relative strength of the aliased components.
To formalize these concepts, let represent a real-valued continuous-time signal and let represent its unitary Fourier transform (to the domain of ordinary frequency, Hz). That is:
Suppose is bandlimited, such that the highest nonzero is at frequency (as in the figure). That is:
Then the condition for alias-free sampling at rate (in samples per second) is:
or equivalently:
The time between successive samples is referred to as the sampling interval. It is given by:
And the samples of are denoted by:
The sampling theorem also states a procedure for obtaining the original from its samples , which works exactly when the above condition is satisfied.
From a signal processing perspective, the theorem describes two processes; a sampling process, in which a continuous time signal is converted to a discrete time signal, and a reconstruction process, in which the continuous signal is recovered from the discrete signal.
Let us assume that the continuous signal varies over time and that the sampling process is done by simply measuring the continuous signal's value every T seconds, which is called the sampling interval (in practice, the sampling interval is typically quite small, on the order of milliseconds or even microseconds). This results in a sequence of numbers which can be said to represent the original signal in one way or another. Let us call the elements of this sequence samples. Notice that each sample is associated to the specific point in time where it was measured. Notice also that 1/T can be interpreted as a sampling frequency, which is often represented by the symbol fs and measured in samples per second, or equivalently, hertz.
Let us also assume that the reconstruction process is done by somehow interpolating a continuous/analog signal from the samples.
A very practical question would be to ask: under what circumstances is it possible to reconstruct the original signal completely and exactly (perfect reconstruction)?
The answer is provided by the sampling theorem, which can be interpreted as two parts:
Note that if the original signal contains a frequency component exactly equal to one-half the sampling rate, the condition is not satisfied, and the resulting reconstructed signal may or may not have a component at that frequency, which will in general not match the original component.
This reconstruction with sinc functions is not the only, or necessarily best, reconstruction, but it is uniquely the one that satisfies the stated property. If the original signal is bandlimited to some other band away from zero (DC) frequency, then other procedures and conditions can apply to make an appropriate generalization of the sampling theorem to that situation.
A few practical conclusions can be drawn from the theorem:
To prevent or reduce aliasing, two things can be done:
The anti-aliasing filter is to restrict the bandwidth of the signal to satisfy the sampling condition. Such a restriction works in theory, but is not precisely satisfiable in reality, because real filters will always allow some leakage of high frequencies. However, the leakage energy can be small enough that the aliasing effects are negligible.
The sampling theorem is usually formulated for functions of a single variable. Consequently, the theorem is directly applicable to time-dependent signals and is normally formulated in that context. However, the sampling theorem can be extended in a straightforward way to functions of arbitrarily many variables. Grayscale images, for example, are often represented as two-dimensional arrays (or matrices) of real numbers representing the relative intensities of pixels (picture elements) located at the intersections of row and column sample locations. As a result, images require two independent variables, or indices, to specify each pixel uniquely — one for the row, and one for the column.
Color images typically consist of a composite of three separate grayscale images, one to represent each of the three primary colors — red, green, and blue, or RGB for short. Other colorspaces using 3-vectors for colors include HSV, LAB, XYZ, etc. Some colorspaces such as cyan, magenta, yellow, and black (CMYK) may represent color by four dimensions. All of these are treated as vector-valued functions over a two-dimensional sampled domain.
Similar to one-dimensional discrete-time signals, images can also suffer from aliasing if the sampling resolution, or pixel density, is inadequate. For example, a digital photograph of a striped shirt with high frequencies (in other words, the distance between the stripes is small), can cause aliasing of the shirt when it is sampled by the camera's image sensor. The aliasing appears as a Moiré pattern. The "solution" to higher sampling in the spatial domain for this case would be to move closer to the shirt or use a higher resolution sensor.
Another example is shown to the right in the brick patterns. The top image shows the effects when the sampling theorem's condition is not satisfied. When software rescales an image (the same process that creates the thumbnail shown in the lower image) it, in effect, runs the image through a low-pass filter first and then downsamples the image to result in a smaller image that does not exhibit the Moiré pattern. The top image is what happens when the image is downsampled without low-pass filtering: aliasing results.
The top image was created by zooming out in GIMP and then taking a screenshot of it. The likely reason that this causes a banding problem is that the zooming feature simply downsamples without low-pass filtering (probably for performance reasons) since the zoomed image is for on-screen display instead of printing or saving.
The application of the sampling theorem to images should not be made without care. For example, the sampling process in any standard image sensor (CCD or CMOS camera) is relatively far from the ideal sampling which would measure the image intensity at a single point. Instead these devices have a relatively large sensor area at each sample point in order to obtain sufficient amount of light. Also, it is not obvious that the analog image intensity function which is sampled by the sensor device is bandlimited. It should be noted, however, that the non-ideal sampling is itself a type of low-pass filter, although far from one that ideally removes high frequency components. Despite images having these problems in relation to the sampling theorem, the theorem can be used to describe the basics of down and up sampling of images.
As an example, consider this family of signals at the critical frequency:
Where the samples:
are in every case just alternating –1 and +1 for any phase θ or sine component amplitude A; thus there is no way to determine both the amplitude and the phase of the continuous-time sinusoid x(t) that x* was sampled from. This ambiguity is the reason for the strict inequality of the sampling condition.
The Nyquist-Shannon sampling and reconstruction theorem asserts that, given a bandlimited continuous-time signal, x(t), that is uniformly sampled at a sufficient rate, even if all of the information in the signal between samples is discarded, there remains sufficient information in the samples so that the original continuous-time signal can be mathematically reconstructed from only those samples. To prove this, a mathematical representation of the uniform sampled signal that effectively discards the information between samples must be constructed:
It is clear that ΔT(t) takes a value of zero except for values of t that are at the sampling instances, nT, for integer n. Equally clear that xs(t) takes on zero values for all t except for the sampling instances nT. Multiplying x(t) by ΔT(t) effectively discards all of the information between sampling instances and retains information only at the sampling instances nT. xs(t) can be represented in terms of the samples:
Where x(nT) are the samples. As shown above, xs(t) can also be written as:
Using the frequency shifting property of the continuous Fourier transform,
where X(f) is the Fourier transform of x(t). This says that the spectrum of the signal being sampled is shifted and repeated forever at integral multiples of the sampling frequency, fs. Now constrain x(t) or X(f) to be bandlimited to fH (i.e. X(f) = 0 for all |f| > fH) and consider what condition would allow no overlap of the tails of adjacent images X(f):
With that condition satisfied, there is no overlap of images and X(f) (and thus x(t)) can be reconstructed from Xs(f) (or xs(t)) by low pass filtering out all of the images of X(f) in Xs(f) except the original image at the baseband. To do that, fs > 2fH (to prevent overlap) and the frequency response of the reconstruction filter H(f) must be:
With H(f) so defined, it is clear that
and the spectrum of the original signal that was sampled is recovered from the spectrum of the sampled signal. This means, in the time domain, that the original signal that was sampled is recovered from the sampled signal. This is the sampling half of the Nyquist-Shannon sampling and reconstruction theorem. It says that the sampling frequency, fs, must be strictly greater than twice the bandwidth, fH, of the continuous-time signal, x(t), for no information to be lost (or "aliased"). The reconstruction half follows.
The impulse response of the reconstruction filter is the inverse Fourier transform of H(f):
This function is the impulse response of the reconstruction filter that has for its input, the sampled signal xs(t):
xs(t) is no more than a collection of dirac impulses, δ(t-nT), each delayed to the time of their sampling instance, nT and weighted by a value proportional to the continuous-time signal that was sampled at that instance, x(nT). Since the reconstruction filter is a linear, time-invariant system, each impulse at time nT generates its own impulse response delayed to the same time, and the output of the reconstruction filter is the sum of outputs driven by each weighted impulse separately. For each input impulse, the component of the output is the impulse response delayed by the same delay of that input impulse, h(t-nT), and weighted by the same scaling factor attached to that input impulse, T•x(nT). That is, the output of the reconstruction filter is:
This shows explicitly how the samples of the original continuous-time function, x(nT), are combined to reconstruct that function, x(t), at times that are between the samples when such data was discarded in the sampling process and is known as the Whittaker–Shannon interpolation formula. This completes the reconstruction half of the Nyquist-Shannon sampling and reconstruction theorem.
In summary, assuming an adequate sampling rate, fs > 2fH, the spectrum of the original input, X(f), is identical to the baseband image of the sampled signal, Xs(f), and the ideal reconstruction filter simply discards all of the other images in Xs(f), where k≠0, leaving the baseband image untouched.
When considering the effect of practical reconstruction (dirac impulses are decidedly not practical), the question one asks is "what device, perhaps an LTI system, must be placed between the ideal sampled signal xs(t), and the practical signal outputted (before the final reconstruction filter removes the images other than the baseband image) by some device such as a digital-to-analog converter?" This is the basis for modeling and understanding the effect of the zero-order hold on the most common practical means of reconstruction of the sampled signal.
And the corresponding interpolation function is:
The sampling theorem was implied by the work of Harry Nyquist in 1928 ("Certain topics in telegraph transmission theory"), in which he showed that up to 2B independent pulse samples could be sent through a system of bandwidth B; but he did not explicity consider the problem of sampling and reconstruction of continuous signals. About the same time, Karl Küpfmüller showed a similar result, and discussed the sinc-function impulse response of a band-limiting filter, via its integral, the step response Integralsinus; this bandlimiting and reconstruction filter that is so central to the sampling theorem is sometimes referred to as a Küpfmüller filter (but seldom in English).
The sampling theorem, essentially a dual of Nyquist's result, was proved by Claude E. Shannon in 1949 ("Communication in the presence of noise"). V. A. Kotelnikov published similar results in 1933 ("On the transmission capacity of the 'ether' and of cables in electrical communications", translation from the Russian), as did the mathematician E. T. Whittaker in 1915 ("Expansions of the Interpolation-Theory," "Theorie der Kardinalfunktionen"), J. M. Whittaker in 1935 ("Interpolatory function theory"), and Gabor in 1946 ("Theory of communication").
Others who have independently discovered or played roles in the development of the sampling theorem have been discussed in several historical articles, for example by JerriJerri, Abdul J., "The Shannon Sampling Theorem—Its Various Extensions and Applications: A Tutorial Review," Proceedings of the IEEE, 65:1565–1595, Nov. 1977. and by LükeLüke, Hans Dieter, "The Origins of the Sampling Theorem," IEEE Communications Magazine, pp.106–108, April 1999.. For example, Lüke points out that H. Raabe, an assistant to Küpfmüller, proved the theorem in his 1939 Ph.D. dissertation; the term Raabe condition came to be associated with the criterion for unambiguous representation (sampling rate greater than twice the bandwidth).
Digital signal processing | Information theory | Mathematical theorems | Claude Shannon
مبرهنة الاستعيان | Nyquist-Shannon-Abtasttheorem | Teorema de muestreo de Nyquist-Shannon | Théorème d'échantillonnage de Nyquist-Shannon | Teorema del campionamento di Nyquist-Shannon | תורת הדגימה (עיבוד אותות) | Bemonsteringstheorema van Nyquist-Shannon | 標本化定理 | Twierdzenie Kotielnikowa-Shannona | Теорема отсчётов Уиттакера — Найквиста — Котельникова — Шеннона | Nyquistin teoreema
This article is licensed under the GNU Free Documentation License.
It uses material from the
"Nyquist–Shannon sampling theorem".
Home Page • arts • business • computers • games • health • hospitals • home • kids & teens • news • physicians • recreation• reference • regional • science • shopping • society • sports • world