Scale space theory is a framework for multi-scale signal representation developed by the computer vision, image processing and signal processing communities. It is a formal theory for handling image structures at different scales in such a way that fine-scale features can be successively suppressed and a scale parameter can be associated with each level in the scale-space representation.
The notion of scale-space is general and applies in arbitrary dimensions. For simplicity of presentation, however, we here describe the most commonly used case with two-dimensional images. For a given image , its linear scale-space representation is a family of derived signals defined by convolution of with the Gaussian kernel
The motivation for generating a scale-space representation of a given data set originates from the basic fact that real-world objects are composed of different structures at different scales. This implies that real-world objects, in contrast to idealized mathematical entities such as points or lines, may appear in different ways depending on the scale of observation. For example, the concept of a "tree" is appropriate at the scale of meters, while concepts such as leaves and molecules are more appropriate at finer scales. For a machine vision system analysing an unknown scene, there is no way to know a prioiri what scales are appropriate for describing the data. Hence, the only reasonable approach is to consider descriptions at all scales simultaneously.
From the scale-space representation, a large variety of image processing and computer vision operations can be expressed, such as feature detection, feature classification, image segmentation, image matching, motion estimation and computation of shape cues, based on (possibly non-linear) combinations of Gaussian derivatives at multiple scales
A highly useful property of scale-space representation is that image representations can be made invariant to scales, in order to handle size variations that arise from objects of different size or varying distances between the object and the camera. Scale invariance can be achieved by performing scale selection Lindeberg, Tony "Feature detection with automatic scale selection", International Journal of Computer Vision, 30, 2, pp 77--116, 1998.Lindeberg, Tony "Edge detection and ridge detection with automatic scale selection", International Journal of Computer Vision, 30, 2, pp 117--154, 1998. based on local maxima over scales of normalized derivatives
The notion of scale-space representation has also been frequently used for expressing coarse-to-fine methods in particular for tasks such as image matching and for multi-scale image segmentation. For technical details when implementing scale-space smoothing in practice, please see the article on scale-space implementation.
Pyramid representation is a predecessor to scale-space representation, constructed by simultaneously smoothing and subsampling a given signalBurt, Peter and Adelson, Ted, "The Laplacian Pyramid as a Compact Image Code", IEEE Trans. Communications, 9:4, 532--540, 1983.Crowley, Jim and Sanderson, A. C. "Multiple resolution representation and probabilistic matching of 2-D gray-scale shape", IEEE Transactions on Pattern Analysis and Machine Intelligence, 9(1), 113-121, 1987.. In this way, computationally highly efficient algorithms can be obtained. In a pyramid, however, it is ususally algorithmically harder to relate structures at different scales, due to the discrete nature of the scale levels. In a scale-space representation, the existence of a continuous scale parameter makes it conceptually much easier to express this so-called deep structure. For features defined as zero-crossings of differential invariants, the implicit function theorem directly defines trajectories across scales, and at those scales where bifurcations occur, the local behaviour can be modelled by singularity theory.
There are interesting relations between scale-space representation and biological vision. Neurophysiological studies have shown that there are receptive field profiles in the mammalian retina and visual cortex, which can be well modelled by linear or non-linear scale-space operatorsYoung, R. A. "The Gaussian derivative model for spatial vision: Retinal mechanisms", Spatial Vision, 2:273--293, 1987.DeAngelis, G. C., Ohzawa, I., and Freeman, R. D., "Receptive-field dynamics in the central visual pathways", Trends Neurosci. 18: 451-458, 1995..
Extensions of linear scale-space theory concern the formulation of non-linear scale-space concepts more committed to specific purposesRomeny, Bart (Ed), Geometry-Driven Diffusion in Computer Vision, Kluwer Academic Publishers, 1994.Weickert, J Anisotropic diffusion in image processing, Teuber Verlag, Stuttgart, 1998.. There are strong relations between scale-space theory and wavelet theory, although these two notions of multi-scale representation have been developed from somewhat different premises. There has also been work on other multi-scale approaches, such as pyramids and a variety of other kernels, that do not exploit or require the same requirements as true scale-space descriptions do.
This article is licensed under the GNU Free Documentation License.
It uses material from the
"Scale space".
Home Page • arts • business • computers • games • health • hospitals • home • kids & teens • news • physicians • recreation• reference • regional • science • shopping • society • sports • world