Menu
Home Explore People Places Arts History Plants & Animals Science Life & Culture Technology
On this page
Scale-space axioms

In image processing and computer vision, a scale space framework can be used to represent an image as a family of gradually smoothed images. This framework is very general and a variety of scale space representations exist. A typical approach for choosing a particular type of scale space representation is to establish a set of scale-space axioms, describing basic properties of the desired scale-space representation and often chosen so as to make the representation useful in practical applications. Once established, the axioms narrow the possible scale-space representations to a smaller class, typically with only a few free parameters.

A set of standard scale space axioms, discussed below, leads to the linear Gaussian scale-space, which is the most common type of scale space used in image processing and computer vision.

We don't have any images related to Scale-space axioms yet.
We don't have any YouTube videos related to Scale-space axioms yet.
We don't have any PDF documents related to Scale-space axioms yet.
We don't have any Books related to Scale-space axioms yet.
We don't have any archived web articles related to Scale-space axioms yet.

Scale space axioms for the linear scale-space representation

The linear scale space representation L ( x , y , t ) = ( T t f ) ( x , y ) = g ( x , y , t ) ∗ f ( x , y ) {\displaystyle L(x,y,t)=(T_{t}f)(x,y)=g(x,y,t)*f(x,y)} of signal f ( x , y ) {\displaystyle f(x,y)} obtained by smoothing with the Gaussian kernel g ( x , y , t ) {\displaystyle g(x,y,t)} satisfies a number of properties 'scale-space axioms' that make it a special form of multi-scale representation:

linearity T t ( a f + b h ) = a T t f + b T t h {\displaystyle T_{t}(af+bh)=aT_{t}f+bT_{t}h} where f {\displaystyle f} and h {\displaystyle h} are signals while a {\displaystyle a} and b {\displaystyle b} are constants, shift invariance T t S ( Δ x , Δ y ) f = S ( Δ x , Δ y ) T t f {\displaystyle T_{t}S_{(\Delta x,\Delta _{y})}f=S_{(\Delta x,\Delta _{y})}T_{t}f} where S ( Δ x , Δ y ) {\displaystyle S_{(\Delta x,\Delta _{y})}} denotes the shift (translation) operator ( S ( Δ x , Δ y ) f ) ( x , y ) = f ( x − Δ x , y − Δ y ) {\displaystyle (S_{(\Delta x,\Delta _{y})}f)(x,y)=f(x-\Delta x,y-\Delta y)} semi-group structure g ( x , y , t 1 ) ∗ g ( x , y , t 2 ) = g ( x , y , t 1 + t 2 ) {\displaystyle g(x,y,t_{1})*g(x,y,t_{2})=g(x,y,t_{1}+t_{2})} with the associated cascade smoothing property L ( x , y , t 2 ) = g ( x , y , t 2 − t 1 ) ∗ L ( x , y , t 1 ) {\displaystyle L(x,y,t_{2})=g(x,y,t_{2}-t_{1})*L(x,y,t_{1})} existence of an infinitesimal generator A {\displaystyle A} ∂ t L ( x , y , t ) = ( A L ) ( x , y , t ) {\displaystyle \partial _{t}L(x,y,t)=(AL)(x,y,t)} non-creation of local extrema (zero-crossings) in one dimension, non-enhancement of local extrema in any number of dimensions ∂ t L ( x , y , t ) ≤ 0 {\displaystyle \partial _{t}L(x,y,t)\leq 0} at spatial maxima and ∂ t L ( x , y , t ) ≥ 0 {\displaystyle \partial _{t}L(x,y,t)\geq 0} at spatial minima, rotational symmetry g ( x , y , t ) = h ( x 2 + y 2 , t ) {\displaystyle g(x,y,t)=h(x^{2}+y^{2},t)} for some function h {\displaystyle h} , scale invariance g ^ ( ω x , ω y , t ) = h ^ ( ω x φ ( t ) , ω x φ ( t ) ) {\displaystyle {\hat {g}}(\omega _{x},\omega _{y},t)={\hat {h}}({\frac {\omega _{x}}{\varphi (t)}},{\frac {\omega _{x}}{\varphi (t)}})} for some functions φ {\displaystyle \varphi } and h ^ {\displaystyle {\hat {h}}} where g ^ {\displaystyle {\hat {g}}} denotes the Fourier transform of g {\displaystyle g} , positivity g ( x , y , t ) ≥ 0 {\displaystyle g(x,y,t)\geq 0} , normalization ∫ x = − ∞ ∞ ∫ y = − ∞ ∞ g ( x , y , t ) d x d y = 1 {\displaystyle \int _{x=-\infty }^{\infty }\int _{y=-\infty }^{\infty }g(x,y,t)\,dx\,dy=1} .

In fact, it can be shown that the Gaussian kernel is a unique choice given several different combinations of subsets of these scale-space axioms:1234567891011 most of the axioms (linearity, shift-invariance, semigroup) correspond to scaling being a semigroup of shift-invariant linear operator, which is satisfied by a number of families integral transforms, while "non-creation of local extrema"12 for one-dimensional signals or "non-enhancement of local extrema"131415 for higher-dimensional signals are the crucial axioms which relate scale-spaces to smoothing (formally, parabolic partial differential equations), and hence select for the Gaussian.

The Gaussian kernel is also separable in Cartesian coordinates, i.e. g ( x , y , t ) = g ( x , t ) g ( y , t ) {\displaystyle g(x,y,t)=g(x,t)\,g(y,t)} . Separability is, however, not counted as a scale-space axiom, since it is a coordinate dependent property related to issues of implementation. In addition, the requirement of separability in combination with rotational symmetry per se fixates the smoothing kernel to be a Gaussian.

There exists a generalization of the Gaussian scale-space theory to more general affine and spatio-temporal scale-spaces.1617 In addition to variabilities over scale, which original scale-space theory was designed to handle, this generalized scale-space theory also comprises other types of variabilities, including image deformations caused by viewing variations, approximated by local affine transformations, and relative motions between objects in the world and the observer, approximated by local Galilean transformations. In this theory, rotational symmetry is not imposed as a necessary scale-space axiom and is instead replaced by requirements of affine and/or Galilean covariance. The generalized scale-space theory leads to predictions about receptive field profiles in good qualitative agreement with receptive field profiles measured by cell recordings in biological vision.181920

In the computer vision, image processing and signal processing literature there are many other multi-scale approaches, using wavelets and a variety of other kernels, that do not exploit or require the same requirements as scale space descriptions do; please see the article on related multi-scale approaches. There has also been work on discrete scale-space concepts that carry the scale-space properties over to the discrete domain; see the article on scale space implementation for examples and references.

See also

References

  1. Koenderink, Jan J. (August 1984). "The structure of images". Biological Cybernetics. 50 (5): 363–370. doi:10.1007/bf00336961. PMID 6477978. S2CID 206775432. /wiki/Doi_(identifier)

  2. Babaud, Jean; Witkin, Andrew P.; Baudin, Michel; Duda, Richard O. (1986). "Uniqueness of the Gaussian Kernel for Scale-Space Filtering". IEEE Transactions on Pattern Analysis and Machine Intelligence. 8 (1): 26–33. doi:10.1109/TPAMI.1986.4767749. PMID 21869320. S2CID 18295906. http://portal.acm.org/citation.cfm?id=11298&dl=GUIDE&coll=GUIDE

  3. Yuille, Alan L.; Poggio, Tomaso A. (1986). "Scaling Theorems for Zero Crossings". IEEE Transactions on Pattern Analysis and Machine Intelligence. 8 (1): 15–25. doi:10.1109/TPAMI.1986.4767748. hdl:1721.1/5655. PMID 21869319. S2CID 14815630. http://portal.acm.org/citation.cfm?id=11297&dl=ACM&coll=ACM

  4. Lindeberg, T. (1990). "Scale-space for discrete signals". IEEE Transactions on Pattern Analysis and Machine Intelligence. 12 (3): 234–254. doi:10.1109/34.49051. http://kth.diva-portal.org/smash/record.jsf?pid=diva2%3A472968&dswid=-7386

  5. Lindeberg, Tony, Scale-Space Theory in Computer Vision, Kluwer, 1994, http://www.csc.kth.se/~tony/book.html

  6. Pauwels, E.J.; Van Gool, L.J.; Fiddelaers, P.; Moons, T. (1995). "An extended class of scale-invariant and recursive scale space filters". IEEE Transactions on Pattern Analysis and Machine Intelligence. 17 (7): 691–701. doi:10.1109/34.391411. http://portal.acm.org/citation.cfm?coll=GUIDE&dl=GUIDE&id=628701

  7. Lindeberg, Tony (May 1996). "On the axiomatic foundations of linear scale-space: Combining semi-group structure with causality vs. scale invariance". In Sporring, J.; et al. (eds.). Gaussian Scale-Space Theory: Proc. PhD School on Scale-Space Theory. Copenhagen, Denmark: Kluwer Academic Publishers. pp. 75–98. urn:nbn:se:kth:diva-40221. http://kth.diva-portal.org/smash/record.jsf?pid=diva2%3A456533&dswid=8345

  8. Florack, Luc, Image Structure, Kluwer Academic Publishers, 1997.

  9. Weickert, Joachim; Ishikawa, Seiji; Imiya, Atsushi (1999). "Linear Scale-Space has First been Proposed in Japan". Journal of Mathematical Imaging and Vision. 10 (3): 237–252. Bibcode:1999JMIV...10..237W. doi:10.1023/A:1008344623873. S2CID 17835046. http://portal.acm.org/citation.cfm?id=607668&dl=ACM&coll=ACM

  10. Lindeberg, Tony (2011). "Generalized Gaussian Scale-Space Axiomatics Comprising Linear Scale-Space, Affine Scale-Space and Spatio-Temporal Scale-Space". Journal of Mathematical Imaging and Vision. 40: 36–81. Bibcode:2011JMIV...40...36L. doi:10.1007/s10851-010-0242-2. S2CID 950099. http://kth.diva-portal.org/smash/record.jsf?pid=diva2%3A440633&dswid=3892

  11. Lindeberg, Tony (2013). Generalized Axiomatic Scale-Space Theory. Advances in Imaging and Electron Physics. Vol. 178. pp. 1–96. doi:10.1016/B978-0-12-407701-0.00001-7. ISBN 9780124077010. 9780124077010

  12. Lindeberg, T. (1990). "Scale-space for discrete signals". IEEE Transactions on Pattern Analysis and Machine Intelligence. 12 (3): 234–254. doi:10.1109/34.49051. http://kth.diva-portal.org/smash/record.jsf?pid=diva2%3A472968&dswid=-7386

  13. Lindeberg, T. (1990). "Scale-space for discrete signals". IEEE Transactions on Pattern Analysis and Machine Intelligence. 12 (3): 234–254. doi:10.1109/34.49051. http://kth.diva-portal.org/smash/record.jsf?pid=diva2%3A472968&dswid=-7386

  14. Lindeberg, Tony (May 1996). "On the axiomatic foundations of linear scale-space: Combining semi-group structure with causality vs. scale invariance". In Sporring, J.; et al. (eds.). Gaussian Scale-Space Theory: Proc. PhD School on Scale-Space Theory. Copenhagen, Denmark: Kluwer Academic Publishers. pp. 75–98. urn:nbn:se:kth:diva-40221. http://kth.diva-portal.org/smash/record.jsf?pid=diva2%3A456533&dswid=8345

  15. Lindeberg, Tony (2011). "Generalized Gaussian Scale-Space Axiomatics Comprising Linear Scale-Space, Affine Scale-Space and Spatio-Temporal Scale-Space". Journal of Mathematical Imaging and Vision. 40: 36–81. Bibcode:2011JMIV...40...36L. doi:10.1007/s10851-010-0242-2. S2CID 950099. http://kth.diva-portal.org/smash/record.jsf?pid=diva2%3A440633&dswid=3892

  16. Lindeberg, Tony (2011). "Generalized Gaussian Scale-Space Axiomatics Comprising Linear Scale-Space, Affine Scale-Space and Spatio-Temporal Scale-Space". Journal of Mathematical Imaging and Vision. 40: 36–81. Bibcode:2011JMIV...40...36L. doi:10.1007/s10851-010-0242-2. S2CID 950099. http://kth.diva-portal.org/smash/record.jsf?pid=diva2%3A440633&dswid=3892

  17. Lindeberg, Tony (2013). Generalized Axiomatic Scale-Space Theory. Advances in Imaging and Electron Physics. Vol. 178. pp. 1–96. doi:10.1016/B978-0-12-407701-0.00001-7. ISBN 9780124077010. 9780124077010

  18. Lindeberg, Tony (2013). "A computational theory of visual receptive fields". Biological Cybernetics. 107 (6): 589–635. doi:10.1007/s00422-013-0569-z. PMC 3840297. PMID 24197240. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3840297

  19. Lindeberg, Tony (2013). "Invariance of visual operations at the level of receptive fields". PLOS ONE. 8 (7): e66990. arXiv:1210.0754. Bibcode:2013PLoSO...866990L. doi:10.1371/journal.pone.0066990. PMC 3716821. PMID 23894283. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3716821

  20. Lindeberg, Tony (2021). "Normative theory of visual receptive fields". Heliyon. 7 (1): e05897. Bibcode:2021Heliy...705897L. doi:10.1016/j.heliyon.2021.e05897. PMC 7820928. PMID 33521348. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7820928