Abstract:
Abstract—Natural images contain an overwhelming number of visual patterns generated by diverse stochastic processes. Defining and modeling these patterns is of fundamental importance for generic vision tasks, such as perceptual organization, segmentation, and recognition. The objective of this epistemological paper is to summarize various threads of research in the literature and to pursue a unified framework for conceptualization, modeling, learning, and computing visual patterns. This paper starts with reviewing four research streams: 1) the study of image statistics, 2) the analysis of image components, 3) the grouping of image elements, and 4) the modeling of visual patterns. The models from these research streams are then divided into four categories according to their semantic structures: 1) descriptive models, i.e., Markov random fields (MRF) or Gibbs, 2) variants of descriptive models (causal MRF and “pseudodescriptive” models), 3) generative models, and 4) discriminative models. The objectives, principles, theories, and typical models are reviewed in each category and the relationships between the four types of models are studied. Two central themes emerge from the relationship studies. 1) In representation, the integration of descriptive and generative models is the future direction for statistical modeling and should lead to richer and more advanced classes of vision models. 2) To make visual models computationally tractable, discriminative models are used as computational heuristics for inferring generative models. Thus, the roles of four types of models are clarified. The paper also addresses the issue of conceptualizing visual patterns and their components (vocabularies) from the perspective of statistical mechanics. Under this unified framework, a visual pattern is equalized to a statistical ensemble, and, furthermore, statistical models for various visual patterns form a “continuous ” spectrum in the sense that they belong to a series of nested probability families in the space of attributed graphs.
Citations
|
2739
|
A mathematical theory of communication
– Shannon
- 1948
|
|
2322
|
Stochastic relaxation, Gibbs distributions and the Bayesian restoration of images
– Geman, Geman
- 1984
|
|
1943
|
Snakes: Active contour models
– Kass, Witkin, et al.
- 1987
|
|
1533
|
A theory for multiresolution signal decomposition: the wavelet representation
– Mallat
- 1989
|
|
615
|
Spatial interaction and the statistical analysis of lattice systems (with discussion
– Besag
- 1974
|
|
536
|
Visual Reconstruction
– Blake, Zisserman
- 1987
|
|
433
|
Optimal approximations by piecewise smooth functions and associated variational problems
– Mumford, Shah
- 1989
|
|
361
|
Texture Synthesis by Non-parametric Sampling
– Efros, Leung
- 1999
|
|
358
|
Perceptual Organization and Visual Recognition
– Lowe
- 1985
|
|
348
|
Relations between the statistics of natural images and the response properties of cortical cells
– Field
- 1987
|
|
336
|
Inducing Features of Random Fields
– Pietra, Pietra, et al.
- 1995
|
|
306
|
Shiftable multiscale transforms
– Simoncelli, Adelson, et al.
- 1992
|
|
279
|
Information theory and statistical mechanics
– Jaynes
- 1957
|
|
269
|
What is the Goal of Sensory Coding
– Field
- 1999
|
|
257
|
Spectral Analysis and Time Series
– Priestley
- 1981
|
|
247
|
Sparse coding with an overcomplete basis set: a strategy employed by v1
– Olshausen, Field
- 1997
|
|
235
|
Image Quilting for Texture Synthesis and Transfer
– EFROS, FREEMAN
|
|
223
|
Some information aspects of visual perception
– Attneave
- 1954
|
|
197
|
Markov random field texture models
– Cross, Jain
- 1983
|
|
188
|
The Perception of the Visual World
– Gibson
- 1950
|
|
180
|
Possible principles underlying the transformation of sensory messages
– Barlow
- 1961
|
|
153
|
Principles of Gestalt Psychology
– Koffka
- 1935
|
|
145
|
Ondelettes et Opérateurs
– Meyer
- 1990
|
|
145
|
Stochastic geometry and its applications
– Stoyan, Kendall, et al.
- 1995
|
|
135
|
Textons, the elements of texture perception and their interactions
– Julesz
- 1981
|
|
134
|
Minimax entropy principle and its application to texture modeling
– Zhu, Wu, et al.
- 1997
|
|
117
|
Inferring global perceptual contours from local features
– Guy, Medioni
- 1993
|
|
114
|
HANDS. A Pattern Theoretical Study of Biological Shapes
– Grenander, Chow, et al.
- 1991
|
|
107
|
What does the retina know about natural scenes?”, Neural Computation
– Atick, Redlich
- 1992
|
|
105
|
Stochastic completion fields: a neural model of illusory contour shape and salience
– Williams, Jacobs
- 1995
|
|
100
|
Deformable templates for face recognition
– Yuille
- 1991
|
|
98
|
Rates of convergence of the Hastings and Metropolis algorithms
– Mengersen, Tweedie
- 1996
|
|
93
|
Natural image statistics and neural representation”. Annual Review of Neuroscience
– Simoncelli, Olshausen
- 2001
|
|
88
|
Prior learning and Gibbs reaction-diffusion
– Zhu, Mumford
- 1997
|
|
87
|
Image segmentation by data driven markov chain monte carlo
– Tu, Zhu, et al.
- 2001
|
|
77
|
Data compression and harmonic analysis
– Donoho, DeVore, et al.
- 1998
|
|
73
|
Entropy based algorithms for best basis selection
– Coifman, Wickerhauser
- 1992
|
|
70
|
Organization in Vision
– Kanizsa
- 1979
|
|
68
|
Independent component analysis of natural image sequences yields spatio-temporal filters similar to simple cells in primary visual cortex
– Hateren, Ruderman
- 1998
|
|
65
|
Real-Time Texture Synthesis by Patch-Based Sampling
– Liang, Liu, et al.
- 2001
|
|
63
|
Introduction to Modern Statistical Mechanics
– Chandler
- 1987
|
|
59
|
From volumes to views: An approach to 3-D object recognition
– Dickinson, Pentland, et al.
- 1991
|
|
56
|
Multilevel computational processes for visual surface reconstruction
– Terzopoulos
- 1983
|
|
55
|
A non-parametric multi-scale statistical model for natural images
– Bonet, Viola
- 1997
|
|
55
|
CCCP algorithms to minimize the Bethe and Kikuchi free energies: Convergent alternatives to belief propagation
– Yuille
|
|
54
|
Ridgelets: a key to higher-dimensional intermittency
– Candès, Donoho
- 1999
|
|
54
|
Elementary Principles of Statistical Mechanics
– Gibbs
- 1902
|
|
52
|
Novel cluster-based probability model for texture synthesis, classification, and compression
– Popat, Picard
- 1993
|
|
41
|
Origins of scaling in natural images
– Ruderman
- 1997
|
|
40
|
Stochastic models for generic images
– Gidas, Mumford
- 2001
|