Results 1 - 10
of
13
Brightness Perception, Illusory Contours, and Corticogeniculate Feedback
, 1995
"... A neural network model is developed to explain how visual thalamocortical interactions give rise to boundary percepts such as illusory contours and surface percepts such as filled-in brightnesses. Top-down feedback interactions are needed in addition to bottom-up feed-forward interactions to simulat ..."
Abstract
-
Cited by 69 (39 self)
- Add to MetaCart
A neural network model is developed to explain how visual thalamocortical interactions give rise to boundary percepts such as illusory contours and surface percepts such as filled-in brightnesses. Top-down feedback interactions are needed in addition to bottom-up feed-forward interactions to simulate these data. One feedback loop is modeled between lateral geniculate nucleus (LGN) and cortical area VI, and another within cortical areas VI and V2. The first feedback loop realizes a matching process which enhances LGN cell activities that are consistent with those of active cortical cells, and suppresses LGN activities that are not. This corticogeniculate feedback, being endstopped and oriented, also enhances LGN ON cell activations at the ends of thin dark lines, thereby leading to enhanced cortical brightness percepts when the lines group into closed illusory contours. The second feedback loop generates boundary representations, including illusory contours, that coherently bind distributed cortical features together. Brightness percepts form within the surface representations through a diffusive filling-in process that is contained by resistive gating signals from the boundary representations. The model is used to simulate illusory contours and surface brightnesses induced by Ehrenstein disks, Kanizsa squares, Glass patterns, and cafe wall patterns in single contrast, reverse contrast, and mixed contrast configurations. These examples illustrate how boundary
Image segmentation based on oscillatory correlation
- Neural Computation
, 1997
"... We study image segmentation on the basis of locally excitatory globally inhibitory oscillator networks (LEGION), whereby the phases of oscillators encode the binding of pixels. We introduce a potential for each oscillator so that only those oscillators with strong connections from their neighborhood ..."
Abstract
-
Cited by 63 (18 self)
- Add to MetaCart
We study image segmentation on the basis of locally excitatory globally inhibitory oscillator networks (LEGION), whereby the phases of oscillators encode the binding of pixels. We introduce a potential for each oscillator so that only those oscillators with strong connections from their neighborhood can develop high potentials. Based on the concept of potential, a solution to remove noisy regions in an image is proposed for LEGION, so that it suppresses the oscillators corresponding to noisy regions, without affecting those corresponding to major regions. We show analytically that the resulting oscillator network separates an image into several major regions, plus a background consisting of all noisy regions, and illustrate network properties by computer simulation. The network exhibits a natural capacity in segmenting images. The oscillatory dynamics leads to a computer algorithm, which is applied successfully to segmenting real graylevel images. A number of issues regarding biological plausibility and perceptual organization are discussed. We argue that LEGION provides a novel and effective framework for image segmentation and figure-ground segregation. DeLiang Wang and David Terman Image Segmentation 1.
Fast learning VIEWNET architectures for recognizing 3D objects from multiple 2-D views.” Neural Networks
, 1995
"... Abstract--The recognition of three-dimensional ( 3-D) objects from sequences of their two-dimensional ( 2-D) views is modeled by a family of self-organizing neural architectures, called VIEWNET, that use View Information Encoded With NETworks. VIEWNET incorporates a preprocessor that generates a com ..."
Abstract
-
Cited by 46 (12 self)
- Add to MetaCart
Abstract--The recognition of three-dimensional ( 3-D) objects from sequences of their two-dimensional ( 2-D) views is modeled by a family of self-organizing neural architectures, called VIEWNET, that use View Information Encoded With NETworks. VIEWNET incorporates a preprocessor that generates a compressed but 2-D invariant representation of an image, a supervised incremental learning system that classifies the preprocessed representations into 2-1) view categories whose outputs are combined into 3-D invariant object categories, and a working memory that makes a 3-D object prediction by accumulating evidence from 3-D object category nodes us multiple 2-D views are experienced. The simplest VIEWNET achieves high recognition scores without the need to explicitly code the temporal order of 2-D views in working memory. Working memories are also discussed that save memory resources by implicitly coding temporal order in terms of the relative activity of 2-D view category nodes, rather than as explicit 2-D view transitions. Variants of the VIEWNET architecture may be used for scene understanding by using a preprocessor and classifier that can determine both what objects are in a scene and where they are located. The present VIEWNET preprocessor includes the CORT-X 2 filter, which discounts the illuminant, regularizes and completes figural boundaries, and suppresses image noise. This boundary segmentation is rendered invariant under 2-D translation, rotation, and dilation by use of a log-polar transform. The invariant spectra undergo Gaassian coarse coding to further reduce noise and 3-D foreshortening effects, and to increase generalization. These compressed codes are input into the
Synthetic Aperture Radar Processing by a Multiple Scale Neural System for Boundary and Surface Representation
, 1994
"... A neural network model of boundary segmentation and surface representation is developed to process images containing range data gathered by a synthetic aperture radar (SAR) sensor. The boundary and surface processing are accomplished by an improved Boundary Contour System (BCS) and Feature Contour S ..."
Abstract
-
Cited by 35 (16 self)
- Add to MetaCart
A neural network model of boundary segmentation and surface representation is developed to process images containing range data gathered by a synthetic aperture radar (SAR) sensor. The boundary and surface processing are accomplished by an improved Boundary Contour System (BCS) and Feature Contour System (FCS), respectively, that have been derived from analyses of perceptual and neurobiological data. BCS/FCS processing makes structures such as motor vehicles, roads, and buildings more salient and interpretable to human observers than they are in the original imagery. Early processing by ON cells and OFF cells embedded in shunting centersurround network models preprocessing by lateral geniculate nucleus (LGN). Such preprocessing compensates for illumination gradients, normalizes input dynamic range, and extracts local ratio contrasts. ON cell and OFF cell outputs are combined in the BCS to define oriented filters that model cortical simple cells. Pooling ON and OFF outputs at simple cel...
A neural network for enhancing boundaries and surfaces in synthetic aperture radar images
- NEURAL NETWORKS
, 1999
"... ..."
Neural dynamics of binocular brightness perception
- Vision Research
, 1999
"... ¶ The authors wish to thank C. Bourassa for providing the data from his ganzfeld experiments. *Acknowledgments: The author wishes to thank Diana Meyers for her valuable assistance in the prepara-tion of this manuscript. i-1 How does the visual cortex combine information from both eyes to generate pe ..."
Abstract
-
Cited by 13 (13 self)
- Add to MetaCart
¶ The authors wish to thank C. Bourassa for providing the data from his ganzfeld experiments. *Acknowledgments: The author wishes to thank Diana Meyers for her valuable assistance in the prepara-tion of this manuscript. i-1 How does the visual cortex combine information from both eyes to generate perceptual representa-tions of object surfaces? Important clues about this process may be derived from data about the perceived brightnesses of surface regions under binocular viewing conditions, including data about binocular bright-ness summation in response to ganzfelds, the U-shaped data of Fechner’s Paradox that violates binocular brightness summation, and the effects of different combinations of monocular and binocular contours and surface luminance differences on threshold sensitivity to monocular flashes of light. How to reconcile these apparently contradictory data properties has been a severe challenge to previous models, and none has explained them all. The present article quantitatively simulates them all by further developing the FACADE vision model. Key model processes discount the illuminant and compute image contrasts in each monocular channel using shunting on-center off-surround networks; binocularly fuse these discounted
Scene analysis by integrating primitive segmentation and associative memory
- IEEE Transactions on Systems, Man, and Cybernetics Part B
, 2002
"... Abstract—Scene analysis is a major aspect of perception and continues to challenge machine perception. This paper addresses the scene-analysis problem by integrating a primitive segmentation stage with a model of associative memory. Our model is a multistage system that consists of an initial primit ..."
Abstract
-
Cited by 7 (2 self)
- Add to MetaCart
Abstract—Scene analysis is a major aspect of perception and continues to challenge machine perception. This paper addresses the scene-analysis problem by integrating a primitive segmentation stage with a model of associative memory. Our model is a multistage system that consists of an initial primitive segmentation stage, a multimodule associative memory, and a short-term memory (STM) layer. Primitive segmentation is performed by locally excitatory globally inhibitory oscillator network (LEGION), which segments the input scene into multiple parts that correspond to groups of synchronous oscillations. Each segment triggers memory recall and multiple recalled patterns then interact with one another in the STM layer. The STM layer projects to the LEGION network, giving rise to memory-based grouping and segmentation. The system achieves scene analysis entirely in phase space, which provides a unifying mechanism for both bottom-up analysis and top-down analysis. The model is evaluated with a systematic set of three-dimensional (3-D) line drawing objects, which are arranged in an arbitrary fashion to compose input scenes that allow object occlusion. Memory-based organization is responsible for a significant improvement in performance. A number of issues are discussed, including input-anchored alignment, top-down organization, and the role of STM in producing context sensitivity of memory recall. Index Terms—Associative memory, grouping, integration, locally excitatory globally inhibitory oscillator network (LEGION), scene analysis, segmentation, short-term memory (STM). I.
Perceiving without Learning: from Spirals to Inside/Outside Relations
- in Advances in Neural Information Processing Systems
, 1997
"... As a benchmark task, the spiral problem is well known in neural networks. Unlike previous work that emphasizes learning, we approach the problem from a generic perspective that does not involve learning. We point out that the spiral problem is intrinsically connected to the inside /outside problem. ..."
Abstract
-
Cited by 2 (0 self)
- Add to MetaCart
As a benchmark task, the spiral problem is well known in neural networks. Unlike previous work that emphasizes learning, we approach the problem from a generic perspective that does not involve learning. We point out that the spiral problem is intrinsically connected to the inside /outside problem. A generic solution to both problems is proposed based on oscillatory correlation using a time delay network. Our simulation results are qualitatively consistent with human performance, and we interpret human limitations in terms of synchrony and time delays, both biologically plausible. As a special case, our network without time delays can always distinguish these figures regardless of shape, position, size, and orientation. We conjecture that visual perception will be effortful if local activation cannot be rapidly propagated, as synchrony would not be established in the presence of time delays. 1 INTRODUCTION The spiral problem refers to distinguishing between a connected single spiral a...
Viewnet Architectures For Invariant 3-D Object Learning And Recognition From Multiple 2-D Views
"... 3 The recognition of 3-D objects from sequences of their 2-D views is modeled by a family of self-organizing neural architectures, called VIEWNET, that use View Information Encoded With NETworks. VIEWNET incorporates a preprocessor that generates a compressed but 2-D invariant representation of an i ..."
Abstract
-
Cited by 2 (0 self)
- Add to MetaCart
3 The recognition of 3-D objects from sequences of their 2-D views is modeled by a family of self-organizing neural architectures, called VIEWNET, that use View Information Encoded With NETworks. VIEWNET incorporates a preprocessor that generates a compressed but 2-D invariant representation of an image, a supervised incremental learning system (Fuzzy ARTMAP) that classifies the preprocessed representations into 2-D view categories whose outputs are combined into 3-D invariant object categories, and a working memory that makes a 3-D object prediction by accumulating evidence over time from 3-D object category nodes as multiple 2-D views are experienced. VIEWNET was benchmarked on an MIT Lincoln Laboratory database of 128x128 2-D views of aircraft, including small frontal views, with and without additive noise. A recognition rate of up to 90% is achieved with one 2-D view and of up to 98.5% correct with three 2-D views. The properties of 2-D view and 3-D object category nodes are compar...
Perceiving Geometric Patterns: From Spirals to Inside-Outside Relations
- IEEE Trans. Neural Netw
, 2001
"... Since first proposed by Minsky and Papert, the spiral problem is well known in neural networks. It receives much attention as a benchmark for various learning algorithms. Unlike previous work that emphasizes learning, we approach the problem from a different perspective. We point out that the spiral ..."
Abstract
-
Cited by 2 (0 self)
- Add to MetaCart
Since first proposed by Minsky and Papert, the spiral problem is well known in neural networks. It receives much attention as a benchmark for various learning algorithms. Unlike previous work that emphasizes learning, we approach the problem from a different perspective. We point out that the spiral problem is intrinsically connected to the inside--outside problem proposed by Ullman. We propose a solution to both problems based on oscillatory correlation using a time-delay network. Our simulation results are qualitatively consistent with human performance, and we interpret human limitations in terms of synchrony and time delays. As a special case, our network without time delays can always distinguish these figures regardless of shape, position, size, and orientation. Index Terms---Desynchronization, geometric patterns, inside-- outside relations, LEGION, oscillatory correlation, spiral problem, synchronization, time delays, visual perception. I.

