Characterization of the spatial-frequency spectrum in the perception of shape from texture. 1995

K Sakai, and L H Finkel
Department of Bioengineering, University of Pennsylvania, Philadelphia 19104-6392, USA.

The major cue to shape from texture is the compression of texture as a function of surface curvature [J. Exp. Psychol. 13, 242 (1987); Vision Res. 33, 827 (1993)]. A number of computational models have been proposed in which compression is measured by detection of changes in the spatial-frequency spectrum [Comput. Graphics Image Process. 5, (1976)]. We propose that the visual system uses a strategy of characterizing the frequency spectrum by a simple set of measures and of tracking the changes in this characterization rather than determining changes in the shape of the actual spectra. Our evidence is based on a number of psychophysical demonstrations that use stimuli with specifically tailored frequency spectra, constructed from white noise filtered in the frequency domain. Our evidence suggests that the visual system determines the average peak frequency of the spectrum and uses this measure as its characterization. Changes in fp are strongly correlated with the degree of surface curvature, and, over a range of stimuli, fp takes account of the variance in local estimates of the frequency spectrum. One computes fp by determining the peak frequency at each spatial location and then averaging these frequency values over a local spatial region. We show that fp is related to the second-order moment but is more biologically plausible and shows superior ability to function in the presence of noise. As a test of this model, we have constructed a neural network architecture for computing shape from texture. Our model is limited to orthographically projected, homogeneous textures without in-surface rotation. The early stages of the model consist of multiple simple-cell units tuned to different orientations and spatial frequencies. We show that these simple cells are inadequate for the determination of compression but that the outputs of complex-cell-like units after normalization generate estimates of surface slant and tilt. The network shows qualitative agreement with human perception of shape from texture over a wide range of real and artificial stimuli.

UI MeSH Term Description Entries
D008027 Light That portion of the electromagnetic spectrum in the visible, ultraviolet, and infrared range. Light, Visible,Photoradiation,Radiation, Visible,Visible Radiation,Photoradiations,Radiations, Visible,Visible Light,Visible Radiations
D008433 Mathematics The deductive study of shape, quantity, and dependence. (From McGraw-Hill Dictionary of Scientific and Technical Terms, 6th ed) Mathematic
D008954 Models, Biological Theoretical representations that simulate the behavior or activity of biological processes or diseases. For disease models in living animals, DISEASE MODELS, ANIMAL is available. Biological models include the use of mathematical equations, computers, and other electronic equipment. Biological Model,Biological Models,Model, Biological,Models, Biologic,Biologic Model,Biologic Models,Model, Biologic
D011601 Psychophysics The science dealing with the correlation of the physical characteristics of a stimulus, e.g., frequency or intensity, with the response to the stimulus, in order to assess the psychologic factors involved in the relationship. Psychophysic
D005556 Form Perception The sensory discrimination of a pattern, shape, or outline. Contour Perception,Contour Perceptions,Form Perceptions,Perception, Contour,Perception, Form,Perceptions, Contour,Perceptions, Form
D006801 Humans Members of the species Homo sapiens. Homo sapiens,Man (Taxonomy),Human,Man, Modern,Modern Man
D013028 Space Perception The awareness of the spatial properties of objects; includes physical space. Perception, Space,Perceptions, Space,Space Perceptions
D016571 Neural Networks, Computer A computer architecture, implementable in either hardware or software, modeled after biological neural networks. Like the biological system in which the processing capability is a result of the interconnection strengths between arrays of nonlinear processing nodes, computerized neural networks, often called perceptrons or multilayer connectionist models, consist of neuron-like units. A homogeneous group of units makes up a layer. These networks are good at pattern recognition. They are adaptive, performing tasks by example, and thus are better for decision-making than are linear learning machines or cluster analysis. They do not require explicit programming. Computational Neural Networks,Connectionist Models,Models, Neural Network,Neural Network Models,Neural Networks (Computer),Perceptrons,Computational Neural Network,Computer Neural Network,Computer Neural Networks,Connectionist Model,Model, Connectionist,Model, Neural Network,Models, Connectionist,Network Model, Neural,Network Models, Neural,Network, Computational Neural,Network, Computer Neural,Network, Neural (Computer),Networks, Computational Neural,Networks, Computer Neural,Networks, Neural (Computer),Neural Network (Computer),Neural Network Model,Neural Network, Computational,Neural Network, Computer,Neural Networks, Computational,Perceptron

Related Publications

K Sakai, and L H Finkel
May 2001, Vision research,
K Sakai, and L H Finkel
May 2010, Journal of vision,
K Sakai, and L H Finkel
January 1988, Biological cybernetics,
K Sakai, and L H Finkel
January 1978, Medical & biological engineering & computing,
K Sakai, and L H Finkel
July 2014, Neuropsychologia,
K Sakai, and L H Finkel
May 1988, Nature,
K Sakai, and L H Finkel
October 1982, Perceptual and motor skills,
K Sakai, and L H Finkel
February 1984, IEEE transactions on pattern analysis and machine intelligence,
Copied contents to your clipboard!