From Neuron to Cognition via Computational Neuroscience by Michael A. Arbib & James J. Bonaiuto

From Neuron to Cognition via Computational Neuroscience by Michael A. Arbib & James J. Bonaiuto

Author:Michael A. Arbib & James J. Bonaiuto
Language: eng
Format: epub
Publisher: The MIT Press
Published: 2016-10-11T16:00:00+00:00


where the {bi} are the basis functions (which are the same for all images and could include sinusoids and impulse functions) and the {αi} are the coefficients of the bases (which depend on the image). The number N of bases is much bigger than the dimension of the image, and hence the bases are overcomplete. This differs from Fourier analysis where the data (e.g., an image) are expressed in terms of a set of basis functions which are mutually orthogonal, which enables the coefficients α for each image to be estimated by αi = ∑x bi (x) ∙ I (x). Overcompleteness implies that there are many ways to represent the image in terms of these basis functions (by different choices of the α’s), and we need an additional criterion to select the α’s. The sparsity criterion proposes that we favor representations which make small, which penalize the weights of the basis functions and encourages most coefficients to be 0.

More precisely, we represent an image I by the approximation , where the are chosen to minimize the function

(12.12)

The first term penalizes the error of the approximation, and the second term, whose strength is weighted by a parameter λ, penalizes the coefficients {αi}. The solution cannot be specified in closed form (unlike the case for orthogonal basis functions), but E(α) is a convex function of α, and efficient algorithms exist for minimizing it to estimate . The results of these algorithms can, for example, decompose an image into a sum of sinusoids and a sum of impulse functions.

These ideas give an alternative way to think about the receptive fields of cells in V1. Firstly, observe that V1 has far more cells than the retina or the LGN, and so it has enough neural machinery to implement overcomplete bases. Secondly, overcomplete bases can be designed for specific image structures of interest (e.g., impulse functions or edges), which enables us to start interpreting the image instead of simply representing it. Thirdly, it relates to the observation that cells in V1 fire sparsely, which suggests (Barlow, 1961) that they are tuned to specific stimuli and may relate to metabolic processes (firing a neuron takes energy which needs to be replenished). Hence the idea that the visual cortex seeks to obtain sparse, and hence presumably more easily interpretable, representations has intuitive appeal.

How does this discussion of overcompleteness and sparsity relate to our previous description of V1 cells in terms of Gabor filters? Gabor filters have some of the properties that this approach requires. Families of Gabor filters are built by taking a basic function and performing transformations on it which give an overcomplete basis. Hence they do not specify a unique representation of an image (i.e., any image can be represented many different ways in terms of Gabor functions). These issues, and the relations of Gabors to wavelets, are discussed in more detail in Lee (1996).



Download



Copyright Disclaimer:
This site does not store any files on its server. We only index and link to content provided by other sites. Please contact the content providers to delete copyright contents if any and email us, we'll remove relevant links or contents immediately.