Explicit Object Representation by Sparse Neural Codes

Citation

Waydo, Stephen J. (2008) Explicit Object Representation by Sparse Neural Codes. Dissertation (Ph.D.), California Institute of Technology. doi:10.7907/1XY7-2H19. https://resolver.caltech.edu/CaltechETD:etd-11022007-104734

Abstract

Neurons have been identified in the human medial temporal lobe (MTL) that display a strong selectivity for only a few stimuli (such as familiar individuals or landmark buildings) out of perhaps 100 presented to the test subject. While highly selective for a particular object or category, these cells are remarkably insensitive to different presentations (i.e., different poses and views) of their preferred stimulus. This invariant, sparse, and explicit representation of the world may be crucial to the transformation of complex visual stimuli into more abstract memories. In this thesis I first discuss the issue of how best to quantify sparseness, particularly in very sparse systems where biases are significant, and show the results of this analysis applied to human MTL data. I also provide an overview of existing results from other investigators on measuring sparseness both elsewhere along the primate visual pathway and in selected other sensory processing systems. From there I move into the computational realm. Sparse coding as a computational constraint applied to the representation of natural images has been shown to produce receptive fields strikingly similar to those observed in mammalian primary visual cortex. I apply sparse coding as a model for processing further along the visual hierarchy: not directly to images but rather to an invariant feature-based representation of images analogous to that found in the inferotemporal cortex. This combination of sparseness and invariance naturally leads to explicit category representation. That is, by exposing the model to different images drawn from different categories, units develop that respond selectively to different categories. After extending an existing model of sparse coding and providing some mathematical analysis of its operation, I show results obtained by applying this method both to unsupervised category discovery in images and to differentiation between images of different individuals.

Item Type:	Thesis (Dissertation (Ph.D.))
Subject Keywords:	Neural coding; representation; sparseness; unsupervised learning; vision
Degree Grantor:	California Institute of Technology
Division:	Engineering and Applied Science
Major Option:	Computation and Neural Systems
Thesis Availability:	Public (worldwide access)
Research Advisor(s):	Murray, Richard M. (advisor) Koch, Christof (co-advisor)
Thesis Committee:	Koch, Christof (chair) Murray, Richard M. (co-chair) Olshausen, Bruno Marsden, Jerrold E. Perona, Pietro
Defense Date:	21 September 2007
Record Number:	CaltechETD:etd-11022007-104734
Persistent URL:	https://resolver.caltech.edu/CaltechETD:etd-11022007-104734
DOI:	10.7907/1XY7-2H19
Default Usage Policy:	No commercial reproduction, distribution, display or performance rights in this work are provided.
ID Code:	4374
Collection:	CaltechTHESIS
Deposited By:	Imported from ETD-db
Deposited On:	05 Dec 2007
Last Modified:	30 Aug 2022 22:56

Thesis Files

Preview

PDF (waydo-thesis-final-oneside.pdf) - Final Version
See Usage Policy.
5MB

Preview

PDF (waydo-thesis-final-twoside.pdf) - Final Version
See Usage Policy.
5MB

Repository Staff Only: item control page