CaltechTHESIS
  A Caltech Library Service

Object recognition in compressed imagery

Citation

Daniell, Cynthia Evors (2000) Object recognition in compressed imagery. Dissertation (Ph.D.), California Institute of Technology. http://resolver.caltech.edu/CaltechTHESIS:10052010-135012445

Abstract

It is often necessary to search for objects in large databases of compressed imagery. In the past, object recognition and image compression have generally been treated as separate problems, resulting in inefficient suboptimal performance. Moreover, computational and storage issues make it fundamentally prohibitive to uncompress large images prior to object recognition. We provide two complementary solutions to the problem of object recognition in compressed imagery, each of which integrates subband and correlation filtering in a unique manner. One key benefit of correlation filters is that, as linear systems, they are highly compatible with the subband filtering process. This enables us to provide a seamless operation in which object recognition and data compression are viewed as continuations of the same process. The public MSTAR data set illustrates our results on a three class problem of 79 Synthetic Aperture Radar images at one foot resolution. Our general framework, the Pattern Recognition Subband Coder (PRSC), provides simultaneous synthesis and recognition at full resolution in a computationally efficient architecture. Its parallelism enables a result 1.6 times faster, in the limit, than correlation on uncompressed imagery. Furthermore, by jointly optimizing the synthesis and recognition filters, the PRSC achieves 100% recognition accuracy on our compressed data set, improving performance over that produced from the original (uncompressed) data set, by 3.7%. We maintain this success for compression ratios up to 6:1. Addressing the issue of reduced resolution recognition, our Subband Domain Correlation Filters operate directly on the subband coefficients at multiple resolution levels. For compression ratios of at least 20:1, we achieve recognition performance of at least 90%, 85%, and 75%, respectively, on two, four, and eight foot resolution data. Thus, through our solutions with compressed imagery, we outperform correlation results on the equivalent original imagery in terms of both speed and accuracy, as well as provide success at reduced resolutions of the data.

Item Type:Thesis (Dissertation (Ph.D.))
Subject Keywords:Electrical Engineering
Degree Grantor:California Institute of Technology
Division:Engineering and Applied Science
Major Option:Electrical Engineering
Thesis Availability:Restricted to Caltech community only
Research Advisor(s):
  • Goodman, Rodney M.
Thesis Committee:
  • Koch, Christof
  • Psaltis, Demetri
  • Vaidyanathan, P. P.
  • Abu-Mostafa, Yaser S.
  • Goodman, Rodney M.
Defense Date:5 May 2000
Record Number:CaltechTHESIS:10052010-135012445
Persistent URL:http://resolver.caltech.edu/CaltechTHESIS:10052010-135012445
Default Usage Policy:No commercial reproduction, distribution, display or performance rights in this work are provided.
ID Code:6098
Collection:CaltechTHESIS
Deposited By: Benjamin Perez
Deposited On:11 Oct 2010 16:50
Last Modified:26 Dec 2012 04:31

Thesis Files

[img] PDF - Final Version
Restricted to Caltech community only
See Usage Policy.

42Mb

Repository Staff Only: item control page