CaltechTHESIS
  A Caltech Library Service

Searching large-scale image collections

Citation

Aly, Mohamed Alaa El-Dien Mahmoud Hussein (2011) Searching large-scale image collections. Dissertation (Ph.D.), California Institute of Technology. http://resolver.caltech.edu/CaltechTHESIS:04252011-145432540

Abstract

Searching quickly and accurately in a large collection of images has become an increasingly important problem. The ultimate goal is to make visual search possible: allow users to search using images in addition to typing text. The typical approach is to index all the images of interest (e.g., images of landmarks, books, or DVDs) in a database and let users question the system with query images. Such a database can reach billions of images, and this poses challenges in terms of memory and computational requirements and recognition performance. In this work we provide an in depth study of systems used for searching large-scale image collections. Specifically, we provide a thorough comparison of the two leading image search approaches: Full Representation (FR) vs. Bag of Words (BoW). We derive theoretical estimates of how the memory and computational cost scale with the number of images in the database, and empirically evaluate the performance and run time on four real-world datasets. Our experiments suggest that FR provides better recognition performance than BoW, though it requires more memory. Therefore, we address these shortcomings by presenting novel methods that increase the recognition performance of BoW and decrease the memory requirements of FR. Finally, we present a novel way to parallelize FR on multiple machines and scale up database sizes to 100 million images with interactive run time.

Item Type:Thesis (Dissertation (Ph.D.))
Subject Keywords:computer vision, visual object recognition, large-scale object recognition, large-scale image search, content-based image retrieval
Degree Grantor:California Institute of Technology
Division:Engineering and Applied Science
Major Option:Electrical Engineering
Thesis Availability:Public (worldwide access)
Research Advisor(s):
  • Perona, Pietro
Thesis Committee:
  • Perona, Pietro (chair)
  • Abu-Mostafa, Yaser S.
  • Burdick, Joel Wakeman
  • Munich, Mario E.
Defense Date:23 May 2011
Record Number:CaltechTHESIS:04252011-145432540
Persistent URL:http://resolver.caltech.edu/CaltechTHESIS:04252011-145432540
Related URLs:
URLURL TypeDescription
http://vision.caltech.edu/malaaPublisherUNSPECIFIED
Default Usage Policy:No commercial reproduction, distribution, display or performance rights in this work are provided.
ID Code:6353
Collection:CaltechTHESIS
Deposited By: Mohamed Aly
Deposited On:27 May 2011 21:09
Last Modified:26 Dec 2012 04:34

Thesis Files

[img]
Preview
PDF - Final Version
See Usage Policy.

3655Kb

Repository Staff Only: item control page