Citation
Aly, Mohamed Alaa El-Dien Mahmoud Hussein (2011) Searching Large-Scale Image Collections. Dissertation (Ph.D.), California Institute of Technology. doi:10.7907/VRGJ-4J54. https://resolver.caltech.edu/CaltechTHESIS:04252011-145432540
Abstract
Searching quickly and accurately in a large collection of images has become an increasingly important problem. The ultimate goal is to make visual search possible: allow users to search using images in addition to typing text. The typical approach is to index all the images of interest (e.g., images of landmarks, books, or DVDs) in a database and let users question the system with query images. Such a database can reach billions of images, and this poses challenges in terms of memory and computational requirements and recognition performance. In this work we provide an in depth study of systems used for searching large-scale image collections. Specifically, we provide a thorough comparison of the two leading image search approaches: Full Representation (FR) vs. Bag of Words (BoW). We derive theoretical estimates of how the memory and computational cost scale with the number of images in the database, and empirically evaluate the performance and run time on four real-world datasets. Our experiments suggest that FR provides better recognition performance than BoW, though it requires more memory. Therefore, we address these shortcomings by presenting novel methods that increase the recognition performance of BoW and decrease the memory requirements of FR. Finally, we present a novel way to parallelize FR on multiple machines and scale up database sizes to 100 million images with interactive run time.
Item Type: | Thesis (Dissertation (Ph.D.)) | ||||||
---|---|---|---|---|---|---|---|
Subject Keywords: | computer vision, visual object recognition, large-scale object recognition, large-scale image search, content-based image retrieval | ||||||
Degree Grantor: | California Institute of Technology | ||||||
Division: | Engineering and Applied Science | ||||||
Major Option: | Electrical Engineering | ||||||
Thesis Availability: | Public (worldwide access) | ||||||
Research Advisor(s): |
| ||||||
Thesis Committee: |
| ||||||
Defense Date: | 23 May 2011 | ||||||
Record Number: | CaltechTHESIS:04252011-145432540 | ||||||
Persistent URL: | https://resolver.caltech.edu/CaltechTHESIS:04252011-145432540 | ||||||
DOI: | 10.7907/VRGJ-4J54 | ||||||
Related URLs: |
| ||||||
Default Usage Policy: | No commercial reproduction, distribution, display or performance rights in this work are provided. | ||||||
ID Code: | 6353 | ||||||
Collection: | CaltechTHESIS | ||||||
Deposited By: | Mohamed Aly | ||||||
Deposited On: | 27 May 2011 21:09 | ||||||
Last Modified: | 08 Nov 2023 00:44 |
Thesis Files
|
PDF
- Final Version
See Usage Policy. 3MB |
Repository Staff Only: item control page