CaltechTHESIS
  A Caltech Library Service

Unsupervised learning of categorical segments in image collections

Citation

Andreetto, Marco (2011) Unsupervised learning of categorical segments in image collections. Dissertation (Ph.D.), California Institute of Technology. http://resolver.caltech.edu/CaltechTHESIS:04262011-213152111

Abstract

Which one comes first: segmentation or recognition? We propose a unified framework for carrying out the two simultaneously and without supervision. The framework combines a flexible probabilistic model for representing the shape and appearance of each segment, with the popular "bag of visual words" model for recognition. If applied to a collection of images, our framework can simultaneously discover the segments of each image, and the correspondence between such segments, without supervision. Such recurring segments may be thought of as the "parts" of corresponding objects that appear multiple times in the image collection. Thus, the model may be used for learning new categories, detecting/classifying objects, and segmenting images, without using expensive human annotation.

Item Type:Thesis (Dissertation (Ph.D.))
Subject Keywords:Computer Vision, Machine Learning, Image Segmentation, Object Recognition, Statistical Models, Montecarlo Methods
Degree Grantor:California Institute of Technology
Division:Engineering and Applied Science
Major Option:Electrical Engineering
Thesis Availability:Public (worldwide access)
Research Advisor(s):
  • Perona, Pietro
Thesis Committee:
  • Perona, Pietro (chair)
  • Abu-Mostafa, Yaser S.
  • Hassibi, Babak
  • Welling, Max
  • Belongie, Serge J.
Defense Date:12 January 2011
Author Email:marco (AT) vision.caltech.edu
Record Number:CaltechTHESIS:04262011-213152111
Persistent URL:http://resolver.caltech.edu/CaltechTHESIS:04262011-213152111
Related URLs:
URLURL TypeDescription
http://www.vision.caltech.edu/marco/AuthorUNSPECIFIED
Default Usage Policy:No commercial reproduction, distribution, display or performance rights in this work are provided.
ID Code:6355
Collection:CaltechTHESIS
Deposited By: Marco Andreetto
Deposited On:27 May 2011 20:34
Last Modified:26 Dec 2012 04:34

Thesis Files

[img]
Preview
PDF - Final Version
See Usage Policy.

17Mb

Repository Staff Only: item control page