CaltechTHESIS
  A Caltech Library Service

Contextual pattern recognition with applications to biomedical image identification

Citation

Song, Xubo (1999) Contextual pattern recognition with applications to biomedical image identification. Dissertation (Ph.D.), California Institute of Technology. doi:10.7907/F5YK-HM52. https://resolver.caltech.edu/CaltechETD:etd-09222005-111015

Abstract

This thesis studies two rather distinct topics: one is the incorporation of contextual information in pattern recognition, with applications to biomedical image identification; and the other is the theoretical modeling of learning and generalization in the regime of machine learning. In Part I of the thesis, we propose techniques to incorporate contextual information into object classification. In the real world there are cases where the identity of an object is ambiguous due to the noise in the measurements based on which the classification should be made. It is helpful to reduce the ambiguity by utilizing extra information referred to as context, which in our case is the identities of the accompanying objects. We investigate the incorporation of both full and partial context. Their error probabilities, in terms of both set-by-set error and element-by-element error, are established and compared to context-free approach. The computational cost is studied in detail for full context, partial context and context-free cases. The techniques are applied to toy problems as well as real world problems such as white blood cell image classification and microscopic urinalysis. It is demonstrated that superior classification performance is achieved by using context. In our particular application, it reduces overall classification error, as well as false positive and false negative diagnosis rates. In Part II of the thesis, we propose a novel theoretical framework, called the Bin Model, for learning and generalization. Using the Bin Model, a closed form is derived for generalization that estimates the out-of-sample performance in terms of the in-sample performance. We address the problems of overfitting, and characterize conditions under which it does not appear. The effect of noise on generalization is studied, and the generalization of the Bin Model framework from classification problems to regression problems is discussed.

Item Type:Thesis (Dissertation (Ph.D.))
Subject Keywords:Compound Bayesian Theory; context; contextual information; image recognition; pattern recognition
Degree Grantor:California Institute of Technology
Division:Engineering and Applied Science
Major Option:Electrical Engineering
Thesis Availability:Public (worldwide access)
Research Advisor(s):
  • Abu-Mostafa, Yaser S.
Thesis Committee:
  • Abu-Mostafa, Yaser S. (chair)
  • Psaltis, Demetri
  • Kasdan, Harvey L.
  • Perona, Pietro
Defense Date:23 November 1998
Non-Caltech Author Email:xubosong (AT) csee.ogi.edu
Record Number:CaltechETD:etd-09222005-111015
Persistent URL:https://resolver.caltech.edu/CaltechETD:etd-09222005-111015
DOI:10.7907/F5YK-HM52
Default Usage Policy:No commercial reproduction, distribution, display or performance rights in this work are provided.
ID Code:3690
Collection:CaltechTHESIS
Deposited By: Imported from ETD-db
Deposited On:23 Sep 2005
Last Modified:21 Dec 2019 01:57

Thesis Files

[img]
Preview
PDF (Song_x_1999.pdf) - Final Version
See Usage Policy.

4MB

Repository Staff Only: item control page