Citation
Kondapaneni, Neehar (2025) Aligning and Comparing Vision Representations to Improve Understanding and Performance. Dissertation (Ph.D.), California Institute of Technology. doi:10.7907/0crh-zb71. https://resolver.caltech.edu/CaltechTHESIS:06032025-000834048
Abstract
Recent advances in large artificial intelligence (AI) models have enabled these models to perform a wide range of real-world tasks with skill levels comparable to or surpassing those of humans. In this thesis, we develop methods to compare, analyze, and align data representations from these powerful models. In Part 1, we develop methods for estimating human knowledge during a learning task and for comparing various data representations. These methods are steps towards a system designed to help us learn from AI.
In Part 2, we show how aligning models can be useful in two separate domains. First, we discover and fix a misalignment in the inputs to a powerful foundation model and show how it improves performance. Second, we show that biologically inspired object manipulation tasks can be used as a training signal for learning human-aligned representations of number. Our results demonstrate the potential for alignment and comparison methods to improve the overall performance of AI models, improve our understanding of biological intelligence, and help us discover new patterns in the natural world.
Item Type: | Thesis (Dissertation (Ph.D.)) | ||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Subject Keywords: | computer vision, artificial intelligence, visual cognition, visual psychophysics, human cognition, machine teaching, visual categorization | ||||||||||||||||||
Degree Grantor: | California Institute of Technology | ||||||||||||||||||
Division: | Biology and Biological Engineering | ||||||||||||||||||
Major Option: | Computation and Neural Systems | ||||||||||||||||||
Awards: | Thomas A. Tisch Prize for Graduate Teaching in Computing and Mathematical Sciences, 2024. | ||||||||||||||||||
Thesis Availability: | Public (worldwide access) | ||||||||||||||||||
Research Advisor(s): |
| ||||||||||||||||||
Thesis Committee: |
| ||||||||||||||||||
Defense Date: | 27 May 2025 | ||||||||||||||||||
Non-Caltech Author Email: | neeharkondapa (AT) gmail.com | ||||||||||||||||||
Record Number: | CaltechTHESIS:06032025-000834048 | ||||||||||||||||||
Persistent URL: | https://resolver.caltech.edu/CaltechTHESIS:06032025-000834048 | ||||||||||||||||||
DOI: | 10.7907/0crh-zb71 | ||||||||||||||||||
Related URLs: |
| ||||||||||||||||||
ORCID: |
| ||||||||||||||||||
Default Usage Policy: | No commercial reproduction, distribution, display or performance rights in this work are provided. | ||||||||||||||||||
ID Code: | 17388 | ||||||||||||||||||
Collection: | CaltechTHESIS | ||||||||||||||||||
Deposited By: | Neehar Kondapaneni | ||||||||||||||||||
Deposited On: | 04 Jun 2025 00:36 | ||||||||||||||||||
Last Modified: | 13 Jun 2025 18:22 |
Thesis Files
![]() |
PDF
- Final Version
See Usage Policy. 83MB |
Repository Staff Only: item control page