New Algorithms for Programmatic Deep Learning with Applications to Behavior Modeling

Citation

Zhan, Eric (2022) New Algorithms for Programmatic Deep Learning with Applications to Behavior Modeling. Dissertation (Ph.D.), California Institute of Technology. doi:10.7907/5n5q-x203. https://resolver.caltech.edu/CaltechTHESIS:11302021-224628633

Abstract

Raw behavioral data is becoming increasingly more abundant and more easily obtainable in spatiotemporal domains such as sports, video games, navigation & driving, motion capture, and animal science. How can we best use this data to advance their respective domains forward? For instance, researchers for self-driving vehicles would like to identify the key features of the environment state that impact decision-making the most; game developers would like to populate their games with characters that have unique and diverse behaviors to create a more immersive gaming experience; and behavioral neuroscientists would like to uncover the underlying mechanisms that drive learning in animals. Machine learning, the science of developing models and algorithms to identify and leverage patterns in data, is well-equipped to aid in these endeavors. But how do we integrate machine learning with these spatiotemporal domains in a principled way? In this dissertation, we develop and introduce new algorithms in programmatic deep learning that tackle some of the new challenges encountered in behavior modeling.

Our work in programmatic deep learning comprises two main themes: in the first, we show how to use expert-written programs as sources of weak labels in domains where manually-annotated expert labels are scarce; in the second, we explore programs as a flexible function class with human-interpretable structure and show how to learn them via neurosymbolic program learning. Augmenting deep learning with programmatic structure allows domain experts to easily incorporate domain knowledge into machine learning models; we show that this results in significant improvements in many behavior modeling applications like imitation learning, controllable generation, counterfactual analysis, and unsupervised clustering.

Item Type:

Thesis (Dissertation (Ph.D.))

Subject Keywords:

Machine Learning, Deep Learning, Behavior Modeling, Neurosymbolic Program Learning, Program Synthesis, Imitation Learning, Controllable Generation, Representation Learning, Unsupervised Clustering, Generative Modeling

Degree Grantor:

California Institute of Technology

Division:

Engineering and Applied Science

Major Option:

Computing and Mathematical Sciences

Thesis Availability:

Public (worldwide access)

Research Advisor(s):

Yue, Yisong

Thesis Committee:

Murray, Richard M. (chair)
Perona, Pietro
Chaudhuri, Swarat
Yue, Yisong

Defense Date:

16 November 2021

Non-Caltech Author Email:

zhaneric94 (AT) gmail.com

Funders:

Funding Agency	Grant Number
NSF	1564330
NSF	1918655
DARPA PAI	UNSPECIFIED
Intel	UNSPECIFIED
Activision/Blizzard	UNSPECIFIED
Northrop Grumman	UNSPECIFIED
STATS	UNSPECIFIED

Record Number:

CaltechTHESIS:11302021-224628633

Persistent URL:

https://resolver.caltech.edu/CaltechTHESIS:11302021-224628633

DOI:

10.7907/5n5q-x203

Related URLs:

URL	URL Type	Description
https://openreview.net/forum?id=rkxw-hAcFQ	Publisher	Content adapted for Ch. 2
https://proceedings.mlr.press/v119/zhan20a.html	Publisher	Content adapted for Ch. 3
https://proceedings.neurips.cc/paper/2020/hash/342285bb2a8cadef22f667eeb6a63732-Abstract.html	Publisher	Content adapted for Ch. 4
https://arxiv.org/abs/2107.13132	arXiv	Content adapted for Ch. 5

ORCID:

Author	ORCID
Zhan, Eric	0000-0003-0521-5329

Default Usage Policy:

No commercial reproduction, distribution, display or performance rights in this work are provided.

ID Code:

14436

Collection:

CaltechTHESIS

Deposited By:

Eric Zhan

Deposited On:

07 Dec 2021 19:05

Last Modified:

14 Dec 2021 17:58

Thesis Files

PDF - Final Version
See Usage Policy.
8MB

Repository Staff Only: item control page