Predictions and Policy Optimization in Online Decision Making

Citation

Lin, Yiheng (2025) Predictions and Policy Optimization in Online Decision Making. Dissertation (Ph.D.), California Institute of Technology. doi:10.7907/37t0-7n77. https://resolver.caltech.edu/CaltechTHESIS:05292025-064811885

Abstract

Predictions are ubiquitous in modern systems, offering insights into how environments might evolve by encoding our prior knowledge and assumptions. Recent advances in artificial intelligence have significantly expanded the scope and accuracy of such models, creating vast new opportunities across domains. At the same time, online decision making remains a fundamental challenge in many real-world problems, concerned with challenges such as limited information, delayed feedback, and irrevocable actions. This dissertation focuses on the interplay between predictions and online decision making---how predictive information can be effectively leveraged to improve performance in dynamic, uncertain environments.

While incorporating predictions often enhances decision-making, the degree of improvement can vary substantially. This variability arises from two key factors. First, the potential benefit of using predictions is fundamentally determined by both the nature of the predictions (e.g., their targets, errors, and distributions) and the characteristics of the decision-making process (e.g., costs and dynamics). Second, standard predictive policies frequently fall short of realizing such potential, especially in changing environments or when critical system parameters are unknown.

This dissertation introduces a unified theoretical framework to quantify the benefit of leveraging predictions across a broad range of online decision-making problems. To close the gap between the maximum potential and achievable performance, we formulate a general policy optimization framework and design efficient algorithms capable of tracking optimal (predictive) policies in time-varying settings. Additionally, we address practical considerations such as scalability and computational efficiency, enabling the application of our methods in large-scale networks and on resource-constrained devices.

Item Type:

Thesis (Dissertation (Ph.D.))

Subject Keywords:

Predictions, policy optimization, online decision making

Degree Grantor:

California Institute of Technology

Division:

Engineering and Applied Science

Major Option:

Computing and Mathematical Sciences

Awards:

Amori Doctoral Prize in CMS, 2025.

Thesis Availability:

Public (worldwide access)

Research Advisor(s):

Wierman, Adam C. (advisor)
Yue, Yisong (co-advisor)

Thesis Committee:

Mazumdar, Eric V. (chair)
Wierman, Adam C.
Yue, Yisong
Srikant, Rayadurgam

Defense Date:

13 May 2025

Funders:

Funding Agency	Grant Number
U.S. National Science Foundation	2106403
U.S. National Science Foundation	2105648
U.S. National Science Foundation	1637598
U.S. National Science Foundation	2146814
U.S. National Science Foundation	2136197
U.S. National Science Foundation	2326609
Amazon AI4Science Fellowship	UNSPECIFIED
PIMCO Graduate Fellowship in Data Science	UNSPECIFIED
Kortschak Scholars program	UNSPECIFIED

Record Number:

CaltechTHESIS:05292025-064811885

Persistent URL:

https://resolver.caltech.edu/CaltechTHESIS:05292025-064811885

DOI:

10.7907/37t0-7n77

Related URLs:

URL	URL Type	Description
https://doi.org/10.1145/3651890.3672260	DOI	Article adapted for Chapter 3
https://proceedings.mlr.press/v247/lin24a.html	Publisher	Article adapted for Chapter 5
https://proceedings.neurips.cc/paper_files/paper/2023/file/a7a7180fe7f82ff98eee0827c5e9c141-Paper-Conference.pdf	Publisher	Article adapted for Chapter 5
https://proceedings.mlr.press/v162/lin22c/lin22c.pdf	Publisher	Article adapted for Chapter 3
https://proceedings.neurips.cc/paper_files/paper/2022/file/eadeef7c51ad86989cc3b311cb49ec89-Paper-Conference.pdf	Publisher	Article adapted for Chapter 3
https://proceedings.neurips.cc/paper_files/paper/2021/file/298f587406c914fad5373bb689300433-Paper.pdf	Publisher	Article adapted for Chapter 3
https://proceedings.neurips.cc/paper_files/paper/2021/file/412604be30f701b1b1e3124c252065e6-Paper.pdf	Publisher	Article adapted for Chapter 6

ORCID:

Author	ORCID
Lin, Yiheng	0000-0001-6524-2877

Default Usage Policy:

No commercial reproduction, distribution, display or performance rights in this work are provided.

ID Code:

17295

Collection:

CaltechTHESIS

Deposited By:

Yiheng Lin

Deposited On:

30 May 2025 23:30

Last Modified:

17 Jun 2025 17:42

Thesis Files

PDF - Final Version
See Usage Policy.
3MB

Repository Staff Only: item control page