CaltechTHESIS
  A Caltech Library Service

Predictions and Policy Optimization in Online Decision Making

Citation

Lin, Yiheng (2025) Predictions and Policy Optimization in Online Decision Making. Dissertation (Ph.D.), California Institute of Technology. doi:10.7907/37t0-7n77. https://resolver.caltech.edu/CaltechTHESIS:05292025-064811885

Abstract

Predictions are ubiquitous in modern systems, offering insights into how environments might evolve by encoding our prior knowledge and assumptions. Recent advances in artificial intelligence have significantly expanded the scope and accuracy of such models, creating vast new opportunities across domains. At the same time, online decision making remains a fundamental challenge in many real-world problems, concerned with challenges such as limited information, delayed feedback, and irrevocable actions. This dissertation focuses on the interplay between predictions and online decision making---how predictive information can be effectively leveraged to improve performance in dynamic, uncertain environments.

While incorporating predictions often enhances decision-making, the degree of improvement can vary substantially. This variability arises from two key factors. First, the potential benefit of using predictions is fundamentally determined by both the nature of the predictions (e.g., their targets, errors, and distributions) and the characteristics of the decision-making process (e.g., costs and dynamics). Second, standard predictive policies frequently fall short of realizing such potential, especially in changing environments or when critical system parameters are unknown.

This dissertation introduces a unified theoretical framework to quantify the benefit of leveraging predictions across a broad range of online decision-making problems. To close the gap between the maximum potential and achievable performance, we formulate a general policy optimization framework and design efficient algorithms capable of tracking optimal (predictive) policies in time-varying settings. Additionally, we address practical considerations such as scalability and computational efficiency, enabling the application of our methods in large-scale networks and on resource-constrained devices.

Item Type:Thesis (Dissertation (Ph.D.))
Subject Keywords:Predictions, policy optimization, online decision making
Degree Grantor:California Institute of Technology
Division:Engineering and Applied Science
Major Option:Computing and Mathematical Sciences
Awards:Amori Doctoral Prize in CMS, 2025.
Thesis Availability:Public (worldwide access)
Research Advisor(s):
  • Wierman, Adam C. (advisor)
  • Yue, Yisong (co-advisor)
Thesis Committee:
  • Mazumdar, Eric V. (chair)
  • Wierman, Adam C.
  • Yue, Yisong
  • Srikant, Rayadurgam
Defense Date:13 May 2025
Funders:
Funding AgencyGrant Number
U.S. National Science Foundation2106403
U.S. National Science Foundation2105648
U.S. National Science Foundation1637598
U.S. National Science Foundation2146814
U.S. National Science Foundation2136197
U.S. National Science Foundation2326609
Amazon AI4Science FellowshipUNSPECIFIED
PIMCO Graduate Fellowship in Data ScienceUNSPECIFIED
Kortschak Scholars programUNSPECIFIED
Record Number:CaltechTHESIS:05292025-064811885
Persistent URL:https://resolver.caltech.edu/CaltechTHESIS:05292025-064811885
DOI:10.7907/37t0-7n77
Related URLs:
URLURL TypeDescription
https://doi.org/10.1145/3651890.3672260DOIArticle adapted for Chapter 3
https://proceedings.mlr.press/v247/lin24a.htmlPublisherArticle adapted for Chapter 5
https://proceedings.neurips.cc/paper_files/paper/2023/file/a7a7180fe7f82ff98eee0827c5e9c141-Paper-Conference.pdfPublisherArticle adapted for Chapter 5
https://proceedings.mlr.press/v162/lin22c/lin22c.pdfPublisherArticle adapted for Chapter 3
https://proceedings.neurips.cc/paper_files/paper/2022/file/eadeef7c51ad86989cc3b311cb49ec89-Paper-Conference.pdfPublisherArticle adapted for Chapter 3
https://proceedings.neurips.cc/paper_files/paper/2021/file/298f587406c914fad5373bb689300433-Paper.pdfPublisherArticle adapted for Chapter 3
https://proceedings.neurips.cc/paper_files/paper/2021/file/412604be30f701b1b1e3124c252065e6-Paper.pdfPublisherArticle adapted for Chapter 6
ORCID:
AuthorORCID
Lin, Yiheng0000-0001-6524-2877
Default Usage Policy:No commercial reproduction, distribution, display or performance rights in this work are provided.
ID Code:17295
Collection:CaltechTHESIS
Deposited By: Yiheng Lin
Deposited On:30 May 2025 23:30
Last Modified:17 Jun 2025 17:42

Thesis Files

[img] PDF - Final Version
See Usage Policy.

3MB

Repository Staff Only: item control page