Do Robots Dream of Random Trees? Monte Carlo Tree Search for Dynamical, Partially Observable, and Multi-Agent Systems

Citation

Rivière, Benjamin Pierre (2024) Do Robots Dream of Random Trees? Monte Carlo Tree Search for Dynamical, Partially Observable, and Multi-Agent Systems. Dissertation (Ph.D.), California Institute of Technology. doi:10.7907/dbwa-we50. https://resolver.caltech.edu/CaltechTHESIS:06032024-152357240

Abstract

Autonomous robots are poised to transform various aspects of society, spanning transportation, labor, and scientific space exploration. A critical component to enable their capabilities is the algorithm that interprets sensor data to generate intelligent planned behavior. Although reinforcement learning methods that train parameterized policies offline from data have shown recent success, they are inherently limited when robots inevitably encounter situations outside their training domain. In contrast, optimal control techniques, which compute trajectories in real-time using numerical optimization, typically yield only locally optimal solutions.

This research endeavors to bridge the gap by developing algorithms that compute trajectories in real-time while converging towards globally optimal solutions. Building upon the Monte Carlo Tree Search (MCTS) framework—a stochastic tree search method that simulates future trajectories while balancing exploration and exploitation—the research focus is twofold: (i) constructing an efficient discrete representation of continuous systems in a decision trees, and (ii) searching on the resulting tree while balancing exploration and exploitation to achieve global optimality.

The study spans theoretical analysis, algorithmic design, and hardware demonstrations across dynamical, partially observable, and multi-agent systems. By addressing these critical questions, this research aims to advance the field of autonomous robotics, enabling the deployment of intelligent robots in complex and diverse environments.

Item Type:

Thesis (Dissertation (Ph.D.))

Subject Keywords:

Robotics; Planning; Dynamical Systems; Machine Learning

Degree Grantor:

California Institute of Technology

Division:

Engineering and Applied Science

Major Option:

Aeronautics

Awards:

William F. Ballhaus Prize, 2024.

Thesis Availability:

Public (worldwide access)

Research Advisor(s):

Chung, Soon-Jo

Thesis Committee:

Yue, Yisong (chair)
Chung, Soon-Jo
Hadaegh, Fred
Pellegrino, Sergio

Defense Date:

29 May 2024

Non-Caltech Author Email:

benjamin.p.riviere (AT) gmail.com

Funders:

Funding Agency	Grant Number
Supernal	UNSPECIFIED
Aerospace Corporation	UNSPECIFIED
JPL	UNSPECIFIED
Defense Advanced Research Projects Agency (DARPA)	UNSPECIFIED
United Technologies Corporation	UNSPECIFIED

Record Number:

CaltechTHESIS:06032024-152357240

Persistent URL:

https://resolver.caltech.edu/CaltechTHESIS:06032024-152357240

DOI:

10.7907/dbwa-we50

Related URLs:

URL	URL Type	Description
https://doi.org/10.2514/6.2023-0874	DOI	Article for Chapter 3
https://doi.org/10.1109/LRA.2021.3096758	DOI	Article for Chapter 4
https://doi.org/10.1109/LRA.2020.2994035	DOI	Article for Chapter 5
https://doi.org/10.1109/TITS.2021.3097297	DOI	Article for Appendix C

ORCID:

Author	ORCID
Rivière, Benjamin Pierre	0000-0002-0597-5400

Default Usage Policy:

No commercial reproduction, distribution, display or performance rights in this work are provided.

ID Code:

16482

Collection:

CaltechTHESIS

Deposited By:

Benjamin Riviere

Deposited On:

06 Jun 2024 22:14

Last Modified:

17 Jun 2024 18:38

Thesis Files

PDF - Final Version
See Usage Policy.
46MB

Repository Staff Only: item control page