Make the Most of What You Have: Resource-Efficient Randomized Algorithms for Matrix Computations

Citation

Epperly, Ethan Nicholas (2025) Make the Most of What You Have: Resource-Efficient Randomized Algorithms for Matrix Computations. Dissertation (Ph.D.), California Institute of Technology. doi:10.7907/pef3-mg80. https://resolver.caltech.edu/CaltechTHESIS:05272025-233435853

Abstract

In recent years, randomized algorithms have established themselves as fundamental tools in computational linear algebra, with applications in scientific computing, machine learning, and quantum information science. Many randomized matrix algorithms proceed by first collecting information about a matrix and then processing that data to perform some computational task. This thesis addresses the following question: How can one design algorithms that use this information as efficiently as possible, reliably achieving the greatest possible speed and accuracy for a limited data budget? This question is timely, as randomized algorithms are increasingly being deployed in production software and in applications where accuracy and reliability is critical.

The first part of this thesis focuses on the problem of low-rank approximation for positive-semidefinite matrices, motivated by applications to accelerating kernel and Gaussian process machine learning methods. Here, the goal is to compute an accurate approximation to a matrix after accessing as few entries of the matrix as possible. This part of the thesis explores the randomly pivoted Cholesky (RPCholesky) algorithm for this task, which achieves a level of speed and reliability greater than other methods for the same problem.

The second part of this thesis considers the task of estimating attributes of an implicit matrix accessible only by matrix–vector products, motivated by applications in quantum physics, network science, and machine learning. This thesis describes the leave-one-out approach to developing matrix attribute estimation algorithms, and develops optimized trace, diagonal, and row-norm estimation algorithms for this computational model.

The third part of this thesis considers randomized algorithms for overdetermined linear least squares problems, which arise in statistics and machine learning. Randomized algorithms for linear-least squares problems are asymptotically faster than any known deterministic algorithm, but recent work of [Meier et al., SIMAX '24] raised questions about the accuracy of these methods when implemented in floating point arithmetic. This thesis shows these issues are resolvable by developing fast randomized least-squares problem achieving backward stability, the gold-standard accuracy and stability guarantee for a numerical algorithm.

Item Type:

Thesis (Dissertation (Ph.D.))

Subject Keywords:

randomized algorithms; low-rank approximation; Cholesky factorization; random pivoting; leave-one-out algorithm; trace estimation; least-squares problem; numerical stability

Degree Grantor:

California Institute of Technology

Division:

Engineering and Applied Science

Major Option:

Applied And Computational Mathematics

Awards:

W. P. Carey & Co., Inc., Prize in Applied Mathematics, 2025. Thomas A. Tisch Prize for Graduate Teaching in Computing and Mathematical Sciences, 2022.

Thesis Availability:

Not set

Research Advisor(s):

Tropp, Joel A.

Thesis Committee:

Chandrasekaran, Venkat (chair)
Tropp, Joel A.
Hoffmann, Franca
Lin, Lin

Defense Date:

1 May 2025

Non-Caltech Author Email:

epperlyethan (AT) gmail.com

Funders:

Funding Agency	Grant Number
Department of Energy Computational Science Graduate Fellowship	DE-SC0021110
Office of Naval Research BRC Award	N00014-18-1-2363
Office of Naval Research BRC Award	N00014-24-1-2223
NSF FRG	1952777
Caltech Carver Mead New Adventures Fund	UNSPECIFIED

Record Number:

CaltechTHESIS:05272025-233435853

Persistent URL:

https://resolver.caltech.edu/CaltechTHESIS:05272025-233435853

DOI:

10.7907/pef3-mg80

Related URLs:

URL	URL Type	Description
https://doi.org/10.1002/cpa.22234	DOI	Journal version of "randomly pivoted Cholesky" paper; major source for Chapters 3–6 and 9
https://doi.org/10.48550/arXiv.2410.03969	DOI	arXiv preprint of "embrace rejection" paper; major source for Chapter 8
https://doi.org/10.1137/23M1616790	DOI	Journal version of "fast and forward stable" paper; major source for Chapter 23
https://doi.org/10.48550/arXiv.2406.03468	DOI	arXiv preprint of "fast randomized least-squares" paper; major source for Chapters 23 and 24
https://doi.org/10.1137/23M1558537	DOI	Journal version of "efficient error and variance" paper; major source for Chapters 18 and 19
https://doi.org/10.1137/23M1548323	DOI	Journal version of "XTrace" paper; major source for Chapters 13–16
https://doi.org/10.48550/arXiv.2304.12465	DOI	arXiv preprint of "robust, randomized preconditioning" paper; major source for Chapter 6
https://doi.org/10.5555/3666122.3668997	DOI	Journal version of "kernel quadrature" paper; major source for Chapter 7

ORCID:

Author	ORCID
Epperly, Ethan Nicholas	0000-0003-0712-8296

Default Usage Policy:

No commercial reproduction, distribution, display or performance rights in this work are provided.

ID Code:

17278

Collection:

CaltechTHESIS

Deposited By:

Ethan Epperly

Deposited On:

02 Jun 2025 22:58

Last Modified:

26 Jun 2025 18:57

Full text not available from this repository.

Repository Staff Only: item control page