CaltechTHESIS
  A Caltech Library Service

Make the Most of What You Have: Resource-Efficient Randomized Algorithms for Matrix Computations

Citation

Epperly, Ethan Nicholas (2025) Make the Most of What You Have: Resource-Efficient Randomized Algorithms for Matrix Computations. Dissertation (Ph.D.), California Institute of Technology. doi:10.7907/pef3-mg80. https://resolver.caltech.edu/CaltechTHESIS:05272025-233435853

Abstract

In recent years, randomized algorithms have established themselves as fundamental tools in computational linear algebra, with applications in scientific computing, machine learning, and quantum information science. Many randomized matrix algorithms proceed by first collecting information about a matrix and then processing that data to perform some computational task. This thesis addresses the following question: How can one design algorithms that use this information as efficiently as possible, reliably achieving the greatest possible speed and accuracy for a limited data budget? This question is timely, as randomized algorithms are increasingly being deployed in production software and in applications where accuracy and reliability is critical.

The first part of this thesis focuses on the problem of low-rank approximation for positive-semidefinite matrices, motivated by applications to accelerating kernel and Gaussian process machine learning methods. Here, the goal is to compute an accurate approximation to a matrix after accessing as few entries of the matrix as possible. This part of the thesis explores the randomly pivoted Cholesky (RPCholesky) algorithm for this task, which achieves a level of speed and reliability greater than other methods for the same problem.

The second part of this thesis considers the task of estimating attributes of an implicit matrix accessible only by matrix–vector products, motivated by applications in quantum physics, network science, and machine learning. This thesis describes the leave-one-out approach to developing matrix attribute estimation algorithms, and develops optimized trace, diagonal, and row-norm estimation algorithms for this computational model.

The third part of this thesis considers randomized algorithms for overdetermined linear least squares problems, which arise in statistics and machine learning. Randomized algorithms for linear-least squares problems are asymptotically faster than any known deterministic algorithm, but recent work of [Meier et al., SIMAX '24] raised questions about the accuracy of these methods when implemented in floating point arithmetic. This thesis shows these issues are resolvable by developing fast randomized least-squares problem achieving backward stability, the gold-standard accuracy and stability guarantee for a numerical algorithm.

Item Type:Thesis (Dissertation (Ph.D.))
Subject Keywords:randomized algorithms; low-rank approximation; Cholesky factorization; random pivoting; leave-one-out algorithm; trace estimation; least-squares problem; numerical stability
Degree Grantor:California Institute of Technology
Division:Engineering and Applied Science
Major Option:Applied And Computational Mathematics
Awards:W. P. Carey & Co., Inc., Prize in Applied Mathematics, 2025. Thomas A. Tisch Prize for Graduate Teaching in Computing and Mathematical Sciences, 2022.
Thesis Availability:Not set
Research Advisor(s):
  • Tropp, Joel A.
Thesis Committee:
  • Chandrasekaran, Venkat (chair)
  • Tropp, Joel A.
  • Hoffmann, Franca
  • Lin, Lin
Defense Date:1 May 2025
Non-Caltech Author Email:epperlyethan (AT) gmail.com
Funders:
Funding AgencyGrant Number
Department of Energy Computational Science Graduate FellowshipDE-SC0021110
Office of Naval Research BRC AwardN00014-18-1-2363
Office of Naval Research BRC AwardN00014-24-1-2223
NSF FRG1952777
Caltech Carver Mead New Adventures FundUNSPECIFIED
Record Number:CaltechTHESIS:05272025-233435853
Persistent URL:https://resolver.caltech.edu/CaltechTHESIS:05272025-233435853
DOI:10.7907/pef3-mg80
Related URLs:
URLURL TypeDescription
https://doi.org/10.1002/cpa.22234DOIJournal version of "randomly pivoted Cholesky" paper; major source for Chapters 3–6 and 9
https://doi.org/10.48550/arXiv.2410.03969DOIarXiv preprint of "embrace rejection" paper; major source for Chapter 8
https://doi.org/10.1137/23M1616790DOIJournal version of "fast and forward stable" paper; major source for Chapter 23
https://doi.org/10.48550/arXiv.2406.03468DOIarXiv preprint of "fast randomized least-squares" paper; major source for Chapters 23 and 24
https://doi.org/10.1137/23M1558537DOIJournal version of "efficient error and variance" paper; major source for Chapters 18 and 19
https://doi.org/10.1137/23M1548323DOIJournal version of "XTrace" paper; major source for Chapters 13–16
https://doi.org/10.48550/arXiv.2304.12465DOIarXiv preprint of "robust, randomized preconditioning" paper; major source for Chapter 6
https://doi.org/10.5555/3666122.3668997DOIJournal version of "kernel quadrature" paper; major source for Chapter 7
ORCID:
AuthorORCID
Epperly, Ethan Nicholas0000-0003-0712-8296
Default Usage Policy:No commercial reproduction, distribution, display or performance rights in this work are provided.
ID Code:17278
Collection:CaltechTHESIS
Deposited By: Ethan Epperly
Deposited On:02 Jun 2025 22:58
Last Modified:26 Jun 2025 18:57

Full text not available from this repository.

Repository Staff Only: item control page