CaltechTHESIS
  A Caltech Library Service

Computation Foundations of Spatial Transcriptomics

Citation

Moses, Lambda (2023) Computation Foundations of Spatial Transcriptomics. Dissertation (Ph.D.), California Institute of Technology. doi:10.7907/rt24-pq60. https://resolver.caltech.edu/CaltechTHESIS:05312023-213322223

Abstract

Single-cell and spatial transcriptomics have come of age in the past few years; datasets and data analysis software packages have proliferated. With the increasing sizes of datasets, proliferating new data collection technologies, and mainstreaming of high-throughput technologies, the software can be improved for better speed and memory efficiency, standardized and consistent user interface for multiple technologies, and in documentation to onboard new users. First, I collected a database of spatial transcriptomics literature and analyzed the data on trends and sociology in this field. Based on the database and data analyses, I wrote a comprehensive book both qualitatively and quantitatively documenting the history of the field since the 1960s and reviewing more recent developments, which informed the software and methods I later developed. Then, to address the challenges with the pre-processing large datasets, we developed \texttt{kallisto} \texttt{bustools} for fast and modular pseudoalignment of sequencing reads to the transcriptome in single-cell RNA-seq (scRNA-seq), giving consistent results with the established and much more computationally demanding alignment method Cell Ranger. Briefly summarized are my attempt to map dissociated cells in scRNA-seq to a spatial gene expression reference and to build a image processing pipeline for image based spatial transcriptomics data analysis. Finally, to address the challenges in downstream analyses of spatial -omics data, I first wrote the new \texttt{SpatialFeatureExperiment} (SFE) data structure to represent and operate on geometries in spatial transcriptomics data and to organize results from spatial analyses. Based on SFE, I wrote Voyager, which brings decades of research in geospatial data analysis to spatial transcriptomics, to better utilize the opportunities from spatial information to gain novel biological insights. To reduce user learning curve, Voyager conforms to SCE styles and conventions and has a comprehensive documentation website and consistent user interface to many geospatial methods.

Item Type:Thesis (Dissertation (Ph.D.))
Subject Keywords:spatial; single cell; transcriptomics; exploratory data analysis; history
Degree Grantor:California Institute of Technology
Division:Biology and Biological Engineering
Major Option:Biology
Thesis Availability:Public (worldwide access)
Research Advisor(s):
  • Pachter, Lior S.
Thesis Committee:
  • Van Valen, David A. (chair)
  • Thomson, Matthew
  • Wold, Barbara J.
  • Pimentel, Harold
  • Pachter, Lior S.
Defense Date:24 May 2023
Funders:
Funding AgencyGrant Number
National Institute of Mental HealthU19MH114830
Record Number:CaltechTHESIS:05312023-213322223
Persistent URL:https://resolver.caltech.edu/CaltechTHESIS:05312023-213322223
DOI:10.7907/rt24-pq60
Related URLs:
URLURL TypeDescription
https://doi.org/10.1038/s41592-022-01409-2DOIArticle adapted for Chapter 2
https://doi.org/10.1038/s41587-021-00870-2DOIFigure 2 in the paper used as Figure 11.4 in the thesis
ORCID:
AuthorORCID
Moses, Lambda0000-0002-7092-9427
Default Usage Policy:No commercial reproduction, distribution, display or performance rights in this work are provided.
ID Code:15247
Collection:CaltechTHESIS
Deposited By: Dongyi Lu
Deposited On:07 Jun 2023 15:16
Last Modified:14 Jun 2023 16:13

Thesis Files

[img] PDF - Final Version
See Usage Policy.

48MB

Repository Staff Only: item control page