CaltechTHESIS
  A Caltech Library Service

Autonomous Temporal Understanding and State Estimation during Robot-Assisted Surgery

Citation

Qin, Yidan (2022) Autonomous Temporal Understanding and State Estimation during Robot-Assisted Surgery. Dissertation (Ph.D.), California Institute of Technology. doi:10.7907/n58k-tr61. https://resolver.caltech.edu/CaltechTHESIS:05272022-171138586

Abstract

Robot-Assisted Surgery (RAS) has become increasingly important in modern surgical practice for its many benefits and advantages for both the patient and the healthcare professionals, as compared to traditional open surgeries and minimally invasive surgeries such as laparoscopy. Artificial intelligence applications during RAS and post-operative analysis can provide various surgeon-assisting functionalities and could potentially achieve a better surgery outcome. These applications, ranging from providing surgeons with advisory information during RAS and post-operative analysis to virtual fixture and supervised autonomous surgical tasks, share a necessary prerequisite of a comprehensive understanding of the current surgical scene. This understanding should include the knowledge of the current surgical task being performed, the surgeon's actions and gestures, the state of the patient, etc. Currently, there is yet to be a unified effort to achieve the autonomous temporal understanding and perception of an RAS at the high accuracy and efficiency required in the highly safety-critical field of medicine.

This thesis develops novel modeling methodologies and deep learning-based models for the autonomous perception and temporal segmentation of the current surgical scene during an RAS. An RAS procedure is modeled as a hierarchical system consisting of discrete surgical states at multiple levels of temporal granularity. These surgical states take the form of surgical tasks, operational steps, fine-grained surgical actions, etc. A broad range of computational experiments were performed to develop methods that achieve an accurate, robust, and efficient estimation of these surgical states. Multiple novel deep learning-based models for feature extraction, noise elimination, and efficient training were proposed and tested. This thesis also shows the significant benefits of incorporating multiple types of data streams recorded by the surgical robotic system to a more accurate surgical state estimation effort.

Two new RAS datasets that contains real-world RAS procedures and diverse experimental settings were collected and annotated--filling a gap in the data sets available for the development and testing of of robust surgical state estimation models. The performance and robustness of models in this thesis work were showcased with these highly complex and dynamic real-world RAS datasets and compared against state-of-the-art methods. A significant model performance improvement was observed in both surgical state estimation accuracy and efficiency. The modeling methodologies and deep learning-based models developed in this work have diverse potential applications to the development of a next-generation surgical robotic systems.

Item Type:Thesis (Dissertation (Ph.D.))
Subject Keywords:Robot-assisted Surgery, Machine learning
Degree Grantor:California Institute of Technology
Division:Engineering and Applied Science
Major Option:Mechanical Engineering
Awards:Demetriades - Tsafka - Kokkalis Prize in Biotechnology or Related Fields
Thesis Availability:Public (worldwide access)
Research Advisor(s):
  • Burdick, Joel W.
Thesis Committee:
  • Murray, Richard M. (chair)
  • Yue, Yisong
  • Tai, Yu-Chong
  • Burdick, Joel Wakeman
Defense Date:26 May 2022
Non-Caltech Author Email:asimoqin (AT) gmail.com
Record Number:CaltechTHESIS:05272022-171138586
Persistent URL:https://resolver.caltech.edu/CaltechTHESIS:05272022-171138586
DOI:10.7907/n58k-tr61
Related URLs:
URLURL TypeDescription
https://doi.org/10.1109/ICRA40945.2020.9196560DOIArticle adapted for Chapter 4.
https://doi.org/10.1109/IROS45743.2020.9340723DOIArticle adapted for Chapter 5.
https://doi.org/10.1109/LRA.2021.3063014DOIArticle adapted for Chapter 6.
https://doi.org/10.1109/LRA.2021.3091728DOIArticle adapted for Chapter 7.
ORCID:
AuthorORCID
Qin, Yidan0000-0002-7766-1021
Default Usage Policy:No commercial reproduction, distribution, display or performance rights in this work are provided.
ID Code:14637
Collection:CaltechTHESIS
Deposited By: Yidan Qin
Deposited On:31 May 2022 23:29
Last Modified:07 Jun 2022 21:55

Thesis Files

[img] PDF (Final version of the thesis) - Final Version
See Usage Policy.

26MB

Repository Staff Only: item control page