Citation
Qin, Yidan (2022) Autonomous Temporal Understanding and State Estimation during Robot-Assisted Surgery. Dissertation (Ph.D.), California Institute of Technology. doi:10.7907/n58k-tr61. https://resolver.caltech.edu/CaltechTHESIS:05272022-171138586
Abstract
Robot-Assisted Surgery (RAS) has become increasingly important in modern surgical practice for its many benefits and advantages for both the patient and the healthcare professionals, as compared to traditional open surgeries and minimally invasive surgeries such as laparoscopy. Artificial intelligence applications during RAS and post-operative analysis can provide various surgeon-assisting functionalities and could potentially achieve a better surgery outcome. These applications, ranging from providing surgeons with advisory information during RAS and post-operative analysis to virtual fixture and supervised autonomous surgical tasks, share a necessary prerequisite of a comprehensive understanding of the current surgical scene. This understanding should include the knowledge of the current surgical task being performed, the surgeon's actions and gestures, the state of the patient, etc. Currently, there is yet to be a unified effort to achieve the autonomous temporal understanding and perception of an RAS at the high accuracy and efficiency required in the highly safety-critical field of medicine.
This thesis develops novel modeling methodologies and deep learning-based models for the autonomous perception and temporal segmentation of the current surgical scene during an RAS. An RAS procedure is modeled as a hierarchical system consisting of discrete surgical states at multiple levels of temporal granularity. These surgical states take the form of surgical tasks, operational steps, fine-grained surgical actions, etc. A broad range of computational experiments were performed to develop methods that achieve an accurate, robust, and efficient estimation of these surgical states. Multiple novel deep learning-based models for feature extraction, noise elimination, and efficient training were proposed and tested. This thesis also shows the significant benefits of incorporating multiple types of data streams recorded by the surgical robotic system to a more accurate surgical state estimation effort.
Two new RAS datasets that contains real-world RAS procedures and diverse experimental settings were collected and annotated--filling a gap in the data sets available for the development and testing of of robust surgical state estimation models. The performance and robustness of models in this thesis work were showcased with these highly complex and dynamic real-world RAS datasets and compared against state-of-the-art methods. A significant model performance improvement was observed in both surgical state estimation accuracy and efficiency. The modeling methodologies and deep learning-based models developed in this work have diverse potential applications to the development of a next-generation surgical robotic systems.
Item Type: | Thesis (Dissertation (Ph.D.)) | |||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Subject Keywords: | Robot-assisted Surgery, Machine learning | |||||||||||||||
Degree Grantor: | California Institute of Technology | |||||||||||||||
Division: | Engineering and Applied Science | |||||||||||||||
Major Option: | Mechanical Engineering | |||||||||||||||
Awards: | Demetriades - Tsafka - Kokkalis Prize in Biotechnology or Related Fields | |||||||||||||||
Thesis Availability: | Public (worldwide access) | |||||||||||||||
Research Advisor(s): |
| |||||||||||||||
Thesis Committee: |
| |||||||||||||||
Defense Date: | 26 May 2022 | |||||||||||||||
Non-Caltech Author Email: | asimoqin (AT) gmail.com | |||||||||||||||
Record Number: | CaltechTHESIS:05272022-171138586 | |||||||||||||||
Persistent URL: | https://resolver.caltech.edu/CaltechTHESIS:05272022-171138586 | |||||||||||||||
DOI: | 10.7907/n58k-tr61 | |||||||||||||||
Related URLs: |
| |||||||||||||||
ORCID: |
| |||||||||||||||
Default Usage Policy: | No commercial reproduction, distribution, display or performance rights in this work are provided. | |||||||||||||||
ID Code: | 14637 | |||||||||||||||
Collection: | CaltechTHESIS | |||||||||||||||
Deposited By: | Yidan Qin | |||||||||||||||
Deposited On: | 31 May 2022 23:29 | |||||||||||||||
Last Modified: | 07 Jun 2022 21:55 |
Thesis Files
PDF (Final version of the thesis)
- Final Version
See Usage Policy. 26MB |
Repository Staff Only: item control page