Online Learning for the Control of Human Standing via Spinal Cord Stimulation

Citation

Sui, Yanan (2017) Online Learning for the Control of Human Standing via Spinal Cord Stimulation. Dissertation (Ph.D.), California Institute of Technology. doi:10.7907/Z9BK19DN. https://resolver.caltech.edu/CaltechTHESIS:04172017-163725367

Abstract

Many applications in recommender systems or experimental design need to make decisions online. Each decision leads to a stochastic reward with initially unknown distribution, while new decisions are made based on the observations of previous rewards. To maximize the total reward, one needs to balance between exploring different strategies and exploiting currently optimal strategies within a given set of strategies. This is the underlying trade-off of a number of clinical neural engineering problems, including brain-computer interface, deep brain stimulation, and spinal cord injury therapy. In these systems, complex electronic and computational systems interact with the human central nervous system. A critical issue is how to control the agents to produce results which are optimal under some measure, for example, efficiently decoding the user's intention in a brain-computer interface or performs temporal and spatial specific stimulation in deep brain stimulation. This dissertation is motivated by electrical sipnal cord stimulation with high dimensional inputs(multi-electrode arrays). The stimulation is applied to promote the function and rehabilitation of the remaining neural circuitry below the spinal cord injury, and enable complex motor behaviors such as stepping and standing. To enable the careful tuning of these stimuli for each patient, the electrode arrays which deliver these stimuli have become increasingly more sophisticated, with a corresponding increase in the number of free parameters over which the stimuli need to be optimized. Since the number of stimuli is growing exponentially with the number of electrodes, algorithmic methods of selecting stimuli is necessary, particularly when the feedback is expensive to get.

In many online learning settings, particularly those that involve human feedback, reliable feedback is often limited to pairwise preferences instead of real valued feedback. Examples include implicit or subjective feedback for information retrieval and recommender systems, such as clicks on search results, and subjective feedback on the quality of recommended care. Sometimes with real valued feedback, we require that the sampled function values exceed some prespecified ``safety'' threshold, a requirement that existing algorithms fail to meet. Examples include medical applications where the patients' comfort must be guaranteed; recommender systems aiming to avoid user dissatisfaction; and robotic control, where one seeks to avoid controls that cause physical harm to the platform.

This dissertation provides online learning algorithms for several specific online decision-making problems. \selfsparring optimizes the cumulative reward with relative feedback. RankComparison deals with ranking feedback. \safeopt considers the optimization with real valued feedback and safety constraints. \cduel is designed for specific spinal cord injury therapy.

A variant of \cduel was implemented in closed-loop human experiments, controlling which epidural stimulating electrodes are used in the spinal cord of SCI patients. The results obtained are compared with concurrent stimulus tuning carried out by human experimenter. These experiments show that this algorithm is at least as effective as the human experimenter, suggesting that this algorithm can be applied to the more challenging problems of enabling and optimizing complex, sensory-dependent behaviors, such as stepping and standing in SCI patients.

In order to get reliable quantitative measurements besides comparisons, the standing behaviors of paralyzed patients under spinal cord stimulation are evaluated. The potential of quantifying the quality of bipedal standing in an automatic approach is also shown in this work.

Item Type:

Thesis (Dissertation (Ph.D.))

Subject Keywords:

Online Learning, Spinal Cord Injury

Degree Grantor:

California Institute of Technology

Division:

Engineering and Applied Science

Major Option:

Computation and Neural Systems

Minor Option:

Applied And Computational Mathematics

Thesis Availability:

Public (worldwide access)

Research Advisor(s):

Burdick, Joel Wakeman

Thesis Committee:

Burdick, Joel Wakeman (chair)
Murray, Richard M.
Perona, Pietro
Yue, Yisong

Defense Date:

15 December 2016

Record Number:

CaltechTHESIS:04172017-163725367

Persistent URL:

https://resolver.caltech.edu/CaltechTHESIS:04172017-163725367

DOI:

10.7907/Z9BK19DN

ORCID:

Author	ORCID
Sui, Yanan	0000-0002-9480-627X

Default Usage Policy:

No commercial reproduction, distribution, display or performance rights in this work are provided.

ID Code:

10138

Collection:

CaltechTHESIS

Deposited By:

Yanan Sui

Deposited On:

04 May 2017 23:46

Last Modified:

04 Oct 2019 00:15

Thesis Files

Preview

PDF - Final Version
See Usage Policy.
12MB

Repository Staff Only: item control page