research into coordinating sensing, control and

34
AFRL-RV-PS- TR-2019-0102 AFRL-RV-PS- TR-2019-0102 RESEARCH INTO COORDINATING SENSING, CONTROL AND COMPUTATIONS FOR AUTONOMOUS VEHICLES Ilya Kolmanovsky and Anouck Girard Regents of the University of Michigan Office of Research and Sponsored Projects 503 Thompson Street Ann Arbor, MI 48109-1340 30 August 2019 Final Report APPROVED FOR PUBLIC RELEASE; DISTRIBUTION IS UNLIMITED. AIR FORCE RESEARCH LABORATORY Space Vehicles Directorate 3550 Aberdeen Ave SE AIR FORCE MATERIEL COMMAND KIRTLAND AIR FORCE BASE, NM 87117-5776

Upload: others

Post on 06-Dec-2021

2 views

Category:

Documents


0 download

TRANSCRIPT

AFRL-RV-PS- TR-2019-0102

AFRL-RV-PS- TR-2019-0102

RESEARCH INTO COORDINATING SENSING, CONTROL AND COMPUTATIONS FOR AUTONOMOUS VEHICLES

Ilya Kolmanovsky and Anouck Girard

Regents of the University of Michigan Office of Research and Sponsored Projects 503 Thompson Street Ann Arbor, MI 48109-1340

30 August 2019

Final Report

APPROVED FOR PUBLIC RELEASE; DISTRIBUTION IS UNLIMITED.

AIR FORCE RESEARCH LABORATORY Space Vehicles Directorate 3550 Aberdeen Ave SE AIR FORCE MATERIEL COMMAND KIRTLAND AIR FORCE BASE, NM 87117-5776

NOTICE AND SIGNATURE PAGE

Using Government drawings, specifications, or other data included in this document for any purpose other than Government procurement does not in any way obligate the U.S. Government. The fact that the Government formulated or supplied the drawings, specifications, or other data does not license the holder or any other person or corporation; or convey any rights or permission to manufacture, use, or sell any patented invention that may relate to them.

Qualified requestors may obtain copies of this report from the Defense Technical Information Center (DTIC) (http://www.dtic.mil).

AFRL-RV-PS-TR-2019-0102 HAS BEEN REVIEWED AND IS APPROVED FOR PUBLICATION IN ACCORDANCE WITH ASSIGNED DISTRIBUTION STATEMENT.

//SIGNED// DAVID WILT, Ph.D

//SIGNED// CHRISTOPHER PETERSEN Program Manager Technical Advisor, Space Based Advanced Sensing

And Protection

//SIGNED//JOHN P. BEAUCHEMIN Chief Engineer, Spacecraft Technology Division Space Vehicles Directorate

Approved for public release; distribution is unlimited

REPORT DOCUMENTATION PAGE Form Approved OMB No. 0704-0188

Public reporting burden for this collection of information is estimated to average 1 hour per response, including the time for reviewing instructions, searching existing data sources, gathering and maintaining the data needed, and completing and reviewing this collection of information. Send comments regarding this burden estimate or any other aspect of this collection of information, including suggestions for reducing this burden to Department of Defense, Washington Headquarters Services, Directorate for Information Operations and Reports (0704-0188), 1215 Jefferson Davis Highway, Suite 1204, Arlington, VA 22202-4302. Respondents should be aware that notwithstanding any other provision of law, no person shall be subject to any penalty for failing to comply with a collection of information if it does not display a currently valid OMB control number. PLEASE DO NOT RETURN YOUR FORM TO THE ABOVE ADDRESS. 1. REPORT DATE (DD-MM-YYYY)30-08-2019

2. REPORT TYPEFinal Report

3. DATES COVERED (From - To)14-Feb-2018 – 30-Aug-2019

4. TITLE AND SUBTITLEResearch Into Coordinating Sensing, Control and Computations for AutonomousVehicles

5a. CONTRACT NUMBER

5b. GRANT NUMBER FA9453-18-1-0010 5c. PROGRAM ELEMENT NUMBER 62601F

6. AUTHOR(S)Ilya Kolmanovsky and Anouck Girard

5d. PROJECT NUMBER 8809 5e. TASK NUMBER PPM00036957

5f. WORK UNIT NUMBER EF130350 7. PERFORMING ORGANIZATION NAME(S) AND ADDRESS(ES)

AND ADDRESS(ES)8. PERFORMING ORGANIZATION REPORT

NUMBERRegents of the University of Michigan Office of Research and Sponsored Projects 503 Thompson Street Ann Arbor, MI 48109-1340

9. SPONSORING / MONITORING AGENCY NAME(S) AND ADDRESS(ES) 10. SPONSOR/MONITOR’S ACRONYM(S)Air Force Research LaboratorySpace Vehicles Directorate AFRL/RVSV 3550 Aberdeen Ave, SE 11. SPONSOR/MONITOR’S REPORTKirtland AFB, NM 87117-5776 NUMBER(S)

AFRL-RV-PS-TR-2019-0102 12. DISTRIBUTION / AVAILABILITY STATEMENTApproved for public release; distribution is unlimited.13. SUPPLEMENTARY NOTES 14. ABSTRACTThe report addresses several strategies developed for coordinating sensing, estimation, control and computations in systemsthat must operate autonomously and satisfy constraints. The potential of the proposed algorithms is demonstrated inspacecraft simulation case studies. The developed algorithms and strategies contribute to advancing capabilities ofautonomous vehicles, in particular, in the areas of handling constraints and non-standard sensing-estimation-actuationrequirements. The proposed strategies also facilitate incorporating optimization-based control technology for onboardoperation in autonomous vehicles.15. SUBJECT TERMSAutonomous Vehicles, Constrained Control Algorithms, Model Predictive (MPC), Spacecraft Control

16. SECURITY CLASSIFICATION OF: 17. LIMITATIONOF ABSTRACT

18. NUMBEROF PAGES

19a. NAME OF RESPONSIBLE PERSON

Christopher Petersen a. REPORTUnclassified

b. ABSTRACTUnclassified

c. THIS PAGEUnclassified Unlimited 25

19b. TELEPHONE NUMBER (include areacode)

Standard Form 298 (Rev. 8-98) Prescribed by ANSI Std. 239.18

(This page intentionally left blank)

Approved for public release; distribution is unlimited

Approved for public release; distribution is unlimited i

TABLE OF CONTENTS Section Page

List of Figures .................................................................................................................................ii ACKNOWLEDGEMENTS AND DISCLAIMER ........................................................................ iii 1. SUMMARY ................................................................................................................................1

2. INTRODUCTION.......................................................................................................................2

3. METHODS, ASSUMPTIONS, AND PROCEDURES ..............................................................2

3.1 Optimal Strategies for Disjunctive Sensing and Control ...................................................... 2 3.1.1 Disjunctive Sensing and Actuation for Linear Systems. ............................................................... 2 3.1.2 Admissible Periodic Switching Sequences. .................................................................................. 3 3.1.3 Chance Constraints. ..................................................................................................................... 6 3.1.4 Selecting an Optimal Admissible Sequence. ................................................................................ 7

3.2 Computational Reference Governors for Safe Online Optimization .................................... 7 3.2.1 Assumptions.................................................................................................................................. 8 3.2.2 Methods........................................................................................................................................ 9 3.2.2.1 Nonlinear Model Predictive Control........................................................................................ 9

3.2.2.2 A Computational Reference Governor for Suboptimal MPC. .................................................. 11 3.3 Constrained control framework using a virtual net of chance constraint admissible sets and output measurements only ...........................................................................................................13

4. RESULTS AND DISCUSSION ................................................................................................14 4.1 Disjunctive-Based Constrained Spacecraft Relative Motion Planning ............................... 14

4.1.2 Theoretical Results. .................................................................................................................... 14 4.2 A Computational Reference Governor for Safe Online Optimization ................................ 18 4.3 Constrained Control Framework Using a Virtual Net of Chance Constraint Admissible Sets and Output Measurements Only.................................................................................................20

5. CONCLUSION ...........................................................................................................................21

6. REFERENCES............................................................................................................................22

LIST OF SYMBOLS, ABBREVIATIONS, AND ACRONYMS..................................................24

Approved for public release; distribution is unlimited ii

LIST OF FIGURES

Figure Page

Figure 1 A Comparison of Suboptimal MPC and Ideal MPC ............................................................... 11 Figure 2 Architecture of the Computational Reference Governor ......................................................... 12 Figure 3 Disjunctive Sensing and Control Trajectory Under a Length 8 Periodic Sequence ................ 16 Figure 4 Comparison of Length 4 and Length 7 Optimal Switching Sequences................................... 17 Figure 5 Demonstration of Satisfaction of Both Box and Chance Constraints ..................................... 18 Figure 6 The Time Histories of Control Torques, Angular Velocities and Euler Angles in Closed-loop

Simulation ....................................................................................................................................... 19 Figure 7 Closed-loop Traces of the Residuals and References with and without the CRG. ................. 20 Figure 8 Closed-loop Path of Spacecraft in CW Frame. Red Cone is the Obstacle and Black Dots are

the Reference Point Nodes in the Graph. ....................................................................................... 21

Approved for public release; distribution is unlimited iii

ACKNOWLEDGEMENTS AND DISCLAIMER

Acknowledgement This material is based on research sponsored by Air Force Research Laboratory under agreement number FA9453-18-1-0010. The U.S. Government is authorized to reproduce and distribute reprints for Governmental purposes notwithstanding any copyright notation thereon.

Disclaimer The views and conclusions contained herein are those of the authors and should not be interpreted as necessarily representing the official policies or endorsements, either expressed or implied, of Air Force Research Laboratory or the U.S. Government.

Approved for public release; distribution is unlimited iv

(This page intentionally left blank)

Approved for public release; distribution is unlimited 1

1. SUMMARY

The objective of the research was to advance algorithms for coordinating sensing, control and computations in systems that must operate autonomously and satisfy constraints. The focus was on situations where sensing, estimation, control and computations are inherently coupled and on developing strategies that are effective and could be straightforward to implement in real-world applications.

Specific progress has been made in the following areas:

1. Developing optimal periodic sequences of sensing and actuation actions in situations wheresensing and actuation cannot be performed simultaneously,

2. Establishing a controller that guarantees the boundedness of state and state estimationerrors and the eventual satisfaction of the imposed probabilistic chance constraints,

3. Demonstrating the effectiveness of this controller in simulations of an autonomousspacecraft application,

4. Developing a computational reference governor for safe online optimization.Demonstrating the potential of the computational reference governor in simulations whenpaired with a nonlinear model predictive controller,

5. Extending the virtual net constrained motion planning framework to the case when onlyoutput measurements are available, there is random measurement and process noise, andthe states of the system are estimated. Demonstrating the potential of the proposedapproach for spacecraft maneuvering while avoiding obstacle collision.

The following are publications which have appeared, been accepted or are in preparation related to the subject matter of this report:

• Sutherland, R., Kolmanovsky, I., Girard, A., Leve, F., Petersen, C., “Optimal strategies fordisjunctive sensing and control” [1],

• Sutherland, R., “Advances in disjunctive and time-optimal predictive control methods,”[15],

• Berning, A., Kolmanovsky, I., Girard, A., Leve, F., and Petersen, C., “Maneuverplanning using chained chance constrained admissible sets,” [14],

• Sutherland, R., Kolmanovsky, I., Girard, A., Leve, F., Petersen, C., “Minimum-time modelpredictive spacecraft attitude control for waypoint following and exclusion zoneavoidance” [2],

• Sutherland, R., Kolmanovsky, I., Girard, A., Leve, F., and Petersen, C., “On closed-loopLyapunov stability with minimum-time MPC feedback laws for discrete-time systems,”[13]

2. INTRODUCTION

A key consideration in the design of control schemes for autonomous vehicles in general, and for spacecraft in particular, is the enforcement of constraints. Control actuation limits and pointwise in time and terminal state constraints must be satisfied. In this report, we summarize several research developments and outcomes in developing strategies for control of systems with constraints in situations when there is an interplay between sensing, actuation, estimation, control and computations. In particular, a non-standard setting of disjunctive sensing and control, when simultaneous actuation and sensing is not possible will be addressed. Furthermore, the notion of computational reference governor (CRG) to monitor and supervise the computational optimization within the nonlinear model predictive controller (NMPC) will be introduced and illustrated with simulation results. Finally, the previously proposed framework for motion planning/maneuvering based on chained invariant sets will be extended to the setting of output measurements and state estimation.

3. METHODS, ASSUMPTIONS, AND PROCEDURES

3.1 Optimal Strategies for Disjunctive Sensing and Control

3.1.1 Disjunctive Sensing and Actuation for Linear Systems.

Consider a system represented by a discrete-time linear model with stochastic state disturbance and measurement noise inputs, given by

𝑥𝑥𝑘𝑘+1 = 𝐴𝐴𝑥𝑥𝑘𝑘 + 𝐵𝐵𝑢𝑢𝑘𝑘 + 𝑤𝑤𝑘𝑘, 𝑦𝑦𝑘𝑘 = 𝐶𝐶𝑥𝑥𝑘𝑘 + 𝑣𝑣𝑘𝑘,

(1)

where 𝑥𝑥𝑘𝑘 is a vector state and 𝑢𝑢𝑘𝑘 is a vector control. The variables 𝑤𝑤𝑘𝑘 and 𝑣𝑣𝑘𝑘 are, respectively, the state disturbance and the measurement noise inputs, each assumed to be a sequence of zero-mean, independent (and jointly independent) identically distributed (i.i.d.) random variables with 𝔼𝔼[𝑤𝑤𝑘𝑘𝑤𝑤𝑘𝑘

𝑇𝑇] = Σ𝑤𝑤 = Σ𝑤𝑤𝑇𝑇 , 𝔼𝔼[𝑣𝑣𝑘𝑘𝑣𝑣𝑘𝑘𝑇𝑇] = Σ𝑣𝑣 = Σ𝑣𝑣𝑇𝑇.

We consider a fixed-gain state feedback,

𝑢𝑢𝑘𝑘 = 𝑢𝑢𝑇𝑇 + 𝐾𝐾(𝑥𝑥�𝑘𝑘 − 𝑥𝑥𝑇𝑇), (2)

where 𝑥𝑥�𝑘𝑘 is the state estimate generated by a fixed-gain observer of the form

𝑥𝑥�𝑘𝑘+1 = 𝐴𝐴𝑥𝑥�𝑘𝑘 + 𝜂𝜂𝑘𝑘𝐵𝐵𝑢𝑢𝑘𝑘 + (1 − 𝜂𝜂𝑘𝑘)𝐿𝐿(𝑦𝑦𝑘𝑘 − 𝐶𝐶𝑥𝑥�𝑘𝑘). (3)

The binary variable 𝜂𝜂𝑘𝑘 ∈ {0, 1} represents the system operating mode. When 𝜂𝜂𝑘𝑘 = 1, the control is applied to the system. When 𝜂𝜂𝑘𝑘 = 0, the control is deactivated (𝑢𝑢 𝑘𝑘 = 0), and the sensed output 𝑦𝑦𝑘𝑘 is obtained. The target equilibrium is denoted by 𝑥𝑥𝑇𝑇, and it is assumed that the feedforward control input, 𝑢𝑢 , supports it in steady-s tate in the absence of 𝑤𝑤 𝑘𝑘 and with 𝜂𝜂 𝑘𝑘 = 1, i.e., 𝑥𝑥 𝑇𝑇 = 𝑇𝑇 + .

Approved for public release; distribution is unlimited 2

In a typical control design process, due to the Separation Principle, the gains 𝐾𝐾 and 𝐿𝐿 would be determined without consideration of the interference between actuation and sensing (possibly by different engineers) and then a coordination mechanism introduced by specifying 𝜂𝜂𝑘𝑘, 𝑘𝑘 ≥ 0. In this setting, offline-generated 𝑁𝑁-periodic switching sequences, {𝜂𝜂𝑘𝑘}, with 𝜂𝜂𝑘𝑘 = 𝜂𝜂𝑘𝑘+𝑁𝑁 for all 𝑘𝑘 ∈ℤ≥0, 𝑁𝑁 ∈ ℤ≥0, are of particular interest, so that their repeated application leads to the attainment of the control objectives, including eventual satisfaction of state constraints. This approach, based on the application of the offline generated periodic sequence, has low computational footprint and is appealing in view of limited computing power and restrictive electrical power consumption budget onboard of small spacecraft.

3.1.2 Admissible Periodic Switching Sequences.

Suppose that {𝜂𝜂𝑘𝑘} is fixed and define �̅�𝐴𝑘𝑘 = 𝐴𝐴 + 𝜂𝜂𝑘𝑘𝐵𝐵𝐾𝐾, �̃�𝐴𝑘𝑘 = 𝐴𝐴 + (1 − 𝜂𝜂𝑘𝑘)𝐿𝐿𝐶𝐶. The evolution of the state, 𝑥𝑥𝑘𝑘, and of the state estimation error, 𝑒𝑒𝑘𝑘 = 𝑥𝑥𝑘𝑘 − 𝑥𝑥�𝑘𝑘, are driven by

𝑥𝑥𝑘𝑘+1 = 𝐴𝐴𝑥𝑥𝑘𝑘 + 𝜂𝜂𝑘𝑘𝐵𝐵𝐾𝐾𝑥𝑥�𝑘𝑘 + 𝑤𝑤𝑘𝑘 = �̅�𝐴𝑘𝑘𝑥𝑥𝑘𝑘 − 𝜂𝜂𝑘𝑘𝐵𝐵𝑒𝑒𝑘𝑘 + 𝜂𝜂𝑘𝑘𝐵𝐵𝑢𝑢𝑇𝑇 + 𝑤𝑤𝑘𝑘,

(4)

and

𝑒𝑒𝑘𝑘+1 = [𝐴𝐴 + (1 − 𝜂𝜂𝑘𝑘)𝐿𝐿𝐶𝐶]𝑒𝑒𝑘𝑘 + 𝑤𝑤𝑘𝑘 + (1 − 𝜂𝜂𝑘𝑘)𝐿𝐿𝑣𝑣𝑘𝑘 = �̃�𝐴𝑘𝑘𝑒𝑒𝑘𝑘 + 𝑤𝑤𝑘𝑘 + (1 − 𝜂𝜂𝑘𝑘)𝐿𝐿𝑣𝑣𝑘𝑘.

(5)

In simulations, we will assume that 𝑥𝑥𝑇𝑇 = 0 (and so 𝑢𝑢𝑇𝑇 = 0). Based on the error dynamics and assumed independence and zero mean properties of the stochastic disturbance, 𝑤𝑤𝑘𝑘, and measurement noise, 𝑣𝑣𝑘𝑘, the error covariance matrix 𝑃𝑃𝑘𝑘 = 𝔼𝔼[𝑒𝑒𝑘𝑘𝑒𝑒𝑘𝑘𝑇𝑇] satisfies

(6)

Definition: Let 𝐴𝐴, 𝐵𝐵, 𝐾𝐾, 𝐿𝐿, �̅�𝐴𝑘𝑘, and �̃�𝐴𝑘𝑘 be defined as above, with none of �̅�𝐴𝑘𝑘, �̃�𝐴𝑘𝑘 nilpotent. An 𝑁𝑁-periodic sequence of binary integers {𝜂𝜂0, 𝜂𝜂1, … , 𝜂𝜂𝑁𝑁−1}, where, for all 𝑘𝑘 ∈ ℤ≥0, 𝜂𝜂𝑘𝑘 ∈ {0, 1} and 𝜂𝜂𝑘𝑘+𝑁𝑁 = 𝜂𝜂𝑘𝑘, is called admissible if the following contractivity conditions hold:

(7)

(8)

where 𝜌𝜌(⋅) denotes the spectral radius operator. A non-periodic sequence of binary integers {𝜂𝜂 0, 𝜂𝜂 1, … } is admissible if

Approved for public release; distribution is unlimited 3

Approved for public release; distribution is unlimited 4

(9)

(10)

The constraint against nilpotent matrices ensures that the above limits approach zero rather than “jump” to zero. These definitions are consistent with the properties of discrete-time state transition matrices found in, for example, Chen [4]. Note that the existence of a non-periodic admissible sequence implies the existence of periodic admissible sequences.

In the existence of a periodic admissible sequence, the state and error dynamics coupled with 𝔼𝔼[𝑣𝑣𝑘𝑘] = 0, 𝔼𝔼[𝑤𝑤𝑘𝑘] = 0 imply that the state and estimation error mean, 𝜇𝜇𝑥𝑥,𝑘𝑘 = 𝔼𝔼[𝑥𝑥𝑘𝑘] and 𝜇𝜇𝑒𝑒,𝑘𝑘 =𝔼𝔼[𝑒𝑒𝑘𝑘], respectively, satisfy,

(11)

where �̅�𝐴𝑘𝑘 and �̃�𝐴𝑘𝑘 are periodic with the same period, 𝑁𝑁, as 𝜂𝜂𝑘𝑘. The following results can be proven (see [1] for details):

Proposition 1: Suppose that the contractivity conditions hold. Then, as 𝑘𝑘 → ∞, 𝜇𝜇𝑒𝑒,𝑘𝑘 → 0 and 𝜇𝜇𝑥𝑥,𝑘𝑘 → 𝜇𝜇𝑥𝑥,𝑘𝑘

𝑠𝑠 exponentially, where {𝜇𝜇𝑥𝑥,𝑘𝑘𝑠𝑠 } is the unique 𝑁𝑁-periodic solution of 𝜇𝜇𝑥𝑥,𝑘𝑘 with 𝜇𝜇𝑒𝑒,𝑘𝑘 ≡ 0.

Furthermore, if 𝑢𝑢𝑇𝑇 = 0, then 𝜇𝜇𝑥𝑥,𝑘𝑘𝑠𝑠 = 0.

Proposition 2: Suppose that the second contractivity condition holds. Then, the error covariance matrix, 𝑃𝑃𝑘𝑘, is bounded and, as 𝑘𝑘 → ∞, converges to the unique 𝑁𝑁-periodic solution, {𝑃𝑃𝑘𝑘𝑠𝑠}, with 𝑃𝑃𝑘𝑘+𝑁𝑁𝑠𝑠 = 𝑃𝑃𝑘𝑘𝑠𝑠. In addition, for any 𝑛𝑛 ∈ ℤ≥0,

(12)

where

(13)

Remarks: 1) The steady-state periodic solution, {𝑃𝑃𝑘𝑘𝑠𝑠}, can be computed by solving the conventional

discrete-time Lyapunov equation for the evolution of the lifted system error covariancematrix, i.e., of 𝑃𝑃𝑙𝑙 = diag{𝑃𝑃0𝑠𝑠 ,⋯ ,𝑃𝑃𝑁𝑁−1𝑠𝑠 }.

2) limsupk→∞ ‖𝑃𝑃𝑁𝑁𝑘𝑘‖ ≤ 𝛾𝛾 / (1 − 𝑞𝑞�𝐴𝐴2).

3) The results in Propositions 1 and 2 generalize to non-constant 𝑁𝑁-periodic feedback andobserver gains. However, the analysis results benefit from both �̅�𝐴𝑘𝑘 and �̃�𝐴𝑘𝑘 having onlytwo possible values each, which is the case when the gains are constant.

We can take advantage of the dwell time conditions for stability analysis of hybrid systems to develop simpler sufficient conditions that can inform procedures for faster determination of admissible switching sequences. The discussion of the dwell time conditions follows Theorem 4.1 in Yu [5] and its proof.

Let Ω�0 = 𝐴𝐴 and Ω�1 = 𝐴𝐴 + 𝐵𝐵𝐾𝐾, and consider the first contractivity condition. By Gelfand’s theorem [6], lim𝑘𝑘→∞ ‖Ω𝑘𝑘‖1\𝑘𝑘 = 𝜌𝜌(Ω), for any matrix Ω and norm ||⋅||. This implies that there exist constants 𝑐𝑐0, 𝑐𝑐1 that do not depend on 𝑘𝑘 such that for any 𝑘𝑘 ≥ 1,

(14)

For any consistent matrix norm, there exists a finite 𝑘𝑘∗ such that �𝐴𝐴𝑘𝑘∗�1/𝑘𝑘∗

≥ ‖𝐴𝐴𝑘𝑘‖1/𝑘𝑘 for all 𝑘𝑘 ≥1. Then,

(15)

Let 𝑐𝑐 = max {𝑐𝑐0, 𝑐𝑐1}, 𝑛𝑛0 < 𝑁𝑁 be the total time spent in the mode 𝜂𝜂 = 0, 𝑛𝑛1 < 𝑁𝑁 be the total time spent in the mode 𝜂𝜂 = 1, and 𝑛𝑛𝑠𝑠 be the number of switches plus one. Then, the first contractivity condition holds if

(16)

Taking the logarithm of each side, it follows that this contractivity condition holds if

(17)

where �̅�𝜌0 = 𝜌𝜌(Ω�0) and �̅�𝜌1 = 𝜌𝜌(Ω�1). Similar analysis can be done in the case of the second contractivity condition to yield the complementary inequality

(18)

When seeking admissible sequences, it is possible to restrict the search to sequences for which 𝑛𝑛0, 𝑛𝑛1, and 𝑛𝑛𝑠𝑠 satisfy both logarithmic conditions, which together are sufficient (but not necessary) to ensure admissibility.

Approved for public release; distribution is unlimited 5

The search for admissible sequences can be made faster still by discarding sequences that replicate a known inadmissible sequence.

Definition: A sequence {𝑠𝑠𝑁𝑁} of length 𝑁𝑁 ∈ ℤ is called reducible if there exists a subsequence {𝑠𝑠𝑘𝑘} of length 𝑘𝑘 ∈ ℤ>0 such that 𝑘𝑘 < 𝑁𝑁, 𝑁𝑁 is a multiple of 𝑘𝑘, and {𝑠𝑠𝑁𝑁} = {𝑠𝑠𝑘𝑘} ⊕ {𝑠𝑠𝑘𝑘} ⊕⋯⊕ {𝑠𝑠𝑘𝑘}, 𝑁𝑁/𝑘𝑘 times, where ⊕ is used to denote sequence concatenation. A sequence which is not reducible is called irreducible.

Proposition 3: Every (non-empty) sequence {𝑠𝑠𝑁𝑁} contains a unique irreducible subsequence {𝑠𝑠𝑛𝑛}.

Proposition 4: A binary sequence {𝑠𝑠𝑁𝑁} is admissible if and only if the associated irreducible subsequence {𝑠𝑠𝑛𝑛} is admissible.

Any reducible admissible sequence can be formed by propagating an irreducible admissible sequence forward in time. Thus, when investigating sequences of length 𝑁𝑁 for admissibility, all sequences for which we have already evaluated the associated irreducible sub-sequence can be discarded.

3.1.3 Chance Constraints.

Consider the chance constraint,

(19)

where 𝑋𝑋 is the set prescribed by the constraints of the problem, 0 ≤ 𝛿𝛿 < 1, and 𝑘𝑘∗ is sufficiently large. Define 𝑧𝑧 = [𝑥𝑥𝑇𝑇 𝑒𝑒𝑇𝑇]𝑇𝑇 and 𝜁𝜁 = [𝑣𝑣𝑇𝑇 𝑤𝑤𝑇𝑇]𝑇𝑇. Then, 𝑧𝑧𝑘𝑘+1 = �̌�𝐴𝑘𝑘𝑧𝑧𝑘𝑘 + Γ�𝑘𝑘𝜁𝜁𝑘𝑘 + 𝐺𝐺�𝑘𝑘𝑢𝑢𝑇𝑇, where

(20)

and 𝐺𝐺�𝑘𝑘 = [𝟎𝟎𝑇𝑇 𝜂𝜂𝑘𝑘𝐵𝐵𝑇𝑇]𝑇𝑇. Under the contractivity conditions, we repeat the analysis in Propositions 1 and 2 for 𝑧𝑧𝑘𝑘+1. Let 𝑃𝑃�𝑘𝑘 = 𝔼𝔼 ��𝑧𝑧𝑘𝑘 − 𝜇𝜇𝑧𝑧,𝑘𝑘��𝑧𝑧𝑘𝑘 − 𝜇𝜇𝑧𝑧,𝑘𝑘�

𝑇𝑇�, where 𝜇𝜇𝑧𝑧,𝑘𝑘 = 𝔼𝔼[𝑧𝑧𝑘𝑘]. Then,

(21)

𝑘𝑘

and 𝑃𝑃�𝑘𝑘 → 𝑃𝑃�𝑘𝑘𝑠𝑠 as 𝑘𝑘 → ∞, where 𝑃𝑃�𝑘𝑘𝑠𝑠 is the unique periodic solution to the above system. Then, the steady-state covariance matrix satisfies 𝑃𝑃�𝑥𝑥𝑠𝑠,𝑘𝑘 = [ 𝐈𝐈 𝟎𝟎 ]𝑃𝑃�𝑘𝑘𝑠𝑠 [ 𝐈𝐈 𝟎𝟎 ]T. Thus, 𝑥𝑥𝑘𝑘 converges to 𝑥𝑥𝑠𝑠, which is a cyclostationary process with the 𝑁𝑁-periodic mean 𝜇𝜇𝑥𝑥𝑠𝑠 ,𝑘𝑘 and 𝑁𝑁-periodic covariance matrix 𝑃𝑃𝑥𝑥𝑠𝑠,𝑘𝑘.

We have made no assumptions on the probability density functions of 𝑤𝑤𝑘𝑘 and 𝑣𝑣𝑘𝑘, so to treat the chance constraint we resort to the multivariate Chebyshev’s inequality [7][8], which states

Approved for public release; distribution is unlimited 6

(22)

where 𝑛𝑛𝑥𝑥 is the dimension of 𝑥𝑥 and 0 < 𝛼𝛼𝑥𝑥 ≤ 1. Given 𝛿𝛿 > 0, choose 𝛼𝛼𝑥𝑥 = �nx/𝛿𝛿. Suppose that for 𝑘𝑘 = 0, … ,𝑁𝑁 − 1, the following condition is verified:

(23)

where ℇ(𝟎𝟎, 𝑆𝑆) = {𝑥𝑥: 𝑥𝑥𝑇𝑇𝑆𝑆𝑥𝑥 ≤ 1} is an ellipsoidal set and ~ denotes the Pontryagin’s set difference. Then, in steady state, the chance constraint holds.

3.1.4 Selecting an Optimal Admissible Sequence.

Since multiple admissible sequences may exist, we can select one by minimizing a cost functional. Consider the blended cost 𝐽𝐽 that penalizes the estimation, the control objective, and the control effort:

(24)

where 𝑅𝑅𝑒𝑒 = 𝑅𝑅𝑒𝑒𝑇𝑇 and 𝑅𝑅 𝑥𝑥 = 𝑅𝑅𝑥𝑥𝑇𝑇 are positive semi-definite weighting matrices and 𝑟𝑟𝜂𝜂 ≥ 0 is a scalar weight. The 𝑁𝑁−1 factor in the cost functional normalizes for sequence length, ensuring that a given reducible/irreducible sequence pair will yield the same cost. To find the optimal sequence, fix 𝑁𝑁, check sequences for admissibility, then select the admissible sequence that yields the lowest cost. If no admissible sequence exists, then extend the sequence length and search again. The equivalence of admissibility between a sequence and its associated irreducible subsequence is invoked after incrementing the sequence length; if the subsequent search produces a reducible sequence for which the irreducible subsequence has already been evaluated, then the sequence can be skipped.

3.2 Computational Reference Governors for Safe Online Optimization

As spacecraft become more autonomous they will need increasingly complex online decision-making capabilities. One of the dominant paradigms for autonomous decision making is online optimization where actions are computed by solving a suitably formulated optimization problem. Optimization based frameworks have the major advantage that constraints can be naturally incorporated into the decision making process. In the context of control, the most common type of online optimization method is Model Predictive Control (MPC) wherein a control action is computed by solving a constrained Optimal Control Problem (OCP) at each sampling instance. If the dynamics are nonlinear, the OCP is necessarily non-convex and solving it in real-time is a significant challenge. In addition, the question of optimizer reliability is paramount.

Approved for public release; distribution is unlimited 7

Approved for public release; distribution is unlimited 8

Our research into safe online optimization has led to the development of a computational reference governor (CRG). By treating the optimizer as a dynamic system, we can impose constraints on it, e.g., that its optimality error be small, using the reference governor framework [9]. Our CRGessentially extends the continuous time computational reference governor developed in [3].

to the discrete time setting common in MPC applications. We have demonstrated the efficacy of the CRG for spacecraft orientation control benchmark problem and are in the process of developing the underlying mathematical theory.

3.2.1 Assumptions.

We consider a rigid spacecraft, actuated by three orthogonal thrusters. The orientation of the spacecraft is parametrized by 3-2-1 Euler angles. It is assumed that the attitude representation does not approach singularity; this is consistent with the limitations of the target physical testbed.

The equations of motion are given by

𝐼𝐼�̇�𝑤+ 𝑤𝑤×𝐼𝐼𝑤𝑤 = 𝑢𝑢 + 𝑑𝑑 , �̇�𝜃 = 𝑆𝑆(𝜃𝜃)𝑤𝑤

(25)

where 𝑤𝑤 is the vector of angular velocities, 𝜃𝜃 are the Euler angles, 𝑆𝑆(𝜃𝜃) is the inverse kinematic matrix, 𝐼𝐼 is the moment of inertia matrix in a body-fixed frame, 𝑢𝑢 are the control torques and 𝑑𝑑 are unknown, possibly state dependent, disturbances. Collected into a compact form, the model is written as

�̇�𝑥 = 𝑓𝑓𝑐𝑐(𝑥𝑥,𝑢𝑢,𝑑𝑑), (26)

where 𝑥𝑥 = [𝑤𝑤𝑇𝑇 𝜃𝜃𝑇𝑇]𝑇𝑇 .

The prediction model for NMPC is a discretized approximation of the continuous time model (using the Forward Euler integrator and zero order hold assumption) and is given by

𝑥𝑥𝑘𝑘+1 = 𝑥𝑥𝑘𝑘 + 𝑡𝑡𝑠𝑠𝑓𝑓𝑐𝑐(𝑥𝑥𝑘𝑘,𝑢𝑢𝑘𝑘, 0), (27)

where 𝑡𝑡𝑠𝑠 is the discretization time step. The discrete-time prediction model is then considered in the following form,

𝑥𝑥𝑘𝑘+1 = 𝑓𝑓(𝑥𝑥𝑘𝑘,𝑢𝑢𝑘𝑘). (28)

Approved for public release; distribution is unlimited 9

Note that since the disturbances are unknown, they cannot be incorporated into the prediction model and are assumed to be zero. At time 𝑡𝑡 = 𝑘𝑘𝑡𝑡𝑠𝑠, the state estimate is denoted by 𝑥𝑥0 = 𝑥𝑥�(𝑡𝑡).

3.2.2 Methods.

3.2.2.1 Nonlinear Model Predictive Control.

The NMPC optimal control problem (OCP) to be solved for the control to be applied, 𝑢𝑢(𝑡𝑡) = 𝑢𝑢0 has the following form,

min𝑧𝑧

𝑒𝑒𝑖𝑖𝑇𝑇𝑄𝑄𝑓𝑓𝑒𝑒𝑖𝑖 + ��|𝑒𝑒𝑖𝑖|�𝑄𝑄2

+ �|𝑢𝑢𝑖𝑖|�𝑅𝑅2

+ 𝑠𝑠𝑖𝑖+1

𝑁𝑁−1

𝑖𝑖=0𝑠𝑠. 𝑡𝑡. 𝑥𝑥𝑖𝑖+1 = 𝑓𝑓(𝑥𝑥𝑖𝑖 ,𝑢𝑢𝑖𝑖) + 𝑥𝑥𝑖𝑖 − 𝑓𝑓(𝑥𝑥𝑖𝑖−1,𝑢𝑢𝑖𝑖−1), 𝑖𝑖 = 0 …𝑁𝑁 − 1,

𝑐𝑐(𝑥𝑥𝑖𝑖) ≤ 𝑠𝑠𝑖𝑖, 𝑖𝑖 = 1 …𝑁𝑁, 𝑝𝑝(𝑢𝑢𝑖𝑖) ≤ 0, 𝑖𝑖 = 0 …𝑁𝑁 − 1,

𝑠𝑠𝑖𝑖 ≥ 0, 𝑖𝑖 = 1 …𝑁𝑁

(29)

where 𝑒𝑒𝑖𝑖 = 𝑥𝑥𝑖𝑖 − 𝑟𝑟 is the deviation from the target, 𝑧𝑧 = [𝑢𝑢0𝑇𝑇 𝑥𝑥1𝑇𝑇 𝑢𝑢1𝑇𝑇 … 𝑥𝑥𝑁𝑁𝑇𝑇]𝑇𝑇 is the primal optimization variable, 𝑁𝑁 is the length of the prediction horizon, 𝑃𝑃 = 𝑃𝑃𝑇𝑇 ≽ 0 is the terminal penalty, 𝑄𝑄 = 𝑄𝑄𝑇𝑇 ≽ 0 is the state penalty weight, 𝑅𝑅 = 𝑅𝑅𝑇𝑇 ≻ 0 is the control penalty weight, and 𝑐𝑐,𝑝𝑝 are the stage state and control constraints respectively.

Note that the state constraints are softened using slack variables in order to ensure the problem is always feasible. The prediction model is written in rate-based form and will automatically compensate for low frequency unmodeled disturbances.

To solve the above OCP, we first write it in compact form as

min𝑧𝑧

𝐹𝐹(𝑧𝑧, 𝜉𝜉), 𝑠𝑠. 𝑡𝑡. 𝑔𝑔(𝑧𝑧, 𝜉𝜉) = 0,

𝑐𝑐(𝑧𝑧) ≤ 0, (30)

and introduce the Lagrangian,

𝐿𝐿(𝑧𝑧, 𝜆𝜆, 𝑣𝑣, 𝜉𝜉) = 𝐹𝐹(𝑧𝑧, 𝜉𝜉) + 𝜆𝜆𝑇𝑇𝑔𝑔(𝑧𝑧, 𝜉𝜉) + 𝑣𝑣𝑇𝑇𝑐𝑐(𝑧𝑧), (31)

where the tuple 𝜉𝜉 = (𝑥𝑥�(𝑡𝑡), 𝑟𝑟(𝑡𝑡)) collects the time varying parameters of the OCP and 𝜆𝜆 and 𝑣𝑣 are Lagrange multipliers. The following quadratic programming problem (QPP) is then solved at the 𝑘𝑘th time-step

min∆𝑧𝑧

12∆𝑧𝑧𝑇𝑇𝐻𝐻 ∆𝑧𝑧 + 𝑞𝑞𝑇𝑇∆𝑧𝑧,

𝑠𝑠. 𝑡𝑡. 𝐺𝐺 𝑧𝑧 = ℎ, 𝐴𝐴 𝑧𝑧 ≤ 𝑏𝑏,

(32)

Approved for public release; distribution is unlimited 10

where 𝐻𝐻 ≈ ∇𝑧𝑧2𝐿𝐿(𝑧𝑧𝑘𝑘−1, 𝜆𝜆𝑘𝑘−1, 𝑣𝑣𝑘𝑘−1), 𝑞𝑞 = ∇𝑧𝑧𝐿𝐿(𝑧𝑧𝑘𝑘−1, 𝜆𝜆𝑘𝑘−1, 𝑣𝑣𝑘𝑘−1), 𝐺𝐺 = ∇𝑧𝑧𝑔𝑔(𝑧𝑧𝑘𝑘−1), ℎ = 𝑔𝑔(𝑧𝑧𝑘𝑘−1),𝐴𝐴 = ∇𝑧𝑧𝑐𝑐(𝑧𝑧𝑘𝑘−1), and 𝑏𝑏 = −𝑐𝑐(𝑧𝑧𝑘𝑘−1). The solution estimate is then updated as

𝑧𝑧𝑘𝑘 = ∆𝑧𝑧∗ + 𝑧𝑧𝑘𝑘−1, (33)

and the multipliers are updated as

𝑣𝑣𝑘𝑘 = 𝑣𝑣∗, 𝜆𝜆𝑘𝑘 = 𝜆𝜆∗, (34)

where (∆𝑧𝑧∗, 𝜆𝜆∗, 𝑣𝑣∗) is the solution of the QP. Under certain assumptions, this strategy generates a stabilizing feedback law and will converge to the optimum solution of the original nonlinear program [9]. The convergence of the SQP algorithm can be monitored using the natural residual of the KKT conditions

𝜋𝜋(𝑥𝑥, 𝜉𝜉) = ���∇𝑥𝑥𝐿𝐿(𝑧𝑧, 𝜆𝜆, 𝑣𝑣, 𝜉𝜉)

𝑔𝑔(𝑥𝑥, 𝜉𝜉)min(−𝑐𝑐(𝑥𝑥),𝑣𝑣)

���. (35)

Written out explicitly, the feedback control law is computed by repeatedly solving QPs of the form:

min𝑥𝑥,𝑢𝑢,𝑠𝑠

�∆𝑢𝑢∆𝑥𝑥�𝑇𝑇�𝐻𝐻𝑢𝑢 0

0 𝐻𝐻𝑥𝑥� �∆𝑢𝑢∆𝑥𝑥� + �𝑑𝑑𝑢𝑢𝑑𝑑𝑥𝑥

�𝑇𝑇�∆𝑢𝑢∆𝑥𝑥� + 𝛾𝛾11𝑇𝑇𝑠𝑠

𝑠𝑠. 𝑡𝑡. 𝐴𝐴∆𝑥𝑥 + 𝐵𝐵∆𝑢𝑢 + 𝑔𝑔(�̅�𝑥,𝑢𝑢�) = 0 𝐶𝐶∆𝑥𝑥 + 𝑐𝑐(�̅�𝑥) ≤ Γ𝑠𝑠 𝑃𝑃∆𝑢𝑢 + 𝑝𝑝(𝑢𝑢�) ≤ 0

𝑠𝑠 ≥ 0

(36)

where Γ = 𝐼𝐼𝑁𝑁⨂ 1𝑁𝑁 , 𝐶𝐶 = ∇𝑥𝑥𝑐𝑐(�̅�𝑥), 𝑃𝑃 = ∇𝑢𝑢𝑝𝑝(𝑢𝑢�), 𝐴𝐴 = ∇𝑥𝑥𝑔𝑔(�̅�𝑥,𝑢𝑢�), 𝐵𝐵 = ∇𝑢𝑢𝑔𝑔(�̅�𝑥,𝑢𝑢�), 𝐻𝐻𝑥𝑥 =𝑑𝑑𝑖𝑖𝑑𝑑𝑔𝑔�𝐼𝐼𝑁𝑁−1 ⨂𝑄𝑄 ,𝑄𝑄𝑓𝑓�, 𝐻𝐻𝑢𝑢 = 𝐼𝐼𝑁𝑁 ⨂𝑅𝑅.

To speed up computations we can condense the problem. Since for any sequence 𝑢𝑢 the state sequence x is uniquely determined 𝐴𝐴 must be invertible, allowing us to write

(37) ∆𝑥𝑥 = −𝐴𝐴 −1𝐵𝐵 ∆𝑢𝑢 − 𝐴𝐴 −1𝑔𝑔 (𝑥𝑥 ̅, �) = 𝐺𝐺 ∆𝑢𝑢 + 𝑤𝑤

min∆𝑢𝑢,𝑠𝑠

�∆𝑢𝑢𝑠𝑠�𝑇𝑇�𝐻𝐻 00 0� �

∆𝑢𝑢𝑠𝑠 � + � 𝑑𝑑𝛾𝛾1 �

𝑇𝑇�∆𝑢𝑢𝑠𝑠 �

𝑠𝑠. 𝑡𝑡. �𝑃𝑃 0𝐶𝐶𝐺𝐺 −Γ0 −𝐼𝐼

� �∆𝑢𝑢𝑠𝑠 � + ��̅�𝑝

𝑐𝑐̅ + 𝐶𝐶𝑤𝑤0

� ≤ 0, (38)

where 𝐻𝐻 = 𝐻𝐻𝑢𝑢 + 𝐺𝐺𝑇𝑇𝐻𝐻𝑥𝑥𝐺𝐺, and 𝑑𝑑 = 𝑑𝑑𝑢𝑢 + 𝐺𝐺𝑇𝑇(𝐻𝐻𝑥𝑥𝑤𝑤 + 𝑑𝑑𝑥𝑥). The iterates can then be updated as

𝑥𝑥 ← 𝑥𝑥̅ + 𝐺𝐺∆𝑢𝑢 + 𝑤𝑤, 𝑢𝑢 ← 𝑢𝑢� + ∆𝑢𝑢. (39)

3.2.2.2 A Computational Reference Governor for Suboptimal MPC.

Suboptimal MPC (SOMPC), where optimization algorithm iterations are distributed over time, have the potential to reduce computation times and improve the reliability of numerical algorithms for MPC controller. In SOMPC an estimate of the OCP solution is maintained within the controller and a finite number of optimization algorithm iterations are performed at each sampling instance. This practice is commonly known as warmstarting. The solution updating process can be seen Figure 1, the result is a pair of coupled dynamic systems representing the plant and optimizer respectively.

FIGURE 1 A COMPARISON OF SUBOPTIMAL MPC AND IDEAL MPC

Model predictive control imposes constraints on the plant, the objective of the CRG is to impose constraints on the optimizer. Specifically, we’re interested in imposing the constraint

𝜋𝜋(𝑧𝑧, 𝑥𝑥) ≤ 𝜋𝜋𝑢𝑢𝑢𝑢 , (40)

Approved for public release; distribution is unlimited 11

We can then write the condensed problem as

Approved for public release; distribution is unlimited 12

�|𝑧𝑧𝑘𝑘 − 𝑧𝑧∗(𝜉𝜉𝑘𝑘)|� ≤ 𝜂𝜂�|𝑧𝑧𝑘𝑘−1 − 𝑧𝑧∗(𝜉𝜉𝑘𝑘−1)|� + 𝛽𝛽�|∆𝑥𝑥𝑘𝑘|� + 𝛽𝛽||∆𝑟𝑟𝑘𝑘||, (41)

where 𝑧𝑧 is the primal dual solution estimate, 𝑧𝑧∗(𝜉𝜉) is the parameter to solution mapping for the OCP and 𝜉𝜉 = (𝑥𝑥, 𝑟𝑟) is the OCP parameter which consists of the measured state 𝑥𝑥 and the reference command 𝑟𝑟.

The idea of the computational RG is to limit the residual 𝜋𝜋 by controlling ∆𝑟𝑟𝑘𝑘. To accomplish this, we use an add-on unit. The CRG is attached to the NMPC controller as shown in Figure 2 below:

FIGURE 2 ARCHITECTURE OF THE COMPUTATIONAL REFERENCE GOVERNOR

The CRG takes in the user commanded reference 𝑟𝑟, and the current OCP residual and outputs a modified auxiliary reference 𝑣𝑣. Internally, the CRG solves the following optimization problem

min∆𝑣𝑣

�|𝑟𝑟𝑘𝑘 − 𝑣𝑣𝑘𝑘−1 + ∆𝑣𝑣|�22

𝑠𝑠. 𝑡𝑡. |∆𝑣𝑣𝑖𝑖| ≤ ∆𝑣𝑣ub, 𝑖𝑖 = 1 … 3,𝜃𝜃𝑙𝑙𝑢𝑢 ≤ 𝑣𝑣𝑘𝑘−1 + ∆𝑣𝑣 ≤ 𝜃𝜃𝑢𝑢𝑢𝑢 ,

(42)

where 𝑟𝑟𝑘𝑘 is the reference, 𝑣𝑣𝑘𝑘−1 is the virtual reference at the previous step, 𝜃𝜃𝑙𝑙𝑢𝑢,𝑢𝑢𝑢𝑢 are the upper and lower bounds on the Euler angles. This is a quadratic program with only 3 decision variables; its computational cost is negligible compared to the MPC controller. The maximum step length is calculated as

∆𝑣𝑣𝑢𝑢𝑢𝑢 = max�0, 𝛿𝛿𝑣𝑣(𝜋𝜋𝑢𝑢𝑢𝑢 − 𝜋𝜋𝑘𝑘−1)

𝜋𝜋𝑢𝑢𝑢𝑢� (43)

where 𝜋𝜋𝑘𝑘−1 is the residual of the OCP at the previous sampling instance and the reference is updated as

𝑣𝑣𝑘𝑘 = 𝑣𝑣𝑘𝑘−1 + ∆𝑣𝑣. (44)

where is the residual function for the OCP defined in (35). Enforcing this constraint ensures the quality of the numerical solution. The solver obeys the following convergence bounds

This forms a kind of nonlinear filter; the underlying intuition is to reduce the step size proportionally to the distance from the constraint 𝜋𝜋(𝑧𝑧, 𝑥𝑥) ≤ 𝜋𝜋𝑢𝑢𝑢𝑢. The maximum reference step length 𝛿𝛿𝑣𝑣 and the maximum allowable residual 𝜋𝜋𝑢𝑢𝑢𝑢 are user chosen parameters.

3.3 Constrained control framework using a virtual net of chance constraint admissible sets and output measurements only

Consider a system represented by a discrete-time linear model with stochastic state disturbance and measurement noise inputs, given by

𝑥𝑥𝑘𝑘+1 = 𝐴𝐴𝑥𝑥𝑘𝑘 + 𝐵𝐵𝑢𝑢𝑘𝑘 + 𝛤𝛤𝑤𝑤𝑘𝑘, 𝑦𝑦𝑘𝑘 = 𝐶𝐶𝑥𝑥𝑘𝑘 + 𝐹𝐹𝑣𝑣𝑘𝑘,

(45)

where 𝑥𝑥𝑘𝑘 is a vector state and 𝑢𝑢𝑘𝑘 is a vector control. The variables 𝑤𝑤𝑘𝑘 and 𝑣𝑣𝑘𝑘 are, respectively, the state disturbance and the measurement noise inputs, each assumed to be a sequence of zero-mean, independent (and jointly independent) identically distributed (i.i.d.) random variables with 𝔼𝔼[𝑤𝑤𝑘𝑘𝑤𝑤𝑘𝑘

𝑇𝑇] = Σ𝑤𝑤 = Σ𝑤𝑤𝑇𝑇 , 𝔼𝔼[𝑣𝑣𝑘𝑘𝑣𝑣𝑘𝑘𝑇𝑇] = Σ𝑣𝑣 = Σ𝑣𝑣𝑇𝑇.

We consider a fixed-gain state feedback,

𝑢𝑢𝑘𝑘 = 𝐾𝐾𝑥𝑥�𝑘𝑘 + 𝐺𝐺𝑟𝑟, (46)

where 𝑟𝑟 is the reference and 𝑥𝑥�𝑘𝑘 is the state estimate generated by a fixed-gain, L, observer. Then the error, 𝑒𝑒 = 𝑥𝑥𝑘𝑘 − 𝑥𝑥�𝑘𝑘 is normally distributed with covariance

𝑃𝑃∞ = 𝐴𝐴𝑜𝑜𝑃𝑃∞𝐴𝐴𝑜𝑜𝑇𝑇 + 𝐵𝐵𝑜𝑜𝐵𝐵𝑜𝑜𝑇𝑇 𝐴𝐴𝑜𝑜 = 𝐴𝐴 + 𝐿𝐿𝐶𝐶,𝐵𝐵𝑜𝑜 = [𝛤𝛤, 𝐿𝐿𝐹𝐹]

(47)

Consider now enforcing a state constraint of the form,

𝐻𝐻𝑥𝑥𝑘𝑘 ≤ ℎ,∀𝑘𝑘 (48)

in a probabilistic manner. The resulting chance constraint can be shown to have the form (see [14] for details):

𝑃𝑃𝑟𝑟𝑃𝑃𝑏𝑏 �[𝐻𝐻 𝐻𝐻] �𝑥𝑥�𝑘𝑘𝑒𝑒𝑘𝑘� ≤ ℎ − 𝐻𝐻(𝐼𝐼 − 𝐴𝐴𝑐𝑐)−1𝐵𝐵𝑐𝑐𝑟𝑟� ≥ 1 − 𝛼𝛼, (49)

where 𝛼𝛼 is the probability threshold and 𝑒𝑒𝑘𝑘 is the state estimation error. For this chance constraint, a chance constrained admissible set, 𝑂𝑂∞(𝑟𝑟), can be defined which is the set of initial state estimates such that the subsequent trajectories for the given reference, 𝑟𝑟, satisfy the chance constraint. In defining 𝑂𝑂∞(𝑟𝑟) certain technical assumptions need to be made such that the observed dynamics have reached the steady-state [16].

Approved for public release; distribution is unlimited 13

Following the same approach as in [10], now a virtual net of reference points 𝑟𝑟 may be built to ensure safe transitions between reference points by declaring a connection between two nodes on the graph only exists if the a priori node lies within the chance constrained admissible set of a posteriori node. The maneuver planning problem on this graph is then solved using any standard graph search algorithm.

4. RESULTS AND DISCUSSION

4.1 Disjunctive-Based Constrained Spacecraft Relative Motion Planning

We consider a case study of spacecraft three-dimensional relative motion control. The relative motion dynamics are modeled with the linearized Clohessy-Wiltshire [11] equations,

(50)

where 𝑚𝑚 is the chaser vehicle’s mass and 𝜔𝜔 is the mean motion of the target vehicle’s orbit. We form a system of first-order equations with state vector 𝑥𝑥𝑘𝑘 = �𝑥𝑥1,𝑘𝑘,𝑥𝑥2,𝑘𝑘,𝑥𝑥3,𝑘𝑘, �̇�𝑥1,𝑘𝑘, �̇�𝑥2,𝑘𝑘, �̇�𝑥3,𝑘𝑘�

𝑇𝑇 and

discretize using a Zero-Order Hold [12] with sampling period of 30 seconds, chaser vehicle mass of 140 kg, and target vehicle mean motion of 0.0010 radians per second. We choose 𝐶𝐶 =[𝑰𝑰3 𝟎𝟎3×3], which corresponds to relative position measurements. The goal in this scenario is to rendezvous the chaser vehicle with the target vehicle, i.e., to bring the chaser's state to the origin. For this controller and observer, the feedback gain matrix 𝐾𝐾 is computed using LQR by solving the Discrete Time Algebraic Riccati Equation (DARE) and the observer gain matrix 𝐿𝐿 computed by solving the dual DARE, with 𝑄𝑄 = 𝑰𝑰6 and 𝑅𝑅 = 𝑰𝑰3 in each case.

4.1.2 Theoretical Results.

We consider a case study of spacecraft three-dimensional relative motion control. The relative motion dynamics are again modeled with the linearized CW equations,

(51)

where 𝑚𝑚 represents the chaser vehicle’s mass and 𝜔𝜔 is the mean motion of the target vehicle’s orbit. We form an equivalent system of first-order equations and discretize using Zero-Order Hold [12] with sampling period 30 sec, chaser vehicle mass of 140 kg, and target vehicle mean motion of 0.0010 rad/sec. We choose 𝐶𝐶 = [ 𝐈𝐈 𝟑𝟑 𝟎𝟎 3×𝟑𝟑 ], which corresponds to

Approved for public release; destribution is unlimited14

Approved for public release; distribution is unlimited 15

relative position measurements. The goal in this scenario is to rendezvous the chaser vehicle with the target vehicle, i.e., to bring the chaser’s state to the origin.

For this controller and observer, the feedback gain matrix 𝐾𝐾 is computed using LQR by solving the Discrete Time Algebraic Riccati Equation (DARE) and the observer gain matrix 𝐿𝐿 computed by solving the dual DARE, with 𝑄𝑄 = 𝐈𝐈6 and 𝑅𝑅 = 𝐈𝐈3 in each case . The dist ributions of the measurement noise, 𝑣𝑣𝑘𝑘, an d of th e st ate disturbance, 𝑤𝑤𝑘𝑘, are assu med to b e G aus sian with covariance matrices Σ𝑣𝑣 = 10−2 ⋅ 𝐈𝐈3 and Σ𝑤𝑤 = 10−4 ⋅ 𝐈𝐈6. The cost function is defined with 𝑟𝑟𝜂𝜂 = 0, 𝑅𝑅𝑒𝑒 = 𝐈𝐈, and 𝑅𝑅𝑥𝑥 = 𝟎𝟎. These choices ensure accurate estimates of the relative position states.

We will now construct a sequence that satisfies the sufficient conditions, and demonstrate that it leads to stable behavior. For this example, �̅�𝜌0 = 1.0063, 𝜌𝜌 1̅ = 0.2016, 𝜌𝜌� 0 = 0.0332, and 𝜌𝜌�1 = 1.0063. Working with the Frobenius norm, 𝑘𝑘∗ = 1 and

(52)

We begin with the (inadmissible) sequence 𝑆𝑆2 = {0, 1}, with 𝑛𝑛𝑠𝑠 = 2, 𝑛𝑛0 = 1, and 𝑛𝑛1 = 1. To modify the sequence such that it satisfies the sufficient conditions, increase 𝑛𝑛1 by 1, and iterate until the first logarithmic condition is satisfied, then iterate on 𝑛𝑛0 until the second logarithmic condition is satisfied, and repeat as necessary until both inequalities are true. For this example, the iterative process yields 𝑛𝑛0 = 3 and 𝑛𝑛1 = 5. The resulting sequence, 𝑠𝑠8 = {0, 0, 0, 1, 1, 1, 1, 1}, will converge in relative position and state error estimates when propagated forward periodically, and a representative simulation is shown in Figure 3.

FIGURE 3 DISJUNCTIVE SENSING AND CONTROL TRAJECTORY UNDER A LENGTH 8 PERIODIC SEQUENCE

The constructed sequence satisfies the contractivity conditions but it not the optimum sequence under the chosen cost functional. To determine the optimum such sequence, we apply the algorithm discussed in Section 3.2.4. There are no admissible sequences of length 2 or length 3, thus we extend the search to sequences of length 4. The optimum such sequence is 𝑆𝑆4 = {0, 0, 1, 1}, with 𝑞𝑞�𝐴𝐴 = 0.5879 and 𝑞𝑞� 𝐴𝐴 = 0.0130. This sequence, however, is optimal only over sequences of length 4; longer sequences may exist that exhibit faster convergence. When the algorithm treats sequences of length 7, the optimum such sequence is 𝑆𝑆7 = {0, 0, 1, 1, 1, 0, 0}, with 𝑞𝑞�𝐴𝐴 = 0.07594 and 𝑞𝑞�𝐴𝐴 = 3.796 × 10−5. Si mulated re sults of propagating ea ch op timal sw itching se quence forward periodically are shown in Figure 4; the sequence 𝑆𝑆7 converges more quickly than the sequence 𝑆𝑆 4, which goes through several additional oscillations before settling to the origin.

Approved for public release; distribution is unlimited 16

FIGURE 4 COMPARISON OF LENGTH 4 AND LENGTH 7 OPTIMAL SWITCHING SEQUENCES

Of note, when treating sequences of length 8, the optimum such sequence is {0, 0, 1, 1, 0, 0, 1, 1}, a reducible sequence that has the previous length 4 optimal sequence 𝑆𝑆4 a s the corresponding irreducible subsequence.

The remaining objective is the treatment of chance constraints. Suppose that we wish to establish a constraint on the steady-state of the form 𝑋𝑋 = {𝑥𝑥: ‖𝑥𝑥 ‖∞ ≤ 𝑏𝑏}, so t hat the chas er spac ecraft remains within a box centered at the origin, of side lengths 2𝑏𝑏, with Prob(𝑋𝑋) ≥ 0.95. Consider the length 4 sequence 𝑆𝑆4; invoking the Chebyshev inequality with 𝛼𝛼 𝑥𝑥 = �3/0.05, for each of 𝑃𝑃𝑥𝑥𝑠𝑠,0, …, 𝑃𝑃𝑥𝑥𝑠𝑠,3 yields four spheres, with minimum radius 𝑏𝑏 = 2.79 and maximum radius 𝑏𝑏 = 9.54. Only the maximum radius, the loosest of the four bounds, is guaranteed by the Chebyshev inequality, but even the tightest bound remains somewhat conservative. When setting 𝑏𝑏 = 2.5, t he c hance constraints are violated in no more than 4% of trajectories at any given time step. Figure 5 illustrates the satisfaction of the chance constraints over the course of 200 simulation runs.

Approved for public release; distribution is unlimited 17

Approved for public release; distribution is unlimited 18

FIGURE 5 DEMONSTRATION OF SATISFACTION OF BOTH BOX AND CHANCE CONSTRAINTS

4.2 A Computational Reference Governor for Safe Online Optimization

We placed CRG and NMPC combination in closed-loop with the model described in Section 3.2.1 and commanded a pair of reorientation maneuvers. Figure 6 shows the closed-loop traces, the constraints are satisfied, and the system successfully completes the maneuver.

FIGURE 6 THE TIME HISTORIES OF CONTROL TORQUES, ANGULAR VELOCITIES AND EULERANGLES IN CLOSED-LOOP SIMULATION

Figure 7 compares the angle references and OCP residual with and without the CRG. Without the CRG the OCP residual violates the maximum residual constraint (𝜋𝜋𝑘𝑘 ≤ 10−4). With the CRG active the residual constraint is satisfied; the CRG filters the reference commands to limit the change in the reference. While these simulations provide a proof of concept, further development of the theory of computational reference governors will be pursued in future research.

Approved for public release; distribution is unlimited 19

FIGURE 7 CLOSED-LOOP TRACES OF THE RESIDUALS AND REFERENCES WITH AND WITHOUT THE CRG.

4.3 Constrained Control Framework Using a Virtual Net of Chance Constraint Admissible Sets and Output Measurements Only

For illustration, a spacecraft constrained relative motion maneuver is considered. The equations of motion are based on the Clohessy-Wiltshire equations introduced in Section 4.1 modified with random disturbance terms and measurement noise. Only the spacecraft relative position coordinates are measured and an observer has been constructed to estimate the state. A virtual net of stationary reference points in the relative frame and of the corresponding chance constrained admissible sets has then been constructed. The graph search is performed with fuel usage as the cost. Figure 8 illustrates the resulting maneuver, where the constraints correspond to avoiding a cone emanating from the origin and staying with the specified box.

Approved for public release; distribution is unlimited 20

Approved for public release; distribution is unlimited 21

FIGURE 8 CLOSED-LOOP PATH OF SPACECRAFT IN CW FRAME. RED CONE IS THE OBSTACLEAND BLACK DOTS ARE THE REFERENCE POINT NODES IN THE GRAPH.

5. CONCLUSION

The outcomes of research into coordinating sensing, control and computations for autonomous vehicles are as follows:

• Strategies to ensure stability and eventual chance constraint enforcement have been defined forsystems where simultaneous sensing and actuation cannot be performed and thus switchingbetween sensing and actuation is necessary.

• A computational reference governor has been defined to supervise and modify if necessaryreference commands to a nonlinear model predictive controller and ensure constraints on theresiduals of the optimization problems to ensure acceptable accuracy of the solution within limitedcomputational time.

• For the case of systems operating with output measurements only and a state observer used toreconstruct the full state, a methodology for constructing and chaining chance constraint admissiblesets has been defined which reduces the problem of constructing constraint admissible maneuversto a standard graph search.

• Simulations of spacecraft relative motion and attitude control have been used to illustrate thepotential of the proposed strategies.

Approved for public release; distribution is unlimited 22

6. REFERENCES

[1] Sutherland, R., Kolmanovsky, I., Girard, A., Leve, F., Petersen, C., “Optimal strategies for disjunctive sensing and control,” 2018 American Control Conference, AIAA, Milwaukee, WI, June 2018.

[2] Sutherland, R., Kolmanovsky, I., Girard, A., Leve, F., Petersen, C., “Minimum-time model predictive spacecraft attitude control for waypoint following and exclusion zone avoidance,” 2019 AIAA SciTech Forum, AIAA, San Diego, CA, January 2019.

[3] Liao-McPherson, D., Nicotra, M., and Kolmanovsky, I., “Embedding constrained model predictive control in a continuous-time dynamic feedback,” IEEE Transactions on Automatic Control, 64, 5, pp. 1932-1946, 2019.

[4] Chen, C. Linear System Theory and Design, Oxford University Press, 3e, 1999.

[5] Yu, Qiang and Zhao, Xudong, “Stability analysis of discrete-time switched linear systems with unstable subsystems,” Applied Mathematics and Computation, 273, pp. 718-725, 2016.

[6] Lax, Peter D., Functional Analysis, John Wiley and Sons, pp. 195-197, 2002.

[7] Navarro, Jorge, “A simple proof for the multivariate Chebyshev inequality,” arXiv preprint, arXiv:1305.5646, 2013.

[8] Chen, Xinjia, “A new generalization of Chebyshev inequality for random vectors,” arXiv preprint, arXiv:0707.0805, 2007.

[9] Garone, E., Di Cairano, S., and Kolmanovsky, I., “Reference and command governors for systems with constraints: A survey on theory and applications,” Automatica, 75, pp. 306–328, 2017.

[10] Weiss, A., Kolmanovsky, I., Baldwin, M., Erwin, R.S., “Safe positively invariant sets for spacecraft obstacle avoidance,” AIAA Journal of Guidance, Control, and Dynamics, 36, 5, pp. 720-732, 2014.

[11] Clohessy, W. H., and Wiltshire, R. S., “Terminal guidance system for satellite rendezvous,” Journal of the Aerospace Sciences, 27, 9, pp. 653–658, 1960.

[12] Toth, R., Heuberger, P.S.C., and Van den Hof, P.M.J., “Discretisation of linear parameter-varying state-space representations,” IET Control Theory and Applications, 4, pp. 2082–2096, 2010.

[13]Sutherland, R., Kolmanovsky, I., Girard, A., Leve, F., and Petersen, C., “On closed-loop Lyapunov stability with minimum-time MPC feedback laws for discrete-time systems,” IEEE Conference on Decision and Control, Nice, France, 2019.

Approved for public release; distribution is unlimited 23

[14] Berning, A., Kolmanovsky, I., Girard, A., Leve, F., and Petersen, C., “Maneuver planningusing chained chance constrained admissible sets,” American Control Conference, Denver,Colorado, in preparation, 2020.

[15] Sutherland, R., “Advances in disjunctive and time-optimal predictive control methods,”Ph.D. Dissertation, The University of Michigan, 2019.

[16] Gilbert, Elmer G., Ilya Kolmanovsky, and Kok Tin Tan. "Discrete‐time referencegovernors and the nonlinear control of systems with state and control constraints."International Journal of robust and nonlinear control 5.5, pp. 487-504, 1995.

Approved for public release; distribution is unlimited 24

LIST OF SYMBOLS, ABBREVIATIONS, AND ACRONYMS

CRG Computational Reference Governor

DARE Discrete-Time Algebraic Riccati Equation

DSC Disjunctive Sensing and Control

IID Independent and Identically Distributed

MILP Mixed-Integer Linear Program

MPC Model Predictive Control

NMPC Nonlinear Model Predictive Control

OCP Optimal Control Problem

DISTRIBUTION LIST

1 cy

1 cy

DTIC/OCP 8725 John J. Kingman Rd, Suite 0944 Ft Belvoir, VA 22060-6218

AFRL/RVIL Kirtland AFB, NM 87117-5776

Official Record Copy AFRL/RVS/Christopher Petersen 1 cy

25Approved for public release; distribution is unlimited

(This page is intentionally left blank)

26Approved for public release; distribution is unlimited