switching control

Upload: rnoshirvani

Post on 05-Apr-2018

224 views

Category:

Documents


0 download

TRANSCRIPT

  • 7/31/2019 Switching Control

    1/10

    2012 IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 53, NO. 9, OCTOBER 2008

    Safe Adaptive Switching Control:Stability and Convergence

    Margareta Stefanovic, Member, IEEE, and Michael G. Safonov, Fellow, IEEE

    AbstractA formal theoretical explanation of the model-mis-match instability problem associated with certain adaptive controldesign schemes is proposed, and a solution is provided. To addressthe model-mismatch problem, a primary task of adaptive controlis formulated as finding an asymptotically optimal, stabilizingcontroller, given the feasibility of adaptive control problem. Aclass of data-driven cost functions called cost-detectable is intro-duced that detect evidence of instability without reference to priorplant models or plant assumptions. The problem of designingadaptive systems that are robustly immune to mismatch instabilityproblems is thus placed in a setting of a standard optimizationproblem. We call the result safe adaptive controlbecause it robustly

    achieves adaptive stabilization goals whenever feasible, withoutprior assumptions on the plant model and, hence, without the riskof model-mismatch instability. The result improves the robustnessof previous results in hysteresis switching control, both for discreteand for continuously-parameterized candidate controller sets.Examples are provided.

    Index TermsAdaptive control, convergence, robustness, sta-bility, switching.

    I. INTRODUCTION

    T

    HE book Adaptive Control [2] begins in the followingway: In everyday language, to adapt means to change a

    behavior to conform to new circumstances. Intuitively, an adap-tive controller is thus a controller that can modify its behavior inresponse to changes in the dynamics of the plant and the char-acter of the disturbances. Whether it is conventional, contin-uous adaptive tuning or more recent adaptive switching, adap-tive control has an inherent property that it orders controllersbased on evidence found in data. Any adaptive algorithm canthus be associated with a cost function, dependent on availabledata, that it minimizes, though this may not be explicitly present.The differences among adaptive algorithms arise in part dueto the specific algorithms employed to approximately computecost-minimizing controllers. And, major differences arise due

    to the extent to which additional assumptions are tied with thiscost function. The cost function needs to be chosen to reflect

    ManuscriptreceivedFebruary 6, 2007; revised January 30,2008. Current ver-sion published October 8, 2008. This work was supported in part by the AirForce Office of Scientific Research under Grant F49620-01-1-0302. Recom-mended by Associate Editor J. Hespanha.

    M. Stefanovic is with the Department of Electrical and Computer En-gineering, University of Wyoming, Laramie, WY 82071 USA (e-mail:[email protected]).

    M. G. Safonov is with the Department of Electrical Engineering-Systems,University of Southern California, Los Angeles, CA 90089-2563 USA (e-mail:[email protected]).

    Color versions of one or more of the figures in this paper are available onlineat http://ieeexplore.ieee.org.

    Digital Object Identifier 10.1109/TAC.2008.929395

    control goals. The perspective adopted in this paper hinges onthe notion of feasibility of adaptive control. An adaptive con-trol problem is said to be feasible if the plant is stabilizable andat least one (a priori unknown) stabilizing controller exists inthe candidate controller set that achieves the specified controlgoal for the given plant. Given feasibility, our view of a pri-mary goal of adaptive control is to recognize when the accumu-lated experimental data shows that a controller fails to achievedesired stability and performance objectives. If a destabilizingcontroller happens to be the currently active one, adaptive con-

    trol should eventually switch it out of the loop, and replace itwith an optimal, stabilizing one. An optimal controller is onethat optimizes the controller ordering criterion (cost function)given the currently available evidence. This perspective rendersthe adaptive control problem in a form of a standard constrainedoptimization. A concept similar to this feasibility notion can befound in [15].

    To address the emerging need for robustness for larger uncer-tainties or achieve tighter performance specifications, severalrecent important advances have emerged, such as [13] andmulti-model controller switching formulations of the adaptivecontrol problem, e.g., supervisory based control design in [8],

    [10], [18], and [19], or data-driven unfalsified adaptive controlmethods of [23] (based on criteria of falsifiability [20], [31],[32]), which exploit evidence in the plant output data to switcha controller out of the loop when the evidence proves that thecontroller is failing to achieve the stated goal. In both cases,the outer supervisory loop introduced to the baseline adaptivesystem allows fast discontinuous adaptation in highly uncer-tain nonlinear systems, and thus leads to improved performanceand overcomes some limitations of classical adaptive control.These formulations have led to improved optimization-basedadaptive control theories and, most importantly, significantlyweaker assumptions of prior knowledge. Both indirect [18],[19], [33] and direct [6], [15], [33] switching methods have

    been proposed for the adaptive supervisory loop. Recently,performance issues in switching adaptive control have beenaddressed in robust multiple model adaptive control schemes(RMMAC) [3]. The results of this paper build on the result ofMorse et al. [17], [18] and Hespanha et al. [9][11], and widenthe theoretical ground in our paper [25] by allowing the class ofcandidate controllers to be infinite so as to allow considerationof continuously parameterized adaptive controllers, in additionto finite sets of controllers. It is shown that, under some mild ad-ditional assumptions on the cost function (designer-based, notplant-dependent), stability of the closed-loop switched systemis assured, as well as the convergence to a stabilizing controller

    in finitely many steps. We show, via an example, that, when0018-9286/$25.00 2008 IEEE

  • 7/31/2019 Switching Control

    2/10

    STEFANOVIC AND SAFONOV: SAFE ADAPTIVE SWITCHING CONTROL: STABILITY AND CONVERGENCE 2013

    Fig. 1. Switching adaptive control system .

    there is a mismatch between the true plant and the assumptions

    on the plant (priors), a wrong ordering of the controllers canin some cases give preference to destabilizing controllers. This

    phenomenon is called model mismatch instability. However,

    prior knowledge and plant models, when they can be found,can be incorporated into the design of the candidate controllers,

    which could be used together with the safe switching algorithm.

    In certain ways, the paradigm of adaptive control problem

    cast as a constrained optimization problem bears similarities

    with the ideas found in machine learning algorithms [16],

    [26]. As a direct, data-driven switching adaptive method, it

    is more similar to the reinforcement learning algorithms than

    supervised/unsupervised learning. In reinforcement learning,

    the algorithm learns a policy of how to act given an observation

    of the world. Every action has some impact in the environment,

    and the environment provides feedback that guides the learning

    algorithm.

    The organization of the paper is as follows. Preliminary facts

    are given in Section II. Section III contains the main results as

    well as an example of the cost function satisfying sufficient con-ditions for stability and finiteness of switches. An illustrative ex-ample is presented in Section IV. Section V provides some finalconclusions and directions for future work.

    II. PRELIMINARIES

    Consider the adaptive system shown in

    Fig. 1, where and are the plant input and output vector sig-

    nals, and is the linear vector space of functions whose

    norm, defined as , exists forany finite . For any , a truncation operatoris a linear projection operator that truncates the signal at .

    The symbol will be used for the truncated signal [22].

    The adaptive controller switches the currently active controller

    at times with . For brevity,

    we also denote the controller switched in the loop

    during the time interval . If finite, the total numberof switches is denoted by , so the final switching time isand the final controller is .

    We define the setwhere is an unknown plant. Unknown disturbances and noises

    may also affect the plant relation . Let

    represent the output data signals measured in one experiment,defined on the time-interval .

    We consider a possibly infinite set (e.g., containing a con-tinuum) of the candidate controllers. The finite controller setcase is included as a special case. The parametrization of ,

    denoted , will initially be taken to be a subset of ; the

    treatment of the infinite dimensional spaces will be discussed inthe Comment 5.

    Recall some familiar definitions from the stability theory. Afunction belongs in class if iscontinuous, strictly increasing and . The -norm of

    a truncated signal is given as .

    The Euclidean norm of the parameterization of the

    controller is denoted . A functional is said

    to be coercive [4] if when .

    Definition 1: A system with input and

    output is said to be stable if for every such input

    there exist constants such that

    (1)

    Otherwise, is said to be unstable. Furthermore, if (1) holdswith a single pair for all , then the system

    is said to be finite-gain stable, in which case the gain of is the

    least such .

    Comment 1: In general, can depend on the initial state.

    Specializing to the system in Fig. 1, and (without loss of gen-

    erality) disregarding , stability of the closed-loop system

    means that for every , there exist such that

    .

    Definition 2: The system is said to be incrementally stable

    if, for every pair of inputs and outputs

    , there exist constants such that

    (2)

    and the incremental gain of , when it exists, is the least

    satisfying (2) for some and all .

    Definition 3: The adaptive control problem is said to be fea-

    sible if a candidate controller set contains at least one con-

    troller such that the system is stable. A controller

    is said to be a feasible controller if the system is

    stable.

    Safe adaptive control problem goal is then formulated as

    finding an asymptotically optimal, stabilizing controller, giventhe feasibility of the adaptive control problem. Under this con-

    dition, safe adaptive control should recognize a destabilizingcontroller currently present in the loop, and replace it with an

    as-yet-unfalsified controller. Hence, we have the following (andonly) assumption on the plant that we will use in this paper.

    Assumption 1: The adaptive control problem is feasible.

    A similar notion of the safe adaptive control appears in [1],

    where Anderson et al. define the safe adaptive switching controlas one that always yields stable frozen closed-loop; the solution

    to this problem is achieved using the -gap metric. Limitations

    of the -gap metric are discussed in [12].

    Prevention from inserting a destabilizing controller in the

    loop is not assured (since the adaptive switching system cannot

    identify with certainty a destabilizing controller beforehand,

    based on the past data), but if such a controller is selected, itwill quickly be switched out as soon as the unstable modes

  • 7/31/2019 Switching Control

    3/10

    2014 IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 53, NO. 9, OCTOBER 2008

    are excited. Under the feasibility condition, an unfalsifiedcontroller will always be found and placed in the loop. Whether

    the optimal, robustly stabilizing controller will eventually be

    found and connected, depends on whether the unstable modes

    are sufficiently excited.It follows from the above definition of the adaptive control

    goal that feasibility is a necessary condition for the existenceof a particular that robustly solves the safe adaptive

    control problem. This paper aims to show that feasibility is also

    a sufficient condition to design a robustly stable adaptive systemthat converges to a , even when it is not known a priori

    which controllers in the set are stabilizing.

    Definition 4: Stability of the system is said to

    be unfalsified by the data if there exist such

    that (1) holds; otherwise, we say that stability of the system

    is falsified by .

    Unfalsified stability is determined from (1) based on the datafrom one experiment for one input, while stability requiresadditionally that (1) hold for the data from every possible input.

    Any adaptive control scheme has a cost index inherently tiedto it, which orders controllers based on evidence found in data.

    This index is taken here to be a cost functional , de-

    fined as a causal-in-time mapping

    (3)

    An example of the cost function according to the above defi-nition, which satisfies the desired properties introduced later inthe text, is given in Section III-A. Various examples of the cost

    are discussed in [24], [29], among others.

    The switched system comprised of the plant and the cur-

    rently active controller , where is de-

    noted (Fig. 1). For the switched system inFig. 1, the is defined as

    , where .

    Definition 5: Given the pair , a controller

    is said to be falsified at time by the past measurement if. Otherwise it is said to

    be unfalsified at time by .Then, a robust optimal controller is one that stabilizes

    (in the sense of the Def. 1) the given plant and minimizes the

    true cost .

    Therefore, (and is not nec-

    essarily unique). Due to the feasibility assumption, at least one

    such exists, and .Definition 6: [23] For every , a fictitious reference

    signal is defined to be an element of

    In other words, is a hypothetical reference signal that

    would have exactly reproduced the measured data had the

    controller been in the loop for the entire time period over

    which the data was collected.

    Definition 7: [28] When for each and there is a unique

    , then we say is causally left invertible (CLI)

    and we denote by the induced causal map . The

    causal left inverse is called the fictitious reference signalgenerator(FRS) for the controller . When is incremen-

    tally stable, is called stably causally left invertible controller

    (SCLI).

    Definition 8: Let denote the input and

    denote the resulting plant data collected with as the current

    controller. Consider the adaptive control system of

    Fig. 1 with input and output . The pair is said to

    be cost-detectable if, without any assumption on the plantand for every with finitely many switching times, thefollowing statements are equivalent:

    1) is bounded as increases to infinity;2) stability of the system is unfalsified by the input-

    output pair .

    Comment 2: With cost-detectability satisfied, we can use thecost to reliably detect any instability exhibited by

    the adaptive system, even when initially the plant is completely

    unknown.

    Comment 3: Cost-detectability is different from the plant de-

    tectability. Cost-detectability is determined from the knowledge

    of the cost function and candidate controllers, without refer-

    ence to the plant. In [9], a problem similar to ours is approachedusing the following assumptions: 1) the plant itself is detectable

    and 2) the candidate plant models are stabilized by the corre-

    sponding candidate controllers. The difference between the ap-

    proach in [9], [7] and this paper lies in the definition of cost-de-tectability introduced in this paper, which is the property of the

    cost function/candidate-controller-set pair, but is independent of

    the plant.

    In the following, we use the notation

    for a family of functionals

    with the common domain , with .

    Let

    denote the level set in the controller space corresponding tothe cost at the first switching time instant. With the family offunctionals with a common domain , a restriction to the

    set is associated, defined as a family of functionalswith a common domain .

    Thus, is identical to on , and is equal to

    outside .

    Consider now the cost minimization hysteresis switching

    algorithm reported in [17], together with the cost functional

    . The algorithm returns, at each , a controller

    which is the active controller in the loop:

    -HYSTERESIS SWITCHING ALGORITHM A1 [17]

    where is the Kroneckers , and is the limit of frombelow as .

    The switch occurs only when the current unfalsified cost re-lated to the currently active controller exceeds the minimum

    (over ) of the current unfalsified cost by at least (Fig. 2).The hysteresis step serves to limit the number of switches on

    any finite time interval to a finite number, and so prevents thepossibility of the limit cycle type of instability. It also ensures a

    nonzero dwell time between switches.

    The hysteresis switching lemma of [17] implies that a

    switched sequence of controllers that min-imize (over ) the current unfalsified cost at each

  • 7/31/2019 Switching Control

    4/10

    STEFANOVIC AND SAFONOV: SAFE ADAPTIVE SWITCHING CONTROL: STABILITY AND CONVERGENCE 2015

    Fig. 2. Cost versus control gain time snapshots.

    switch-time , will also stabilize the plant if the cost related to

    each fixed controller has the following properties: first, it isa monotone increasing function of time and second, it is uni-

    formly bounded above if and only if is stabilizing. But, these

    properties were demonstrated for the cost functions

    in [17] only by introducing prior assumptions on the plant,

    thereby also introducing the possibility of model-mismatch

    instability.

    Definition 9: [30]. Let be a topological space. A family

    of complex functionals with a common

    domain is said to be equicontinuous at a point if for

    every there exists an open neighborhood such that

    . The family is

    said to be equicontinuous on if it is equicontinuous at each

    . is said to be uniformly equicontinuous on if

    such that

    , where denotes an open neighborhoodof size .

    In a metric space with a metric , uniform equiconti-

    nuity means that

    .

    Lemma 1: If isa compactmetricspace, then any family

    that is equicontinuous on is uniformly

    equicontinuous on .

    Proof: In [21].

    III. MAIN RESULT

    The results on stability and finiteness of switches are devel-

    oped in the sequel.Lemma 2: Consider the feedback adaptive control system

    inFig. 1 withinput (generality is not lostif istakeninsteadof

    the input ) and output , together with

    the hysteresis switching algorithm A1. Suppose there are finitelymany switches. If the adaptive control problem is feasible (Def.

    3), candidate controllers are SCLI, and the following properties

    are satisfied: is cost-detectable (Def. 8) is monotone increasing in time

    then the final switched controller is stabilizing. Moreover, thesystem response with the final controller satisfies the perfor-mance inequality

    Proof: It suffices to consider the final controller . De-note the final switching time instant . Then, by the definitionof , and feasibility of the control problem (Def. 3), it

    follows that for all

    (4)

    (5)

    Further, by monotonicity in of , it follows that (5)

    holds for all . Due to the cost-detectability, stability of

    with is not falsified by , that is, there exist constantscorresponding to the given such that

    (6)

    According to Lemma 6 in Appendix A, there exist

    such that . This, along with (6),

    implies , for some .

    Lemma 3: Let be a continuous and coercive

    function on . Then for any scalar , the levelset

    is compact.

    Proof: Since , we show that is closed and

    bounded: Let be a convergent sequence, and

    . Since is continuous, .

    Also, . Then,

    , so . Hence, is closed. To

    show that is is bounded, proceed by contradiction. As-

    sume that is not bounded; then there exists a sequence

    such that . Since is co-

    ercive, ; in particular, such that

    , for any fixed . Then, ,

    which contradicts the above assumption. Thus, is closedand bounded in , therefore compact.

    Lemma 4: Consider the feedback-adaptive control system in

    Fig. 1, together with the switching algorithm A1. If the adap-

    tive control problem is feasible (Def. 3), and the associated

    cost functional/controller set pair is cost-detectable,

    is monotone increasing in time and, in addition:

    For all , the cost functional iscoercive on

    (i.e., ), and

    The family of restricted cost functionals with a common domain

    is equicontinuous on ,

    then the number of switches is uniformly bounded above for all

    by some .

    Proof: Our proof is similar to the convergence lemmas of

    [10], [17]. A graphical representation of the switching process,

    giving insight to the derivation presented below, is shown in

    Fig. 3. Due to Lemma 3, the level set is compact. Then,

    the family is uniformly

    equicontinuous on (Lemma 1), i.e., for a hysteresis step

    such that for all

    (i.e.,

    is common to all and all ). Since is

    compact, there exists a finite open cover ,with such that ,

  • 7/31/2019 Switching Control

    5/10

    2016 IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 53, NO. 9, OCTOBER 2008

    Fig. 3. Derivation of the upper bound on switches in the continuum case.

    where depends on the chosen hysteresis step (this is a di-

    rect consequence of the definition of a compact set). Let bethe controller switched into the loop at the time , and the corre-sponding minimum cost achieved is .

    Consider that at the time a switch occurs at the

    same cost level , i.e., where

    . Therefore,

    is falsified, and so are all the controllers . Letbe the index set of the as-yet-unfalsified -balls of controllersat the time . Since , for some

    ( is not necessarily a singleton as may belong to more

    than one balls , but it suf fices for the proof that thereis at least one such index ), also falsified are all the controllers

    , so that . Thus, is up-

    dated according to the following algorithm ( is the index of the

    switching time ).

    Unfalsified index set algorithm:

    1) Initialize: Let

    2) . If : Set //

    Optimal cost increases

    Else ,where is such that

    .

    3) go to (2)

    Therefore, the number of possible switches to a single

    cost level is upper-bounded by , the number of -balls

    in the cover of . The next switch (the very first after theth one), if any, must occur to a cost level higher than

    , due to the monotonicity of . But then, according

    to Algorithm 1, ,

    with and

    . Combining the two bounds, the overall

    number of switches is upper-bounded by

    Equicontinuity assures that the cost functionals in the said

    family have associated -balls offinite, non-zero radii, which isused to upper bound the number of switches. If holds,

    the set is compact; otherwise, an additional requirement that

    the set is compact is needed. The finite controller set caseis obtained as a special case of the Lemma 4, with being the

    number of candidate controllers instead of the number of -ballsin the cover of . The main result follows.

    Theorem 1: Consider the feedback adaptive control system

    in Fig. 1, together with the hysteresis switching algorithm A1.

    Suppose that the adaptive control problem is feasible, the asso-

    ciated cost functional is monotone in time, the pair

    is cost-detectable, candidate controllers are SCLI, and

    the conditions of Lemma 4 hold. Then, the switched closed-loop

    system is stable, according to Def. 1. In addition, for each , the

    system converges after finitely many switches to the controllerthat satisfies the performance inequality

    (7)

    Proof: Invoking Lemma 4 proves that there are finitelymany switches. Then, Lemma 2 shows that the adaptive con-

    troller stabilizes, according to Def. 1, and that (7) holds.

    Comment 4: Note that, due to the coerciveness of

    is bounded below (by a nonnegative

    number, if the range of is a subset of ), for all .

    Comment 5: The parametrization of the candidate controller

    set can be more general than ; in fact, it can belong

    to an arbitrary infinite dimensional space; however has to becompact in that case, in order to ensure uniform equi-continuity

    property.

    Note that the switching ceases after finitely many steps for all. If the system input is sufficiently rich so as to increase

    the cost more than above the level at the time of the latest

    switch, a switch to a new controller that minimizes the current

    cost will eventually occur at some later time. The values of these

    cost minima at any time are monotone increasing and bounded

    above by . Thus, suf ficient richness of the systeminput (external reference signal, disturbance or noise signals)

    will affect the cost to approach .

    Comment 6: The minimization of the cost functional over the

    infiniteset istractableif the compact set can berepresentedas a finite union of convex sets, i.e., the cost minimization is aconvex programming problem.

    A. Cost Function Example

    An example of the cost function and the conditions under

    which it ensures stability and finiteness of switches accordingto Theorem 1 may be constructed as follows. Consider (a not

    necessarily zero-input zero-output) system in

    Fig. 1. Choose as a cost functional

    (8)

    where are arbitrary positive numbers. The constant is

    used to prevent division by zero when (unless

    has zero-input zero-output property), ensures evenwhen , and ensures coerciveness of .

  • 7/31/2019 Switching Control

    6/10

    STEFANOVIC AND SAFONOV: SAFE ADAPTIVE SWITCHING CONTROL: STABILITY AND CONVERGENCE 2017

    Fig. 4. MFD ofa controller inclosed loopwiththe plant [14]:(a) feedbackloop

    with the current controller written in MFD form and (b) fictitious feedbackloop associated with the candidate controller written in MFD form (in bothcases, are the actually recorded data).

    Alternatively, in order to avoid the restriction to the minimum

    phase (SCLI) controllers (which would assure causality and in-

    cremental stability of the map ), the denominator of

    (8) can contain instead of [14], [5], where is definedvia the matrix fraction description (MFD) form of the controller

    , as and

    (Appendix B), where the mentioned signals are shown in Fig. 4

    (9)

    Both (8) and (9) satisfy the required properties of Theorem 2,

    i.e.,: monotonicity in time, coerciveness on , equicontinuity

    of the restricted cost family , and cost detectability. The firsttwo properties are evident by inspection of (8) and (9). The jus-

    tification for the last two properties is as follows.Since in (8) and (9) is continuous in , then ,

    defined as the family of the cost functionals restrictedto the level set , is equicon-

    tinuous (since for any and are either equal to , or

    clamped at ).

    Lemma 5: Consider the cost functions (8) and (9) with

    . For to be cost-detectable, it is sufficientthat the candidate controllers in the set are SCLI, or that

    they admit matrix fraction description (MFD) form considered

    in [14].

    Proof: Cost-detectability of with in (8) follows

    from the following: 1) the fact that is bounded asif and only if stability is unfalsified by the input-output

    pair [28]; 2) SCLI property of the controllers; 3) sta-

    bility of the mapping (Lemma 6 in Appendix A); and

    4) unfalsified stability by the data (see Appendix C).These results can be elaborated further using [14] for the class

    of non-SCLI controllers and the cost function (9), which also

    ensure internal stability of the adaptive system designed usingcost-detectable cost-functions of the forms (8) or (9).

    IV. SIMULATION EXAMPLE

    The algorithm A1 in Section II originated as the hysteresis

    switching algorithm in [17]. We emphasized that the power ofthe hysteresis switching lemma was clouded in the cited work by

    imposing unnecessary assumptions on the plant in the demon-

    strations of the algorithm functionality. One of the plant prop-

    erties required in [17] for ensuring tunability was the min-imum phase property of the plant. We have shown in theory

    that the cost detectability is assured by properly choosing a cost

    function, and is not dependent on the plant or exogenous sig-

    nals. In the following, we present a simulation example thatdemonstrates these findings. Assume that a true, unknown planttransfer function is given by .

    Given is the set of three candidate controllers:

    and , each of which

    stabilizes a different possible plant model. The task of the

    adaptive control is to select one of these controllers, based on

    observed data. The problem is complicated by the fact that in

    this case, as is often the case in practice, the true plant is

    not in the model set, i.e., there exists a model mismatch.A simple analysis of the nonswitched system (true plant in

    feedback with each of the controllers separately) shows that

    is stabilizing (yielding a nonminimum phase but stable closed

    loop) while and are destabilizing. Next, a simulation was

    performed of a switched system, where A1 was used to select

    the optimal controller, and the cost function was chosen to be a

    combination of the instantaneous error and a weighted accumu-

    lated error

    (10)

    where is the fictitious error of the th controller, defined as

    (11)

    and and , where is astable, minimum phase reference model. This is the same cost

    function used in the multiple model switching adaptive control

    scheme [19], with replaced by , the identification error ofthe th plant model [for the special case of the candidate con-

    trollers designed based on the MRAC method, is equivalent

    to the control error and to the fictitious error (11)].The simulations assume a band-limited white noise at the

    plant output and the unit-magnitude square reference signal. The

    stabilizing controller has initially been placed in the loop,

    and the switching, which would normally occur as soon as the

    logic condition of Algorithm A1 is met, is suppressed during the

    initial 5 s of the simulation. That is, the adaptive control loop isnot closed until time s. The reason for waiting some pe-

    riod of time before engaging the switching algorithm will be

    explained shortly. The forgetting factor is chosen to be 0.05.

    Figs. 5(a) and 6(a) show the cost dynamics and the reference

    and plant outputs, respectively.

    Soon after the switching was allowed (after ), the al-

    gorithm using cost function (10) discarded the stabilizing con-

    troller initially placed into the loop and latched onto a desta-

    bilizing one, despite the evidence of instability found in data.

    Even though the stabilizing controller was initially placed in the

    loop, and forced to stay there for some time (5 s in this case), as

    soon as the switching according to (10) was allowed, it promptly

    switched to a destabilizing one. This model-match instabilityhappens because the cost function (10) is not cost-detectable.

  • 7/31/2019 Switching Control

    7/10

    2018 IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 53, NO. 9, OCTOBER 2008

    Fig. 5. Without cost-detectability, the optimal cost-minimizing controller

    destabilizes the system. (a) Non-cost-detectable case; cost (10). (b) Cost-de-tectable case; cost (12).

    Note that the initial idle period of 5s is used only to empha-

    size that, even when the data are accumulated with the stabi-

    lizing controller, the switching algorithm based on (10) can dis-

    regard this data, and latch onto a destabilizing controller. Thisidle period is not the dwell time in the same sense used in the

    dwell-time switching control literature.

    Next, a simulation was performed of the same system, but

    using a cost-detectable cost function (i.e., one that satisfies con-ditions of Theorem 1)

    (12)

    viz., an -gain type cost function (factor added for co-

    erciveness is not necessary, since the set of candidate controllers

    is finite in this example). Cost-detectability of followsfrom Lemma 5. The modified fictitious reference signal isused, instead of , due to the presence of the non-minimum

    Fig. 6. Switching using cost function (a) (10) and (b) (12). Reference and plantoutputs. (a) Non-cost-detectable case; (10), (b) Cost-detectable case; (10).

    phase controller . It is calculated from the on-line data as

    [according to (9)], where

    and for ; and

    for ; and and for .

    The corresponding simulation results are shown in Figs. 5(b)

    and 6(b). The initial controller was chosen to be (a desta-

    bilizing one). The constant was chosen to be 0.01. The fore-

    going example shows that closing an adaptive loop that is de-signed using a non-cost-detectable cost function like (10) can

    destabilize, even when the initial controller is stabilizing. In the

    example, this happens because there is a large mismatch be-

    tween the true plant and the plant models used to design the

    candidate controllers. On the other hand, Theorem 1 implies,

    and the example confirms, that such model-mismatch instabilitycannot occur when the adaptive control loop is designed using

    the cost-detectable -gain type cost function (12).

    V. CONCLUSION

    The goal of stabilizing an uncertain plant by means of

    switching through an infinite candidate controller set is solvedin the paper, provided that feasibility (existence of at least

  • 7/31/2019 Switching Control

    8/10

    STEFANOVIC AND SAFONOV: SAFE ADAPTIVE SWITCHING CONTROL: STABILITY AND CONVERGENCE 2019

    Fig. 7. Generators of the true and fictitious reference signals.

    one stabilizing solution in the candidate controller set) holds.

    Noting that every adaptive scheme is optimal with respect to

    some data-driven controller-ordering cost function, sufficientconditions are derived on the data-driven cost function to ensure

    stability and performance. To this end, we have re-examined

    the Morse-Mayne-Goodwin hysteresis algorithm for adaptivecontrol from the perspective of unfalsified control theory. Anupper bound on the number of switches for a general continuum

    controller set case is calculated. The result is a solution to the

    problem of model mismatch instability. Our cost- detectability

    approach to the adaptivecontrollerswitching robustly eliminates

    the possibility of model-mismatch instability that inevitably

    accompanies the introduction of any prior assumption on the

    plant. It does so by replacing the plant-dependent/controller-

    independent tunability requirements with the plant-independent/

    controller-dependent cost-detectability requirements.

    In the present paper, our focus has been on how a given can-

    didate controller set should be pruned based on data in orderto adaptively converge to a controller in the candidate set that

    achieves and maintains stable behavior and acceptable perfor-

    mance. Though this data driven process requires no prior knowl-

    edge of the plant model, this does not mean that our formulation

    is model-free. Plant models, when available, play an essential

    role in determining and designing the candidate controller set

    . The issue of how one might optimally utilize data for con-

    tinuously and adaptively generating new candidate controllers

    on the fly remains an open question.

    APPENDIX A

    RELATION BETWEEN AND

    Lemma 6: Consider the switching feedback adaptive control

    system (Fig. 1), where uniformly bounded reference input

    , as well as the output are given. Suppose there

    are finitely many switches. Let and denote the finalswitching instant and the final switched controller, respectively.Suppose that the final controller is SCLI (i.e., the fictitiousreference signal is unique and incrementally stable).

    Then

    (13)

    Proof: By the assumption there are finitely many switches.Consider the control configuration in Fig. 7. The top branch gen-

    erates the fictitious reference signal of the controller . Its in-puts are the measured data , and its output is . The

    output is generated by the fictitious reference signal generatorfor the controller , denoted . Inthe middle interconnec-

    tion, the signal , generated asthe outputof the final controllerexcited by the actual applied signals and , is simply in-

    verted by passing through the causal left inverse . Finally,the bottom interconnection has the identical structure as the top

    interconnection (series connection of and ), except that

    it should generate the actually applied reference signal . To this

    end, another input to the bottom interconnection is added (de-

    noted ), as shown in Fig. 7. This additional input can be

    thought of as a compensating (bias) signal, that accounts for the

    difference between the subsystems generating and before

    the time of the last switch. In particular, it can be shown (as seen

    in Fig. 7) that (due to the fact that

    ).

    By definition, is incrementally stable. Thus, there existconstants such that

    Whence by the triangle inequality for norms, inequality (13)

    holds with

    APPENDIX BDefinition 10: [27]. The ordered pair is a left ma-

    trix fraction description (MFD) of a controller if and

    are stable, is invertible, and .

    Comment 7: To avoid restricting our attention to only those

    controllers that are stably causally left invertible. i.e., controllers

    whose FRS generator is stable, we can use the MFD represen-

    tation of the controllers and write a modifiedfictitious reference

    signal as and [14].

    Similarly, would represent the modified applied ref-erence signal , that is related to the active controller in the loop

    . Thus, although may not be stable, which is the case with

    nonminimum phase controllers, is, by construction.

    APPENDIX C

    UNFALSIFIED STABILITY VERIFICATION FOR

    THE COST FUNCTION (8) IN SECTION III-A

    Recall that the controller switched in the loop at time is

    denoted ; and are the

    indices of the switching instants. When , we have

    . Let the controller switched

    at time be denoted (so, ). Then, due to the

    cost minimization property of the switching algorithm,

    , and .

    Denote by the time of the final switch, and the cor-responding controller . Consider the time interval .

  • 7/31/2019 Switching Control

    9/10

    2020 IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 53, NO. 9, OCTOBER 2008

    During this time period, the active controller in the loop is

    since .

    Since ris uniformly bounded, .

    At , the cost of the current controller exceeds the cur-

    rent minimum by

    (14)

    and so, according to the hysteresis switching algorithm, a switch

    occurs to the controller . Expres-sion in (14) is finite since is finite and

    where is finite due to the feasibility assumption.Denoting the sum by , we have

    Now considerthe nextswitchingperiod, . The activecon-

    troller in the loop is . We have:

    where the second inequality from the left follows from the

    monotone increasing property of .

    Therefore, . Now

    Thus, is finite, and so are . By induction,we conclude that

    where is the final switching time. Since was the finalswitching time

    Thus, is finite for any finite

    From here we conclude that stability of the closed-loop switched

    system with the final controller is unfalsified by .

    ACKNOWLEDGMENT

    The authors would like to thank Prof. J. Hespanha for

    insightful comments and referral to the hysteresis switching

    lemma of Morse et al. [17] and the related references. They

    also thank anonymous reviewers for their valuable comments

    that helped improve the manuscript quality.

    REFERENCES

    [1] B. D. O. Anderson, T. S. Brinsmead, D. Liberzon, and A. S. Morse,

    Multiple model adaptive control with safe switching, Int. J. Adapt.Control and Signal Process., vol. 15, pp. 445470, 2001.

    [2] K. J. strm and B. Wittenmark, Adaptive Control. Reading, MA:Addison-Wesley, 1995 .

    [3] M. Athans et al., Issues, progress and new results in robust adaptivecontrol, Int. J. Adapt. Control Signal Process., vol. 20, pp. 519579,Dec. 2006.

    [4] D. P. Bertsekas, Nonlinear Programming, 2nd ed. Nashua, NH:Athena, 1999.

    [5] A. Dehghani, B. D. O. Anderson, and A. Lanzon, Unfalsified adap-tive control: A new controller implementation and some remarks, in

    European Control Conf. ECC07, Kos, Greece, Jul. 2007.[6] M. Fu and B. R. Barmish, Adaptive stabilization of linear systems via

    switching control, IEEE Trans. Automat. Control, vol. AC-31, no. 12,pp. 10971103, Dec. 1986.

    [7] J. P. Hespanha and A. S. Morse, Certainty equivalence implies de-tectability, Syst. Contr. Lettr., vol. 1, no. 1, pp. 113, Jan. 1999.

    [8] J. P. Hespanha, Logic-Based Switching Algorithms in Control, Ph.D.dissertation, Dept. Elect. Eng., Yale University, New Haven, CT, 1998.

    [9] J.P. Hespanha, D.Liberzon, and A.S. Morse, Supervision of integral-input-to-state stabilizing controllers, Automatica, vol. 38, no. 8, pp.13271335, Aug. 2002.

    [10] J. P. Hespanha, D. Liberzon, and A. S. Morse, Hysteresis-basedswitching algorithms for supervisory control of uncertain systems,

    Automatica, vol. 39, no. 2, Feb. 2003.[11] J. P. Hespanha, D. Liberzon, and A. S. Morse, Overcoming the lim-

    itations of adaptive control by means of logic-based switching, Syst.Control Lett., vol. 49, pp. 4965, 2003.

    [12] G. C. Hsieh and M. G. Safonov, Conservatism of the gap metric,IEEE Trans. Automat. Control, vol. 38, no. 4, pp. 594598, Apr. 1993.

    [13] E. Kosmatopoulos and P. A. Ioannou, Robust switching adaptive con-trol of multi-input nonlinear systems, IEEE Trans. Automat. Control,vol. 47, no. 4, pp. 610624, Apr. 2002.

  • 7/31/2019 Switching Control

    10/10

    STEFANOVIC AND SAFONOV: SAFE ADAPTIVE SWITCHING CONTROL: STABILITY AND CONVERGENCE 2021

    [14] C. Manuelli, S. G. Cheong, E. Mosca, and M. G. Safonov, Stability ofunfalsified adaptive control with non SCLI controllers and related per-formance under different prior knowledge, in Proc. European ControlConf. (ECC07), Kos, Greece, Jul. 2007.

    [15] B. Mrtensson, The order of any stabilizing regulator is sufficient in-formation for adaptive stabilization, Syst. Control Lettr., vol. 6, no. 2,pp. 8791, 1985.

    [16] T. Mitchell, Machine Learning. New York: McGraw-Hill, 1997.

    [17] A. S. Morse, D. Q. Mayne, and G. C. Goodwin, Applications of hys-teresis switching in parameter adaptive control, IEEE Trans. Automat.Control, vol. 37, no. 9, pp. 13431354, Sep. 1992.

    [18] A. S. Morse, Supervisory control of families of linear set-point con-trollersPart I: Exact matching, IEEE Trans. Automat. Control, vol.41, no. 10, pp. 14131431, Oct. 1996.

    [19] K. S. Narendra and J. Balakrishnan, Adaptive control using multiplemodels, IEEE Trans. Automat. Control, vol. 42, no. 2, pp. 171187,Feb. 1997.

    [20] K. R. Popper, Logic of Scientific Discovery 1934.[21] W. Rudin, Principles of Mathematical Analysis, 3rd ed. New York:

    McGraw-Hill, 1976.[22] M. G. Safonov, Stability and Robustness in Multivariable Feedback

    Systems. Cambridge, MA: MIT Press, 1980, Signal Processing, Op-timization and Control.

    [23] M. G. Safonov and T. Tsao, The unfalsified control concept andlearning, IEEE Trans. Automat. Control, vol. 42, no. 6, pp. 843847,

    Jun. 1997.[24] M. Stefanovic, Safe Switching Adaptive Control: Stability and Con-

    vergence, Ph.D. dissertation, Univ. Southern California, Los Angeles,2005.

    [25] M. Stefanovic, R. Wang, and M. G. Safonov, Stability and conver-gence in adaptive systems, in Proc. American Control Conf., Boston,MA, 2004.

    [26] R. S. Sutton and A. G. Barto, Reinforcement Learning: An Introduc-tion. Cambridge, MA: MIT Press, 1998.

    [27] M. Vidyasagar, Control System Synthesis: A Factorization Ap-proach. Cambridge, MA: The MIT Press, Sep. 1988.

    [28] R. Wang, A. Paul, M. Stefanovic, and M. G. Safonov, Cost-de-tectability and stability of adaptive control systems, in Proc. 44th

    IEEE Conf. Decision and Control, Seville, Spain, Dec. 2005.[29] R. Wang, Cost Detectability and Safe MCAC, Ph.D. dissertation,

    Univ. Southern California, Los Angeles, 2005.[30] R. L. Wheeden and A. Zygmund, Measure and Integral: An Introduc-

    tion to Real Analysis. New York: Marcel Dekker, 1977.[31] J. C. Willems, Paradigms and puzzles in the theory of dynamical sys-

    tems, IEEE Trans. Automat. Control, vol. 36, no. 3, pp.259294, Mar.1991.

    [32] G. Zames, On the input-output stability of nonlinear time-varyingfeedback systems, part 1: Conditions derived using concepts of loopgain, conicity and positivity, IEEE Trans. Automat. Control, vol.AC-11, pp. 465476, Apr. 1966.

    [33] P. Zhivoglyadov, R. H. Middleton, and M. Fu, Localization basedswitching adaptive control for time-varying discrete-time systems,

    IEEE Trans. Automat. Control, vol. 45, no. 4, pp. 5275, Apr. 2000.

    Margareta Stefanovic (S03M05) received theDipl.-Ing. degree from the University of Nis, Yu-goslavia, in 1996, and the M.Sc. and Ph.D. degreesin electrical engineering from the University ofSouthern California, Los Angeles, in 2002 and 2005,respectively.

    Shejoined theUniversityof Wyoming, Laramie, in2005 as an Assistant Professor in the Electrical and

    Computer Engineering Department. Her current re-search interests include switching adaptive control ofuncertain systems and coordinated control and deci-

    sion making in hyper-spectral imaging nano-satellite networks for Space Situ-ational Awareness.

    Michael G. Safonov (F89) was born in Pasadena,CA, on November 1, 1948. He received the B.S.,M.S., Eng., and Ph.D. degrees in electrical en-gineering from the Massachusetts Institute ofTechnology, Cambridge, in 1971, 1971, 1976, and1977, respectively.

    From 1972 to 1975, he served with the U.S. Navyas an Electronics Division Officer aboard the aircraftcarrier U.S.S. Franklin D. Roosevelt (CVA-42).Since1977, he has been with the University of SouthernCalifornia, Los Angeles, where he is presently a

    Professor of Electrical Engineering. He has been a Consultant to The AnalyticSciences Corp., Honeywell Systems and Research Center, Systems Control,Systems Control Technology, Scientific Systems, United Technologies, TRW,Northrop Aircraft, Hughes Aircraft, and others. His consulting and universityresearch activities have involved flight control system design studies in whichmodern robust multivariable control techniques were applied to a variety ofaircraft including the CH-47 Chinook helicopter (Analytic Sciences Corp.,1976), the NASA HiMAT aircraft (Honeywell/USC, 1980) and the F/A-18Hornet (Northrop, 19871991). During the academic year 19831984, he wasa Senior Visiting Fellow with the Department of Engineering, Cambridge Uni-versity, Cambridge, U.K., and in summer 1987 he held a similar appointment atImperial College of Science and Technology, London, U.K. During 1990 1991,he was a Visiting Faculty Member at Caltech, Pasadena, CA. He has authoredor co-authored more than 150 journal and conference papers and the book

    Stability and Robustness of Multivariable Feedback Systems (Cambridge, MA:MIT Press, 1980). Additionally, he is co-author of the MATLAB Robust ControlToolbox (Natick, MA: MathWorks), a software package for use with MATLAB.He is presently an Editor of the International Journal of Robust and NonlinearControl and Systems and Control Letters. His research interests include robustcontrol, infinity-norm optimal control theory, and nonlinear system theory withapplications to aerospace control design problems.

    Dr. Safonov served as an Associate Editor of the IEEE TRANSACTIONS ON

    AUTOMATIC CONTROL from 1985 to 1987.He was elected IFAC Fellowin 2008.From 1993 to1995, he was Chair of the AACC Awards Committee of the Amer-ican Automatic Control Council.