prospects of gradient methods for nonlinear control ivo bukovskÝ 1 jiří bÍla 1 noriasu homma 2...

PROSPECTS OF GRADIENT METHODS FOR NONLINEAR CONTROL

Ivo BUKOVSKÝ1

Jiří BÍLA1

Noriasu HOMMA2

Ricardo RODRIGUEZ1

1Czech Technical University in Prague

2Tohoku University, Japan

• We consider sample-by-sample adaptation of discrete-time models and controllers by gradient descent

2( )( 1) ( ) ;

... adaptable parameter of a model or controler

kk ki i

Qw w i

weight update system

• Stability monitoring and maintenance of weight update system of adaptively tuned models and controllers significantly contributes to a stable and convergent control loop

• In the paper, we introduce derivation of stability condition for gradient-descent tuned models and controllers

• The approach is valid for models and controllers that are nonlinear (incl. linear), but they are linear in parameters– Not suitable for conventional neural networks (MLP,

RBF)– Suitable for Higher-Order Neural Units (HONU, also

known as polynomial neural networks) (not limited to)

Further in this presentation

• Fundamental gradient descent schemes for adaptive identification and control

• Static or dynamic Higher Order Neural Units (HONU)

• Stability conditions for static and dynamic HONU and its maintenance at every adaptation step

• Demonstration of achievements with ONU( NOx prediction – EME I, lung motion prediction, nonlinear control loop of a laboratory system)

Adaptive model-linear

- neural network,

2( )( 1) ( ) ;

kk ki i

ew w i

( )ku( )krealy

Fundamental gradient descent schemes for adaptive identification and control

Plant Identification by Gradient Descent

... neural weights

(adaptable parameter)

... control variableu

weight update system

(Základní schemata adaptivní identifikace a řízení gradientovými metodami)

Automatické ladění adaptivního stavového regulátoru

Regulovanásoustava

Adaptivní regulátor-lineární

- polynomiální-- klasická neuronová síť

2( )( 1) ( ) ;

kk ki i

ew w i

( )kv ( )krealy

Referenční model (požadované chování regulované soustavy)

Žádaná hodnota

( )kdesiredy

... adaptovatelný parametr

(váhy u neuronových sítí)

... žádaná hodnota

Systém adaptovaných

Žádaný průběh chování

Fundamental gradient descent schemes for adaptive identification and control (continue)

Tuning of Adaptive Controller in a Feedback Control Loop with Gradient Descent

Plantadaptive controller

- linear PID - neural network,

2( )( 1) ( ) ;

kk ki i

ew w i

( )krealy

Model of desired behavior

( )kdesiredy

... neural weight ,

(adaptable parameter)

... desired value

2( )( 1) ( ) ;

kk ki i

ew w i

Adaptive model-linear

- neural network,

( )ku( )krealy

Fundamental gradient descent schemes for adaptive identification and control (continue)

Updating Control Inputs Directly by Gradient Descent

2( )( 1) ( ) c

kck ki

( )kdesiredy

The question is:

• How do we assure stability of nonlinear adaptive control loop?• The ways is to assure stability and convergence of adaptive

components in a control loop (plant model + controller)• What nonlinear model to use?

• MLP or RBF networks as models and controllers– Not linear in parameters– Guaranteeing stability is complicated (not

suitable for undergraduate level, difficult for PhD students from non-heavy-math schools)

– Guaranteeing stability is complicated and theoretically heavy for practicioners (thus not attractive for practice)

Static & Dynamic Higher-Order Neural Units

How do we assure stability of the nonlinear adaptive control loop? What model to choose?

Static & Dynamic Higher-Order Neural Units

How do we assure stability of the nonlinear adaptive control loop? What model to choose?

2( )( 1) ( ) ;

kk ki i

ew w i

Weight-update system:

Example of 2nd-order HONU: 1

( ...)

( )sk ny

r rn n

i j iji j i

20 0 0 1 0 2 i j ny x x x x x x x x x 0,0 0,1 0,2 i,j n,nw w w w w

“axis of adapted neural weights”

convetional NN

Approximation strength of neural networks can be improved by adding more neurons or even layers, GA, PSO,…

Static & Dynamic Higher-Order Neural Units (continue)

Sketch of optimization error surfacesLinear x MLP Networks x HONU

Static MLP vs. QNU as MISO models of hot steam turbine averaged data (“steady states”, batch training by Levenberg-Marquardt)

• double hidden layer FFNN

• single hidden layer FFNN

• static QNU• measured data

Static & Dynamic Higher-Order Neural Units (continue)Respiration time series: Training Accuracy for Predicting Exhalation Time -Instances of trained neural architectures trained from different initial conditions by L-M algorithm

2-hidden-layer static MLPs (static feedforward networks)

1-hidden-layer static MLPs (static feedforward networks)

static

0 50 100 150

trénovacích epoch

Trénování predikce Mackey-

0 50 100 150

trénovacích epoch

Trénování predikce polohy plic0 20 40 60

trénovacích epoch

Trénování predikce nelineárního periodického

signálu

0 50 100 150

trénovacích epoch

0,0 0 0 0,1 0 1 0,2 0 2

2, ,... i j i j n n n

y w x x w x x w x x

w x x w x

0,20 0 0 1 0 2

wy x x x x x x x x

rowx colW

1( ...)

( ...)

( )sk ny

r rn n

i j iji j i

Stability of weight-update system

• Condition for STATIC HONU

• Condition for DYNAMICAL HONU

( ) ( ) 1k k 1 M colx rowx

( )( ) ( ) ( ) 1

kk n k kse

rowx1 M colx rowx

1( ...)

( ...)

0 50 100 150 200 250 300 350 400-2

1One Epoch of GD Adaptation of Recurrent QNU to Predict MacKey-Glass Equation (training data vs. neural output)

0 50 100 150 200 250 300 350 400-0.5

0.5Prediciton Error during the Epoch of Adaptation

0 50 100 150 200 250 300 350 4000.98

Spectral Radius during the Epoch (stability of weight update system at each adaptation step)

0 100 200 300 400 500 600 700 800

GD Adaptation of Recurrent QNU to Predict MacKey-Glass Equation (training data vs. neural output)

0 100 200 300 400 500 600 700 800-10

Prediciton Error during Adaptation

0 100 200 300 400 500 600 700 800

Spectral Radius during Adaptation (stability of weight update system at each adaptation step)

600 620 640 660 680 700 720 740 760 780 800

GD Adaptation of Recurrent QNU to Predict MacKey-Glass Equation (training data vs. neural output)

600 620 640 660 680 700 720 740 760 780 800-10

Prediciton Error during Adaptation

600 620 640 660 680 700 720 740 760 780 800

Spectral Radius during Adaptation (stability of weight update system at each adaptation step)

Achievements with QNU

250 300 350 400

t [min]

NOx,CO prediction – EME I

trénování testování

Obr. 1: Dobře natrénovaná síť TptRNN pro 3-minutovou predikci klouzavých 3-minutových průměrů NOx, externí měřené vstupy jsou klapky a výkon , (klouzavé průměry se počítají jako průměry předchozích, současných a následujících hodnot, při intervalu predikce 3 minuty to znamená, že externí vstupy jsou již dostupné ale model v principu predikuje 3-minutový průměr který má být za 2 minuty), včase cca 415 ignoruje výpadek měření NOx a výstup modelu dobře nahrazuje měření.

Lung Tumor Motion Prediction

0 500 1000 1500 2000 2500 3000 3500-2

0 500 1000 1500 2000 2500 3000 3500-10

0 500 1000 1500 2000 2500 3000 3500-2

20 40 60 80 100 120

t [sec]

6testing MAE= 0.853120295578 [mm], RMSE= 1.14143756682, treatment time = 86[sec], computing time= 83.385[sec]

20 40 60 80 100 120t [sec]

4.0absolute value of prediction error

Lung Tumor Motion Prediction by static QNU

sampling 15 Hz, epochs=100, Ntrain=360, 492 neural weights

Lung Tumor Motion Prediction by static QNU

10^0 10^1 10^2

epochs

0.040 Averaged normalized SSE of Retrainings

Nonlinear Control Loop of a Laboratory System

[ ] Ladislav Smetana: Nonlinear Neuro-Controller for Automatic Control,Laboratory System, Master’s Thesis, Czech Tech. Univ. in Prague, 2008.

PID Control and Nonlinearity of the Plant

0 100 200 300 400 500 600-30

Prubeh PID regulace v zavislosti na hloubce ponoru batyskafu, serizeno na hloubku 20 cm

15 cm20 cm

0 100 200 300 400 500 600-40

15 cm20 cm

Tunned PID controller for 10 cm

Tunned PID controller for 20 cm

0 100 200 300 400 500 600-30

15 cm20 cm

0 100 200 300 400 500 600-40

15 cm20 cm

310 20 40 60 80 100 120 140

Prubeh regulace neuro-regulatoru zavislosti na hloubce ponoru batyskafu

15 cm20 cm

QNU as Adaptive Controller (simplest gradient descent)

Linear PID

False Neighbor Analysis is a single-scale analysis

x yyf )(x

( ) ( )

To train neural networks , input (state) vector must be estimated to minimize uncertainty in training data

Děkuji za pozornost

y=f(x)x input data y output data

False Neighbors

1 2 IF AND

THEN and are False Neighbors

=> How much is correct Rx and Ry? - we do not know

=> Let's characterize false neighbors over whole intervals

of Rx and Ry, an

x yR y y R 1 2

d not just for their single setup

False Neighbor Analysis is a single-scale analysis

Slope of FN in Log-Log plot

FN = 4.2239*log2(id) - 4.5879

1 1.5 2 2.5 3

log2(id)

log2(FN) Linear (log2(FN))

q(k ) c r (k )H

log log

MULTI-SCALE ANALYSIS approach (MSA)

number of false neighbours on a main diagonal

1 2 3 4 5 6

id...index of a diagonal cell

• To characterize a system over the range of setups

• Power law

•What is the fundamental idea?

MULTI-SCALE ANALYSIS approach (MSA)• What is the fundamental idea?

q(k ) c r (k )H

q … quantityH … characterizing exponentr(k) … discretely growing radius

r(k)=2,4,8

•To characterize a system over the range of intervals•The power-law concept

q(k ) c r (k )H

q … quantityH … characterizing exponentr(k) … discretely growing radius

r(k)=2,4,8

•To characterize a system over the range of intervals

•The power-law concept

k r(k) q A q B

1 2 4 22 4 13 113 8 44 44

r(k)=2,4,8

log2(qB) = 2.2297*log2(r) - 1.1531

log2(qA) = 1.7297*log2(r) + 0.2605

1 1.5 2 2.5 3log2(r(k))

q(k ) c r (k )H

MULTI-SCALE ANALYSIS approach (MSA) (cont.)

• How can MSA help to create better neural network models?

j =1 j =2 j =3 j =4 j =5

max FN (highest chance that y1≠y2

when x1=x2 )

i=2 FN (2,2)

i=3 FN (3,3)

i=4 FN (4,4,)

min FN (lowest chance that y1≠y2

when x1=x2 )

FN (i ,j ) … count of False Neighbors for Rx (i ) and Ry ( j )

Ry ( j )

Smallest Rx - maximum of different states of a system

Largest Rx - minimum of different states of a system

Smallest Ry - maximum of recognized different outputs

Largest Ry - minimum of recognized different outputs

FN decrease

FN decreases

ffecf F

( )f yxj

False Neighbors Matrix:

Multiscale False Neighbor Approach

MULTI-SCALE ANALYSIS approach (MSA) (cont.)

• What are other potentrials for MSA for signal processing?

• MSA based signal processing

• Variance Fractal Dimension Trajectory (VFDT)

• Mutual Information

– Multiscale approach to calculate mutual information itself

– Mutual information of VFDT processed signals

• Everywhere, where a common analysis is subject to a

single-parameter setup and changing the setup disqualifies

the analysis results.

prospects of gradient methods for nonlinear control ivo bukovskÝ 1 jiří bÍla 1 noriasu homma 2...

nonlinear control slide

nonlinear control loop

control static

control plant identification

adaptive identification

control inputs

nonlinear model

feedback control loop

Documents

homma, gaku - aikido for life

Šerek jiří

1 nondestructive measurement of charged particles kensuke...

the pied piper of hamelin (krysař) by jiří barta · pdf...

jiří matas - vision.uamt.feec.vutbr.cz

jiří r. pick

big brother (jiří „boris“ täuber)

computing jiří chudoba institute of physics, cas

computers and programming 1 the 11 th lecture jiří...

the 3 rd lecture jiří Šebesta

what is „public opinion“ ? l 1 ing. jiří Šnajdar 2014

jiří Žára

publishing management lecture 1 / part 1 a: ing. jiří...

jiří pechanec jboss qe supervisor, red hat sep 11th,...

october 11, 2009 masanori homma executive director japan...

organizational communication lecture 1. ing. jiří Šnajdar...

jiří valdhans (ed.)

jiří nekvapil, charles university, prague jiri.nekvapil...

atsuhiko ochi 1, yasuhiro homma 1, tsuyoshi takemoto 1,...

historical presumptions of social communication development...