2008 "an overview of methods for analysis of identifiability and observability in nonlinear...
DESCRIPTION
2008. Trial lecture, pHd-defence Steinar M. ElgsæterTRANSCRIPT
1
An overview of Methods for analysis of Identifiability and Observability in Nonlinear State and Parameter Estimation
Steinar M. ElgsæterTrial Lecture - October 14 2008
2
Emphasis of this presentation
• identifiability and observability in the context of industrial processes, where even simplified models may have 30 or more states.
• emphasis on recent advances in computational methods (many motivated by applications in systems biology)
3
Presentation overview
• what is identifiability and observability?• methods for testing identifiability
– structural identifiability
– ”practical” identifiability
• methods for testing observability• discussion/conclusions
4
What is Identifiability and Observability?
5
Example: An oil well
Reservoir
Oil,water and gas rates
Choke
Well
Pipeline
Pressures and temperature
Density
Chokeopening
Unmeasured
Measured
6
Internal system
dynamics
un-modeled disturbances
output(y)input(u)Map
internal states (x)
System
7
internal states (x)
modeled internal states (x)
An open-loop ”ballistic” state estimator
Internal dynamics
measured output (y)Map
Modeled internal
dynamics
modeled output(y)
input(u)
Map
Plant
Model
Fitted parameters (θ)
8
Closing the loop improves estimates in the face of uncertainty or disturbances
Plant
Model
State Estimator
-
input output
state injection term
fitted state
Plant
Model
Parameter Estimator
-
input output
parameter injection term
fitted parameter
Duality
Feedback loop
9
Parameter estimation and identifiability• ”Whether the identification procedure will yield a
unique value of the parameter θ, and/or whether the resulting model is equal to the true system.”1
• identifiability both a property of the model structure (1) and the data set (2) 1 :
1. no two parameter values should give the same model
2. the data should be informative enough to distinguish between non equal models
1. Ljung,(1999) System Identification: Theory for the user
10
State estimation and observability• observability
– loosely: ”is it possible to determine the initial state of a system by observing its input and output over some period”?
– different definitions for nonlinear systems depending on the theoretical approach chosen
11
Observability in linear systems
• definition: A linear system is observable if any unknown initial state can be determined uniquely by observing inputs and outputs over some time interval
• observability can be tested for linear systems using linear algebra (for instance by the rank of the observability matrix or Gramian)
• note that observability does not depend on trajectory for linear systems
1. Ljung,(1999) System Identification: Theory for the user
12
Kalman Filter• often used to simultaneously estimate both parameters
and states • assumes white, zero-mean measurement and process
noise,– implies no structural mismatch
• requires linear state and measurement models• ad-hoc adjustment of noise covariances often required
when applied to complex, nonlinear chemical processes– divergence problems often cited(1 and references therein)
1. Wilson, D.I., Agarwal, M, Rippin, D.W.T “Experiences implementing the extended Kalman filter on an industrial batch reactor”(1998), Computers Chem Engng, Vol 22, No.11, pp.1653-1672
13
Flavors of nonlinear filtersType Constrained Solution Model considered
Linearized Kalman Filter
Unconstrained Explicit 1.Order local approximation
Extended Kalman Filter
Unconstrained Explicit 1. Order local approximation
Unscented Kalman Filter
Unconstrained Explicit 2.Order local approximation
Moving Horizon Estimator
Constrained Numerical Full model
14
Parameter estimation a special case of state estimation
• theoretically θ is a special kind of state where• the distinction between identifiability and observability
may be slightly artificial, – many methods discussed are extendable to both types of analysis
– parameters are estimated alongside states in Kalman filters in practice anyway...
15
Why is observability and identifiability an issue?• less instruments than one would like
– extreme pressures, temperatures or inaccessibility
• overparametrized models• many simplified process models are still nonlinear
and have 30 states or more– while linear models can be easily both solved and analyzed
analytically, nonlinear process control models can usually not even be solved analytically
16
Methods for testing identifiability
17
Different methods for different aspects of identifiability
1. a priori ”structural identifiability” 1. sensitivity analysis
2. empirical observability Gramians
2. a posteriori ”practical identifiability” including data1. asymptotic analysis (local)
2. Alternating Conditional Expectation Algorithm
3. algebraic analysis
18
Structural identifiability
19
Local Sensitivity Analysis1
• which parameters can be uniquely fitted against data?
• sensitivity matrix:
– can rank parameters to find sets which have high sensitivity and low collinearity
– can be assessed locally and numerically for a range of (u,θ) for nonlinear systems
1. Lund, B.F. (2005) ”Rigorous simulation models for improved process operation” (PhD thesis)
20
Local Sensitivity Analysis(2)• advantage:
– based on intuitive principles
– sensitivity matrices are calculated quickly and easily
• disadvantages:– physical insight required to choose candidate operating points (u,θ)
– interpreting sensitivity matrices may get technical
– an iterative approach may be required to find a suitable parameter set
– linear and local analysis of nonlinear systems
21
Empirical Observability Gramians(1)• a linear stable system is observable if the observability
Gramian, an energy function, is of full rank • applying energy functions directly to nonlinear systems
is computationally intensive• if nonlinear systems could be linearized, to determine
linear observability Gramians, but significant dynamic effects could be neglected
• empirical Gramians originally introduced for model reduction1
1. Singh, A. K. and Hahn, J. On the use of empirical gramians for controllability and observability analysis. In Proc. 2005 American Control Conference (ACC). Portland,OR, USA, 2005 pp. 140-141
22
Empirical Observability Gramians(2)1
• approximation of analytical Gramian, valid for a region
• reduces to linear Gramian for linear systems• both observability and identifiability2
1. J. Hahn, T.F. Edgar, (2002)“An improved method for nonlinear model reduction using balancing of empirical gramians”, Computers and Chemical Engineering, Vol. 26, 2002, pp. 1379-1397.
2. Geffen, D. (2008) Parameter Identifiability of Biochemcial Reaction Networks in Systems Biolgy (Master Thesis) , Queens University, Ontario
”Regional” approximation found from simulated perturbations of nonlinear model
23
”Practical” identifiability
24
Asymptotic analysis1
• distribution of covariance for– asymptotic analysis (large N)
– covariance matrix found from a second order Taylor approximation of the model around an specific parameter value
– quality tag or confidence intervals for fitted parameters
– estimated from data
– for linear systems with invertible covariance matrix:
1. Ljung,(1999) System Identification: Theory for the user
25
Shortcomings of asymptotic analysis• N is never infinite, it is finite and often small• for nonlinear models, analysis is only local
– which operating point(s) to analyze?
• calculations require inverting matrices which are close to singular if information content is low, – covariance estimates may be inaccurate when covariance is large
26
Parameter uncertainty estimates can assess how close to identifiability a model/dataset is
• covariance-matrix analysis• Leave-Out-Sign-Dominant Correlation Regions1
• brute-force computational methods (bootstrapping2)
1. Campi, M.C. and Weyer, E. (2005) ”Guaranteed non-asymptotic confidence regions in system identification”, Automatica, 41(10), 1751-1764
2. Efron, B. and Tibshirani, R. J. (1993) An Introduction to the Bootstrap, Chapmann & Hall
27
Alternating Conditional Expectation Algorithm1
• principle: non-identifiabilities in nonlinear dynamical model reveal themselves in functional dependencies between parameters– model fitted repeatedly in bootstrapping to
data to generate a set of fitted parameters
– functional dependencies in this set are identified using ACE methods2
1. S. Hengl, C. Kreutz, J. Timmer and T. Maiwald (2007) Data-based identifiability analysis of non-linear dynamical models,Bioinformatics, Vol. 23 no. 19, pp 2612–2618
2. Breiman,L. and Friedman,J. (1985) Estimating optimal transformations for multiple regression and correlation. J. Am. Stat. Assoc., 80, 580–598.
28
Methods for testing observability*
* and identifiability as a special case
29
Observability tests in nonlinear systems• directions:
– differential geometry: Indistinguishable Dynamics rank condition (not considered)
– differential algebra: algebraic observability rank condition
• note: observability of nonlinear systems will depend on trajectory in general
30
Differential algebra: algebraic observability• ”The state variable xi is observable if there exists an
algebraic relation that binds xi to inputs, outputs and a finite number of their derivatives.”1
• Example2
1. Diop, S. and Fliess, M. On nonlinear observability In Proceedings of the first european control conference (Grenoble, France, July 2-5 1991), C.Commault and coll., vol. 1., Hermès, pp.152-157
2. Anguelova 2004, Nonlinear Observability and Identifiability: General Theory and a Case Study of a Kinetic Model for S. cerevisiae (Master Thesis), Chalmers, Göteborg University
31
Differential algebra: algebraic observability• the algebraic observability of the
system
Σ requires the full rank of matrix,
where n is the number of states.1
• possible to pinpoint unobservable states by removing columns of matrix and re-calculating rank iteratively
1. Diop, S. and Wang.Y (1993) ”Equivalence between algebraic observability and local generic observability ”, in Proc. 32nd Conference on Decision and Control
32
Local Algebraic Observability (3)...• for bilinear models, if more than two derivations are
required express a model in terms of derivates of y and u, numerical solution may be required (3.order polynomials)
33
Computational algebraic observability analysis• symbolic calculations of Lie-derivatives not possible in
polynomial time– instead ”semi-numerical” algorithm, implemented in Maple1
– applies only to rational control systems– specializes parameters to random integer values and inputs to power series of
t with integer coefficients– achieves polynomial run-time
• algorithm output– a positive observability test is always correct– a negative observability test is correct with a high probability– non-observable states and parameter symmetries determined in exponential
time
1. Sedoglavic A. 2002, A probabilistic algorithm to test local algebraic observability in polynomial time, Journal of Symbolic Computation 33:735-755
34
Computational algebraic observability analysis(2)• only one application of CAOA found in
literature1
– case study of kinetic model for metabolic dynamics in biochemistry
• Sedoglavic algorithm applied:– 21 states, all states measured, ~100 parameters– 2 non-identifiable parameters identified– two ”symmetries” identified, including 11
parameters, – one parameter in each symmetry must be fixed
to make model identifiable
1. Anguelova 2004, Nonlinear Observability and Identifiability: General Theory and a Case Study of a Kinetic Model for S. cerevisiae (Master Thesis), Chalmers, Göteborg University
35
Determining observability robustness through simulations1
• observability Jacobian determined for different initial conditions and different parameter values in simulations
• did not relate findings to individual states
1. Dafis C.J, Nwankpa, C.O. (2005) Characteristics of Degree of Observability Measure for Nonlinear Power Systems, Proc. 38th Hawaii Int. Conf. on System Sciences
36
Discussion/Conclusions
37
”Observability in practice”
• observability is currently treated as– an ”on-off” property’
– given perfect measurements
• in practice states are not equally observable.• it would be useful to develop a ”measure of
observability” for each individual state which assesses for instance – sensitivity to non-idealness of measurements (noise)
– sensitivity to operating point/parameter value
– sensitivity to excitation
38
Conclusion
• several recent advances in computational methods for observability analysis have been made:– empirical Observability Gramian
– Sedoglavic algorithm for algebraic observability
– Alternating Conditional Expectation algorithm
• these methods appear well-suited to process control – computing time is ”cheap”
– ”turnkey methods”: computational methods are not model-specific, applying methods in practice need to be laborious
39
Conclusion(2)
• application of computational observability analysis in process control– no reference of such applications found in literature
– advances in field mean that timing may be good for a case study in process control
– a research topic well-suited to industry-university collaboration (Master thesis topic?)
– mainly of interest for industry which has design of nonlinear observers for process control as core business (narrow)
40
Thank you for your attention