a plea for adequate data anali thdlysis methodsconf.ncree.org.tw/proceedings/0-i0990506/05...
TRANSCRIPT
-
A plea for adequate data l i th danalysis methods
Norden E. Huang
Research Center for Adaptive Data AnalysisNational Central University
Chungli, Taiwan
MCMD Conference Taipei, Taiwan 2010
-
A plea for adequate data analysisA plea for adequate data analysis
• Whatever we want to do related to MCMD:– On forecast hazard occurrence– On increase Infrastructure resiliency
On quantify risk and– On quantify risk, and – On minimize hazard impact
• We need good data and more importantly• We need good data and, more importantly, adequate data analysis methods.
-
With the advance of IT and sensor technologies, there is an explosion of dataan explosion of data.
We are now drowning in data yet still thirsty for informationyet still thirsty for information.
-
Scientific ActivitiesScientific Activities
Collecting, analyzing, synthesizing, and theorizing are the core of scientifictheorizing are the core of scientific activities.
Data analysis is a key link in this y ycontinuous loop.
Data is the link to reality
-
Data analysis for engineeringData analysis for engineering
• Data for design specifications
• Data for health monitoring
• Data for quantify risk
• Data to minimizing hazard impacts
-
Example: Health monitoringExample: Health monitoring
• Most natural hazards are hard to predict; therefore, it is critical that we should increase the structures resiliency to withstand the impacts of the hazards. The easiest and most logic ways to maximize structural resiliency are these:
– To design the structure intelligently, and – To keep the structure in healthy conditions– To keep the structure in healthy conditions
and good repair.
-
Land Slide damagesLand Slide damages
Photograph by R.L. Schuster, U.S. Geological Survey, 1995.) g y )
-
Earthquake Damage(www.cedim.de)
-
Hurricane DamagegSouthern Grenada after Hurricane Ivan (Photo WW-2004).
-
Data AnalysisData Analysis
Data analysis is too important to be left to th th ti ithe mathematicians.
Why?!y
-
Different Paradigms IMathematics vs. Science/Engineering
• Mathematicians • Scientists/Engineers
• Absolute proofs • Agreement with observations
• Logic consistency • Physical meaning
• Mathematical rigor • Working Approximations
-
Data Processing vs AnalysisData Processing vs. Analysis
In pursue of mathematic rigor and certainty, however, we lost sight of physics and are forced to idealize, but also deviate from, the reality.
As a result, we are forced to live in a pseudo-real world, in which all processes areworld, in which all processes are
Li d St tiLinear and Stationary
-
On nonlinear oscillations
The key to structure health monitoringy g
-
How to define nonlinearity?
Based on Linear Algebra: nonlinearity isBased on Linear Algebra: nonlinearity is defined based on input vs. output.
But in reality, such an approach is not practical. The alternative is to define pnonlinearity based on data characteristics.
-
Linear Systems
Linear systems satisfy the properties of superposition and scaling Given two valid inputsand scaling. Given two valid inputs
x ( t ) and x ( t )1 2
as well as their respective outputs
y ( t ) H{ x ( t )} and=y ( t ) H{ x ( t )} and y (t) = H{ x ( t )}
=1 1
2 2
then a linear system must satisfy
y ( t ) y ( t ) H{ x ( t ) x ( t )}α β α β+ +
for any scalar values α and β.
y ( t ) y ( t ) H{ x ( t ) x ( t )} α β α β+ = +1 1 1 2
-
Characterizing Nonlinear Processes C a acte g o ea ocessesfrom Data
2d x 32
d x x cos tdt
x γ ωε+ + =
( )2
22d x x cos t
dt1 x γ ωε⇒ + =+
Spring with position dependent constai f d l
nt,i
⇒intra-wave frequency m odula ;therefore, we need instantaneous fre
tio.
nquency
-
Duffing Pendulum: Intra-Wave Frequency Modulation
x
2
22( co .) s1d x x tx
d tε γ ω=++
-
Duffing Equation : Datag q
-
Duffing Type WaveDuffing Type WavePerturbation Expansion
For 1 , we can haveε
-
Duffing Type WaveData: x = cos(wt+0.3 sin2wt)
-
Hilbert Transform : DefinitionpFor any x( t ) L ,∈
1 x( )y( t ) d ,tτ
τ τπ τ
= ℘−∫
then , x( t )and y( t ) form the analytic pairs:
i ( t )z( t ) x( t ) i y( t ) ,a( t ) e θ= + =
( )2 2 1 / 2 1where
y( t )a( t ) x y and ( t ) tan .θ −= + =( )( ) y ( ) x( t )
-
Instantaneous Frequency
distanceVelocity ; mean velocitytime
=
dxNewton vdt
⇒ =dt
1Frequency ; mean frequency=Frequency ; mean frequencyperiod
dθ
=
dHHT defines the phase functiondtθω⇒ =
So that both v and can appear in differential equations.ω
-
The combination of Hilbert Spectral Analysis and Empirical Mode Decomposition is designated as
HHT(HHT FFT)(HHT vs. FFT)
-
Comparison between FFT and HHT
1 . F F T :
ji tj
j
x ( t ) a e .ω= ℜ ∑j
2 . H H T :
jt
i ( ) d
x ( t ) a ( t ) eω τ τ∫
ℜ ∑ tjj
x ( t ) a ( t ) e .= ℜ ∑
-
Comparisons: pFourier, Hilbert & Wavelet
-
Speech AnalysisSpeech AnalysisHello : Data
-
Four comparsions D
-
Hilbert’s View onHilbert s View on Nonlinear Data
-
Duffing Type WaveDuffing Type WavePerturbation Expansion
For 1 , we can haveε
-
Duffing Type WaveDuffing Type WaveWavelet Spectrum
-
Duffing Type WaveDuffing Type WaveHilbert Spectrum
-
Duffing Type Wave: Marginal SpectrumDifferent harmonics from different Method: not physical
-
Degree of Nonlinearity
Quantify rather than qualifyQuantify, rather than qualify,nonlinearity
-
Degree of nonlinearityDegree of nonlinearity
Consider x( t ) cos( t sin t ) ω ωε η= +
( )cdIF os t= 1 ηε ηθ ωω⇒ = +( )c IF os t= 1dt
ηε ηωω⇒ = +
1 / 22
zIF IFDN (Degree of Nolinearity )I 2F
ηε⎛ ⎞−= ⎜ ⎟
⎝=
⎠zI 2F⎝ ⎠
-
Degree of NonlinearityDegree of Nonlinearity
• We can determine DN precisely with HilbertWe can determine DN precisely with Hilbert Spectral Analysis.
• We can also determine ε and η separately• We can also determine ε and η separately.• η can be determined from the frequency
mod lationmodulation.• ε can be determined from DN with η given.
-
Stokes Waves
22d x x x cos tε γ ω+ + =2 x x cos tdt
ε γ ω+ +
( )2d 1x ( )2 1x cos tdt x ωε γ+ + =
-
Stokes ee= +0.375; DN=0.0407
2S to k e s : e e = + 0 .3 7 5
1
0
1
e
1
0
mpl
itude
- 1Am
-2
0 5 0 1 0 0 1 5 0 2 0 0-3
T im e : s e c
-
Stokes ee= +0.375; DN=0.0893
2Stokes : ee=+0.375; IF,RZF;IWM
1.5
2
1
ency
: H
z
0.5
us F
requ
e
0
stan
tane
o
-0.5
Ins
IF QuadIF RZFIWM
0 20 40 60 80 100 120 140 160 180 200-1
Tim e : sec
IWM
-
Stokes ee=-0.375; DN=0.0407
3Stokes : ee=-0.375
2
1
2
e
0
1
mpl
itude
0Am
-1
0 50 100 150 200-2
Time : sec
-
Stokes ee=-0.375; DN=0.0407
2S to k e s : e e = -0 .3 7 5 ; IF ,R ZF ; IW M
1 .5
1
uenc
y : H
z
0 .5neou
s Fr
equ
IF Q u a dIF R ZFIW M
0 .5
Inst
anta
n
0
0 2 0 4 0 6 0 8 0 1 0 0 1 2 0 1 4 0 1 6 0 1 8 0 2 0 0-0 .5
T im e : s e c
-
Duffing Wavesg
23d x x x cos tε γ ω+ + =2 x x cos tdt
ε γ ω+ +
( )2
2d x 1( )2 2x cos1 x tdt ε γ ω++ =
-
6New Duffing : ee= 10, DN = 0.0470
5
4
y : H
z
2
3
Freq
uenc
y
IF Q uadIF ZC
1
2
ntan
eous
DataIF-IFz
0Inst
an
-1
0 10 20 30 40 50-2
Tim e : sec
-
Case Study
A Bridge in Central TaiwanA Bridge in Central Taiwan
-
A Bridge in Southern Taiwan
-
Data : Hsin Nan Bridge Span 2 Vertical
-
Marginal Hilbert Spectra : Hsin Nan Bridge Span 2 V ti lVertical
-
Data : Hsin Nan Bridge Span 3 Vertical
-
Marginal Hilbert Spectra : Hsin Nan Bridge Span 3 VerticalVertical
-
Data : Hsin Nan Bridge Span 4 Vertical
-
Marginal Hilbert Spectra : Hsin Nan Bridge Span 4 V ti lVertical
-
The foundations: the Piers
Check the pilesCheck the piles
-
Data : Bridge Piers 1 & 2
-
Data : Bridge Piers 3 & 4
-
Marginal Hilbert & Fourier Spectra : Bridge Piers 1 & 2 Vertical
-
What This Means
• Instantaneous Frequency offers a total different view for nonlinear data: instantaneousview for nonlinear data: instantaneous frequency with no need for harmonics and unlimited by uncertainty.
• Adaptive basis is indispensable for t ti d li d t l inonstationary and nonlinear data analysis
HHT t bli h di f d t• HHT establishes a new paradigm of data analysis with which we can quantify nonlinearity
-
ComparisonsComparisons
Fourier Wavelet Hilbert
Basis a priori a priori AdaptiveBasis a priori a priori Adaptive
Frequency Integral transform: Global
Integral transform: Regional
Differentiation:Local
Presentation Energy-frequency Energy-time-frequency
Energy-time-frequency
Nonlinear no no yes
Non-stationary no yes yes
Uncertainty yes yes no
H iHarmonics yes yes no
-
On Hazard Occurrence
Global WarmingGlobal Warming
HHT: Trend and prediction
-
IPCC Global Mean Temperature Trend
-
“Note that for shorter recent periods, th l i t i di tithe slope is greater, indicating
accelerated warming.”g
IPCC 4th Assessment Report 2007
-
The State-of-the arts: Trend
“One economist’s trend is another economist’s l ”cycle”
Engle, R. F. and Granger, C. W. J. 1991 Long-run Economic Relationships. Cambridge University Press.
Regression method is arbitrary and ad hoc.
-
Philosophical Problem Anticipatedp p
名不正則言不順名不正則言不順
言不順則事不成言不順則事不成
——孔夫子
-
On Definition
Without a proper definition, logic discourse would be impossible.
Without logic discourse, nothing can be accomplished.
Confucius
-
Definition of the TrendH t l P R S L d 1998Huang et al, Proc. Roy. Soc. Lond., 1998
Wu et al. PNAS 2007
Within the given data span, the trend is an intrinsically fitted monotonic function, or a function in which there can be at most one extremum.
The trend should be an intrinsic and local property of the data; it is determined by the same mechanisms that generate the data.
Being local, it has to associate with a local length scale, and be valid only within that length span, and be part of a full wave length.
Th th d d t i i th t d h ld b i t i i B i i t i iThe method determining the trend should be intrinsic. Being intrinsic, the method for defining the trend has to be adaptive.
All traditional trend determination methods are extrinsicAll traditional trend determination methods are extrinsic.
-
IPCC Global Mean Temperature Trend
-
Comparison between non‐linear rate with multi‐rate of IPCC
0.025Various Rates of Global Warming
0.015
0.02
r
0.005
0.01
Rat
e: o
C/Y
r
-0.005
0
War
min
g R
-0.015
-0.01
1840 1860 1880 1900 1920 1940 1960 1980 2000 2020-0.02
Time : Year
Blue shadow and blue line are the warming rate of non‐linear trend.Magenta shadow and magenta line are the rate of combination of non‐linear trend and AMO‐like components.Dashed lines are IPCC rates.
-
GSAT Data and Various Trends
0.8G STA and v arious tre nds
D ata
0.4
0.6 Tre ndTre nd8Tre nd7
0.2
: o C
-0 .2
0
mpe
ratu
re
-0 6
-0.4Tem
-0 .8
-0 .6
1850 1900 1950 2000-1
Tim e : Y e ar
-
Annual Temperature Ranking : 2008
GISS NCDC CRU Rank
2005 2005 1998 11998 1998 2005 21998 1998 2005 22002 2002 2003 32007 2003 2002 42007 2003 2002 42003 2006 2004 52006 2007 2006 62001 2004 2001 72004 2001 2007 82008 2008 1997 91997 1997 2008 10
-
ConclusionConclusion
Ad ti th d i th l i tifi llAdaptive method is the only scientifically meaningful way to analyze data.
It is the only way to find out the underlyingIt is the only way to find out the underlying physical processes; therefore, it is indispensable in scientific researchindispensable in scientific research.
It is physical, direct, and simple; it is ideal for engineering applications.