a plea for adequate data anali thdlysis methodsconf.ncree.org.tw/proceedings/0-i0990506/05...

68
A plea for adequate data l i th d analysis methods Norden E. Huang Research Center for Adaptive Data Analysis National Central University Chungli, Taiwan MCMD Conference Taipei, Taiwan 2010

Upload: others

Post on 24-Jul-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

  • A plea for adequate data l i th danalysis methods

    Norden E. Huang

    Research Center for Adaptive Data AnalysisNational Central University

    Chungli, Taiwan

    MCMD Conference Taipei, Taiwan 2010

  • A plea for adequate data analysisA plea for adequate data analysis

    • Whatever we want to do related to MCMD:– On forecast hazard occurrence– On increase Infrastructure resiliency

    On quantify risk and– On quantify risk, and – On minimize hazard impact

    • We need good data and more importantly• We need good data and, more importantly, adequate data analysis methods.

  • With the advance of IT and sensor technologies, there is an explosion of dataan explosion of data.

    We are now drowning in data yet still thirsty for informationyet still thirsty for information.

  • Scientific ActivitiesScientific Activities

    Collecting, analyzing, synthesizing, and theorizing are the core of scientifictheorizing are the core of scientific activities.

    Data analysis is a key link in this y ycontinuous loop.

    Data is the link to reality

  • Data analysis for engineeringData analysis for engineering

    • Data for design specifications

    • Data for health monitoring

    • Data for quantify risk

    • Data to minimizing hazard impacts

  • Example: Health monitoringExample: Health monitoring

    • Most natural hazards are hard to predict; therefore, it is critical that we should increase the structures resiliency to withstand the impacts of the hazards. The easiest and most logic ways to maximize structural resiliency are these:

    – To design the structure intelligently, and – To keep the structure in healthy conditions– To keep the structure in healthy conditions

    and good repair.

  • Land Slide damagesLand Slide damages

    Photograph by R.L. Schuster, U.S. Geological Survey, 1995.) g y )

  • Earthquake Damage(www.cedim.de)

  • Hurricane DamagegSouthern Grenada after Hurricane Ivan (Photo WW-2004).

  • Data AnalysisData Analysis

    Data analysis is too important to be left to th th ti ithe mathematicians.

    Why?!y

  • Different Paradigms IMathematics vs. Science/Engineering

    • Mathematicians • Scientists/Engineers

    • Absolute proofs • Agreement with observations

    • Logic consistency • Physical meaning

    • Mathematical rigor • Working Approximations

  • Data Processing vs AnalysisData Processing vs. Analysis

    In pursue of mathematic rigor and certainty, however, we lost sight of physics and are forced to idealize, but also deviate from, the reality.

    As a result, we are forced to live in a pseudo-real world, in which all processes areworld, in which all processes are

    Li d St tiLinear and Stationary

  • On nonlinear oscillations

    The key to structure health monitoringy g

  • How to define nonlinearity?

    Based on Linear Algebra: nonlinearity isBased on Linear Algebra: nonlinearity is defined based on input vs. output.

    But in reality, such an approach is not practical. The alternative is to define pnonlinearity based on data characteristics.

  • Linear Systems

    Linear systems satisfy the properties of superposition and scaling Given two valid inputsand scaling. Given two valid inputs

    x ( t ) and x ( t )1 2

    as well as their respective outputs

    y ( t ) H{ x ( t )} and=y ( t ) H{ x ( t )} and y (t) = H{ x ( t )}

    =1 1

    2 2

    then a linear system must satisfy

    y ( t ) y ( t ) H{ x ( t ) x ( t )}α β α β+ +

    for any scalar values α and β.

    y ( t ) y ( t ) H{ x ( t ) x ( t )} α β α β+ = +1 1 1 2

  • Characterizing Nonlinear Processes C a acte g o ea ocessesfrom Data

    2d x 32

    d x x cos tdt

    x γ ωε+ + =

    ( )2

    22d x x cos t

    dt1 x γ ωε⇒ + =+

    Spring with position dependent constai f d l

    nt,i

    ⇒intra-wave frequency m odula ;therefore, we need instantaneous fre

    tio.

    nquency

  • Duffing Pendulum: Intra-Wave Frequency Modulation

    x

    2

    22( co .) s1d x x tx

    d tε γ ω=++

  • Duffing Equation : Datag q

  • Duffing Type WaveDuffing Type WavePerturbation Expansion

    For 1 , we can haveε

  • Duffing Type WaveData: x = cos(wt+0.3 sin2wt)

  • Hilbert Transform : DefinitionpFor any x( t ) L ,∈

    1 x( )y( t ) d ,tτ

    τ τπ τ

    = ℘−∫

    then , x( t )and y( t ) form the analytic pairs:

    i ( t )z( t ) x( t ) i y( t ) ,a( t ) e θ= + =

    ( )2 2 1 / 2 1where

    y( t )a( t ) x y and ( t ) tan .θ −= + =( )( ) y ( ) x( t )

  • Instantaneous Frequency

    distanceVelocity ; mean velocitytime

    =

    dxNewton vdt

    ⇒ =dt

    1Frequency ; mean frequency=Frequency ; mean frequencyperiod

    =

    dHHT defines the phase functiondtθω⇒ =

    So that both v and can appear in differential equations.ω

  • The combination of Hilbert Spectral Analysis and Empirical Mode Decomposition is designated as

    HHT(HHT FFT)(HHT vs. FFT)

  • Comparison between FFT and HHT

    1 . F F T :

    ji tj

    j

    x ( t ) a e .ω= ℜ ∑j

    2 . H H T :

    jt

    i ( ) d

    x ( t ) a ( t ) eω τ τ∫

    ℜ ∑ tjj

    x ( t ) a ( t ) e .= ℜ ∑

  • Comparisons: pFourier, Hilbert & Wavelet

  • Speech AnalysisSpeech AnalysisHello : Data

  • Four comparsions D

  • Hilbert’s View onHilbert s View on Nonlinear Data

  • Duffing Type WaveDuffing Type WavePerturbation Expansion

    For 1 , we can haveε

  • Duffing Type WaveDuffing Type WaveWavelet Spectrum

  • Duffing Type WaveDuffing Type WaveHilbert Spectrum

  • Duffing Type Wave: Marginal SpectrumDifferent harmonics from different Method: not physical

  • Degree of Nonlinearity

    Quantify rather than qualifyQuantify, rather than qualify,nonlinearity

  • Degree of nonlinearityDegree of nonlinearity

    Consider x( t ) cos( t sin t ) ω ωε η= +

    ( )cdIF os t= 1 ηε ηθ ωω⇒ = +( )c IF os t= 1dt

    ηε ηωω⇒ = +

    1 / 22

    zIF IFDN (Degree of Nolinearity )I 2F

    ηε⎛ ⎞−= ⎜ ⎟

    ⎝=

    ⎠zI 2F⎝ ⎠

  • Degree of NonlinearityDegree of Nonlinearity

    • We can determine DN precisely with HilbertWe can determine DN precisely with Hilbert Spectral Analysis.

    • We can also determine ε and η separately• We can also determine ε and η separately.• η can be determined from the frequency

    mod lationmodulation.• ε can be determined from DN with η given.

  • Stokes Waves

    22d x x x cos tε γ ω+ + =2 x x cos tdt

    ε γ ω+ +

    ( )2d 1x ( )2 1x cos tdt x ωε γ+ + =

  • Stokes ee= +0.375; DN=0.0407

    2S to k e s : e e = + 0 .3 7 5

    1

    0

    1

    e

    1

    0

    mpl

    itude

    - 1Am

    -2

    0 5 0 1 0 0 1 5 0 2 0 0-3

    T im e : s e c

  • Stokes ee= +0.375; DN=0.0893

    2Stokes : ee=+0.375; IF,RZF;IWM

    1.5

    2

    1

    ency

    : H

    z

    0.5

    us F

    requ

    e

    0

    stan

    tane

    o

    -0.5

    Ins

    IF QuadIF RZFIWM

    0 20 40 60 80 100 120 140 160 180 200-1

    Tim e : sec

    IWM

  • Stokes ee=-0.375; DN=0.0407

    3Stokes : ee=-0.375

    2

    1

    2

    e

    0

    1

    mpl

    itude

    0Am

    -1

    0 50 100 150 200-2

    Time : sec

  • Stokes ee=-0.375; DN=0.0407

    2S to k e s : e e = -0 .3 7 5 ; IF ,R ZF ; IW M

    1 .5

    1

    uenc

    y : H

    z

    0 .5neou

    s Fr

    equ

    IF Q u a dIF R ZFIW M

    0 .5

    Inst

    anta

    n

    0

    0 2 0 4 0 6 0 8 0 1 0 0 1 2 0 1 4 0 1 6 0 1 8 0 2 0 0-0 .5

    T im e : s e c

  • Duffing Wavesg

    23d x x x cos tε γ ω+ + =2 x x cos tdt

    ε γ ω+ +

    ( )2

    2d x 1( )2 2x cos1 x tdt ε γ ω++ =

  • 6New Duffing : ee= 10, DN = 0.0470

    5

    4

    y : H

    z

    2

    3

    Freq

    uenc

    y

    IF Q uadIF ZC

    1

    2

    ntan

    eous

    DataIF-IFz

    0Inst

    an

    -1

    0 10 20 30 40 50-2

    Tim e : sec

  • Case Study

    A Bridge in Central TaiwanA Bridge in Central Taiwan

  • A Bridge in Southern Taiwan

  • Data : Hsin Nan Bridge Span 2 Vertical

  • Marginal Hilbert Spectra : Hsin Nan Bridge Span 2 V ti lVertical

  • Data : Hsin Nan Bridge Span 3 Vertical

  • Marginal Hilbert Spectra : Hsin Nan Bridge Span 3 VerticalVertical

  • Data : Hsin Nan Bridge Span 4 Vertical

  • Marginal Hilbert Spectra : Hsin Nan Bridge Span 4 V ti lVertical

  • The foundations: the Piers

    Check the pilesCheck the piles

  • Data : Bridge Piers 1 & 2

  • Data : Bridge Piers 3 & 4

  • Marginal Hilbert & Fourier Spectra : Bridge Piers 1 & 2 Vertical

  • What This Means

    • Instantaneous Frequency offers a total different view for nonlinear data: instantaneousview for nonlinear data: instantaneous frequency with no need for harmonics and unlimited by uncertainty.

    • Adaptive basis is indispensable for t ti d li d t l inonstationary and nonlinear data analysis

    HHT t bli h di f d t• HHT establishes a new paradigm of data analysis with which we can quantify nonlinearity

  • ComparisonsComparisons

    Fourier Wavelet Hilbert

    Basis a priori a priori AdaptiveBasis a priori a priori Adaptive

    Frequency Integral transform: Global

    Integral transform: Regional

    Differentiation:Local

    Presentation Energy-frequency Energy-time-frequency

    Energy-time-frequency

    Nonlinear no no yes

    Non-stationary no yes yes

    Uncertainty yes yes no

    H iHarmonics yes yes no

  • On Hazard Occurrence

    Global WarmingGlobal Warming

    HHT: Trend and prediction

  • IPCC Global Mean Temperature Trend

  • “Note that for shorter recent periods, th l i t i di tithe slope is greater, indicating

    accelerated warming.”g

    IPCC 4th Assessment Report 2007

  • The State-of-the arts: Trend

    “One economist’s trend is another economist’s l ”cycle”

    Engle, R. F. and Granger, C. W. J. 1991 Long-run Economic Relationships. Cambridge University Press.

    Regression method is arbitrary and ad hoc.

  • Philosophical Problem Anticipatedp p

    名不正則言不順名不正則言不順

    言不順則事不成言不順則事不成

    ——孔夫子

  • On Definition

    Without a proper definition, logic discourse would be impossible.

    Without logic discourse, nothing can be accomplished.

    Confucius

  • Definition of the TrendH t l P R S L d 1998Huang et al, Proc. Roy. Soc. Lond., 1998

    Wu et al. PNAS 2007

    Within the given data span, the trend is an intrinsically fitted monotonic function, or a function in which there can be at most one extremum.

    The trend should be an intrinsic and local property of the data; it is determined by the same mechanisms that generate the data.

    Being local, it has to associate with a local length scale, and be valid only within that length span, and be part of a full wave length.

    Th th d d t i i th t d h ld b i t i i B i i t i iThe method determining the trend should be intrinsic. Being intrinsic, the method for defining the trend has to be adaptive.

    All traditional trend determination methods are extrinsicAll traditional trend determination methods are extrinsic.

  • IPCC Global Mean Temperature Trend

  • Comparison between non‐linear rate with multi‐rate of IPCC

    0.025Various Rates of Global Warming

    0.015

    0.02

    r

    0.005

    0.01

    Rat

    e: o

    C/Y

    r

    -0.005

    0

    War

    min

    g R

    -0.015

    -0.01

    1840 1860 1880 1900 1920 1940 1960 1980 2000 2020-0.02

    Time : Year

    Blue shadow and blue line are the warming rate of non‐linear trend.Magenta shadow and magenta line are the rate of combination of non‐linear trend and AMO‐like components.Dashed lines are IPCC rates.

  • GSAT Data and Various Trends

    0.8G STA and v arious tre nds

    D ata

    0.4

    0.6 Tre ndTre nd8Tre nd7

    0.2

    : o C

    -0 .2

    0

    mpe

    ratu

    re

    -0 6

    -0.4Tem

    -0 .8

    -0 .6

    1850 1900 1950 2000-1

    Tim e : Y e ar

  • Annual Temperature Ranking : 2008

    GISS NCDC CRU Rank

    2005 2005 1998 11998 1998 2005 21998 1998 2005 22002 2002 2003 32007 2003 2002 42007 2003 2002 42003 2006 2004 52006 2007 2006 62001 2004 2001 72004 2001 2007 82008 2008 1997 91997 1997 2008 10

  • ConclusionConclusion

    Ad ti th d i th l i tifi llAdaptive method is the only scientifically meaningful way to analyze data.

    It is the only way to find out the underlyingIt is the only way to find out the underlying physical processes; therefore, it is indispensable in scientific researchindispensable in scientific research.

    It is physical, direct, and simple; it is ideal for engineering applications.