support inference in linear statistical inverse problems · consider the test statistic m = max i 2...

54
Support Inference in linear Statistical Inverse Problems Frank Werner 1 Statistical Inverse Problems in Biophysics Group Max Planck Institute for Biophysical Chemistry, G¨ ottingen and Felix Bernstein Institute for Mathematical Statistics in the Biosciences University of G¨ottingen Applied Inverse Problems Conference 2017, Hangzhou 1 joint work with Katharina Proksch and Axel Munk Frank Werner, MPIbpC G¨ ottingen Support Inference May 31, 2017 1 / 30

Upload: others

Post on 25-Aug-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Support Inference in linear Statistical Inverse Problems · consider the test statistic M = max i 2 4w i 0 @ h i;Yi Y q V h i;Yi Y w i 1 A 3 5: compute the asymptotic distribution

Support Inference in linear Statistical Inverse Problems

Frank Werner1

Statistical Inverse Problems in Biophysics GroupMax Planck Institute for Biophysical Chemistry, Gottingen

and

Felix Bernstein Institute for Mathematical Statistics in the BiosciencesUniversity of Gottingen

Applied Inverse Problems Conference 2017, Hangzhou

1joint work with Katharina Proksch and Axel MunkFrank Werner, MPIbpC Gottingen Support Inference May 31, 2017 1 / 30

Page 2: Support Inference in linear Statistical Inverse Problems · consider the test statistic M = max i 2 4w i 0 @ h i;Yi Y q V h i;Yi Y w i 1 A 3 5: compute the asymptotic distribution

Outline

1 Introduction

2 Theory

3 Simulations and real data example

4 Conclusion

Frank Werner, MPIbpC Gottingen Support Inference May 31, 2017 2 / 30

Page 3: Support Inference in linear Statistical Inverse Problems · consider the test statistic M = max i 2 4w i 0 @ h i;Yi Y q V h i;Yi Y w i 1 A 3 5: compute the asymptotic distribution

Introduction

Outline

1 Introduction

2 Theory

3 Simulations and real data example

4 Conclusion

Frank Werner, MPIbpC Gottingen Support Inference May 31, 2017 3 / 30

Page 4: Support Inference in linear Statistical Inverse Problems · consider the test statistic M = max i 2 4w i 0 @ h i;Yi Y q V h i;Yi Y w i 1 A 3 5: compute the asymptotic distribution

Introduction

Setting and goal

• X and Y Hilbert spaces

• ϕii∈N ⊂ X a dictionary

• T : X → Y bounded and linear

Can we identify the active components (=’support’) of an unknown ffrom noisy measurements of Tf ?

More precisely:

Given noisy measurements Y ≈ Tf we want to generate a set Ia such thatat a controlled error level all i ∈ Ia satisfy 〈ϕi , f 〉X > 0

Frank Werner, MPIbpC Gottingen Support Inference May 31, 2017 4 / 30

Page 5: Support Inference in linear Statistical Inverse Problems · consider the test statistic M = max i 2 4w i 0 @ h i;Yi Y q V h i;Yi Y w i 1 A 3 5: compute the asymptotic distribution

Introduction

Setting and goal

• X and Y Hilbert spaces

• ϕii∈N ⊂ X a dictionary

• T : X → Y bounded and linear

Can we identify the active components (=’support’) of an unknown ffrom noisy measurements of Tf ?

More precisely:

Given noisy measurements Y ≈ Tf we want to generate a set Ia such thatat a controlled error level all i ∈ Ia satisfy 〈ϕi , f 〉X > 0

Frank Werner, MPIbpC Gottingen Support Inference May 31, 2017 4 / 30

Page 6: Support Inference in linear Statistical Inverse Problems · consider the test statistic M = max i 2 4w i 0 @ h i;Yi Y q V h i;Yi Y w i 1 A 3 5: compute the asymptotic distribution

Introduction

Support inference?

Special situation: Suppose that

• X is a space of functions (i.e. X = L2 (Ω)),

• and the functions ϕi have compact support (e.g. wavelet dictionary)

Then:〈ϕi , f 〉X > 0 ⇒ f|supp(ϕi )

6≡ 0

Consequently, we obtain information about the support of f !

Frank Werner, MPIbpC Gottingen Support Inference May 31, 2017 5 / 30

Page 7: Support Inference in linear Statistical Inverse Problems · consider the test statistic M = max i 2 4w i 0 @ h i;Yi Y q V h i;Yi Y w i 1 A 3 5: compute the asymptotic distribution

Introduction

Related problems and methodsSuppose f is sparse w.r.t. ϕii∈N. Then for the recovery of f manymethods are used:

• `1-penalized Tikhonov regularization / LASSO

fα = argminf ∈X

[‖Tf − Y ‖2

Y + α

∞∑i=1

|〈ϕi , f 〉X |

]

• Residual method / Danzig selector

fα = argminf ∈X

∞∑i=1

|〈ϕi , f 〉X | subject to ‖Tf − Y ‖Y ≤ ρ

• Orthorgonal matching pursuit

• ...

But none of these methods can identify the true support at a controllederror level!

Frank Werner, MPIbpC Gottingen Support Inference May 31, 2017 6 / 30

Page 8: Support Inference in linear Statistical Inverse Problems · consider the test statistic M = max i 2 4w i 0 @ h i;Yi Y q V h i;Yi Y w i 1 A 3 5: compute the asymptotic distribution

Introduction

Methodology

Inference for a single i :

• compute a function Φi such that 〈Φi ,Tf 〉Y = 〈ϕi , f 〉X , i.e.ϕi = T ∗Φi

• compute the (asymptotic) distribution of 〈Φi ,Y 〉Y (test statistic)under the hypothesis 〈ϕi , f 〉X = 0

• with the (1− α)-quantile q1−α of this (asymptotic) distribution itholds under 〈ϕi , f 〉X = 0 (asymptotically)

P[〈Φi ,Y 〉Y ≥ q1−α

]≤ α

• consequently if 〈Φi ,Y 〉Y ≥ q1−α we have 〈ϕi , f 〉X > 0 withprobability ≥ 1− α.

Frank Werner, MPIbpC Gottingen Support Inference May 31, 2017 7 / 30

Page 9: Support Inference in linear Statistical Inverse Problems · consider the test statistic M = max i 2 4w i 0 @ h i;Yi Y q V h i;Yi Y w i 1 A 3 5: compute the asymptotic distribution

Introduction

Methodology

Inference for a single i :

• compute a function Φi such that 〈Φi ,Tf 〉Y = 〈ϕi , f 〉X , i.e.ϕi = T ∗Φi

• compute the (asymptotic) distribution of 〈Φi ,Y 〉Y (test statistic)under the hypothesis 〈ϕi , f 〉X = 0

• with the (1− α)-quantile q1−α of this (asymptotic) distribution itholds under 〈ϕi , f 〉X = 0 (asymptotically)

P[〈Φi ,Y 〉Y ≥ q1−α

]≤ α

• consequently if 〈Φi ,Y 〉Y ≥ q1−α we have 〈ϕi , f 〉X > 0 withprobability ≥ 1− α.

Frank Werner, MPIbpC Gottingen Support Inference May 31, 2017 7 / 30

Page 10: Support Inference in linear Statistical Inverse Problems · consider the test statistic M = max i 2 4w i 0 @ h i;Yi Y q V h i;Yi Y w i 1 A 3 5: compute the asymptotic distribution

Introduction

Methodology

Inference for a single i :

• compute a function Φi such that 〈Φi ,Tf 〉Y = 〈ϕi , f 〉X , i.e.ϕi = T ∗Φi

• compute the (asymptotic) distribution of 〈Φi ,Y 〉Y (test statistic)under the hypothesis 〈ϕi , f 〉X = 0

• with the (1− α)-quantile q1−α of this (asymptotic) distribution itholds under 〈ϕi , f 〉X = 0 (asymptotically)

P[〈Φi ,Y 〉Y ≥ q1−α

]≤ α

• consequently if 〈Φi ,Y 〉Y ≥ q1−α we have 〈ϕi , f 〉X > 0 withprobability ≥ 1− α.

Frank Werner, MPIbpC Gottingen Support Inference May 31, 2017 7 / 30

Page 11: Support Inference in linear Statistical Inverse Problems · consider the test statistic M = max i 2 4w i 0 @ h i;Yi Y q V h i;Yi Y w i 1 A 3 5: compute the asymptotic distribution

Introduction

Methodology

Inference for a single i :

• compute a function Φi such that 〈Φi ,Tf 〉Y = 〈ϕi , f 〉X , i.e.ϕi = T ∗Φi

• compute the (asymptotic) distribution of 〈Φi ,Y 〉Y (test statistic)under the hypothesis 〈ϕi , f 〉X = 0

• with the (1− α)-quantile q1−α of this (asymptotic) distribution itholds under 〈ϕi , f 〉X = 0 (asymptotically)

P[〈Φi ,Y 〉Y ≥ q1−α

]≤ α

• consequently if 〈Φi ,Y 〉Y ≥ q1−α we have 〈ϕi , f 〉X > 0 withprobability ≥ 1− α.

Frank Werner, MPIbpC Gottingen Support Inference May 31, 2017 7 / 30

Page 12: Support Inference in linear Statistical Inverse Problems · consider the test statistic M = max i 2 4w i 0 @ h i;Yi Y q V h i;Yi Y w i 1 A 3 5: compute the asymptotic distribution

Introduction

Methodology (cont’)Inference for a all i :

• if we infer for each i individually, the multiplicity adjustment makesthe statements weak (if the statements are true for each single i withprobability 90%, then they hold true for two i at the same time onlywith probability 81% etc.)

remedy: simultaneous testing

• consider the test statistic

M = maxi

wi

〈Φi ,Y 〉Y√V[〈Φi ,Y 〉Y

] − wi

.• compute the asymptotic distribution of M under f ≡ 0 and its

(1− α)-quantile q1−α

• mark each i for which 〈Φi ,Y 〉Y > (q1−α/wi + wi )√V[〈Φi ,Y 〉Y

]as

active

Frank Werner, MPIbpC Gottingen Support Inference May 31, 2017 8 / 30

Page 13: Support Inference in linear Statistical Inverse Problems · consider the test statistic M = max i 2 4w i 0 @ h i;Yi Y q V h i;Yi Y w i 1 A 3 5: compute the asymptotic distribution

Introduction

Methodology (cont’)Inference for a all i :

• if we infer for each i individually, the multiplicity adjustment makesthe statements weak (if the statements are true for each single i withprobability 90%, then they hold true for two i at the same time onlywith probability 81% etc.)

remedy: simultaneous testing

• consider the test statistic

M = maxi

wi

〈Φi ,Y 〉Y√V[〈Φi ,Y 〉Y

] − wi

.• compute the asymptotic distribution of M under f ≡ 0 and its

(1− α)-quantile q1−α

• mark each i for which 〈Φi ,Y 〉Y > (q1−α/wi + wi )√V[〈Φi ,Y 〉Y

]as

activeFrank Werner, MPIbpC Gottingen Support Inference May 31, 2017 8 / 30

Page 14: Support Inference in linear Statistical Inverse Problems · consider the test statistic M = max i 2 4w i 0 @ h i;Yi Y q V h i;Yi Y w i 1 A 3 5: compute the asymptotic distribution

Introduction

Methodology (cont’)

• by taking the max in the statistic the statements become uniform in i :

Ia :=

i∣∣∣ 〈Φi ,Y 〉Y >

(q1−αwi

+ wi

)√V[〈Φi ,Y 〉Y

]satisfies (asymptotically)

P [〈ϕi , f 〉X > 0 for all i ∈ Ia] ≥ 1− α.

we can infer on all i at a controlled level!

Are the quantiles q1−α well-defined? How to compute them?

Frank Werner, MPIbpC Gottingen Support Inference May 31, 2017 9 / 30

Page 15: Support Inference in linear Statistical Inverse Problems · consider the test statistic M = max i 2 4w i 0 @ h i;Yi Y q V h i;Yi Y w i 1 A 3 5: compute the asymptotic distribution

Introduction

Methodology (cont’)

• by taking the max in the statistic the statements become uniform in i :

Ia :=

i∣∣∣ 〈Φi ,Y 〉Y >

(q1−αwi

+ wi

)√V[〈Φi ,Y 〉Y

]satisfies (asymptotically)

P [〈ϕi , f 〉X > 0 for all i ∈ Ia] ≥ 1− α.

we can infer on all i at a controlled level!

Are the quantiles q1−α well-defined? How to compute them?

Frank Werner, MPIbpC Gottingen Support Inference May 31, 2017 9 / 30

Page 16: Support Inference in linear Statistical Inverse Problems · consider the test statistic M = max i 2 4w i 0 @ h i;Yi Y q V h i;Yi Y w i 1 A 3 5: compute the asymptotic distribution

Theory

Outline

1 Introduction

2 Theory

3 Simulations and real data example

4 Conclusion

Frank Werner, MPIbpC Gottingen Support Inference May 31, 2017 10 / 30

Page 17: Support Inference in linear Statistical Inverse Problems · consider the test statistic M = max i 2 4w i 0 @ h i;Yi Y q V h i;Yi Y w i 1 A 3 5: compute the asymptotic distribution

Theory

Specific setting

• Y = L2([0, 1]d), discrete measurements: Yj = (Tf ) (xj) + ξj ,

j ∈ 1, ..., nd

• ξj are independent, centered (E [ξj ] = 0) and satisfy a momentcondition (especially all moments need to exist)

• the dictionary has at most N = N(n) elements ϕi which satisfyϕi = T ∗Φi , and there is a transformed mother wavelet Φ such that

Φi =

Φ

(· − tihi

) ∣∣ 1 ≤ i ≤ N (n)

with scales hi ∈ [0, 1]d and positions ti ∈ [0, 1]d

• the function Φ has compact support in [0, 1]d

Frank Werner, MPIbpC Gottingen Support Inference May 31, 2017 11 / 30

Page 18: Support Inference in linear Statistical Inverse Problems · consider the test statistic M = max i 2 4w i 0 @ h i;Yi Y q V h i;Yi Y w i 1 A 3 5: compute the asymptotic distribution

Theory

Specific setting

• Y = L2([0, 1]d), discrete measurements: Yj = (Tf ) (xj) + ξj ,

j ∈ 1, ..., nd

• ξj are independent, centered (E [ξj ] = 0) and satisfy a momentcondition (especially all moments need to exist)

• the dictionary has at most N = N(n) elements ϕi which satisfyϕi = T ∗Φi , and there is a transformed mother wavelet Φ such that

Φi =

Φ

(· − tihi

) ∣∣ 1 ≤ i ≤ N (n)

with scales hi ∈ [0, 1]d and positions ti ∈ [0, 1]d

• the function Φ has compact support in [0, 1]d

Frank Werner, MPIbpC Gottingen Support Inference May 31, 2017 11 / 30

Page 19: Support Inference in linear Statistical Inverse Problems · consider the test statistic M = max i 2 4w i 0 @ h i;Yi Y q V h i;Yi Y w i 1 A 3 5: compute the asymptotic distribution

Theory

Specific setting

• Y = L2([0, 1]d), discrete measurements: Yj = (Tf ) (xj) + ξj ,

j ∈ 1, ..., nd

• ξj are independent, centered (E [ξj ] = 0) and satisfy a momentcondition (especially all moments need to exist)

• the dictionary has at most N = N(n) elements ϕi which satisfyϕi = T ∗Φi , and there is a transformed mother wavelet Φ such that

Φi =

Φ

(· − tihi

) ∣∣ 1 ≤ i ≤ N (n)

with scales hi ∈ [0, 1]d and positions ti ∈ [0, 1]d

• the function Φ has compact support in [0, 1]d

Frank Werner, MPIbpC Gottingen Support Inference May 31, 2017 11 / 30

Page 20: Support Inference in linear Statistical Inverse Problems · consider the test statistic M = max i 2 4w i 0 @ h i;Yi Y q V h i;Yi Y w i 1 A 3 5: compute the asymptotic distribution

Theory

Specific setting

• Y = L2([0, 1]d), discrete measurements: Yj = (Tf ) (xj) + ξj ,

j ∈ 1, ..., nd

• ξj are independent, centered (E [ξj ] = 0) and satisfy a momentcondition (especially all moments need to exist)

• the dictionary has at most N = N(n) elements ϕi which satisfyϕi = T ∗Φi , and there is a transformed mother wavelet Φ such that

Φi =

Φ

(· − tihi

) ∣∣ 1 ≤ i ≤ N (n)

with scales hi ∈ [0, 1]d and positions ti ∈ [0, 1]d

• the function Φ has compact support in [0, 1]d

Frank Werner, MPIbpC Gottingen Support Inference May 31, 2017 11 / 30

Page 21: Support Inference in linear Statistical Inverse Problems · consider the test statistic M = max i 2 4w i 0 @ h i;Yi Y q V h i;Yi Y w i 1 A 3 5: compute the asymptotic distribution

Theory

Test statistic• 〈Φi ,Y 〉Y is not available, approximate it by

〈Φi ,Y 〉n := n−d∑

j∈1,...,ndYjΦi (xj)

• the variance σ2i := V

[〈Φi ,Y 〉Y

]might also be unknown, consider a

family of uniformly consistent estimators σ2i

• we have to investigate the asymptotic distribution of

S (Y ) := maxi

[wi

(〈Φi ,Y 〉n

σi− wi

)]• the calibration values are only scale-dependent and chosen as

wi =

√2 log

(CΦi∏hi

)+ Cd

log

(√2 log

(CΦi∏hi

))√

2 log(

CΦi∏hi

)

Frank Werner, MPIbpC Gottingen Support Inference May 31, 2017 12 / 30

Page 22: Support Inference in linear Statistical Inverse Problems · consider the test statistic M = max i 2 4w i 0 @ h i;Yi Y q V h i;Yi Y w i 1 A 3 5: compute the asymptotic distribution

Theory

Test statistic• 〈Φi ,Y 〉Y is not available, approximate it by

〈Φi ,Y 〉n := n−d∑

j∈1,...,ndYjΦi (xj)

• the variance σ2i := V

[〈Φi ,Y 〉Y

]might also be unknown, consider a

family of uniformly consistent estimators σ2i

• we have to investigate the asymptotic distribution of

S (Y ) := maxi

[wi

(〈Φi ,Y 〉n

σi− wi

)]• the calibration values are only scale-dependent and chosen as

wi =

√2 log

(CΦi∏hi

)+ Cd

log

(√2 log

(CΦi∏hi

))√

2 log(

CΦi∏hi

)

Frank Werner, MPIbpC Gottingen Support Inference May 31, 2017 12 / 30

Page 23: Support Inference in linear Statistical Inverse Problems · consider the test statistic M = max i 2 4w i 0 @ h i;Yi Y q V h i;Yi Y w i 1 A 3 5: compute the asymptotic distribution

Theory

Test statistic• 〈Φi ,Y 〉Y is not available, approximate it by

〈Φi ,Y 〉n := n−d∑

j∈1,...,ndYjΦi (xj)

• the variance σ2i := V

[〈Φi ,Y 〉Y

]might also be unknown, consider a

family of uniformly consistent estimators σ2i

• we have to investigate the asymptotic distribution of

S (Y ) := maxi

[wi

(〈Φi ,Y 〉n

σi− wi

)]

• the calibration values are only scale-dependent and chosen as

wi =

√2 log

(CΦi∏hi

)+ Cd

log

(√2 log

(CΦi∏hi

))√

2 log(

CΦi∏hi

)

Frank Werner, MPIbpC Gottingen Support Inference May 31, 2017 12 / 30

Page 24: Support Inference in linear Statistical Inverse Problems · consider the test statistic M = max i 2 4w i 0 @ h i;Yi Y q V h i;Yi Y w i 1 A 3 5: compute the asymptotic distribution

Theory

Test statistic• 〈Φi ,Y 〉Y is not available, approximate it by

〈Φi ,Y 〉n := n−d∑

j∈1,...,ndYjΦi (xj)

• the variance σ2i := V

[〈Φi ,Y 〉Y

]might also be unknown, consider a

family of uniformly consistent estimators σ2i

• we have to investigate the asymptotic distribution of

S (Y ) := maxi

[wi

(〈Φi ,Y 〉n

σi− wi

)]• the calibration values are only scale-dependent and chosen as

wi =

√2 log

(CΦi∏hi

)+ Cd

log

(√2 log

(CΦi∏hi

))√

2 log(

CΦi∏hi

)Frank Werner, MPIbpC Gottingen Support Inference May 31, 2017 12 / 30

Page 25: Support Inference in linear Statistical Inverse Problems · consider the test statistic M = max i 2 4w i 0 @ h i;Yi Y q V h i;Yi Y w i 1 A 3 5: compute the asymptotic distribution

Theory

Gaussian Approximation

Suppose that

• there are only polynomially many probe functions in the dictionary(N(n) 6 nκ for some κ > 0)

• the smallest scale tends to zero not too fast(mini minentries hi > log(n)p/n with some specific p)

• the largest scale tends to zero sufficiently fast(maxi maxentries hi 6 n−δ for some δ > 0)

Gaussian Approximation

Then there are i.i.d. standard normal random variables ζj such that underf ≡ 0 it holds

limn→∞

|P [S (Y ) > q]− P [S (ζ) > q]| = 0 for all q

Frank Werner, MPIbpC Gottingen Support Inference May 31, 2017 13 / 30

Page 26: Support Inference in linear Statistical Inverse Problems · consider the test statistic M = max i 2 4w i 0 @ h i;Yi Y q V h i;Yi Y w i 1 A 3 5: compute the asymptotic distribution

Theory

Gaussian Approximation

Suppose that

• there are only polynomially many probe functions in the dictionary(N(n) 6 nκ for some κ > 0)

• the smallest scale tends to zero not too fast(mini minentries hi > log(n)p/n with some specific p)

• the largest scale tends to zero sufficiently fast(maxi maxentries hi 6 n−δ for some δ > 0)

Gaussian Approximation

Then there are i.i.d. standard normal random variables ζj such that underf ≡ 0 it holds

limn→∞

|P [S (Y ) > q]− P [S (ζ) > q]| = 0 for all q

Frank Werner, MPIbpC Gottingen Support Inference May 31, 2017 13 / 30

Page 27: Support Inference in linear Statistical Inverse Problems · consider the test statistic M = max i 2 4w i 0 @ h i;Yi Y q V h i;Yi Y w i 1 A 3 5: compute the asymptotic distribution

Theory

Gaussian Approximation (cont’)

Continuous Gaussian Approximation

There exists a Brownian sheet W such that S (Y ) can be approximated

S (W ) := maxi

[wi

(∫Φi (x) dWx

‖Φi‖L2

− wi

)],

i.e. under f ≡ 0 it holds

limn→∞

|P [S (Y ) > q]− P [S (W ) > q]| = 0 for all q.

Moreover

• S (W ) is a.s. bounded from below and above,

• S (W ) is asymptotically non-degenerate, i.e. does not concentrate toany point,

• S (W ) does not depend on any unknown quantities.

Frank Werner, MPIbpC Gottingen Support Inference May 31, 2017 14 / 30

Page 28: Support Inference in linear Statistical Inverse Problems · consider the test statistic M = max i 2 4w i 0 @ h i;Yi Y q V h i;Yi Y w i 1 A 3 5: compute the asymptotic distribution

Theory

Gaussian Approximation - Implications

This means that we can use quantiles q1−α from

S (ζ) = maxi

[wi

(〈Φi , ζ〉n‖Φi‖2

− wi

)]to hold the asymptotic level.

• S (ζ) is ’distribution free’, i.e. it depends only on known quantities

quantiles can be simulated easily

• quantiles are meaningful as n→∞

If 〈Φi ,Y 〉n >(q1−αwi

+ wi

)σi mark i as active (i.e. i ∈ Ia)!

Frank Werner, MPIbpC Gottingen Support Inference May 31, 2017 15 / 30

Page 29: Support Inference in linear Statistical Inverse Problems · consider the test statistic M = max i 2 4w i 0 @ h i;Yi Y q V h i;Yi Y w i 1 A 3 5: compute the asymptotic distribution

Theory

Gaussian Approximation - Implications

This means that we can use quantiles q1−α from

S (ζ) = maxi

[wi

(〈Φi , ζ〉n‖Φi‖2

− wi

)]to hold the asymptotic level.

• S (ζ) is ’distribution free’, i.e. it depends only on known quantities

quantiles can be simulated easily

• quantiles are meaningful as n→∞

If 〈Φi ,Y 〉n >(q1−αwi

+ wi

)σi mark i as active (i.e. i ∈ Ia)!

Frank Werner, MPIbpC Gottingen Support Inference May 31, 2017 15 / 30

Page 30: Support Inference in linear Statistical Inverse Problems · consider the test statistic M = max i 2 4w i 0 @ h i;Yi Y q V h i;Yi Y w i 1 A 3 5: compute the asymptotic distribution

Theory

Detection properties

So far: whenever i ∈ Ia, then asymptotically P [〈ϕi , f 〉X > 0] ≥ 1− α.

But how large must 〈ϕi , f 〉X be to be detected?

Lower detection bound

If 〈ϕi , f 〉X ≥ 2(q1−αwi

+ wi

)σi , then

limn→∞

P [i ∈ Ia] ≥ 1− α

uniformly in i .

Frank Werner, MPIbpC Gottingen Support Inference May 31, 2017 16 / 30

Page 31: Support Inference in linear Statistical Inverse Problems · consider the test statistic M = max i 2 4w i 0 @ h i;Yi Y q V h i;Yi Y w i 1 A 3 5: compute the asymptotic distribution

Theory

Detection properties

So far: whenever i ∈ Ia, then asymptotically P [〈ϕi , f 〉X > 0] ≥ 1− α.

But how large must 〈ϕi , f 〉X be to be detected?

Lower detection bound

If 〈ϕi , f 〉X ≥ 2(q1−αwi

+ wi

)σi , then

limn→∞

P [i ∈ Ia] ≥ 1− α

uniformly in i .

Frank Werner, MPIbpC Gottingen Support Inference May 31, 2017 16 / 30

Page 32: Support Inference in linear Statistical Inverse Problems · consider the test statistic M = max i 2 4w i 0 @ h i;Yi Y q V h i;Yi Y w i 1 A 3 5: compute the asymptotic distribution

Theory

Detection properties

So far: whenever i ∈ Ia, then asymptotically P [〈ϕi , f 〉X > 0] ≥ 1− α.

But how large must 〈ϕi , f 〉X be to be detected?

Lower detection bound

If 〈ϕi , f 〉X ≥ 2(q1−αwi

+ wi

)σi , then

limn→∞

P [i ∈ Ia] ≥ 1− α

uniformly in i .

Frank Werner, MPIbpC Gottingen Support Inference May 31, 2017 16 / 30

Page 33: Support Inference in linear Statistical Inverse Problems · consider the test statistic M = max i 2 4w i 0 @ h i;Yi Y q V h i;Yi Y w i 1 A 3 5: compute the asymptotic distribution

Theory

Special case: deconvolution

• Suppose d = 2 and T is a convolution operator, i.e.

(Tf ) (y) = (k ∗ f ) (y) :=

∫[0,1]2

k (x − y) f (y) dy .

This implies that if we choose ϕi of Wavelet-type we obtain

Φi =

Φhi

(· − tihi

) ∣∣ 1 ≤ i ≤ N (n)

where Φh depends on h.

• Suppose the Fourier transform of the kernel k has a polynomial decay,i.e.

c(1 + ‖ξ‖2

2

)−a ≤ |Fk(ξ)| ≤ C(1 + ‖ξ‖2

2

)−a.

This implies that the functions ϕi can be chosen such that they arenon-negative and have compact support, and ‖Φh‖L2 behaves likemaxentries h

2a.

Frank Werner, MPIbpC Gottingen Support Inference May 31, 2017 17 / 30

Page 34: Support Inference in linear Statistical Inverse Problems · consider the test statistic M = max i 2 4w i 0 @ h i;Yi Y q V h i;Yi Y w i 1 A 3 5: compute the asymptotic distribution

Theory

Special case: deconvolution

• Suppose d = 2 and T is a convolution operator, i.e.

(Tf ) (y) = (k ∗ f ) (y) :=

∫[0,1]2

k (x − y) f (y) dy .

This implies that if we choose ϕi of Wavelet-type we obtain

Φi =

Φhi

(· − tihi

) ∣∣ 1 ≤ i ≤ N (n)

where Φh depends on h.

• Suppose the Fourier transform of the kernel k has a polynomial decay,i.e.

c(1 + ‖ξ‖2

2

)−a ≤ |Fk(ξ)| ≤ C(1 + ‖ξ‖2

2

)−a.

This implies that the functions ϕi can be chosen such that they arenon-negative and have compact support, and ‖Φh‖L2 behaves likemaxentries h

2a.

Frank Werner, MPIbpC Gottingen Support Inference May 31, 2017 17 / 30

Page 35: Support Inference in linear Statistical Inverse Problems · consider the test statistic M = max i 2 4w i 0 @ h i;Yi Y q V h i;Yi Y w i 1 A 3 5: compute the asymptotic distribution

Theory

Special case: deconvolution

• Suppose d = 2 and T is a convolution operator, i.e.

(Tf ) (y) = (k ∗ f ) (y) :=

∫[0,1]2

k (x − y) f (y) dy .

This implies that if we choose ϕi of Wavelet-type we obtain

Φi =

Φhi

(· − tihi

) ∣∣ 1 ≤ i ≤ N (n)

where Φh depends on h.

• Suppose the Fourier transform of the kernel k has a polynomial decay,i.e.

c(1 + ‖ξ‖2

2

)−a ≤ |Fk(ξ)| ≤ C(1 + ‖ξ‖2

2

)−a.

This implies that the functions ϕi can be chosen such that they arenon-negative and have compact support, and ‖Φh‖L2 behaves likemaxentries h

2a.

Frank Werner, MPIbpC Gottingen Support Inference May 31, 2017 17 / 30

Page 36: Support Inference in linear Statistical Inverse Problems · consider the test statistic M = max i 2 4w i 0 @ h i;Yi Y q V h i;Yi Y w i 1 A 3 5: compute the asymptotic distribution

Theory

Special case: deconvolution

• Suppose d = 2 and T is a convolution operator, i.e.

(Tf ) (y) = (k ∗ f ) (y) :=

∫[0,1]2

k (x − y) f (y) dy .

This implies that if we choose ϕi of Wavelet-type we obtain

Φi =

Φhi

(· − tihi

) ∣∣ 1 ≤ i ≤ N (n)

where Φh depends on h.

• Suppose the Fourier transform of the kernel k has a polynomial decay,i.e.

c(1 + ‖ξ‖2

2

)−a ≤ |Fk(ξ)| ≤ C(1 + ‖ξ‖2

2

)−a.

This implies that the functions ϕi can be chosen such that they arenon-negative and have compact support, and ‖Φh‖L2 behaves likemaxentries h

2a.

Frank Werner, MPIbpC Gottingen Support Inference May 31, 2017 17 / 30

Page 37: Support Inference in linear Statistical Inverse Problems · consider the test statistic M = max i 2 4w i 0 @ h i;Yi Y q V h i;Yi Y w i 1 A 3 5: compute the asymptotic distribution

Theory

Special case: deconvolution (cont’)

• S (Y ) can be approximated by a Gaussian version which is a.s.bounded and non-degenerate

whenever i ∈ Ia, then asymptotically P [〈ϕi , f 〉X > 0] ≥ 1− α.

• If 〈ϕi , f 〉X is sufficiently large, then i will be detected with probability≥ 1− α.

Optimality of the lower detection bound

In d = 1 this lower detection bound is optimal in the sense that noestimator for a β-Holder-continuous function f can distinguish betweenf|[t,t+h]

= 0 and f|[t,t+h]≥ hβ at a faster rate in h = h(n).

Frank Werner, MPIbpC Gottingen Support Inference May 31, 2017 18 / 30

Page 38: Support Inference in linear Statistical Inverse Problems · consider the test statistic M = max i 2 4w i 0 @ h i;Yi Y q V h i;Yi Y w i 1 A 3 5: compute the asymptotic distribution

Theory

Special case: deconvolution (cont’)

• S (Y ) can be approximated by a Gaussian version which is a.s.bounded and non-degenerate

whenever i ∈ Ia, then asymptotically P [〈ϕi , f 〉X > 0] ≥ 1− α.

• If 〈ϕi , f 〉X is sufficiently large, then i will be detected with probability≥ 1− α.

Optimality of the lower detection bound

In d = 1 this lower detection bound is optimal in the sense that noestimator for a β-Holder-continuous function f can distinguish betweenf|[t,t+h]

= 0 and f|[t,t+h]≥ hβ at a faster rate in h = h(n).

Frank Werner, MPIbpC Gottingen Support Inference May 31, 2017 18 / 30

Page 39: Support Inference in linear Statistical Inverse Problems · consider the test statistic M = max i 2 4w i 0 @ h i;Yi Y q V h i;Yi Y w i 1 A 3 5: compute the asymptotic distribution

Simulations and real data example

Outline

1 Introduction

2 Theory

3 Simulations and real data example

4 Conclusion

Frank Werner, MPIbpC Gottingen Support Inference May 31, 2017 19 / 30

Page 40: Support Inference in linear Statistical Inverse Problems · consider the test statistic M = max i 2 4w i 0 @ h i;Yi Y q V h i;Yi Y w i 1 A 3 5: compute the asymptotic distribution

Simulations and real data example Simulations

Considered problem

• T deconvolution problem, i.e.

Yj = (k ∗ f ) (xj) + ξj , j ∈ 1, ..., n2

with an equidistant grid xj on [0, 1]2.

• The kernel k is chosen from the family

(Fka,b) (ξ) = (1 + b2 ‖ξ‖22)−a, ξ ∈ R2.

• The variance is considered to be known.

• The mother wavelet ϕ is chosen tominimize the variance ‖Φ‖2

L2

( Tensor product of Beta-Kernels)

0

1

Testfunction f

Frank Werner, MPIbpC Gottingen Support Inference May 31, 2017 20 / 30

Page 41: Support Inference in linear Statistical Inverse Problems · consider the test statistic M = max i 2 4w i 0 @ h i;Yi Y q V h i;Yi Y w i 1 A 3 5: compute the asymptotic distribution

Simulations and real data example Simulations

Considered problem

• T deconvolution problem, i.e.

Yj = (k ∗ f ) (xj) + ξj , j ∈ 1, ..., n2

with an equidistant grid xj on [0, 1]2.

• The kernel k is chosen from the family

(Fka,b) (ξ) = (1 + b2 ‖ξ‖22)−a, ξ ∈ R2.

• The variance is considered to be known.

• The mother wavelet ϕ is chosen tominimize the variance ‖Φ‖2

L2

( Tensor product of Beta-Kernels)

0

1

Testfunction f

Frank Werner, MPIbpC Gottingen Support Inference May 31, 2017 20 / 30

Page 42: Support Inference in linear Statistical Inverse Problems · consider the test statistic M = max i 2 4w i 0 @ h i;Yi Y q V h i;Yi Y w i 1 A 3 5: compute the asymptotic distribution

Simulations and real data example Simulations

Considered problem

• T deconvolution problem, i.e.

Yj = (k ∗ f ) (xj) + ξj , j ∈ 1, ..., n2

with an equidistant grid xj on [0, 1]2.

• The kernel k is chosen from the family

(Fka,b) (ξ) = (1 + b2 ‖ξ‖22)−a, ξ ∈ R2.

• The variance is considered to be known.

• The mother wavelet ϕ is chosen tominimize the variance ‖Φ‖2

L2

( Tensor product of Beta-Kernels)

0

1

Testfunction f

Frank Werner, MPIbpC Gottingen Support Inference May 31, 2017 20 / 30

Page 43: Support Inference in linear Statistical Inverse Problems · consider the test statistic M = max i 2 4w i 0 @ h i;Yi Y q V h i;Yi Y w i 1 A 3 5: compute the asymptotic distribution

Simulations and real data example Simulations

Considered problem

• T deconvolution problem, i.e.

Yj = (k ∗ f ) (xj) + ξj , j ∈ 1, ..., n2

with an equidistant grid xj on [0, 1]2.

• The kernel k is chosen from the family

(Fka,b) (ξ) = (1 + b2 ‖ξ‖22)−a, ξ ∈ R2.

• The variance is considered to be known.

• The mother wavelet ϕ is chosen tominimize the variance ‖Φ‖2

L2

( Tensor product of Beta-Kernels)

0

1

Testfunction f

Frank Werner, MPIbpC Gottingen Support Inference May 31, 2017 20 / 30

Page 44: Support Inference in linear Statistical Inverse Problems · consider the test statistic M = max i 2 4w i 0 @ h i;Yi Y q V h i;Yi Y w i 1 A 3 5: compute the asymptotic distribution

Simulations and real data example Simulations

Some empirical levels for α = 0.1

Noise scenario Parameters false positives %

Gaussian noise 8.8

Student’s t noise

ν = 3 100ν = 6 94.7ν = 7 72.3ν = 11 21.8ν = 15 15.7ν = 19 13.0ν = 23 13.3

CCD noise (Sneyder ’93, ’95):obs. time t, background b,read-out errors N

(0, σ2

) t = 100, b = 0.5, σ = 0.01 9.8t = 1000, b = 0.005, σ = 0.01 8.1t = 100, b = 0.005, σ = 0.01 14.5

Frank Werner, MPIbpC Gottingen Support Inference May 31, 2017 21 / 30

Page 45: Support Inference in linear Statistical Inverse Problems · consider the test statistic M = max i 2 4w i 0 @ h i;Yi Y q V h i;Yi Y w i 1 A 3 5: compute the asymptotic distribution

Simulations and real data example Simulations

Support recovery - result

−2

0

2

3120

1560

1040

780

624

data (σ = 0.5) 90% significance map

Frank Werner, MPIbpC Gottingen Support Inference May 31, 2017 22 / 30

Page 46: Support Inference in linear Statistical Inverse Problems · consider the test statistic M = max i 2 4w i 0 @ h i;Yi Y q V h i;Yi Y w i 1 A 3 5: compute the asymptotic distribution

Simulations and real data example Simulations

Support recovery - result

−0.2

0

0.2

0.4

0.6

800

400

266.7

200

160

data (σ = 0.05) 90% significance map

Frank Werner, MPIbpC Gottingen Support Inference May 31, 2017 23 / 30

Page 47: Support Inference in linear Statistical Inverse Problems · consider the test statistic M = max i 2 4w i 0 @ h i;Yi Y q V h i;Yi Y w i 1 A 3 5: compute the asymptotic distribution

Simulations and real data example Simulations

Support recovery - result

0

1

0

0.2

0.4

0.6

exact solution data (σ = 0.005)

240

120

80

60

48

240

120

80

60

48

90% significance map zoomFrank Werner, MPIbpC Gottingen Support Inference May 31, 2017 24 / 30

Page 48: Support Inference in linear Statistical Inverse Problems · consider the test statistic M = max i 2 4w i 0 @ h i;Yi Y q V h i;Yi Y w i 1 A 3 5: compute the asymptotic distribution

Simulations and real data example Real data example

Real data example - Setup

• we analyze fluorescent dyeson single DNA Origami

• imaging is performed by aSTED microscope

• each of the two strands canat most hold 12 markers

Observations (photon counts)500nm

0

50

100

Frank Werner, MPIbpC Gottingen Support Inference May 31, 2017 25 / 30

Page 49: Support Inference in linear Statistical Inverse Problems · consider the test statistic M = max i 2 4w i 0 @ h i;Yi Y q V h i;Yi Y w i 1 A 3 5: compute the asymptotic distribution

Simulations and real data example Real data example

ModelingThe observations can perfectly modeled by

Yjindependent∼ Bin (t, (k ∗ f ) (xj)) , j ∈ 1, ..., n2 .

• Bin (t, p): Binomial distribution with parameters t ∈ N and p ∈ [0, 1]• f (x): probability that a photon emitted at grid point x is recorded at

the detector in a single excitation pulse

100nm

0

2 · 10−3

4 · 10−3

6 · 10−3100nm

0

2 · 10−3

4 · 10−3

6 · 10−3

experimental kernel k2,0.016

Frank Werner, MPIbpC Gottingen Support Inference May 31, 2017 26 / 30

Page 50: Support Inference in linear Statistical Inverse Problems · consider the test statistic M = max i 2 4w i 0 @ h i;Yi Y q V h i;Yi Y w i 1 A 3 5: compute the asymptotic distribution

Simulations and real data example Real data example

Result

500nm

8000

4000

2667

2000

1600

200nm

??

Frank Werner, MPIbpC Gottingen Support Inference May 31, 2017 27 / 30

Page 51: Support Inference in linear Statistical Inverse Problems · consider the test statistic M = max i 2 4w i 0 @ h i;Yi Y q V h i;Yi Y w i 1 A 3 5: compute the asymptotic distribution

Simulations and real data example Real data example

Comparison of the result with the data

(1)

(2)

0

50

100

8000

4000

2667

2000

1600

data excerpt result excerpt

0

50

100

0

50

100

zommed data (1) zommed data (2)

Frank Werner, MPIbpC Gottingen Support Inference May 31, 2017 28 / 30

Page 52: Support Inference in linear Statistical Inverse Problems · consider the test statistic M = max i 2 4w i 0 @ h i;Yi Y q V h i;Yi Y w i 1 A 3 5: compute the asymptotic distribution

Conclusion

Outline

1 Introduction

2 Theory

3 Simulations and real data example

4 Conclusion

Frank Werner, MPIbpC Gottingen Support Inference May 31, 2017 29 / 30

Page 53: Support Inference in linear Statistical Inverse Problems · consider the test statistic M = max i 2 4w i 0 @ h i;Yi Y q V h i;Yi Y w i 1 A 3 5: compute the asymptotic distribution

Conclusion

Outlook

• Methodology and theory:• inference on active coefficients 〈ϕi , f 〉X w.r.t. a dictionary ϕi

• techniques from multiscale testing yield uniform inference at acontrolled (asymptotic) error level

• detection power is optimal in a suitable sense

• Application:• the method can be used to determine the support of a function

observed in a convolution model

• performs well in a real data example

Thank you for your attention!

Frank Werner, MPIbpC Gottingen Support Inference May 31, 2017 30 / 30

Page 54: Support Inference in linear Statistical Inverse Problems · consider the test statistic M = max i 2 4w i 0 @ h i;Yi Y q V h i;Yi Y w i 1 A 3 5: compute the asymptotic distribution

Conclusion

Outlook

• Methodology and theory:• inference on active coefficients 〈ϕi , f 〉X w.r.t. a dictionary ϕi

• techniques from multiscale testing yield uniform inference at acontrolled (asymptotic) error level

• detection power is optimal in a suitable sense

• Application:• the method can be used to determine the support of a function

observed in a convolution model

• performs well in a real data example

Thank you for your attention!

Frank Werner, MPIbpC Gottingen Support Inference May 31, 2017 30 / 30