a multigrid method for large scale inverse problemshaber/pubs/pcopper02.pdf · a multigrid method...

28
A multigrid method for large scale inverse problems Eldad Haber Dept. of Computer Science, Dept. of Earth and Ocean Science University of British Columbia [email protected] July 4, 2003 E.Haber: Multigrid methods for full-space formulation

Upload: trinhdat

Post on 30-Jul-2018

213 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: A multigrid method for large scale inverse problemshaber/pubs/pcopper02.pdf · A multigrid method for large scale inverse problems Eldad Haber Dept. of Computer Science, Dept. of

A multigrid method for largescale inverse problems

Eldad Haber

Dept. of Computer Science,Dept. of Earth and Ocean Science

University of British Columbia

[email protected]

July 4, 2003

E.Haber: Multigrid methods for full-space formulation

Page 2: A multigrid method for large scale inverse problemshaber/pubs/pcopper02.pdf · A multigrid method for large scale inverse problems Eldad Haber Dept. of Computer Science, Dept. of

Collaborator

Uri Ascher

E.Haber: Multigrid methods for full-space formulation 1

Page 3: A multigrid method for large scale inverse problemshaber/pubs/pcopper02.pdf · A multigrid method for large scale inverse problems Eldad Haber Dept. of Computer Science, Dept. of

Main Outline

• Preliminaries - all-at-once methods.

• Multigrid for the linearized system

• Discretization

• Smoothers

• The multigrid iteration

• Solving the nonlinear system using an inexactNewton SQP method

• Example

• Conclusions

E.Haber: Multigrid methods for full-space formulation 2

Page 4: A multigrid method for large scale inverse problemshaber/pubs/pcopper02.pdf · A multigrid method for large scale inverse problems Eldad Haber Dept. of Computer Science, Dept. of

Preliminaries

• In many applications we need to evaluatecoefficients from noisy measured data

• Examples:

– Resistivity/hydrology measurements:

∇ · σ∇φk = qk k = 1...N

– Electromagnetics:

∇× µ−1∇× Ejk − iωjσEjk = iωjqk

• Discretize PDE plus BC and write forwardproblem as

A(m)u = q

• In these problems we need to infer the model mgiven measurements of the field u

E.Haber: Multigrid methods for full-space formulation 3

Page 5: A multigrid method for large scale inverse problemshaber/pubs/pcopper02.pdf · A multigrid method for large scale inverse problems Eldad Haber Dept. of Computer Science, Dept. of

Formulation of the inverse problem

We have:

• The PDE+BC A(m)u = q (physical process)

• Measured data bobs = Qu

• A priori information

What are we after?

• Solve the differential problem A(m)u = q

• Fit the data, ||Qu− bobs|| ≤Tol

• Get a reasonable model: ||W (m−mref)|| nottoo large

E.Haber: Multigrid methods for full-space formulation 4

Page 6: A multigrid method for large scale inverse problemshaber/pubs/pcopper02.pdf · A multigrid method for large scale inverse problems Eldad Haber Dept. of Computer Science, Dept. of

Formulation cont.

There are two (major) ways to approach theproblemUnconstrained :

φ(m) = ||Qu− b||2 + β||Wm||2= ||QA(m)−1q − b||2 + β||Wm||2

Constrained or - simultaneous; all-at-once:

Lm,u,λ = ||Qu− b||2 + β||Wm||2 + λT (A(m)u− q)

λ - Vector of Lagrange multipliers.

E.Haber: Multigrid methods for full-space formulation 5

Page 7: A multigrid method for large scale inverse problemshaber/pubs/pcopper02.pdf · A multigrid method for large scale inverse problems Eldad Haber Dept. of Computer Science, Dept. of

All-at-once - Unconstrained: Thegeneral idea

A(m)u-q=0

m

u

Constrained - Unconstrained

E.Haber: Multigrid methods for full-space formulation 6

Page 8: A multigrid method for large scale inverse problemshaber/pubs/pcopper02.pdf · A multigrid method for large scale inverse problems Eldad Haber Dept. of Computer Science, Dept. of

Linear systems which evolve fromall-at-once methods

The Gradients:

Lu = QT (Qu− b) + A(m)Tλ = 0

Lm = βWTW (m−m0) + G(m,u)Tλ = 0

Lλ = A(m)u− q = 0

where

G(m,u) =∂(A(m)u)

∂m.

Gauss-Newton iteration1:

A 0 GQTQ AT 0

0 GT βWTW

δuδλδm

=

Lu

Lm

1In the Gauss-Newton case we do not use second order information

E.Haber: Multigrid methods for full-space formulation 7

Page 9: A multigrid method for large scale inverse problemshaber/pubs/pcopper02.pdf · A multigrid method for large scale inverse problems Eldad Haber Dept. of Computer Science, Dept. of

The linearized system

It is easy to see that the permuted KKT matrix

A 0 GQTQ AT 0

0 GT βWTW

corresponds to discretizing the differential operator

∇ · (em∇(·)) 0 ∇ · ((em∇u)(·))∑

δ(x− xi) ∇ · (em∇(·)) 00 −(∇u)T (em)(∇(·)) −β∇2

plus BC

This PDE system can be efficiently solved usingmultigrid methods

E.Haber: Multigrid methods for full-space formulation 8

Page 10: A multigrid method for large scale inverse problemshaber/pubs/pcopper02.pdf · A multigrid method for large scale inverse problems Eldad Haber Dept. of Computer Science, Dept. of

Ellipticity of the PDE system

Assuming we have data everywhere, consider onemode

δuδλδm

=

δu

δλδm

eıθ·x

Denote σ = em and obtain the symbol

C =

−σ|θ|2 0 ıσ(∇u)Tθ

1 −σ|θ|2 00 −ıσ(∇u)Tθ β|θ|2

Thus

det(C) = σ2(β|θ|6 + [(∇u)Tθ]2)

The system is therefore elliptic for any β > 0

E.Haber: Multigrid methods for full-space formulation 9

Page 11: A multigrid method for large scale inverse problemshaber/pubs/pcopper02.pdf · A multigrid method for large scale inverse problems Eldad Haber Dept. of Computer Science, Dept. of

Discretization of the system - I

• Care must be taken to ensure that the discreteversion of the Euler-Lagrange equations is h-elliptic.

• Given m at cell centers, there are two possibleapproaches to the discretization of the forwardproblem

– Cell-centered– Nodal

−2 −1.5 −1 −0.5 0 0.5 1 1.5 2−2

−1.5

−1

−0.5

0

0.5

1

1.5

2

OO

O O

m

0

0

0 0

0

0 0

0

0

E.Haber: Multigrid methods for full-space formulation 10

Page 12: A multigrid method for large scale inverse problemshaber/pubs/pcopper02.pdf · A multigrid method for large scale inverse problems Eldad Haber Dept. of Computer Science, Dept. of

Discretization of the system I - cont.

The continuous linearized Euler-Lagrange equationsare

∇ · (em∇(δu))−∇ · ((em∇u)δm) = gu∑δ(x− xi)δu +∇ · (em∇(δλ)) = gλ

(∇u)T (em)(∇(δλ))− β∇2δm = gm

If we use a cell-centered discretization for theforward problem, then the term (∇u)Tem(∇(δλ))must be calculated using long differences in bothu and λ

In this case, for very small regularization parameterβ , the h-ellipticity of the system can be lost oncoarser grids.

E.Haber: Multigrid methods for full-space formulation 11

Page 13: A multigrid method for large scale inverse problemshaber/pubs/pcopper02.pdf · A multigrid method for large scale inverse problems Eldad Haber Dept. of Computer Science, Dept. of

Discretization of the system I - cont.

m,u,λ

∂u/∂x

∂u/∂y

(∇u) [exp(m)] (∇λ)

Note - Compact discretization for the forwardproblem may still lead to non-compact discretizationof the inverse problem!

E.Haber: Multigrid methods for full-space formulation 12

Page 14: A multigrid method for large scale inverse problemshaber/pubs/pcopper02.pdf · A multigrid method for large scale inverse problems Eldad Haber Dept. of Computer Science, Dept. of

Discretization of the system I - cont.

To avoid the difficulty associated with longdifferencing when the regularization parameteris small, we may introduce an O(h2) “artificialregularization” (similar to Levenberg-Marquardt forthe reduced Hessian and to pressure correctiontechniques in CFD)

∇ · (em∇(δu))−∇ · ((em∇u)δm) = gu∑δ(x− xi)δu +∇ · (em∇(δλ)) = gλ

(∇u)T (em)(∇(δλ))− (β + ωh2)∇2δm = gm

E.Haber: Multigrid methods for full-space formulation 13

Page 15: A multigrid method for large scale inverse problemshaber/pubs/pcopper02.pdf · A multigrid method for large scale inverse problems Eldad Haber Dept. of Computer Science, Dept. of

Discretization of the system II

The problem of long differences can be solved bynodal discretization for u and λ while mremains at cell centers.

In this case we use only short differences andtherefore obtain an h-elliptic discretization even forvery small regularization parameters.

∂u/∂x

∂u/∂y

(∇u) [exp(m)] (∇λ)

m

E.Haber: Multigrid methods for full-space formulation 14

Page 16: A multigrid method for large scale inverse problemshaber/pubs/pcopper02.pdf · A multigrid method for large scale inverse problems Eldad Haber Dept. of Computer Science, Dept. of

Discretization of the system andoptimization

• We discretize the optimization problem ratherthan the Euler-Lagrange equations directly,thereby retaining KKT symmetry in the discretenecessary conditions.

• We obtain a discrete optimization problem, anduse optimization techniques (e.g. variants ofSQP [Nocedal & Wright 1999] )

E.Haber: Multigrid methods for full-space formulation 15

Page 17: A multigrid method for large scale inverse problemshaber/pubs/pcopper02.pdf · A multigrid method for large scale inverse problems Eldad Haber Dept. of Computer Science, Dept. of

Smoothing I

Cell-centered discretization

• In this case all unknowns are at the same point.

• For small regularization parameter values, thesystem is tightly coupled.

• We therefore use Collective Symmetric Gauss-Seidel relaxation.

• During a relaxation sweep invert a 3 × 3 block ateach grid point.

m u λ

E.Haber: Multigrid methods for full-space formulation 16

Page 18: A multigrid method for large scale inverse problemshaber/pubs/pcopper02.pdf · A multigrid method for large scale inverse problems Eldad Haber Dept. of Computer Science, Dept. of

Smoothing II

Cell-node discretization

• The grid is staggered and we therefore use boxrelaxation.

• Box relaxation I [Vanka 1986]

• Box relaxation II - Eight-color box relaxation (inthe spirit of red-black boxes)

Box Eight− color box

m

u,λ

E.Haber: Multigrid methods for full-space formulation 17

Page 19: A multigrid method for large scale inverse problemshaber/pubs/pcopper02.pdf · A multigrid method for large scale inverse problems Eldad Haber Dept. of Computer Science, Dept. of

The Multigrid iteration

• Prolongation and restriction

– Nodal unknowns - Trilinear interpolation andfull-weight restriction.

– Cell unknowns - Trilinear interpolation and full-weight restriction.

• Coarse grid operator by Galerkin coarsening.

We perform a classical W(2,2) cycle.

E.Haber: Multigrid methods for full-space formulation 18

Page 20: A multigrid method for large scale inverse problemshaber/pubs/pcopper02.pdf · A multigrid method for large scale inverse problems Eldad Haber Dept. of Computer Science, Dept. of

The Multigrid iteration in an InexactNewton solver

ReminderWe are solving a nonlinear problem where at eachiteration the linear system

Hδy = −g

must be solved.

In inexact Multigrid-Newton methods, we solve thissystem to a very rough tolerance (say 0.1 [Kelley1998] ).

This can be accomplished by one W(2,2) cycle perInexact Newton iteration.

NoteThe cost of calculating the Jacobian is negligiblecompared with one W-cycle.

E.Haber: Multigrid methods for full-space formulation 19

Page 21: A multigrid method for large scale inverse problemshaber/pubs/pcopper02.pdf · A multigrid method for large scale inverse problems Eldad Haber Dept. of Computer Science, Dept. of

The Inexact Newton solver,Optimization issues.

We incorporate the method into an inexactSequential Quadratic Programming (SQP) version[Heinkenshlus 1996] .

Add a globalization strategy (using a line search andthe l1 merit function) [Flecher 1981] .

Further stabilization is obtained by inexact projectionto the constraint [Nocedal & Wright 1999]

E.Haber: Multigrid methods for full-space formulation 20

Page 22: A multigrid method for large scale inverse problemshaber/pubs/pcopper02.pdf · A multigrid method for large scale inverse problems Eldad Haber Dept. of Computer Science, Dept. of

Numerical experiments I - Thelinearized problem

The Setting

• The PDE is∇ · em∇u = q

in the interval [−1, 1]3. Apply natural BC.

• We use cell-centered and nodal finite volumediscretizations for u , where m is at cellcenters.

• We measure u on a 63-grid uniformly spaced inthe interval [−0.6, 0.6] (contaminated with noise)

• The fine grid has 323 cells which leads to roughly105 unknowns

E.Haber: Multigrid methods for full-space formulation 21

Page 23: A multigrid method for large scale inverse problemshaber/pubs/pcopper02.pdf · A multigrid method for large scale inverse problems Eldad Haber Dept. of Computer Science, Dept. of

Numerical experiments I - Thelinearized problem

To test the multigrid scheme, we solve the systemwhich arises in the first iteration to accuracy of 10−6

Rate of convergence for different β’s and differentdiscretizations

β Levels CC SCC CN10−2 4 0.11 0.06 0.04

3 0.05 0.05 0.0410−3 4 NC 0.09 0.04

3 0.35 0.09 0.0410−4 4 NC 0.12 0.04

3 NC 0.12 0.04

CC - Cell CenteredSCC - Stabilized Cell CenteredCN - Cell-Node

E.Haber: Multigrid methods for full-space formulation 22

Page 24: A multigrid method for large scale inverse problemshaber/pubs/pcopper02.pdf · A multigrid method for large scale inverse problems Eldad Haber Dept. of Computer Science, Dept. of

Numerical experiments I - Thelinearized problem, cont.

• The stabilization of the cell-centered discretizationis needed for small values of the regularizationparameter

• Cell-Node performs better than Cell Centered(possible due to h-ellipticity effects)

• Using cell-node, the multigrid method is onlyslightly sensitive to the size of the regularizationparameter

E.Haber: Multigrid methods for full-space formulation 23

Page 25: A multigrid method for large scale inverse problemshaber/pubs/pcopper02.pdf · A multigrid method for large scale inverse problems Eldad Haber Dept. of Computer Science, Dept. of

Solving the nonlinear problem

In the second experiment we solve the nonlinearproblem using inexact Newton.

Number of nonlinear iterations and of W-cyclesneeded to solve the nonlinear problem when usingan inexact Newton method.

β Newton iterations SCC CN10−1 8 8 810−2 8 9 810−3 14 17 1410−4 17 21 17

Note that we manage to solve the entire nonlinearinverse problem in a total cost which is not muchlarger than that of solving a few forward problems.

E.Haber: Multigrid methods for full-space formulation 24

Page 26: A multigrid method for large scale inverse problemshaber/pubs/pcopper02.pdf · A multigrid method for large scale inverse problems Eldad Haber Dept. of Computer Science, Dept. of

Solving the nonlinear problem cont.

The solutions

2 3 4 5

Smooth training model

20 40 60 80

10

20

30

40

50

60

2 3 4 5

True smooth model

20 40 60 80

10

20

30

40

50

60

2 3 4 5

Rough training model

20 40 60 80

10

20

30

40

50

60

2 3 4 5

True rough model

20 40 60 80

10

20

30

40

50

60

Slices in the true model (lest) and the reconstructedmodel for z = 0.3, 0.55, 0.75.

E.Haber: Multigrid methods for full-space formulation 25

Page 27: A multigrid method for large scale inverse problemshaber/pubs/pcopper02.pdf · A multigrid method for large scale inverse problems Eldad Haber Dept. of Computer Science, Dept. of

Solving the nonlinear problem cont.

Convergence of the iteration

1 2 3 4 5 6 7 810

−8

10−7

10−6

10−5

10−4

10−3

10−2

10−1

100

||g||/||g0||

||A(m)u−q||/||q||

Relative gradient and residual of the constraint(PDE) as a function of iteration for β = 10−2.

E.Haber: Multigrid methods for full-space formulation 26

Page 28: A multigrid method for large scale inverse problemshaber/pubs/pcopper02.pdf · A multigrid method for large scale inverse problems Eldad Haber Dept. of Computer Science, Dept. of

Conclusions & some related questions

Using an h-elliptic discretization, it is possible tosolve the inverse problem in a cost which is notmuch larger than of a forward problem

Compact discretizations for the forward problem donot necessarily lead to compact discretizations ofthe inverse problem.

Related questions

• Incorporating a-priori information

• Robust statistics - choosing β using GCV, L-curve, etc.

• Incorporation of better SQP-type schemes

• Parallelism in the case of multi-experiments

E.Haber: Multigrid methods for full-space formulation 27