accuracy and convergence of finite-element/ …oden/dr._oden...with the exception of a number of...

ACCURACY AND CONVERGENCE OF FINITE-ELEMENT/

GALERKIN APPROXIMATIONS OF TIME-DEPENDENT

PROBLEf-,ISWITH EMPHASIS ON DIFFUSION AND CONVECTION

by

L. C. Wellford, Jr. and J. T. Oden

Department of Aerospace Engineering andEngineering Mechanics

Division of Engineering Mechanics

Texas Institute for Computational Mechanics

The University of Texas at Austin

AUSTIN, TEXAS

December, 1973

ACCURACY AND CONVERGENCE OF FINITE-ELEMENT/

GALERKIN APPROXIMATIONS OF TIME-DEPENDENT

PROBLEMS WITtI EMPHASIS ON DIFFUSION AND CONVECTION

L. C. Wellford Jr. and J. T. Oden

1. Introduction

With the exception of a number of important papers in the

engineering literature of the early 1960's, the mathematical

theory of finite element approximations reached a degree of mat-

urity only in 1969 and 1970. It was around this time that the

strength and elegance of the method and its relationship to con-

temporary research areas in interpolation theory began to be

appreciated by the mathematical world. There has since been a

flood of literature on mathematical features of the method, and a

number of books and expository articles have already appeared

[1-5]. The latest chapter in the theory, time-dependent problems,

is naturally a very new and incomplete addition. However, portions

of this chapter stand to be developed relatively quickly, for,

with minor modifications, they can be drawn from the existing and

more extensively developed theory of finite-differences, much of

which has bearing on certain finite-element approximations. In

fact, it is now widely accepted that the natural way to approach

many time-dependent problems is to approximate the spatial varia-

2

tion of the dependent variables using finite-element methods and

to approximate the temporal behavior by finite-differences. This

practice should make it possible to optimally exploit inherent ad-

vantages in both methods.

In very recent times, several distinct methods have emerged

for studying the central mathematical questions surrounding finite

element schemes for time-dependent problems: accuracy, convergence,

and stability. The ultimate utility of the method in applications

to evolution problems hinges on the answers to these questions. In

the present paper, we attempt to assess the degree to which these

questions have been answered thus far, as well as to describe sev-

eral techniques that can be used to arrive at them.

More specifically, this paper is largely an exposition aimed

at analyzing a number of techniques for arriving at error estimates

of finite-element/finite-difference approximations of certain time-

dependent problems. We shall emphasize linear diffusion equations

for two reasons: first, to keep the scope of the paper within

reasonable limits; second, because the underlying theory is more

completely developed for this class of problems. However, we shall

also consider certain finite-element approximations of the convec-

tion problem which, to date, has not been treated deeply in the

literature. Indeed, much of what we consider can be applied to

finite-element approximations of all types of linear boundary/

initial-value problems.

In the exposition to follow, we describe a number of basic

techniques for determining rates-of-convergence of approximations

of a class of linear time-dependent problems. We choose to cate-

gorize these techniques as follows:

(i)

(ii)

(iii)

(i v)

Semigroup Theoretic Estimates

Energy Methods

L2 Methods

Other Methods

3

(i) It is a widely known result in the theory of partial-

differential equations that for a broad class of evolution equations

the fundamental solution operator is a member of a semigroup of .

operators (e.g. [6,7]). By using certain basic properties of semi-

groups, error estimates can often be easily obtained, particularly

in those cases in which it is possible to transform the problem into

a one-parameter family of elliptic problems. This makes the problem

of estimating the spatial rate-of-convergence relatively straight-

forward. The rate-of-convergence of the temporal approximations can

be established by exploiting certain properties of fundamental solu-

tions. For example, if the fundamental solution is a member of a

semigroup, then it has a matrix exponential form which can be approx-

imated using the Pade~ matrix-approximation theory. The use of semi-

groups in difference approximations of time-dependent problems has

been discussed by several authors (e.g. Peetre and Thomee [8], Widlund

[9]). Some features of the methods we discuss were used by Babuska

andAzizin collaboration with Fix [10].

(ii) In most mathematical models of physical problems a norm

can be developed which is equivalent to the total energy in the

system. Normally, the study of convergence of various Galerkin

approximations in an appropriate energy norm is a very natural under-

taking. This is due to the fact that the weak forms of most boundary-

value problems of mathematical physics can be interpreted physically

in terms of changes in energy. The use of energy methods can be found

4

in the fini te- difference 1itera ture (e.g. [11]), and variants have

been used by Fujii [12] and Oden and Fost [13] for finite-element

analyses.

(iii) A fairly extensive literature has accumulated in recent

years on Galerkin approximations of the diffusion equation in which

L2-estimates in spatial variations and L2- or Loo-estimates are ob-

tained in the temporal variations. This "L2-theory" has been largely

developed by Douglas and Dupont [14, 15, 16], Varga [17], Wheeler [18],

and others. Interestingly enough, the methods do not involve energy-

error estimates and enable one to go directly to stronger L2-estimates.

(iv) There are, of course, several other techniques in use for

studying finite element approximations of time-dependent problems.

The projection methods of Thomee [19], for example, make use of the

fact that the finite-element technique produces a system of differ-

ence equations in Rn. By using standard difference concepts and

projections from Rn back into the space V in which the original prob-

lem is posed, error-estimates for certain finite-element approxima-

tions can be obtained. Alternately, certain parabolic problems can

be shmvn to have coercive properties under an appropriate choice of

norm (see, e.g., Lions and Magenes [20]). Thus, "elliptic -type"

error estimates can be obtained, as shown by Cella and Cecchi [21].

It is only a matter of interpretation as to whether or not these

"projection" and "coercive-operator" techniques do not actually be-

long to the semigroup and the energy methods described previously.

As in elliptic theory, the study of convergence of finite

element approximations rests firmly on certain results from inter-

polation theory. For this reason we discuss in the section follow-

ing this introduction certain features of interpolation theory which

(Z.l)

...5

are essential for our investigation. Next, in Section 3, we des-

cribe finite-element/Galerkin models of a general class of diffusion

problems, and in Sections 4,5, and 6 we obtain error estimates for

these models using the semigroup theory, energy methods, and LZ-

methods, respectively. In section 7 we describe finite-element

approximations of certain linear convection problems, and in Sections

8 and 9 we apply semigroup and LZ techniques to obtain error-estimates

associated with these approximations.

2. Finite Element Approximation and Interpolation

Consider a Hilbert space H whose elements are functions u(~)

of points x = (x ,x ,... ,xn) in some bounded domain Q of Rn. In~ 1 2

subsequent sections, the context shall make clear the specific

properties of H, but for the moment we need only assume that it

is endowed with an inner product, (u ,u ). A finite-element model1 2

~of Q (and H) is another region Q which is partitioned into a fin-

ite number E of disjoint open sets Qe called finite elements:

/\ EQ = LJ"ITe

e=l

Here "ITeis the closure of Qe'

a set of local basis functions

that

Within each element we identifya

~~(e)(~) which have the property

~~(e) (x) = 0N ~

x :f Qe

M,N = 1,2, ••• ,Ne;

e,f = 1,Z, ... ,E ; lal :. k (Z.Z)

Here ~~ is a nodal point labelled M in element Qf, o~~, o~, o~,are Kronecker deltas, N is the number of nodes in element Q ,e e

6

and we have used multi-index notation; i.e., ~ and ~ are ordered

n-tuples of non-negative integers, a =

following conventions are used:

(a , a , ... , a ), and the1 2 n

+ C/. + ••• + a2 n

(2. 3)

The local representation of a function in terms of the basis

functions ~~(e)(~) is of the form

Na~(e)

(2.5)

is of the formG

E E A~Xg(x)lal:'k ~=l ~ ~ -

and the global representationE

Vex) = LJ V (x ) =- 1 e -ee=

X~ (x)fi -

Here X~(x) are global basis functions given byfi -

E N (e)= U re Q N~~ (e) (x )

e=l N=l fi N -e(2.6)

(e)Nwhere Q fi defines a Boolean transformation of the disconnected

(e)Nsystem of elements into the connected model Q (i.e., Q ~ = 1 if

(e)node N of Q coincides with node x~ of Q and Q ~ = 0 if other-e _ u

wise).~ a G

Suppose Q = Q. Then the set of functions {X-ex)} -l;lal<kfi - ~- -

defines a finite-dimensional subspace of H which we shall denote

Sh(Q). Here h is the mesh parameter of the finite element mesh

(i.e., if he = dia(Q)e' h = max(h1,h

2, ••• ,hE)). We define Sh(Q)

more precisely subsequently. For economy in notation, we shall

re-label the global basis functions X~(x) as ~N(x) N=1,2, ... ,NOfi - -

7

(. . d h' ~al.e., we lntro uce an automorp lsm aN

-

global representation is of the form

Vex) (2.7)

Returning now to the space H, consider a typical element

u(=u(x)). The pair {(. ,.) '{~N}NO } define an orthogonal projec-- N=l

tion Qh : H+Sh(Q) such that

_ No NQhu = W = E (u,~ )~N(x)

N=l

where ~M(~) ::;/~N'~M)-l~M(~)' The function

(2.8)

(2.9)

is referred to as the (pointwise) interpolation error of the finite

element approximation W(x) = Qhu(~). Its properties depend ex-

plicitly on the properties of the subspace Sh(Q).

To appreciate the importance of the interpolation error ln

finite-element/Galerkin theory, consider the abstract boundary-

value problem, find uEH such that

(P(u),v) = (f,v) 'f v E M (2.10)

where P:H+M is a linear operator. The finite-element/Galerkin

approximation of the solution u of (2.10) is the function UESh such

that

The function

(P(U),V) = (f,V) (2.11)

e(~) :: u(~) - U(~) (2.12)

is the (pointwise) approximation error of U(~). Since U(~) £ Sh(Q),

there is a mapping ITh : H+Sh(Q) such that IThU = U; but ITh is not ~n

8

orthogonal projection into Sh(Q) relative to {(. ,.) '{~N}}' Indeed,

by setting v = V in (2.10) and subtracting (2.11) from the result,

we see that (Pe,V) = 0; thus, Pe is orthogonal to She

The function

£(~) = TIhu(~) - Qhu(~) = U(~) - Qhu(~) (2.13)

is referred to as the projection error. In most of the develop-

ments to follow, we show that it is possible to bound certain

norms of the projection error by the corresponding norm of the"

"interpolation error; i.e., we derive relationships of the type

(2.14)

Since e = E-£, use of the triangle inequality glves

(2.15)

Thus, (i) if the coefficient (1 + C(h)) remains bounded as h+O, and

(ii) if I lEI I +0 as h+O, we have proved convergence of the methodH

in the I 1·1 IH norm. Criteria (i) is a question of stabil~ of the

approximation, while (ii) is a question of consistency. The latter

question depends explicitly on H and the properties of Sh(Q), so

that the convergence of the method is connected to the interpolation

error in a fundamental way.

In many instances, the space H is a Sobolev space Hm(Q), the

elements of which are functions whose partial derivatives of order

< m are square integrable on Q. The inner-product in Hm(Q) lS

then

and the norm is

f E D~uD~vdQQ I al<m~ -

E f (D~u)2dQ1~I~m Q

(2.16)

(2.17)

where dQ = dx dx ...dx .1 2 n In such cases we construct the finite-

9

element subspaces S (Q) so as to have the following properties:h

(i) For every u£Hm(Q), there is a constant C such

that IIQhullm:' Cllullm for all It > O.

(ii) If p(~) is a polynomial of degree < k,

QhP(~) = p(~)

(iii) Let h+O uniformly (i.e., for each refinement of

the mesh, let the radius Pe of the largest sphere that can be In-

scribed in Qe be proportional to he)' Then there is a constant K

independent of h such that

IIElln:' Khk+l-nlulk+l

for n < m, where lulk+l is the semi-norm

lul2:: ~ f (D~u)2dQm I a l2,m Q

(2.18)

(2.19)

Interpolation results such as (2.18) were derived by Strang [22],

Ciarlet and Raviart [23], and others. We shall henceforth assume

that the spatial interpolation spaces Sh(Q) have properties (i)-

(iii) .

3. Finite-Element Approximation of the

Diffusion Equation

We now consider a class of time-dependent problems charac-

terized by equations of the form

au(~,t) + A(~)u(~,t) = f(~,t)

~ £ Q; t £ (O,T]

D~u(x,t) = 0 , X £ an; lal < m - 1~ - -

= u (x)o -

X £ Q (3.1)

where A is the 2m-th order differential operator

A(x) = E (-1) I~ID~A (x)D~- I a/ , I B 1< m g~- ~-

10

(3.2)

and the coefficients A (x) are such that A is m-elliptic (stronglya B --,-

elliptic); i.e., there exists a sesqui1inear form

a(u,v) :: (Au,v) = J E A B(X)D~UD~VdQQlal,IBI<m ~ ~-~ ~-

such that there are positive constants µ and µ for whicho 1

a(u,u) > µ IIul12- 0 m

and

(3. 3)

(3.4)

a(u,v)< µ Ilullllvll (3.5)- 1 m m

We then replace (3.1) by the equivalent (weaker) variational prob-

lem,

(dU(t) ,v) + a(u(t) ,v) = (f(t) ,v)at 0 0

(u(',O),v) = (u ,v) , V V E Hm (Q)o 0 0 0

V t E (O,T] (3.6)

~here Hm(Q) is the Sobolev space of Hm(Q) functions with compacto

support in Q.

Now consider a Galerkin approximation of (3.5) which involves

seeking a function U(~,t) E Sh(Q)XC1(0,T) such that

( au ( t) V) + a (U ( t) ,V) = ( f (t) ,V)at ' 0 0

t E (O,T]

(3. 7)(U ( • ,0) ,V) = (u , V)o 0 0

Since a continuous dependence on t is still assumed, U(~,t) lS

referred to as a semidiscrete Galerkin approximation. Now, the

coefficients AN in (2.7) are functions of time t. Thus, intro-

ducing (2.7) into (3.7) we obtain a system of first-order differ-

ential equations for the specific coefficients AN(t) corresponding

to the finite element approximation:

11

~oc AM(t) + ~oK Ar>'[(t) = f (t)~1=1 NM M=l NM N

~oc AM(O) = 9- (3 8)NM N .M=l

Here

fN = 9-(~N) ;

9-N = (u , ~N)I 0 1

and AM(t) = dAfv1(t)/dt.

In practical calculations, we introduce the partition P of

[0,T] comp 0sed 0 f the set {t ,t ,..., tR} \II her e 0 = t < t <o 1 0 1

< tR = T with t 1 - t = ~t; and we introduce the sequence {Un}Rn+ n n=o

to denote the value of the function U(t) E Sh(Q)XCl(O,T) at the

time points of partition P. Thus {Un}R = {Vet )}R . Then wen=o n n=o

construct a family of finite difference-Galerkin approximations

associated with parameter B (0 ~ e ~ 1) which represent solutions

to the following equation:

(OtUn,V) ° + (l-e)a(Un+l,V):+ Ba(Un,V) = (f (t) ,V)

°t E CO,T]

(3.9)v V E S h (Q)

l.e., 0tUn =

Evaluation of (3.10) with the finite element approxi-

(UO,V) = (tio,V)° °

denotes the forward difference operator;where 0t

Un+l _ Un~t

mation (2.7) leads to a system of algebraic equations for the

coefficients An = AN(t ).N n .

N ° n+lM~l [CNM + ~t(l- e) °NQKQM]A~l

N .EOC AO = 9-

M=l N~r-M N(3.10)

12

4. Error Estimates for the Diffusion Equation

Using Semigroup Theoretic Results

The calculation of error estimates for equation (3.6) can be

embedded in the theory of the approximation of elliptic partial

differential equations by using semigroup tools. In particular,

the traditional characterization of the semigroup through the resol-

vent operator can be used to transform the parabolic diffusion prob-

lem into a one-parameter family of elliptic problems. The error ln

the spatial discretization for the parabolic problem can then be

related to errors encountered in modeling elliptic problems. On

the other hand, errors due to the discretization in time can be

determined through the exponential function representation for the

semigroup and its relationship to the Pade~ approximations.

Initially, we must define the components of the error of the

approximation scheme. u(~,t) is the exact solution to (3.6);

U(~,t) is the solution to the semidiscrete Galerkin approximation

(3.7); and Un(x) is the solution to the finite difference-Galerkin-approximation (equation (3.~)) at time t = n~t. We introduce the

definitions,

e(x,n~t)-o(~,n~t)

= u(x,nfit) - Un(~) = approximation error (4.1)

= u(~,nfit) - U(~,nfit) = semidiscrete approxi-

mation error (4.2)

Un(x) = temporal approximation

error (4.3)

Then, for any choice of norm on ~,

Ile(t)11 = Ilu(·,n~t) - Un(o) + U(',nfit) - U(',nfit) I 1

= 110 +TII ~ 11011 + IITII (4.4)

g(O) = So

13

The evolution problem (3.1) is reminiscent of the linear

dynamical system

g - ~g = f

the solution of which is

q = e~tq + fte~(t-s)f(s)ds- -0 0 -

For simplicity, let us assume that ~(s) = O. Then the matrix

~(t) given by

is called the fundamental solution operator and

q(t) = E(t)q- "'-0

The operator ~(t) is a member of a multiplicative semigroup, G;

i.e., if E (t) and E (t) are in G, then the semigroup properties,-1 -2

(i) E'E E G-1 ~2

(closure) (4.5)

(ii) E •(E •E ) = (E 1 • E ).E (assoc iat ivi ty)-1 -2 ~3 - -2 ~3

are satisfied. In fact, we also have the important properties,

§(t)§(s) = ~(t + s)

limt+O E (t) = I- - (4.6)

How does one construct the fundamental solution operator ~(t)

from a given matrix A? Let L(g(t))

form of get). Then

g denote the Laplace trans-

Thus, if s is not an eigenvalue of ~,

- )-19 = (s!-~ goE(t) = L-l(sI-A)-l- - - (4.7)

NOlV a similar situation is encountered -in the use of the

weak formulation of partial differential equations of the type

(3.6). In this case we define a fundamental solution operator

E(x,t) such that- -

14

u(x,t) = E(x,t)u (x)- - - 0-

Then it is clear from (4.7) and the definition of the resolvent-1operator R(x,s) = (sI-A) that

....., ....., - -

Hence, if u = L[u]

R(x,s) = f~e-stE(x,t)dt- - 0 --

(4.8)

and, of course,

R(x,s)u (x)- .....,0-

(4.9)

(4.11)

where r is a contour in the complex plane. Now we take the Laplace

transform of the weak parabolic partial differential equation (3.6)

and set f(t) = 0, for convenience. Then

L[(()1(t) ,v) ] + L[a(u(t) ,v)] = 0at 0

Nmv

and

L[( a~(t) ,v)].at

= f ft€stdu(x,t)v(x)dtdQQ 0 dt -

= sfQu(~;s)v(~)dQ - fQuo(~)v(~)dQ

= s(u(s) ,v) - (u ,v)000

(4.12)

L [a(u(t) ,v)] = a(u(s),v) (4.13)

Hence introducing (4.12) and (4.13) into (4.11), we obtain the

boundary-value problem

s(u(s),v) + a(u(s),v) = (u ,v)000

(4.14)

Similarly, if U is the Galerkin approximation to the exact

15

solution, then we define the approximate fundamental solution

operator ~h(~,t) so that

(4.15)

If we introduce the approximation ~h to matrix 6, an approx-

imate resolvent operator ~h(~'s) is defined such that

~h(~ ' s) = (s I-~h) -1

Then the approximate resolvent is related to the approximate fun-

damental solution operator through

~h(~,t) = L-l[(S!-~h)-l] = L-l[~h(~'s)]

Thus,

(4.16)

Hence, 'ifU = L[U], then the transformed version of (4.15) glves

iJ(~,s) = ~h (~,s)uo (~)

and, of course,

U(~,t) = 2;' fr~(~,s)estdsuo(~)1

(4.17)

Now if we take the Laplace transform of the semidiscrete

Galerkin equation (3.7) and set f(t) = 0 for convenience, we ob-

tain an approximate boundary-value problem for (4.14)

s (U(s) ,V) + a(U(s) ,V) = (u ,V) V V£Slt(Q) (4.18)a a a

Subtracting (4.18) from (4.14) and defining the transformed approx-

L[a(t)] = L[u(t)-U(t)], we get

V V £ Sh(Q) (4.19)

a(s) by u(s) - U(s) =S(0(5) ,V) + a(o(s) ,V) = a

a

Thus it is clear that if we can estimate the approximation error

imatiori error

a(s) for the boundary-value problem (4.14), then we can deduce the

approximation error for problem (3.6) by using the inverse

16

Laplace transform, (4.10), and (4.17) ~

a (t) = L -1 [0 (s)] = L -1 [u(s) - IT(s) ]

To insure that the inverse Laplace transform exists, we assume

that the operator ~ is the infinitesimal generator of a strongly

continuous semigroup. The Hille-Yosida-Phillips theorem (Friedman

[24]) guarantees that if the operator ~ is the infinitesimal gen-

erator of a strongly continuous semigroup, there exist real numbers

M and w such that for every R s > w, s is in the resolvent set ofeA (i.e., (s !-6)-1 exists) and

IIR(x,s)nll < M- - - (s-w)lI n = 1,2,3, ... (4.21)

Thus if we select the contour r in the complex plane so that if

S E r, S is-i~ the resolvent set, Res> B, and Res = B as s+oo

where B is a small negative constant, then the integrand in (4.20)

is bounded in accordance with. (a:2l) as s increases. Thus the

Laplace transform (4.20) exists. We choose B so that lsi < µ .u

The operator A satisfies the m-elliptic condition (3.4).

Thus the operator ~ + Re sl occurring in (4.14) also satisfies

an m-elliptic condition since

= a(v, v) + Res (v,v) 0

= a(v,v) + Res Ilvll~

> a(v,v) + Bllvl12

m> µ II v II

2 m (4.22)

Thus the boundary value problem (4.14) is strongly elliptic for

17

each s € r. Equation (4.22) leads to the following error estimate.

Theorem 4.1. If TI(s) is the Galerkin approximation in Sh to ~(s)

the solution of problem (4.14), then

(4.23)

Proof: From the m-elliptic property of operator A + Re s~ (4.22)

we have that if TI*(s) is an arbitrary element of Sh

µ211~(s) - U(s) II~:. Res(~(s) - U(s) ,u(s) - U(s))

+ a(~(s) - TI(s), ~(s) - TI(s))

= R s(~(s) - TI(s), ~(s) - U*(s) +eU* (s) - U ( s)) + a (u(s) - IT(s),

u(s) - U*(s) + U*(s) - U(s))

= Res (~(s) - TI(s), ~(s) - U* (s))

+ a(~(s) - U(s), ~(s) - U*(s))

(4.24)

To obtain the last result, we have used the identity

Res(~(s) - TI(s), U*(s) - U(s)) + a(u(s) - U(s), U*(s) - TI(s))

= 0

obtained by taking the real part of (4.19) and setting V = U*(s)

TI(s). Now using the Schwarz inequality (3.5) in (4.24)

µ.llu(s) - U(s)hI2 < ResM Ilu(s) - U(s)llo Ilu*(s) - U(s)llo2 m - 1

+ µ 11~(s) - U(s) II IIU*(s) - U(s) II1 m m

< ( R sM. M + µ ) II ~ (s) - U(s) I te 1 21m

IIU*(s) - TI(s)II (4.25)mThus, simp1if~ing (4.25), we obtain

11~(s) - TI(s)II < (ResM1M2 + µl) IIIT*(s) - TI*(s)IIm - m

µ 2

(4.26)

Now let U*(s) be the arbitrary element of Sh which interpolates

li(s). Then we see from the interpolation result (2.18) that

18

Ilu(s) - U(s)11 <m -

This completes the proof.

Thus the error estimate for the semidiscrete approximation (3.7)

to (3.6) can be determined.

Theorem4.2. If u(x,n~t) is the solution to (3.6) at time t = nfit-and U(x,nfit) is the solution to (3.7) at time t = n~t, then

Ilcr(x,nfit)II = Ilu(x,nfit) - U(x,nfit) II- m - - m

:. C2hk+1-mlu(x,nfit) I

k+

l(4.27)

Proof: 'Using the trans form (4.20) -be tween the error involved in

the approximation of equation (3.6) and (4.14) the result follows

Ilcr(~,nfit)11m = Ilu(~tnfit) - U(~,n~t) 11m

= II 1 J (u(s) - U(s)) estds II2lTi r m

< C hk+ 1- mJ Iu(s) I est ds- 2lTi r k+l

= C hk+l-mlu(x t) I2 _, k+l

Now let E} (X;t,T) be the fundamental solution operator assoc-I -

iated with the semidiscrete Galerkin approximation (3.7) and

E}~t,e(X;(V+l)fit,vfit) be the fundamental solution operator (oftenI _

called the amplification matrix) associated with the finite dif-

ference-Galerkin approximation (3.9). Then

(4.28)and

Un+l(x) = Eh~t,e(~;(V+l)fit,v~t)un(~) (4.29)

The operators Eh and Eh6t,e have the semigroup property (4.6).

Thus

(4.31)

19

(4.32)

Now the semidiscrete problem (3.7) is assumed to be well-posed and

the finite difference-Ga1erkin problem (3.9) is assumed to be

stable (depends continuously on the initial data). Thus the fun-

damental solution operator and the amplification matrix are bounded

IIEh(~;(V-l)fit,O)u(~)llm:' C31IU(~)"m v = O,l, ... ,N

V U £ Sh(Q) (4.33)

IIEh~t,e(~;n~t,vfit)V(~) 11m:' C41IV(~) 11m 0:. v:.n:.N

V V £ S h (Q)

(4.34)

Equation (4.34) implies that

IIEhfit,e(~;n6t,vfit) II =

o < v < n ::. N (4.35)

Consider the semidiscrete finite element-Galerkin equation

(3.8). Solving (3.8) for the value of the solution vector AM(v~t)

based on the "initial value" of AM((v-l)fit) and setting fn=O

AM(V6t) = EM8A8((V-l)~t) (4.36)

where E~,18= e-fitBM~ and B~lB = C~,ln-lKn 8. The matrix EM8 is a dis-

crete approximation for the fundamental solution operator Eh. Sim-

ilarly, solving the finite difference-Galerkin equation (3.10) for

the value of A~ in terms of A~-l and setting fn =0, lve obtain

( 4. 37)

where

20

The operator F~S in (4.37) is a discrete analogue for Ehfit,e.

A useful approximation for the term EMS in (4.36) can be ob-

tained through the Pade~ approximations. The Pade~ approximations

Rp,q (~tBMS) are a rational matrix approximation for e-fitBMS de-

fined by

where

Rp,q(~tB~ ..IB) = nptq(~tB~IS) [dp,q(~tBMS)]-l

= e-fitB~I13+ O(~tBrl13)r (4.38)

r =

q 1- kE (p+q-k)! [(p+q) !k! (q-k)! ]-l.-LitBM13)

k=OpE (p+q-k)! [(p+q) !k! (p-k)! ]-l(~tBt'IS)k

k=O

p+q+l

From (4.36) ,'(4.37), and (4.38) we see that the following re-

lationships hold

RO,l = F~IS = EMS + 0(~t)2

R = F~13 = EMS + 0(~t)2 (4.39)1 , 0

1R 1 , 1 = Ft113= EMS + 0 (~t) 3

Here-the' choice of e corresponds to the forward difference, back-.ward difference, and Crank-Nicholson schemes, respectively. The

notation indicates that each term of the matrix FM~ can be expressed

as the sum of the corresponding term in the matrix EMS plus a term

of order of magnitude ~tr. Equation (4.39) implies that

No\V

EMS

Ei'-IB

EMS

F~B = 0(~t)2

F~lS = 0(~t)2

FAs = O(fit)3

(4.40)

(4.41)

21

Using (4.40), and (4.41), we find that

IIEh(~;v~t,(v-l)fit)

IIEh(~;vfit,(v-l)~t)

IIEh (~;vfit,(v-I) fit)

- E~t,l(~;V~t,(V-l)~t) II <

- E~t'O(~;v~t,(V-l)~t) II <1

- E~t , 'Z (~ ; v fi t , (v - 1) ~t) II <

(4.42)

Based on the previous results, the temporal error estimate theorem

can be introduced. We will prove the temporal error estimate the-

orem only for the forward difference approximation (8=1). Error

estimates for other temporal operators can be derived in similar

fashion.

Theorem 4.3. Let U(x,n~t) be the solution to the semidiscrete-

Galerkin equation (3.7) at time t=nfit and Un(x) be the solution to

the forward differenced Galerkin approximation, (3.9) with 6=1, at

time t=n~t. In addition let (4.33), (4.35), and (4.42) hold. Then

the temporal approximation error is

IIT(x,nfit)11 = IIU(x,nfit) - Un(x)11 < C ~t IIUO(x)11- m - - m- 8 - m

(4.43)

Proof: Using the semigroup property (4.31) and (4.32)

II T(x,nfit)II = II U(x,n~t) - Un(x) II- m - - m

II(~h (~;nfit,O) - E~t, 1 (~;nfit,O))UO(~) I'm

nI I z: Eh~t,l(x;n~t,v~t) [Eh~t,l(x;v~t,(V-l)~t)

1 ~ ~v=

n< E IIE}~t,l(x;n~t,vfit)11 IIEhfit,l(x;v~t,(V-l)~t)

1 1 - ~v=

22

And using (4.33), (4.35), and (4.42)

IIT(x,nfit)11 < nC C ~t2C IIUo(x)11~ m- 45 3 - m

The total error estimate of the approximation is given in the

following theorem.

Theorem 4.4. If the components of the error of the approximation

are given by (4.1), (4.2), and (4.3), then

Ile(~,n~t) 11m :. C2hk+l-mlu(~,n~t) Ik+l + C/~t IIUu(~) 11m

(4.44)

Proof: From (4.4)

Ile(~,n~t) 11m:' Ilcr(~,n~t)11m + IIT(~,n~t) 11m

But introducing (4.27) and (4.43)

Ile(~,nfit)11m:' C2hk+l-mlu(~,n~t) Ik+l + Ca~t

S. Error Estimates for the Diffusion

Equation Using the Energy Method

We often find it mathematically convenient and physically

appealing to seek an error estimate that indicates the error in

energy of an approximation. This estimate of the error in energy

of the approximation requires the definition of an appropriate

energy norm. For the diffusion problem the most natural energy

norm can be constructed using the bilinear form a(u,v) connected

with the operator ~ (3.3). The bilinear form a(u,v) satisfies

the m-ellipticity condition (3.4). Thus effectively, an inner

product space HA is introduced in association with the operator

~ and the inner-product a(u,v). The norm associated with this

23

inner-product is called the energy ~,

IluliA = la(u,u) (5.1)

the properties of which depend intrinsically on the operator A.

The energy space HA is topologically equivalent to the Sobolev

space Hm(Q); i.e., positive constants y and y exist such thato 1

y 0 I Iu I 1m:" I I u I I A :. y 11 I u I Im (5.2)

In the subsequent developments we find it convenient to replace

the energy norm II' II A by the Sobolev norm II' II m us ing (5.2).

Now suppose ul . is the exact solution to (3.6) evaluated atn

time point t=n~t and Un is the solution to the finite difference-

Galerkin equation (3.9) at time point t=n~t. We seek an estimate

in energy for the difference between uln and Un. In this section

we consider only the case in which the temporal operator is the

forward difference operator in time. Thus we set 8=1 in (3.9).

Now let Wn be an arbitrary function in the subspace Sh(Q). Then

we define the error components

en ::: U 1'1 - Unn

En = Wn _ Un

En = u In - Wn

n ,', IE =-~ul - 0tU it,. at n

(5.3)

We can now cite a theorem which specifies the behavior of the

error.

Theorem 5.1. Let en, En, En, and En be the error components de-

fined in (5.3). Then

a(En En) :::-(0 en En) - a(En En) - (En En), t' 0 ' , 0(5.4)

24

Proof: Subtracting (3.9) with 8=1 from (3.6) evaluated at t=nfit

- 0tul + 0tul. - 0tUn,V) + a(ul: - wn + wn - Un V) = 0n non '

V E Sh(Q)

(5.5)

(aulat n

(~In - 0tUn,V)o + a(uln - Un,V) = 0

where~ln denotes }f evaluated at time point

in the form

t=nfit.

(5.5)

Rewriting

and introducing (5.3), we get

(En + 0 en V) + a (En + En,V) = 0 V E Sh (Q) (5.6)t ' a

Now setting V=En and using the bilinearity of (.,.) and a(','),0

we obtain (5.4).

The error component En can be estimated using the following

theorem.

Theorem 502. If the error components en, En, En, and En are re-

related by (5.4), then there exist positive constants µ , y , µ , andall

M such that2

µ IIEnl1 < y~IIEnll + 2µ IIEnl1 + 2~.IIIEnll (5.7)o m- m 1 TIl 2 0

Proof: The last term on the right hand side of (5.4) can b~ es-

timated using the Schwarz inequality for the L2 inner-product and

the embedding result IIEn 110 :.. c II En II m' There exist positive con-

stants M and M such that1 2

(En, En) a :. M 1 II En II a II En II 0 :. ~\ II En II a II En II m

Then introducing (3.5) and (5.8) into (5.4)

a(En,En) < _(Oten,En) + µ IIEnl1 liEn/1m- 0 1 m+ H II En II II En II m

2 a

Now to estimate (Oten,En)o set V=En in (5.6)

_(Oten,En)o = a(En,En) + a(En,En) + (En,En)o

(5.8)

(5.9)

(5.10)

2S

Introducing (5.1), (5.2), and (5.8) into (5.10)

_(Oten,En) < y211Enili + µ IIEnllm IIEnllm + t-I IIEnl1

o - 1 1 2 0

Now introducing (5.11) into (5.9) and using the m-elliptic property

of a(' ,.) (equation (3.4)), we obtain the energy estimate.

µ IIEnl12 < a(En En) < y211Enl12 +o m- , - 1 m

+ 2M II En II2 0

2 µ II En II IIEn II1 m m

II En II m

The resul t (5. 7) follows by dividing by II En IIm'

Then the estimate of the approximation error en is contained

ln the following theorem.

Theorem 5.3. If the error components en, En, En, and En are defined

by (5.3) and En satisfies (5.7), then there exist positive constants

M , M , M , and M such that4 5 6 7

and

II en II <!\I IIEn II + M IIe;nIIm- 5 m 4 0(5.12)

Ilenll < M hk+l-mlulk+l + M fit II lull I (5.13)m - 6 7 2

where IIIull12 = f(d2U)2dQ.

2 Q dt2

Proof: The first result follows from (5.3) and (5.7). Equation

(5.7) implies that

IIEn IIm :. M 3 IIEn IIm + M4 IIe;nII 0

But from (5.3)

Ilenllm= I IEn+Enl 1m

< II En II m + II En I 1m< (1 +M ) II En II + M IIe;nII

3 m 4 0

= M IIEn II +;"1 IIe;nII5 m 4 0

(5.14 )

26

The second result can be obtained by expanding u in a Taylor's

series about t=n6t, introducing the result into (5.3) , and taking4

the L2 norm.

II £n II < cfit III u III (5.15)0- 2

Then introducing (Z.18) and (5.15) into (5.14), we obtain (5.13).

6. Error Estimates for the Diffusion Equation

Using L2 Methods

The derivation of LZ error estimates for finite element-Galerkin

models for the diffusion equation follows very naturally from the

interpolation theory developed in section 2. The key point in the

derivation of the L2 error estimate is establishing a bound on the

approximation error for all time points based on the pointwise error

estimate in time. Traditionally this has been carried out through

a discrete version of Gronwall's Inequality (Lee [26]). The method

used here to derive error estimates for the finite element-Galerkin

solution circumvents the complex arguments involved in using the

discrete Gronwall's inequality. For simplicity, the finite element-

Galerkin solution will be defined as piecewise linear between dis-

cretization points in time. Then we identify the nodal values in

time, effectively, through a collocation procedure at the temporal

node points. Thus, we embed the approximate solution in the temporal,

as well as the spatial, variables in a subspace of the original sol-

ution space. This procedure leads to the use of the previously

developed interpolation results in time, as well as space.

We consider the solution of the weak parabolic problem (3.6)

with f(t) = O. The exact solution to this problem has been charac-

terized by Goldstein [27]

27

(6.1)

However if the coefficients AaSin equation (3.2) are such that~-A~~ £ COO(Q) and Dr(A~~) £ Loo(Q) for all ~, ~, and y, then

u £ COO[O,T;Hm(Q)] (6.2)

We assume in this section that the coefficients AaS are constants~

so that (6.2) holds. However, this choice is not basic to the

method of derivation of error estimates to be introduced here. In

fact, the methods used here are valid as long as the regularity

of the exact solution u is such that

Let us define the space ~1 [O,T;Sh(Q)]. If PC1[0,T;Sh(Q)]

is the space of functions which are continuous with piecewise con-

tinuous derivatives between the points of the partition P of [O,T]

introduced in Section 3, then pel [O,T;Sh(Q)] is the subspace of

PC1[0,T;Sh(Q)] of functions which are piecewise linear; Sh(Q)

is the finite dimensional subspace of Hm(Q) described in Section 2.

We seek an approximate Galerkin solution Un to (3.6) using the

finite difference-Galerkin approximation (3.9) with 8=1 (this cor-

responds to the forward difference in time case). The solution of

this problem is fully equivalent to the solution of the following

problem. Find a U £ ~l[O,T;Sh(n)] such that

(!¥(!l,V)~=n6t + a(U(t) ,V)t=n~t = 0 V V £ PC1[0,T;Sh(Q)]

n = O, ... ,R

(U ( • ,0) , V) = (u ,V)o 0 0

(6.3)

where the notation indicates that a collocation is performed at

time points t=n~t, n=O, ... ,R. Our subsequent developments pertain

to (6. 3) .

28

Now let Wet) be that element of 'P"C1 [O,T;Sh(Q)] which inter-

polates u in the sense of (2.8), and let Un = U(n~t). We can

define the error components of the approximation by

e = u(t) - U(t)

E 1 = Wet) - U(t)

E = Wet) - Un(6.4)

2

E = u(t) - Wet)

Then the following theorem describes the behavior of the error.

Theorem 6.1. If u(t) is the exact solution to (3.6), U(t) is the

approximate solution to (3.6) defined by (6.3), Wet) is the inter-

polant of u(t) defined by (2.8), and the error components are

defined by (604)" then.

rtfit:.t :. (n+l)fit

Proof: Subtracting (6.3) from (3.6)

(6.5)

(duCt) Vet)) _(dUet) V(t))t=n~t + a(u(t) ,Vet))at.' 0 dt ' 0

- a(U(t),V(t))t=n~t = 0

Tf V E F'C'l[o,T; h(Q)] (6.6)

But (dU(t) V(t)t=n~t = (dU(t) Vet)) for nfit< t < n+l(~t), sinceat' 0 '\... ' - '-

U(tl is piecewise linear in t. Thus.1.1.-

( d u.~~)_d Uit),V (t)) + a (u(t)-Un ,V (t)) = 0 Tf V E ~ 1 [0,T ;Sh (Q) ]o

(6. 7)

29

Rewriting (6.7)

(au(t)_aW(t)+aW(t)_aU(t) Vet)) + a(u(t)-W(t)+W(t)-Un,V(t))=Oat at at at' 0

Then introducing (6.4) into (6.8) and setting V = El

(U1,E) + aCE ,E ) = _(aE,E "\ - a(E,E )()L 1 2 1 at 1-'0 1

(6.9)

The result C6~5) follows directly from (6.9) by splitting up the

second term on the left:.hand side.

The following theorem describes the variation of E with time:1

Theorem 6.2. Let the hypotheses of Theorem 6.1 hold, then2

~ LIIE 112 :.~11~112 + nilE 112 +!:l. IlaU(t)(t_n~t)112dt 1 0 4nat 0 1 0 4µ at mo

n~t < t < (n+l) fit (6.10)

Proof: E -E = U (t) -un = a ¥ ( t) (t-:n~t) for nfit < t < (n+1) 6t .2 1 t-

Thus using the Cauchy-Schwarz inequality (3.5)

a (E -E E) = a (au (t) (t-nfit) E )2 l' 1 at ' 1

< µ II au (t) (t-nfit) II II E II nfit < t < (n+1) fit1 ~r m 1 m

Now using the elementary inequality ab < ~ [Ea2+(.!.)b2] for E > a- £

(with a = µ IlaU(t)(t_n~t) II b = liE II and E = ~)1 at m' 1 m' 2µ o

aCE -E ,E ) < ~II aU(t) (t-nfit)112 + µ II E 112 n~t < t < (n+l) fit2 1 1 - 4µ at mOl m

o(6.11)

Now by the definition of the interpolation operation (2.8) used

to define Wet) £ 'i'5t'l[O,T;Sh(Q)]and the best approximation property

of Hilbert spaces (O'de'n1[28]), we have that

(E, V) = 0o

(6.12)

30

Now for the moment we suppress the dependence on time. If V E

IZl[O,T;Sh(Q)], then for any particular time point, V E Sh(Q). Now

Sh(Q) contains all polynomials of degree ~k, and A is the

operator of order 2m introduced in (3.2). In this development we

restrict the subspace Sh to the class of subspaces which allow inter-

polation in the sense of (2.8) of derivatives through the m-lth order

of the solution U. And assume that A maps Sh(Q) into Sh(Q) .. This implies

-. -1if V £ Pc1 [O,T;'Sh(Q)], then AV £ PC [O,T;Sh(Q)J. Thus

a(E,E ) = (E,AE )1 1 0

= 0

Introducing (6.11) and (6.13) into (6.5) gives

(6.13)

+ ~II au (t) (t-nfit)112

4µ Clt mo

+ µ liE 112

o 1 mn~t < t < (n+l)fit (6.14 )

Now using the definition of the LZ norm and the m-elliptic property

of (3.4), we get

l.-dIIE 112 + µ liE 1122 dt 1 0 0 1 m

+ µ 'IE 112

o 1 mnfit < t < (n+l)fit (6.15)

2Thus cancelling µo IIE 111m on each side of (6.15) and

inequality (a,b) < !..../laI12+ nllbl12 to estimate the

- 4non the right hand side in (6.15), we obtain (6.10).

using the

first term

31

The following theorem gives an estimate for E at the dis-1

cretization points in time.

Theorem 6.3. Let the hypothesis of Theorem 6.1 hold. Then

II E "L ( ) = sup I I E (M~t) I I ~ c [I I E (0) I I + I IE I IH 1 (L )1 00 L2 O<M<R 1 0 It 1 0 2

+ fit II duet) II ],\, LZ(HITI)

(6.16)

where, for example, the notation L2(Hm) indicates the function lS

in LZ in the time variable and Hm in the spatial variable.

Proof: Integrating (6.10) from t=nfit to t=(n+l)~t gives

liE (n+l)~tI12- liE (nfit)I 12 < ~ f(n+l)fifldEI12dt

1 0 1 0 - Zn nfit dt 0

(n+l)~t 2+ Zn fnfit IIElllodt

2 (+ 1)+ ~ f n tJtII dUet) (t-n~t) 112dt

2u 0 n~ t d t m(6.17)

Now simplifying the last term on the right hand side

(n+1) tJtII dU (t) (t-n~t) II J~dtf atnH "(n+1) 6t au (t) 112 (t-nH) 2dt

= f II dt mnfit

I" ,t !

r ' t I

d ~t2 f(n+l)~tlldU(t)112dt3 - n4! ',_ d t m

)-... -- --- J.......... s-

t I • '-::1 (6.18)

Introducing (6.17) into (6.18)

liE (M~t) II 2 - lIE ( 0) I I 21 0 1 0

and summing from n=O to n=M, we obtain

< L ll£lt I I~ II 2 d t + Zn lIt:. l I I E II 2 d t- 2n 0 dt 0 0 1 0

+ U ~ fit2 /'tJ t II dU (t) II 2dt (6 .19)6u 0 dt mo

•

32

But II aU(t) II < c II au(t) II for all t £ [O,M t]. C is of theat m - 2 at m 2

form l+f(h,~t). We do not introduce the explicit expression for

f into our equations because it can only result in higher order

terms in the error estimate. This result then implies that

T+ 2n f IIE II2dto 1 0

(6.20)

2 1 II aE II 2II E 1 (M~ t) II ~ :. [II E 1 (0) II 0 + 2n" at L 2 (L 2)222

11 C2 2 lau(t)11 ]+ --L-~t I at L2(Hm)6110

But the classical Gronwall's inequality (Bellman [29]) implies

that if Ix(t) 1< a + ft1x(s) Ie ds where a and C are positive con-- 0

stants, then Ix(t) I~ a ect. Thus

I I E ( M~t) I I 2 < [I I E ( 0) I I 2 + Lz I I ~E I IL (L )1 0 - 1 0 n at 2 2

+ µ~C 22~ t 21 I aU ( t) I I 2 ] e Z n T

6110 at L2(Hm)

or uSlng the embedding result II~EIIL (L ) ~ C /IEIIH

1(at 2 2 3 LZ)

IIE1(M~t)"0:' C4[IIE1(0)llo + IIEIIl-p(L2) + fit·/Ia~~t)IIL2(Hm)]

(6.21)

The result (6.16) follows by taking the supremum of (6.21) for all

integers M ,,,ith0 < M < R. Note that PCl [0,T;S (Q)] is a space ofh

piecewise linear functions. Thus since El £ PLl[O,T;Sh(Q)], El

attains;its maximum at one of the discretization points in time.

Thus the supremum of the El at the discretization points in time

is exactly the Loo norm.

33

The approximation error can then be established through the

following theorem.

Theorem 6.4. Let the hypotheses of theorem 6.1 hold. Then

IleIILz(Lz) :. cs[lle(O) 110 + IIEIIH1(L2

) + 6t '1Ia~~t)IILz(Hm)]

(6.22)

Proof: Using (6.4),

IleIIL2(L2) = IIEIIL2{L2) + IIE11IL2(L2)k

< IIEll L (L ) + Til E 1IIL (L )Z 2 00 2

(6.23)

Then introducing (6.16) into (6.23), we have~ ~

I Ie IIL (L ) < TC IIE (0) II +·1 I E II . + T C I IE II 12 2 - ~ 1 0 _L2(L2) 4 H (Lz)

k+ TC fit:llau(t)1I

4 at L2 (HID)

< C [lle(O)11 + IIEII 1 + ~t:llau(t)IJ ]soH (LZ) at L2 (Hm)

We can now use the interpolation results of section Z to

derive the final error estimate.

Theorem 6.5. If the space pel[O,T;Sh(Q)] satisfies conditions

(i), (ii), and (iii) in section 2, then

IIell L (L ) :. c II e (0) II + C hk+lll u II k 12 2 5 e, 8 H ~(H + )

+ C ~t II au II5 at L2 (Hm)

(6.24)

Proof: Th~ interpolation error E is caused by an orthogonal pro-

jection Qx in space and Qt in time. The projection operator Qh

34

(equation (2.8)) is the composition

Qh = QxQt

Then using the triangle inequality and the fact that IIQ I I < 1x

But from the interpolation result (2.18)

k+lII u - Qxu II I-[l (L ) :... C 6h II u I IH1 (IIk + 1)

" 2

Ilu-QtuIIH~(L2):' C7f.t IluIIH2(L2)

(6.25)

(6.26)

Then introducing (6.25) and (6.26) into (6.22), we obtain the

estimate (6.24).

7. Finite Element Approximations

for the Convection Equation

We now consider a class of time-dependent problems character-

ized by equations of the form

au(~,t) + L(x)u(x,t) = f(x,t)at X E Q t £ [O,T]

X £ aQ- (7.1)

X £ Q

where L is the first order differential operator

L(x) = E Aa(x)D~1~1=1

(7.2)

35

Now we replace (7.1) by the weaker but equivalent problem,

(dU(t) v) + (Lu(t) ,v) = (f(t) ,v)dt ' 0 0 0

(u(',O),v) =(u ,v)o 0 0

'if t £ (O,T] (7 .3)

Now we construct a semidiscrete Galerkin approximation of

(7.2) which involves seeking a function U(~,t) E Cl [O,T;Sh(Q)]

( d ~ (t) ,V) + (L U(t) ,V) = (f (t) ,V)tOO 0

( U ( • , 0) , V) = ( u ,V)o 0 0

'rJ t E (O,T] (7.4)

Now introducing the finite element approximations (2.7) into

(7.3) we obtain a system of first order differential equations

for the coefficients AN(t)

(7.5)

where

<PN

temporal operator is replaced by a finite difference operator in

time through a procedure similar to that introduced in Section 3 ..In this manner we construct a family of finite difference - Galerkin

approximations associated with parameter 8 (0 :. 8 ~ 1) which rep-

resent solutions to the following equation:

+ (1-8) (LUn+l V), 0 + 8(LUn,V) = (f(t),V)o 0

(U 0 , V) 0 = (u0 ' V) 0

t £ (O,T) V V E Sh(Q)

(7.6)

..36

where t denotes the forward difference operator in time; i.e.,

8tUn = Un+l_Un. Evaluation of (7.5) with the finite elementfit

approximation (2.7) leads to a system of algebraic equations for

the coefficients A~ = AN(tn)

N Na n+l 0 nE [CNM + fit(1-8)0NQDQM]Ai,f = E [CNM - ~t80NQDQM]AN + fitfN

~I=l ~I=l(7.7)

8. Error Estimates for the Convection Equation

Using Semigroup Theoretic Methods

We seek an estimate for the error encountered in the approx-

imation of (7:3) by the finite difference-Ga1erkin approximation

(7.6) with 8=1 (the forward difference in time case). Let u(x,n~t)

be the solution to (7.3) at t=nfit, U(x,nfit) be the solution to' the

semidiscrete Galerkin approximation (7.4) at time t=nfit, and Un(x)-be the solution to the finite difference-Galerkin approximation at

t=n~t. Then we can define the error components of the approxima-

tion

e(~,nfit) = u(~,nfit)

o(~,nfit) = u(~,nfit)

Un(~) = approximation error

U(x,nfit) = semidiscrete approx-

imation error

(8.1)

(8.2)

temporal approximation

error (8.3)

Now to establish the semidiscrete approximation error we use

a technique which is essentially the same as adopted in section 4.

37

We take the Laplace transform of the weak hyperbolic equation

(7.3) and denote the transformed solution by u(~,s) = L[u(~,t)]

s(u(s),v)o + (Lu(s),v)o = (uo,v)o (8.4)

Note that in developing this equation we have assumed that f(t) = 0

for convenience. Similarly taking the Laplace transform of the

semidiscrete Galerkin approximation (7.4) and denoting the trans-

formed solution by U(~,s) = L[U(~,t)]

s(U(s),V)o + (LU(s),V)o = (uo,V)o (8.5)

Now subtracting (8:5) from (8.4) and defining the transformed

approximation error by u(s) - U(s) = a(s) = L[aCt)] = L[u(t) - U(t)]

s(er(s),V) a + (La(s),V) = 0 (8.6)

We can now introduce a theorem establishing the magnitude of

erthe transformed semidiscrete approximation error.

Theorem 8.1. Let u(x,n~t) by the solution to (7.~, U(x,n~t) be- -the solution to (7.~ with e = 1, and a(~,n~t) be the semidiscrete

approximation error, then there exists a constant C1such that

(8. 7)

Proo~: Let U* be the element of Sh(Q) which interpolates U in the

sense of (2.18); and let ~ = u - U* and I = U* - U. Then a = ~+ E. - Thus (8.6) with V = E can be rewritten

s(~,E) + s(t,t) + (L~,t) + (LE,F) = 0

or

-- -- 1 -- 1--(E,E) = -(E,E) - s(LE,E) - s(LE,E) (8.8)

Note that (LI,t) can be expressed as an integral over the boundary.

38

Thus since the boundary conditions are homogeneous (LE,E) = o.Using this result and the Schwarz inequality, (8.8) becomes

II Ell 0 :. II If II 0 + } I ILE II 0

< IIE/lo + S.IIElilS

Now using the triangle inequality, we obtain (8.7).

The semidiscrete approximation error is established

through the following theorem.

Theorem 8.2. Let the hypotheses of Theorem 8.1 hold, then

/lcr(',nfit)llo:' 21IE(',nfit)llo + Clf~fitIIE(.,t)llldt (8.9)

Proof: This result follows from Theorem 8.1 by taking the inverse,

Laplace transform.

The semidiscrete approximation error can be evaluated

for finite element models by using the interpolation results of

section 2.

Theorem 8.3. Let the hypotheses of Theorem 8.1 hold. Then there

exist positive constants C and C such that2 3

II cr(.,nfit) II 0 < C hk+ 11u Ik+ 1 + C 3hk fn~ t Iu I dt- 2 0 k+l

(8.10)

Proof: This result can be obtained from (8.9) by using the inter-

polation result (2.18).

Note that the rate of convergence is determined by the

term involving hk. Thus the rate of convergence of the finite

element approximation for the convection equation 1S one power

10\ver than the rate of convergence for the finite element approx-

imation of,the diffusion equation.

The temporal approximation error is established by exactly

39

the same procedure as was used in section 4 and is given in

Theorem 4.3. We now cite the theorem establishing the total

error for the finite element-Galerkin approximation for the con-

vection equation.

Theorem 8.4. If the components of the error of the approximation

are given by (8.1-3), then there exist constants C , C , and C234

such that

II e ( • , nfi t) II < C h k + 11 u Io - 2 k+l + C hkfn~tlul dt

3 0 k+l

+ C fit II Uo II 04

(8. 11)

Proof: The result follows by using the triangle inequality 1n

conjunction with (8.10) and (4.43).

9. Error Estimates for the Convection Equation

Using L2 Methods

The L2 error estimates for the convection equation will be

developed using the technique introduced in section 6. Accordingly

we will use the space PC1[0,T;Sh(Q)] here. Reference should be

made to section 6 for its definition.

We consider the solution of the weak hyperbolic problem (7.3).

The exact solution to this problem has been characterized by Gold-

stein [27]. If the coefficients Aa in (7.2) are constants, then-

For convenience we set Aa equal to a constant for each a. We

note that the method of proof developed here is valid for a much

weaker specification of A~.

Now we seek an approximate Galerkin solution Un to (7.3) uS1ng

the finite difference-Galerkin approximation (7.6) with 8=1 (the

forward difference in time case). The solution of this problem is

40

fully equivalent to the solution of the following problem. Find

a U £ PC1[0,T;Sh(Q)] such that

(~~(t) ,V)t=nfit+ (LU(t),V)t=nfit = 0

(U ( • , 0) , V) 0 = (u 0 , V) 0

'if V £ PC1[O,T;Sh(Q)]

n = O, ... ,R

(9.1)

The notation indicates that a collocation is performed at t=n~t,

n=O, ... , R.

Now let Wet) be that element of PC1[0,T;Sh(Q)] which inter-

polates u in the sense of (Z.8). Then we define the error com-

ponents of the approximation by

e = u(t) - U(t)

El = Wet)

Ez = Wet)

U( t)

Un(9.2)

E = u(t) - Wet)

The following theorem relates the error components.

Theorem 9.1. If u(t) is the exact solution to (7.3), U(t) is the

function defined by (9.1), and Wet) is the interpolant of u(t)

defined by (2.8), and the error components are defined by (9.2),

then

nfit< t < (n+l)fit (9.3)

Proof: Subtracting (9.1) from (7.3)

(duCt) V) - (dUCt) V)t=n~t+ (Lu(t) V) - (LU(t) V)t=n~t = 0'. , 0 " ... ' 0 ' 0 ' 0

'if V £ PC1[O,T;Sh(Q)]

But

(a~(t) ,V)t=nfito CdU(t) V) for n~t < t < (n+l)fit

dt ' 0

41

Thus

(au (t) _ au (t) ,V) + (Lu (t) - Lun ,V) 0 = 0at at 0

~ V £ PC1[0,T;Sh(Q)]

Rewriting (9.4)

eel u (t) - aW (t) + aw ( t) - au ( t) V)at at at at' 0

+ (Lu(t) - LW(t) + LW(t) - LUn,V) = 0o

(9.4)

v V £ PC1[0,T;Sh(Q)] (9.5)

Equation (9.3) can be obtained by introducing (9.2) into (9.5).

The theorem governing the behavior of El can now be stated.

Theorem 9.2. Let the hypotheses of Theorem 9.1 hold, then there

exist positive constants 6, £, n, Cl' and Cz such that

1 .d.....IIE112 < ~llaEl12 + C211EI12 +2 dt 1 0 - 4n at 0 - 1

4£

+ £!..II au (t) (t _n~ t) II 248 at 1

n~t < t < (n+l)~t (9.6)

Proof: E - E = U(t) - Un = aU(t) (t-n6t) for nfit < t < (n+l) ~t.2 1 at

Now using the elementary inequality

(a,b):' ~Bllal12 + 811bl12

we find that

(L(E~-El),El) = (L(aU(t)(t-n~t),El), 0 at 0

< !.-IIL( aU(t] (t-nfit))112 + 811 E 112

- 46 at 0 1 0

< ~llaU(t)(t_n~t)112 + I3I1E 112

- 413 at 1 1 u

nfit< t < (n+l) fit

Similarly, using (9.7) we haveC

(LE,E) < 211EII2 + £IIE 1121 0 - 4£ 1 1 0

(9. 7)

(9 .8)

(9.9)

(9.10)

\I

42

and

-(~, E) < LII ~112 + nilE 112at 1 0 - 4n at 0 1 0

The term (LE ,E ) can be expressed in terms of an integral over1 1

the boundary. Thus since the boundary conditions are homogeneous,

(LE ,E ) = o. Introducing this resul t in conjunction wi th (9.3),1 1

(9.9), and (9.10) into (9.3), we obtain (9.6).

We are now ready to introduce the theorem which glves an

estimate for E at the discretization points in time.1

Theorem 9.3. Let the hypotheses of Theorem 9.1 hold, then there

exists a positive constant C such that6

sup liE (Mfit)II0< M< R 1 0

< c6[IIE (0)11 + IIEII + ~t Ilau(t)11 1]1 0 Hl(Hl) at L2(H)

(9.11)

Proof: Integrating (9.6) from t=n6t to t=(n+l)~t

liE ((n+l)~t)112 - liE (n~t)1121 0 1 0

< .L!(n+l)~tll~112dt + ~!(n+l)6tlIEI12dt- 2n n~t at 0 2£ n~t 1

(n+I)6t I 112+ 2(n+£+S) ! IE dtn6t 1 0

C (n+l)~t ()+ ~ ! II au t (t -nfi t) II 2 dt

26 n~t at 1

Now there exists a constant C such that I laU(t) I I <"t 1

for all t £ [0,M6t]. C is of the form 1 + f(h,fit).3

(9.12)

C IlauCt)113 at 1

Since the

function f(h,~t) causes only higher order terms in the error es-

timate, we do not include an explicit expression for it in our re-

sults. Use of this inequality in conjunction with the Schwarz

inequality gives the following result

43

j(n+l)~tll dUet) (t-n~t) 112dtn~t dt 1

< fit2 jCn+l)fitllaUCt)112dt- 3 n~t at 1

< C:fit2 j(n+l)~tllau(t)112dt (9.13)- n~t at 1

3Introducing (9.13) into (9.12), summing from n=O to n=M, and using

the embedding result II~II < C IIEIIH1( )' we find thatat LZ(LZ)- ~ L2

2

II E (Mfi t) II 2 < [II E ( 0) II 2 + C4 I lEI IH21 (L )

1 0 - 1 0 2n 2

+ SllEI12 + clC~~e Ildu(t) 112 ]2£ L2(H1) 6, Bat L2(H1)

+ 2(n+£+B) jMfitII E IIo2dt (9.14)o 1

Applying the Gronwall's inequality to (9.14) and using the embedding2

result 21lEI12l + SllEIIL

2(Lll)< CsIIEII1211(Lll)we obtain (9.11).2n H (L2) 2£ 2 r - 'I

The approximation error can be established through the following

theorem.

Theorem 9.4. Let the hypothesis of Theorem 9.1 hold, then

(9.15)

Proof: This proof is the same as the one given in Theorem 6.4.

We can now give the final error estimate which is based on the

interpolation results of Section 2.

Theorem 9.5. If the space ptl[O,T;Sh(Q)] satisfies conditions (i),

(ii), and (iii) in Section 2, then

IleIIL2(L2) ~ C71Ie(0) 110 + ClohklluIIH1(Hk+d

+ C fit IIau II7 at L 2 (Hl ) (9.15)

•44

Proof: The proof of this theorem is essentially the same as the

proof of Theorem 6.5. However, we should note that in this case

the spatial interpolation error is different. Using (2.18)

I IU-Qxul IH1(Hl) :. Cahk+l-llluIIHl(I-Ik+l)

CahkllullHl (Hk+l)

Thus for the convection problem the rate of convergence in the

spatial variable is one power lower than that obtained in the

diffusion problem.

Acknowledgement. Support of the U.S. Air Force Office of Scientific

Research under Contract F44-69-C-0124 to the University of Alabama

1n Huntsville is gratefully acknowledged. We also express thanks

to the Engineering Mechanics Division of the ASE-EM Department,

The University of Texas at Austin, for providing certain facilities

used during the course of the work reported here.

References

1. Aziz, A. K. (Editor), The Mathematical Foundations of theFinite Element Method with Appllcatlons to Part1al D1fferent1alEquations, Academic Press, N.Y., 1973.

2. Whiteman, J. (Editor), The Mathematics of Finite Elementswith Applications, Academic Press Ltd., London, 1973.

3. Strang, G. and Fix, G. An Analysis of the Finite ElementMethod, Prentice-Hall, Englewood Cliffs, N.J., 1973.

4. Oden, J. T., "Mathematical Aspects of Finite Element Approx-imations in Theoretical Mechanics," Mechanics Today - 1973, Edited byS. Nemmat-Nasser, Pergamon Press Ltd., Oxford, (to appear).

5. Aubin, J. P., Approximation of Elliptic Boundary-ValueProblems, Wiley Interscience, New York, 1972.

6. Hille, E., and Phillips, R., Functional Analysis and Semi-groups, Amer. Math. Soc. Colloq. Pub., Vol. 31, 1957.

4S

7. Butzer, P. L., and Berens, H., Semigroups of Operatorsand Approximations, Springer-Verlag, New York, 1967.

8. Peetre, J., and Thomee, V., "On the Rate of Convergencefor Discrete Initial-Value Problems," Mathematics of Scandinavia,21, 1967, pp. 159-176.

9. Widlund, o. B., "On the Rate of Convergence for ParabolicDifference Schemes, II," Communications on Pure and Applied Mathe-matics, 23, 1970, pp. 79-96.

11. Richtmyer, R., and Morton, K., Difference Methods forInitial-Value Problems, Interscience, 1967.

13. Oden, J. T., and Fost, R. B., "Convergence, Accuracy, andStability of Finite Element Approximations of a Class of Non-LinearHyperbolic Equations," International Journal for Numerical Methodsin Engineering, 6, 1973, pp. 357-365.

14. Douglas, J., and Dupont, T., "Galerkin Methods for Para-bolic Equations," SIAM Journal on Numerical Analysis, Vol. 7, No.4, 1970, pp. 575-626.

15. Dupont, T., "Some L2 Error Estimates for Parabolic GalerkinMethods," The Mathematical Foundations of the Finite Element Methodwith A lications to Partial Differential Equations, Ed. A. K. Aziz,

ca em1C ress, ew Yor , , pp. 2

16. Douglas, J., and Dupont, T., "Galerkin :Methods for ParabolicEquations with Nonlinear Boundary Conditions," Numerische Mathematik,20, 1973, pp. 213 -237 .

17. Varga, R. S., Functionalin Numerical Analhsis, Societymatics, Philadelp ia, 1971.

18. Wheeler, M. F., "A Priori LZ Error Estimates for GalerkinApproximations to Parabolic Partial Differential Equations," SIAMJournal on Numerical Analysis, Vol. 10, No.4, 1973, pp. 723-~

19. Thomee, V., "Spline Approximation and Difference Schemesfor the Heat Equation," The Mathematical Foundations of the FiniteElement Method with Application to Partial D1fferent1al EquatIons,A. K. Aziz (ed.), Academic Press, New York, 1972, pp. 711-746.

.'

46

20. Lions, J. L., and Magenes, E., Non-Homogeneous BoundaryValue Problems and Applications, Vol. I, Springer-Verlag, NewYork, 1972.

21. Cella, A., and Cecchi, M. M., "An Extended Theory for theFinite Element Method," Variational Methods in Engineering, Vol. T.,Ed. C. A. Brebbia and H. Tottenham, Southampton University Press,1973, pp. 74-84.

22. Strang, G., "Approximation in the Finite Element Method,"Numerische Mathematik, 19, 1972, pp. 81-98.

23. Ciarlet, P. G., and Raviart, P. A., "General Lagrangeand Hermite Interpolation in Rn with Applications to Finite ElementMethods,l1 Archives for Rational Mechanics and Analysis, 46, 1972,pp. 177-199.

24. Friedman, A., Partial Differential Equations, Holt, Rhine-hart, and Winston, 1969.

25. Schultz, M., Spline Analysis, Prentice-Hall, 1973.

26. Lees, M., "A Priori Estimates for the Solution of DifferenceApproximations to Parabolic Partial Differential Equations,l1 DukeMathematical Journal, 27, 1960, pp. 297-311.

27. Goldstein, J., semigrou§s of Operators and Abstract CauchyProblems, Tulane University, 197 .

28. Oden, J. T., Finite Elements of Nonlinear Continua, McGraw-Hill, 1972.

29. Bellman, R., Stability Theory of Differential Equations,McGraw-Hill, 1952.

.-

accuracy and convergence of finite-element/ …oden/dr._oden...with the exception of a number of...

Documents