c. elliott to cite this version - accueil · stories of undulator light ... finally it points out...

HAL Id: jpa-00222553https://hal.archives-ouvertes.fr/jpa-00222553

Submitted on 1 Jan 1983

HAL is a multi-disciplinary open accessarchive for the deposit and dissemination of sci-entific research documents, whether they are pub-lished or not. The documents may come fromteaching and research institutions in France orabroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, estdestinée au dépôt et à la diffusion de documentsscientifiques de niveau recherche, publiés ou non,émanant des établissements d’enseignement et derecherche français ou étrangers, des laboratoirespublics ou privés.

STORIES OF UNDULATOR LIGHTC. Elliott

To cite this version:C. Elliott. STORIES OF UNDULATOR LIGHT. Journal de Physique Colloques, 1983, 44 (C1),pp.C1-255-C1-287. <10.1051/jphyscol:1983124>. <jpa-00222553>

https://hal.archives-ouvertes.fr/jpa-00222553

https://hal.archives-ouvertes.fr

JOURNAL DE PHYSIQUE

Colloque Cl, supplément au n°2, Tome 44, février 1983 page Cl-255

STORIES OF UNDULATOR LIGHT

C.J. E l l i o t t

Los Alamos National Laboratory, Los Alamos, lieu Mexico 87546, U.S.A.

Résumé.- Nous passons en revue les travaux effectués sur les lasers à électrons libres incluant : un exposé des principales idées destiné aux profanes; un récit historique des travaux accomplis au cours de ces dernières années; une caractérisation mathématique de divers projets; une formulation du problême mathématique comme problème non linéaire à point fixe; une description des techniques de calcul.

Abstract. - We review optical details of free-electron lasers including: a layman's account of many of the ideas; a historical account of work over the last two years; a mathematical characterization of various schemes; a formulation of the mathematical problem as a non-linear fixed-point problem; and a description of computational techniques.

A. Introduction.

The purpose of this paper is to summarize and categorize work which has

occurred over the last two years that treats the optical details of the Compton

regime free-electron laser (FEL). These details are important in assessments of

optical quality and optical-resonator design. By the term optical details I mean

the description of the dependence of the optical field on coordinates transverse to

the axis of the optical resonator. Virtually no discussion is given here on the

longitudinal dependence of the field (the pulse train structure).

The audience at which this paper is directed is primarily those who are

contemplating undertaking or utilizing such calculations, but this review is also

useful for those who simply want to know what is occurring in the field. Section B

should be understandable to a non—technical person who is interested in an overview

of the ideas as opposed to the implementation of them.

The remainder of this section gives a brief description of the parts of this

paper, and then it goes on to define the problem and concepts used. Section B

serves to translate the major ideas into non-technical terms; it can be skipped by

those well versed in the field. Section C reviews work on various physical models;

it starts with oversimplified models (the classic models) and goes on to describe

the work of the last two years (the 1981-1982 models) and finally gives a preview of

work to come. Section D examines these models from the point of view of the

underlying mathematical ideas; this section starts with a description of the flow of

Article published online by EDP Sciences and available at http://dx.doi.org/10.1051/jphyscol:1983124

http://www.edpsciences.org

http://dx.doi.org/10.1051/jphyscol:1983124

Cl-256 JOURNAL DE PHYSIQUE

ideas from Maxwell's equations to the actual computations; it then abstracts a

general non-linear description that allows the various techniques to be categorized

and points to new techniques; finally it points out that optical propagation in

cavities can be formulated as a non-linear fixed-point problem. Section E delves

into the digital operations necessary to carry out computations of optical beam

quality; it starts by looking at available computer power, and then it goes on to

describe ghost cell requirements that occur in finite grid techniques; finally it

looks at the various methods of handling the optical part of the problem and

concludes that the choice should often be based on convenience as opposed to

efficiency of the algorithm.

Mathematics is a language that enables us to state precisely what problem we

are addressing. We start with the vector potential,!, and the transverse driving

current J1 that is produced in the interaction region by interplay between the

radiation field, the static magnetic field, and the relativistic electron beam. The

wave equation, in the radiation gauge, is

2 1 a!? v2A - - =

- c2 at2 -'JoJ1

and is subject to reflecting boundary conditions at the mirrors, and radiative

boundary conditions,transversely, at infinity. The well studied empty cavity modes

are the solutions, for JL = 0 , to the eigenvalue problem that requires the optical

field on one mirror to be a complex-number multiple (the eigenvalue) of that same

field after it has propagated a round trip in the cavity. The loaded cavity mode is

the solution for finite JJ. The transverse current, .JI, can itself be thought of as

radiating into a superposition of the empty-cavity modes, and it is the feedback

enhancement of these empty cavity modes that determines their contribution to the

loaded-cavity mode.

It is only in low gain devices that empty-cavity modes form a good basis for

describing the loaded-cavity problem. (Note one often has strong signal even though

the gain per pass is low.) When the gain is high and the device is long, non-linear

effects similar to self-focusing can overcome diffractive spreading of the optical

beam and entrain the optical pulse in the electron beam. In these cases the

electron dynamics, that lead to lensing, and the optical propagation are tightly

coupled together.

In describing optical quality from an optics point of view, we can think in

terms of the index of refraction and the absorption or gain. Both the gain and the

index of refraction may be thought of as complex quantities and the relationship

between them is given here. Suppose we have a complex gain coefficient, gc. Then

we expect a free-space plane wave having z dependence, exp[i(w/c)z], to be modified

by the multiplicative factor, exp(gcz). The modified wave can be described by a

complex index of refraction,nc, where

Another convention is to use

or equivalently,

where Re and Im are the signed lengths of the real and imaginary parts respectively.

Although neither the complex conductivity nor the complex dielectric constant have

appeared in the literature on the optical quality of FELs, they too are simply

related concepts and might be preferred in future publications.

In the case of a low gain FEL, to understand the possibility of good optical

beam quality, it is useful to think in terms of the incident field, E (the incident field that we often think of in of an oscillator is that at the design point), and

the excited field k . If we look at the field far away from the interaction region

(ignoring the mirrors) the energy density is proportional to l + ~ 1 ' . With low gain,

the radiated intensity increment is the cross term 24*E, and if we think of the FEL

interaction region as a single entity this term is the (in-phase) stimulated

emission. The spatial dependence of this intensity-increment cross term is the

product of the spatial dependences of the excited wave and the incident wave. Thus,

even though the excited wave may radiate over a large spatial region, the intensity

increment associated with the stimulated emission may be found over a smaller region

because the incident wave is nearly a cavity mode whose far-field dimensions may be

considerably smaller than the excited wave.


The procedures for attacking the problem of describing the FEL optical details

are many. There are also a great variety of tastes as to the physical description.

The particle motion is described elsewhere, the starting point being a Lorentz force

equation, or a relativistic Hamiltonian. The starting point for optical details is

Maxwell's equations. These details are complicated enough without including the

time-dependent pulse structures; i.e. only one fourier component of the field is

studied. In these single frequency pulse cases all methods that have evolved are,

in principle, equivalent to each other and to the paraxial-wave-equation description

(see Section D). Of course, the numerical implementations which are the steps in

going from the paraxial wave equation to a discrete calculation differ according to

the representational grid and the propagation methods.

One final comment on terminology comes from David Deacon. In the United States

the term wiggler has often been used to describe the static magnetic field of the

free-electron laser interaction region. In Europe wiggler often has an additional

restriction: it is applied only to magnetic fields that are weak. To avoid

confusion we do not distinguish between strong and weak fields and refer only to

undulators.

B. Happenings Of Light In Servicing The Interaction Chamber: A Holistic Approach.

\ r e f ~ e c t i n ' ~ surfoce reflecting surface interact ion

chamber

Fig.1 - The optical wave bounces back and forth between the pair of reflecting mirrors. The wave of undulator light is produced originally by fast electrons that are pushed and pulled (wiggled) by the static magnetic field, the undulator. The fast electrons come from an accelerator that has increased the electrons' mass many fold above their rest mass. These electrons arrive in bursts that are separated by the time it takes light to make a round trip. The undulator light they produce, the excited wave, gets added to the incident wave to form the sum wave that gets reflected in the mirrors and becomes the new incident wave. The intensity of the undulator light grows until the burst of electrons can only give it the small fraction of its energy lost on reflection.

The device we call an EEL is shown in Eig.1. The excitation of the optical

wave begins slowly as the interaction begins, and gradually over-massed electrons

become trapped by the interplay of the optical wave and the static magnetic field

that wiggles the electrons back and forth in transverse fashion. These electrons

become excited and give off light. This process of stimulated emission is a source

of energy to the optical wave and causes the electrons to lose energy and reduce

their speed in accordance with the laws of Nature. This energy comes by way of the

excited wave that has taken the time and energy of electrons to help create. The

time it takes the electrons delays the peak intensity of the excited optical wave

and the resultant optical wave is then characterized by lethargy, a term that also

describes the enervated motion of the electrons that are still significantly

over-massed. Various schemes to enable the electrons to give more of their mass per

pass and to recover left-over energy of the electrons as they are brought to the

rest-mass condition are actively under experimental study.

The mathematical model has an intrinsic beauty when couched in the terms of the

powerful wave equation driven by well known Lorentz-gauge electron currents. The

waves that occur are of two types: permanent (or static) fields and naturally free

(or propagating) fields. The naturally-free fields that propagate in the EEL are

refered to as optical (light) waves and these are of two types. The incident

optical wave is curl-free everywhere, and the excited optical wave has a non-zero

curl as it is produced in the interaction chamber, but the excited wave loses its

curl when abandoned by the electrons. Upon summation to the incident wave and after

losing a little energy upon further reflection in several mirrors, it may be

regarded as the new-incident wave, more energetic than the last and ready for a go

around with another group of electrons that are well prepared to enter the

interaction chamber. Where does this lead you may ask? It is almost like a fairy

tale in which the wicked witch is punished by seven years of bad luck. Things are

not that grim; as the incident wave becomes more and more powerful it soon

aggregates much more energy than any single group of electrons can give it; it

eventually stops growing when the relatively small amount of energy lost in

reflection becomes as large as the amount supplied by the electrons.

You may wonder if after this explosive growth of the incident wave, whether it

is capable of settling down to a predictable stable existence or will it lose and

gain randomly, sometimes on the left side, sometimes on the right side, sometimes

here, and sometimes there? You will see that as the story unfolds it all depends on

the group of electrons with which the incident wave interacts. It also depends on

how the incident wave began in the dark. There are some unusual incident waves that

can interact with a number of electrons so large that it would drive most waves

chaotic. These have been named the strange attractors, and they exist on the

islands of stability, changing character as they roam the island but always staying

there. Between these isles the sea of chaos reigns. If even more electrons are

used in a particular reduced mathematical description, intended for electrons that

C 1-260 JOURNAL DE PIIYSIQUE

do not influence each other's motion, there comes a point where the domains of the

strange attractors no longer exist. This space of incident waves is completely

covered by a turbulent sea of chaos. Here the incident waves come and go in

unpredictable fashion. They might remind you of a strange attractor you had

previously encountered and then on the next pass they may resemble a completely

different strange attractor. Yet with enough electrons the behavior is completely

aperiodic and utterly wild.

The realm of fancy possibility is truly mind boggling, so we will describe the

more predictable aspects of the theory where the incident wave settles down to a

fixed shape. In describing these cases the entire space of wave functions is much

too extensive for our purposes. We are interested primarily in those optical waves

traveling through the interaction chamber in nearly the same direction as the

electrons. It is these waves that can grow at the expense of the electrons. The

interaction chamber can be thought of, for Platonic simplicity, as a narrow tube

that is long but finite in the direction the electrons are traveling. The chamber

is regarded as narrow because the electron beam is narrow; the chamber is an

artificial delimitation of space telling us where the over-massed electrons are

located when they interact with the optical fields. We can imagine many types of

electron beams: some fill the cylindrical chamber uniformly in the narrow

(tranverse) direction, some have more electrons at the center of the chamber, others

might be contained by a rectangular cylinder. Of course, all of these chambers are

invisible to the optical waves because the chambers are simply figments of our

imagination. Another aspect of the class of problems we address is that the mirrors

for the optical waves are at either end of the interaction chamber. If the optical

wave leaves through the narrow dimensions of the interaction chamber, we do not

allow it to reenter; we give it only one chance in the interaction chamber per pass.

By placing these restrictions, some progress can be made on a reduced mathematical

description, but even this problem is more difficult than that currently attempted.

The mathematical idealizations that have been attempted so far do not describe

actual physical situations, but rather they extract a subset of features of the

physical problem that they describe faithfully. The remainder of the features are

described in a distorted fashion that may even be unnatural in the physical world.

An analogy might be the study of the flight of an airplane by means of a physical

model of only the wings. Clearly a restricted set of questions may be asked of this

model problem; e.g., questions of lift are expected to be reasonably described;

questions of tire wear are not addressed; and questions relating to drag are only

partially answered.

The questions we may ask involve nouns and verbs. We have already become

acquainted with the elements of the FEZ. problem: the electrons, the incident wave,

the excited wave, and the permanent field. Each of these elements have attributes

that may be described by nouns such as the length and breadth of each of the

elements. It is the wave equation that tells us which nouns are important. It

relates how each of the waves vary in space and time at a particular point and time,

with the electron-current value. The electron current depends on the instantaneous

position, speed, and direction of all the electrons in the immediate vicinity of the

point under examination in the wave equation. Given an initial description of the

nouns, the wave equation describes what happens.

The nouns that appear in the wave equation are the electron current and

derivatives of the gradients in the size of the optical fields in each of three

directions, whereas the verb is the rate of change in the field with time. Given

the nouns that describe the element of the problem, the wave equation provides the

verbs. The wave equation may be thought of as the weaver of stories about how

optical pulses behave in the presence of electron currents, and it has many tales to

tell.

Not only must we pay attention to the wave equation story teller, we must also

simultaneously listen to other tales that are woven by the particle equations of

motion. For the class of problems we are describing, the electrons do not alter

each others motion significantly, but rather move according to the single permanent

magnetic field and the propagating fields in which they are immersed. Our situation

is somewhat like watching an opera where the stories of all the characters

(electrons) are evolving simultaneously as the orchestra plays the music (incident

optical field), and the characters add their own arias (excited optical field)

hopefully in harmony with the orchestra. But rather than the situation in most

musical operas where the characters influence each other strongly, in our optical

opera the characters (electrons) react only to the musical ambiance (the optical

waves) of the moment, and disregard what the other electrons are doing except

through their emitted waves (arias). In most of the work we will describe, the

incident wave (orchestra) dominates over the excited wave (that was emitted by the

electrons). An exception is the work of Prosnitz, Doss, Haas, and Gelinas [I] where

the incident wave (orchestra) starts the process off and then is soon out done by

the excited wave; at that time the orchestra (incident wave) simply cannot be heard

and might as well stop playing. The opera analogy describes the type of interaction

between the waves and particles, but it fails to describe the spatial distribution

and motion of the waves and particles.

The stories of the wave equation that I have selected to tell you have several

things in common. First they all start out in a similar fashion with the incident

optical wave entering the interaction chamber from the left, apparently in a hurry

with a speed called c. The electrons also enter from the left, simultaneously with

the wave, traveling almost as fast. Although the electrons had been pushed a great

deal, the faster they went the more mass they obtained, and the more mass they'had,

the harder it was to make them go faster; such was their lot: if they started out up

front on the wave soon they would slip toward its rear.

C1-262 JOURNAL DE PHYSIQUE

This slippage along with careful timing is the key to successful EEL action by

electrons that do work on (for) the optical field. The electron's work depends on

the electrons being able to move in a direction opposite to that of the optical

field. Their motion depends on the presence of the static wave, the magnetic field,

that perturbs the electrons and sets them in a spin or causes them to totter as if

inebriated. Ideal timing by the most efficient electron occurs if that electron

totters to the left when the optical field is pointed to the right and is at her

peak positive value. Then at the moment the electron has slipped back toward the

rear of the optical wave where the wave is most negative and is pointed to the left,

our efficient electron is tottering to the right, always opposing the field.

Electrons that do work lose energy and slow down so that they slip at an even

greater rate toward the rear of the optical wave. The rate of electron energy loss

depends on where the electrons are in the interaction region. As they progress

through the interaction region, the static magnetic field is designed to cause the

electrons to totter back and forth even faster to compensate for the increased rate

at which the optical wave changes its sign from the greater slippage rate. These

magnetic fields are called tappered because they gradually change their period to

accommodate to the reduction of speed of the efficient electrons.

The other common aspect of the stories is an almost timeless quality. The

stories we describe are actually not those of the wave equation itself. We employ a

model that makes the story repetitious and permits the optical wave to have only a

single color (monochromatic). Our model is simpler than the,actual storyteller

(wave equation), and it is a matter of speculation whether the model will ever be

close to the seasoned teller, that is, will the full wave equation exhibit solutions

that the simple model can describe in most of their important details?

We don't place sole blame on our apprentice (model) storyteller of the wave

equation for possible inconsistencies because the electron storyteller is not

genuine either. The story of the electrons also repeats itself after the same short

interval, and who is to say whether a periodic set of electrons drove the optical

wave periodic or vice versa. As suggested above it is not clear that the true

raconteurs could ever have gotten into this recurrent mess. Either the model for

the wave equation or the tyro particle teller said to the other, "If you will do it

again, I will too." And since it was a lot easier to do so, they went off and did

it, forgetting the boundary conditions specified by law. The legal problem is that

a consequence of this cabal led to an infinitely long pulse and an infinite amount

of energy. Now it can be argued that perhaps a very long pulse of finite energy

will not vary in peak amplitude (i.e. it would be monochromatic or periodic)

halfway through, but that raises start-up and shut-down questions. A partial

reassurance to assuage our annoyance is that physicists do not regard similar

transgressions in scattering theory as serious, although there the situation does

not involve as many interesting beams. When physicists discuss the infinite, it is

a most difficult topic.

Several of the stories continue, as we recounted earlier, with reflections in

the mirrors. We might like to think of these as additional stories about thin

slices, one optical period long, taken from very long beams that travel nearly

forwards (or upon reflection sometimes backwards). The electron beam can be

described as a thin slice of the interaction chamber crossection; it moves from the

beginning of the chamber to the end at the speed of light, slightly faster than the

electrons within. The electrons that slip out the back end are replaced by

electrons with identical transverse location, speed and direction of travel, at the

front end of the slice, so our repetitious story teller says. (There are other

important details of the electron motion we don't have time to recount.) Similarly

the optical slice is one wavelength long, with speed c, but because the substitute

story teller insists on a monotone, the description of the optical waves in the

optical slice is considerably simpler than the description of the electrons in the

spatially coincident electron slice.

The incident wave is sinusoidal as shown in Fig.2. In this three dimensional

representation, the height represents the amplitude of the wave, left to right

represents the propagation direction, and back into the plane of the figure and out

Fig.2 - A single period of a Gaussian optical beam. Left to right is the propagation direction. Away and towards us represents the radial distance from the center line of the optical system; r-0 occurs halfway back. The height is the amplitude of the optical field. If we look along the edge of a single stripe, the up and down motion we see is sinusoidal. The peak height on stripes occurs further to the right (are advanced) as we move out from the center plane and this indicates that the focal point is to the right.

C1-264 JOURNAL DE PHYSIQUE

of the plane represent a transverse dimension. As we look into and out of this

plane at the similar parts of the wave, they appear to be more advanced the further

we go away from the center plane (the center plane has directions of up and down and

left to right) of the wave. Also as the incident wave propagates it moves forward

or backward in the slice, but still keeps its overall sinusoidal shape in the

direction (axial) of propagation. By this, I mean that at the same transverse

position (as we slide along a stripe) the wave goes up and down in the same way it

does at another transverse position, except for possible shifts or delays. The only

data we need to know to describe the information of this wave everywhere in the cell

(slice) is the height to which it rises (as we slide along a stripe) and its

retardation (an advance is negative retardation); these quantities are also known as

the amplitude and phase of the wave. We now shift from the context of the space of

Fig.2 where height is the amplitude of the wave to a space where the height is the

other transverse direction of real space. The retardation or phase is measured by

the distance from a fixed transverse reference plane in the cell such as the length

from the end plane (containing the ridge at the right) of the slice to the location

of the peak of the wave. From each point (transverse location) on the reference

plane there will be a different measurement of the height and retardation or delay,

and we may think of these quantities as being stored on this transverse plane, two

values for each point. We could imagine a chess board as the transverse plane and

in each square two numbers one for the height and one for the delay would be written

at the center of the square. We might want to subdivide the squares finer and write

smaller so we could get a better description of the wave. In a computer, the chess

board numbers may be stored, two numbers for each square, and given a big enough

memory you may make the squares as fine as you wish.

The excited wave when added to the incident wave is known as the sum wave.

Both the sum wave and the excited wave are sinusoidal but have heights and

retardations that are usually different from the incident wave. We can either think

of there being two separate waves, the excited wave and the incident wave, or we can

think of just one, the sum. For some problems, one of these two representations is

more convenient. To obtain the sum height at a point in the cell we take the sum of

the two separate heights at that point and then we repeat the process for every

point in the cell. It is a matter of simple trigonometry to show that the sum of

two shifted sinusoidal waves with different amplitudes give another sinusoidal wave,

whose amplitude and retardation are algebraically related to those of waves added.

In the chess board description, this same algebraic formula could be used to do the

height and retardation representation of the sum. How and under what conditions the

excited wave generated by the electrons is actually sinusoidal is a story to be told

another time.

The full numerical description of the electrons requires more than one

chessboard, it requires many chessboards discretely placed at various axial

locations or phases as in three dimensional chess (or the Inner Circle game).

Coders have used between 30 and 500 stacked chessboards for special over-simplified

calculations in which the chessboard has only one square, and computations for

many-square chess boards are now in their infancy. By counting the many chess

boards, we can see that the description of the electrons is considerably more

involved than the single board description of the optical waves. A simplified model

for the electrons reduces the computation time considerably by requiring only two

numbers (rather than many pairs) on each square. One of the numbers on a chess

board square is the phase of the electron and the other is its energy (or mass or

alternately momentum). Another oversimplification is the selection of one chess

board, the resonant-phase board, to represent many electron chessboards. The less

efficient electrons on this chess board slip at a slower rate than the more

efficient ones. As the chess board for the electrons moves through the chamber,

each square has a different slippage so that the chess board disassembles into 64

(or more) separate squares, but since we have written the phase on the square, we

may think of the chess board as being reassembled and the axial position (phase) at

which each square would be located as being coded information stored on the

individual squares, should anyone ever want to actually see the real disassembled

locations.

In this resonant phase model the numerical calculation can be viewed as a two

board game. If the outcome depends on who (the electrons or the incident wave) goes

first then the game does not correspond to the physical problem. We want to take

small enough axial steps so we will describe the problem smoothly. The situation is

similar to film making where LO frames per second results in jerky motions, and 24

frames per second appears natural. For large steps the motions appear jerky, with

small enough steps we describe small changes in the motion and very little happens

in one step, so it makes little difference who starts. Although the picture of the

electrons and the sum wave alternately taking a turn (small step) is somewhat over

simplified, it contain the idea of how the story is told.

When the electrons move, they move according rules that depend on the sum-wave

chess board, and when the sum wave moves the rules depend on the electron board. It

might seem this game would be considerably harder than chess, but that is not so.

We are not taxing the power of present day computer in playing this two board game

(or even the many board game). We are not using game theory to guide the plays. It

is essentially different than chess in that there is no freedom of choice in the

moves: our story is automatically woven. Who would believe electrons had free

choice in the free electron laser? For our classical calculations God does not play

dice, but we wonder if there will ever be a Solon or Moses for electrons, or will

the stories remain just so.

JOURNAL DE PHYSIQUE

C. History of the Models.

This section summarizes work done on transverse optical-detail models for FELs

utilizing large-mass (relativistic) electrons. Here a description of what work has

been done is given without emphasis on conclusions or speculations. Much of the

language used is aimed at those who have had a good grounding in classical

electrodynamics [2] and who are familiar with the variable-parameter FEL

calculations [3] that do not describe transverse optical details. If you get bogged

down, going to the sources will be helpful.

1. The Classic Models.

We may obtain a fuzzy picture, see Fig.3, of how the optical wave travels

through the interaction chamber by balancing the loss of energy to the electrons

with a gain in the optical energy and by introducing a filling factor. We think of

this optical wave as being flat (a plane wave). After moving the electrons a small

step through the interaction region and computing the average energy lost by the

electrons, we may compute the total energy lost from all electrons with

multiplication by the total number of electrons. Then the volume of the optical

pulse together with the energy density (energy per unit volume) of the incident wave

permits us to compute the energy density of the new-incident wave. If we think of

the electron number density as being fixed, it follows that the total energy lost by

the electrons is proportional to the size of the volume containing the electrons.

Because this source of optical energy is fixed, the increase in optical energy

density varies inversely with the optical volume. Thus, the optical energy denpity

increment is proportional to the electron-beam volume and inversely proportional to

the optical-beam volume. The ratio of these two volumes is known as the filling

/ electron volume

pt ical ol u me

Fig.3 - The energy balance model takes the filling factor to be the ratio of the electron volume to the optical volume.

factor, one number that in the theory replaces two numbers, the separate volumes of

the beams, and the filling factor is the only geometrical concept in this simple

model.

The above energy balance technique is elementary but incomplete; it leaves many

questions about the evolution of the optical pulse unanswered by providing only one

piece of information. Is it as if we have applied a projection operator to the wave

equation that permits it to ask only one question in question space? (See Fig.4.) In

satisfying this one question, it is clear that other dimensions of the description

are missing: e.g., how is the energy distributed? Do we keep the same energy per

unit volume and use a larger volume, or do we keep the volume the same and increase

the energy density? Do we concentrate the energy in globs or do we distribute it

uniformly?

To project answers to these other questions more information needs to be

extracted from the wave equation. Better physical pictures are achieved by more and

more elaborate models that answer more and more questions. Beyond the energy

balance approach comes a geometric optics treatment. By use of rays parallel to the

center axis of the interaction region (going left to right), we may arrive at a

filling factor description that also gives us the phase retardation of the optical

wave. By use of Wigner distributions or other means that incorporate non-parallel

rays we can describe optical beams that shrink and grow (and mock diffraction) as

they pass through the chamber. With this approach the concept of the filling factor

can be defined at the center of the chamber where we expect the optical volume to be

nearly minimum (and the optical energy density, a maximum). Alternately, an average

filling factor could be defined.

,,...THIS POINT IS THE ' E N E M BALANCE

QUESTION.

Fig.4 - Question space showing energy balance as a central question in one portion of the space.


The optical cavity can be modeled as a device that takes the sum wave, projects

out a lowest order Gaussian mode, and multiplies that @art by a reflection

coefficient. This suggests a model in which the excited wave is replaced by that

part of the excited wave radiated into the fundamental Gaussian mode, or

equivalently, by producing an operator that projects out that portion of the

electron current radiating into such a mode. This approach can be handled in the

context of the paraxial wave equation and is similar to work published by Moore and

Scully (MS) [4] in 1980, more recent work of Colson and Elleaume [5], and work

reported at Santa Barbara in March 1982 by Tang and Sprangle.

2. The 1981 - 1982 Models. The rest of the work described here brings us up to date with the next level of

abstraction that uses the paraxial wave equation to treat non-Gaussian, transverse

variations. These treatments do not remove the monochromatic (periodic) requirement

nor do they treat many slices of time. They do, however, use realistic boundary

conditions at the mirrors, for oscillator (cavity) problems. Questions involving

the impact of lethargy or the effects from electrons slipping a substantial portion

of the optical beam on optical beam quality are ignored.

The focus is on solving the paraxial wave equation by digital calculations on

computers. Although similar work has been done for conventional laser systems, not

everyone interested in this level of detail is familiar with that work. Prior

methods of solving the partial differential equation, the paraxial wave equation,

feature finite differentials 16-81, Gauss-Laguerre polynomials [9,10], Gauss-Hermite

polynomials [10,11], Fresnel-Kirchhoff integration [12], quasi-fast Hankel transform

(QFHT) [13-151, and fast fourier transform (FFT)[16-181. In application to our

laser, all of these methods incorporate production of the excited wave with

distribution by carrier transport in a left-to-right or right-to-left fashion, much

like an automobile carrier transporting newly made sedans on a freeway.

The latest innovations in electron engineering in these models include tapered

magnetic (mag.) fields and bucket traps designed for a single electron.

Discriminating consumers who are looking for a vehicle to transport a family of

electrons will have to wait because these models are still in the test stage.

A prior innovation by Sprangle and Tang (ST) [19,20], in which they sacrificed

no generality, resulted in an integral Lagrangian formalism. Although this model

was highly original, its deployment required knowledge of the entire futures of

electron families, futures that depended on the predictions of the model. For

specialized uses where the excited wave had little impact on the electrons, and

where the electrons followed a known Gaussian with a fixed radial width, the ST

model predicted lensing effects that distorted the appearance of the (sum) optical

wave. These effects were also pointed out by others [3]. Although the ST model

crammed all the electrons into the bucket trap, rather than just those that would

have been caught with an adibatic model, results (see Fig.5) stirred the

imaginations of the designers who have worked with the 81-82 models.

cos yR=0.6 f PHASE (rad)

Fig.5 - Phase of a Gaussian electron source of excited waves at various distances from the beginning of the interaction region. The phase may be thought of as a retardation of the wave in time or space.

The inherent difficulties of the ST model were sidestepped by Slater, Lowenthal

and Quimby (SLQ) [21] who turned to a partial differential approach with finite grid

implementation late in 1980. This transverse grid effectively traveled back and

forth through the interaction region and reflected from the mirrors until a steady

state successfully ensued. It enables evaluation of tradeoffs due to inability to

match the optical field to the magnetic field at all radii. SLQ also reported a

comprehensive design of the FEL chamber that included tradeoffs between optical beam

quality, magnetic field strength, and magnetic field inhomogenities. On the one

hand the magnetic field strength was to be made as intense as possible, by

decreasing the separation between permanent magnets, but on the other hand an

aperture that is too small will destroy optical beam quality and gives unacceptably

large transverse variations of the magnetic field in the interaction chamber.

Figure 6 shows a map of optimum optical beam size and electron beam radius for

various interaction chamber lengths for 10 pm wavelength light (The optimum optical

beam size decreases for shorter wavelengths.)

Additional work by Quimby and Slater(QS) in 1982 [22] included graphical

presentation of their results in a cleafly written paper that was a delight to read.

They utilize a criterion that requires taking an electron out of its bucket for good

(detrapping without retrapping) when its contribution falls off due to too much

resonant phase shift. They also use many radial lanes (grid points) and describe

the effect of the magnet aperturing that produces the sharp protuberances shown in

JOURNAL DE PHYSIQUE

PHOTON BEAM

- E " - "l - n w

100

WIGGLER LENGTH ( m )

~ig.6 - Design map of electron-beam radius and optical-beam radius at the waist (minimum) and end (maximum) for lOpm light.

Fig.7 - Calculations of Quimby and Slater showing pulse intensity as a function of radius and axial location.

Fig.7. These self-consistent calculation use apodized mirrors to remove numerical

instabilities.

The SLQ results were a stimulus that led ' Elliott [ 2 3 ] to undertake a mode

expansion along the lines suggested by MS, but where the resonator modes in the

formalism are chosen to satisfy sharp-edged mirror boundary conditions. Cavity

modes are similar to the modes of a string instrument: both are enhanced by the

feedback from the resonator. The strong enhancement of the dominant mode and the

lesser enhancement of the higher order cavity modes can be looked at as a relative

suppression of these higher modes. This comparison can be made graphically as shown

in Fig.8. Here the inverse of the length of the line segment, drawn from the

location of the squared (round-trip) eigenvalue for a particular mode to the

operating point, represented by the triangle, is a measure of the resonance

enhancement of that mode. The other aspect of the problem is the extent to which

each of the cavity modes are produced, were the resonator not present. The

analogous question for the instrument is: what sounds are produced by the bare

string? Elliott's treatment of the mode source demonstrates the near independence of

mode quality upon electron beam size, but in its present form fails to adequately

treat the longitudinal dependence. In particular Elliott's model could be easily

extended to show a difference in mode composition of pure-gain and pure-refraction

cases described by QS.

This theory expressed in the language of empty cavity modes, was

extraordinarily useful in that the empty cavity properties appear naturally and give

rise to a separate factor, the cavity suppression ratio, in determining the overall

suppression of higher empty-cavity modes. It answered questions related to the

effects on the mode selectivity due to cavity reflectivity or other cavity-quality

Fig.8 - Graphical determination of the suppression by the cavity. The square of the cavity eigenvalue A are plotted as complex numbers. The cavity phase is-chosen so that the triangular ftoint, the point on the circle, is closest to A~'. The suppression -of the q'th mode is the ratio of the length of the line drawn from the triangle to Ao2 to the length of that drawn from the triangle to .lo2.

factors. It also pointed to a new method of mode selectivity when modes have

different phase shifts from the dominant mode, even though these modes may have

little loss to begin with. It also appears to be the only work so far that has

checked to see that empty cavity properties are in agreement with published values,

and it extends the number of reported eigenvalues and should be a useful source for

checking other codes. This work also confirms and extends SLQ results that good

optical beam quality is achievable.

Prosnitz, Hass, Doss, and Gelinas (PHDG) [ l ] carried out work conceptually

similar to SLQ, but in an amplifier configuration (rather than an optical

resonator). They assessed the ability of effects similar to self-focussing to

overcome diffraction and to confine the optical beam to small radii over long

pathlengths and large gains. Rather than select a single electron in each bucket,

they use an average electron whose behavior represents a composite behavior of all

electrons. They also describe special problems that test their code for its optical

propagation and its ability to reproduce the electron description calculated in the

absence of diffraction.

PHDG describe two calculations for a proposed amplifier. The first calculation

matches the optical field to the magnetic field on axis and results in premature

focussing on axis. The second calculation matches the fields off axis at the

estimated radius of synchronism and shows that detrimental focusing can be avoided,

see Fig.9. The sensitivity of the performance of this device to the design-radius

parameter raises questions of how reproducible these results will be due to

shot-to-shot variations in the electron beam under actual experimental conditions.

Mani, Korf, and Blimmel (MKB) 1241 attacked the loaded cavity problem in a new

way. They started with the Fresnel-Kirchhoff integral method concocted by Fox and

Li (121; to this they added descriptions of gain and phase that served the role of

the wave equation sources. This procedure may be bitter for new initiates; I myself

acquired a taste for it only after finding out that indeed the wave equation (story

teller) was satisfied with this particular mix (Appendix A). In their first

attempts at putting the theory together MKB did not follow their own prescription:

t'wit, they skimped on the ingredients but still obtained a whole, natural product

as seen in Fig.10. Note that the draped surface (100% gain) is quite different than

those calculated at lower gain. The assumptions MKB use in calculating the field,

however, break down at 100% gain.

Because of computer time requirements, MKB have made several simplifications in

the physics they utilize. For one, they presume the gain is a separable function of

x and y, the rectangular, transverse coordinates. They also use Gauss-Hermite free-

space wave functions instead of prolate-spheroid wave functions to compute the loss

of the minors for use as an eigenvalue of the empty cavity. They also presume the

gain is small. We refer to treating the physics in full as the extended MKR method.

MKR also point out they could formulate the extended MKB method in cylindrical

coordinates. The resulting computation time would be reasonable.

Fig.9 - Calculations by PHDG with fields matched off axis. It shows that as the pulse continues to move toward the output port it gains energy instead of focusing in a sharp peak.

Fig.10 - MKB calculations of optical field on a rectangular grid at the center of the interaction region. Gains are left to right: I%, lo%, and 100%. The electron-beam waist radius is equal to the optical-beam waist radius and the cavity is confocal.

3. Sneak Previews of 1983.

Elias and Gallardo (EG) at the '82 Santa Barbara Conference turned to an electron-

oriented method where they add up the Lignard-Wiechert fields of the individual

electrons on their field-counting machine. Exactly how this marvelous machine works

is still inadvertently a trade secret. What we do know is that it has described

C l - 2 7 4 JOURNAL DE PtlYSIQUE

fourier components of the optical fields for amplifier configurations in terms of

Gauss-Laguerre functions. We may begin to appreciate hov such machine may work by

regarding it as a special case of the retarded-time potential technique.

Colson and Richardson (CR), and Hiddleston, Colson and Richardson (HCR) have

reported progress at the '82 Santa Rarbara Conference with a rectangular-coordinate

fast-fourier-transform (FFT) propagation scheme. They have attained a technique

they describe as unitary (perhaps energy conserving in free space) and efficient

(for calculations in rectangular geometries with right-angle-polygon mirrors). CR

have used a parabolic profile for the electrons and have calculated mode patterns

for off-axis electron beams as well as the usual on-axis type.

Coffee, Lax, and Elliott(CLE) ( 2 5 1 describe propagation using a new

implementation of the well known fourier-Bessel technique applied to the paraxial

wave equation. In this CLE algorithm, the Hankel transforms are evaluated using

Gauss-Laguerre-integration (radial) locations and weights rather than by the quasi

fast Hankel transform (QFHT), the analogue of FFT, because of inefficiencies and

instabilities found in QFHT implementation. The finite mirror size was not included

in any of the multipass calculations, but I expect work by Elliott and Skokowski

will be incorporated in '83 to remedy this situation.

A. Bhowmik and R. Cover reported calculations for stable hole-coupled

resonators at a Rockwell International meeting in August 1982. The work is now

preliminary and uses simplified FEL models with a single gain sheet in a QFHT code.

D. Mathematical Ideas.

The mathematical formulation of the pulse propagation problem naturally divides

into two parts. The first part of the problem is the description of the

amplification process as the pulse moves through the interaction region. The second

part includes the cavity or resonator properties; it is concerned with what happens

to the pulse after it leaves the interaction region, up until it reenters the

interaction region; it describes the effect of the mirrors and apertures of the

system.

A summary of the theoretical work that has been done on the transverse

dependence (see below) is summarized in Fig.11. In principle all approaches began

with Maxwell's equations written in the wave equation form. The wave equation is a

partial differential equation having a source (differential source) that describes

the effect of the electrons or currents in incrementing the wave. Going from

Maxwell's equations to the paraxial wave equation has been discussed by Lax,

Louisell, and McKnight (261. The wave equation can also be written in an equivalent

form, as an integral over a relativistic volume source [ 2 ] , and this can be

expressed in terms of the well known retarded time potential. The retarded time

potential can be specialized to the case of a single electron in motion and gives

rise to the well known Lidnard-Wiechert potential for that electron. Less well

Fig.11 - Summary of theoretical work on optical quality showing logical flow from Maxwell's equations to various computations. Dashed lines represent work that either was not done or has not been reported.

known is the fact that the retarded time formalism, by using a number of paraxial

assumptions, can directly lead to the integral form utilized by Tang and Sprangle

[ 1 9 ] . They arrived at their integral form by way of the well known paraxial wave

equation.

This equation has many names, for instance, the parabolic wave equation, the

Gaussian wave equation, the quasi-optical wave equation, the envelope wave equation.

depending upon the emphasis one wishes to place on particular properties or uses.

The term paraxial emphasizes the physical requirement that the rays described by the

equation are all nearly traveling in the same (axial) direction. The term envelope

suggests that the modulation of the envelope must be slow compared to the carrier

frequency. All of these names refer to the following equation written for the

vector potential envelope,As,

JOURNAL DE PHYSIQUE

where ks= ws/c, and us is the carrier frequency, and V? is the transverse Laplacian.

The third major way the wave equation has been utilized is through the

free-space Fresnel-Kirchhoff propagator [12,24]. Although the results only apply to

free-space propagation, the formulation leads to an interpretation of the

propagator, and it is easy on a conceptual basis to generalize the interpretation to

include modification to rays by the effects of gain and phase from the source. This

method has been described as ad hoc phase and gain in Fig.11, but it has been shown

equivalent to the paraxial wave equation in Appendix A.

The three major ways the wave equation has been attacked are all closely

related to the paraxial wave equation. Although the details of the Lienard-Wiechert

potential method have not been made known, it is known that it may lead to a

Gauss-Laguerre formalism. Presumably this formalism is identical to that arrived at

by expressing the paraxial wave equation in cylindrical coordinates and finding its

free-space modes.

The paraxial wave equation expressed in cylindrical coordinates gives rise to a

(r,t) description, where r is the radial distance from the optical axis and t is a

time parameter describing the evolution of the pulse. The radial dependence can

arise due to rotational symmetry about the optical axis; the fact that we do not

have a z dependence comes about from taking a single z slice of the pulse and

ignoring coupling of the various z slices. The rectangular coordinate description

(x,y,t) involves breaking the transverse grid up into rectangles or squares rather

than rings.

There are three major ways the calculations can be implemented in either

rectangular or cylindrical coordinates. The first way is by finite differencing the

partial differential equations [6-81. The second way is to fourier transform

[13-181 (or Hankel transform in cylindrical coordinates). The third way is to

expand the solution in a series of functions having transverse dependence and

meeting boundary conditions in the transverse direction [9-11,251. In this later

category fall the boxes labeled Gauss-Laguerre, hybrid, partial waves, and

Gauss-Hermite. The finite difference boxes are labeled by A = > ; and the transform

methods are EFT and QFHT (the hybrid method may also be regarded as a transform

method).

As seen above there are many ways to formulate the general solution to the

paraxial wave equation . The initial conditions are the specification of the field

everywhere upon a plane. This may be done by specifying the field at a select N

points that become dense everywhere as N+m. Alternately we may choose N basis

functions and specify N coefficients. In either of these approaches we require the

additional physical condition that the field modulus be square integrable over the

initial plane; other than that we are free to choose the field at will.

From a mathematical point of view all of the representations are equivalent,

i.e. may be transformed back and forth into each other as N becomes infinite.

However some mathematical ideas can be expresses easier in particular

representations.

We can simplify the description of the solution to the paraxial wave equation

by choosing a representation in which free space propagation through any distance is

described by the idenity operation. The free-space mode coefficients are the

components of a vector in this representation, anc they don't change in traversing

free space.

We write the vector of coefficients for the incident field as E , and we

describe the generated, excited wave that gets added to the incident field as i. The value of ; is found by propagating the field and electrons through the

interaction region (the two board game described earlier) and depends on the

incident field in a non-linear way, so that we write :(E), to indicate the

functional relationship. The sum wave that leaves the interaction region is E+A(E),

and this wave reflects from two mirrors in the cavity whose total effect is

described by the linear operator M, a matrix. If we denote the new incident field

by E' to distinguish it from the old, we then have

as the description of our problem.

The second part of the problem involves specifying what the cavity or mirrors

do. We may categorize each of the schemes in terms of their mathematical

idealization. The most common scheme is shown as scheme number 1 in Table I. This

scheme takes the field vector on the n+l'th pass to be given by adding the excited

wave on the n'th pass to the incident wave and by multiplying by the mirror matrix.

Indeed this method follows the physics of the problem but is by no means the fastest

way to arrive at an equilibrium field vector, Em, assuming it exists. Scheme number

2 shown in the table is a relaxation technique, commonly used in numerical

algorithms, which takes the field on the n+l'th pass to be a weighted average of

that on the n'th pass and that predicted by scheme number 1. Here f is a free

parameter that is adjusted to achieve best convergence. These first two schemes are

explicit schemes because the field on the n+l'th pass depends directly on that of

the n'th pass.

The second group of schemes utilize the new value (n+l'th pass) of the field on

the right hand side of the equation and these are known as implicit schemes which

require solution of linear equations for their implementation. Scheme number three

is the IlKB [ 2 4 ] method (generalized to arbitrary order). That scheme utilizes the

JOUKNAL DE PHYSIQUE

Table I. I I I

Characterization Non-Linear Approach I 1 (driven by) I Algorithm Explicit

Schemes i.7 Like 1 but with relaxation

Schemes / 4 ( (old currents) ( Fn+l = M[E~+I + !(En) 1

1

En+l = f~. + (I - ~)M[E, + g ~ ~ ) I*

Implicit

Successive passes (old field)

3

Like 4 but

Error length I 2 Optimization/ 6 1 minimization h=min{l~-~[~+_h(E)]I }

E ~ + ~ = M[E, + @ " ) . I

"+l ' MCIEn+l 4- !(En) +

5

* f adjusted for best convergence t non-unity A is equivalent to A=l and a new reflectance

A times the old value because M is proportional to reflectance

= G(E)E

(old gain)

old (n'th pass) field values to determine the complex gain function matrix G; given

the gain an elgenvalue problem is solved for the field that would be self-consistent

were G the right value, The fourth scheme is similar to that used by Elliott [ 2 3 ]

and takes the currents to remain the fixed quantity based upon the old field. These

last two schemes have the advantage that when the current or gain is small, the

natural modes of the empty cavity appear. Scheme number 5 is similar to scheme

number 4, except that a linear correction term has been applied to ; the next order

taylor series term has been added making the expression for implicit itself. This

method and scheme number 6 in its usual implementation involve evaluation of xi(!,). A rule of thumb is that six place precision is required in evaluation of in order

to make gradient methods practical; this kind of precision is rarely seen in

calculations of FEL properties. Scheme number six is an optimization approach; it

is quite efficient in some implementations in that very few evaluations of ?(E) need

to be made.

t = +,+I + G(En)En+~ 1

higher order (En+l - En) -... VA(gn) 1

The introduction of the mathematical notation above has led to the suggestion

of several new schemes for achieving convergence of the mode problem, but it can

also be applied to advantage in characterizing the transverse mode problem as a

non-linear fixed-point problem. To do this we simply note that if an equilibrium

solution exists by numerical scheme number 1, we may write the problem as

We are also interested in the question of stability of this solution, and this

aspect can be studied by introduction of the error vector,

With this definition we may examine local stability in the neighborhood of a fixed

point Em by the taylor series expansion,

From this equation we see that the eigenvalues of !f(E,) determine stability. If

all the eigenvalues have modulus less than unity, then any error vector we may

construct as linear combinations of the eigenvectors will get smaller in modulus on

each successive pass.

What we have done here is to make a connection between a physical problem, the

free electron laser mode quality, and a well studied mathematical problem, the

non-linear fixed-point problem 1271. (A simple extension of the above arguments

also shows that (x,y,z,t) or (z,t) or (r,z,t) problems can be cast in a similar

fashion.) This fixed-point problem has recieved alot of attention and suggests a

general scenerio that has already been observed in the (z,t) problem. One usually

looks for a parameter of the physical problem that represents the drive of the

system. In the FEL problem the electron density or current may play that role. At

sufficiently low values of the driving parameter, the fixed point problem is

observed to exhibit stable solutions. As the driving parameter is increased, there

often comes a point at which the non-linear fixed point problem has no solutions.

What happens instead is that successive applications of f lead to limit cycle

behavior; i.e. the field vector varies between two solutions. Application of f to

one gives the other and vice versa. This switching between one solution and the

other is also known as a bifurcation of the solution. As the value of the parameter

increases further bifurcations can occur, and eventually at very high levels of the

JOURNAI. DE PHYSIQUE

parameter, the field vector will never reproduce itself. This kind of behavior has

not been verified numerically yet in the (r,t) mode quality schemes.

E. Digital Operations.

Given coders clever enough, And computers with the power of Hermes, We can calculate the world.

paraphrase of Archimedes

To show how quickly machines are gaining power, I have plotted the computer

power of the fastest digital calculator at the Los Alamos National Laboratory up to

the year 1980. The power is measured by the number of machine instructions per

second, where a floating point operation was arbitrarily defined to be three

instructions. These data are from Jack Worlton of the Computer Division; the points

for 1983 come from Chris Barnes also of Los Alamos, and the point labeled NSCP is

the projection of the National Super Computer Project in Japan. The graph (Fig.12)

shows that the trend for computers is to increase their power by a factor of ten

every four years, and the trend seems to be persisting into the future although

there is some disagreement about exactly how to extrapolate these data and what size

the error bars are.

Fig.12 - Computing power over the years at. Los Alamos. Computer growth and projections are consistent with a straight line fit that indicates computers increase their power a factor of ten in a little over four years.

10' - A&gu;tiig - lo0 1 I I I

1940 1950 1960 1970 1980 1990 Year

Those doing optical beam quality calculations will want to use the available

computer power or collaborate with those who have it. On the other hand, personal

computers may be useful for simple models, for building and polishing codes, and for

determining relevant physics. Computer codes are getting larger and more complex,

and rarely is it possible to present graphically all possible outcomes of a code.

We must pick and choose output diagnostics and input quantities in our numerical

experiment much the same way an experimentalist chooses the conditions he will study

and determines what measurements he will make. Our paraphrase of Archimedes

suggests that almost anything is computable, but in practice we require art in being

able to extract the relevant features of a problem and to attack those alone. The

skill of contriving a simple model having most of the physical features of the

problem is similar to the skill of an experimentalist who isolates a particular

physical effect from the influence of many others. Ideally the goal of

experimenting is to understand what is occuring and to be able to anticipate what

the results will be. It will be increasingly important for those running codes to

plan ahead and determine strategies to achieve that goal. It will also be important

to ensure that all the required skills in doing the numerical experiments are

assembled together in a working team.

Implementation of codes requires attention to the numerical algorithms in order

to avoid the introduction of spurious (non-physical) features. First we will

I I RAY PATH WITH! /

E M S 1 6 R I D I \ ' I I

/ /

/

,~$G,;ho' REFLECTION ! ORIGINAL GRID - 1 T;LATED 1

REAPPERANCE PER'ODlC OH O R I G I N A L A b C ~ ~ ~ ~ , ! , ~

I I I ORlGlNAl

GRlD

Fig. 13 - Ghost grid requirements. If the upper and lower ghost grids were not present, the boundary conditions on the original grid would result in spurious reflection or translation of rays.

JOURNAL DE PHYSIQUE

discuss the requirement of ghost cells in finite grid problems. Then assuming these

requirements are understood, we give a brief discussion of various algorithms.

The requirement of ghost cells arises from non-physical boundary conditions.

In Fig.13 we can see the idea of a criterion that can be applied a posteriori.

Suppose we have performed a calculation on an original grid and find that the

calculated pulse has a structure size of dimensions ds. (These arguments become

more complicated when focusing elements are present.) This structure will diffract

through an angle 8=X/ds, where X is the optical wavelength. In the case shown in

the figure, the structure is at the center of the original grid and it will diffract

at angle 8 and will be spread out to a width z8 after traveling a distance z. If the

spread is greater than the width of the original grid, additional grid points called

ghost cells are needed. Without ghost cells, the ray will impinge on the boundary

of the original grid. In a finite differencing code, the ray will reflect on the

original grid and change direction unlike the. real problem which has infinite

boundaries. With periodic boundaries in FFT algorithms, rather than reflecting on

the original grid, the ray reappears on the opposite boundary (suffering a

translation) but also in an unnatural fashion. The extent to which violation of the

ghost cell requirement will impact precision depends of course on the amplitude and

dimension of the diffracting structure relative to the rest of the pulse. Also,

depending upon the requirements, one may wish to have higher precision and more

confidence in the answer; in that case he might consider a new definition of the

defracting angle that is several times the above value.

Ghost cell requirements occur for FFT and finite differencing methods but not

for methods using a set of basis functions that meet the boundary conditions at m.

For finite difference methods B.R. Suydam has found that changing the boundary

conditions from reflecting to totally absorbing was not possible. However, Don

Prosnitz has suggested that the use of a grid with points that become spaced further

apart as the distance from the optical axis increases, ameliorates the ghost cell

requirements. In the EFT method the grid is fixed and ghost cells must be carefully

planned for.

The FFT technique is attractive because of its speed and the availability of

packaked routines that have been thoroughly tested. For a rectangular mesh of N by

N cells, the number of operations in the FFT is proportional to N~ log2N in going

one step. This few operations is sometimes an advantage, but there are several

things to keep in mind before deciding upon this algorithm. First ghost cells may

make N larger than in other methods. Second there may be symmetry present that can

not easily be taken advantage of in FFT. As an example, for a problem with

cylindrical symmetry it roughly requires N~ FFT points to do what can be done with N

points on a radial grid. Third sharp edges of mirrors are difficult to properly

model. Fourth another method may be more accessible and convenient to use. In

spite of these considerations, FFT should be given serious attention for problems

that are natural to rectangular coordinates.

Although the finite difference method has more flexibility in its grid, it is

somewhat more restrictive than EFT for free space propagation. With EFT one may

propagate the field arbitrarily far and get the same answer as if he took many

smaller steps, a definite advantage in problems with substantial free space

propagation. In the finite difference method an additional criterion occurs that

guarentees the solution will be sufficiently precise if the step size is small

enough. The criterion simply guarentees that the variation in the z (propagation)

direction should be small in a single step. For a smallest structure distance, ds,

in the transverse direction, the variation in the z direction occurs over a distance

that doubles the structure size i.e. z(X/ds) = ds or z = (ds)'/l. For a nearly

Gaussian mode 1281, this distance over which the area of the beam waist doubles is

known as the Rayleigh range or (twice this length) as the diffraction length.

Frequently, the criterion is that there are many steps in a Rayleigh range. Since a

FEL often has a length nearly equal the Rayleigh range, the criterion simply states

that many steps need to be taken in going through the interaction region.

In addition to the above criterion, the effect of gain and phase must give only

a small change to the field. For small gain FELs now in vogue this criterion

presents less of an obstacle than the free space criterion. When electron dynamics

are being calculated along with the optical propagation a step size is chosen in z

that describes the motion properly, and often it is the electrons that determine the

actual z step. In this case the interaction-region z step for the finite

differencing technique could be the same as that for FFT. Assuming a tridiagonal

scheme such as that used by Rensch 161, or Suydam [7], the number of operations

required for finite differencing of N radial grid points is roughly 3N per z step

and is actually less than an EFT technique (QFHT) would be for this problem.

The last technique we describe is to use free space modes as a set of basis

functions for the field. Like FFT there is no problem in taking arbitrarly large

steps in free space, and better yet there are no ghost cell requirements, but in the

interaction region the field must be evaluated at the locations of the electrons.

If there are N electron locations and N basis functions, that evaluation requires

2~~ numerical operations. Likewise, translation of the optical source at the N

electron positions into a source of mode coefficients also requires 2~~ operations.

(Another way to approach this problem is to work with grid quantities, but then

propagation requires a matrix multiplication having 2~' operations.) At first this

N' requirement might seem to rule out this technique, but upon closer inspection it

turns out for FEL problems that the number of basis functions in this technique is

considerably smaller than the number of radial grid points needed in finite

differencing. For instance, in the work reported by Elliott[Z3]only 20 basis

functions were required to get good eigenvalues for sharp-edged mirrors, whereas QS

was implemented with 200 radial cells, and in the QS technique the sharp-edged

mirrors had to be replaced with mirrors having a gradual fall-off of reflectivity.


Another method closely related to that of free space modes is to use basis

functions that have the proper transverse boundary conditions. The requirements for

this approach include the condition that little variation in the field occurs in one

step, but as before with finite differencing this requirement is usually met

naturally in practice.

One final comment is that when we use groups of electrons at the same radial

locations at which the optical field is evaluated, the time required to do the

optical part of the problem is dwarfed by the time to push the electrons. Here

optical techniques should be chosen for their convenience rather than the number of

digital operations needed for their implementation.

Acknowledgements. - The author would like to thank: D. Quimby and J. M. Slater

for an advanced copy of their work, D. Deacon for inviting the topic C. Brau for

permitting the work to be done, John Goldstein and Paul Skokowski for suggested

improvements to the manuscript, Paula Adams and Joyce Schuster for assistance in

graphics, and Sarah Martinez for assistance in word processing.

APPENDIX A

We may see the practical equivalence of the MKB technique to the paraxial wave

equation by a simple argument. MKB relies upon the intuitive notion that with a

complex-gain coefficient, g+i6k E gc, which may be calculated, propagation is

described by,

where FP is the Fresnel propagator moving free space waves from the point x' where

Uo is evaluated, to the point 5 , where U is evaluated; here the integral over ds is

carried out along the straight line joining 5' to 5 . (Note that there appears to be

an inconsistency in the sign MKB use for 6k; their g-ibk convention is consistent

with exp(-ikz) waves whereas their propagator is consistent with exp(ikz) waves; we

are using the latter convention.) At the point 2 , FP is given by,

In the intuitive statement of the problem, the complex-gain integral occurs over the

same ray (straight line) that joins x to 5'; the statement is: To compute the

propagated field in the presence of an active medium that gives rise to a

complex-gain coefficient, we calculate the propagated field in the absence of gain

and multiply the result by the amplification of that ray (the amplification being

the complex exponential term appearing in Eq.Al). This statement is more

complicated than it might first appear in that the gain coefficient along a ray may

depend on the value of the field at all points along that ray; i.e., the field may

need to be calculated at all intermediate points.

To make the connection between MKB and the paraxial wave equation we need to

show that Eq.Al correctly includes the medium effects and does the free space

propagation. First we will verify that Eq.A2 satisfies the paraxial wave equation

and the boundary condition that as z+z', FP is the idenity operator. We proceed by

fourier transforming Eq.A2 from x and y to p and q. To achieve the interchange of

the order of integration we take w+w(l+i~+) and note that in this limit the new

inner integral converges. We complete squares in x and then y and note for Im(a) > 0, and (exp 18)'~~ = exp (i8/2),

00

I exp(iax2)dx = (in~a)'/~ . (A3) 4

We thus arrive at the tranformed equation:

where TV is V(x,y,z) fourier transformed in the first two variables. This

expression shows that the propagator in Eq.A2 meets the boundary conditions at z=z'

(note the difference with MKB), and it is an easy matter to show it satisfies the

transformed paraxial equation in free space,

where in real space p2=-a/ax2 and q2=-a/ay2.

The active medium effects are even simpler to establish. We develop the

partial differential equation for U by using values as given by Eq.Al at z and z+Az,

starting with Uo at z=0. We will remain at the same x,y point in forming the

partial derivative with respect to z, and will only indicate the axial location,

v.i.2. the z value. We subtract the following two equations,


where we have further approximated the ray length by its projection on the z-axis;

this approximation is a paraxial assumption (MKB already have used a paraxial

approximation in using FP). By using the inverse of the first equation to

substitute out Uo in the difference equation we have,

Now in the limit that Az+0, the first term on the right side of Eq.A5 must give the

free-space terms, i.e. the transverse Laplacian terms, and the second gives the

gain term we were trying to establish, i.e.

_ = transverse Laplacian terms + u gc . a (A9)

The MKB method is thus seen to be practically identical to the paraxial wave

equation approach. MKB appears slightly more general in that they use the

ray-length in the gain expression rather than its projection onto the z-axis.

However, this is the only one of several uses of the paraxial approximation they

have eliminated, and it is not clear whether the others are more or less important.

In practice MKB also assume that the gain is small, making the above distinctions

unimportant for their application. Practically speaking the MKB approach is the

same as the paraxial wave equation.

References.

[l] Prosnitz(D.), Haas(R.A.), Doss(S.) and Gelinas(R.J.), Physics of Quan. Electronics, 1982, 9, 1047.

(21 Jackson(J.D.) , ~lassical Electrodynamics 2nd Edition, John Wiley and Sons Inc., New York, 1975.

[3] Kroll(N.M.), Morton(P.L.) and Rosenbluth(M.N.), Physics of Quantum Electronics, 1980, 7, 89. A more elegant version appears as Jason Technical Report, JSR-79-51, 1980.

[4] Moore(G.T.) and Scully(M.O.), Phys. Rev., 1980, g, 2000. [5] Colson(W.B.) and Elleaume(P.), Appl. Phys., 1982, E, 1. [6] Rensch(D.B.), Appl. Optics, 1974, 12, 2546. [7] Suydam(B.R.), LA-5607-MS, Los Alamos National Lab, Los Alamos NM, 1974. [8] Goldstein(J.G.) and Dickman(D.O.), LA-6955, Los Alamos National Lab, Los Alamos

NN, 1978. [9] Goubau(G.) and Schwering(P.), IRE Trans. on Antennas and Prop., 1961, 2, 248.

[lo] Kogelnik(H.) and Li(T.), Proc. IEEE, 1966, 54, 1312. [ll] Siegman(A.E.) and Sziklas(E.A.), Appl. Optics, 1974, 12, 2775. [12] Fox(A.G.) and Li(T.), Bell. Sys. Tech. J., 1961, 40, 489. See also

Eqs.(8.2.1) and (8.3.20) in Born(M.) and Wolf(E.), ~ r i n a ~ l e s of Optics, Pergamon, New York, 1965.

[13] Siegman(A.E.), Optics Lett., 1977, 1, 13. . [14] Agrawal(G.P.) and Lax(M.), Optics Lett., 1981, 6, 171. [15] Latham(W.P.) jr and Salvi(T.C.), Optics Lett., 1q80, 5, 219.

(161 Sziklas(E.A.) and Siegman(A.E.), Appl. Opt., 1975, 14, 1874. [17] Fleck(J.A.)jr, Morris(J.R.) and Feit(M.D.), Appl. Phys., 1976, 10, 129. (181 Goldstein(J.G.), LA-6833-MS, Los Alamos National Lab, Los Alamos NM, 1977. [19] Sprangle(P.) and Tang(C.-M.), Appl. Phys. Lett., 1981, 2, 677. Also: AIAA

Journal, 1981, 2, 1164. [20] Tang(C.-M.) and Sprangle(P.), Physics of Quantum Electronics, 1982, 9, 627.

Also: Proc. of the International Conference on Lasers '80, Dec. 15-19, 1980, Collins(C.B.) Editor, STS Press, McLean VA, 1981.

(211 Slater(J.M.), Quimby(D.) and Lowenthal(D.), Proc. of the International Conference on Lasers '80, Dec. 15-19, 1980, Collins(C.B.) Editor, STS Press, McLean VA, 1981. Also: Slater(J.M.) and Lowenthal(D.D.), 3. Appl. Phys., 1981, 52, 44.

(221 Quimby(D.C.) and Slater(J.M.), TEEE J. Quant. Elect., to appear 1982. [23] Elliott(C.J.), Physics of Quantum Electronics, 1982, 8, 531. (241 Mani(S.A.), Korff(D.A.) and Blimmel(J.), Physics of quantum Electronics, 1982,

9, 557. [251 ~ofTee(~.~.), Lax(M.) and Elliott(C.J.), IEEE J. Quant. Elect., to appear

1982. (261 Lax(M.), Louisell(W.H.) and McKnight(W.B.), Phys. Rev. A, 1975, 2, 1365. [27] Feigenbaum(M.J.), Los Alamos Science, 1980, 1, 4. [28] Yariv(A.), Quantum Electronics 2nd Edition, John Wiley and Sons Inc., New York,

1975, Chapter 6.6.

c. elliott to cite this version - accueil · stories of undulator light ... finally it points out...

Documents