pattern recognition problems in geology and paleontology

Lecture Notes in Earth Sciences

Edited by Gerald M. Friedman and Adolf Seilacher

2

UIf Bayer

Pattern Recognition Problems in Geology and Paleontology

Springer-Verlag Berlin Heidelberg New York Tokyo

Author Dr. UIf Bayer Instltut f(3r Geologie und Pal~ontologie der Unlversit~t T0bingen S~gwartstr. 10, D-7400 TfJbmgen, FRG

ISBN 3-540-13983-4 Spnnger-Verlag Berlin Heidelberg New York Tokyo ISBN 0-387-13983-4 Sprmger-Verlag New York Heidelberg Berlin Tokyo

This work is subject to copyright All rights are reserved, whether the whole or part of the material ~s concerned, specifically those of translation, repnntmg, re-use of illustrations, broadcasting, reproduction by photocopying machme or similar means, and storage in data banks Under § 54 of the German Copyright Law where cop~es are made for other than prtvate use, a fee ~s payable to "Verwertungsgesellschaft Wort", Munich

© by Sprmger-Verlag Berlin Heidelberg 1985 Printed in Germany

Printing and binding' Beltz Offsetdruck, Hemsbach/Bergstr. 2132/3140-543210

To

Dorothee

Julia and Vincent

P r e f a c e

The research on m a t h e m a t i c a l methods and compute r

appl icat ions in geology since 1977 was supported by the "Sonder-

forschungsbere ich 53, PalOkologie" T{ibingen, d i rec ted by A.

Sei lacher . During the years, several "Tei lprojekte" were in-

volved: "Konstrukt ionsmorphologie , Fossi ldiagenese, Fossi lver-

gesel lschaf tungen, Foss i l -Lagers ta t t en" . During the last period

of the "Sonderforscbungsbereich" a special pro jec t "Quan t i t a t ive

Methoden der PalOkologie" was establ ished: Chapte rs 1 to 3

serve as a final repor t of the sc ien t i f ic ac t iv i t ies . Fur the r

in format ion is avai lable in the repor t s of the "Sonderforschungs-

bere ich 53".

The ideas on the seismic record in chap te r 4 arose during

ac t iv i t i e s on Leg 71 of the DSDP-program in 1980, and I am

indebted for valuable discussions to W. Gtl t t inger , G. Dangel-

mayr, D. Armbrus te r , H. Eikenmeier of the "Ins t i tu t far Infor-

mat ionsverarbe i tung" , Tt~bingen.

During the years, a considerable number of people was

engaged somewhere in the research ac t iv i t ies . Here I want

to express my special thanks to E. A l the imer and W, Deutschle ,

which were ac t ive in programming problems during several

years.

Ttlbingen Ulf Bayer

C O N T E N T

1. INTRODUCTION

1.1 Ma thema t i ca l Geology and Algor i thmiza t ion

1.2 Syntax and Semant ics

1.3 Stabi l i ty

2. NOISY SYSTEMS AND FOLDED MAPS

2.1 Recons t ruc t ion of Sed iment -Accumula t ion

2.1.1 Accumula t ion Ra tes and Deformat ions of the Time-Scale

2.1.2 Es t ima t ion of Original Sediment Thickness

2.1.3 Underconsol idat ion of Sediments - - a History Ef fec t

2.2 In t raspec i f ic Variabi l i ty of Paleontological Species

2.2.1 Al lomet r i c Relat ionships

2.2.2 The ~Ontogenetic Morphospace ~

2.2.3 Discont inui t ies in the Observed Morphospace

2.3 Analysis of Direc t ional Data

2.3.1 The Smoothing Error in Two Dimensions

2.3.2 Stabi l i ty of Local Ex t rema

2.3.3 Approximat ion and Averaging of Data

2.3.4 A Topological Excursus

2.3.5 Densi t ies , Folds and the Gauss Map

2.4 Recons t ruc t ion of Surfaces from Sca t t e r ed Da ta

2.4.1 The Regular Grid

2.4.2 Global and Local Ext rapola t ions

2.4.3 Linear In terpola t ion by Minimal Convex Polygons

2.4.4 Stabi l i ty Problems with Minimal Convex Polygons

2.4.5 Cont inuat ion of a Local Approximat ion A) A local cont inuous approximat ion B) Cont inua t ion of a local sur face approximat ion

3, NEARLY CHAOTIC BEHAVIOR ON FINITE POINT SETS

3.1 I t e r a t ed Maps

3,1,1 The Logist ic Di f fe rence Equat ion

3,1.2 The Numerical Approximat ion of a Pa r t i a l Di f fe ren t ia l Equat ion

3.1.3 Inf ini te Series of Caust ics

3.2 Chi2-Tes t ing of Direc t ional Da ta

8

10

i1

12

16

19

21

23

26

29

30

35

40

45

46

51

51

54

56

57

62 62 65

70

72

73

76

79

82

Vt

3.3 Problems with Sampling S t ra teg ies in Sedimentology 86

3.3.1 Markov Chains in Sedimentology 86 A) Disc re te Signals 88 B) Equal In terval Sampling 89

3.3.2 Art i f ic ia l P a t t e r n Format ion in S t ra t ig raph ic Pseudo-Time Series 92 A) Sampling of periodic funct ions 92 B) The analysis of 'bed th ickness ' by equal d is tance samples 95

3.4 Cent ro id Clus te r S t ra t eg ies - - Chaos on Fin i te Point Sets 98

3.4.1 Binary Trees 99

3.4.2 Image Concepts t00

3.4.3 Stabi l i ty Problems with Cent ro id Cluster ing 103 Cent ro id c lus te r s t r a t eg ies 104 Ins tabi l i t ies be tween c lus te rs t06 I n s t a b i l i t i e s w i t h i n c l u s t e r s 106

3.5 Tree P a t t e r n s be tween Chaos and Order 112

3.5.1 Topological Proper t i es of Open Network P a t t e r n s 115

3.5.2 P a t t e r n Genera to r s for Open Networks 120 A) Algebra ic models -- pro to types of branching pa t t e rn s 120 B) A met r i c model -- the Honda t r ee 123

3.5.3 Morphology of Branches in Honda Trees 127 A) Length of b ranches 127 B) Branching angles -- s imi lar i ty and se l f - s imi la r i ty 129 C) Branches and b i furca t ions - - a quasi-cont inuous approximat ion 131

3.5.4 Evolution of Shape 134 A) Trees, Peano and Jordan curves 135 B) The out l ine of Honda t rees 138 C) Chance and de te rmin ism 141

4. STRUCTURAL STABLE PATTERNS AND ELEMENTARY CATASTROPHES 144

4.1 Image Recogni t ion of Three-Dimensional Objec ts 146

4,1,1 The Two-Dimensional Image of Three-Dimensional Objec ts 147

4.1.2 The Skeleton of P lane Figures 151

4.1.3 Theore t ica l Morphology of Worm-Like Objec ts 153

4.1.4 Continuous Trans format ions of Form 155

4.2 Surface Inversions in the Seismic Record -- the Cusp and Swallowtail Ca t a s t rophes 157

4.2.1 Compute r Simulat ions of Rays, Wave Fronts and Trave l t ime Records 160

Linear rays 160 Successive wave f ronts 161 Trave l t ime record 163

4.2.2 Local Surface Approximat ion 165

4.2.3 Linear Rays, Caust ics and the Cusp Ca ta s t rophe 167

4.2.4 Wave Fronts and the Swallowtail Ca tas t rophe 172

4.2.5 Wave Front Evolution and the Trave l t ime Record 174

4.2.6 The Trave l t ime Record as a Plane Map 176

VII

4.2.7 Singular i t ies on the Ref l ec to r Line

4.2.8 Genera l i zed Ref lec t ion P a t t e r n s in Two and Three Dimensions A) The deformed c i rc le and the dual cusp B) Three-d imens ional p a t t e r n s -- the double cusp

4.2.9 Dis t r ibu ted Rece ive r s

4.3 "Para l le l Systems" in Geology

4.3.1 Some Examples of Para l le l Systems

4.3.2 Similar and Paral le l Folds

4.3.3 Bending a t Fold Hinges - - the Hyperbolic Umbil ic

4.3.4 Nota t ion of S t ra in

4.3.5 Genera l ized Plane Strain in Layered Media

4.4 SUMMARY

REFERENCES

179

183 183 189

191

198

199

201

206

210

211

214

217

INDEX 226

1. I N T R O D U C T I O N

Theoret ical modelling and the use of mathemat ical methods are presently gaining

in importance since progress in both geology and mathematics offers new possibilities

to combine both fields. Most geological problems are inherently geometr ical and morpholog-

i c a l , and, therefore, amenable to a classification of forms from a "Gestalt point of

view". Geometr ical objects have to possess an inherent stability in order to preserve

their essential quality under slight deformations. Otherwise, we could hardly conceive

of them or describe them, and today's observation would not reproduce yesterday's result

(DANGELMAYR & GOTTINGER, 1982). This principle has become known as ' s t ructural

s tabil i ty ' (THOM, 1975), i.e. the persistence of a phenomenon under all allowed perturba-

tions. Stability is also, of course, an assumption of classical Newtonian physics, which

is essentially the theory of various kinds of smooth behavior (POSTON &STEWART, 1978).

However, things sometimes "jump". A new species with a different morphology appears

suddenly in the paleontological record (EI.DREDGE & GOULD, 1972), a fault develops,

a landslide moves, a computer program becomes unstable with a certain data configura-

tion, e tc . It is, surprisingly, the topological approach which permits the study of a broad

range of such phenomena in a coherent manner (POSTON &STEWART, 1978; LU, 1976;

STEWART, 1982). The universal singularit ies and bifurcation processes derived from the

concept of structural stabiIity determine the spontaneous formation of quali tat ively similar

spat io-temporal s tructures in systems of various geneses exhibiting cri t ical behavior

( DANGELMAYR & GI~TTINGER, 1982; THOM, 1975; POSTON & STEWART, 1978; GI21T -

TINGER & EIKEMEIER, t979; STEWART, 1981). In addition, this return to a ' g eome t r i z a -

tion of p h e n o m e n a " - - af ter decades of a lgo r i t hmiza t i on - - comes much closer to the

geologist 's intuitive geometr ic reasoning. It is the aim of this study to elucidate, by

examples, how the quali tat ive geometr ical approach allows one to classify forms and

to control the behavior of complex computer algorithms.

1.1 MATHEMATICAL GEOLOGY AND ALGORITHMIZATION

The geometr ical approach dominated the "mathematizat ion" of geology until recently

the computer "changed the world". As VISTELIUS (1976) summarized in his discussion

of mathemat ica l geology:

"the restoration (or 18th century) of axiomatic ideas is due to maturation

of @eoloEical sciences .... The more mature the geological ideas in the problem

are, the more the mathematical tool is determined by the Ecological meanin E

of the problem. Less mature geological problems make it necessary to introduce

more routine mathematical means with restricted foundation to form geology".

Here, two types of "ma thema t i ca l applicat ion" occur: The m a t h e m a t i c a l a t t e m p t

to "model" a speci f ic o b j e c t - - the classical method of theore t i ca l physics which can

lead to the formulat ion of physical l a w s - - and "rout ine m a t h e m a t i c a l means" which

can most ly be t r ans l a t ed as " s ta t i s t i ca l methods". And, case s tudies by compute r are

widely viewed as the " m a t h e m a t i c a l methods". Usually, descr ip t ive s t a t i s t i ca l resul t s

are t r e a t ed like "physical laws", a s i tuat ion s trongly cr i t ic ized by THOM (1979). He

gave the following reasons why the tool of m a t h e m a t i c s looses i ts s t reng th as one goes

down the scale of sciences:

"... the first is that those sciences which do not have as efficient tools

at hand as physical laws would like to be like physics and try to appear in

the eyes of other people as precise as physics. Every science wants to become

mathematized because it believes that way it would be put on the same footin E

as fundamental physics .... The second, internal reason now works in the re-

verse sense: Inasmuch as a given science does not allow for precise mathemati-

zation it opens practically indefinite workin E possibilities to scientists

in that field, because they can make models of all kinds, with approximations,

statistical hypotheses, and so on, and there is practically no limit to the

possibility of buildin E models in situations which actually do not allow for

specific, exact quantitative models .... And the third reason, of course,

is the computer industry's lobby: Every laboratory wants to have its own com-

puter workin E even in situations where a priori there is no reason to believe

that you can extract any kind of useful information out of the things you

have put into the computer. "

It was not Thom's aim to b lame those sciences which are not as precise as phys-

ics. R a t h e r this was d i rec ted against the degradat ion of the m a t h e m a t i c a l t o o l - - may

be the reason is tha t topologists like Thom "want q u a l i t i e s - - though these somet imes

acquire a fearsomely a lgebraic , even numerical , expression" (POSTON & STEWART, 1978).

Of special in t e res t is Thorn's second argument , the indef ini te working possibili t ies. It

is always a very s tr iking exper ience in applying "computer methods" tha t some of these

methods allow for various and con t rad ic to ry in te rp re ta t ions of the same d a t a - - and,

fu r the rmore , tha t some methods can even be inf luenced by the ordering of the input

data . Such observat ions were the s ta r t ing point to analyze the qual i ta t ive behavior of

propagated a lgor i thms in geology.

In geology and paleontology s t a t i s t i ca l and approximat ion methods are general ly

used as s t r a t eg ies of p a t t e r n recogni t ion. A densi ty dis t r ibut ion is e s t ima ted from a

sample, a sur face is r econs t ruc t ed from s c a t t e r e d da ta points, per iodici ty pa t t e rns of

profi les are analyzed by means of s t a t i s t i ca l t ime series analysis, e tc . Al te rna t ive ly ,

data are sorted, grouped and classified by using factor analysis, cluster or discriminant

analysis, and so forth. These are the fields where "routine mathemat ica l methods" domi-

nate, and it is the field where the computer allows one to analyze everything without

regard to any a priori scient i f ic meaning and without the formulation of a scient i f ic

hypothesis. During several years of work with the computer, and implementing computer

programs at the 'Sonderforschungsbereich 53, PalOkologie -- University Tt~bingen', it

was a challenge to accept that ra ther sophist icated pat tern-recogni t ion programs may

become ra ther unstable if some initial conditions, e.g. the input data, do not sat isfy

the proper conditions, and that it is, in general, not known what the "proper conditions"

are. On the other hand, such computer work allowed me to collect and to analyze exam-

ples of instable procedures and problems of in terpreta t ion. A collection of such examples

is presented here together with the 'qual i tat ive ' analysis of instabilities.

1.2 SYNTAX AND SEMANTICS

Af te r decades of algorithmization in science the computer provides a valuable

and indispensable tool. Much work has been invested in computer science to find rules

for the verif icat ion of program correctness . The idea is to solve the programming problem

"by decomposing the overall problem into precisely specified

subproblems and then verifying that if each subproblem is solved

correctly, and if the solutions are fitted together in a speci-

fied way, then the original problem will be solved correctly"

(ALAGACIC & ARBIB, 1978}.

Thus, it seems not very difficuIt to construct "correct" programs -- as far as the

syntax is concerned (WIRTH, 1972). The other problem, however, is a semant ic one:

The meaning of a computer output is not defined -- no ma t t e r how cor rec t the syntax

may be -- until the meaning of the input is defined and until the input is consistent

with the operations within the algorithm. In the same sense, the formulation of a program

is usually not only a syntac t ic problem, as in most cases semantics is initially involved to

some extent . The problem, however, is not res t r ic ted to computer applications in a narrow

sense: It occurs whenever "formulas" are applied to data. In addition to the "correctness"

of algorithms, therefore , the problem of the cor rec t application of algori thms arises and,

fur thermore, the question of how to "control" the computat ions. These are quali tat ive

problems because semantics i tself is qualitative.

The problem and the necessi ty of a lgori thm-control in the field of geological appli-

cat ions will be elucidated by a collection of examples. The material is ordered in three

chapters . These a t t empt to re la te the observed instabili t ies with current areas of research

in topological, i.eo geometr ical , areas. Somet imes the examples are only weakly connected

with the theoret ica l introduction to each chapter , as a theoret ical classif icat ion is not

yet available for f inite point sets from which most of the examples arose. However, it

will be e lucidated that it is commonly a question of the viewpoint - - the question what

we assume as variables and what as parameters - - if we classify a problem as a d iscre te

or a d i f ferent iable system. Such systems are commonly accounted whenever ~stability v

problems arise:

Branching solutions, i.e. bifurcations, can be de tec ted in many classical procedures:

like Chi2-test ing of directional data, surface reconst ruct ion from sca t t e red data points

and equal distance sampling in sedimentology. The widely used centroid clustering methods

turn out to provide an excel lent example of chaot ic behavior on finite point sets, Their

s ta t is t ica l value is strongly questioned because they lack structural stability. Smoothing

of directional data on a sphere and the classical Chi2- tes t for or ientat ion data fur ther

provide examples of a degenera ted bifurcation problem. A br ief discussion of i te ra ted

maps gives a connect ion to present areas of research.

The application of the concepts of s t ructural stabili ty and of ca tas t rophe theory

to ref lec t ion seismics provides a classif icat ion of s t ructural ly stable singularities in two

dimensions. The analysis of image inversions in t e rms of the local curvature of the re-

f lector and its depth produces a catalogue of images which allows a detailed, semiquanti-

ta t ive on-si te survey of the t rave l t ime record. For the geologist it can provide a f rame-

work for his quali tat ive s t ructural in terpreta t ion.

The concept of s t ructural s tabil i ty also provides new insights in paleontological

and evolutionary problems, e.g. the analysis of the "morphospace" of paleontological

species or the "bifurcation" of species. However~ such models are quali tat ive in the

narrow sense of this term and provide ra ther a framework for further analyses which

may te rmina te in models which can be t e s t ed exper imental ly or s tat is t ical ly .

Mathemat ica l details are ignored as far as possible: The object is to convey the

Vspirit~ of s t ructural s tabil i ty and re la ted fields, and its application to geological pa t te rn

recognition problems. As far as mathemat ics is required, it is kept to a minimal l e v e l -

examples of various fields are thought to be of more interest than the mathemat ica l

theory which has been summarized in various textbooks.

1.3 STABILITY

~Pattern recognition problems v as used here, cover a wide field of ~deformations t

and VinstabilitiesL Various types of pa t te rn recognition problems -- which are usually

solved by computer methods - - are analyzed in te rms of ttopological stabil i ty v. The term

ttopological stabil i ty ~ or ~structural s tabil i ty t means that the pa t te rn does n o t drast ical ly

change under a small disturbance (ANDRONOV et aL, 1966; THOM, 1974; NICOLtS &

PRIGOGINE; 1977). However, pa t t e rn recognit ion problems may result even if the disturb-

t 0

i m

m

0

0

a ~ - - m i g r a t i o n ~ [..) ~ " - - ~

Fig, 1.1: The phylogenet ic history of horses: (a) the classical gradual phylogeny a f t e r SIMPSON {1951); (b) the same phylogeny redrawn along a modern t ime scale. In t e rms of evolut ionary veloci t ies two quite d i f fe ren t "modes of evolution" are represen ted by the two figures. However, in t e rms of p h y l o g e n y - the relat ionship be tween s p e c i e s - - the t r ans format ion is s t ruc tura l ly s table as all pa thways remain the same.

a n c e , the t rans format ion , is s t ruc tura l ly s table . Such an example is given in Fig. 1.1. The

phylogeny -- the evolut ionary history -- of horses is one of the most ce leb ra ted examples

of gradual Darwinian evolution, and to some ex ten t of direct ional se lect ion {Fig. 1.1 a).

However, if the t ime scale used by SIMPSON (1951 and others) is replaced by the absolute

t ime scale under cur ren t use, the phylogenet ic pa t t e rn changes dramat ica l ly with respec t

to mode and veloci ty of evolution (Fig, 1.1 b). All s ignif icant "evolut ionary events" are

now concen t r a t ed within very narrow t ime intervals . What does not chang% is the princi-

pal s t ruc tu re of the phytogenet ic lineages, i.e. the ances to r -descendan t relat ionships

are s t ruc tura l ly s table . The ttime~ axis in Fig. 1.1 b is deformed like a rubber s t r ip

which is d i f fe ren t ia l ly s t r e t ched without folding - - a purely topological deformat ion .

Although this deformat ion is purely topological and s t ruc tura l ly s table , it changes

the ' s eman t i c ' i n t e rp re ta t ion of the evolut ionary mode. While Fig. 1.1 a indicates a

slow gradual evolut ion under a long- term changing envi ronment , Fig. 1.1 b indica tes

periods of ' s tas is ' with l i t t le morphological evolution which are in te r rup ted by short t e rm

in tervals of rapid evolut ionary change and associa ted speciat ion events . A simple, however

not t r ivial , t r ans fo rmat ion of the scale, thus, may t ransform a gradual is t ic p ic ture into

a punc tua ted one (cf. STANLEY, 1979).

S t ruc tura l s tabi l i ty in a more precise sense can be r e l a t ed to the topological simi-

lar i ty of the " t ra jec to r ies" of a process in this phase-space (NICOLIS & PRIGOGINE, 1977;

HAKEN, 1977). The " internal dynamic" of a process is usually descr ibed by d i f fe ren t ia l

equat ions which usually depend on some pa rame te r s . In many physical i n t e rp re t a t ions

these p a r a m e t e r s can be ident i f ied with some s t a t e of the env i ronment of the system,

i.e. they depend on various kinds of d is turbance act ing continuously on the system (HA-

KEN, 1977). As the sys tem and/or its env i ronment evolves, some of these p a r a m e t e r s

can change smoothly or suddenly, and during such a change the principal behavior of

the system can change.

homogeneous ~ ~" ed

A B strain stra/n

Fig. 1.2: Stress-strain diagrams of deep-sea sediments: a) homogeneous sediments; b) s t r a t i f i ed sediments . Within each sequence compac t ion and overload increase(modi f ied from BAYER, 1983).

In physical sys tems not uncommonly a threshold occurs which, when passed, causes

a sudden change of the behavior of the system. The most d ramat i c change in a dynamic

sys tem is tha t its t r a jec to r ies in the phase-space change the i r topological configurat ion.

By a small pe r tu rba t ion of a pa rame te r , the system then devia tes widely from the initial

s i tuat ion. Fig. t .2 i l lus t ra tes this s i tuat ion roughly by the " t ra jec to r ies" of a s t r ess - s t ra in

diagram. The exper iments were per formed with a ro ta t ing vane (with the vanes inser ted

paral lel to the bedding planes of sediments; cf. BOYCE, 1977; BAYER, 1983). In the

s t r e s s - s t r a in diagram -- the 'phase plane ' of the process -- two qual i ta t ive ly very d i f fe r -

ent p a t t e r n s were observed (Fig. 1.2; BAYER, 1983) depending on the " s t r a t i f i ca t ion"

of the sediments . In homogeneous sediments the s t ress - s t ra in curves are smooth while

in s t r a t i f i e d s e d i m e n t s a sudden b reak occu r s . Wi th in t he r a n g e of m e a s u r e m e n t s t h e s e

two t y p e s a r e i n d e p e n d e n t o f t h e c o m p a c t i o n (preloading) o f t h e s e d i m e n t s , i .e. w i t h in

e v e r y se t t h e t r a j e c t o r i e s evo l ve s m o o t h l y and s t r u c t u r a l l y s t a b l e wi th i n c r e a s i n g c o m -

pac t ion . However , t h e o t h e r o b s e r v e d p a r a m e t e r , t h e l a m i n a t i o n or s t r a t i f i c a t i o n o f t h e

s e d i m e n t s , c a u s e s an e s s e n t i a l c h a n g e in t h e m o d e o f fa i lu re : A ve ry d i s t i n c t po in t o f

f a i l u re a p p e a r s in w e l l - s t r a t i f i e d s e d i m e n t s wi th a " sudden j ump" in t h e s t r e s s va lues .

T h e r e is no c o m m o n s e n s e w i t h r e g a r d to t he t e r m ' s t a b i I i t y ' . As H O C H -

STADT (1964} no tes :

"Often i t . is not necessary to determine the explicit solution

to a problem, but it is important to be able to say something

about the solution ... In many physical problems one is motivated

by the feeling that a small change in the conditions of the

problem should result in a comparably small change in the solu-

tion .... The word stability is a very tricky word. No one defi-

nition seems to be adequate for all purposes."

. • . " . % 0 . ° . ~

, A " ~ ' ~ " ~ ' " - . 4 " t .~.:--...

2. N O I S Y S Y S T E M S A N D F O L D E D M A P S

Many geological and paleontological problems are r e l a t ed to the recons t ruc t ion

of some anc ien t s t a t e from the present remains. The present s ta te , however, is usually

noisy as various fac tors may have inf luenced the sys tem during its history. This s i tuat ion

comes very close to the r econs t ruc t ion of de formed signals in informat ion theory. In

the theory of informat ion the d is turbance of signals by random (white) noise pIays an

impor tan t role (YOUNG, 1975). The s tab i l i ty problems of such noisy systems can be

NOISE

X

Fig. 2.1: Dis tu rbance of signals by random, whi te noise. The init ial signals are well s epa ra ted points or suff ic ient ly small c i rc les on the {x,y)-plane at t ime t=0. With increasing t ime, whi te noise is added, and the area increases where the signals are found with some probabil i ty. Where these areas in te rsec t , two signals are in Competi t ion for the r econs t ruc t ion process.

visual ized in a three-d imens ional model like Fig. 2.1. An init ial signal is cha rac t e r i zed

as a point (or as a suff ic ient ly small circle) on a space plane (x,y). On its way to the

r ece ive r whi te noise is added. As a result , the signal is dr iven out of its original position.

When the random noise sums up during t ime, the signal will be found with a specif ic

probabi l i ty within a ce r t a in area which surrounds the original position of the signal.

A serious recogni t ion problem occurs when the probabi l is t ic neighborhoods of d i f fe ren t

signals s t a r t t o overlap. Indeed, a signal found within an in te r sec t ion area cannot be

r econs t ruc t ed with ce r ta in ty .

The signals, discussed so far, are isolated points which are originally located in

disjunct areas. Now, if we cover parts of the (x,y)-plane densely with signals so that

their initial areas of definition are connected along boundary lines, then another way

to formulate the recognition problem is more appropriate. The evolution of the signal--

space along the t ime axis can be described as a map

source ---> receiver .

The overlapping of the probability areas, in which a signal will be found {Fig. 2.1}, can

then be described as local folding of the original definition space. Fig. 2.2 i l lustrates

the local folding of the (x,y)-plane. Within the folded areas it is not possible to solve

" 7

: : . ' . .

a Fig. 2.2: A folded sheet as a model of a locally folded map (a). A continuous curve on a sheet may develop se l f - in tersect ions in the project ive plane of a folded sheet (b).

uniquely the inverse problem, the reconstruct ion of the 'original signal'. Fig. 2.2b illus -

t ra tes the distortion which can be caused by a local fold. A regular curve on the original

plane develops a se l f in tersect ion on the projection of the local fold. However, deforma-

tions of this type will be discussed in more detail in the last chapter .

In an informat ion- theore t ic approach the disturbance of signals is due to random

forces, and the reconstruct ion process is mainly a s tochast ic problem. The possible drift

of signals or par t ic les is governed by a probability d i s t r i b u t i o n - - the classical example

is the Brownian movement of part icles in a fluid. In geology and paleontology problems

of this type arise mainly if global propert ies of distributions are recons t ruc ted from

s tochas t ic samples by local es t imat ion methods. In this first chapter , pa t t e rn recognition

problems will be elucidated by examples which are related to the superposition of densi ty

functions and to double- (multiple-) valued local solutions of reconstruct ion processes.

In the first example s ta t is t ica l problems are discussed in terms of the reconstruct ion

of original sediment volumes and sediment accumulation rates. The system bears three

10

types of disturbances: Uncer ta in ty about the datum points; s tochast ic components , and

a sys temat ic t rend to under-consolidation of sediments with low overburden.

Then the analysis of ' in t raspeci f ic variability of paleontological species ~ is discussed

in te rms of a probabilistic ontogenet ic morphospace. It will turn out that the covarianee

s t ructure between (measurable) features within the ontogenet ic morphospace can be help-

ful for the taxonomist . But, we will see further tha t one cannot expect linear relat ion-

ships between the f e a t u r e s - - a nonlinear theory seems appropriate ra ther than just

an analysis by the so-ca l led higher s ta t is t ica l methods.

In a quali tat ive discussion of the analysis of directional data the problem will be

to find an optimal weighting function for the reconst ruct ion of a smooth density distribu-

tion from sca t t e red data. It will become clear that the cri t ical areas of the reconst ruc-

tion are in tersect ions of the areas~ on which the weighting function is defined,

In the next example the reconstruct ion of surfaces from sparse point pa t te rns by

computer methods is discussed. It will be shown that the problems which arise in this

con tex t are mainly of a geometr ical nature. Therefore , the algori thmizat ion of the recon-

s t ruct ion process is not trivial. The local es t imat ion methods in use provide no unique

solution, i.e. they are very sensit ive to small changes of the initial conditions. The

example, therefore , leads over to the next chap te r where this type of instabil i t ies is

discussed in more detail .

2.1 RECONSTRUCTION OF SEDIMENT-ACCUMULATION

The problem to reconst ruct accumulat ion- and sedimentat ion ra tes arises in sedimen-

tology in order to gather information about sea-level changes, c l imat ic changes, and

the evolution of basins. Fur thermore, accumulation and sedimentat ion ra tes clearly indi-

ca te hiatuses in the sedimentary sequence on the base of which local and global events

of the past are recognized (e.g. VAIL et aI., t977). However, several processes and as-

sumptions are involved in the reconstruct ion of which the most important ones are

** the dating of the sediment sequence

** the es t imat ion of the original sediment thickness without compact ion.

Both reconst ruct ions are biased and usually involve specif ic assumptions about the datum

points, the initial porosit ies and the consolidation s t a te of the sediments.

11

2.1.1 Accumulation Rates and Deformations of the Time-Scale

The computat ion of accumulation ra tes requires es t imates of the absolute t ime

scale, i.e. a suff ic ient number of datum points along the sediment column. As soon as

the datum points are given, the computation of the accumulation ra tes is ra ther simple,

age s tages accumulation rate cm/kyr stages accumul, ra te cm/kyr

0 I 2 3 4 t

14o [ K I M M E R I D - / GIAN

_1 O X F O R D I A N

15o

CALLOVIAN

160 BATIqONIAN

BAJ OCIAN

I70

AALENIAN

TOARCIAN

-- 180 PLIENS- BACHIAN

S1NEMURIAN

- - 1 9 0 1 HETTANGIAN

- . J r -

t

I _ _ J

m

m

l _ _ _ I

I - - - -

t- a

K I M M E R I D G I A N

OXFORD,

! L . . . . . . . . . , ,

I . . . . . . . . . . . . . . '

CALLOV0

BATHON.

BAJOCIAN

AALENIAN

TOARCIAb

PLIENS- BACHIAN

SINEMUR.

HETTANG.

[]

HI

t l . , .

~ Cyclic accumulation rates of sediments in the South German Jurassic. on the t ime scale of VAN HINTE (1976); b) based on an a l tered

Jurassic t ime scale {see text for explanation). On both scales a cycl iei ty is obvious, however, in (b) the cycles are much more regular (about 4 Ma), and a superimposed megatrend appears.

b

12

i.e. it is the quot ient

sediment thickness

t ime interval of depos i t i on .

Fig. 2.3 gives such accumula t ion ra tes for the South German Jurass ic and i l lus t ra tes

how they subdivide the Jurass ic sequence into gener ic deposit ional cycles. However,

the pa t t e rn is not invar ian t against deformat ions of the t ime scale. Essent ial ly the same

s i tuat ion arises as was discussed for the revolution of horses v if the t ime scale is a l te red

(Fig. 2,a A,B).

The accumula t ion r a t e s in Fig. 2.3A have been ca lcu la ted using the t ime scale

of VAN HINTE (1976). However, for the t ime- in te rpo la t ion within s tages a more r ecen t

b ios t ra t ig raphic subdivision (COPE e t al., 1980) was used (McGHEE & BAYER, in press}.

A deformat ion of this t ime scale (Fig. 2.aB) a l te rs the cycl ic i ty drast ical ly, and the

pa t t e rn becomes much more regular . In addition, a well pronounced megacycle becomes

visible (Fig. 2.3 B).

Van Hin te ' s Jurass ic t ime scale is based on e s t ima tes of the upper and lower bound-

ary of the Jurass ic and of one addit ional ' ca l ib ra t ion point v at the middle of the Jurass ic

(base of the Bathonian). Between these points he divided the t ime scale l inearly by the

number of ammoni t e Z o n e s - - with the resul t of an average durat ion t ime of IMy for

each ammon i t e Zone. Now, s ince he published his t ime scale, the b ios t ra t ig raphic scheme

has been a l tered, and the re fo re the durat ion t ime var ies from Zone to Zone. However,

by shif t ing the addit ional -- middle Jurass ic -- ca l ibra t ion point into the Callovian the

original assumption of 1My/ammoni te Zone can be res tored for the Lower and Middle

Jurassic . This has been done in Fig. 2.3B, and this simple t r ans fo rmat ion genera tes the

except ional cycl ic pa t t e rn which agrees with o therwise es tabl i shed cycles of s imilar

phase length (cf. EINSELE or McGHEE & BAYER in BAYER & SEILACHER, eds., in

press}.

As discussed ear l ie r for the evolut ion of horses, the cycl ic i ty per se is a s t ruc tu r -

al s table p a t t e r n which is preserved under topological t r ans fo rmat ions of the t ime scale

(even if o the r proposed scales are used, e.g. HARLAND et al., 1978), while the regular i ty

of the cycles, the i r phase length and m a g n i t u d e - - i.e. all p roper t ies from which we

can ga ther in format ion about the veloci ty of the p r o c e s s - - change as the scale is

changed.

2.1.2 Es t ima t ion of Original Sediment Thickness

One impor tan t process ~ which a l t e r s the physical proper t ies of sediments~ is compac-

13

Sediment Water Content (w) Bulk Coml~sition and Porosity (p) Density

{%) (%) (g/cm 3)

0

c~ 4OO

5 0 0 ~ ~ 1

Fig. 2.4: Depth-logs for sediment

20 40 60 80 100 20 40 60 80 1.4 1.8 2.2 I I ¥ ] I I r "

Computed Grain

Density (g/cm 3}

2.4 2.7 3.0

fi

Sonic Velocity

(m/s)

1.4 2.0 2.6

composi t ion and physical proper t ies for DSDP-si te 511 (adapted from BAYER, 1983). Data are mean values for cores (D: diatoms, N: nannofossils, c: clay content , O: o ther components) .

tion under the overburden of la te r deposits. Especially in clays the physico-chemical evolu-

t ion is dominated by compact ion. Fig. 2.4 i l lus t ra tes how the physical proper t ies in a

sediment column change with depth {i.e. overburden). In the example given, the sediment

column below 200 m depth is dominated by clay, and within this column the physical

pa rame te r s porosity and wa te r - con t en t decrease continuousIy as the sed iment becomes

increasingly compacted . In the same course the densi ty of the sediment increases and

tends slowly towards the mean grain density of the sediment .

tf one assumes tha t the void volume of the sediments is in equilibrium with the

overburden, then a f irst approximation for the equilibrium curve of compact ion can be

given by the equat ion

dn dp + cn = 0 (2.1)

{where n: porosity = re la t ive void space; p: pressure or overburden). The cons tan t 'c '

can be in te rp re ted as a coef f ic ien t of volume change (TERZAGHI, 1943), which is specif-

ic for par t icu lar mater ia ls . In tegra t ion gives a simple declining exponent ia l funct ion

n = no exp (-cp). (2.2)

14

n 1.o

0.8

0.5

0.,~

0 2

0 .0 m

0 100 200 300 ZOO 500 500 200 800

Fig. 2.5: Porosi ty data from Fig. 2.4 with least square f i t ted t rend lines (see tex t for explanat ion) .

This model has been used in Fig. 2.5 to e s t i m a t e t h e decl ine in porosity with respec t

to depth (i.e. the change in bulk densi ty has been neglected, cf. Fig. 2.4). Empir ical ly

the da ta are well approximated. Besides the mean t rend line some more t r a j ec to r i e s

are given in Fig. 2.5~ which have been cons t ruc ted under the assumption tha t the coeff i -

c ient of volume change is cons tan t while the init ial porosity of the sediments may have

been var iable and, thus, cause the s ca t t e r ing of the da ta points.

If we assume tha t the sed iment is everywhere in equil ibrium with the overload,

it is no problem to e s t i m a t e the thickness of the sediment column which would resul t

wi thout compac t ion (e.g. MAGARA, 1968; HAMILTON, 1976). Fig, 2.6 i l lus t ra tes how

the two principal componen ts of a sediment change under pure compact ion: The volume

vn

vs

Fig, 2.6: The two components of a s e d i m e n t - - voids and s o l i d s - - during

compact ion. Vn: volume of voids, Vs: volume of solids, p: pressure = overload.

15

of solids remains constant while the volume of voids decreases. The porosity is defined

as the re la t ive volume of the voids so that

V n = nV and Vs= ( 1 - n ) V . (2.3)

where n: porosity, Vn: volume of voids, and Vs: Volume of solids. Because the volume

of the solids is not changed by compaction, one has

and, therefore ,

Vs(t=O ) = Vs(t )

(l-no)Vo = (l-n)V

from which we immediately have the compact ion number

V 1-no C = Vo - 1 , n (2.4)

and the decompact ion number

D = Vo l-n V - 1-no -

The decompaction number allows to compute

layer if we know its original porosity no, i.e.

1 C ' (2.5)

the original thickness of any sediment

V o = Dr. (2.6}

The thickness of the ent i re sediment column then can be computed by summing up all

sediment layers or, if regression curves are used as in Fig. 2.5 by evaluation of the inte-

gral

! - n V f z l - n ( z ) dz (2.7) i=l l-no i or zo 1-no

Both techniques have been used in Fig. 2.7 whereby the original porosities of the samples

have been es t imated from the intersect ion of their associated t ra jec tory (from Fig. 2.5)

with the zero-depth line. In a s ta t is t ical sense the t ra jec tor ies of Fig. 2.5 are error

bounds to the mean regression line (probabilities can be a t tached to them by standard

s ta t is t ica l techniques}, and so the curves in Fig. 2.7 can be interpreted. Thus, the recon-

s truct ion of the original sediment amount is simply a s ta t is t ica l process. However, it

closely resembles the si tuation of Fig. 2.1. The data are biased by the sampling tech-

nique as well as by the laboratory technique. Now, if we add a small error to a data

point, it will not a f fec t the results much if the overburden (or depth) is small. However,

as the overburden increases, the t ra jector ies in Fig. 2.5 come closer and closer. An error

of the same magnitude, therefore , biases the results increasingly.

16

15

km t 4

1.2

1.0

0.8

0.6

0.4

0 2

0.0

thickness

n = 0 9

/

depth

o 1oo 20o 3oo 4oo 500 6o0 zoo m

0 8 5

0.8

075

Fig. 2.7: Decompac ted sediment thickness based on the data of Fig. 2.5 (m: mean trend, n=0.9 etc . : in tegrals of the t rend lines in Fig. 2.5.

While the recons t ruc t ion of the init ial s t a t e of the sediments depends on how far

the "compact ion machine was run", we can, on the o ther hand, atways find the output

if the "machine would work unti l infinity". This s table limit, of course, is simply the

volume of solids, and the "dry sed imenta t ion ra tes" (of. SWIFT, 1977) are, of course,

only biased by the t ime scale and the labora tory technique.

2.1.3 Underconsol idat ion of Sediments -- a History Ef fec t

Es t imat ions of the original sed iment volume are usuatly based on the assumption

tha t the consol idat ion s t a t e of the sediment is in equilibrium with the overburden. In

this case, as was pointed out, the e r ror of the es t imat ion should increase with increasing

overburden. However, if burial depth is small, then the t ime-dependen t flow of the pore-

wa te r cannot be neglected; it bears on our unders tanding of the widespread underconsol-

idat ion of r ecen t sediments , which is observed even under slow sed imenta t ion ra t e s (MAR-

SAL & PHILIPP, 1970; EINSELE, 1977).

The consol idat ion of sed iments is descr ibed by Terzaghi ' s model (e.g. TERZAGHI,

1943; CHILINGARIAN & WOLF, eds., 1975, 1976; DESAI & CHRISTIAN, 1977). In a

one-dimensional sed iment column and under the assumption of a cons tan t coef f ic ien t

of consol idat ion Terzaghi ' s model takes the form

17

3p = a2p

- - m -

a t 3x 2 ' (2.8)

where p: the excess pore water pressure due to overload, m: the consolidation coeff i -

cient, and t: t ime. If this model is discret is ized in space, i.e. if the sediment column

is divided into small discrete e lements , then the partial different ial equation is t rans-

formed into a set of ordinary different ia l equations:

dP x d--~ = m(Px-Ax- 2Px + Px+fXx)" (2.9)

Now, if one reduces the system to a single e lement -- a situation which occurs in labora-

tory exper iments - - then we can rewri te equation {2.9) as

dp --+ cp = i(t), (2.10) dt

where the right side describes the " i n p u t " - - i.e. the f l u x e s - - at the boundaries of the

e lement as a function of time, and with free boundary conditions (I(t)=O) a suddenly

imposed pressure declines exponentially with time.

The idea of Terzaghi 's model is that a sudden imposed load increases initially the

pore-water pressure (excess hydrostat ic pressure) and that this pressure decreases a f t e r -

ward due to a loss of pore-water from the e lement whereby the excess hydrostat ic pres-

sure is t ransformed into a pressure at grain contacts . Associated with the loss of pore--

water is an increase in the number of grain contacts . Therefore, the sediment approaches

a new equilibrium s ta te a f te r compression which, of course, is usually not reversible.

The reduction of volume is res t r ic ted to the volume of voids, and the change in pore

volume is simply proportional to the decline in the excess hydrostat ic pressure:

~n ap __dz = m--dz . (2.11) at ~t

Thus, we can solve equation (2.10) in terms of the pore volume, which in case of free

boundary conditions takes the form:

-ct V(t) = Ve+ (Vo- V e) e (2.12)

for a load which is suddenly applied. The load is here represented by the equilibrium

volume V e (cf. equation 2.2), and the excess hydrostat ic pressure is proportional to the

reducible void volume (Vo-Ve). Now, if at t ime t=t 1 an additional load is applied, then

equation (2.12} takes the form

V(t) = Ve2+ (V(t I) - Ve2) e-C(t-tl) , (2.13)

18

which can be r ewr i t t en if V(t l ) is inser ted from equat ion (2.t2):

V(t) = Ve2+ (Vel- Ve2)eCt2 e-at+ (Vo-Vel)e -ct " (2.14)

As this equat ion shows, the re is some remaining reducible porevolume from the f irst

loading even t , which has to be taken into considera t ion. If fur ther load is added in dis-

c r e t e s teps, we ar r ive finally at

I"I

V ( t ) = V i + ( [ ( V i _ l - V i ) e C t i ) e - c t

±=1

(2.15)

which i l lus t ra tes how the ear l ie r loading s t a t e s con t r ibu te to l a t e r s ta tes . The equil ibrium

can only be approached if the t ime in tervals be tween Ioading are suff ic ient ly long, o ther -

wise the sed iment layer will be underconsol idated. This history e f f e c t of loading is illus-

t r a t e d in Fig. 2.8 for various t ime in tervals be tween loading events . The excess hydros ta t -

ic pressure {p in Fig. 3) develops c lear ly a maximum which degene ra t e s to a s imple

declining exponent ia l funct ion for a single loading event and to a sequence of such single

events , as the t ime in tervals be tween loading become large.

10ad. ~ t i i, i i 1 i j i i i i l i i i i i

V

\

i

k

m , . .

t t 1 t

Fig. 2.8: Responce of a single sed iment layer under s tepwise loading when loads are apptied a t d i f f e ren t t ime intervals : t ime in terva l dec rease from lef t to r ight; r ight: a single load. V: void volume; P: momen ta ry reducible void volume which will vanish even if no addi t ional load is applied; C:equi l ibrium void volume for every loading event . At the top the loading in te rva ls are marked, the to ta l applied load is cons tan t for all ' exper imen t s ' .

With respec t to the previous discussion we have, the re fore , to expect tha t e s t ima te s

of original sediment volume are biased by the t ime-de lays in the consol idat ion process,

the p a r a m e t e r s t i in equat ions (2.13) and (2.14) have, of course, the s t ruc tu r e of a t ime

delay. Fur the rmore , if the pressures at the boundaries of the sed iment layer are not

zero, i.e. if the sediment layer is a segment within a sediment column, then the t ime-

19

delay e f f ec t increases further . In case, the permeabil i ty of the sediment is low, the

excess hydrosta t ic pressure will s tay for ra ther long t ime near the values of the over-

burden, and the t ime lack between loading and equilibrium consolidation causes a cont in-

uation of pore-water flow when sedimentat ion has stopped. On the other hand, if we

consider a two- or three-dimensional system of strongly underconsolidated sediments,

any spatial disturbance like unequal loading can initialize an instable flow of pore-water ,

which may lead to fluidization or Iiquidization of the upper sediment layers.

2.2 INTRASPECIFIC VARIABILITY OF PALEONTOLOGICAL SPECIES

~n 1966, WESTERMANN observed that in several ammonite stocks -- a group of

cephalopods (Fig. 2 . 9 ) - - a specif ic intercorrelat ion of morphological fea tures occurs:

~Of particular interest is the inter-correlation between costa-

tion, whorl section, and coiling which has been observed in dif-

ferent, unrelated ammonoid stocks and cannot he satisfactorily

explained" (WESTERMANN, 1966).

Fig. 2.9: Ectocochl ia te cephalopods, lef t recent Nautflus and two ammonites with well marked ontogenet ic changes in morphology.

Because BUCKMAN (1892) observed, probably for the first t ime, this part icular type

of covariat ion (intercorrelation) between the ornament and the whorl section in ammonites,

Westermann named this relationship 'B u c k m a n' s 1 a w o f c o v a r i -

a n c e ' . In some cases, the 'covariance ' extends to other features:

"in general the complexity of the suture-line increases in pro-

portion to the decrease of ornament"

This caused Westermann to establish 'Buckman's second law of covariance'. Proceeding

in this way, any correlation between features, which cannot be satisfactorily explained,

would lead to a new 'law', and 'experiments' with other ammonite stocks would soon

disprove the specific correlation sufficiently 'to be a law'.

20

1 D 1-8

1.0

• e e • • •

• • • •

, e e , ~ •

• e e e •

• • • •

• , • e

e

• j

,I o •

e • •

o

• • •

DSP. b ' ~ - - - ~ ' ~ - ~ - ~ - ~ 2 6 o I 50 100 m m

Fig. 2.10: Covar ia t ion of o rnamen t and c ross -sec t ion of Sonninia (Euhoploceras) -

adicra (Waagen), modified from WESTERMANN (1966). The s c a t t e r g r a m shows tha t the morphotypes cover a cont inuous area in the p a r a m e t e r space; D: Raup 's morphological p a r a m e t e r " ra t io of whorl height to whorl width", DSP: end d i ame te r of the spinous s tage.

On the o ther hand, Wes te rmann was able to show, by means of the eovar ia t ion s t ruc-

ture, t ha t 80 descr ibed species of the subgenus S o n n i n i a ( E u h o p l o c e r a s ) belong to a single

species and tha t the observed var iabi l i ty must be viewed as an in t raspec i f ic property .

His b iomet r ica l study {cf. Fig. '2.10) shows tha t the specimens of this lumped species

fill a cont inuous area in the p a r a m e t e r space (DSP, I/D) and tha t the cos ta t ion types

or ' f o rma ' are regular ly a r ranged within this p a r a m e t e r space (of. Fig. 2.10 for explanat ion

of parameters ) .

The covar ia t ion pa t t e rn described by WBuckman's law w is not unique within the ammo-

ni tes , but it is also not universal . Addit ional s tudies (e.g. BAYER, t977) show tha t in

some cases Wage e f f ec t s w may play some role and tha t there are some special condi t ions

which make 'Buckman ' s law' easi ly visible. One of these condi t ions is t ha t the morphology

changes s t rongly during ontogeny (cf. Fig. 2.9 for cases of r a t h e r s t rong on togene t i c

changes). The avai lable in format ion makes i t likely tha t the observed cor re la t ion is due

to oblique sec t ions through the on togene t i c morphospaee because t ime is not accessible .

The problem tha t age is not avai lable in paleontology is well known; GOULD (1977} discus-

ses in deta i l the problems, which arise, if equal sizes but d i f fe ren t ages of specimens (and

species) are compared by the a l lomet r ic relat ionship.

21

Evidence for an age control of 'Buckman's law' comes from additional features of

the s h e l l s - - the spacing of growth lines and s e p t a - - which both are likely formed in

ra ther regular t ime intervals. Especially the spacing of septa (which is more easily ana-

lyzed) shows a close relationship to cross-sect ion and sculpture in cer tain ammonites

(BAYER, 1972, 1977). Fig. 2.11 i l lustrates such a relationship between spacing of septa

and shell morphology.

S

70°

5o

3o

to

0:2 oL5

:g

r l i i

"t 5 lo 20 m m

Fig. 2.11: Relationship be tween spacing of septa and morphology in ammoni tes (modified from BAYER, 1972). s: angular distance of septa; r: radius of the shell.

2.2.1 Allometric Relationships

If one accep ts the hypothesis that 'Buckman's law' describes a phenomenon of intra-

specif ic variation, we should be able to deduce it from more basic biological principles.

Everyday exper ience on living organisms shows that most morphological features change

with age and that the relationship be tween two morphological features (which can be

quantified) leads usualiy to an al lometric relationship, i.e. a relationship of the form:

y = ax b or log(y) = log(a) + bx. {2.16)

Actually, the al lometr ic relationship can be t raced fur ther down to the, say,

' f i r s t p r i n c i p 1 e s o f g r o w t h' (HUXLEY, 1932), The term ' f i rs t

22

principle ' is here used in the sense tha t i t is very likely to observe such an a l lomet r i c

re la t ionship. As HUXLEY not iced, two measu remen t s (organs etc.) are in an a l lometr ic

re la t ionship when they both grow exponential ly, i.e. let Yl' Y2 be the two measurements ,

which grow exponent ia l ly

Yl = a l e e l t ; y2 = a 2 e C 2 t , (2.17}

then by e l imina t ing t ime we find the a l lomet r i e re la t ionship

= (Y2) c l / c 2 Yl a l ~ '2 " (2.18)

Now, s t r ic t ly a l lomet r i c growth resul ts also in more sophis t ica ted growth models like

the "Oomper tz model". In this model one assumes tha t the p a r a m e t e r ' c ' is not cons tan t

but decreases with age. Growth then can be descr ibed by a pair of d i f fe rent ia l equat ions

dy dc d--'t + c ( t ) y = 0 and an equat ion like d--t = -c . (2.t9)

The growth p a r a m e t e r ' c ' can be any funct ion of t ime, which goes to zero for large t ime

values {ideally as t ime approaches infinity). Especially, any s tab le output of a l inear con-

t rol system (e.g. homogeneous l inear d i f fe ren t ia l equations) provides a possible input for

the growth pa rame te r . A pe r fec t a l lomet r ic re la t ionship resul ts whenever the two organs

under cons idera t ion are cont ro l led by the same mechanism, i.e. if the i r growth equat ions

take the form:

dy dx dt c(t)*ay = O; d-'t- c(t)*by = O; (2.20)

by e l imina t ing t ime one finds the pe r fec t a l lomet r ic relat ionship

dy a y o r y = XoX a / b . dx - bx

In both cases considered so far the a l lomet r ic re la t ionship descr ibes the re la t ionship b e -

tween two growing organs in the phase-plane, i.e. the t r a jec to r ies of growth without con-

s idera t ion of the veloci ty of growth. Indeed, we may still fu r ther genera l ize the re la t ion-

ship to pairs of l inear d i f fe ren t ia l equat ions like

dy dx f(t)~-~ = ax + by; f(t)~ T = cx + dy, (2.21)

and the re la t ionship be tween the two measu remen t s takes the form

dy ax + by (2.22) d-x = cx + dy

23

- (a + d)

ters

Fig.2.12: Relat ionship be tween type of equilibrium a n d coef f ic ien t s of a pair of f i rs t order d i f fe ren t ia l equat ions {equation 2.21}. The type of equilibrium depends on the eigenvalues of the coef f ic ien t mat r ix of equat ion 2.21. The e igenvatues are given by the root

),l,)t2 = { (a+d) -+/((a+d) 2 - 4(ad-cb)) } /2

(e.g. HOCHSTADT, 1964; JACOI3S, 1974; HADELER, 1974).

which provides a l lometr ic relat ionships for a wide range of p a r a m e t e r values (cf. Fig.

2.12).

Huxley's a l lomet r i c relat ionship, there fore , appears as a r a the r likely f irst order

approximat ion of the relat ionship be tween growing organs or measu remen t s taken on a

growing organism. However, t he re are numerous except ions especial ly in ontogeny. Such

an example is given in Fig. 2.13 -- the non-l inear on togenet ic t rend in a Paleozoic ammo-

ni te which, however, can be approximated by a l lomet r ic relat ionships in d i f fe ren t intervals .

2.2.2 The tOntogenetic Morphospace f

If one picks individuals of a ce r t a in age class from a species~ then the morphological

24

lo

r

I i i t I i t I

2 4 6 8 w h o r l N °

Fig. 2.13: Nonlinear ontogenet ic relationships in a Paleozoic ammonite which can be s tepwise approximated by simple a l lometr ic relationships (modified from KANT & KULLMANN, 1980).

fea tures show usually a typical intraspecif ic variability, and in most cases the d i f fe rent

fea tures are cor re la ted within every age class, e.g. size and weight are corre la ted and

can be described by a two-dimensional Gaussian distribution for every age class. In the

most simple case one needs two sources of variation to describe the ontogenet ic mopho-

space of a species:

a) for every age class a description of the variabili ty of all features under consider-

ation and their covariances. As a first approximation one can assume that the

t ime sect ions through the ontogenet ic morphospace are multi-dimensional Gaussian

distributions;

b} a description how the mean of these distributions moves with increasing age

through the morphospace. This gives a charac te r i s t i c {mean) ontogenet ic t r ace for

the ent i re species -- for measurements , the mean {multidimensional) a l lometr ic

relationship.

Fig. 2.14 i l lustrates this description of the morphospace whereby the 'mean ontogenet -

ic t race ' is approximated by a s traight line {e.g. an ideal al lometric relationship in loga-

r i thmic coordinates), and the age sect ions are idealized as ellipsoids {ideal Gaussian distr i-

bution). It is obvious that this description cannot be used only for continuous ontogenet ic

development {as in the ammoni te example), it also holds for growth in f ini te s teps like

in crus tacea . Thus, this kinematic model provides a relat ively general description of the

25

IU N'," . - -

, • ~ - ~ o • • • • t , : m @@

• ; '~:" 4 , • . • •

Oj

O

l m O

$

Fig. 2.14: A linear modeI for the 'ontogenet ic morphospace' of a species. The variables u,v,s are parameters or measurements which charac te r ize the morphology. The ellipsoids are t ime sections, i.e. they are the probability distributions for a cer ta in age cIass. They are dislocated within the (u,v,s)-space with t ime t e i ther continuously or in d iscre te steps. The hull of these ellipsoids {in the linear model a cone) is the probabilistie boundary of the ontogenet ic morphospace. Sections through this morphospace by another variable than time, e.g. size (s), are ellipses which contain various age classes which may appear to be strongly corre la ted.

ontogenet ic development as well as a definition of a probabilistic morphospace for the

whole ontogeny.

If this ontogenet ic morphospace is now sect ioned by another variable than by age,

e.g. by constant size which is an accessible con t ro l -parameter in p a l e o n t o l o g y - - then

the section contains parts of the ontogenet ic trend. Thus, even if the fea tures under

consideration are uneorrelated within an age-sect ion, it is possible to find a strong corre la-

tion within the s ize--sect ions (Fig. 2.14). l~low strong this correlat ion will be, depends

on the specif ic ontogenet ic t race, on the correlat ion of fea tures in the age-sect ions and

on the angle between' the principal axis of the age distribution and the ontogenet ic t race .

26

'Buckmanfs law' was observed in those ammoni tes which show specially strong mor-

phological changes through ontogeny, and the observed variability for a constant size

consists of morphotypes which are found as ontogenet ic growth s ta tes in all specimens.

Therefore , it is likely that this Vlaw' results simply from the oblique sect ions through

the age dependent morphospaces; whereby a high correla t ion be tween fea tures on the

level of the age sect ions may inforce the strong correla t ion within the size sections.

2.2.3 Discontinuit ies in the Observed Morphospace

So far, the mean ontogenet ic t race has been assumed to be a s traight line or can

be t ransformed into a s traight line (i.e. if it is ideally allometric). However, even al lometry

is only an idealized first order approximation. Especially in ontogeny more complex rela-

tionships commonly occur,, which only allow an a l lometr ic approximation through cer ta in

intervals (Fig. 2.13, cf. KANT & KULLMANN; 1980). Non-linear relationships are usually

found if morphology is described by some index numbers -- as it is the case in t theoret ical

morphology ~ (e.g. RAUP, 1966). Thus, in the general case one has to expect that the

mean ontogenet ic t race is a th ree- or more-dimensional curve. This causes complicat ions

if t ime is not available as the controll ing variable; e.g. the size sect ions will show defor-

mations as a function of age. A mathemat ica l description of the morpho-space without

t ime can, therefore , lead to ra ther complicated nonlinear equations.

In addition, one has to expect complicat ions in any project ion of the n-dimensional

ontogenet ic morphospace (all possible relationships) onto a subspace, say the two-dimension-

al subspace of a point plot. Fig. 2.15 gives a sketch of such a curved ontogenet ic morpho-

space. In the convex area of its hull a singularity appears due to the projection into

......... . ......... :..:.:.:z::,-:::::.::.::

Fig. 2.15: The hull of a curved ontogenet ic morphospace, a single ontogenet ic t race and the probabilistic neighborhood of this t race. Other trajectories~ which s ta r t close to the sketched trace, will be within this probabilistic neighborhood. In the concave area of the hull a swallowtail singularity appears, which will be discussed in chapter 4.

27

the plane. Such structural ly stable singularities will be discussed in detail in chapter 4,

however, some aspects of the deformations in subspaces can be already discussed here

by the analysis of the ontogenet ic t races of single specimens.

If one picks a cer ta in set of ontogenet ic t races for single specimens from the proba-

bilistic ontogenet ic morphospace, then, by experience, one can expect that they evolve

in a regular manner and that they do not depart too much from their original relat ive

position within the age section:

Experience shows that a juvenile ' pyknic ' human will, in general,

not turn into a FleptosomeF one during its ontogeny.

Now, we can describe the evolution of the ontogenet ic morphospace as an i te ra ted

(or continuous) map which describes the change of the age dependent probability distribu-

tion and the dislocation of its mean. And, one can assume that the map, which genera tes

the probabilistic ontogenet ic morphospace of a species from some initial distribution,

also describes the ontogenet ic t races of single specimens up to some error term. If one

neglects the error term, which causes the representa t ion of ontogenet ic t races by tubes

rather than by lines (Fig. 2.15), then a significant regular disturbance within a family

of ontogenet ic t races can result only from the projection of the multi-dimensional space

onto a subspace. What then reasonably can be expected, without further analysis, are

local folds of the map (Fig. 2.2).

A simple model of such a fold in two dimensions is the tangent space of a parabola

(Fig. 2.16c) whereby the tangents are local linear approximations of the ontogenet ic t ra-

jectories . The concave side of the fold line, the parabola, is empty, no tangents pass

through this area. In contrary, on the convex side of the fold line two tangents pass

through every point of the plane. Naturally, such a fold model can be valid only as a

local model. In this sense Fig. 2.16 provides a paleontological example of a local fold

in the ontogenet ic morphospace.

The ontogenet ic t races of several individuals of the ammonite genus Hyperlioceras

are drawn in a two-dimensional parameter space (non-al lometric) which includes size (=Dm).

The specimens belong to d i f ferent species of this genus (BAYER, 1970)~ but this should

not be a serious problem because the idea is only to show that local folds can be expected

in paleontological 'growth ' d a t a - - under the aspects of the previous discussion these

species may well be lumped into a single species. During late ontogeny, measured by

size, an inversion of the morphological trend occurs (Fig. 2.16). Specimens with ra ther

high relat ive whorl height (N) turn into forms with moderate values of this parameter

and vice versa. This pa t tern is very regular with respect to the precision of the measure~

ments, and the inversion occurs within a relat ive small size interval. Thus, the local behav-

28

50"

~0"

r..-..: ...... :; /

/ u r n i 16 2b mm

Fig. 2.16: Ontogenet ic t ra jec tor ies of ammoni tes of the genus Hyperlioceras (a: H. desori, b: H. subsectum, c: H. d~d t e s ) , modified from BAYER (1969). N: relat ive height of whorl, Dm: d iameter of the shell. During the late ontogeny, measured by size, an image inversion occurs, which can be in terpre ted as a local fold. In the model t he fold causes local in tersect ions of the t ra jec to- ries and an empty area. If age (t) is used as an additional variable, one can expect that the t ra jec tor ies are well separated, i.e. that the in tersect ions are due to the projection onto the two-dimensional pa ramete r space.

ior of the morphological t ra jec tor ies can well be compared with a local fold. If age

could be added as an independent variable, the t ra jec tor ies would be l if ted into the third

dimension. However, if age is re la ted to the ear l ier development in the pa ramete r space

(Dm,N), then the t ra jec tor ies will be arranged in a more or less regular manner within

the three-dimensional space {Dm,N,t). The local singularity, where the t ra jec tor ies inter-

sec t , may then appear like a piece of a ruled hyperbolic surface (Fig. 2.16). The rulings

model locally the ontogenet ic t races, and their projection onto the (Dm,N)-ptane is the

discussed tangent space of a parabola.

This is not the place to say that this is the way to study and to describe the pat-

tern of Fig. 2.16. But it is a way to i l lustrate and perhaps to overcome the diff icul t ies

which arise from singular s t ructures like the regular in tersect ion of the t ra jec tor ies . In

chap te r 4 it will be shown that singularity theory or, more specific, e lementary ca tas t rophe

theory provides a very elegant method to analyze such pat terns . Anyway, it became clear

that the ontogenet ic development of morphology cannot always be considered to be linear,

nei ther on the probabilistic level of the ontogenet ic morphospace nor on the level of

individual ontogenet ic t races. The ce lebra ted analysis of morphology by the s o - c a l l e d

higher s ta t i s t ica l methods {like factor analysis) has, therefore , to be used with caution.

Pa t t e rns like in Fig. 2.16 cannot be l inearized within the observed paramete r space, and,

therefor% they cannot be analyzed with linear models. On the other hand, the ear l ier

29

discussion of the on togene t ic morphospace shows that , even within the most simple l inear

model, the on togene t ic t rend cannot be ruled out for a l inear f ac to r analysis as is some-

t imes assumed (BLACKITH & REYMENT, 1971). If the covar iance s t ruc tu re is a l t e red

by age e f f ec t s within the size sect ions, then we cannot r econs t ruc t the original dis t r ibut ion

from this sect ions without addit ional informat ion -- in paleontology qual i ta t ive informat ion

will then be pre fe rab le against any quan t i t a t ive measurement .On the o ther hand, the discussed

models provide tools for the taxonomist . They give qua l i ta t ive a rguments for the var iabi l i -

ty of species and, there fore , for the defini t ion of a species. In addition, they allow to

fo rmula te speci f ic quan t i t a t ive models.

2.3 ANALYSIS OF DIRECTIONAL DATA

The analysis of three-dimensional direct ional da ta by means of the ' s te reographic

projec t ion ' (Fig. 2.17) is a s tandard procedure in tec ton ics and sedimentology. The aim

of the procedure is usually to e s t ima te a densi ty funct ion of unknown form from data

points on the sphere (el. MARSAL, 1970). The recons t ruc t ion of the densi ty dis t r ibut ion

requires a smoothing process, in general a moving average. The classical hand method

works with a count ing a rea (circle) of 1% of the sur face of the half sphere (or of i ts

C

Fig. 2.17: a) Represen ta t ion of a t angen t plane in the unit sphere: by the ' c i rc le of in te r sec t ion ' wi th the sphere, i ts unit normal and a point on the sphere ( in tersec t ion of the normal with the sphere), b) a pair of idealized shear planes and a system of real shear plains in the s te reographic projection: r epresen ta t ion by the 'c i rc les of in te r sec t ion ' and the normals, c) Two s te reographic project ions of the same set of joints; above: Schmidt ' s grid (equal area); below: Wulf's grid (equal angles).

30

projection into the plane). When the first computer programs for the analysis of direction-

al data appeared (e.g. SPENCER & CLABAUGH, 1967; ADLER et al., 1968; BONYUM

& STEPHENS, 1971; ADLER, 1970), they did not only simplify the analysis of directional

data, but they added new 'degrees of f reedom' : to choose the size of the counting circle,

to use various weighting functions or projections of the sphere (Fig. 2.17), and the com-

puter allows to handle a ra ther large number of data. A question, which arose early

{KRAUSE, 1970), was, therefore , whether there exists an optimaI size of the counting

area with respec t to the number and to the distribution of data points on the sphere.

Alternat ively, new ' inf luence functions ' like an exponential decay function have been

introduced {BONYUM & STEVENS, 1971).

The problems associated with the smoothing process can be divided into more quan-

t i t a t ive and more qual i ta t ive ones. The variation of the influence area (either by chang-

ing the d iamete r of the 'counting c i rc le ' or by d i f fe rent 'weighting functions') a l ters

the total number of expec ted values at a grid point. The classical way to s tandardize

this number to a percen tage of all observed data points causes deformat ions of the

distribution in the way that the maxima are s t r e t c h e d - - the sum over all grid points

is g rea te r than 100%. The counted data need to be normalized into 'densi t ies per unit

area ' , or the area of influence has to be replaced by a weighting function for which

the integral over the area of influence equals one (BAYER, 1982), A more quali tat ive

aspect is that the smoothing process a f fec t s the variance of the distribution (GEBELEIN,

1951). This de fec t is mainly a function of the size of the area of influence. These prob-

lems are briefly discussed in the first section. However, while they are important in

a s ta t is t ica l sense, they are less significant for the geological in terpre ta t ion of orienta-

tion data. In geology only the position of ex t rema may play a role for the s t ructural

in terpreta t ion, and in this case the described deformat ions of the global distribution

do not a f fec t the local in terpreta t ion. Therefore , most of the following discussion will

focus on the question whether the local ex t rema are stably es t imated by the methods

current ly in use. In the final sect ion we will return to a more general problem and

analyze under which conditions we can suspect a density distribution at all.

2.3.1 The Smoothing Error in Two Dimensions

The es t imat ion of a density function from sca t t e r ed data on the sphere or its

project ion onto the plane involves a moving average. For one-dimensional histograms

the result ing errors and the deformat ion of the moments have been discussed in detail

by GEBELEIN (1951). Fig. 2.18 i l lustrates how a one-dimensional histogram is deformed

if a moving average is used. Two-dimensional data and data on the sphere behave in

the same way (Fig. 2.19), and what we will do here is to e s t ima te the error of the

smoothing process, i.e. the expec ted d i f fe rence between the true and the computed

density distribution. Technically this requires Taylor expansions and integrations, however,

31

1 f = ~Xf i J

/ '7= 3

I

f = Xf i

' 1

d

~ 2 . 1 8 : Smoothing a histogram by a moving average: a to c: normalized averages; d: not normalized histogram of a three point moving average.

the mathemat ic remains ra ther simple.

The way to es t imate the error is to compare the observed densities with a theoret ica l den-

sity function f(x,y) which is analytic (i.e. continuous and differentiable) with the values which

result from averaging over a small interval. The error is the d i f ference between the

true value of the density function and the average. In the plane we choose an interval

{Ax,~y) in the way that its c e n t e r - - t h e ar i thmet ic m e a n - - has coordinates (0,0). We

can do this for any interval by simply shifting the coordinate system. To find the mean

density within the interval we have to sum over all points within the interval and to

divide by the area of the interval, i.e. -Ay <=y __<A~2 2

f(O,O) = _I__ Y'fX; f(x,y)dxdy; AxAy -Ax ~X =<A~2 2 (2.23)

32

Fig. 2. t9: Smoothing a ' h i s togram' (a) on the sphere, b-d: increasing ' a rea of inf luence ' ( t r iangular weight ing funct ion -- see tex t for discussion).

We assume fur ther tha t the densi ty funct ion f(x,y) can locally be developed in a Taylor

series:

f(O+h,O+k) = f(O,O) + hf x + kfy + (h2fxx+2hkfxy+k2fyy)/2 + "'" (2.24)

If f(x,y) in equat ion (2.23)is replaced by the approximat ion (2.24}, we can eva lua te the

in tegral and find

2 ~.~4 2 (2.2a) ~ ( 0 , 0 ) = f ( o , o ) + -~{ f xx + fyy + "'"

whereby it is worth noting tha t all in tegrals over odd p o w e r s in equat ion (2.24) vanish,

the la rges t e r ror term, the re fore , depends on the local curva tu re of the densi ty funct ion

33

f(x,y) and is approximately

8f = f(O,O)-f(O,O) 2 2

= - { 2 ~ f x x + -~¼ f y y ) " (2.26)

The error te rms have a negative sign, i.e. maxima are lowered, minima are f i l l e d - -

depending on the local curvature and on the area of influence. Thus, if we have enough

information about the curvature of the density function, a good s t ra tegy would be to

use a small e lement for averaging where the curvature is strong, and a large one where

the function is flat, in order to combine a good approximation with fast computation.

In the empirical problem, however, this will not be possible.

If the normalization by the area of the interval {~x.~y) is ignored, then it makes

not even sense to speak about an error {cf. Fig. 2.17d), the averaging process then gener-

ates a totally new distribution as defined by equation (2.25). A safe s t ra tegy, which

easily can be used by the computer, is to transform the initial data first into percent -

ages (densities} and to normalize them again by the counting area a f te r averaging.

In the case of a circular counting area the error is of similar form. In this case

one has to evaluate the integral

1 f(O,O) .... x2 f ; f(x,y)dydx;

~ XG

or

-/(Ax2-x2)= < G < /(Ax2-x 2)

(2.27)

f(O,O) .... 1 2 ~r }~Irf(rcos@,rsin~)rdrd~)

0 -r

using again a Taylor approximation, the integral can be evaluated, and the error is of

order d 2

~f = f(O,O)-r(O,O) = ~ (fxx+fyy)

where d: the diameter of the 'counting area'. (2.28)

Even for averaging on the sphere the error is of the same magnitude. Averaging on

the sphere is easily evaluated with a computer if the data points are t rea ted as vectors .

The angular dis tance (a spherical cap) can be defined by the scalar product of vector

pairs. The computat ion of the smoothing error proceeds as before, however, it is slightly

more tricky to evaluate the integrals. We assume that the density function can be wri t ten

as f(x,y) = g(x,y,z}, because we are dealing with unit vectors, we have z = (x2+y2) 1/2.

Thus, we can again use the Taylor expansion (2.24}, and the mean density within the

area F (area e lement dr) can be wri t ten

34

Fig. 2.20: The ' a r ea of inf luence ' on the unit sphere a f t e r i ts c e n t e r has been ro t a t ed to coincide with the z-axis (see tex t for discussion),

t Z

F F F I f(O,O) = ~ {f(O,O)fdf + fxfhdf + fyfkdf

+ + + 2- (fxx h2df + 2fxy yy

Transforming the coordina te system to spherical coord ina tes yields the integral

(2.29)

i 2Y a

f(0,0) - 2~(l-cosa) { f(0,0~]o] sin~d~d~ +

+ fx ~ ~ cos~sin26d~d~ + fy~ ~ sin~sin2~d~d~

I + ~ [fxx ~ ~ cos2~sin3~d~d~

(2.30)

+ 2fxyl; cos~sin~sin3~d~d~

+ fyy] ~ sin2~sin36d6d@ ) + ...}

The coord ina te system has been chosen so tha t the cent ro id of the spherical ' count ing

cap ' coincides with the z-axis of the sphere. The angle ' a ' defines a cone with the

z-axis i ts cen t r a l line (Fig. 2.20} 7 whereby all in tegrals conta ining cos a and sin a vanish.

The in te r sec t ion of this cone with the sphere bounds the averaging area. The e s t ima ted

densi ty value is of magni tude

I f (O,O)2~(l-cosa) ~(0,0) =~(1 -c0sa )

1 c o s 3 a ) {2.31) + f x x l r ( - c o s a + ~-

2 + fyy~(~ - rosa + } cos3a)

or f(O,O) = f(O,O) + (fxx+fyy)(# o - u I cosa(l + rosa)).

35

The averaging error is approximately

1 1 ~T(0 ,O) = (Tj - g c o s a (1 + c o s a ) ) ( f x x + f y y ) ,

(2,32)

2.3.2 Stability of Local Extrema

The major problem of a geological analysis of directional data is not so much

the "stat is t ical stability", but it is the stabili ty of local extrema. However, in this context

it turns out that the classical approach causes problems, i.e. the smoothing process by

a s tep function over an influence area or by a rectangular weighting function. Let the

area of the rectangular weighting function increase until it reaches the surface area

of the half sphere, then the whole distribution becomes equalized, independent of the

original data pat tern. In other words, the ext rema are smeared out with increasing area

of influence. The instability of the local maxima becomes more obvious if one studies

sparse data s t ructures . If the area of influence, the counting circle, is varied on such

a data set (Fig. 2.21), then new maxima are generated whenever two areas of influence

I . . . . . . . . !

Fig. 2.21: Two stereographic projections for the same sparse data pa t tern but with d i f ferent sizes of the 'counting c i rc le ' or ' rectangular weighting function' (5% and 20% of the area of the half sphere). The location of maxima (dark areas) depends strongly on the size of the counting circle. The lower figures i l lustrate how the maxima arise over the intersect ion area of the rectangular weighting functions (Wcounting circles ') .

36

begin to overlap. Furthermore, the transit ion to such a new maximum is not a smooth

proeess~ but it is a 'sudden jump'. The smoothing process is, therefore , highly instable

with respect to the position of the local maxima and to small size changes of the in-

f luence area.

However, it is not only the size of the influence area which may cause the sudden

occurrence and disappearance of local maxima. The smoothing process is executed on

a finite net e i ther on the sphere or on a two-dimensional project ion of the sphere. There-

fore, any change of the grid s t ruc ture will locally change the overlapping pa t te rn of

the influence areas, and the identical instabil i t ies will arise. Because there exists no

'equally spaced grid ~ on the sphere, even a rigid rotat ion of the data may al ter the

distribution of maxima. Thus, the position of local maxima is not invariant against (even

small) changes of the grid s t ruc ture or of the size of the counting circle. Indeed, the

classical method has only two stable s ta tes with respect to local extrema:

a) the si tuation where the counting area is identical with the surface area

of the half sphere, in which case all ex t rema vanish, and there fore all

information is lost,

b) the case where the counting area goes to zero and the distribution resem-

bles a plot of the original data points. This case, however, does not summa-

rize the s t ructural information.

Now, one could go back to the old hand method and hope that all problems can

be solved by a ' s tandardized ' counting area and grid s t ruc ture because then the system

is forced to be without variations or irregulari t ies. However, this would only cover the

problem because any addition or subtract ion of a data point (or of a group of data

points} can change the local pa t te rn of ex t rema in the same way as discussed above.

The reason is that the method does not preserve a data point as the smallest possible

maximum, but that the maxima appear in the intersect ion areas of the counting circles

(Fig. 2.21). The equivalence between the grid and the data can be i l lustrated by two

s t ra tegies~which can be used for computer -a lgor i thms (Fig. 2.22):

1) The classical method t r ea t s a data point as a point and defines a grid on

the sphere (or its projection). To every grid point a counting circle is assigned,

and one has to find the number of data which fall within this circular area,

i.e. aIl points are summed up at a grid point which are within a dis tance limit.

2) Another s t ra tegy is to assign the counting circle to every data point. Then

one has to find all grid points which fall inside the distance limit (which surrounds

the data point} and to add the ( w e i g h t e d ) value of the data point to the

grid points which sat isfy the dis tance property.

37

Fig. 2.22: Two viewpoints of the smoothing process, a ) A data point belongs to a grid point with a ce r t a in probabili ty. Several grid points are in 'compe- t i t ion ' , b} One or several da ta points have a - c e r t a i n probabil i ty to be co r r ec t - ly l o c a t e d - - the spherical cap, which is associated with a data point, is an area of confidence. In the project ion the grid points have to be found which belong to the area of confidence.

Both s t ra teg ies are possible implementa t ions . In the first case, one usually has

to s tore all data in the cen t ra l memory while the grid points can be t r e a t ed one a f t e r

the other; in the second case, one stores the grid and can call one data point a f t e r

the o ther from some ex te rna l memory, tt is this second method which i l lus t ra tes tha t

the da ta points are not preserved as the smal les t possible maximum. Actually, they

are replaced by a rec tangular (density) function over a f ini te area. Variat ion of the

area of the count ing circle, thus, resembles the d is turbance of a signal {Fig. 2.1). The

count ing c i rc le defines the area in which the data point will be found with a ce r ta in

probabili ty, and this probabil i ty is everywhere equal (within the area of influence). The

rec tangu la r weighting function over the count ing circle, therefore , can be in t e rp re t ed

as a rec tangula r probabil i ty distr ibution. A consequence is tha t one can not e i the r assume

tha t ano ther (sparse) sample from the identical universe shows the same ex t rema.

It turnes out tha t the smoothing process by a r e c t a n g u l a r weighting function is

very sensi t ive and locally highly unstable with respect to the position of the ex t rema .

The s tr iking point is tha t o ther weighting functions give much b e t t e r resul ts with regard

to the considered s tabi l i ty problem. Such a family of non-s tandardized weighting funct ions,

which includes asymptot ica l ly the point plot and the rec tangula r weighting function, is

given by the family of polynomials (Fig. 2.23)

w(r) = (i - r)n; 0_- < r= < I

w(r) = 0 r>l (2.33)

r = R/Ro, n = 1,2,3 ..... 1/2, 1/3 ....

38

(l-x) n

Fig. 2.23: The family of functions y=(l-x) n in the interval 0 <x _-< 1 provides a possible family of weighting functions (not s tandardized). They include asymptot ical ly the point plot and the rectangular weighting function.

X

where Ro is the d iameter of the counting circle, and R is the distance of a grid point

from the data point. For every grid point one has to form the sum E w(r) over the

data points which are inside the dis tance R o. In the case of a rectangular weighting

function one has w=l within the counting circle and w=0 outside. For the point plot

one has ro=0, w=l at the data point, To normalize the es t imated distribution one has

to sum over all counts and then to divide the grid point values by this total sum,

Figs. 2.24 and 2.25 i l lustrate that the major maxima are well preserved, even for

large counting areas if one of these functions is used, especially if the exponent 'n '

is chosen small e n o u g h - - while for large 'n' the weighting function resembles the rec-

tangular weighting function. A similar result was found by BONYUM & STEVENS (1970).

They used an exponential decay function as a weighting function which was defined

over the ent i re grid area, i.e. every data point contr ibuted to every grid point. Their

computer studies showed that the resulting frequency distributions are as useful as those

genera ted by the classical ~hand ~ method. The point is that the stabil i ty of the locaI

maxima increases if the weighting function has a weII pronounced maximum at the cen te r

of the counting circle in local coordinates or at the data points in global coordinates.

As this maximum of the weighting function vanishes - - like in the case of a rectangular

weighting f u n c t i o n - - the es t imated ex t rema become instable in the sense that they

do not fur ther resemble the original data points. Fig. 2.26 i l lustrated how a triangular

weighting function preserves the local data s t ruc ture in contras t to the rectangular weight-

ing function of Fig. 2.21.

In summary, it seems worthwhile to discuss the two technical s t ra tegies given

above in more general terms. If the counting circle is associated with the grid points~

then we are working in local coordinates. The question is whether a data point belongs

with some probability to this grid point. In general, one assumes that this probability

39

Nr 1.8 stEP 1.6 NORM m x l

Fig. 2.24:_ Estimated frequency distributions on the sphere by use of various weighting functions: point plot (data); std: rectangular weighting function (classical counting circle);

sqrt: w=(l-r /ro)l /2; sqr: w=(1-r/ro)2; cub: w=r~r2(3-2r). The size of the counting circle is 1.8% of the surface area of the half sphere for all smoothed distributions.

0 ~ N r l ~

\

1 /

. . . . . 9 0

Fig. 2.25: Estimated frequency distribution on the sphere for counting circles of different size and for various weighting functions, Data and symbols like in Fig. 2.24. Counting areas 10% and 90% of the surface area of the half sphere,

40

- ,

Fig. 2.26: Tr iangular weight ing funct ions over a c i rcular area of inf luence preserve the da ta point as a local maximum. As the overlapping of the circles of inf luence increases , e i the r by increasing densi ty of data points or by a larger area of influence, the new maxima develop smoothly be tween the grid points.

dec reases cont inuously wi th the d i s tance from the grid point. The o the r possibility is

to assign the count ing c i rc le to the data point. In this case, we are working in global

coordinates , i.e. we have to find the grid points which fall inside the count ing circle.

The quest ion is now what is the probabi l i ty for the data point to fall on a specif ic

grid point -- and again the probabi l i ty should decrease with increasing d i s tance be tween

the da ta point and the grid point. In any case, the smoothing procedure focuses on the

"sampling error" , i.e. the problem tha t we have only a small sample from a large universe

of usually unknown s t ruc ture .

In a s t a t i s t i ca l sense, the weight ing funct ion should assure tha t a random sample:

which is t aken from the smoothed distr ibut ion, wilt not depar t too much from the original

da ta set , a t least if the e s t ima ted dis t r ibut ion is co r r ec t ed for the variance(GEBELEIN,

1951). MARSAL {1970) even t r ied to in t roduce a t e s t - s t a t i s t i c s on this argument . For

the r ec tangu la r weight ing funct ion this condit ion does obviously not hold, as Fig. 2.21

i l lus t ra tes . .

2.3.3 Approximat ion and Averaging of Da ta

Another useful aspect for the previous discussion can be developed from the

methods for local sur face f i t t ing. The local Shepard method (e.g. SCHUMAKER, 1976)

uses a weight ing funct ion for the approximat ion of surfaces:

w(r;R) =

i/r ; O< r~ R/3

27 r 2 R (~ - !) ; 3- <=r <R (2.34)

0 ;R< r

4]

and the local es t imat ion at any point (x,y) is given by

[ ~ Fi(w(ri;R))~ ; ri ~ 0

(w(r;R)) U f(x,y)

t F. ; r . = 0 ( 1 1

(2.35)

where the F i are the observed z-values at the data points (or frequencies); r i is the

dis tance between a grid point and a data point; la : is a parameter (a 'metr ic ' ) which

can be freely chosen; R: is the radius of the area of influence (the 'counting circle ') .

Formula (2.35) is defined at all points (x,y) in the plane. It interpolates the ob-

served values correc t ly at the data points (the values F. at r.=0) while the values at 1 1

non-data points are weighted averages of the data points which lie within a distance

R of the grid point. The weights are defined by equation {2.34}. The local Shepard

method provides an approximation method which is based on a 'counting circle ' as

discussed above. SCHUMAKER (1976) shows that the local Shepard method is an optimal

approximation s t ra tegy for the local surface reconstruct ion. The relatively complicated

definition of the weighting function ensures that the observed F.-values, i.e. the data 1

points, are preserved.

The local surface fit t ing method can be used to es t imate a density function if

the samples become ra ther large. For a large sample, with space coordinates measured

on a discrete scale (with fixed precision), one can expect that the resulting histogram

is a good es t imat ion of the s ta t is t ical universe. The local surface approximation (with

normalization over the area of influence R) gives a first order approximation of the

density function. As the sample size decreases, the histogram becomes noisy, and the

local surface approximation produces local ex t rema which are due to the sampling error.

A smoothing process is then required, and this procedure should be capable to el iminate

the sampling noise. On the other hand, the smoothing process should not deviate too

much from the local approximation. If one takes Shepard's method as a model for the

local approximation, then the transition to a smoothing procedure requires to drop 1/r

term for the interval 0 < r < R/3 in equation 2.34 because this term would cause infinite

values (if ri=0). The remaining parts are

r (g - 1) ; o _<-r_-<R

w(r) = (2.36)

0 ; R< r

This is the earl ier discussed polynomial weighting function (2.33), which together with

the first term in equation (2.35) provides a smoothing procedure. It is not hard to see

42

tha t the rec tangu la r weight ing funct ion is the most degenera ted case of the family

of weight ing funct ions defined by equat ion (2.36); it is approached as lJ ~ .

Another question is how to choose the p a r a m e t e r lJ , or which funct ion of the

family w{r; !J) is opt imal . In order to see what we can achieve by the p a r a m e t e r ]j

of equat ion (2.36) or by the p a r a m e t e r 'n ' of equat ion (2.33), one may discuss the most

general form of these weight ing functions:

w(x) = (I - x) ; x =<i (2.37)

x = r/R

Equation (2.3,7) takes the values w(r=0)=l at the data point and w(r=R)=0 at the boundary

of the a rea of inf luence, as required. The first de r iva t ive takes the values

w'(X) = -~/(I - x) ~-I with I/>0

with t l>0 (2.38)

w'(O) = -~

w'(1) = -I if ~=0 else w'(1) = O.

The p a r a m e t e r !J allows to adjust the slope of the weighting funct ion at the grid point.

However, to use this addit ional degree of freedom requires addit ional informat ion about

the proper v a l u e for the slope a t x=0. From a general viewpoint, the re fore , the s t ra igh t

line approximat ion (n=l) is nearly opt imal (DeBOOR, 1978). It does not require any

addit ional assumptions, and it solves the problem to connec t the da ta point with the

boundary of the a rea of inf luence by a cont inuous function.

However, a na tu ra l boundary condit ion for the weight ing funct ion could be tha t

the f i rs t de r iva t ives vanish at the data point and at the boundary of the count ing circle .

If one considers the weighting funct ion as a probabi l i ty dis t r ibut ion which assigns a

data point to a grid point , then the Gaussian dis t r ibut ion would, of course, be a model

with an inf in i te a rea of inf luence. In t e rms of polynomial weight ing funct ions this condi-

t ion cannot be sa t is f ied by equat ion {2.37), at least we need a cubic polynomial like

w(x) = (x 3 + ax 2 + bx + c). (2.39)

The boundary condi t ions w(0 )> 0, w(R) = 0, and w'(0)=w'(R)=0 are only sat isf ied by

the polynomial

3 3 2 i w(x) = x - ~'x + ~ (2.40)

or

w(x) = s(l + x2(2x - 3)).

43

--R-- R Fig. 2.27: Interpola t ion by spline functions: a) the l inear Euler spline; b) the cubic spline with vanishing der iva t ives at the grid point and the boundary of the area of inf luence (R).

It turns out tha t there is a l imited number of opt imal weighting functions depending

on the boundary condit ions (Fig. 2.27). Equation (2.39) allows a l t e rna t ive ly to adjust

the slope of the weighting function in an arb i t ra ry way at the grid point and at the

boundary of the area of influence. It is the cubic Hermi te in terpola t ion and can be

used for cubic spline in terpola t ion (De BOOR, 1978). The ear l ie r discussed t r iangular

weight ing funct ion is simply the l inear Euler spline.

The t inear and the cubic spline provide a simple in te rp re ta t ion of the radius of

inf luence (R). Assume the s i tuat ion of two data points with d is tance R: Without loss

of genera l i ty we can locate them on the real line and s i tua te one of them a t the origin,

the o ther at the point x=l {i.e. x=r/R). The weighting funct ions for the two points

are

w1(x ) = (I - x)

(2.41) w 2 ( x ) = (1 - (1 - x ) ) = x

for the l inear spline and

W l ( X ) = (1 + x 2 ( 2 x - 3 ) ;

w2(x) = (i + (l-x)2(2(l-x) - 3) = x2(3 - 2x) (2.42)

for the cubic spline. In both cases we find tha t Wl+W2=l within the interval (0,R).

The approximated values within this in terval are

f ( x ) = f ( 0 ) w 1 + f ( 1 ) w 2 (2.4a)

= f(O)(l - w2) + f(1) w 2

= f(O) + (f(1) - f(O))w2 ~

tha t is a s t ra ight line in the case of the l inear spline and a cubic funct ion in the second

case. However, if f{0}=f(R} (e.g. single measurement s at both points), then the connect ion

be tween the two data points is simply the hor izontal line f(0), cf. Fig. 2.27. The radius

44

of inf luence R, there fore , def ines a threshold of resolution: Data points with dis tance

less or equal R are subsumed in a single maximum while data points with dis tances

larger than R are preserved as dis t inct maxima. As far as it is possible to r e l a t e R

to the number of da ta (cf. KRAUSE, 1970), the spline funct ions provide opt imal approxi-

mat ion and smoothing capabi l i t ies .

It is ins t ruc t ive to inver t the above argument . If one defines the threshold proper ty

as the opt imal solution~ then one can discuss any polynomial weight ing function in t e rms

of the opt imal solution, i.e. we have to find coef f i c ien t s so t ha t w(x)+w{1-x) = cons tan t .

In the case of an a rb i t r a ry polynomial

w I = a 0 + alx + a2 x2 + ... + an xn

w 2 = a 0 + al(l-x ) + ... + an(l-x)n

(2.44)

the condi t ion can only be sa t is f ied i f a(-x) n = -ax n, tha t is, if the leading power te rm

is an odd number . The polynomials of equat ions (2.33) and (2.36), the re fore , are not

opt imal . Now, we may analyze the special case of the cubic weight ing function:

w I

w 2

3 bx 2 = ax + + cx + d

= a ( 1 - x ) 3 + b ( 1 - x ) 2 + c ( 1 - x ) + d

3 =-ax + (3a+b)x 2 - (3a+2b+c) +,(a+b+c+d).

(2.45)

To sa t i s fy the condit ion a l l t e rms which conta in powers of x have to vanish in the

sum Wl+W 2. This gives the following re la t ions be tween the coeff ic ients :

b + (3a+b) = 0 --~ 3a = -2b

c - (3a+2b+c) = 0 --~ 3a = -2b, (2.46)

and we have twice the same re la t ionship be tween ' a ' and 'b ' which, of course, appeared

a l ready in equat ion (2.39}. The Pa rame te r ' c ' is a rb i t r a ry -- if a=b=0 and c~0, the cubic

funct ion degenera tes to the l inear Euler spline.

The general solution for a cubic spline under the condi t ion w(x,R)+w(R-x,R)=constant

is lust the l inear combina t ion of the special cubic spline of equat ion (2.39) and the

l inear Euler spline. We can rewr i te equat ion {2.45) with the p a r a m e t e r s of equat ion

(2.46) as d d +cx} w ( x ) = { a ( x 3 3 x 2 ) + ~} + { ~ , (2.47)

45

i.e. as a l inear combinat ion of the spline functions. To see what happens at the special

points x=0 and x=R we take the der iva t ive of equat ion (2.47) at these points

w' = 3ax 2 - 3ax + c

w ' ( O ) = +- c (2.48}

w ' ( 1 ) = +- c

and find tha t the weighting function has identical slopes at the points under considera-

tion. Some weighting functions~ which sa t is fy the discussed condit ions, are given in Fig.

2.28~ and it turns out tha t these functions are no proper weight ing functions as they

Fig. 2.28: Symmet r ic cubic weighting funct ions which sa t is fy the condit ion w(x)+w(1-x)=constant.

have ex t r ema inside the 'count ing c i rc le ' and may even assume negat ive values. The

only remaining weighting funct ion is the cubic spline of equat ion (2.39), however, this

function tends to produce local ' p la t fo rms ' at the data points (cf. Fig. 2.27). Thus,

the l inear Euler spline remains the opt imal solution for a smoothing procedure.

2.3.4 A Topological Excursus

The previous discussion focussed on the problem to find a proper weighting funct ion

which takes values g rea t e r than zero inside a ce r t a in area and the value zero outside

this area. A similar problem occurs in topology and is solved there by Urysohn's

l e m m a - - for deta i ls see J~NICH (1980). Formal ly we can formula te our problem in

the following way (of. Fig. 2,29):

What we are looking for is a function f: x--* [0,1] on U ( V (~W which has

value 1 on U and value 0 on W. On V = W\U we are looking for a cont inuous

connection° The simple idea is (J~NICH, 1980) to cons t ruc t a s tep funct ion which

46

wbz,. ....... -J/" ,1

Fi~. 2.29: (a) The problem is to find a continuous connect ion be tween the areas U and W. (b) A possible solution is to const ruct a s tep function in V = W U. (c) However, if the boundaries of the subsets A,, A2, ... are Iocally in contac t , a further re f inement of the s tep function is not possible.

decreases from U to W (Fig. 2.29 B), i.e. one has to find a chain of sets

A = A I ( . . . ( A n ( W\U.

The s tep function takes the values

1 on U, 1 - i/n on A. and 0 on W. 1

By stepwise ref inement one can const ruct the continuous connection between

V and W. The only problem is that the boundaries of U i and Ui_ 1 do not meet

(Fig. 2.29 C). In this case, it would not be possible to const ruct a continuous

connection, i.e. we cannot insert an additional s tep between the boundaries

(JNNICH, t980).

If one applies this to the discussion of weighting function G it becomes immediately

clear that the rectangular weighting function is the ex t reme solution which does not

allow any fur ther re f inement between the areas V and W because U = V\W = 0. On

the o ther hand, the linear Euler spl int is a possible connection, even when V shrinks

to a point.

2.3.5 Densit ies, Folds and the Gauss Map

So far, the reconst ruct ion of density distr ibutions was discussed without regard

to the question whether there exis ts a density distribution for or ientat ion data. In the

case of current or iented obstacles in sedimentology and of lineations and cleavages in

47

tectonics , we can use standard arguments. One can expect that in these cases the orien-

tat ion is control led by a potential (BAYER, 1978). The systems stay at the minima of

the potentials , i.e. in the position of minimal drag in a current; c leavage will occur

in the direction of maximal shear s t ress (minimal normal stress) e tc . To arrive at a

probability distribution or a density distribution, one assumes that the objects are driven

out of the minimum position by random forces. One way to find the density distribution

is provided by the s ta t ionary solution of the Fokker-Planck equation (e.g. HAKEN, 1977}.

The situation is d i f ferent when orientat ion data are taken from surfaces, e.g. from

deformed bedding planes in tectonics . The orientat ion data are then the normals of the

surface, and therefore the surface needs to be regular, that is, there exists a tangent

plane at every point of the surface. If the surface is given as a map R2 ~ R 3, e.g.

as

X ( u , v ) = { x ( u , v ) , y ( u , v ) , z ( u , v ) } , (2.49)

then the unit normal vector at each point p of the surface is given by

N ( p ) = (X u A X v ) / ( I X u A X v l ) ;

(2.50) A is the vector product.

Now, in different ia l geometry the map N: S - - R 3 (S: the surface) is studied which

takes its values on the unit sphere

S 2 = { ( x , y , z ) R 3 [ x 2 + y 2 + z 2 = 1}. (2.51)

The map N:S ~ S 2 is called the Gauss map of the surface S (DoCARMO, 1976), and

this map is equivalent to the standard representa t ion of or ientat ion data on the unit

sphere. The Gauss map has various propert ies , which can be interest ing for geological

in terpre ta t ion of directional data. Detai led discussions are given by DoCARMO (1976)

and DANGELMAYR & ARMBRUSTER {1983).

The first point to be discussed is how a surface e lement ~S' maps onto the unit

sphere. If only local propert ies of a surface are considered, that is a small neighborhood

of a surface point, then typically three situations o c c u r - - the local surface s t ruc ture

is e i ther elliptic, parabolic or hyperbolic as i l lustrated in Fig. 2.30. With regard to the

positive normals of the surface one finds that the Gauss map preserves orientat ion at

an elliptic point and reverses it at a hyperbolic point (Fig. 2.30) - - at a parabolic point

a degenera ted situation arises, the normal vectors are all aligned in a plane which inter-

sec ts the sphere. In geological problems one has also to consider the inverse surfaces

{synclines). In this case, the closed pathways around the surface points in Fig. 2.30 are

inverted in the Gauss map.

48

e l l i p t i c

@@ p a r a b o l i c

r. , ~ , , ' ; : . ~ ,¢." ;..:...¢." ...

Fig. 2.30: The local s t r uc tu r e of a surface is e i the r of ell iptic, parabolic or hyperbolic type. The o r ien ta t ion of a closed pathway (surrounding the c r i t i - cal po in t ) i s preserved at an el l ipt ic point and reversed at hyperbolic and parabolic points (a: posi t ive normals, anticl ines) . For synclines (b) the en t i re pat- t e rn is reversed. Al te rna t ive ly , (a,b) can be in t e rp re t ed as project ions in the upper and lower h e m i s p h e r e - - both methods are used in geology. Grids: in- cl ined L a m b e r t ' s equal area project ion; adapted from HOSCHEK, 1969).

Af t e r these geomet r i ca l considera t ions we analyze how a surface e l e m e n t ' S ' maps

onto the unit sphere, and which value the area of the surface e l ement takes on the

unit sphere. If we choose a very small sur face e l ement dB, the expec ted densi ty of

normals on the sphere is proport ional to dB/dS (where dS is the corresponding surface

e l emen t on the sphere). The area of a small sur face e l emen t is

B = / . f l X u ^ x v I du d r , (2.52) R

and the image of B on the unit sphere has a rea

S = f f i N u A NvJ du d v . R (2.53)

(R is the area in the (u,v)-plane which corresponds to the surface e l ement B). *S* can

49

be expressed by

s = ; ; K lXu A Xvl du dv 12.541 R

where K is the Gaussian curvature of the surface. The relation between the surface

e lements B and their image on the unit sphere S is finally

l i m B/S = ( IX u A X v l ) / ( K J X u A X v l ) = 1 / K . {2.55) B + 0

For a detai led proof see DoCARMO (1966). The local density on the sphere is propor-

tional to l /K, or for a larger surface area we expect the density at a point p to be

I Xu A Xvl du dv (2.56)

that is the inverse 'weighted average ' of the Gaussian curvatures. There is one point

one has to take care of. The Gaussian curvature K can assume positive and negative

v a l u e s - - thus, it is necessary that the Gaussian curvature does not change sign on the

surface e lement under consideration. Otherwise we cross a point where K=0, and at

that point the density is not defined (equation 2.56).

However, if K=0 everywhere at the surface, one has to distinguish two cases. If

the surface is planar~ then there exists no density distribution; all normals are mapped

onto the identical point on the sphere. In the case of a parabolic surface e lement , e.g.

a cylindrical or conical one, we can define densities if the Gaussian curvature K is

replaced by the curvature k of the generating curve of the surface; the surface e lements

are replaced by arc length. The concept of a density distribution, therefore , is ra ther

complex for or ientat ion data from surfaces. One has to distinguish various cases, and

one can handle only finite surface e lements that sat isfy cer tain conditions.

Anyway, it is as usually, the most interest ing situations are those which cause

trouble. In this case, interest ing situations arise when the surface contains points at

which the Gaussian curvature changes sign. Such situations arise in the most simple

cases, e.g. if the surface is a sinusoidal cylinder - - in this case, the Gaussian curvature

K changes sign at the inflection points of the generating sine function. Somewhat more

complicated surfaces are given in Fig. 2.31. The lines of parabolic points, which divide

the surfaces, appear also on the Gauss map (stereographic projection) where they bound

the area of normal vectors . The density of normals on the sphere is especially high

at (or near) the boundary of parabolic points which define a fold line of the Gauss map

(cf. Fig. 2.2). The concept of folded maps applies also to the s tereographic projection

in the vicinity of parabolic points on a surface. The high ( theoret ical ly infinite) density

50

Fig. 2.31: Complex "sinusoidal" s u r f a c e s - - the sets of parabol ic points on the sur faces map onto ' caus t i c lines' {with high densit ies) bn the unit sphere. These ~caustics ~ are typical ~eatastrophe s ingular i t ies ~.

of normals at these s ingular i t ies has i ts analogy in caust ics , and an in te res t ing appl ica t ion

to phonon focusing {cf. DANGELMAYER & ARMBRUSTER, 1983, for a thorough theo re t i -

cal discussion). These pa t t e rn s likely apply to the focal mechanism da ta in seismology

with minor modif icat ions . Fur ther , such s ingular i t ies occur at leas t in the theore t i ca l

s imulat ion of p las t ic deformat ions {Fig. 2.32) as they were studied by LISTER e t al.

{1977). The singulari t ies , which bound the dis t r ibut ion of normals are closely re la ted

to the s ingular i t ies s tudied in ca t a s t rophe theory (DANGELMAYR & ARMBRUSTER,

t983), and some of them will be analyzed in more deta i l in the 4 th chapte r . In geology

a common problem is to find the ' fo ld-axis ' from a set of t angen t planes. The l inear

approach is to find a "regression c i rc le" under the assumption tha t the t ec ton ica l folds

are cyl indric {or conic). The approach by "caust ic lines" probably allows to ga the r addi-

t ional in format ion as soon as the surfaces are more compl ica ted .

Fig. 2.32: c-axis pole f igures for model qua r t z i t e under plane s t ra in {modified from LISTER e t al. 1978).

51

2.4 RECONSTRUCTION OF SURFACES FROM SCATTERED DATA

Contouring by compute r became a widely used method in geology (e.g. DAVIS &

McCULLAGH, 1975; KRUMBEIN & GRAYBILL, 1965; HARBAUGH et al., 1977; FREE-

MAN & PILRAU, 1980}. The use of computers , however, causes special problems which

do e i the r not occur within the classical hand method or are simply not recognized when

contours are in te rpola ted from a hand made t r iangula t ion net . Fig. 2.3a i l lus t ra tes with

a ' ca tas t rophic ' example how di f fe rent compute r resul ts may be dependent on the es t ima-

t ion method. Fig. 2.34 gives a much ear l ie r hand made es t imat ion for the identical da ta

set (BAYER, 1975).

The t r iangula t ion method (cf. CAVENDISH, 1974) is, in general , not used for

compute r procedures because it needs too much ' in tu i t ion ' to c r e a t e a t r iangula t ion

net from s c a t t e r e d data points. On the o ther hand, it is re la t ive ly simple to in te rpo la te

and draw contours from a regular grid {e.g. SCHUMAKER, 1976). Therefore , contouring

procedures for computers use mostly a two pass process:

in a f irst step, the surface data are e s t ima ted for the points of a regular

grid,

** in the second step, the contour lines are in te rpola ted from the regular grid.

The final computa t ion of the contour lines is a r a the r s table process. Useful methods

are well known from f ini te e l ement s (ZIENKIWFI'Z, 1975) and from spline interpolat ion.

Even more complex methods like global or semilocal surface f i t t ing can be used to de-

rive contour lines from regular grids {SCHUMAKER, 1976). Anyway, on the level of

the local grid cell the es t imat ion of contour lines is not uniquely de te rmined , an aspect

which wilt be discussed in some detail .

The major problems, however, derive during the es t imat ion of the grid values,

and this will be the major theme of the following sections. The cen t ra l example is an

es t imat ion method by minimal convex polygons, which is capable to i l lus t ra te the prob-

lems arising during the es t imat ion of the grid values.

2.4.1 The Regular Grid

The main problem with s ca t t e r ed data is to eva lua te the surface values for the

regular grid. In geology, in par t icular , this process can be compl ica ted because the data

commonly show a natural ordering along outcrop lines such as r iver beds, t ec ton ic zones

e tc . Therefore , very sophis t ica ted methods have been developed for the gridding process.

For ins tance , weight ing funct ions are used tha t include di rect ional searching of data

points; also r a the r compl ica ted s t a t i s t i ca l methods have been developed for the prior

52

E

\ C !

Fig. 2~Y3: 'Catastrophic' contour maps from a data set with strongly fluctuating surface values, a,b: regular grids of different roughness; minimal polygon method (see text). c,d: the same estimations on an irregular spaced rectangular grid, every data point is located on a grid point, e: grid point estimation by Shepard's weighting function (see section 2.3.3). f: point distribution and bounding polygon.

Ca "H" und +~'

l,,ig"H'_~j b e:t,- schu~

r~val

2®°

Fig. 2.34: A ~hand' estimation of contour ' ~ ' ~ / ~ ' ~ lines for the data set of Fig. 2.33. I~ / ' < ~

53

i ........ l

Fig. 2.35: Three possible solutions of isolines for the ident ical da ta set° a} Approximat ion by Shepard 's method; b} minimal polygons; c) minimal polygons when the grid is genera ted from the data points. Grid points are marked. Modified from ALTHEIMER et al., 1982.

and poster ior analysis of the grid values (JOURNEL & HUIJBREYTS, 1978; HARBAUGH

et al., 1977; HUIJBREYTS, 1975).

In principle, the gridding technique is a map from one f ini te point set onto another

f ini te point set , whereby no one to one correspondence exists. There may be more grid

points than data points, or there could be less grid points than data points. This re la t ion-

ship may also change locally within the global gridding s t ruc ture , as i l lus t ra ted in Fig.

2.35 in t e rms of the grid points. Therefore, one may expect s tabi l i ty problems to occur,

i.e. t ha t the map becomes locally folded. Fig. 2.33 shows a ' ca t a s t roph ic v resul t of com-

puter contouring, due to the special, very inhomogeneous data configurat ion, the grid

s t ruc tu re and the gridding methods. This conf igurat ion was chosen to i l lus t ra te the prob-

lems tha t arise from computer contouring. As far as s tabi l i ty problems are involved,

they will be discussed below.

A minor problem in gridding is the special s t ruc tu re of the regular grid (Fig. 2.35).

In general , the grid (or net} is chosen in such a way tha t the d is tances be tween the

net points are constant ; at least they are cons tan t for every coordinate direct ion. There-

fore, the resul t ing contours will depend on the init ial decision about the grid s t ruc tu re

(Fig. 2.35}. If the grid is too rough, data points will be lost, or they are not cor rec t ly

recorded. On the o ther hand, if the grid is fine enough to provide locally the required

54

Fig. 2.36: Two isoline representa t ions of foraminiferal diversity (entropy) in Todos Santos Bay, California. a) Shepardts method; b) minimal polygons. Adapted from ALTHEIMER et al., 1982; data from WALTON, 1955.

resolution, the net may be too fine in other areas where the data points are sparsely

sca t t e red (cf. Fig. 2.35 c). For a computer program the la t te r case may be more impor-

tant than lost data points because a very fine grid can cause an extensive need of

memory and unreasonably long computat ion times.

Fur thermore, the original data points may not be recorded correct ly, even on a

very fine grid if they have in termedia te position between the grid points. This can be-

come important if the es t imat ion procedure has a smoothing e f f ec t like it is the case

with Shepardts local method (Fig. 2.36 a, cf. sect ion 2.3.3). This problem can be solved,

to some ex ten t , if it is possible to choose the grid points in such a way that every

data point becomes a grid point and that the grid is still a rectangular one {Fig. 2.35

c; ALTHEIMER et al., 1982). A really satisfying solution of this problem would require

that the grid is formed over the data points. The classical way to do this is the triangu-

lation method, but a good triangulation net is hard to establish within the computer

{SCHUMAKER, 1976; CAVENDISH, 1974).

2.4.2 Global and Local Extrapolat ions

Another problem is to bound the es t imated surface values to a reasonable area

in order to avoid extrapolat ions into areas where no data points are available. Most

55

gridding techniques allow such extrapolations, and most computer programs in use do

not tes t this simple problem. As can easily be derived from the classical tr iangulation

method (ALTHEIMER et al., 1982), the most extensive boundary giving reasonable est i -

mates is the convex polygon that surrounds all data points, though it does not ensure

a reasonable solution, as the Shepard es t imat ion in Fig. 2.33 il lustrates. Somet imes

a be t t e r res t r ic t ion of the solution space can be forced by additional const ra ints like

a limit dis tance between data points and grid points. But such restr ic t ions a f fec t all

es t imat ions, not only the boundary -- and a typical result are 'holes ' within the working

area.

There are several possible s t ra tegies to establish a convex boundary for the data

points (cf. ALTHEIMER et al., 1982). However, we shall see in the next section that

a very simple method results from 'minimal polygons', which can be used to el iminate

the interior points of the bounding polygon.

Indeed, the problem to find a reasonable global boundary for the working area

has its local equivalent. One has to ensure that a grid point, for which a surface value

is evaluated, is located within a closed polygon of data points and that these points

are used to e s t ima te the surface value. Otherwise, local extrapolat ions may occur, which

cause maxima or minima, which are not represented in the data or 'p la t forms ' result

if the local es t imat ion uses weighting functions.

The sectorial search method is an approach to solve this problem (HARBAUGH

et al., 1977)o The problem, of course, does not occur with the classical triangulation

method because, in this special case, the whole area in question is densely covered

by triangles or by convex polygons with the data points at the corners. Again, most

of the gridding techniques in use do not necessari ly tes t these conditions, especially

the weighting average methods do not. For the usually complex geological data the

(1

•

/ / '/

/ b

• / / /

/ ' / / ~

\

Fig. 2.37: a) the oc tant search method. A data point must be found in every octant for the es t imat ion of a grid point value. This avoids local extrapolations, b) Surface reconstruct ion by an octand search method genera tes sinusoidal contours even from a simple cylindrical surface.

56

problem has been well recognized (HARBAUGH et al., 1977), and s t ra teg ies like quadrant

and oc t an t search pa t t e rns have been developed to overcome the problem (Fig. 2.37).

These methods require t ha t for every grid point a se t of da ta points must be found

in a ce r t a in number of radial sec tors (Fig. 2.37). Although the procedure secures tha t

the grid point is surrounded by a - - n o t necessar i ly c o n v e x - - polygon of data points,

it may produce o ther defec t s . A local gap will be produced if one sec tor is empty,

even if t he re exis ts a convex polygon which incIudes the grid point and which would

secure a useful local es t imat ion . From this viewpoint, the method is not f lexible enough

because it requires locally a specif ic data conf igurat ion. On the o ther hand, the process

tends to i r regular contours because the sur face values e s t ima ted from the moving sec to r

system do not change smoothly, but they change r a the r suddenly when a data point

en te r s or leaves the system (Fig. 2.37), at least , if the required number of data per

sec to r is small . Conversely, a higher number of required da ta causes an increasing

number of gaps within the solution area.

2.4°3 Linear In terpola t ion by Minimal Convex Polygons

The discussed problems caused us in 1980 to develop in the 'Sonderforschungsbere ich

53, PalOkologie' (ALTHEIMER et al., 1982) a gridding technique tha t sa t i s f ies all the

so far discussed condi t ions in a very simple w a y - - the grid values are e s t ima ted from

the minimal bounding polygon of a grid point, by a t r iangle. The method works r a t h e r

well up to the point tha t somet imes s t range local ex t r ema appear in areas where no

da ta points are available, i.e. the ex t r ema cannot be explained by the da ta s t ruc ture .

A l a t e r s tab i l i ty analysis provided some remarkab le resul t s which also holds for o the r

gridding techniques, especial ly for the discussed sec to r search methods and for the

classical t r iangula t ion method. It will turn out tha t the problems are mainly geomet r i ca l

ones and tha t our gridding technique is not a real ly good way to e s t ima te surface points,

but t ha t it i l luminates very c lear ly the problems of contour ing from s c a t t e r e d data .

As was pointed out above, the locally s tab le e s t ima t ion of grid point values requires

tha t the grid point is loca ted within a polygon spanned by a subset of the original da ta

points. For a plane mapping problem the smal les t possible polygon is a t r iangle with

da ta points at i ts corners . As is well known from polygon theory and l inear opt imiza t ion

(COLLATZ & WETTERLING, 1971), this minimal polygon has the proper ty t ha t the sur face

values can be simply e s t ima ted by a l inear in terpola t ion for any point within the t r iangle

and along its boundaries (cf. ZIENKIEWlTZ, 1975; SCHUMAKER, 1976). Geomet r ica l ly

the th ree points es tabl ish a plane sur face e lement . Fur the rmore , the principle of ' n ea r e s t

neighborhood' can be easi ly sat isf ied. 'Neares t neighborhood' means tha t a grid point

value should be e s t i m a t e d from closest da ta points (as far as this does not cause local

57

extrapolation). If there exists a polygon at all that encloses the grid point in question,

then there exists a convex polygon, especially a triangle, which both includes the grid

point and has the point of nearest neighborhood at one corner. The proof of this s t a t e -

ment can he outlined in the following way:

If the point of neares t neighborhood is connected with every point on the convex

boundary of a l l data points, then the whole area inside the global convex boundary

is densely covered by non-overlapping triangles, and the grid point must be e i ther

an interior point or a boundary point of one of these triangles.

This secures that we find a bounding triangle. The minimal triangle without an interior

point is found with the following strategy:

Start at the point of nearest neighborhood and find the second and third nearest

points which form, together with the point of nearest neighborhood, a bounding

tr iangle for the grid point.

We can call this triangle the minimal convex polygon -- minimal because these are points

of minimal dis tance with respect to the convexity condition. One should expect that

such a triangle is a locally optimal form for the approximation of the grid point value.

The approximation turns into a local interpolation. In addition, the local propert ies of

the tr iangles project onto the global problem to find the bounding polygon of all data

points. The local triangulations are bounded to the interior of the convex polygon bound-

ary. This gridding technique has, therefore , the additional advantage that it can be

implemented in a very compressed way on a computer (ALTHEIMER et al., 1982) and

that it provides automatical ly control over the boundaries of the working area. On the

other hand, one can use the method to el iminate all points inside the global convex

boundary of the data set. The points to be el iminated are simply those for which a

minimal polygon exists which has data points on its c o r n e r s - - or the global convex

polygon consists of the data points which are not interior points of a triangle with data

points at its corners.

2.4.4 Stabil i ty Problems with Minimal Convex Polygons

Having done all these analyses of local and global properties, it was surprising

for us that the implemented process became unstable with cer ta in data configurations

(cf. Fig. 2.33). The main anomalies are local ex t rema which are not justified by the

data. They occur mostly within relat ively large areas without data points, and they are

especially strong if the surface values of the nearby data points are very irregular.

/ / /

Fig. 2.38,: The subdivision of a r ec tang le by the local t r iangula t ion method {minimal polygons). The blank areas belong to the same t r iangula t ion (as the shadowed do).

58

In addition, the anomal ies are very sens i t ive to small changes of the grid s t ruc ture .

Some of these pa t t e rn s resemble very closely the problems which may ar ise from sec to-

rial search methods, To explain these anomal ies it is necessary to study the local s t ruc-

ture of the polygons, and to see how t r iangles may come into compe t i t ion during the

gridding process,

Four points form the minimal polygon, which can be fur ther divided into t r iangles.

For any grid point located within this polygon two possible t r iangula t ions exist {Fig.

2.a8). Which t r iangle will be chosen, depends on the d is tance funct ions be tween the grid

points and the polygon c o r n e r s - - i.e. on the nea res t neighborhood rule. The minimizing

condit ion for the d i s tance funct ion divides the polygon into four regions (Fig. 2.38).

The opposite r ec tang les belong, thereby, to the same t r iangula t ion, and, there fore , have

s tab le and near ly smooth solutions in the i r inter ior . But the two possible t r iangula t ions

give d i f fe ren t solutions which are not connec ted in a smooth way (Fig. 2,39). If the

grid point moves over the t r iangula t ion boundaries, then the solution jumps from one

in te rpola t ion sur face onto the a l t e rna t ive solution. The instabi l i t ies , the re fore , occur

Fig. 2,39: A ruled sur face over a rec tangu la r grid e l emen t and its approximation by the two possible t r iangula t ions which yield d i f fe ren t solutions.

59

. . . . . . . . . . [ . ! " ~:~. ~,

-" .-" 5 , : : . . . . . . . : "

Fig. 2.40: The bifurcation surface which corresponds to the two possible triangulations of Fig. 2,a9 (cf. Fig. 2.38). The instability is independent of the density of the grid, it results only from the local triangulation method. On a suff icient ly fine grid the contours re f lec t the bifurcation of the surface (b).

in two ways:

1) Closely related grid points may deviate strongly in the es t imated surface values

because they belong to di f ferent local triangulations.

2) Even small changes in the s t ructure of the regular grid can locally cause ra ther

dramatic changes in the es t imated surface v a l u e s - - e.g. for two independent con-

touring processes over the identical data set.

These e f f ec t s are especially strong within relat ively large areas without data points,

i.e. whenever the approximation involves data points which are far apart.

Fig. 2.40 i l lustrates in more detail how the local solution over a rectangle bifur-

ca tes into two disconnected surfaces with discontinuous contours. From which interpola-

tion surface the es t imated value will be taken, depends, as discussed, only on the position

of the grid point(s). The situation becomes even worse if one considers higher polygons,

i.e, a larger number of data points in competi t ion. The possible number of local triangula-

tions increases rapidly with the number of polygon corners {F{g. 2.41). The approximation

process will become more and more instable as the number of data points in compet i t ion

i n c r e a s e s - - a situation favored by large empty areas within the data space. A curious

situation is that in such cases a ref inement of the grid increases the instability in the

way that the resulting surface approximates the local discontinuities of the triangulation

ra ther than a continuous surface (cf. Fig. 2.40 h). The situation is nearly the same with

the octant search method where the triangles are replaced by open angular sectors.

60

Z

~

Fig. 2.41: The various possibilities for the tr iangulation and linear surface approximation of a f ive-point polygon (1 to 5: the numbers z=l ... indicate the height of the corner points; r: the cen te red i n t e r p o l a t i o n - - the central point is the a r i thmet ic mean of the corner points.

One could argue that the discussed instabil i t ies are only problems of the local

approximation method. But if we t ransfer the results to the classical tr iangulation meth-

od, which covers the data space densely with triangles, then it i s not hard to see that

we have, in principle, the same problems. Let several people draw a map from the iden-

tical data set with a f ree choice of triangulation, then the results can diverge to a large

ex ten t . The d i f ferent solutions for a f ive-point polygon in Fig. 2.41 can be taken as

an example. What we find is that the instabil i t ies appear now exclusively between di f fer -

ent maps.

We can go a s tep further . The same problems encounter us again when we draw

contour lines from the e s t ima ted regular grid. A simple way to do this is again a triangu-

lation of the g r i d - the contour lines are uniquely de termined on a triangle. However,

there are several possibilities for this triangulation, some of them are i l lustrated in

Fig. 2.42. Every possible triangulation gives another solution, and we can only hope that

these soIutions deviate not too widely because the surface is simple enough. Besides this

triangulation method other s t ra teg ies are in use which draw the contour lines direct ly

from the rectangles , e.g. such a s t ra tegy is used in the program package SURFACE

II (SAMPSON, 1975). Within this s t ra tegy problems arise when two opposite corners of

a cell are higher and the two others are lower than the value for the contour line en te r -

61

/ / / / / / / / / / / / / / / / o

\ \ \ \ \ \ \ \ o o \ \ \ \ \ \ \ \

\ / \ / \ / \ / \ / ~ / \ /

\ / \ / ~ o / \ / \

N/'•/ /\/\

Fig. 2.42: Four a l ternat ive triangulations of a regular grid,

ing the cell. Fig, 2.43 il lustrates this situation, and it becomes immediately clear that

the cases (b) and (c) are just the two a l ternat ive triangulations of the grid cell, the

decision problem is the same as discussed earlier. Case (a) is slightly different , it re-

presents a cen te red grid e lement where the centra l value could have been es t imated

as the a r i thmet ic mean of the corners (cf. Fig. 2.41). This approximation will be dis-

+ - - 4 - - 4 -

/ /

a b + - + -

,> "> c

4-

Fig. 2.43: Possible paths of contour lines through a grid c e l l - - (+) corners higher and (-) corners lower than average of corner values. Modified from SAMPSON (1975).

cussed in the next section. In the SURFACE II program a decision is made be tween

the solutions (b) and (c) in Fig. 2.43: (b) is chosen if the average of corner values is

higher than the entering contour line while (c) is chosen if the average is lower than

the value of the contour line. This choice is arbitrary, however, it ensures that contour

lines do not in tersec t within the grid e lement {Fig. 2.43 a) -- this switch causes a jump

from the lower to the upper surfaces in Fig. 2.40 when the average height of the corner

points is passed,

62

2.4.5 Continuation of a Local Approximation

It turned out that the method of minimal polygons or of a lbcal or global triangu-

lation is instable with respec t to small changes of the initial conditions. In the case

of the 'hand method ' , the initial condition is the choice of the triangulation, in the

' computer method ' , it is the choice of the grid. The same problem extends to o ther

local gridding techniques, to the seetorial search methods and even to the approximations

by weighting functions. They all are very sensit ive to small changes of the initial condi-

tions and to changes of the pa ramete r set t ing. A major problem arises if there are

large areas without data points. In this case, the interpolat ion process can be somewhat

stabil ized if one does not use the minimal convex polygons but t r ies to find the locally

maximal convex polygon. However, compet i t ion be tween polygons can be only avoided

if the en t i re interior of a locally bounded polygon is t r ea ted as a local continuum,

and if all grid points inside the polygon are es t imated from its corner points by some

smooth process. The compet i t ion during the formation of local polygons can be avoided

if the local solution projects continuously into the neighborhood, i.e. no overlapping

polygons are allowed until they have the same solution inside the in tersect ing areas

and on the common boundaries. The problem has a formal analogy in the analytic cont in-

uation of a function in the complex plane. This analogy suggests that one could s tar t

from a local solution, a local contour line, and then const ruct its continuation through

the data space by use of some convergence cr i ter ia . The convergence circle of the

analytic problem could thereby be replaced by convex polygons over the finite data

set.

The previous remarks lead to a geometr ica l problem, which is hard to solve in

the case of randomly sca t t e r ed data. Nevertheless , it seems useful to discuss finally

how the linear interpolat ion over tr iangles can be general ized for any convex polygon

and how a local solution over a regular grid e lement can be extended throughout the

global data space,

A) A Local Continuous Approximation

In the case of a rectangle , the simplest approach toward a stable continuous surface

approximation is to cons t ruc t a bilinear function over the corner points (SCHUMAKER,

1976)

f(x,y) = a I + a2x + a3Y + a4xY ' (2.57)

The corner values of the grid e lement have to be used to de te rmine the coef f ic ien ts .

Now, any rec tangle can be s tandardized to a square of unit area by the map

68

/" C_-- - - -

/ . ._-- - - - -"< / x

Fig. 2.44: Surface interpolat ion over a rectangle by use of a bilinear function (above); for details see text. The bilinear interpolat ion can be approximated by a linear interpolat ion if an additional centra l point is used, which can be computed as the a r i thmet ic mean of the corner points.

x ---- (Xi+ I - x)/(Xi+ 1 - Xi)

Y ---- (Yi+l - Y)/(Yi+I - Yi )'

where X. and Y. are the coordinates points, i i

(2.58)

of the corner

64

Besides s tandard iza t ion , the map (2.58) t rans forms the global grid coordina tes into local

oneS. Using the new local coordinates , the l inear in terpola t ion along the boundaries

of the r ec tang le can be expressed as

f(x) = w2F(x=O ) + WlF(X=l), (2.59)

where the F-values are the surface height at the corner points and the w i are weight ing

functions: Wl=X, w2=l-x , x in local coordina tes {for the y d i rec t ion x has to be replaced

by y).

The approach by a bi l inear funct ion implies to cons t ruc t a two-dimensional weight-

ing funct ion from the product w(x,y)=w(x)w(y) (e.g. PFALTZ, 1975; DeBOOR, 1978).

tf the weight ing funct ions for the boundaries are inser ted, one finds

and

w1(x=O,y=O) = (1-x)(1-y)

w2(x=O,y=l ) = (l-x)y

W3(x=l,y=l ) = xy

w4(x=l,y=O) = x(l-y)

w(x,y)=xy,

= w(l-x,l-y)

= w(l-x, y )

= w( x ,l-y)

= w( x ,i-y)

(2.60)

a very s imple pa t t e r n of pe rmuta t ions of the coordinates , which easily can be pro--

g r amme& It is easy to prove t ha t ~ wi=l, and t ha t z(x,y)= ~ Fiwi(x,y) is just the

ear l ie r no t iced bi l inear funct ion which provides a cont inuous sur face approximat ion over

the grid e l ement . The weight ing funct ions have the proper ty tha t

Fiwi(0.5,0.5 ) = ~Fi/4 , (2.61)

i.e. t he re exis ts one point on the sur face which is simply the a r i t h m e t i c mean of the

corner points. This observat ion allows a f irst order approximat ion of the bi l inear surface

over a r ec t ang le by a s imple t r iangula t ion. If one adds the cent ro id of the corner points

to the data points, then t h e r e exis ts locally a unique t r iangula t ion of the grid e l emen t

which is given by the connec t ions of the cen t r a l point with the corner points. The

sur face e s t i m a t e d from this t r iangula t ion is a l inear approximat ion of the sur face , which

was def ined by the bi l inear equat ion (2.57). Figs. 2.41 and 2.44 provide examples for

this approximat ion. It is easy to see t ha t a unique t r iangula t ion and, there fore , a unique

local sur face approximat ion can be cons t ruc ted for any convex polygon with n corners .

The addi t ional cen t r a l point is given by

(Xc,YcZc) = (l/n) [ (Xi,Yi,Zi). (2,62)

i t may be useful to in t roduce a meaning for this in terpola t ion scheme. The bi t inear

model is a harmonic function, and this allows a physical in te rp re ta t ion . If a shee t of

65

rubber is s t r e t ched over the rec tangula r boundary, the resul t ing surface equals the sur-

face described by the bi l inear equation. For the general ized convex polygon with n

corners one can cons t ruc t such a surface in the following way (BETZ, t948): The convex

polygon is mapped onto the unit c i rc le by means of the Schwarz-Chr is tophel formula.

The boundary values are then evaluated in t e rms of a Fourier series. The required

harmonic funct ion on the unit c i rc le is finally expressed by the equat ion

f(r,~)=ao/2+ Z rn(anC°S(n~) + bnsin(nq0)), and a first approximation on the original polygon

is given again by the cen te red t r iangulat ion.

Fig. 2.45: A ten t s t ruc tu re provides an example of a continuous surface with discont inui t ies at the poles.

B) Cont inuat ion of a Local Surface Approximation

The cen te red grid e lement , as defined above, leads in a r a the r na tura l way to

a cont inuous solution over the global regular grid s t ruc ture . If we add the computed

cent ra l grid points to the grid, we have simply a r e f inemen t of the grid, and we can

repea t this process infinitely. In the second i tera t ion, addit ional grid points and values

are computed at the boundaries be tween the original grid e l ement s and provide a con-

tinuous approximation be tween grid e lements . In general terms, a regular approximat ion

within grid e l emen t s occurs at every odd in terpola t ion s tep while at even s teps over lap-

ping grid e l ement s are eontinuous!y connected. What we find, is a surface which every-

where sa t is f ies the Laplace equat ion

= O, (2.63) Uxx + Uyy

a surface, which is everywhere smooth, only at the grid points local discont inui t ies

A

\/\/

I •

66

Fig. 2.46: I t e ra t ive r e f inemen t of the grid s t ruc tu r e by recurs ive averaging. Only a single pathway is i l lus t ra ted , which asymptot ica l ly approaches a corner of the cen t r a l grid e lement . Any o the r point, the original grid points and the averaged ones, cause s imilar cascades: The original cen t ro id grid e l emen t is subdivided into smal ler and smal ler r ec tang les which, in the l imit, cover the area densely, however wi thout being cont inuous in a d i f fe ren t i ab le sense. Left : a regular or thogonal grid, r ight: a cen te red regular grid which actual ly consis ts of two overlapping grids as indicated.

appear (Fig. 2.45). The re la t ion be tween the recurs ive averaging process and the Laplace

equa t ion can easily be shown if the Laptace equat ion is approximated on a f ini te grid.

In t e rms of a f ini te grid, equat ion (2.6a) reads

(Ui_l,j_2Ui,j+Ui+l,j) + (Ui,j_l-2Ui,j+Ui,j+l) = 0 (2.64)

providing a f ini te approximat ion, which can be r ewr i t t en as

Ui,j= (I/4)((Ui_l,j+Ui+l,j) + (Ui,j_l+Ui,j+ 1 ),

and this is simply the average discussed above. Thus, our cont inuat ion

analogue to the analy t ic cont inua t ion in the complex plane.

(2.65)

process is a f ini te

The cont inua t ion process by recurs ive averaging is, in addition, opt imal in the

sense tha t it is s tab le under small d i s turbances of the grid p a t t e rn and is opt imal in

t e rms of computa t ion costs, To see, why the l a t t e r remark holds, let change the view-

point again to a single grid e lement . We want to find a local approximat ion which

is continuously connec ted with the neighborhood. To find such a local approximat ion

67

we need in to ta l i ty 16 grid points like in Fig. 2.46a, and this e lementary grid allows

re f inement to any level within the central grid e lement . An al ternat ive would be to

use initially a cen te red grid (Fig. 2.46b) which, of course, provides an initial triangula-

tion. However, if we request a continuous connection with neighboring elements , we

need 21 grid points. The increased number of necessary grid points can be related to

the fact that the cen te red grid is not unique, i.e. that there exist two a l ternat ive grid

s t ructures as indicated in Fig. 2.46b. A continuous solution requires that these a l ternat ive

grids are superimposed, and this causes the higher number of required grid points.

However, the continuation problem can be solved in a quite d i f ferent way: We

can request that the local surface e lement has continuous derivat ives along its bounda-

ries and, thus, can continuously be connected with the neighboring elements . Such an

approximation requires at least cubic splines, and first we consider the case that the

first derivat ive vanishes along the boundaries of the grid e lement . A useful approxima-

tion is given by the weighting function

and w(y) = y2(3-2y).

w ( x , y ) = w ( x ) w ( y

w ( x ) = x 2 ( 3 - 2 x ) ;

(2.66)

The height of a surface point can be expressed as weighted average of the height of

corner points

z(x,y) = ZlW(X,y ) + Z2w(1-x,y ) + Z3w(l-x,l-y ) + Z4(x,l-y). (2.67)

If we use equations (2.66), we can rewri te equation (2.67) as

z(x,y) = ((ZI+Z4)--(Z2+Z3)Xx2(3-2x)Xy2(3-2y)) + (Z3-Z4Xx2(3-2x)) + (z2-z4)(y2(3-2y)), (2.68)

an equation which looks ra ther complicated. However, if we introduce the abbreviations

2 2( u = x ( 3 - 2 x ) ; v = y 3 - 2 y ) , ( 2 . 6 9 )

equation (2.68) turns into a simple bilinear equation

z ( x , y ) = auv + bu + cv (2.70)

with obvious parameter identif ications for 'a t, 'b t, and 'c ' . Thus, we are still dealing

with equation (2.57), with the only d i f ference that the coordinates (x,y) are replaced

by functions of these coordinates. Equations (2.70) and (2.69) provide a system of equa-

tions consisting of two parts: The interpolat ion equation, which is simply a bilinear

equation, and a map {x,y) ~ (u,v), which defines a deformation of the original c o o r d i -

68

es, thus , t h a t t h e y s a t i s f y c e r t a i n c o n d i t i o n s a t t h e b o u n d a r i e s o f t h e gr id e l e m e n t .

Our a p p r o x i m a t i o n p r o b l e m t u rn s in to t h e p rob l em to f ind a p rope r m a p (x,y) -~ (u,v)

wh ich s a t i s f i e s t he r e q u i r e d cond i t ions : T he m a p

3 2 u = alx 3 + blX + ClX + d 1

v a2Y + b2 y2 + c2Y + d 2 (2.71)

for i n s t a n c e a l lows to ad jus t t h e f i r s t d e r i v a t i v e a long t he b o u n d a r i e s o f t he gr id

e l e m e n t ; h o w e v e r , it is no t t he m o s t g e n e r a l c a s e (for a d i s cus s ion of sp l ines , s e e

DeBOOR, 1978). A n y w a y , e v e n an a p p r o x i m a t i o n by e q u a t i o n 2.71 is r a t h e r s e n s i t i v e

to s m a l l c h a n g e s in t h e gr id s t r u c t u r e . T h e d i s cus s ion o f t h e cub i c sp l ine in s e c t i o n

2 .a .3 c an e a s i l y be e x t e n d e d to t h e t w o - d i m e n s i o n a l c a s e - - a s m a l l d i s t u r b a n c e o f

t he e s t i m a t e d s lope a t t he gr id b o u n d a r i e s c a n t o t a l l y c h a n g e t h e e s t i m a t e d s u r f a c e

p a t t e r n , wh ich m a y s w i t c h f rom a r idge to a va l l ey or v i ce ve r sa . As eas i ly c an be

seen , t he s t a b i l i t y o f our a p p r o x i m a t i o n p r o b l e m d e p e n d s on ly on t he m a p {2.69), and

th i s r e l a t e s it to s i n g u l a r i t y t h e o r y . We c a n t r a n s f o r m the m a p {2.69) to a m o r e c o n v e n -

i en t f o r m s by a s i m p l e

dislocation of the origin x --- x + 1/2; y -- y + 1/2

and u -- u - 1/2, v --- v - 1/2, which yields

u= 2x 3 - (3/2)x; v = 2y 3 - (3/2)y,

a rotation x -- x + y; y -- x - y yields

3 3 x 2 y ~ y 2 3 u = x + + x + y 3 3 v = x - 3 x 2 y + 3 x y 2 - y

and a final rotation in the (u,v)-space u --~ u+v, v--- u-v

transforms our original map into

u = 2 x 3 + 6 x y 2 ; v = 2y 3 + 6 x 2 y , ( 2 . 7 2 )

a m a p w h i c h r e p r e s e n t s a spec i a l fo rm of t he double cu sp c a t a s t r o p h e . In c a t a s t r o p h e

t h e o r y such a m a p is e m b e d d e d in a p o t e n t i a l , in th i s c a s e t h e p o t e n t i a l would be

V = x4/2 + y4/2 + 3x2y 2 -ux -vy, (2.73)

and t he m a p r e s u l t s f rom t h e cond i t i on t h a t t h e pa r t i a l d e r i v a t i v e s van ish , i .e. f r om

t h e e q u a t i o n s

V = 0 = 2x 3 + 6xy 2 - u x

V = 0 = 2 y 3 + 6 x 2 y - v . Y

69

If we now re turn to the more general case {the der iva t ives are de te rmined from the

da ta points), we need again addit ional pa ramete r s . Ca tas t rophe theory implies tha t the

potent ia l (2.73} has general unfolding

V = x 4 + y4 + a x 2 y 2 + b x 2 y + c y 2 x + dx 2 + e x y + f y 2 _ ux - v y , (2.74)

an expression which provides us with 6 f ree pa rame te r s to adjust the boundary con-

ditions. This expression, however, is in local coordinates~ in global coordinates we would

have to unfold the map (2.72} and to consider all possible pa rame te r s inclusively the

cons tan t ones.

The Double Cusp is ex t remely unstable, the s table regions are ex t remely narrow,

and even small d is turbances cause switching solutions, in this special case switches

from ridges and hills to valleys and depressions. This gives us a d i rec t re lat ionship

to ca t a s t rophe theory; however, the problems encounte red through these sect ions are

connec ted with ca tas t rophe theory in a much wider sense: The approximation problem

in surface recons t ruc t ion is usually associa ted by some opt imizing problem, i.e. to es t i -

ma te the grid point from the neares t data points. A common problem with such opt imiz-

ing s t ra teg ies is tha t during a smooth change of the dis tance function ' the opt imum

solution changes with a jump, t ransfer r ing from one compet ing maximum to the o the r '

(ARNOLD, 1984). That, of course, is what we observed throughout the discussion of

surface reconst ruct ion . The connect ion of the observed instabi l i t ies with ca t a s t rophe

theory may not necessar i ly be obvious because we usually think about d i sc re te and non--

d i f fe ren t iab le systems in approximation processes. However, d iscre te are only the grid

points, which turned out to be pa rame te r s of a smooth interpolat ion surface. By a smooth

change of these pa rame te r s the approximat ion may reac t with sudden jumps in the

geomet r ica l solution. A change of the grid point values, however, is equivalent to a

change of the boundary conditions, and these clear ly a f fec t the opt imizing function.

in t e rms of var iable boundary condit ions we can, therefore , apply ca t a s t rophe theory

to the surface approximation problem. A discussion of more general opt imizing prob-

lems is given in ARNOLD (1984).

one parameter

F(y) = I Y l

Table 2.1 Singulari t ies in opt imiza t ion problems

Normal forms of a maxima funct ion F {ARNOLD, 1984):

two parameters

IyT or

F(y) = max(Yl' Y2' YI+Y2 ) o r

2 mxaX(-X4 + y l x + Y2 x)

3. N E A R L Y C H A O T I C B E H A V I O R

O N F I N I T E P O I N T S E T S

Chaos implies to ta l ly and apparen t ly i r remediable lack of organizat ion. In physics,

a classical example for chaos is turbulence. In a turbulent system, the pathway of a

par t ic le cannot be predic ted at all, and two par t ic les , which are init ial ly close toge ther ,

may depar t in a shor t t ime interval . The t rans i t ion from a de te rmin i s t i c {laminar) behav-

ior to chaos ( turbulence) can be usually described by a b i furca t ion t ree (Fig. 3.1). "Af te r

the f i rs t b i fu rca t ion the flow becomes periodic, a f t e r the second b i furca t ion the flow

is quasi periodic with two periods, and so on" (RICHTMYER, 1981). Af t e r a suff ic ient ly

high number of b i furca t ions the chaot ic aspect of the flow is so highly developed tha t

s t a t i s t i ca l methods are the proper way to study i ts behavior . It is c lea r t ha t the behav-

ior of such sys tems during the course of t ime depends very sensi t ively on the init ial

condi t ions {HAKEN, 1981), and t ha t the b i furca t ions are not a dynamical fea ture , but

appear in the s t a t e space of the system, i.e. they are a topological proper ty of the

system.

During the last decade, another way to study chaos has a t t r a c t e d much a t ten t ion :

the behavior of d i f fe rence equat ions in ca lcu la tors (MAY, 1974; ROSSLER, 1979; THOMP-

SON, 1982). In this case, the dynamical system is rep laced by an i t e ra t ed map describing

the ou tcomes in f ini te t ime intervals . MAY's (1974) favor i te example was the s tandard ized

form of the logist ic d i f f e rence equat ion. A s h o r t review of the behavior of this equat ion

will be given in the f irst sect ion to in t roduce the concepts of b i furca t ions and of chaos

more precisely. The explici t numer ica l approximat ion of a par t ia l d i f fe ren t ia l equat ion

then e luc ida tes once more the concep t of bi furcat ion, and the concept of i t e r a t ed maps

is used to study inf ini te sequences of caust ics in r e f r ac t ion seismics.

Fig. 3.1: A b i furca t ion cascade or a genera l ized ca t a s t rophe (THOM, 1975), as it resul ts e.g. from the logist ic d i f fe rence equation.

71

Fig. 3.2: Two versions of Galton's m a c h i n e - - a small and a large o n e - - which produce the binomial distribution.

As was noticed above, s ta t is t ica l methods are the usual way to study chaot ic

systems. The 'Galton machine ' (Fig. 3.2) i l lustrates the relationship between the chaot ic

t ra jec tor ies of particles, which cannot be predicted, and the well predictable outcome

if enough part icles are considered. In this case, our impression of chaot ic motion within

the machine will not at least depend on its size (Fig. 3.2). In addition, the form, the

internal geometry of the machine, a f fec t s the type of the s ta t is t ical outcome. The bifur-

cation t ree of Fig. 3.1 can be taken as another machine of this type. It will produce

a uniform distribution. An interest ing case occurs if the internal configuration of such

machines depends on some parameters , or if the initial conditions can a f fec t the outcome

of the machine.

The first example is a brief review of the logistic d i f fe rence equation. A more

interest ing example, from the geological viewpoint, is the instability of the explicit

approximation of a partial different ia l equation. The bifurcation, which is caused by

a smooth change of a parameter , can be nicely visualized by the uncoupling of the grid

into two independent substructures. The concept of i te ra ted maps is finally applied to

series of caust ics in ref rac t ion seismology.

72

The concept of bifurcations and chaos is then applied to several computer methods.

The problem is that chaos in such cases is not obvious. In most exampies, a small change

of parameters will strongly influence the outcome, but with a computer procedure this

sensi t ivi ty will normally not be de t ec t ed because the data are only processed with a

cer ta in pa ramete r set t ing. The first of these examples is the usual Chi2-test ing of direc-

tional data. The tes t is commonly performed against a uniform distribution, and it is

unstable with respect to an arbi t rary choice of the sectorial pa t te rn on which the com-

putation of the tes t s ta t i s t ic is evaluated. The striking point is that the stabil i ty of

the tes t decreases with increasing sample size.

In the third sect ion, problems with sampling s t ra tegies in sedimentology are dis-

cussed. One goal of the s ta t is t ica l analysis of profiles is to de tec t periodicity pat terns .

Two methods are in use, the analysis versus transit ional probabilit ies and the classical

t ime-ser ies analysis. In both cases it is a typical s t ra tegy to take samples at equal dis-

tances. In this case, the transit ion matrix becomes dominated by singular loops, and

the so -ca l l ed ' t ransi t ional probabil i t ies ' are not fur ther free of dimensions. In the case

of a t ime series anaIysis, the identical approach can cause art i f icial pa t te rn formation.

The example is closely related to the generici ty problem of maps, an aspect which is

briefly mentioned. The main result will be that geometr ical and geological reasoning

cannot be replaced by a formal, pseudo-object ive sampling s t ra tegy.

Then, we shall deal with various aspects of classical centroid cluster s t ra tegies .

Again a si tuation is encountered where an increase of the sample size does destabil ize

a ' s ta t i s t ica l ' pa t t e rn recognition process, and it will turn out that these methods provide

excel lent examples of chaot ic behavior on finite point s e t s - - they show the discussed

proper t ies of chaos, especially the ex t remely high sensi t ivi ty to small changes in the

initial data.

Finally, the bifurcation of t ree- l ike bodies is analyzed. The basic model is ent irely

determinis t ic ; never theless the bifurcation pa t te rns gene ra ted are rather chaotic. From

this chaot ic pat tern , however, a well de termined shape a r i s e s - - an analogy found in

the shape of t rees , which is typical on the species level. The analysis is based on a

modificat ion of HONDA's (1971) computer model and takes up the geometr ical analysis,

which roots in D'Arcy Thompson's and even Leonardo da Vinci's work.

3.1 ITERATED MAPS

Classically, stabil i ty is the most important concept for the numerical solution of

different ia l equations. The typical way to solve dif ferent ia l and partial different ia l equa-

tions numerically is to t ransform them into an ' i t e r a t ed map' by use of Taylor's theorem,

73

It is well known tha t there are somet imes several choices for the t rans format ion , and

tha t the various possible approximations behave d i f ferent ly with respect to the quali ty

of the approximation, to the convergence and to o ther s tabi l i ty problems. Here some

aspects of i t e r a t ed maps are brief ly discussed under topologicaI aspects because this

approach may give some insight not only in those problems, which occur with d i f fe rence

equations, but aIso in the concepts of b i furca t ions and chaos.

3.1.1 The Logist ic Di f fe rence Equation

The logistic growth function plays some role in biology and in paleontology. The

d i f fe rence formulat ion of this equat ion was MAYas (1975) favored example for b i furca t ions

and chaot ic behavior. In the meant ime, it became an impor tan t example for b i furca t ion

cascades and chaos in various fields (e.g. HAKEN, ed. 1982). The d i f fe ren t ia l equat ion

of the logistic equat ion is given by

y' = ay(b-y), (3.1)

which has a wei1 known explici t solution. A simple s t ra ight forward d i f fe rence approxi-

mat ion is given by

Yi+l = Yi + dtaYi(b-Yi)" (3.2)

For a special p a r a m e t e r se t t ing of 'a ' and 'b ' , the solution of this d i f fe rence equat ion

depends only on the p a r a m e t e r At, which represen t s a f ini te t ime interval . As Fig. a.a

shows, the upper boundary is only approached for small values of a t . As this p a r a m e t e r

increases, one finds tha t the solution f luc tua tes around the sa tura t ion level. For larger

values of the discre te t ime intervals , the long t ime output of the d i f fe rence system

resembles much more the Lotka-Vol tera model (LOTKA, 1956) of a preda tor -prey system

than the original logistic growth model.

By some e l emen ta ry coordinate t r ans format ions (e.g. ROSSLER, 1979) the logistic

d i f fe rence equat ion can be s tandardized to the form

Yi+l = rYi(l-Yi)' (3.3

which allows to analyze tlle behavior of this model in a general way. The relat ionship

be tween the Yi+l and the Yi values can be p lo t ted as the graph of a funct ion for which

the Yi values are the values of the independent variable. For the f irst i tera t ion, the

graph of this funct ion is a parabola (Fig. 3.4, it I). The sa tura t ion value is exact ly

reached if Yi+l = Yi' and this defines a s t ra ight line in the (yi,Yi+l) coordinates . In the

74

Fig. 3.3: Numeric solutions of the logist ic d i f fe ren t ia l equation for various d i sc re te t ime intervals .

N

At

. . . . . . . . . . t

graph of funct ion (3.3) the sa tura t ion point is given by the in te rsec t ion of this line with

a specif ic parabola, which is de te rmined by the p a r a m e t e r r. Fig. 3.4 (I) shows how

one can use these proper t ies to analyze which values of r allow for a s table solution.

The equivalent a lgebraic expression would be

Yi = rYi(l-Yi)' (3.4)

which can be solved for Yi" Higher i te ra t ions are capable to produce periodic solutions.

i t I Yi÷1 i t I I

Yi+t

Y,. Fig. 3.4: Fi rs t (I) and second (II) i t e ra t ion system of the logistic equat ion in s tandardized form. The curves correspond to d i f fe ren t values of the p a r a m e t e r ' r ' ; the i r in te r sec t ions with the s t ra igh t line are the equilibrium values.

75

The first one occurs for Yi+2 = Yi' i.e. every second i terat ion takes the same value.

Again one can find a graphic representat ion as well as an algebraic one. If one rewri tes

equation {3.4) in te rms of Yi+2 and of Yi' one finds a quartic polynomial

Yi+2 = rYi+l(l-Yi+l)

= (rYi(l-Yi))(l-rYi(l-Yi)).

(3.5)

The stable points are found in the same way as before {Fig. 3.4, it II) by set t ing Yi+2=Yi ,

and we find up to four equilibrium points, but not all of them are stable. As the para-

meter r varies, one finds up to three intersect ions between the polynomial and the equi-

librium line (except the trivial solution Yi = Yi+2 = 0). In the same way we find an in-

creasing number of periodic solutions for every relationship Yi+k = Yi' or, as k increases,

we get an infinite number of periodic solutions or a bifurcation cascade like in Fig. 3.1.

This type of chaot ic behavior was analyzed by MAY (1974), who showed that the logistic

equation has an infinite number of possible periodic t ra jec tor ies and is of chaot ic

behavior.

However, the logistic equation is only the special ce lebra ted example. OSTER &

GUGGENHEIMER (1976) showed that any convex function can replace the parabola in

equation (a.al and drew connections to the Hopf bifurcation. Even a linear spline

approximation causes such bifurcations and periodic solutions {Fig. 3.5a). Probably models

based on exponential functions are more biological than the finite logistic model because

they have no sharp upper limit. In the case of the finite logistic equation, there is a

limit for the 'height ' of the parabola: It cannot exceed its 'width ' , otherwise the process

escapes into negative values without bounds, i.e. the i terat ion simply breaks down. There-

l V I I

Fig. 3.5: Any convex function can be used to define an i te ra ted map, which possesses periodic solutions. Dashed lines indicate pathways which te rminate in a cyclic motion.

76

fore, we are not free in choosing the parameter r which is bound to values 0 <r <4

(y(0.5)=r/4 ---> rmax=4), and a second limit is given when the parabola is so shallow that

it does not in tersect with the line Yi+l=Yi . Some possible a l ternat ive models are (OSTER

& GUGGENHEIMER, t976):

Yi+ 1 = Yi exp(r( 1-ay i)

and

Yi+ 1 = Yi/{ I +exp(-b(1-aYi)).

For r << 1 the last equation can be approximated by equation (3.3} (see OSTER & GUG-

GENHEIMER, 1976). In general, the logistic equation provides a 'prototype ' of chaotic

behavior of i tera ted maps, which easily can be analyzed, and, therefore, it is the most

ce lebrated example.

3.1.2 The Numerical Approximation of a Part ial

Different ia l Equation

In geology the partial differential equation u t = Uxx plays some role as a ' t ransport

equation' . It describes e.g. the flow in porous media and the compaction of sediments

(TERZAGHI, 1943; DESAI & CHRISTIAN, 1977}. A straight forward approximation by

differences leads to the explici t scheme

(Ux,t+l-Ux,t)/At = (Ux_l, t - 2Ux+l, t + Ux+l,t)/Ax2. (3.6)

This equation can be rewri t ten as an i tera ted map

Ux,t+ 1 = (l-2(At/Sx2)Ux,t + (Ux_l, t + Ux+l,t)At/Ax 2. (3.7)

It is well known from numerical mathemat ics (MARSAL, 1976) that the stability of this

approximation requires that

1 - 2(At/Ax 2) ~ O. (3.8)

Fig. 3.6: Stabili ty region of equation (3.8).

Ax

" i l At=Ax2/2

4 f "

if1 table cr i t ical

1

The first r ight hand te rm of equat ion (3.7) needs to be positive, and this de te rmines

the s tabi l i ty condition. If the left hand side of equat ion (3.8) is set to zero, the equat ion

describes a parabola (Fig. 3.6) in the control space (~t,kx), and the in teres t ing point

is what happens in this case with equat ion (3.7). If the control pa r ame te r (3.8) is zero,

the local solution of the map (3.7) does not fur ther depend on the Ux, t values, and

the grid separa tes into two disconnected subs t ruc tures (Fig. 3.7}. The solution then does

77

Fig. 3.7: Two represen ta t ions of the grid b i furca t ion for the explici t d i f fe rence scheme of the t ranspor t equat ion u t = Uxx. The bi furcat ion occurs if the control

p a r a m e t e r is zero (I - 2(kt/Ax 2} = 0).

not fu r ther descr ibe the original ' t r anspor t equat ion ' , but two independent solutions of

this type arise. In consequence, the solution depends strongly on the init ial da ta config-

uration, a fac t which is somet imes not recognized (MARSAL, 1976}. To see how the

solution depends on the init ial conditions, the degenera ted version of equat ion (3.7) can

be wr i t t en as

Ux,t+l = (Ux_l, t + Ux+l,t)/2 (3.9)

Now, we take the example of a subs tance spreading from a source of cons tan t

in tensi ty into an empty medium, and we compute this by use of equat ion (3.9):

t 0 0 0 0 0 0 0

t i l I 0.5 0 0 0 0 0 t I 0.5 0.25 0 0 0 0

t I 0.625 0.25 0.125 0 0 0

78

The numerical approximation looks ra ther well, and, as can be shown (MARSAL, 1976),

it really approximates the different ial equation. Next, we take the same boundary condi-

tions but d i f ferent initial conditions:

t = O

t = I

t = 2

t = 3

t = 4

1 0 1 0 1 0

1 1 0 1 0 1

1 0 .5 1 0 1 0

1 1 0.25 I 0 I

1 0.625 1 0.125 1 0

O W g

b O ~

This time, the solution is nei ther numerically nor physically reasonable, we get fluctua-

tions which cannot approximate the t ransport equation. What happens, becomes clear

if one sketches how the successive values are connected:

..'% ><....'>< 0 I 1 . . . j I ~ . . . . j ~ . . . 0 . . . j I

" " " " 7 ""0 1 U , ; ~ 1 st] ~ •

J" "~ s • f ..'% 5< 5 < ' > 5 I x _ I . . 0 . 2 5 _I . 0 . I -><. ->< I ~ I O. 1 0

Clearly, the bifurcation of the grid into two independent subsets causes two independent

solutions -- one is constant because the initial conditions are constant on this subset

(the diagonal series of ones}, the other one resembles the solution of the first example,

i.e. a spreading process from the left boundary into an empty medium. It is not hard

to see that the stable solution of the first example is not really stable, as somet imes

is assumed in texts on numerical methods (MARSAL, 1976), but that it also consists

of two independent solutions, which under the special conditions become identical.

In case the cri t ical pa ramete r (3.8) takes values less than zero, the result f luctuates

and assumes negative values on one of the bifurcation grids. In this case, the grids are

again connected, but the negat ive control pa ramete r causes a l ternat ing signs of the

U values so tha t one, in pr inc ip le , has still two di f ferent solutions of the discussed x,t

type. In addition, we observe a close relationship to the discussion of regular and cen te red

grids in sect ions 2.4.4-5. Indeed, as the cri t ical value of the parameter (3.8) is ap-

proached, the stable regular grid {Fig. 3.6-1) evolves into two grids which resemble the

cen te red grids of the previous discussion. These grids are disconnected and provide two

79

independent solutions. The stability problem, therefore, has a strong topological

component. Another analogy provides the discussion of linear systems in section 2.2.1,

where we observed a similar parabolic stability boundary. Of course, the partial d i f feren-

tial equation ut+Uxx=0 can be approximated by a set of differential equations

Y'I = allYl + a12Y2 + "'" + alnYn

Y'2 = a21Yl + a22Y2 + "'" + a2nYn etc.,

(3.1o)

which provide a discontinuous spatial but continuous temporal approximation.

To general ize this result, we can briefly analyze the difference approximation of

first derivatives. There are three approximations in use (e.g. DESAI & CHRISTIAN, 1977),

the

forward difference

backward difference

central difference

(Ui+l, j-ui, j) /Ax + O(Ax)

(ui, ]-Ui_l, j)/Ax + O(bx)

(Ui+l, j-Ui_l, j)/(gkx) + O((Ax)2).

Under numerical aspects the central difference should be the best one to approximate

a first partial derivat ive because its discretization error is only of order (Ax) 2. But nearly

all approximations using the central difference are instable (MARSAL, 1976). The previous

discussion has shown that this is not a numerical problem but a topological one. The

central di f ference causes a grid bifurcation as in the previous example, i.e. one computes

two independent solutions, and, therefore, the approximation can only be used for very

special initial conditions. Actually, the problems, which arise here, are very close to

those discussed in the last chapter, especially the surface approximations from scat tered

data.

3.1.3 Infinite Series of Caustics

In refract ion seismology one is sometimes interested in the so-called 'higher arrivals '

and in the caustic formed by the rays. The caustic is the envelope of rays, which, in

its totali ty, can be writ ten as

F(x,y,p) = O. (3.11)

The equation of the envelope of this family of rays is obtained by eliminating the ray

80

p a r a m e t e r p f rom e q u a t i o n (3.11) and f rom i ts pa r t i a l d e r i v a t i v e

(3.12) 3 F(x,y,p) = 0

3p

(e.g. BEN-MENAHEM & SINGH, 1981). For a general overview one can choose special

conditions, i.e. the situation where the source is located at the reflector. For a medium

with linear velocity increase, the rays are circles (e.g. OFFICER, 1974). Because the

aim here is only to demonstrate the use of iterated maps, a still more simple model

will be used , pa r abo l i c r ays . If t h e c u r v a t u r e o f t h e r a y s u n d e r c o n s i d e r a t i o n is l a rge ,

t h e p a r a b o l i c r a y s a p p r o x i m a t e t h e c i r c u l a r ones to s o m e e x t e n t . If all pa rabo l i c r ay s

h a v e a c o m m o n s o u r c e point , e q u a t i o n (3.11) b e c o m e s

y - ( x 2 - b x ) = 0. (3.13)

A s p e c i f i c r ay wi th r ay p a r a m e t e r b p a s s e s t h r o u g h t h e po in t s x = 0 and x = b if y = 0.

A t x = b t h e r ay is r e f l e c t e d , and b e c a u s e t he s o u r c e is l o c a t e d a t t h e r e f l e c t o r , it

r e a c h e s t he r e f l e c t o r a s e c o n d t i m e a t x = 2b. This g ives t h e g e n e r a l i t e r a t e d m a p

for t h e r e f l e c t i o n po i n t s

or

x i = xi_ I + b

x. = xo + ib. 1

(3.14)

T h e r ay i t s e l f is d e f i n e d in t he i n t e rva l (0,b) and m a p s i t e r a t i v e l y in to t he i n t e r v a l s

(b,2b), (2b,3b), . . . . Or if a c e r t a i n i n t e r v a l is g iven , t h e e q u a t i o n for t h e r ays {3.13)

t a k e s t h e fo rm (by u se o f e q u a t i o n (3.14)

y - ( ( x i - i b ) 2 - b ( x i - i b ) ) = 0 . ( 3 . 1 5 )

Fig. 3.8: Caus t i c ( s ) o f a s i m p l e p a r a b o l i c r ay s y s t e m w i t h r e f l e c t i o n a t t h e s u r f a c e . T h e s o u r c e is l o c a t e d a t t he r e f l e c t o r .

81

By use of equations (3.11) and (3.12) one finds the equation of the caust ics by eliminating

the ray paramete r b:

y - xi2(1-(l+2i)2/(4i(i+l))) = 0

or (3.16)

y + xi/(4i(l+i)) = O.

Thus, the caust ics are an infinite series of parabola with increasing curvature, which

all pass through the point {0,0). Fig. 3.8 gives the first i terat ion as an example. More

complicated ray systems like circular rays or arbitrary positions of the source can be

t rea ted in the same way.

Of some interest is the case of a source located below the ref lec tor . This situation

causes a bifurcation, which can be qualitatively described in the following way. The

source is now located at depth 'a ' and the ref lec tor , as before, at depth zero. Then

the ray equation (3.16) becomes

( y + a ) - ( x 2 - b x ) = 0, ( 3 . 1 7 )

i.e. all rays are passing through the source point (-a,0). Now, the rays are re f l ec ted

at y = 0, and their horizontal position is then

2 - x + bx +a = 0 . ( 3 . 1 8 )

To find the ref lect ion points, one has to solve the quadratic equation (3.18), and, in

general, this will give two ref lect ion points because the ray propagates into the positive

and into the negative x-direct ion from the source point (o,-a). Because of symmetry

Fig. 3.9: Caustic of parabolic rays. The source is located below the ref lec tor .

82

reasons, we have to consider the absolute values of the ref lec t ion point; both solutions

propagate into the positive and into the negative direction -- we have to fold the solution

space. Only in this case, the whole semiinfini te solution space is covered with rays.

The consequence is that the ray parameter 'b' is not further uniquely defined; it describes

two rays, and both ray systems are capable to produce i tera t ively caustics. The two

caust ics are connected at a cusp point, i.e. one finds cuspoid caust ics (POSTON &

STEWART, 1978; Fig. 3.9). The bifurcation of the solution arises also with other ray

systems, such as circular rays. It is not a property of a special ray system, but it only

depends on the position of the source; it is a s t ructural ly stable topological proper ty

of ray systems.

3.2 CHI2-TESTING OF DIRECTIONAL DATA

Within the analysis of or ientat ion data one has to prove whether the observed

distribution has pronounced ext rema. The usual way is to tes t the contrary, whether

the data differ significantly from a uniform distribution. The propagated method in text -

books {e.g. MARSAL, 1979) is the Chi2-test . BALLENTYNE & CORNISH {1979) observed

tha t the Chi2-value obtained from such a tes t depends on the arbi t rary select ion of

the of f se t point for the sector system, in which the direct ional data are grouped. By

an extensive numerical analysis they showed that the Chi2-values f luctuate as the sector

pa t te rn is ro ta t ed over the data. In their analysis, the Chi2-values passed thereby several

t imes the s ignif icance leveh Therefore they concluded:

"The hitherto widespread use of the test in this way cannot, therefore, be

considered a valid mode of analysis, and the results of previous studies

employin E this methodolgy must be treated with extreme caution. "

While the numerical analysis of BALLENTYNE & CORNISH (1979) shows that this

is a problemat ic tes t , it does not e lucidate why the uniform distribution as a zero-hypoth-

esis behaves in such an unpredicted way. Ballentyne & Cornish not iced that the expec ted

frequency E is a constant for all sec tors under the special condition of a uniform distri-

bution:

E = N / k; N: number of data (3.19)

k: number of sectors.

The equation for the Chi2-value can, therefore , be wr i t ten as

1 k k X2= ~ i~=l ((Oi-E)2 = i=l ~ (Oi-Ei)2/Ei

if E. = constant; I

0.: number of observed data in the sector i. i

(3.20)

83

But this simplified version contains still a constant value inside the sum. Further evalua-

tion yields

X2= 1 }]0i 2 _ 2 }]0 i + kE. ( 3 . 2 1 )

The indices of the sums are the same as in equation (3.20), they will not be repea ted

in the following equations. By use of equation (3.19) one has the relations

O = N and kE = N , 1

and this gives finally

N 2 - N. (3 .22 ) X = = ~- [ 0 i

Thus the obtained Chi2-value depends only on the sum of squares of the observed values

in the sector system. Because N and k are constants for a cer tain data set and a given

sec tor pat tern , one can define a modified tes t s t a t i s t i c , which simplifies the fur ther

analysis:

k O. 2 (3.23) ~x%1= X • •

The important point is that the tes t s ta t i s t ic does not really depend on the expecta t ion

values Ei; the only remaining variables are the Oi's , and the Oi's can be al tered by

a dislocation of the sector system or by another spacing of the sectors . The same situa-

tion arises in normal histograms and two-dimensional data if they are t es ted against

the uniform distribution. The derived folmulae also hold in these cases. To study the

behavior of the tes t s ta t i s t ic it will be sufficient to al ter the offse t point of the sector

system.

Rotat ion of the sector pa t tern causes jumps of data points from one sector into

the neighboring one when the sector boundary passes a data point. Such a jump modifies

sum ~Oi 2 locally: the

2 + 2 (0i-I)2 + (0i+I)2 = 0 i Oi+ 1 + 2(Oi+l-Oi) + 2. (3.24)

For an arbitrary number of jumps between two classes one finds

(OI-J)2 + (02+J)2 = 012 + 022 + 2(02-01 ) + 2J 2 (3.25)

J: number of jumps.

Therefore, the tes t s ta t i s t ics is changed by a value

84

2(J(02_01) + j2). (3.26}

The general equation for a dislocation of the sector pa t te rn is found by summing up

all local changes, and the modified tes t s ta t i s t ic (3.21) can be wri t ten as

k Xa 2 + 1 = I 0 i + 2 10i(Ji-Ji+l) + 2 ~Ji 2 - 2 [JiJi+l , (3.27)

where the sec tor k + 1 is identical with the sec tor i. The equation shows that any

jump of a data point from one sector into another, under rotat ion of the sec tor pat tern ,

will a l ter the tes t value, as was numerically found by Ballentyne & Cornish. But equation

(3.27) allows a more detai led analysis. To have no change of ~ Oi 2 under a rotat ion

of the sec tors requires

o = X o i ( a i - J i + 1) + XJi 2 - X J i a i + l or alternatively

2 0 = XJi(Oi+l - Oi) + ~ Ji - XmiJi+l"

( 3 . 2 8 )

These conditions can only be sat isf ied if J l = J2 . . . . = Jk' or if Ji = 0 for atl i. In

all o ther cases, the original sum is al tered, and one gets d i f fe rent tes t values. Now,

the condition to have an equal number of jumps for all sectors is mainly a geometr ica l

problem.

The const ra in ts given by equation (3.28) require that the data are equally spaced

on the circle and that they have unique frequency. The unique frequency within every

sec tor is what one suspects to be proved by the tes t method, but now spacing appears

as a new parameter , which a f f ec t s the tes t value. In addition, on a suff icient ly fine

scale the unique distribution does not play any role fur thermore . The data are measured

on a scale of real numbers, and, therefore , any local c lus ter is due to the roughness

of the measurement ; as the scale becomes finer and finer, the cluster will be divided

into single points. Therefore , on a suff ic ient ly fine scale the data are single points on

the circle {Fig. 3.10). The only remaining variable, then, is the spacing of the data

points. One can in terpre t any theore t ica l distribution in the way that the density over

Fig. 3.10: Equally spaced and uniform distr ibuted points on a circle.

85

a cer ta in interval gives a measurement of the (infinitesimal} spacing of points on the

real line. In the case of the uniform distribution, the density describes an (infinitesimal}

uniform spacing. But what we expect , are not equally spaced data. Our opinion is that

the data result from a random process, which selects the data with equal probability

from a cer ta in interval of the real numbers. Therefore, one cannot expect to find equally

spaced data in a finite size s a m p l e - - one has, of course, the combinational problem

to arrange N data on M points on the real line where M is given by the roughness of

the measurement .

Now, one may ask how strong the a l terat ions of the test value for d i f ferent distri-

bution pa t te rns are. The most simple system are two sectors , and 100 data give a likely

sample size. In a f irst case, we may have nearly equal spacing, then every sec tor con-

tains approximately 50 data points. The rotat ion of the sector system causes only jumps

of a few data points at once, say, in the order 1 to 10. We find that ~ 0 i takes values

like (equation (3.24)):

502 + 502 5000

492 + 512 5002

452 + 552 = 5050

402 + 602 = 5200.

The change of the test value is not very impressive. Changes of this magnitude are

found if random numbers a r e , u s e d to provide a numerical test . Now, assume in the

second case that the distribution has a single well pronounced maximum so that it

is possible to locate all data in one of the two sectors. On the other hand, there exists

a rotat ion of the sector system, which divides the data into two sets with nearly equal

frequency; therefore , under the rotat ion one will find sums in the range

02 + 1002 = 10 000

5() 2 + 502 = 5 000,

and the ex t rema diverge quite clearly. For this data configuration the behavior of the

system becomes ra ther chaotic with respect to an arbitrary location of the sec tor system.

In addition, the possible outcomes diverge rapidly with increasing sample size. Thus,

we have the striking situation that , in contradict ion to the general s ta t is t ical opinion,

an increase of sample size does not stabil ize the result, but that the contrary is true:

The uncertainty about the result of the test increases as the

deviation of the sample from the uniform distribution approaches

certainty.

86

These observat ions allow a final discussion of the causes for the instabili ty of

the tes t . Take a t w o - s e c t o r sample from the normal distribution over the circle (e.g.

MARDIA, 1972)~ then there are two ex t r eme cases: In the first, one sec tor contains

nearly all data, in the second, both sec tors contain an equal number of data. This is

again the si tuation considered above. But now, with the normal distribution, the expecta-

tion values are ne i ther independent of the data s t r u c t u r e - - they depend on the mean

and variance of the d a t a - - nor are they independent of the choice of the of f se t point

of the sec tor system. As the sec tor ro ta tes , the observed frequency within a sec tor

is a l tered in the same way as the expecta t ion values for this sec tor and vice versa.

To use a term from synerget ics (HAKEN, 1977), both values are ' s laved' by the position

of the sec tor system. In the case of a uniform distribution, the expecta t ion values break

out of this 'slaving' , and this allows the system to f luctuate f ree as described above.

The ' revolt of the slaved paramete rs ' becomes especially strong when the observed fre-

quencies are strongly dependent on the position of the sector system, i.e. when the

distribution has a well pronounced maximum.

3.3 PROBLEMS WITH SAMPLING STRATEGIES IN SEDIMENTOLOGY

The analysis of pseudo-t ime series (strat igraphic thickness against some variable)

by means of classical methods like polynomial curve fit t ing, moving averages, cross

correla t ion e tc . is well known (FOX, 1975; SCHWARZACHER, 1974). Besides these

methods, random models have been used, and here especially the concept of transitional

probabili t ies and of Markov chains (KRUMBE1N, 1975}. In sedimentology and s t rat igraphy

the general problem with these techniques is that the ' t ime series ' or the ' sequence

of signals' is usually too short because the profiles are of l imited length. Other problems

result from special, propagated sampling techniques like equal dis tance sampling. These

problems are of special in teres t in the present context because they cause problems

of convergence, and they are capable to genera te ar t i f icial pat terns: They are, in some

way, the inverse problem of i t e ra ted maps. It will turn out that the problems are again

geometr ica l ones, and that it is not advisable to replace geological (morphological} reason-

ing by a sampling formalism.

3.3.1 Markov Chains in Sedimentology

There arise special problems if transit ional probabil i t ies are used to study periodici ty

pa t te rns of profiles. These problems are mainly of a classif icatory nature; they resemble

closely the problem to define the geometry of a bed or facies unit. Fur thermore , they

are re la ted to the process of sedimentat ion and from this to the type of ' s i g n a l s ' - -

87

a b c

T ° 7 m e

Fig. 3.11: Graphic representa t ion of d i f ferent scales for definition of probabili- t ies on profiles, a) The profile in classical representat ion, b) the probability to find a cer ta in lithology measured in terms of bed thickness, c) the probability of a lithotogy to be deposited during a t ime interval.

whether the sedimentat ion process consists of dist inct events or ref lec ts ,a continuously

changing environment .

Transitional probabilities are just one possible definition of a large set of probability

measurements to be defined on. profiles. Some other types of probability measurements

on profiles may be briefly reviewed as a base for the later discussion of transitional

probabilities. From a sedimentologicat viewpoint, we have the total probability of a

cer tain facies type, the likelihood to take a sample from the profile and to find a sand-

stone, carbonates , a claystone e tc . In order to define the probability measurement , the

profile has to be classified into dist inct facies units or beds, and the probabilit ies result

from bed-thicknesses. Still the same probability s t ructure but with other probability

values results if the depth scale of the profile is t ransformed into a t ime scale. So,

any smooth deformation of the scale will give new probability values, but it will not

disturb the special probability algebra (Fig. 3.11). Another type of probabilities with

somewhat d i f ferent algebraic rules (RENYI, 1977; FISZ, 1976) results if two objects

are analyzed simultaneously. In this case, one works with conditional probabilities, e.g,

the probability to find a cer ta in fossil in a specif ied lithology. If one can define some

ordering relation like before and af ter , then a series of d iscre te signals can be analyzed

in terms of the conditional probabilities: to find a cer ta in signal before or a f te r another

one. If the positional relationship is reduced to ' just before ' or 'just a f te r ' , then the

conditional probabilit ies become the classical transitional probabilit ies of a Markov chain.

The s t ructure of the conditional probability space depends not only on the scale used

for the measurements but also on the definition of the relationship between the two

88

sets of objects.

In sedimentological prac t ice the main problem is to define dist inct signals, i.e.

dist inct sedimentological units. In a very narrow sense, this is only possible if the sedi-

mentat ion process is not continuous. But long series of sedimentat ion by events usually

are r es t r i c t ed to turbidites, in which the sedimentological s t ruc ture is very homogene-

o u s - - commonly without transit ions be tween a larger number of lithologies. In all o ther

cases, the transit ions be tween beds or facies units are more or less continuous. But,

as sharp boundaries be tween the 'signals ' disappear, the definition of a probability space

becomes more and more subjective, and this cont radic ts the aim of an objective s ta t i s t i -

cal analysis. For this reason, a modified sampling technique is frequently used to establish

the empirical t ransit ion probabili t ies of a ' sedimentological ' Markov chain (MIALL, 1973).

Instead of d iscre te signals, which have to be defined subjectively, one takes small samples

in some regular distance. The reason is that the samples can be more easily classified.

It seems worthwhile to analyze how the d i f fe rent sampling methods may influence the

probability s t ructure .

A) Discre te Signals

First some additional features of the classical approach of Markov chains m a y

be discussed with respect to the sedimentological questions. A sedimentary unit charac te r -

ized by its lithological, sedimentological , paleontological etc . content needs to be defined

as a signal, which is an event e i ther a p r i o r i - - separa ted from the events ' just below '

and 'just above' by dist inct b o u n d a r i e s - - or which can become a 'd is t inct event ' by

a useful definition of its boundaries. Because the event is defined by its s t ructure , the

transit ion matrix is independent of bed thickness and profile depth. The transit ion matrix

is of the form

(3.29)

nij: number of transit ions from signal Sj to Si;

Nj : total number of occurrences of the signal Sj.

Repet i t ions of identical units such as a sequence sandstone - - - sandstone are

usual events in classical Markov chains. In sedimentology they are only possible if there

exists some boundary which can be recognized, a fea ture which usually is re la ted to

banking. Now, the boundary between two beds, no ma t t e r how small, means some change

in sedimentat ion, e.g. a short interval of lowered sedimentat ion ra te that caused the

physical boundary. Thus, if we do not identify the boundary between beds as a separa te

89

Fig. 3.12: A sequence of three sedimentary units a, b, c and different possible transit ion graphs. Above: transitions if the sedimentary units are defined as 'dis- t inct events t, middle: one possible transition graph for equal interval sampling; the number of singular loops depends on the spacing of the samples. Below: occurrence of repetitions, i.e. singular loops, due to lumping of lithologies; tat and 'b' are not distinguished.

event, this may be due to a high threshold that has disturbed the record. This view

can be formalized in terms of a map from the transition matrix of degree n onto a

transition matrix of degree n - k, e.g.

l 0 P12 Pl3N~ 0 / $3 - - ~ $2 Q P l l P 1 2 ) P21 0 P 2 3 ] . . . . . . . . . . . . . . . . . . . . . . . - ~

P21 P22 " P31 P32

(3.30)

A more instructive representation of this map can be given by a transition graph

{Fig. 3.12) which shows how singular loops, like a sandstone --- sandstone sequence,

arise due to ignorance or to lumping of intermediate signals (e.g. sandy shales). As will

be discussed below, the same structure with singular loops arises if the samples are

taken in equal intervals. Within sedimentological problems, the occurrence of singular

loops usually indicates that a transition state (such as non-deposition) is missing.

B) Equal Interval Sampling

If we now go on to the second sampling method, the sampling in discrete intervals,

90

we have to prove whe the r it real ly t e rmina t e s into a Markov chain. MIALL (1973) wri tes

tha t this

"method can give rise to a much more accurate measure of the

relative frequencies of the lithotypes present, but at the

expense of accuracy in measuring step-by-step depositional

changes".

This remark shows t ha t one has to take care tha t the probabi l i t ies are well defined,

i.e. t ha t one does not produce a mix ture out of condi t ional and to ta l probabi l i t ies (see

above). Miall fu r the r emphas izes a sampling in terva l slightly less than the average bed

thickness. This addit ional r ecommenda t ion will be studied in detai l in the next sect ion.

In order to analyze the in te rva l - sample s t r a t egy one may assume to have a ser ies of

well def ined even t s wi th t rans i t iona l probabi l i t ies

0 P j i . . . n j / N j

P i j 0 = n i j / N i

o . , . . . . . . . . 0

( 3 . 3 I )

The samples are t aken in equal in tervals from the identicaI universe. As long as

the sampling d i s tance is larger than the average bed thickness, one will miss qui te a

lot of t rans i t ions . As the sampling dis tance becomes less than the smal les t bed, the

counts of t rans i t ions s tabi l ize , but the sampling d is tance can be fu r ther reduced, in

the e x t r e m e case down to an inf in i tes imal small size. From the point, where the t rans i -

t ion counts (not yet probabil i t ies) become s table , one will find new t rans i t ions only along

the diagonal of the count ing matr ix . The counts along the diagonal will increase with

decreas ing sampling d is tance unti l the values equal the to ta l thickness of every l i thotype.

The sum of the diagonal e lements , the re fore , gives the length of the profile:

With respect to the content of the diagonal elements, therefore,

every interval-sampling process generates nothing but a measurement

of the total thickness of the different lithotypes within a profile --

measured as meter-intervals, cm-intervals or~ generally, in units

of the sampling distance.

The e l emen t s of the diagonal line, divided by the i r sum, provide the to ta l probabi l i ty

to find a l i thotype within the profile. If one computes t rans i t ion probabi l i t ies from this

count ing matr ix , then the probabi l i ty ma t r ix reads

91

(:: . . . . mi i / (mi i +Ni

. , o

n j i / ( m j j + N j ) (3 .32)

m..: th ickness of the l i t ho type 1; 11

n..: counts of t r ans i t ions f rom l i tho type j to i; jl N.: to ta l number of t r ans i t ions f rom l i tho type j to ano the r s t a t e .

l

This p robab i l i ty ma t r i x has a suspicious s t ruc tu r e . F i rs t , the p robab i l i t i e s a re not

d imens ion less numbers because the n . . ' s a re coun t s while the m . . ' s have the d imens ion 1J II

of a length measurement, and this is independent of the roughness of the intervals--

but, per definition, probabilities are dimensionless numbers. On the other hand, if we

take a small sampling distance on a profile, where the bed thickness is not small relative

to profile length, the diagonal elements approach one, and the transition graph (Fig. 3.12)

is dominated by singular loops.

The suspicious s t r u c t u r e of this p robab i l i ty ma t r i x b e c o m e s c l ea r if one s e p a r a t e s

t h e coun t ing ma t r ix into i ts i ndependen t pa r t s

O O

(3.33)

. . . . . . . . ° ' ° ° i] nil nj. i = j . . 0 . . . . + mii 0

. . . . . . I. 0 0 . . .

The t. . a re the real t r ans i t ions b e t w e e n l i tho types , the m.. m e a s u r e t he th i ckness of U 1l

the individual l i tho types .

F rom the m a t r i c e s on the r ight s ide one finds two independen t p robab i l i ty s y s t e m s - -

the t r ans i t iona l p robabi l i t i es f rom the t. . and the probabi l i ty or r e l a t i ve f r equency of li

a l i t ho type mi i / ~mii. It s eems , t h e r e f o r e , r a t h e r dangerous to t r a n s f e r the geologica l

or s e d i m e n t o l o g i c a l dec is ion ' w h a t can be def ined as a d i s c r e t e l i thological s ignal ' to

a p s e u d o - o b j e c t i v e sampl ing s t r a t e g y . What is the mean ing of the ' p robab i l i ty m a t r i x '

n.. when the th ickness of t he beds var ies s t rongly? In the case tha t t he beds have near ly 1j t he s a m e th ickness , t he addi t ional rule t ha t the sampl ing d i s t a n c e should be t aken near

the ave r a ge bed th ickness secures tha t the e s t i m a t e d ma t r i x a p p r o x i m a t e s t he t r ans i t ion

ma t r i x to some e x t e n t . In t he ca se of highly va r iab le bed th ickness , the ma t r i x n.. will q

no t make much sense . At leas t , if one can c o m p u t e a mean bed th ickness , one has some

idea wha t a bed looks like in the prof i le under s tudy, and why should one then go this

doubtfu l way?

92

3.3.2 Art i f ic ia l Pa t t e rn Formation in Strat igraphie

Pseudo-Time Series

It is known for a long t ime that several as t ronomic cycles in the order of 20 000

to 400 000 years may cause c l imat ic changes, and there has been a large number of

a t t empt s to re la te geological phenomena to these astronomic cycles, e.go there are several

good arguments in the case of marl - l imestone rhythms, coming from the proposed cl imat ic

changes (EINSELE & SEILACHER, eds. 1982). However, it is very hard to give a s ta t is t ica l

proof of the correlat ion between bedding phenomena and astronomic cycles. To do this

would require to have true t ime series of carbonate production and of clay influx under

control led conditions. But already the relation between t ime and sediment accumulation

is not known down to suff ic ient ly small intervals in any profile. Therefore, this problem

is usually ignored and a fairly constant ra te of sedimentat ion assumed {SCHWARZACHER

& FISCHER, 1982). An additional handicap is that most computer procedures for t ime- -

series analysis require equal interval samples. SCHWARZACHER & FISCHER (t982)

recommend equal interval samples at a distance which is not smaller than the thinnest

beds encountered with some frequency. Surely, the bed is the {possible} unit for an analysis

of cycles. It is the smallest visible f luctuation within the profile. If one has enough in-

formation about the cycles, one will get a good picture of the periodic process if one

chooses the sampling distance exact ly as half or as one wavelength. If, in addition, the

sampling points are located at the ex t rema of the periods, one gets a very simple linear-

ized approximation. For the s t ra t igraphic problem this would require to locate the sampling

points at the ex t rema of a l imes tone- sha le sequence at the boundaries and the centers

of beds, in order to describe the f luctuations of the carbonate content properly. But,

if the data have to be analyzed with a computer algorithm, equal distant sampling points

are necessary while bed thickness usually f luctuates . So, the computer obtrudes a cer ta in

s t ra tegy, whe ther we like it or not. The question is what we have to do in order to

get a physically or sedimentologically reasonable result and not only a meaningless product

of the computer . To elucidate some problems, two d i f ferent aspects will be discussed,

the sampling of periodic functions and the analysis of bed thicknesses. In the first case,

it will turn out that the propagated sampling distances can cause art if icial pa t tern forma-

tion, in the second exampl% we will see that a not properly defined s ta t is t ica l hypothesis

causes in terpre ta t ional problems.

A) Sampling of Periodic Functions

SCHWARZACHER & FISCHER (1982) recommend equal interval samples at a distance

which is not smaller than the thinnest beds encountered with some frequency, If we assume

the bed to represent approximately one wavelength of the smallest periods, we can study

the influence of the sampling distance for an idealized periodic process. A simple cosine

93

J

..V. TV_.V w,--->v v .-.

Fig. 3.13: Art i f ic ia l pa t t e rn format ion due to intervaI sampling (intervaI width near ~r) on a cosine signal. The cosine signal and the sampling pa t te rns , which resul t from intervals of exact ly the width ~r (but with d i f fe ren t s t a r t ing point), are drawn enlarged.

funct ion provides a model for, e.g., the f luc tua t ion of the ca rbona te con ten t (Fig. 3.13).

Sampling at d is tances of exact ly ~r will be successful if the s ta r t ing point of the samples

does not coincide with the inf lect ion point of the cosine function. In this special case,

no per iodici ty will be d e t e c t e d - - anyway, if the sampling dis tance is chosen as 2 ~ ,

then the periodici ty will vanish for every s ta r t ing point. For all o ther s ta r t ing points,

one finds ampl i tude f luctuat ions which cor rec t ly represen t the wavelength, but the t rue

ampli tude is only found if the sampl ing-po in t s coincide with the maxima of the cosine

function.

Now, it is unlikely tha t the sampling intervals have exact ly the length w ( o r 2Tr)

94

even in a computer simulation. Therefore , we disturb the sampling distance slightly, i.e.

we take dis tances Jr +~ {or 2 Jr+ ~); What happens, demonst ra tes Fig. 3.13. The original

cosine function is modulated, and new periodic pa t te rns of higher order occur which only

depend on the disturbance pa ramete r E . In the ex t r eme case, a new simple periodic

curve appears with several t imes the wavelength of the original signal. The error term

causes the sampling points to Vmove' along the periodic function, and depending on

the error pa ramete r this shift ing process adds an amplitude modulation to the cosine

function. Already for small values of the error term we get an infinite number of possible

higher periods and of pure chaot ic b e h a v i o r - - this depends on the quotient njr / {~+E );

if there exists a ~n' such that the quotient is a rational number, then we have a period

of n ~ ; otherwise, we have chaot ic behavior. Now, in a numerical sense, the sampling

distance near Jr is much too large to approximate the periodic function; an analysis

by equally spaced samples would require sampling distances of only a fract ion of J r

As the sampling distance becomes small in relation to phase length, we can safely use

equal dis tance sampling as well as the mathemat ica l 'machiner ies v of t ime-ser ies analysis

and control theory. In geology, however, we have to consider the sampling scale, and

this re la tes the problem to i te ra ted maps, where similar problems with distances occur --

like in the example of the finite logistic equation. The relation to i t e ra ted maps can

be i l lustrated in more detail.

The cosine function is a well defined map of a rotat ing radius vector of unit length

onto a Cartesian coordinate system and vice versa. The rotat ing radius vec tor can be

identif ied with a mass point, which moves with constant velocity on the circle. Equal

dis tance sampling, then, is equiwatent to a uniform motion of the mass point inside the

circle with regular e last ic collisions with the circular wall. The points of collision are

the sample points. This is a classical example of chaot ic motion (GRENANDER, 1978}:

If the first angle of collision is denoted by e {measured from the radius vector of the

circle), then the arc length between successive impacts is Jr -2 e (from planimetry). The

Fig. 3.14: Elastic motions in a circle.

95

resul t ing p a t t e r n depends on the ra t io 2~r /(~T-2@ }. If 2 @ and 7r are incommensurable ,

the system tends to long t e rm chaot ic behavior {Fig. 3.14); however, the system depends

also on the magni tude of @ . If @ is large, we have a high frequency of collisions, and

the pathway of the mass point approximates the circle. If @ is small, a qui te d i f fe ren t

pa t t e rn appears, like in Fig. 3.11b, where the pathway consis ts of quasi tr iangles, which

slowly ro ta te . The lines of motion envelop again a circle, and in t e rms of the approxima-

tion problem the re la t ion of the radii of the outer and inner c i rc le provides a measure-

ment for the quali ty of approximation (the outer c i rc le is the function which should be

recorded).

B} The Analysis of 'Bed Thickness ' by Equal Dis tance Samples

In the previous example we had two variables, profi le length and a second independ-

ent variable, for ins tance ca rbona te conten t . If rhythms are studied, usually the only

var iable is bed t h i c k n e s s - - the sum of this var iable being profi le length. Therefore ,

it is classically assumed tha t the beds have been deposi ted in equal t ime intervals , i.e.

the number of beds is t aken as a re la t ive measu remen t of t ime. Then bed thickness

is a measu remen t of the varying sed imenta t ion ra te , and these two var iables form an

independent f rame. An a l t e rna t ive sedimentological model is (SCHWARZACHER &

FISCHER, 1982) tha t the sed imenta t ion ra t e was fair ly cons tant , and tha t the bedding

phenomenon is due to f luctuat ions in the l i thological composit ion, as discussed in the

last example. To analyze this model in t e rms of bed thickness by compute r algori thms,

Schwarzacher & Fischer used a special sampling technique. They posit ioned the i r equally

spaced sampling points along the profi le and then measured the thickness of the bed

which was hidden by a sampling point (Fig. 3.15a,b). Therefore , some beds are lost while

o thers are recorded several t imes. The method has the e f f e c t tha t i t appears more simply

to decide which of several possible bedding planes has to be taken as the boundary of

the bed {there can be secondary fea tures due to solution). In addition, the method allows

to hope t ha t e r rors will be smoothed out by the ob jec t ive sampling procedure. In the

next step, they re la ted bed thickness to profi le length. The resul t ing graph (Fig. 3.15c)

can well be analyzed by methods like au tocor re la t ion and spect ra . Now, one can ask

again what happens if the sampling dis tance becomes very small. For an ' inf in i tes imal ly '

small sampling dis tance, the graph resembles a step function (Fig. 3.15d), and 'bed thick-

ness ' appears twice in this graph, i.e. the beds are represen ted as squares along the

profile. It is hard to ident ify this r ep resen ta t ion with a useful in te rp re ta t ion . The only

s igni f icant pa t t e rns are the s teps at the boundaries of the bed. Using only these in te r rup-

t ions {Fig. 3.1~e} one gets a ser ies of phase modulated signals along the profile. These

signals - - the bedding planes -- could be recorded di rect ly along the profi le by sed imen-

tological reasoning, and then, of course, with higher precision than by an 'equal in terval

method ' , where the dis tances are chosen by a rule like ' t ake the th innest beds which

96

Fig. 3.15: The analysis of 'bed thickness ' along a profi le {a). The points, at which bed thickness is measured, are taken in equal in tervals (b) and p lo t ted against prof i le length (c). The t rans i t ion to inf in i tes imal ly small sampling in tervals gives a sequence of ' squares ' (d) - - the remaining informat ion are the in te r rupts be tween beds (e). The assumption of fair ly cons t an t sed imen ta t ion ra tes allows to t ransform the phase modula ted signals (e) into ampl i tude modulated ones (f).

are encoun te red by some f requency ' . Anyway, the decision is necessary, what is and

what is not a bedding plane.

The only point, which is con t ra ry to the geological method ( the def ini t ion of bedding

planes), is tha t the signals are not equally spaced, and tha t they, there fore , cannot be

analyzed with s tandard compute r programs for t ime series. On the o ther hand, t ha t

can be easi ly done if the equal d i s tance sampling method is u s e d - - but can this be

a reason to use the less sui ted method, are we real ly ' s laved ' by the compute r? By

a s imple manipulat ion, the phase modulated system of bedding planes (Fig. 3.15e) can

be brought into a form which allows us to analyze the data by an 'equal s tep ' au to-

cor re la t ion program. The approach, so far, was tha t the sed imen ta t ion r a t e was considered

fair ly cons tan t , i.e. the d i s tance be tween the bedding planes provides a measu remen t

of the durat ion t ime of a sedimentological ' s ignal ' - - of a bed. Therefore , one can give

a plot ' number of the ini t ial signal ' or ' number of the in te r rup t ' against the 'dura t ion

97

t ime of the signal' (Fig. 3.15f). This t ransforms the phase modulated signal of bedding

planes into an amplitude modulated one with equal dis tances between the data points.

But now, it turns out that the two di f ferent sedimentological models become identical.

The assumption that the sedimentat ion ra te is fairly constant cannot be distinguished

from the model that the beds are deposited as events in equal t ime intervals because

the duration t ime, as defined, is proportional to bed thickness. This, of course, holds

only for the ' s ta t i s t ica l approach'; by geological reasoning it is usually possible to dis-

tinguish these two cases (EINSELE & SEILACHER, eds. 1982). Thus, one comes out with

the result that the connection of the depositional models with a s ta t is t ica l formalism

can cause a worse defined s ta t is t ical problem, in the sense that the s ta t i s t ics cannot

prove which of the sedimentological models is the cor rec t one while a successful s ta t i s t i -

cal tes t always seems to prove the a priori sedimentological model. The problem whether

an initial model is consis tent with the s tat is t ical analysis is not a trivial one; in topology

the analogous problem is genericity.

"To many scientists the 'genericity' problem has always been

interesting. What we are looking for is the following: given

a mapping f:U --~ R m, where U is an open set in R n, how can we

perturb f slightly to obtain a nicer and simpler mapping?" (LU, 1976).

The idea to analyze the sedimentological problem in terms of a mapping leads

to the following diagram

~ s 2 It'

The set of observations (location and thickness of beds: P) maps under the sedimentological

hypothesis s 1 (event sedimentation) onto the discrete space D 1 (bed number and bed thick-

ness). Under the hypothesis s 2 (constant sedimentat ion rate) the same set maps onto an

a l ternat ive d iscre te space. As turned out during the earl ier discussion, there exists a

map I which t ransforms D 2 into Dt, but only if D 2 is cons t ructed from infinitesimally

small sampling intervals, i.e. if D 2 contains all bedding planes. Otherwise, if the sampling

distance becomes suff icient ly large, D 1 cannot be const ructed in all details from D 2.

On the other hand, from D 1 we can construct all possible outcomes of the model and

sampling s t ra tegy s2(n) (n for the number of sampling points). The reason is that P can

be recons t ruc ted in all details from D 1 but not from D 2. Thus, diagrammatical ly

98

s 1 p "., D I

Sl -1

D2(o~)

and it turns out t ha t only D I is a gener ic r ep re sen ta t ion of the data , i.e. m a t h e m a t i c a l

analysis jus t i f ies the way via geological reasoning. In re t rospec t ion , the problems concern ing

Markov chains a re the same -- geologically and mathemat ica l ly .

3.4 CENTROID CLUSTER STRATEGIES -- CHAOS ON

FINITE POINT SETS

A wide field of geological and paleontological research focuses on c lass i f ica t ion

problems. A set of o b j e c t s - - samples, specimens e t c . - - has to be classif ied in such

a way tha t the e l ement s of a c lus te r show a maximum of ' s imi la r i ty ' while d i f fe ren t

c lus ters have a maximal dissimilari ty. As the numbers of objec ts and var iables become

large, it is convenien t to use compute r procedures to solve the c lus te r p a t t e rn recogni t ion

problems. The class ical approach into this d i rec t ion b e c a m e known as 'numer ica l t axono-

my' {SOKAL & SNEATH, 1964). Nowadays, the re exis ts a large number of a lgor i thms

which t ry to solve the p a t t e r n recogni t ion problem in a s t ra igh t forward way by use

of various s imi lar i ty and d i s tance measu remen t s {e.g. HARTIGAN, 1975; STEINHAUSEN

& LANGER, 1977; VOGEL, 1975). Clus ter ing s t r a t eg ies produce, in general , no unique

solution; an a l t e rna t ive dis tance measu remen t or even a d i f fe ren t input sequence of

the data can change the local and global s t ruc tu re of the c lus ters (VOGEL, 1975). In

addit ion, our opinion about the image of the s t a t i s t i ca l universe can fair ly diverge from

the s imi lar i ty s t ruc tu re which is gene ra t ed by a c lus te r ing s t ra tegy . There a re t h r ee

points to be discussed: the r ep re sen ta t ion of c lus te r s in binary t rees , image concepts ,

and s tabi l i ty problems with d u s t e r s t ra teg ies . The binary t rees imply t ha t they give

a c lass i f icat ion, but the comparison with c lass i f ica t ion t rees shows tha t this is not the

case. The image concept of most c lus te r s t r a t eg ies is far from our geomet r i ca l intui t ion.

99

It will turn out that this discrepancy explains much of the instable b e h a v i o r - - because

the clusters are not geometr ical objects, they cannot be used for a true classification,

and any additional e lement destroys the local s t ructure . Besides the instability between

clusters, there appears an instability within clusters, which is more striking because

it does not depend on some metadefini t ion of a cluster and leads to true chaotic behavior

on finite point sets.

3.4. I Binary Trees

The c luster s t ructure found by some algorithm is usually represented as a binary

tree, which i l lustrates the similari t ies or dis tances between the hierarchies of e lements

and clusters. The binary t rees in cluster analysis are just a picture of one possible simi-

larity s t ructure (VOGEL, 1975); ~hey are not at all a classif icat ion by some ordering

sequence. Thus, they do not allow to insert an additional object without changing the

s t ructure , at least locaiIy, nor do they allow to search an e lement by some decision

rule, as it is possible with binary t rees which represent a relation or ordering between

data. Binary t rees are commonly used as classif icat ion s t ructures that allow to search

and to insert e lements very quickly in sophist icated computer programs (WIRTH, 1975;

DENERT & FRANK, 1977), for a mathemat ica l discussion see e.g. SCHREIDER (1975).

The d i f ference between binary t rees and cluster t rees can be i l lustrated in the following

way:

classif icat ion trees:

(I)

B F t N

1\ /" /" C E G I K

(II)

(L 0/\i j , o'\ , / \ , / \ , I ) ) I I I ~ I

000 001 010 OII IO0 I0I IIO III

100

clus ter t ree:

!

{ A,B,C,D,E,F,G,H,I}

I

I {A,B,C,D}

{E 'F 'G} I

{A,B} { C,D} {E,F} r~ r-%

A B C D E F

{ A,B,C,D,E,F,G,H,I, J, K,L,M,N,O}

! { J,K,L,M,N,O}

{ E,F,G,H,I } t l 'i

[ { J,K,L} { M,N,O } i

,

{H,I} { {

G H I J K L M N O

The classif icat ion t rees represent an ordering relat ionship like 'before ' in t ree (I}:

A < B < C < ... < O. Therefore, it provides an optimal s t ra tegy to search e lements .

In the same way, objects can be classif ied by a sequence of binary decisions ' to have

or to have not a cer ta in property ' . Tree (II) gives an example for such a classif icat ion

of the binary numbers. This t ree allows to find e lements by the decision rules at the

nodes, and it allows to insert new e lements at its proper position, e.g. it is no problem

to insert a number with four digits. Tree (I) provides a minimal classif icat ion while

t ree (II) is an optimal dynamic classif icat ion because it allows to insert new e lements

without any a l te ra t ion of the present t ree s t ructure .

In contrary, the c lus ter t ree does not describe how to arrive at any one of the

e lements if one s ta r t s at the highest hierarchical level. As far as some distance meas-

urement exists be tween A, B, ..., 0, the cluster t ree simply s ta tes that A is more similar

to B than to C. But because the binary t ree cannot describe a higher dimensional variable

space, it does also not ensure that the dis tance be tween A and t:3 may not be identical

with the d is tance be tween A and C. This proper ty will la ter turn out to be the main

source for instable clustering within clusters. The cluster t ree only gives a possible struc-

ture of similari t ies and a possible grouping into s u b s e t s - - and not at aI1 a rule how

to distinguish the subsets.

3.4.2 Image Concepts

"Cluster analysis is one of the Pa t t e rn Recognition techniques and should be appreci-

a ted as such" DIDAY & SIMON (1980) write, and

"the general idea of all these methods is to build an identifica-

tion function (clustering) from two types of information.

101

0 O

O

Q O

O

O

O O

Fig. 3.16: A point pa t tern in the plane (left} and the intuitive geometr ical inter- pre ta t ion as a bounded object (right).

on one hand, the experimental result,

on the other, the a priori representation of a class or

cluster."

Although the concepts of clustering analysis can be well described in a formal mathemat i -

cal notat ion (DIDAY & SIMON, 1980; HARTIGAN, 1975), these concepts have some 'weak w

points. As Diday & Simon summarize:

"In fact these (the cluster) representations rely heavily on

the concept of potential or inertial functions. The 'classifier

specialist' selects this representation from the knowledge of

the properties of his experimental universe."

Indeed, to work properly with such techniques does not only require to understand the

mathemat ica l formulae, but also to understand the quali tat ive behavior of a cluster

s t ra tegy and the concept of 'c lus ters ' underlying a cer tain clustering process.

Given a point set in the plane (Fig. 3.16), our intention will be, in general, that

these points are the image of a two-dimensional object with a well defined boundary.

In a more general sense, one expects that the points are a sample from a n-dimensional

density distribution which is continuously defined in R n, and, therefore , one suspects

102

Fig. 3.17: Al te rna t ive pa t te rn concepts for the point set of Fig. 3 . t 6 - - a line {left) and the binary fusion s t ruc ture of a weighted centroid method {right},

that an increasing sample size gives a s ta t is t ica l universe which can be bounded by

smooth probabilistic surfaces. A consequence is that one identif ies automatical ly ' c lus ter

recognit ion ' with classif ication. Indeed, if one has found a cluster and has bounded it

by a closed surface, then any additional data point is accepted as a member of the

point set if it is located inside the bounding hypersurface. A point outside of this 'ob jec t '

will be associa ted with it if its inclusion does not disturb our concept of the image

too much. Thus, the geometr ica l objects, of our opinion provide classif icat ion rules, and

these are possibie because the geometr ical objects are stable.

Besides this intuitive concept , there are other possibilities to define 'c lusters ' .

The points of Fig. 3.16 can be connected by a line; the result ing image approximates

a two-dimensional curve (Fig. 3.17). The resulting 'objec t ' can be accepted, but it is

not the kind of things one expects in a two-dimensional space. The binary fusion pa t te rn

of a centroid cluster method (Fig. 3.17) diverges still fur ther from our opinion. That

these pa t te rn concepts appear somewhat strange, may have its reason in the feeling

that they are not stable, and, in fact , they are not stable in a s tructural sense. Any

additional point al ters these pat terns , at least locally, in an unpredictable way. Fur ther-

more, these objects genera te no classif icat ion for additional points, there is no relation

like ' inside' or 'outside ' , like 'belonging to ' e tc . Classif icatory objects for points in R n

103

require tha t they are bounded by a surface. Therefore , an object in R 3 must be t h r ee - -

dimensional i tself . Points, lines and open surfaces in R 3 are degenera ted objec ts which

can exist ne i the r physically nor s ta t i s t ica l ly , they are objects which cannot ' con ta in '

something. The defini t ion of a c lass i f ica tory object in a d iscre te n-dimensional var iable

space requires , the re fore , at leas t n+l data points or samples. These n+t points then

descr ibe just one object , a t r iangle in R 2, a t e t r ahedron in R 3, a hyper te t rahedron in

R n, A c lus te r and c lass i f ica t ion concept requires tha t the number of e l emen t s to be

classif ied exceeds c lear ly the number of var iables defining the position of the e l ement s

in a hyperspace. There should not be more than (n DIV m) o b j e c t s - - n: number of

e lements , m: number of variables, DIV: division of integers .

In the appl icat ions of c lus te r analysis it is commonly the case t ha t the number

of var iables equals, or even exceeds the number of e l ement s or samples. The only s t ruc-

ture one can find under such condit ions is an ordering by some s imilar i ty rule, but one

cannot expect tha t this provides a classif icat ion, as it is usually done. In a s ta t i s t i ca l

sense, ' c lass i f ica tory objects ' are closely re la ted to the convex hull of a f ini te point

se t {EFRON, 1965). In a topological sense, they are just h igher-dimensional analogues

of a smooth closed surface, a manifold. Because the point sets are finite, the cont inuous

d i f fe ren t i ab le s t ruc tu res are replaced by simplices and the i r unions, polygons and polyhe-

dra.

3.4.3 Stabi l i ty Problems with Cent ro id Cluster ing

Within the various types of c lus ter s t ra tegies , h ierarchica l methods are the ones

most commonly used, and within this subgroup the cent ro id methods are most popular.

The t e rm ' cen t ro id ' is here used in a very general sense. It simply expresses tha t two

fused e l emen t s are fu r ther represen ted by a single e lement , the eent ro id of the original

points. It is known from a respec tab le number of c lus te r s t ra teg ies tha t the cons t ruc ted

d is tance t r ee depends, to some exten t , on the init ial ordering of the input data {VOGEL,

1975). As Vogel found by extens ive tes t ing with one special da ta set, the cen t ro id

methods become par t icu lar ly sens i t ive to the ordering of the da ta if an en t ropy d is tance

measu remen t is used. Clus ter s t r a t eg ies with o ther d is tance measu remen t s showed this

sens i t iv i ty only on the lower h ierarchica l levels. In palecology one has very nice homoge-

neous data s t r u c t u r e s - - the f requencies of species within a sample. In this special case,

the usual me t r i c d is tance measurement s make not much sense while the entropy measure-

men t provides a ' na tu ra l ' d i s tance measurement for these f requency data of species

(BAYER, 1982). This d is tance measu remen t has, in addition, the proper ty tha t it approxi-

ma tes the Chi2- tes t for homogenei ty of the samples (DIDAY & SIMON, I980); homogenei -

ty of the samples being of special in te res t in palecological studies. Therefore , a program

104

~ t

-%.

Fig. 3.18: Class i f ica t ion and dis t r ibut ion of the foramini fe ra fauna of Todos Santos Bay. Left : Class i f ica t ion by hand (WALTON, 1955); right: Ordering by a c lus te r program {KAESSLER, 1966).

was implemented tha t al lowed to analyze faunal da ta by the ' en t ropy method ' (BAYER,

1982). The da ta of a *hand made ' ecological analysis (WALTON, 1955), which had la te r

been reeva lua ted by c lus te r analysis {KAESSLER, 1966), were r e - r eeva lua ted again to

see how the s t r a t egy works. Fig. 3.18 gives the spat ia l faunal dis t r ibut ion pa t t e rn of

foramini fera , which had been found by the previous workers. Already the first few runs

wi th the ' en t ropy analysis ' showed a s t rong dependence of the resul t on the ordering

of the input data . The nice s t a t i s t i ca l proper t ies of the en t ropy d is tance measu remen t ,

the re fore , were useless. But the use of o ther d is tance measu remen t s with r ep resen ta t ion

of the c lus te rs by the i r cent roids did not e i the r s tabi l ize the pa t te rn , in con t ras t to

VOGEL's (1975) claim. In fact , dependenc e from the init ial ordering of the data was

near ly the same. Fig. 3.19 gives some examples of various t ree s t ruc tures , which resul t

from sl ightly d i f fe ren t ini t ia l condi t ions and p a r a m e t e r se t t ings . The observat ion t ha t

it is not the choice of a special d i s tance measurement , t ha t causes the ins tabi l i t ies ,

was the s t a r t ing point for a more de ta i led analysis of the s tab i l i ty proper t ies of the

cen t ro id c lus te r s t ra teg ies . This s tab i l i ty analysis will now be outl ined.

The widespread use of cen t ro id c lus te r s t r a t eg ies is, in par t , due to the i r proper ty

to produce nice binary h ie ra rch ica l s t ruc tu res from every da ta set . The points found

to belong to a c lus ter are replaced by the i r centroid , which can be e i the r weighted

or unweighted. 'Weighted ' means tha t the cent ro id is computed from all da ta points

within a c lus te r while 'unweighted ' means tha t a computed cent ro id is fur ther t r e a t e d

like a da ta point, or tha t it has equal weight as a da ta point. In both eases, the progress

of c lus te r ing causes a concen t r a t i on of the original da ta points into fewer and fewer

r ep resen ta t ives , the centroids . Thus, every single fusion of two points evacua tes locally

the point space. As this concen t r a t i on of the points proceeds in a ce r t a in area, it forces

105

c / / °

t !

a

F

Fig, 3.19: Various classif ications with centroid cluster s t ra tegies of WALTONIs (1955) foraminifera data. Lower left and upper left: Two of a large number of possible cluster t rees which result only of an a l tered input sequence of the data. Upper right: a l tered metr ic , Lower right: d i f fe rent clustering algorithm,

106

the process to branch into o ther areas which are still more densely covered by data

points, and the same process is repeated. This coupled process of local evacuation and

subsequent branching into another area of the data space is the reason why every data

set is mapped onto a nice binary tree. The t ree s t ruc ture is due to the clustering proc-

ess, and it is not a property of the data set. The same process causes, in addition,

that the dis tances be tween the clusters increase monotonically, a ce lebra ted proper ty

of these s t ra teg ies (STEINHAUSER & LANGER, 1977) - - but an a l ternat ive viewpoint

is that these s t ra teg ies impose a binary t ree pa t te rn onto every data set .

Instabil i t ies be tween clusters are re la ted to the problem which images or which

point configurat ions can be accep ted as a cluster, i.e. one needs a metalanguage definit ion

of a cluster . Although the proper t ies of clusters and part i t ions, their homogeneity, are

usually defined in te rms of the metr ic , the idea of a 'good' c luster is mainly a geomet-

rical one. The question 'what is a good cluster ' is commonly reduced to the problem

how distant two clusters must be to be separable and whether concave clusters are

allowed or not. In clustering analysis the terms convex and concave are replaced by

homogenei ty and chain-homogenei ty (DIDAY & SIMON, 1980). At the moment , we may

use the ear l ier discussed geometr ical image concept to define 'good' clusters. Then,

concavi ty means simply that every line connect ing two points of a cluster is bound to

the interior of an hyperpolyhedron, that forms the boundary of the cluster. Consequently

the centroid is not necessari ly bound to the interior of a concave cluster (Fig. 3.2O).

And because every fusion of two points increases locally the distances, the process can

reach a s t a t e where the image breaks into disconnected pieces, and where these pieces

are connec ted with data points which form separa te clusters under the geometr ical image

concept (Fig. 3.20). The striking point is that in such cases the centroid method does

not follow the 'rule of neares t neighborhood'; not the clusters, which appear closest

because their boundaries are closest , are fused, but the more distant clusters are connec t -

ed (Fig. 3.20). This behavior elucidates again that such a pa t te rn representa t ion cannot

work as a classif icat ion, at least not as a 'natural c lassif icat ion ' ; in some respects ,

this pa t te rn concept is even cont rad ic tory to our understanding of similarity.

So far, a metadescr ip t ion of clusters was necessary. One can also simply accept

that a c lus ter formed by a centroid process has nothing in common with our geometr ica l

imagination. If it can be uniquely cons t ruc ted by an algorithm, it may be a totally ab-

s t r ac t s t r u c t u r e - - but why should it not be as real as geometry? That clustering does

not produce such a well defined abs t rac t pat tern, wiil be shown by a discussion of insta-

bil i t ies within clusters . To avoid any assumption about the s t ruc ture of clusters, one

has to res t r i c t the a t ten t ion to local s t ruc tures as they are defined by the clustering

process i tself . Then one can analyze how a local disturbance propagates into the es tab-

lished hierarchy. Fig. 3.21 gives two cluster pa t te rns over a given point set (compare

107

E3

121

Q . . . . . . . . i.

Fig. 3.20: Branching between clusters. The smaller c luster will be fused with the ' rectangular points ' because the formation of centroids changed the local topology of the point space. Consequently the larger cluster will be fused with the ~triangular points ' in a later step.

Figs. 3.17- 3.18). This time, an unweighted centroid method was used to find the similari-

ty s t ructure . This method causes a local decision problem because there are two choices

for the same point to be fused, two neighbors have equal distance. Dependent on this

choice, two di f ferent local pa t te rns are generated {Fig. 3.21). These local a l ternat ives

project onto the global t ree s t ructure changing the distances or similari t ies all the way

through the cluster t ree (Fig. 3.21). The di f ference is strong enough to change the overall

t ree pat tern . Thus, one of the two t rees appears more compact , and, therefore , it will

be more easily accepted as a larger cluster if the t rees are embedded in a more complex

cluster t ree. On the lower cluster level the substructure appears also more compact

in the right t ree of Fig. 3.21 while in the lef t one two clusters can still be distinguished

on the lower level. The decision, which one of the two clusters will be produced, depends

only on the ordering of the data. The computer algorithm has ei ther to take the first

pair of data points with smallest distance or the last one for f u s i o n - - a consequence

of the binary t ree representat ion. Once made the decision, the local topology of the

point pa t te rn has changed, and, therefore , the local decision projects onto all la ter fusions

contact ing this area. Thus, a true bifurcation of the solution occurs, and it depends

108

/ %. %.

Fig. 3.21: A single local decision problem (arrows) between two equally distant points a l ters the local t r ee s t ruc ture and the dis tances within the ent i re hierarchy.

only on the initial conditions (ordering of the data) which branch of the solution will

be taken.

The next question is whether there exist s i tuations that cause cascades of bifurca-

tions. In this case, the clustering process would become chaotic, as far as this is possible

on a finite point set. Such a sequence of bifurcations can be easily const ructed by just

109

a

C

Fig. 3.22: Centroid clustering as a decision game. The figure gives two possible cluster solutions for the identical, equally spaced point set. a) Every fusion causes the bifurcation into two identical subsystems; b) the associated fusion tree; c) the cluster trees.

choosing the most degenerated case of input data -- equally spaced data points. It does

not ma t t e r whether these points are arranged on a straight line, on a circle or on a

regular (hyper-)grid. Equally spaced data points on a straight line provide the most simple

case; Fig. 3.22 i l lustrates what happens in this special case from various viewpoints.

First , the clustering process has a dynamical aspect - - at the same time, only two points

can be fused, and the fusion of these two points al ters the local dis tance s t ructure

between the points. In the case of Fig. 3.22, the initial dis tance of izhe equally spaced

points, ax, is a l tered to (3ax)/2 between the centroid and its neighbors. Therefore, the

data space is divided into two subsets of points which have still the original s t ructure .

But the two substructures are now independent; they can be t r ea ted as parallel processes

of identical behavior. This bifurcation process proceeds until there are no data points

lef t with the original spacing. Within the limit of a finite point space, any fusion on

one of the subsets genera tes new identical subsets, every bifurcation of the dynamical

system genera tes two dynamical systems of the identical type. In addition, any fusion

is a random choice be tween several possibilities, and the decision, which pair of data

points is taken, depends only on the initial data configuration. One has a per fec t ly chaot ic

110

system on a finite point set . Because the point set is finite, the process te rmina tes

finally. There are, for every subsystem, two main possible outcomes which depend on

some initial choices of the fusion process {Fig. 3.22). One possibility is that during t ime

all points are fused into pairs. Then the resulting centroids are again equally spaced

(Fig. 3.22 left}, only with a l tered distances. Thus, the process is cyclic. The a l ternat ive

is that already the initial fusions determine, to some extent , the further progress because

there remain some points of smallest dis tance on every level which dominate the process

{Fig. 3.22 right}. The centroid clustering techniques, therefore , provide excel lent examples

of chaot ic behavior on finite point sets. In the case of equally spaced data, every slight

change of the input data will cause another cluster t ree, i.e. the system is ex t remely

sensit ive to the initial conditions. Because the number of data points is finite, the number

of possible outcomes is f inite as well, it is the number of disjunct permutat ions of the

data. Therefore, the chaot ic aspect of the clustering process will increase with the num-

ber of (equally spaced} data, i.e. one has the same si tuation as with the Chi2- tes t against

a uniform distribution or the t ra jec tor ies of part icles in Galton's machine.

Now, a prac t i t ioner could argue that the chaot ic behavior of the centroid c lus ter

s t ra teg ies requires a very special and degenera ted data s t ructure . On the other hand,

it is well known that most c luster s t ra teg ies depend to some ex ten t on the input sequence

of the data {VOGEL, 1975}, and it seems likely that this phenomenon is related to the

discussed instability. The remaining question, therefore , is how this instability can arise

in a s ta t i s t ica l sample. There are two s t ruc tures which can enforce the chaot ic behavior

with any kind of data. The first one is that the data are measured with a cer ta in preci-

sion. Thus, with increasing number of data one will get an increasing number of equally

dis tant data. Because multiple values at the same point do not change the local topology

if they are fused under the centroid condition, the data pa t tern tends, at least locally,

towards equally spaced data. And once more we have the si tuation that an increase

of data does not stabil ize the process, as one should expect from s ta t is t ica l reasoning,

but that it destabi l izes the clustering process. In addition, once more one finds that

a uniform or an equally spaced distribution leads to instability. This s t ruc ture probably

is the main reason why the palecological data cause particularly instable clustering.

The frequencies are natural numbers which enforce equal spacing locally. The second

s t ructure , which can enforce chaot ic behavior, is a numerical one. Every computat ion

with a f inite number of digits causes numerical rounding or truncation errors. An excel- 10000

lent example due to WIRTH (1972) is the computat ion of the sum i_~l/i , which takes

d i f fe rent values if it is computed forward and b a c k w a r d - - the numerical error a f fec t s ,

e.g., 3 of 12 valid digits of a 48-bit machine. The same problem occurs for the compu-

tat ion of dis tances between data points (and centroids). If the number of variables is

high enough, one has to expect that the dis tance be tween two points is not symmetr ic ,

e.g. in the case of the Euclidian distance one has

111

}~ (Xli-X2i)2 ~ ~ (X2i-Xli)2 .

Therefore , this e r ror will also depend on the input sequence of the data , and this t ime

the error increases with the number of var iables and with the complexi ty of the d is tance

measurement . This can explain why the ' en t ropy dis tance measu remen t ' is especial ly

instable. Logar i thms are necessary to compute this d is tance measurement , and this en-

forces numerica l errors .

Thus, the centroid c lus ter methods turn out to be highly instable due to the i r pat-

t e rn recogni t ion concept . They behave in a chaot ic manner with respec t to sl ight changes

of the ini t ial conditions, namely the ordering of the input data . The chaot ic behavior

is t r iggered by the tendency of measured data to form, at least locally, regular spacings,

and by the numer ic error of the dis tance measurement , mainly by i ts a s y m m e t r y - -

whereby the computa t ion order depends again on the input sequence of the data . in

to ta l i ty , it turned out during the last two chapte rs tha t most operat ions on f ini te point

sets with the compute r have to be handled with special care. The local methods, which

domina te this field, are usually ex t r eme ly ins table due to small changes of the boundary

condit ions and of the init ial conditions. What we, in general , would need, are r a the r

more rigid topological methods than the highly sophis t ica ted numerical procedures.

(3C

Fig. 3.23: Class i f ica t ion by probabi l is t ic neighborhoods. Points with overlapping

neighborhoods belong to a c lus te r with a well defined boundary (right).

112

Such a - - p r o b a b l y m o r e - - r i g i d method was proposed by GRENANDER (1981}. The

idea is similar to the single linkage methods; however, it has a more geometr ic back-

ground, and this returns to the discussion in sect ion 3.4.2. A probabilistic neighborhood

is associated with every point: e.g., circles in the plane, spheres, hyperspheres or a l terna-

t ively polyhedrons (Fig. 3.23t. Every point has a cer ta in probability to be found elsewhere

in this neighborhood, and a possible assumption about the probabili ty is ' tha t the curvature

at the boundary is proportional to the density of the probability measure, ~ GRENANDER

(1981), or the probabili ty decreases , as the d iameter of the probabilistic neighborhood

increases. Two points are grouped in a similari ty c lus ter if their probabilistic neighbor-

hoods overlap. If the radius e of the neighborhood is fixed to a cer ta in value, then a

unique part i t ion of the data set arises, and we get part i t ions of d i f ferent roughness

for d i f fe rent values of e. The resulting s t ructure is not a t ree because two or more

e lements may fuse at once for a cer ta in value of e. However, the resulting 'c lus ters '

have now well defined boundaries as i l lustrated in Fig. 3.23. The point set of a ' c lus ter '

is bounded by a ' tubular neighborhood ~, and 'c lus ters ~ are d i f ferent on a cer ta in e - l eve l

if their boundaries do not overlap. Thus, given such a classification, we can later add

samples and question whether they belong to a ' c lus ter ~ for a cer ta in e -va lue . Clearly,

the classif icat ion is not s table in the s e n s e that an additional object may cause two

or more ' c lus ters ' to fuse without changing the e-level. However, that is a quite more d i f ferent

instabili ty than those discussed above. The classif icat ion cer ta inly stabil izes as the number

of data points increases and approaches the universe. This approach returns to the exam-

ples of the second chapter and is closely re la ted to the continuation problem discussed

there .

3.5 TREE PATTERNS BETWEEN CHAOS AND ORDER

The previous presentat ion focussed at pract ical problems which arise from unstable

algorithms. Here, the viewpoint is much more theoret ical , and the questions are mainly

conceptual . The sect ion serves as a kind of summary of the previous chapters and con-

nec ts them with the following one. The morphology of t ree- l ike s t ructures wilt be the

topic of this section, especially the connection between branching pa t te rn and overall

shape, which may arise under cer ta in conditions: If one closely studies the crown of

a t ree, the branching pa t te rn appears ra ther irregular while an ent i re view of the crown,

from some distance, is charac te r i s t i c on the spec i e s - l eve l (Fig. 3.24). On this basis

HONDA (1971) a t t empted to describe the multifarious form of t rees by a few parameters

of the branching pat tern . This a t t empt , however, was not new: Already D'Arcy Thompson

discussed the cymose inf lorescence of the botanists and noticed that these botanical

pa t te rns are 'analogous in a curious and instruct ive way to the equiangular spiral'

(DVARCY THOMPSON, t952); and, as G.L. Steucek informed me, branching pat terns

of t rees were already studied by Leonardo da Vinci.

113

Fig. 3.24~" Tree images.

Rather similar branching pat terns occur in geomorphology, the network of drainage

systems (Fig. 3.25). Trees and networks are equivalent on a cer ta in level, e.g. a t ree

genera tes a network if two branches~ which come into contact , fuse ra ther than over-

l a p - - a situation which necessarily occurs if the branching process is bounded to a

Fig. 3.25: The simplified drainage nets of the Amazon (left} and the Ganges del ta

(right).

114

surface or is res t r i c ted to a nearly two-dimensional object . The network in leaves

provides an example for the la t te r case. As a concession to the l i terature, the t e r m

' t r ee s ' will be equivocally used as the term 'open networks ' .

The botanical a t t emp t s toward a description of branching pa t te rns in t rees are

mainly de terminis t ic and have, therefore , be cr i t ic ized. Thus NIKLAS (1982) wri tes in

a study on plant branching simulations:

"Unless the extent of apparent order that can arise from random

processes can be determined, there is no valid basis for assert-

ing that a particular structure implies deterministic causes."

The meaning of ' random' , however, is ra ther vague. If we a t t r ibu te a probability to

a growing system to branch or to branch not during a growth step, we may al ternat ively

formulate a de terminis t ic system which branches at the beginning of every growth step,

but may be disturbed by some external cause which inhibits branching, and such external

events may be truly s tochast ic . This aspect was clearly expressed by OSTER & GUCKEN-

HEIMER (1976):

"If a series of . .. cerises are collected . .. , and they appear

chaotic, exhibiting no perceivable regularities, then we conclude

one of three things:

(a) the system is truly stochastic--dominated by random influ-

ences

(b) experimental error is of such a magnitude that all regulari-

ties are obscured~

(c) a very simple deterministic mechanism is operating, but

is obscured " (by the phenomenon described here in section

3.1.1).

This is a viewpoint which now seems to be common in physics (e.g. HAKEN, 1981;

THOMPSON, 1982). Examples of points (b) and (c) have been given in the previous

sections, and experimental ly it seems more appropriate to exclude points (b) and (c)

than to prove point (a); the failure of the Chi2- tes t in connection with a uniform distri-

bution i l lustrates this point: There is still enough to be de tec ted on the de terminis t ic

level in which s tochas t ic e lements may en te r in te rms of disturbances.

Branching pa t te rns of t ree- l ike bodies are of some in teres t because they are not

r e s t r i c t ed to trees, they occur commonly in biology and paleontology, on the level of

organisms and on the level of organs. Besides this, they are even interest ing objects

in geology, geomorphology, and physics. The discussion of met r ic t rees is, to some extent ,

based on a study by Bayer & McGhee (An Analytic Approach to Branching Systems,

115

prepr int 1984), a study which was ini t ia l ized by a paper given by G.L. Steucek at Tfi-

bingen in 1984o

3.5.1 Topological Proper t ies of Open Network P a t t e r n s

In geomorphological studies open networks and t ree pa t t e rns natura l ly or ig inate

at the sources of s t reams, a viewpoint which is opposite to the bo tan i s t ' s one where

the branches usually or ig inate at the trunk. However, in t e rms of morphomet ry botanis t s

t ake a viewpoint ident ical to the geomorphological one. The s t r eams or b ranches are

ordered by the descr ip t ive "s t ream number", which is due to S t rah le r {e.g. BARKER

et al., 1973; GRENANDER, 1976):

¢,./ , =",A ,-. / k

Fig. a.26: Trees with branches ordered by S t rah le r numbers. All t rees have equal numbers of end branches.

Defini t ion 1: The S t r a h I e r n u m b e r ' s ' , or the order of a

single branch, is given by the following rules:

(1) end branches have order s=l

(2) two branches of order s produce a branch of order s+l

(3) two branches of d i f fe ren t order (Smax,Smi n) produce a branch of order Sma x,

The highest S t rah le r number S involved in an open network is called the

o r d e r o f t h e n e t w o r k .

The St rahler number {Fig. 3.26) of branching sys tems has special proper t ies (e.g. BARKER

et al., 1973; GRENANDER, 1976). In par t they are based on empir ical observat ions,

in par t they are deducible from the definit ion. Thus, if the numbers of b ranches of

each order are counted, then the logar i thm of the number of branches in each order

p lo t ted against the St rahler number gives a l inear plot: i.e. log n s = a+bs; where Us:

the number of branches of order s {BARKER et al., 1973); or, an observat ion from

116

g e o m o r p h o l o g y (HORTON, 1945), t h e b i f u r c a t i o n r a t i o s a r e a p p r o x i m a t e l y c o n s t a n t . Le t

aga in n s be t h e n u m b e r o f b r a n c h e s o f o r d e r s, t h e n t h e b i f u r c a t i o n

r a t i o

n s _ l / n s = R s (3.34)

is a p p r o x i m a t e l y c o n s t a n t , i .e t h e n u m b e r s ns_ 1 and n s a r e in a g e o m e t r i c a l r e l a t i o n -

ship. T h e s e ' e m p i r i c a l l aws ' , h o w e v e r , a r e no t u n e x p e c t e d , t h e y r e s u l t f r om t h e de f i n i t i on

o f t h e S t r a h l e r n u m b e r . By de f i n i t i on e v e r y n e t w o r k o f o r d e r s m u s t c o n t a i n a t l e a s t

two b r a n c h e s o f o rde r s - I , and t h e s a m e holds for any s u b s t r u c t u r e in t he t r e e w i th

s > 1. T h e r e f o r e ,

n s -2ns_ I > 0 or ns_i/n s > I/2. (3.35)

In the case ns_ l = I/2, the bifurcation tree is a perfect symmetric tree. The 'empirical

laws' are expected whenever the deviation from the binary tree is random, i.e. whenever

branches are randomly suppressed and added with a zero-mean.

A n o t h e r m e a s u r e m e n t , wh i ch is s o m e t i m e s use fu l , is t h e n u m b e r o f end b r a n c h e s

o f a t r e e or a b r a n c h ( G R E N A N D E R , 1976):

1 1 \1 1 /

1 1 1 1 1 1 1 2 1 2 1

, 7 2 1

3 1 8 8 1

Fig. 3.27: The t r e e s o f Fig. 3.26 wi th b r a n c h e s e n u m e r a t e d by ' n u m b e r s o f c o m - p l e x i t y ' .

D e f i n i t i o n 2: The n u m b e r o f c o m p 1 e x i t y

o f o rde r s is t h e n u m b e r o f i t s end b r a n c h e s .

N of a b r a n c h c

Fig. 3.27 i l l u s t r a t e s t h e c h a r a c t e r i z a t i o n of trees by t he i r n u m b e r s o f c o m p l e x i t y . The

n u m b e r o f end b r a n c h e s and t h e o r d e r o f t h e n e t w o r k a r e not i n d e p e n d e n t . T h e r e l a t i o n -

sh ip b e t w e e n t h e n u m b e r o f c o m p l e x i t y a n d t h e S t r a h l e r n u m b e r is s imp ly

N > 2 S - l , (3.36) c

I I7

and the equal i ty holds only if the t r ee is a pe r fec t symmet r ic t ree . To prove this rela-

t ionship it is helpful to introduce

Defini t ion 3: The s k e 1 e t o n of an open network is i ts largest symmet r i c

subst ructure .

The skeleton is cons t ruc ted by e l iminat ing the branches with S t rah le r number Smi n

at nodes where SL,i_ 1 ~ SR,i_ 1,

The remaining t ree conta ins only nodes where the S t rah le r number increases and, the re -

fore, is pe r fec t ly symmetr ic . Along a pathway from an end branch to the trunk the

Nc 4 s ' 3 2 1

SKELETONS S=1,...,5

, ,r - V -

Fig. 3.28: Trees of complexi ty Nc = 3,4, and 5 (s:Strahler numbers) with dis t inct pe rmuta t ions of branch pa t te rns . Note tha t these pe rmuta t ions are not all topologically d is t inct as indicated by brackets . Lower left : e l emen ta ry skeletons of S t rahler numbers 1 to 5.

118

St rah le r number simply counts the number of b i furcat ions . If we inver t this pathway,

the number of branches of equal S t rah le r number increases monotonously like 20 , 21 , . . . . . . . 2 S-1 2 2 , , 2 S-s, Arr iving a t an end branch the number of end branches is Nc= .

Rela t ion (a.a6) now is easi ly proved: If the t r ee is symmetr ic , then the equal i ty holds

as was shown, o therwise branches have been e l imina ted during the cons t ruc t ion of the

skeleton, and, the re fore , end branches have been e l iminated , i.e. t he inequal i ty in equat ion

(3.36) holds. This discussion provides us fur ther with a topological i n t e rp re t a t ion of the

S t rah le r number: It uniquely def ines the e 1 e m e n t a r y s k e 1 e t o n of

a t r ee or i ts subs t ruc tures (Fig. 3.28), where ' e l e m e n t a r y ' means tha t the length of

b ranches is not considered (empty nodes of the skeleton).

Another quest ion is how many d i f fe ren t t rees with ident ical number of end branches

do exist , or general ly: Given a ce r t a in number of end branches , how many d is t inc t ne t -

works can be genera ted? GRENANDER (1976) gives the following solution, which is due

to SHREVE (1966):

The number of open networks N(n) for any branching sys tem with n end branchings

is

N(n) = (2n-2)! (n! (n-l)!) -1, (3.37)

and the number N(nl ,n2, . . . ,nS_l , l} of topological dis t inct networks of order S with

nS_l , t b ranches of order 1 .. S is n 1

S-1 N(nl,n 2 .... nS_l,l ) = 11 2(ns-2ns+l) ((ns-2)! ((ns'2ns+l)! (2ns+l-2)!)-l. (3.38)

s=l

These two equat ions provide us with a probabi l i ty measu remen t of finding an open

network with S t rah le r numbers nl ,n2, ... ,ns_l,1; when the number of end branches

is given, the d i sc re te probabi l i t ies are

P(nl ,n2, . . . ,ns_l , l ) = N(nl ,n 2 ..... ns_l , l ) / N(n). (3.39)

Fig. 3.28 i l lus t ra tes the networks which correspond to N(3)=2, N(4)=3, N(5)=14; i.e. the

various combina t ions of branches . However, these t rees are not all topologically dis t inct ,

some of them are simple symmet r i c inf lect ions and cannot be distinguished: Let the

t r ees be the two-dimensional projec t ion of a th ree-d imens iona l image, then the inflec-

t ions mean simply tha t an observer walked around the object by 180 ° and then classif ied

it d i f fe ren t ly because the symmet r i e s have changed. The number of topological d is t inct

t rees is only

t19

N T = N{n) div 2 + N(n) mod 2. {3.40)

The d i f fe rence of the two concepts is not tr ivial , e.g. the probabi l i t ies will be d i f fe rent

under the two hypotheses: The probabil i t ies for a t r ee with four end branches {Fig. 3.28)

are p(3,2,1) = N(3,2,1)/N(4) = 1/3 and p(2,1) = 2/3. Based on topological similari ty, how-

ever , we find the probabi l i t ies PT{3,2,1) = PT(2,1) = 1/2, and the geomet r ica l hypothesis

inf luences the conclusions we may draw from the probabil i t ies .

Finally it seems worthwhile to emphasize tha t the probabi l is t ic aspect , ment ioned

briefly, is based on the comparison of an observed pa t t e rn with a to ta l ly de te rmin i s t i c

pattern~ the symmet r i c t r e e - - a possible question would be: How strong do we have

to disturb a binary t ree to ar r ive at the observed t r ee pa t te rn . The concept of skeletons

can be used to emphas ize this point. If the symmet r ic t rees wi thout repe t i t ions of empty

nodes -- the e l emen ta ry s k e l e t o n s - - are defined as pr imi t ives of our t r e e s t ruc tures ,

then any t r ee can be defined as 'p roduc t ' of e l emen ta ry skeletons: A node is the product

{Si,S]}, where the f irst t e rm means left , the second r ight branching {for symmet ry reasons

this no ta t ion is arbi t rary) and Si defines an e l emen ta ry skeleton of order (Strahler

number) s=i. The t rees of Fig. 3.26 can be described as l ists (Fig. 3.29)

",/ N/ x / V v v V-

"9' <L V,,/_v V , , / ..... /

Fig. 3.29: The t rees of Fig. 3.26 decomposed into e l emen ta ry skeletons.

(a) S3: (($2: ($2,0), O, ($2: (0, $3: ((S1,$2), O, O, $2), O)

or (($2,0), 0 , (0, (S1,$2), 2*0, S2), O)

(b) S3: ((S2, S2, S2: (0, S3: ((Sl,S2), O, O, S2)), O)

or (2 '$2, (0, (SI,$2), 2 ' 0 , $2), O)

(c) $4: (0, O, O, O, S2: (0,S2), $2, $2, $2)

or (4*0, (0,$2), 3"$2)

where the b racke t s conta in descr ipt ions of the end branches clockwise; the t e rm

I20

~O ~ indica tes t e rmina t ion of the branch, and t.~ means repet i t ions . The descr ipt ion is

recurs ive and, of course, could be reduced to binary decisions or s-expressions which

form the basic s t ruc tu re of the programming language LISP. E lementa ry LISP and the

lambda-ca lcu lus provide ano ther conceptua l f ramework to handle such s t ruc tu res (e.g.

DENERT & FRANK, 1977).

3.5,2 P a t t e r n Gene ra to r s for Open Networks

The theme of the previous sect ion was the descr ipt ion of t r ee pa t te rns . A common

problem, however, is the s imulat ion of t r e e pa t t e rn s and the quest ion how many dif fer-

ent pa t t e rn s can be gene ra t ed from some simple rules .Essent ial ly , the re are two ap-

proaches: t r ans fo rma t ions of qua l i ta t ive proper t ies and me t r i c models. Two examples

will be discussed here. In both cases, the process envolves some pr imi t ive e l emen t s

and de te rmin i s t i c product ion rules. However, one can adapt probabi l i t ies to the bi furca-

t ion events and then s imula te randomly dis turbed pa t te rns ; such sys tems were eog.

used by Raup to ana lyze whe the r phylogenet ic t rees are random s t ruc tu res (RAUP

et al., 1973).

A) Algebra ic Models -- P ro to types of Branching P a t t e r n s

One class of models, which genera tes regular s t ruc tu res inclusively t r ee pa t te rns ,

is based on the t r ans fo rma t ion of ~alphabetsV: A s t r ing of symbols like (a,b~c,...) charac-

t e r i zes qua l i t a t ive proper t ies of an object , and a se t of t r ans fo rmat ion rules like

(a ~ a d , b ~ c, ...) def ines the evolut ion of these propert ies , for ins tance how they

t r ans fo rm during t ime. Such sys tems have been extens ively applied to deve lopmenta l

sys tems by LINDENMAYER (1975). The theory of such t r ans fo rma t ion groups is much

older and closely r e l a t ed to the theory of permuta t ions ; for a de ta i led discussion see

ASHBY (1956, 1974). A typical sys tem, which produces t ree pa t te rns , is (LINDEN-

MAYER, 1975):

the a lphabe t or se t of pr imit ives: (a,b,c,d,e)

the product ion rules: (a --~db, b ~ c, c - ~ d , d ~ e , e --- a),

s t a r t ing with the e l emen t ~a ~ gene ra t e s the t r ee

121

a

A i I e c

I I a d

I I I e c a

ll \ b a d d

I I t I d e e c

I n s p e c t i n g t h e t r e e one f inds t h a t it c o n s i s t s o f t he r e g u l a r r e p e t i t i o n s of s t r i n g s (d,e,a)

and (b,c). The r e d e f i n i t i o n A = (d,e,a) , B = (b,c) and t h e mod i f i ed p roduc t i on ru l e s

(A - ~ AB, B - - - A ) s i m p l i f y t he t r e e p a t t e r n

A

A A B

/ \ \ A B A /! f \ \

A A B

I 1 \ t",,,,\ A A B A [3 A

which , h o w e v e r , c o n t a i n s t h e s a m e i n f o r m a t i o n as t h e or ig ina l t r e e b e c a u s e we can

r e s t o r e it by t he i n v e r s e s u b s t i t u t i o n . The two t r e e s a r e e q u i v a l e n t , and we can p r o d u c e

m o r e e q u i v a l e n t t r e e s by s u b s t i t u t i o n s o f t h e t y p e A = ( a l , a 2 . . . . . an) , B={bl ,b 2 . . . . . bn} ,

w h e r e t he s e t s o f p r i m i t i v e s in A and B a r e d i s junc t , i .e. do not c o n t a i n i den t i c a l

e l e m e n t s . Two t r e e s c a n now be de f i ned as e q u i v a l e n t if it is poss ib le to r e d u c e t h e m

to t h e s a m e p r i m i t i v e fo rm; t h e r e d u c t i o n s , wh ich a r e a l lowed, a r e t h o s e wh ich do

not a l t e r t h e s i ngu l a r p a t t e r n o f t h e p roduc t i on ru les , i .e. wh ich p r e s e r v e

t e r m i n a t i o n po i n t s l ike A ~ A,

b i f u r c a t i o n po i n t s A - - B C ,

and loops A -*- B ~ A.

122

The problems, which arise if o ther reductions are used, were briefly mentioned during

the discussion of Markov chains in sedimentology.

The reduct ion process may be i l lustrated by a somewhat more compl ica ted

system:

the alphabet: (a,b,c,d,e, f,g,h,i,k)

the production rules: (a - - b c , b - - kd, c -,- Ik, d --~ gb, e --- cf, f - - ih,

g - - hi, h - - de, i - - k, k --- k),

which are ra ther compl ica ted and genera te an evenly compl ica ted pa t te rn

a

bc

kdek

kgbcfk

khikdekihk

kdek kgbcfk kdek

kgbefk khikdekihk kgbcfk

khikdekihk kdek kgbcfk kdek khikdekihk

The system was used by LINDENMAYER {1975) to describe the s t ruc ture of compound

leaves. The strings of symbols were in te rpre ted as ceils along the margin of the leaf

which posses cer ta in morphological propert ies . We try to reduce the complicated produc-

tion rules and observe first that the e lement 'a v occurs only once; indeed, it serves

only as an ent ry point. Regular repet i t ions of substrings in the t ree occur from the

third production downward, these are A=(kdek), B={kgbcfk), C=(khikdekihk), and the

production rules can be simplified {A -~ B, B -~- C, C -- ABA)

B

J C

/IN A B A

/ I \ B C B

/ / t \ \ C A B A C

A B A B C B A 13 A

The reduced system produces a nice t ree with triple points at C, a s t ruc ture which

123

was not obvious from the original production rules, but, clearly, the reduced system

is equivalent to the original o n e - - if we subst i tute the original values for A,B, and

C, we have the original productions.

Inspecting the e lements A,B,C it turns out that they all are of the form (k ..o

k), i.e. the ' k ' - e iements act as brackets of the original strings. Only in the e lement

C, k's occur within the string. If we want that this fine s t ruc ture is not lost in the

reduced version, we can introduce the radicals H=(k,h,i) and HT=(i,h,k) replacing the

e lement C, which now reads C=HAH T, and the productions are

A

J B

I HAH T

/ 1 \ A B A

/ \ B B / J \ \

A B A HAH 'F

which again comprise the s t ructure of the original rules.

This sect ion a t t empts to i l lustrate that the reduction of complicated systems

to equivalent simple ones is an essential step in pa t tern recognition problems. However,

such reductions can only involve those t ransformat ions and redefinit ions which do not

a f fec t the singular s t ructure of the original system. Using the concept of equivalence

and proper reduction processes may also save much investigation t ime because a ra ther

complicated system may turn into a well known simple one, and the classif icat ion of

such s t ruc tures reduces to a (not necessari ly finite) catalogue of prototypes.

B) A Metric Model - - the Honda Tree

HONDA's (1971) approach was to formulate a pa t t e rn -genera to r which allows to

study the form of t rees by means of the computer, based on a minimal set of parameters

{Fig. 3.30). Here only his 'geometr ica l assumptions ' are of importance, they are repeated

in a slightly modified form:

I The branches are always straight, and their girth is not considered;

II the mother branch produces two doughter branches with each branching event;

124

Fig. 3.30: Two-dimensional images of Honda t r ees in the (x,y}-plane. The r ight f igure has in par t been comple ted to approximate tr ipling at branching points. Still more rea l i s t ic b i fu rca t ion pa t t e rns for plants can be cons t ruc ted if the branches fork off under a ' d ivergence angle ' which follows a Fibonacci ser ies (cf. HONDA, 197t).

III the length of each successive doughter branch is shor tened in a ce r t a in ra t io

wi th respec t to the mothe r branch;

IV the doughter branches form cons tan t angles with the i r mother branch through-

out the t ree; however, lef t and r ight branching angles may be d i f ferent ;

V the mother branch forks off into doughter branches in the plane which con-

ta ins the mothe r branch and whose s teepes t gradient line coincides with the

d i rec t ion of the mother branch.

From these assumptions Honda derived a formula which allows to compute the

successive coord ina tes of branching points and, thus, can be p rogrammed for graphical

compu te r output . Point (V), which is not a sepa ra te point on Honda's list, is responsible

for a r a t h e r compl ica ted s t ruc tu r e of Honda 's formula, which has to be applied separa te ly

for l e f t and r ight b ranches (d i f ferent p a r a m e t e r s R and {3 ):

x = x B + R(u cos0 - (Lv sin@)/(u2+v2) I/2 (3.41)

Y = YB + R(v cos8 + (Lu sine)/(u2+v2) I/2

z = z B + R w cose

where R: ratio between subsequent branches, @ : branching angle

and u = XB-XA, v = yB-YA, w = ZB-Z A, L = (u2+v2+w2) I/2

125

Fig. 3.31: l londa 's

bifurcat ion.

nota t ion of

XA, x B etc . are coordinates of the branching points (P) as indicated in Fig. 3.31. Honda 's

formula is easily p rogrammed and evaluated wi th a computer ; however, equat ion (3.41)

is not of the proper form for an analyt ic analysis; it takes a much more convenien t

form if one does not work in global coordinates but considers the dislocations of branch-

ing points which are

[ Z - Z b J ~ 0 c o s 0 LZB-ZAJ

where b = L(u2+v2) I/2 = (i + w2/(u2+v2)i/2.

(3.42)

The matrix, which appears on the r ight side, describes a ro ta t ion and a simple compres-

sion (or di latat ion) . We can separa te the two components by dividing all ma t r ix e l emen t s

by a fac to r

R I = (cos~ + b2sin28 )I/2= (I + w2(u2+v2)I/2sin28)I/2),

and the dislocat ions can be wr i t t en

I II = RIR k sin@O co~@ 0 itlJ 0 c

where tan = b(sin 8)/(cos e)

and c = (cos0)/(cos 2 @+ b2sin20 )1/2.

(3.43)

The fac to r R 1 defines an addit ional shortening ra t io which depends like the mat r ix

e l ement 'c ' on the ro ta t ion angle and the length of the mother branch. However, only

the length of the mothe r branch is var iable (cf. assumption IV). The re levan t coef f ic ien t ,

Fig. 3.32:

bifurcation.

Vector notation

126

of a

r L

P r L

which depends on this parameter , is w2/(u2+v2), Cfo equations (3.42) and {3.43). In case

this quotient is constant , equation (3,42) is a simple linear system. We shall not discuss

Honda's original model in more detail; however, condition (II) can only be sat isf ied if

w2/(u2+v2)=constant. Only in this case, the number R defines the shortening ratio of

branches, and Honda~s model can be replaced by equat ion (3.431.

Honda~s model provides a framework which captures essential aspects of branching

pa t te rns in t ree- l ike bodies although already Honda needed some additional assumption for

th ree -d imens iona l t rees. However, it can be simplified leading to a description by two

sets of vectors: r, the dislocations of branching points, and P , the radius vector of

the branching points in global coordiantes {Fig. 3.32):

The h o d o g r a p h of a H o n d a t r e e is defined by the pair

of i te ra t ive equations

rR, i+1 = aRARri (3.44)

rL, i+ 1 = aLALri ,

and the space coordinates of the Honda tree are given by

PR, i+I = P i + rR,i+l (3.45)

PR,i+I = P i + rL,i+l;

r, p will denote vectors throughout this section; R,L are the indices for left and

right branching; 'A t is an orthogonal matr ix (cf. equation (a.4a)); 'a t is a scalar

and denotes the ratio be tween subsequent branches.

While in a Honda t ree s. str. 0 < a R , aL < 1, we call a t ree with arbitrary values

of aR, a L a g e n e r a t i z e d H o n d a t r e e , and, where not necessary,

the a t t r ibu te ~generalized ~ is dropped.

A Honda tree, thus, is a binary tree, and if we superimpose some probability that

127

branches may be lost, i t genera tes an open network, as discussed in the last section,

with all i ts propert ies . The Honda t ree, developed for botanica l models, possesses a

proper ty which is a classical observat ion in geomorphology: the cons tan t ra t io of branch-

es. The s t r eam length sa t i s f ies the approximate re la t ion (GRENANDER, 1976)

ls_l/1 s = RL, s = 2,3,...,S (3.46)

1: length of branches, s: S t rahler number, RL:length rat io,

an empir ica l observat ion, which coincides with Honda's assumption (tI). We shall derive

some proper t ies of Honda t rees , which are only based on this assumption, and, therefore ,

can be applied to s t r eam pat terns .

3.5.3 Morphology of Branches in Honda Trees

"DVArcy Thompson's Problem"

Honda t rees , as defined in the previous sect ion, possess special propert ies: con-

s t an t branching rat ios and branching angles. These proper t ies allow to compute the length

and morphology of ce r t a in b r a n c h e s - - under special c i r cums tances even for inf ini te

numbers of branching events . Fur thermore , the d i sc re te model will turn out to be iso-

morphic with a d i f fe ren t i ab le system.

A) Length of Branches

By defini t ion the branches of a general ized Honda t r ee are in a geomet r ica l rela-

t ionship, i.e. the successive length of branches is given by the i t e ra t ed map

I r i ]= a [ri_l [ (3.47)

where 'a t may be e i the r the branching ra t io for lef t or for r ight turns, or b e t t e r a m a x

or amin, a no ta t ion used now because ief t and r ight are of l i t t l e meaning (cf. sect ion

3.5.1). If one considers a continuous sequence of turns in one direct ion, the length of

the b ranch is given by

]r i[= Jr0] ~ a 1, (3.48)

a geomet r i ca l ser ies with the well known sum

L =1 r 0 1 (1-an+l)( t -a) -1. (3.49)

In case of a convergen t Honda t r ee , the branching rat ios have to sa t is fy

0 < a . < a <l, and we can compute the length of such a branch even for n - 7 ~ : mln max

128

lim] ro} (1-an+l)(1-a) -1 = ]to] (l-a) -1, n ~ co

(3.50)

From this observa t ions we es tabl ish the

Proposi t ion: In a Honda t r ee the sums

Lma x = I ro l (1-amaxn+l}(1-amax )-1 and

Lmi n = ]ro](1-aminn+l)(1-amin )-1

provide upper and lower bounds for the length of any pathway through the t ree

with fixed n (n = 1,2 . . . . . k) and ini t ial length r 0 . In case amax=amin , all possible

pathways are of equal length. In case 0 <ami n <amax < 1, this holds even for

n - ~ o o .

Consider any o ther pathway, it necessar i ly includes at least one turn opposite to the

cont inuous path, i.e. an e l emen t a lmaxalmin . However, for any combinat ion of i and

j we have

a i+j <= a i a j < a i+j (3.51) max max min min"

The sum over the length of e l ement s along an a rb i t ra ry pa thway with init ial length

r 0 conta ins only products of this type, and the length is bounded for fixed n because

every e l emen t is bounded. The equal i ty in the equat ions holds if amax=amin , and independ-

en t of the pathway, i ts length is cons tan t for fixed n.

These resul ts can immedia te ly be applied. Consider a d is t r ibut ive system which

has to be op t imized in the way tha t the lengths of pathways with common origin are

equal. This op t imiza t ion problem is solved by any t r ee with amax=amin , independent

of i ts morphology, e.g. branching angles.

If one considers genera l ized Honda t rees , one cannot expect tha t the length of

b ranches converges for n - - ~ , however, t he re may be special pathways with f ini te length.

To get an idea what branches converge we consider pathways consist ing of regular repe t i -

t ions of 'm ~ lef t and ~n' r ight turns, for which we wri te

( m ' R , n 'L) , ( m ' R , n 'L ) . . . . ; m,n ~ (0,I,2 ..... ).

The sum of such a ser ies is

(L-] r d )= 2 m

a + a + . . . + a

+ am{b + b 2 + ... + b n)

+ , . . +

+ akmb(k ' l )n (b + b 2 + ... +b n)

; the f irst sequence of r ight turns

; the f irst sequence of lef t turns

; the k th and final repet i t ion .

129

This sum can be r e w r i t t e n as

L - I roI = (~ ai + am ~ bi)(l + ambn + (ambn)2

or

L lrol = (g ai + an~ bi)(ambn)k / (1 - ambn),

and b ranch leng th c o n v e r g e s if

0 < arab n < 1.

+ ... + (ambn)k (3.52)

(3.53)

Given a c e r t a i n area x and ami n we have to find numbers m,n so tha t equa t ion (3.53)

holds; if they exis t , t he b ranch is of f in i t e length . In s o m e sense , t h e s e spec ia l pa t hways

provide us again wi th upper l imits: If ( m ' R , n ' L ) c o n v e r g e s and a R > aL, then all p a t h -

ways ( m ' R , (n+i)*L) c o n v e r g e even if ' i ' is some random var iab le but s a t i s f i e s i _-< 0.

B) Branching Angles - - S imi la r i ty and Se l f -S imi la r i ty

The second var iab le in Honda ' s model is the b ranch ing angle . However , this var ia -

ble is no t a p roper m e a s u r e m e n t because t h e t r e e p a t t e r n depends s t rong ly on the b ranch

ra t ios . If one wan t s to c o m p a r e two t r ees , a m e a s u r e m e n t is neces sa ry which t akes

t he b ranch ing angles and the b ranch ra t ios in to cons ide ra t ion . This leads to t he fol lowing

Def in i t ion : The s i m i 1 a r i t y i n d i c e s o f a Honda t r e e a r e t h e

two number s S m a x l m i n = (log amax lmin ) /0 ; 0 in radians . Two t r e e s a r e abso-

lu te ly s imi la r if bo th ind ices a r e iden t ica l , t h e y a re pa r t i a l ly s imi la r i f they a g r e e

in one s imi la r i ty index.

To m o t i v a t e this def in i t ion we cons ider t he hodograph of the Honda t r ee , i .e . the v e c t o r s

r i = aAr i_ l , and we cons ide r again a con t inuous pa thway which cons i s t s e n t i r e l y o f l e f t

or r ight turns . Then the v e c t o r s r i evolve like r0, aAr0, (aA)2r0, ... (aA)nr0 . Because =anA n A is an o r thogona l ma t r i x (cf. equa t ion 3.43), t he powers r n can be w r i t t e n as

The s imple r ede f in i t i on ~ = n (? t r a n s f o r m s this equa t ion into

130

r n =

I cos t9 =sin ~ 0 e c '~ sin ~ cos tp 0

0 0 c n r 0 (3.55)

c = (log a)/0

that is, the vectors r are located on a logarithmic spiral in the (x,y)-plane or on a n

trochospiral in three dimensions, whereby the term 'spiral ' includes circles and straight

lines. To see this more clearly consider ro=(XoYoZo) in its most general form, and equation

(3.55) can be rewr i t t en as

r = E!o yo o I Eco: 1 o : % 0 e c $ sin

0 Z o C n

(3.56)

The term in brackets is clearly a logari thmic spiral if we allow ~ to vary continuously.

Equations (3.54-56) provide a continuous approximation for the location of dislocation

vectors in a monotonous sequence of left or right turns, and the similari ty indices are

based on the similari ty of these spirals taking into account that a change of the magni-

rude of the branching ratio can be compensated by the branching angle so that the

vectors are still located on the identical spiral. Another aspect is that a Honda t ree

is everywhere selfsimilar in te rms of its hodograph (F~ig. 3;33). Equation (3.56) shows

that we can build the hodograph ent irely of sequences of continuous left and right turns,

i.e. of spirals in the (x,y)-plane. Along such spirals the branching points are defined

by ~=0n , and they give rise to a spiral in opposite direction. In case the Honda t ree

is absolutely convergent , the two initial spirals bound the hodograph, i.e. all o ther disloca-

tion vec tors are located inside these leading spirals. The Honda model thus describes

an ideal self similar system with possible infinite repet i t ions.

F_~ig. 3.33: The hodograph of a Honda t ree and its continuous approximation.

i

/ / /

I j

%

J • •

, s l a • • / # J ~ l

i s ~ " I

131

C) Branches and Bi furca t ions - - a Quas i -Cont inuous Approx ima t ion

So far , we g a t h e r e d i n fo rma t ion about the branching p a t t e r n wi thout r ega rd to

the form of b ranches ; however , it will turn out t ha t mos t work was a l r eady done. We

have seen tha t t he hodograph of a Honda t r e e cons i s t s o f s imi la r spirals which o r ig ina te

a t a ' l ead ing ' spiral , and in the ca se of a c o n v e r g e n t Honda t r e e they all approach

the s a m e coi l ing c en t e r :

Def in i t ion: A 1 e a d i n g b r a n c h is a con t inuous s equence of e i t he r

l e f t or r ight turns , and it is t he cont inuous app rox ima t ion o f a leading branch .

The t e r m 's p i r a 1' inc ludes t rochosp i ra l s and c i r c l e s and s t r a igh t lines.

The b r a nc he s and the b i fu rca t ion p a t t e r n a re found if it is poss ib le to ' i n t e g r a t e '

t he hodograph, i .e. one has to sum the d i s loca t ion v e c t o r s along e v e r y leading spiral:

P i = (i-~0aiAi)r0 ' (3.57)

tha t is again a g e o m e t r i c se r ies , but this t ime it involves a mat r ix . However , if one

uses t he inverse of a m a t r i x - - deno t ed by M -1 - - this se r ies can be summed like

an ord inary power se r ies (e.g. ZURMOHL, 1964), and the sum takes the form

Pi = (I - aA)- l ( I + an+lAn+l) ro

w h e r e I: the i den t i t y mat r ix ,

(3.58)

or in a m o r e e x t e n s i v e form

P i = ( I -aA)- l ro . ( i_aA)- l (an+lAn+l) ro .

This equa t ion desc r ibes the success ion of b i fu rca t ion poin ts along our leading spiral

if we use 'n ' as var iab le . It t u rned a l ready out t ha t a t e r m (aA)nro desc r ibes a s e q u e n c e

of v e c t o r s (cf. equa t ion 3.54) with the v e r t i c e s l oca t ed on a spiral , and r e tu rn ing to

t h e s e a r g u m e n t s again i t b e c o m e s c l ea r t ha t t he va r iab le t e r m in equa t ion (3.58) de-

s c r ibes again tile spiral of the hodograph. The t e rm (I-aA) -1 is cons t an t and can be

d e c o m p o s e d into a ro t a t i on and an e longat ion , as i l l u s t r a t ed in s ec t ion 2.5.2 B. The

leading spi ra ls of t he hodograph, t h e r e f o r e , map on to s imi la r spirals , t he b ranches .

The r e p r e s e n t a t i o n of a b ranch by a s equence of spira ls is i l l u s t r a t ed in Fig. 3.34.

The form of b ranches , or more prec ise ly , the loca t ion of b i fu rca t ion points , thus is

well de f ined in a Honda t r e e and r e s t r i c t e d to a s ingle fami ly of func t ions . The spi ra ls

a re t he s a m e as in t h e hodograph up to s imi l a r i ty t r a n s f o r m a t i o n s .

The b i fu r ca t i on p a t t e r n thus can be a p p r o x i m a t e d by spira ls which o r ig ina te a t

132

Fig. 3.34: A Honda t ree as a sequence of spirals.

a leading spiral and give rise to further spirals and so on. The Honda t ree provides

a model for

"D'Arcy Thompson's" theorem: A branch system with continuous branching angles

and geomet r ic relationships between mother and doughter segments of branches

is isomorphic with a system of leading spirals: The first generation of branch-

es is a t tached to the leading spiral in regular, usually geometr ical ly increasing

or decreasing distances measured by arc length and give rise to a second genera-

tion, and so on.

This pa t tern has been i l lustrated above, here we give some more propert ies which will

be needed later, Thereby we res t r ic t ourself to the two-dimensional image of Honda

t rees in the (x,y)-plane (of. equation 3.44):

Proposi t ions concerning the form of branches in Honda trees:

(i) There are at most two d i f ferent spirals in a bifurcat ion t ree of Honda type.

All others are similar to these two spirals. Similar spirals have identical coiling

133

directions.

(2) All branches of a ce r t a in generat ion are d i rec ted to the same side of the leading

s p i r a l

(3) The length of a spiral arc be tween two branching points is proport ional to the

length of the d iscre te branch segment be tween these points,

(4) The lengths of any two spirals with equal numbers of branching points are in

an a l lomet r ic relat ionship. If the two spirals belong to the same family of leading

spirals, the i r lengths are simply p ropor t iona l

Proposition (I) follows from the fact that a Honda tree contains only bifurcations with constant branching angles and branch ratios. The two spirals are defined by equations (3.54-56). Proposition

(2) follows from the definition of a leading spiral which is a continuous pathway of only left or right

turns, and from the constancy of branching angles. To prove proposition (3) we choose the coiling center of a leading spiral as origin of the coordinate system and assume the bifurcation points are at regular

angular distances @=constant. The succession of these points is given by p i=exp(-i¢ ). The length of a spiral segment is

L a e-l (c~+l) l /~(tPi l - IPI_IL? (3.5g) c -I (c2+i)I/2(I _eC)e -el.

The length of the discrete branch segment connecting the same bifurcation points is

Lb = lP i - Pi-~ I= ri (3.60) (e-2Ci+e-2C(i-1)_2e2Ci+C((cosCi)(cos~(i_1 )) + (s:'w~i)(sin(b(i-1)

(I +e2C-2eCcos~)e -ci.

The only variable is ti~, all other terms are constant. Therefore, we carl express the length of discrete branch segments in terms ef the length of the spiral arcs

L b = a Ls. (3.61)

The length of a branch for a given ~i Y is the sum over subsequent segments. The constant factors, however, need not to be summed, the sum involves only terms axp(-ic) which are identical in beth expres-

sions: Equation (3~I~ therefore, holds for any fixed number of branch segments Ti~. In both equations

(3.59+60) the ratio between subsequent segments is Lb,i+I/Lb,i = Ls,i+I/Ls,z" = exp(-c), that is

the geometrical relationship in D'Arcy Thcmpsons theorem. To prove proposition (4) we start with the ailometric relationship between two logarithmic spirals:

c 1 ¢ e 2 ¢ P 1 = ale ; P 2 = a2e '

c 2 can be expressed i n terms of o 1 by an equat ion c2=bc 1 or p2 = a2exp(bc 1 ), and, the re fo re ,

P 2 = ( a 2 P l / a l ) b ' (3,62)

i.e. the radii are in an allometric relationship. The length of a logarithmic spiral is proportional

to the length of its radii (equation 3.59), and the allomatric relationship holds alse for the length of spiral arcs for any fixed angular interval i C. If the two spirals belong to the same family, the coefficient b=1.

As long as we consider branches of f inite length, the resul ts hold for any values

of branching ratios. There are some special cases, e.g. if one branching ra t io equals

134

Fig. 3.35: The plane image of a Honda t r ee if amax=l ,

0 <ami n < 1 are regular poly-

hedrons.

lm

....

/

one. In this case, the images of b ranches in the (x,y)-plane are circles as i l lus t ra ted

in Fig. 3.3~ for amax=l , ami n <1. If such a special p a r a m e t e r se t t ing is chosen, the

net pa t t e rn depends strongly on the b i furca t ion angles, e,g. if amin=0, then the system

resembles the ' e l a s t i c collision in a c i rc le ' as discussed in sect ion 3.3.2 (Fig. 3.14). If

both branching ra t ios equal one and the branching angles are 2~/3, w / 2 . . . . , the Honda

t r ee degene ra t e s into regular t r iangular , r ec tangu la r and hexagonal grids which cover

the en t i r e (x,y)-plane. Usually, however, we will find sequences of logar i thmic spirals

which, in the case of convergent Honda trees, are bounded in length. This causes the

chao t ic .appearance in the two (and three-) dimensional images of the t rees , the end

branches c lus te r and overlap prohibi t ing the recogni t ion of regular s t ructures .

3.5.4 Evolut ion of Shape

'Shape ~ or ~outline ~ means tha t an object is bounded by some kind of surface .

Simple b i furca t ion diagrams, e.g. the skeletons S1-$4 in Fig. 3.28, do not possess a

shape in this sense. However, as the number of b ranches increases, an out l ine evolves,

at least subject ively. From previous work we know tha t a convergen t Honda t r ee needs

to have a maximum out l ine or l imit ing shape for a suff ic ient ly high number of b i furca-

tions. This l imit ing shape exis ts and cannot exceed a ce r t a in boundary because all

b ranches are of f ini te length. On the o ther hand, the number of end branches increases

with the number of b i furca t ion even ts like n=2 s (s=Strahler number because the t r ee

is symmetr ic ) and c lear ly goes to inf ini ty if the number of b i furca t ion even ts is not

135

bounded. BARKER et al. (1973) es t imated the mean number of buds arising from the

highest order branch in a birch t ree to be >7000.

As the number of end branches increases, we expect increasing density of end

points and a t rend towards a cont inuous outline. Fig. 3.36 elucidates this point where

only the end points of the t rees of Fig. 3.30 are marked. Clearly, the end points clus-

ter in cer ta in areas and give the illusion of continuous lines which together give the

Io i

\

Fig. 3.36: Distribution of branch endings in the t rees of Fig. 3.30.

illusion of a continuous outline. One question to be discussed is whether these points

may define a continuous curve or even fill some space, as the number of bifurcation

points goes to infinity. The o ther question is if there exists some defined envelope

or hull of a convergent Honda t ree which can be considered as its limiting shape.

A) Trees, Peano and Jordan Curves

To i l lustrate the concept of shape in more detail we consider a quite d i f ferent

system, continuous curves which are nowhere different iable . A special example, which

is useful in this context , is the Koch curve {cf, MANGOLDT-KNOPP, 1968). The closed

version of a Koch curve can be const ructed in the following way (Fig. 3.37): A regular

tr iangle (equal angles) is inf lected and superposed on i tself so that the sides are divided

into three segments of equal length, the resulting pa t te rn is a regular star . The corners

of this s tar are again regular triangles, similar to the original ones, and with each

t36

vS J ig d, 13tsaTrelC~nstTt~Otr:: ~hatteKrOCh cu

of these triangles the process is repeated, The Koch curve is the outer boundary of

the resulting pat tern as i l lustrated in Fig, 3.37. If the process is repeated infinitely,

the triangles shrink to points and give rise to a continuous, non-differentiable curve.

If we now connect the centers of successive triangles by straight lines, these form

a t ree pat tern with a triple point at each branching event. The branching angles and

the shortening ratios are constant, i.e. if we dele te one of the branch directions, the

resulting binary t ree is a Honda tree.

.Fig, 3.38: Construction of a binary t ree as sequence of similar triangles.

137

a /

b

I 1 I

#

C

~ -If '4 '

J(÷ Ji

Fig. 3.39: Triangulation of a rectangle by i tera t ive averaging (a; cf. Fig. 2.46) reconsidered as a space filling Peano curve (b). Connection of subsequent centroids generates t ree pat terns, c: bifurcation and tripling with ' l imit shape'; d: space filling t ree pat tern (see text).

The cen te r of every triangle along the boundary is the terminat ion point of a

end branch. As the i terat ion process goes to infinity, the triangle shrinks to its centroid

which, by definition, is the terminat ion point of an end branch, and we expect a con-

tinuous curve as limit shape for the tree. In detail, this assumption is not quite correct .

If we only consider branches as indicated in Fig. 3.37, then there are no branches

terminat ing in the concave intervals of the Koch curve while the original construct ion

produces triangles in these areas. In the t ree pat tern an empty interval remains between

terminal points, and new ones are genera ted with every i terat ion. The points belonging

to the t ree are only a subset of the Koch curve. If the points, however, are replaced

by objects of f inite area, the boundary becomes again densely covered. Numerically

this has been studied by HONDA & FISHER (1978, 1979) in terms of the most equitable

distribution of leaf clusters. Fig. 3.3a shows a similar construct ion of a binary bifurca-

tion pat tern with terminal triangular ' leaves ' which tend toward a continuous outline,

but again there are areas which are not covered, and in addition the leaves overlap.

The relationship to the discussion of cluster t rees is obvious; however, let us

return for a moment to a problem of the first chapter , the reconst ruct ion of a continu-

138

ous surface by i te ra t ive

e i ther way i l lustrated in

centroids as unique limit

MANGOLD-KNOPP, 1968;

we have a space filling

averaging (section 2.4.5). If we draw the successive nets in

Fig. a.ag, we get a nested sequence of squares with their

point. The sequence of centroids can be paramet r ic ized (e.g.

GUGGENHEIMER, 1977}, and, as the process goes to infinity,

curve or a Peano curve. Again we can connect a sequence

of centroids in the way indicated in Fig. 3.39 generat ing another special t ree pa t te rn

with its terminat ion points a subset of the Peano curve. As the process goes to infinity,

the terminat ion points of a binary t ree cluster; however, they are always well separated.

The t ree becomes more dense if we allow triple points (Fig. 3.39), and finally we can

turn the branching process into a space filling process by the condition that branches

of any generat ion sprout into areas of least density (with maximal dis tance from neigh-

boring branches).

B) The Outline of Honda Trees

The brief excursus to such s trange s t ruc tures as Jordan and Peano curves provides

us with two contras t ing aspects: It encourages the previous view that there are t r ee

s t ruc tures which genera te an outline, and it discourages the a t t emp t to search a descrip-

tion of this outline by inspecting the distribution of branch terminat ions. A conclusion

could be that we now need s ta t i s t ica l methods to study the distribution of branch t e rmi -

nations more deeply; the si tuation is quite similar to the problem to find a closed bound-

ary for a f inite point set as discussed in te rms of cluster s t ra tegies . However, in the

special case of Honda t rees we have continuous di f ferent iable functions which approximate

the branches, and, what we ar looking for, is another continuous approximation of the

most extended outline. Therefore , we replace the d iscre te model by a continuous approxi-

mation hoping to replace the multivarious outputs of a computer program by a theorem

about the shapes, which possibly can be cons t ruc ted from the model under consideration,

similar to the model 's original a t t emp t to describe the essent ial s t ructures of complex

natural pa t te rns by a few parameters .

Let us consider a leading branch and replace it by its spiral approximation. The

spirals of the second generat ion are then most economically described in local coordinates

of the leading spiral, i.e. in te rms of its local moving frame or Frenet trihedron. If

we consider s traight branches of second order, we can e.g. express them in terms of

the tangent vec tor of the leading spiral and ro ta t e this vec tor into the proper position.

The local descript ion of the d iscre te system is analogous to equation (3.57), only the

vec tor ro is replaced by the ro ta t ed tangent vec tor bBt:

Pi = (i~=O aiAi)(bBt)" (3.63)

139

If we move this spiral of second order along the heading spiral, the branching angle

is everywhere preserved. Consider a convergent Honda t ree with infinite number of

branching points, then by proposition {4) of the previous section the length of the second

order spiral is always in an al lometr ic relationship with the length of the leading spiral,

and the same holds for the radii of the two spirals. We take the coiling center of the

leading spiral as the origin of the global coordinate system, and the global coordinates

of a point fixed on the second generation branching pat tern takes the form:

PB =Ps + a lps Ib (bBt)

where the index s denotes the leading spiral.

(3.64}

Before discussing this equation in more detail, we observe that b=l if we res t r ic t ourself

to leading spirals of the same system because these are all similar. Furthermore, if

we consider any fixed point on a spiral of higher generation, this point is described

by a vector which originates at the leading spiral. This holds for any point, even for

the most distant one. Although it is cumbersome to evaluate the most distant point,

it will turn out that we can give a quali tat ive answer about the outline.

Remaining in a system of leading spirals equation (3.64) is simplified and can

be wri t ten in a more extensive form as

I: :1] I or (3.65)

l p B = I + bB sin~ -cos~j Lsin~,

and once more it turns out that this equation describes the original spiral which is ro-

ta ted around its coiling center . Every family of leading spirals, therefore , is bounded

by a similar s p i r a l - - because a t ree consists of two systems of leading spirals, the

shape results from superposition of the two systems. On the other hand, within every

system of leading spirals there exists an infinity of identical subsystems which differ

only in size, and we can consider the maximal outline as result of the superposition

of smaller subsystems as i l lustrated in Fig. 3.40. Concerning the density of terminat ion

points we note that the distance of branching points decreases regularly along the leading

spiral, the density of branching points is proportional to the curvature of the leading

140

Fig. 3.40: A Honda t ree is se l f -s imi lar on all levels. Superposit ion of images on a ce r t a in level genera tes the ident ical but enlarged image.

spiral, and the same holds for any secondary leading spiral, etc. . Thus, the densi ty in-

c reases towards the coiling cen te rs , and the densi ty of coiling cen te r s is proport ional

to the cu rva tu re of the leading spiral. Densi ty dis t r ibut ions of this type were a l ready

encoun te red with the ' c lus t e r s t r a teg ies ' .

Finally we consider b ranches of f ini te length. The proposit ions of the previous

sec t ion sti l l hold, the only d i f f e rence is tha t the leading branches do not coil to infinity.

Equat ion 3,64, there fore , takes the form

PB =Ps + a(JPsJ -IP 0 ])b(bBt), (3.66)

and the b ranches t e r m i n a t e for Jpsl=JOOJ. Far from this s ingular point and for b=l

the shape is qui te close to the spiral of the leading branch. The system, there fore , is

real ly se l f - s imi la r on each level.

141

C) Chance and Dete rmin i sm

The previous discussions showed tha t the Honda model genera tes ideal se l f -s imilar

and de te rmin i s t i c s t ructures : The morphology of b ranches is (infinitely) repea ted and

impresses i ts pa t t e r n onto the shape of the en t i r e s t ruc ture , and the shape is again

repea ted on the level of every genera t ion of b ranches differ ing only in size, With these

proper t ies it is unlikely tha t the Honda model re f lec t s r e a l i t y - - it is a s trongly ideal-

ized model, and in a la te r study HONDA & FISHER (1979) int roduced addit ional sources

of var ia t ion to approximate real t rees more closely. However, se l f - s imi lar i ty and f rac ta l

sys tems are cu r ren t fields of in te res t (MANDELBROT, 1977), and the Honda t r ee provides

a simple l inear model al though the basic model is r a t h e r unreal is t ic . However, t he re

is some possibili ty tha t models "nearby" approximate real i ty, i.e. t ha t dis turbed Honda

t rees provide b e t t e r approximations. There are several ways to disturb the original model,

which involve s tochas t ic and de te rmin i s t i c aspects .

A simple approach is ' chance ' , i.e. some ex te rna l and perhaps in ternal noise is

added which a l t e r s branching ratios, branching angles, and may even inhibit branching.

However, ' chance ' involves also events as ' to evolve at a ce r ta in place ' , to 'evolve

within a ce r t a in t ime interval ' , or to 'evolve under ce r t a in and for the individual s t ruc tu re

not pred ic tab le env i ronmenta l condit ions ' , even if they are not random but exhibi t only

a complex spa t io - tempora l dis tr ibut ion. The random aspect "if the re would not have

been t ha t par t icu lar thunders torm at t ha t par t icu lar t ime, then those branches would

not have been broken off and those buds would not have been inhibited" is only a very

special viewpoint.

Other aspects are ex te rna l and internal cons t ra in ts , the system has to fulfill cer -

tain boundary condit ions and cannot grow freely. A possible resul t could be tha t under

ce r t a in cons t ra in t s only one or few of the topologically dis t inct t rees discussed in sect ion

3.5.1 are s table. Or, consider the drainage systems of Fig. 3.25; al though they are ir-

regular, they are not random; the geology de te rmines much of the drainage system:

So the inf luence of the Andes obviously de te rmines s t r eam direct ions at the wes te rn

margin of the Amazon drainage system, and sea- level f luctuat ions through glacia t ion

periods de te rmined much of the dra inage pa t t e rn of the Ganges del ta . If we consider

such systems as random or par t ia l ly formed by chance, the reason is simply tha t we

cannot r econs t ruc t the his tor ical evolution of boundary condit ions to a suff ic ient pre-

cision.

A third group of factors, which may be of in teres t , are in ternal const ra ints , or

in ternal regulat ion systems, The Honda model is a special case of the family of l inear

maps

Xi+l = a lx i + blYi Yi+l = a2xi + b2Yi (3,67)

142

which is equivalent to the d i sc re t iza t ion of a pair of coupled l inear d i f fe ren t ia l equat ions

dx/dt = a l l x + a22Y; dy/dt = a21 x ÷ a22Y (3.68)

which gene ra t e spiral pa t t e rns in the phase space for p a r a m e t e r se t t ing equivalent to

Honda 's model (cf. sect ion 2.2.1). Another possible extension is tha t the local evolu-

tion depends not only on the mother branch (a "Markovian" s i tuat ion) but on o the r branch-

es nearby, e.g. t ha t the inhibi t ion of a branch inf luences growth in some neighborhood.

If the resul t ing regula t ion is l inear, then the map (3.67) comes close to a t r anspor t

equat ion as discussed in sec t ion 3.1.2. Any local d is turbance then would a l t e r the system

in some neighborhood. Adding diffusion t e rms to equat ion (3.68) and assuming tha t the

coef f i c ien t s aij may depend on (x,y) or I (x,y)], leads to the (kto)-models which play

some role in spiral chemica l waves (e.g. DUFFY et al., 1980; VIDAL & PACAULT, 1982).

Still more compl ica ted sys tems ar ise if non- l inear feedback mechanisms or au toca ta ly t i c

sys tems are in t roduced (cf. MEINHARD, 1984; NICOLIS & PRIGOGINE, 1977). Such a

sys tem is e.g. the H~non map (cf. THOMPSON, 1982)

Xi+l = Yi + l -ax i2 ' Yi+l = bxi (3.69)

which exhibi ts a s t range a t t r a c t o r and chaot ic behavior . THOMPSON (1982) concluded

from the random response of such a de te rmin i s t i c modeh

"Strange attractors may thus have a profound effect on our

modellin E of seemingly random behaviour, since it is now seen

that a stochastic modellin E may no longer be essential in all

cases. For simple deterministic mechanical systems, they mean

that computer results of their non-linear dynamics must be in-

spected with care (as must any result of a conventional Krylov--

Bogoliubov averaging technique), since one feature of a strange

attractor is that a sudden leap in response may occur after

a lon E period of apparent quiescence. "

Another point is also e luc ida ted by this sect ion: Somet imes it is worthwhile to

study compu te r models analy t ica l ly to de t ec t the i r in ternal s t ruc ture . We usually think

the o the r way, we fo rmula te a problem analyt ica l ly and then use the compute r to solve

it. Compute r modelling and s imulat ion thus b e c a m e decoupled from classical ana ly t ic

m a t h e m a t i c s to some ex ten t , providing us usually wi th an enormous or inf ini te number

of possible solutions. In this respect , na tu re is r a the r closely approximated, the da ta

a re not summarized, but we can es tab l i sh an inf in i te ca ta logue of forms wi thout the

d isadvantage to go to the field.

143

A major point throughout this section was self-s imilar i ty which leads to the aspect

that it is somet imes worthwhile to decompose complex s t ructures into smaller units

and to ex t rac t the 'pr imit ives ' which allow to describe and to analyze the complex global

pat tern. Fig. 3.41 i l lustrates this in the case of ammonite sutures which are close to

t ree pat terns , and which are self-similar to some exten t . By turning a global problem

into a local one the analysis of a s t ruc ture commonly is s i m p l i f i e d - - however, the

inverse procedure is not always possible as was elucidated by various examples.

The analytic approach, however, usually condenses the possible pat terns and allows

to discover unexpected s t ructures which may evolve under cer ta in conditions. Thus, the

previous discussion of shape was incomplete, as new pa t te rns may arise if the second

order branches are d i rec ted to the concave side of the leading spiral. Straight branches

e.g. then overlap and are bounded by their evolute, and this pa t te rn does not require

the convergence of the Honda tree. Such p a t t e r n s - - evolutes and caust ics - - will be

the topic of the next chapter .

Fig. 3.41: Ammoni te suture lines are comparable to t ree pat terns . Commonly they are also self-s imilar (including reflections) on various levels. The enlarged 'pr imit ives ' can repeatedly be found within the complex suture lines.

4. S T R U C T U R A L S T A B L E P A T T E R N S A N D

E L E M E N T A R Y C A T A S T R O P H E S

In the preceding discussion it became clear that most of the observed instabili t ies

are due to the fact that the pa t te rn recognition process lacks an inherent s tabi l i ty prop-

erty. As LU (1976) states:

"In any branch of science, it is always a challenge to try to classify the

objects under study. Unfortunately, it is often extremely difficult to carry

out this classification. It becomes much easier if one tries to classify only

the stable objects .... stable objects have boundaries where discontinuities

appear. We all know that mathematics used in almost all sciences so far is

based on the differential calculus, which presupposes continuity. There is

a @rest demand, therefore, for a mathematical theory to explain and predict

(if possible) the occurrence of discontinuous phenomena."

A very instruct ive example of s t ructural s tabil i ty and singularit ies we owe to CAL-

LAHAN (1974): Take a piece of fabric at one corner and put it onto a f lat surface.

What forms can occur locally on the sheet {Fig. 4.1)? There are three possibilities:

a) the sheet lies flatly and smoothly, these points are called the regular ones;

b) a fold line appears on the sheet;

c) a pleat forms at the end of a fold line.

' ~ L o c a l forms on a folded sheet (after CALLAHAN, 1974}; (a) a regular sheet lies flat; (b) a singular point on a fold line; (c) a singular

point where the fold turns into a pleat; (d) a singular point which is not s tructural ly stable.

The points (b) and (c) are cal led the singular points because of their special nature;

in part icular , they are s tructural ly stable singular points. To see this, take a point such

as (d) in Fig. 4.1. This is an additional type of singular points, but it is not a s tructural ly

145

s tab le one because any dis turbance, slight as it may be, turns this point into one of

type (a), (b} or (c). On the o ther hand, points of types (a), (b) or (c) cannot be made

to disappear by a small per turbat ion . One can dis locate a fold or pleat by a small per tur-

bat ion, but one cannot a f fec t i ts presence. The discussed s ingular i t ies yield in addit ion

a s t ruc tu ra l informat ion. Put a s t i f f shee t of paper beside the fabr ic in the same way:

It will only consis t of regular and perhaps fold points. It is dist inguished from the piece

of fabr ic by the singular points which may appear under this exper iment . It is usually

the set of s tab le s ingulari t ies which allows us to classify objects .

It was THOM (1975) who pushed forward these concepts in ma thema t i c s and thei r

applicat ions. He has shown tha t the concept of s t ruc tura l s tabi l i ty , i.e. the insensi t ivi ty

to small per turbat ions , is re la ted in one contex t to s table singulari t ies . These s table

s ingular i t ies have then been classif ied by Thorn in his "seven e l emen ta ry ca tas t rophes" .

Here, examples are given which demons t r a t e tha t these concepts are useful to under-

s tand pa t t e rns and to de t e rmine the geomet r i c s ingular i t ies tha t an unknown surface

gener ical ly impresses on a sensing wavefield (DANGELMAYR & GOTTINGER, 1982).

The examples are derived from Huygens' cons t ruc t ion of wave fronts and envelopes. A

deta i led discussion of the m a t hem a t i ca l machinery and of mainly physical appl icat ions

can be found in LU (t976}, POSTON & STEWART (1978), GOTTINGER & EIKEMEIER

(eds. 1979), STEWART (1981, 1982).

Fig. 4.1 repea t s Fig. 2.1 with the addit ion of the typical singular points which

uniquely de t e rmine the local s t ruc tu re of the s u r f a c e - - i.e. the se t of singular points

on surfaces provides a skeleton of essent ia l s t ruc tu ra l informat ion. In the previous chap-

te r s such sets of singular points were viewed as instabi l i t ies , in this chap te r the inverse

problem will be studied, the c lass i f ica t ion of "form" by the intr insic set of singular

points.

In the f irst sect ion singulari t ies are discussed, which occur as discont inui t ies in

the two-dimensional images of three-dimensional objects . The typical s ingulari t ies provide

useful in format ion in pic ture processing. The concept of skeletons then re la tes the analy-

sis of two-dimensional images to the previous discussion of op t imiza t ion problems on

one hand~ and to D'Arcy Thompson's classical Vtransformations of form ~ on the o ther .

In the second sect ion the theory of e l emen ta ry ca tas t rophes is applied to the l inear

ray model in re f l ec t ion seismology. Most of the discussion remains r e s t r i c t ed to the two--

dimensional t rack line problem, and the concepts are der ived from simple assumptions

with basic ma themat i c s . Instead of dealing with waves direct ly, the l inear ray pa t t e rns

and the i r caust ics are studied. The wave fronts then are analyzed in t e rms of a cont inu-

ous plane map. Finally, the t r a v e l t i m e record is analyzed as a map which t r ans fo rms

the th ree-d imens iona l spa t io - tempora l system into the t r a v e l t i m e record. It will turn

146

out tha t the t r ave l t ime record is locally equivalent to {oblique) sect ions through a swal-

lowtail c a t a s t rophe which is loca ted on an oblique line in the d e p t h / t i m e coordinates ,

In the third sect ion some aspects of folds and faul ts are discussed in t e rms of

'para l le l sur faces ' . This discussion takes up the ' p r e - compu te r ' analysis of large scale

deformat ions in t ec ton ics and reviews this ' ea r ly ' k ine t ic approach in t e rms of r ecen t

developments in s ingular i ty theory, The la te ra l cont inua t ion and the depth l imit of folds

will be discussed in d i f fe ren t ways and cont inues in some respec t the problems of chap-

t e r 2.

4.1 IMAGE RECOGNITION OF THREE-DIMENSIONAL OBJECTS

A common problem in pa t t e rn recogni t ion is the r econs t ruc t ion and c lass i f ica t ion

of th ree-d imens iona l ob jec ts from two-dimensional p ic tures . Typically, this problem arises

in t ransmiss ion microscopy. Fig.4.2 shows the larva of a medusa in t r an s mi t t i n g light.

Locally t r iangular pa t t e rns of increased densi ty appear, which can be r e l a t ed to a bound-

Fig. 4.2: The larva of a medusa in t r ansmi t t i ng light. A swallowtail s ingulari ty occurs in the two-dimensional image. The ident ical p a t t e rn is found in the two-dimensional image of a t r anspa ren t canal surface {modified a f t e r WUN- DERLICH, 1966).

147

ary e f fec t . An ident ical pa t t e rn is well known from two-dimensional perspec t ive views

of locally convex surfaces , especial ly from canal surfaces in cons t ruc t iona l geomet ry

(WUNDERLtCH, 1966).

Another field, where similar pa t t e rns play some role, are tec tonica l ly f l a t t ened

and deformed images. The proper recons t ruc t ion of such images yields useful in format ion

for the paleontologis t as well as for the de te rmina t ion of the local t ec ton ica l s t ress

field. The normal approach in such cases are af f ine t ransformat ions . These describe

the deformat ions suff ic ient iy if the dimension of the object , i.e. of i ts image, is not

a l tered: if one has maps R 2 ~ R 2 or R3 --~ R 3. In the case of a map R3 ~ R 2,

new pa t t e rns occur, d iscont inui t ies at the boundary lines of the image which are closely

r e l a t ed to the local surface s t ruc ture .

Fig. 4.3: Tectonical ly deformed and f l a t t ened bivalves show surface discontinui- t ies s imilar to those in Fig. 4.2: In this case, the objects are not t ransparent , and, therefore , only par ts of the surface discont inui t ies are visible {modified a f t e r ROLLIER, 1918}.

4.1.1 The Two-Dimensional Image of Three-Dimensional Objec ts

Locally hyperbolic surfaces are capable to project onto swallowtai l - l ike images

in two dimensions (Fig. 4.4). These images are closely re la ted to Thom's swallowtail

ca t a s t rophe (see e.g. POSTON & STEWART, 1978). This re la t ionship secures t ha t the

pa t t e rn is s t ruc tura l ly s table , and, therefore , it can be used to de t ec t locally concave

surface e l emen t s or holes in the two-dimensional images of three-d imensional objects .

148

• . ° °

°,°• ...:.. " . ' . . . . . . . . . . . . . . : . .

::.'..::...-.:..'.......:...:.. : , .. : ; . : : ' . . . . . . ' : : . . : , . .

• .- • o • •

. • . . . . . : . . ' . . : . , : : . • . . . . . • . . , , • . . : . . . ' . ~ .

. . . . .

.:.:';:!".':"

Fig• 4.4: Two projec t ions of a hyperbolic sur face e lement• An oblique project ion causes the occur rence of a surface d iscont inui ty in the two-dimensional image space. The th ree-d imens iona l surface e l emen t can be ident i f ied wi th the ca t a s - t rophe manifold of the swallowtai l ca tas t rophe• The discont inui ty is the two-- dimensional image of the b i furca t ion se t of the swallowtail (modified a f t e r POSTON & STEWART, 1978).

How such sur face discont inui t ies develop in the two-dimensional image space, can

be analyzed most simply by use of canal sur faces (WUNDERLICH, 1966)• A canal sur face

is the envelope of a (one-parameter ) family of spheres of cons tan t radius (GUGGEN-

HEIMER, 1977). The canal surface, there fore , can be genera ted by moving the c e n t e r

of a sphere along a ( three-dimensional) curve; the envelope sur face of the spheres then

defines the canal surface• The most s imple case of such a canal surface is the torus

where the leading curve is a circle• Under an oblique project ion i n t o the plane the

c i rcular leading curve t rans forms into an ellipse• This de format ion of the leading

curve can be descr ibed by an af f ine t r ans fo rma t ion which takes the c i rc le into an ellipse•

But the boundary lines of the torus in the two-dimensionaI pro jec t ive space behave dif-

ferent ly . The genera t ing spheres project onto R 2 as c i rc les independent of a rigid ro ta t ion

of the torus• The resul t is tha t the apparen t boundaries of the t r anspa ren t torus develop

into a curve with a se l f - in te r sec t ion and with two cuspoid edges, the earI ier not iced

' t r i ang le ' (Fig• 4•2)• Fig• 4•5 gives th ree d i f fe ren t views of the torus and of the local

sur face discont inui t ies in the p ro jec t ive plane.

The ident ical pa t t e rn appears if one keeps the leading curve cons tan t and a l t e r s

the d i a m e t e r of the canal sur face (Fig. 4.6)• In this case, classical cons t ruc t iona l geome-

t ry (WUNDERLICH, 1966) tel ls us tha t the locally discontinuous image appears when

the c i rcu la r project ions of the spheres e n t e r the evolu te of the leading curve, along

which the cen te r s of the spheres are located• A family of such surfaces with var iab le

149

I>cI

Fig. 4.5: Three perspect ive views of a (transparent) torus and of its apparent surface discontinuities. The la t ter develop at the interior hyperbolic surface points, as the projection becomes oblique.

canal d iameter maps, therefore , onto a family of involutes or of parallel curves, which

unfold inside the evolute of the leading curve into swallowtaiI-like curves {Fig. 4o6}.

The la t te r observations allow to relate the evolution of the image boundaries to

Huygens ~ principle (Fig, 4.7). If the image of the canal surface is studied in two dimen-

sions~ then the envelope of the spheres reduces to the envelope of circles with a constant

radius. A change of the d iameter of the canal surface corresponds to a change of the

radii of these circles. In two dimensions, this is identical with a continuous transport

of the boundary lines along the normals of the leading curve. The image changes

locally when the area is reached where the normals in tersec t {Fig. 4.7). This construct ion-

al principle allows to study all possible images of canal surfaces that can occur in two

, : : % : ..:., " ~ ~,:~!..'....:~"

Fig. 4.6: Canal surfaces with identical generating curve but of d i f ferent diameter . The boundaries of the surface map onto a family of parallel carves in the two-dimensional image space.

150

Fi~. 4.7: Construct ion of the two-dimensional images of canal surfaces by use of Huygens v principle. The rays indicate the dislocation lines for the surface boundary in the project ive plane.

dimensions (Fig. 4.8). The only discontinuities, which appear, are the swallowtail-l ike singu-

lar curves. The same holds for the perspect ive views of the torus; the only d i f fe rence

is that , in this case, not the d iameter of the canal surface is a l tered but the local

curvature o f the leading curve. The result is the same: As the curvature decreases ,

the surface ~moves into the evolute v of the leading curve, and the boundary lines

deform in the identical way discussed above.

The cri t ical points of a canal surface are the hyperbolic points. They behave under

the project ion onto a two-dimensional image in the discussed way. The occurrence of

the ~swallowtait' is typical for locally convex s tructures , as will be shown in sect ion

4.2. The relation to hyperbolic surface e lements allows for a simple construct ion of

\ \

Fig. 4.8: A family of boundary lines for canal surfaces which have a common parabolic generat ing curve.

151

Fig. 4.9: Cons t ruc t ion of the swallowtail s ingulari ty as a ruled surface over a folded polygon.

the swallowtail . A hyperbolic surface, which locally takes the form z = xy, can be gener-

a ted as a ruled surface. If these rulings are pro jec ted into two dimensions (Fig. 4.9),

then the envelope of the s t ra igh t lines gives again the various possible images of the

surface in two dimensions including the swallowtail discontinuity.

The concept of parallel curves, which are genera ted by a t ranspor t along the i r

normals, can be re la ted to the concept of potent ia l s and to e l emen ta ry ca tas t rophes .

tn the l a t t e r case, the original three-d imensional obiec t resembles the ca t a s t rophe mani-

fold. If the leading curve is continuous, it can be locally described as an implici t

funct ion f(x,y) = 0. In addition, one can suspect t ha t the family of paral lel curves, which

is gene ra t ed by the t ranspor t along the normals , can be given in implici t form as

c = fix,y). But this l a t t e r formulat ion can be in t e rp re t ed as a potent ia l . The di rect ional

der ivat ives , the gradient of this local potent ia l , def ine the dislocat ion field. Ca ta s t rophe

theory then gives a c lass i f ica t ion of the s tab le s ingular i t ies of such ' loca l ' potent ia ls .

One of the ca tas t rophes , which appear under the map R3 ~ R 2, is the swallowtail .

In the present con tex t it is a s table discontinuity, which allows to ident i fy local hyper-

bolic sur face e l emen t s in the two-dimensional image. A more deta i led discussion of this

s ingular i ty will follow in the next sect ions.

4.1.2 The Skeleton of P lane Figures

In P ic tu re Recogni t ion the problem arises to represen t objec ts economical ly in

the computer , and to classify objects as equivalent even if they are disturbed to some

ex ten t . Especial ly the l a t t e r problem leads to the concept of skeletons or 'media l axes '

(BLUM, 1973; BLUM & NAGEL, 1977). BOOKSTEIN (1978) gives the following defini t ion

152

of the skeleton of a plane figure:

The skeleton of a plane figure is a ce r t a in graph inside the figure, toge the r with

a funct ion on the graph. The skeleton is the locus of all points which do not

have a unique neares t boundary point upon the shape; the function is the d is tance

to any of the set of equally d is tant neares t points.

Thus, the skele ton is lust the set of s ingular points discussed in sect ion 2.4.5, which

arose from the problem to find the neares t boundary point. The most na tura l way of

finding the skeleton of a given object is by shrinking it until it reduces to i ts skeleton

(ROSENFELD & WESZKA, 1980). This shrinking c a n - - t h e o r e t i c a l l y - - be done by a

wave front , which s t a r t s ins tan taneous ly at every point along the boundary and moves

with cons tan t v e l o c i t y - - i.e. by Huygens' principle, as discussed in the previous sect ion

(for t echn ica l deta i ls of computa t ion see e.g. ROSENFELD & WESZKA, 1980). Still more

i l lus t ra t ive is the "grassf ire" model (BOOKSTEIN, 1978):

"We imagine a shape boundary to be 'drawn' on the dry grass of a prairie,

and fired (simultaneously). The fire will burn evenly in all directions from

its starting locus until it encounters points at which it arrives simultane-

ously from two directions, whereupon it quenches itself~ as grass cannot

burn twice. Such loci comprise the skeleton, and the function we seek is

the time it takes the fire to arrive there and Eo out. "

The c r i t i ca l set of singular points we encounte red in the opt imiza t ion problem

{section 2.4.5) has now a to ta l ly d i f fe ren t quali ty: Toge ther wi th a vec tor valued funct ion

(which defines the d i rec t ion to the neares t boundary points) and a d is tance measu remen t

{which def ines the locat ion of the original boundary line) the se t of s ingular points or

the Vmedial axis ' gener ical ly defines our object , Fig. 4.10 shows various examples of

plane f igures and the i r 'media l axes ' .

Fig. 4.10: Skele tons (medial axes) of various two-dimensional objects ,

153

4.1.3 Theore t ica l Morphology of Worm-Like Objects

The concept of skele tons becomes especia l ly simple in the case of cylindrical ob-

jec ts when par ts of the boundary line and the skeleton are parallel curves. Fig, 4.11

i l lus t ra tes the case of a cylinder with spherical caps t e rmina t ing its ends. We may think

about such a cylindrical object in t e rms of a wiggly worm or a chromosome if we allow

qg7 Fig. 4.11: Morphology of a wiggly worm, which preserves length, width, c i r cum- ference, and area when it bends (a,b). The cu rva tu re of coiling, however, is l imited by the width of the worm (c).

X- and Y-like pa t te rns . Now, if our wiggly object s t r e t c he s or changes its form in some

way, "our intui t ion expects tha t the width of its form will s tay pract ica l ly the same

and we expec t in addition its length to be invariant" (BOOKSTEIN, 1978). What we then

need are precise defini t ions of width and length and a map which preserves these proper-

t ies if the 'worm' bends.

Let the cylindrical object be s t ra ight , then we always can find a coordinate sys tem

so tha t the boundaries are described by the map

x = s if I S l < a y = X (4.1a)

and

x = -+(a + cos(s)) if a< Is I <a+k y = ~sin(s) (4. Ib)

In this case, the medial axis is the line y=0. If the medial line is bended to some arbi-

t ra ry form, we can describe this deformat ion by a map

154

x ---> ( F ( x ) , G ( x ) )

or (4.2)

u = f(s)

v = g(s) ,

whereby we have to secure t ha t the length of the medial line does not change (even

locally). This requires tha t the dis turbed image of the medial line is pa rame t r i c i zed

by arc length, what will be assumed for the following discussion. The next condi t ion

to be sa t i s f ied is cons tan t width. In the s t ra igh t object (equation 4.1) width is defined

perpendicular to the medial axis. If we t r ans fe r this def ini t ion to the wiggly worm,

then width is measured along the normals to the medial axis, i.e. the de format ion (4.2)

def ines the de format ion of the en t i re object which is given by the map

(x,y) ---> (F(x),G(x)) -+ X~(F(x),G(x))

or (4.3)

u = f ( s ) - X g ( s )

v = g ( s ) + x ~ ( s ) ,

where N is the normal to the medial line. The spherical caps (4.1b) are still loca ted

at the endpoints of the medial line and are not deformed {cf. Fig. 4.11).

Finally, we are i n t e r e s t ed how the c i r cumfe rence of the object and its area are

a l tered. From equat ion (4.3) we find the length inc remen t ds of the paral lel pieces of

the boundary to be

• 2 ds B = (I - Xk(s)) - "- -~/(f(s) 2 + g(s) ) ds ,

where k(s) is the curvature of the medial line and • 2 /(~(s) 2 + g(s) ) = I because the medial line is parametri-

cized by arc length ,

and the length of the boundary is

LB= f(l-Xk)ds + f(l +%k)ds + 2Lspherica I caps

(4.4)

(4.5) = 2 fds + 2Lspherical caps ; k: curvature of medial axis.

That is, the length of the boundary is simply twice the length of the medial axis, inde-

pendent of the deformation of the medial line! By definition, the length of the medial

line does not change under arbitrary deformations, and the length of the circumference

is an invariant property of the map (4.3). The area of the object is given by

155

A = ff(l-Xk)dsdl + ff(l +Xk)dsdl

= If dsdX + 2A spherical caps,

+ 2A spherical caps

and again the deformat ion does not a f fec t the area which, there fore , is another invari-

ant for any fixed value of the p a r a m e t e r )~. The map (4.2) describes ideal de forma-

t i o n s - - bending and c o i l i n g - - of worm-like objects , which preserve the length of the

medial axis, the length (surface) of the boundary, and the area (volume) of the objec t - -

proper t ies which -- by intui t ion -- can be expected for biological objects .

However, if we re turn to the discussion of the preceding section, it is c lear tha t

there are l imits for the deformat ion of such worm-like objects . As the curva ture or

the width of the object increases (Fig. 4.6), the surface would develop se l f - in tersec t ions ,

which cannot be real ized. These singulari t ies, which occur at the convex areas of the

wiggly worm (Fig. 4.11), are ' s tandard ca tas t rophes ' , which will be discussed in more

detai l in the next sect ions. For our worm-like object , however, we find a s trong corre la-

tion be tween width and the curva tu re of coiling.

4.1.4 Continuous Trans format ions of Form

D'Arcy Thompson introduced the concept of shape t r ans fo rmat ions to descr ibe the

morphological evolut ion in phylogeny and ontogeny (THOMPSON, 1952). BOOKSTEIN

(1978) summarizes:

"The formal theme of D'Arcy Thompson's method is this: to represent a change

of one shape into another by the single mathematical object which is the

map of one shape onto the other, and then to visualize this mathematical

object. "

This program has been followed with much emphasis, and a common goal was to

find the ' ideal family of t r ans format ions ' , which solves the biological problem, as the

ideal hydrodynamic problem is solved by the conformal mappings. Many published

a t t empts , of course, r e la te D 'Arcy Thompson's problems to classical physical m e t h o d s - -

conformal mappings (Fig. 4.12; RICHARDS & KAVANAGH, 1947), biorthogonal grids

(BOOKSTEIN, t978), and to the Navier -Stoke equat ion as a general solution (GRENAN-

DER, 1976).

The methods, based on D'Arcy Thompson's program, compare d i f fe rent objects

and morphological s ta tes . They describe a map from one s t a t e to the o ther but not

a continuous deformat ion as it is usually obvious in ontogeny and appears likely for

156

A

, - - - V - - ' '. ', ~ - - : - - - I

F- - " ; " ; - "-'/ t J , t L 4 - ~ _ _ , j s

. - - . J , - - - ~

.... 4--- ,>'-;

" i B

Fig. 4.12: Transformation of form in a bivalve (A) and a brachiopod (B}. Both mappings are produced by the conformaI map w=a(z + l /z) + b log z. (Adapted from BAYER, 1978).

phylogeny if one does not believe in macro-mutat ions . Further, the use of 'Car tes ian

grids' implies a continuous, a topological deformation. However, in the real systems

we find somet imes ra ther sudden changes although the deformation is continuous.

Equation (4.3) provides a 'p ro to type ' of such continuous deformations, which are

capable of sudden morphological changes as soon a s the surface enters the 'caust ic '

of normals. Of course, it is not hard to argue that real biological t ransformat ions are

much more complicated than the map (4.3). However, if we do not a t t empt to describe

the ent i re morphology at once but res t r ic t the study to interest ing local pat terns , then

Fig. 4.13: The early ontogeny of Sepia (after NAEF, 1928 and BLIND, 1976) and the c icatr ix of Nautilus (after BLIND, 1976) compared with a swallowtail surface (after THOM, 1970).

157

the map (4°3) provides a f irst order approximation for the evolution of a locally cylindri-

cal surface e l e m e n t - - what may happen, is the evolution of folds on the surface (cf.

Figs. 4.1--4.9). Fig. 4.13 i l lus t ra tes folding processes in the ear ly ontogeny of cephalopods,

which can be re la ted to the evolution of a wave f r o n t - - the s u r f a c e - - which folds

and genera tes singular lines.

The map (4.3) describes a simple case of a wave front and is equivalent to THOM's

(1970,1975} approach to morphogenesis, as soon as not the morphology i tsel f but the

singular se ts are emphasized. In the simple example of equat ions (4.3) the in teres t ing

s ingular i t ies are the cusp and swallowtail ca tas t rophes . How these e l emen ta ry ca tas t rophes

are re la ted to the map (4.3), will be discussed in the next sect ion.

4.2 SURFACE INVERSIONS IN THE SEISMIC RECORD --

THE CUSP AND SWALLOWTAIL CATASTROPHES

In a wide field of applicat ions geologists are concerned with the problem to in te r -

pre t the records of re f lec t ion seismology in a qua l i ta t ive way. The morphological {geometri-

cal} i n t e rp re t a t i on is here usually much more impor tan t than the recons t ruc t ion of t rue

depth r e l a t i o n s h i p s - - a major problem for the seismologist . To solve the in t e rp re t a t ion

problem quan t i t a t ive ly requires to solve the full dynamics of the wave equation, which

cap tures the process of r emote sensing. In the most general sense, waves are spreading

processes which sa t is fy a to ta l hyperbolic d i f fe ren t ia l equat ion (COURANT & HILBERT,

1968). The t rouble is tha t one does not only need the init ial condit ions but also the

f

Fig. 4.14: Sketches of t r ave l t ime records, which have been in te rp re ted as sal t domes with se l f - in te rsec t ing re f lec tors (above: a f t e r DRIVER & PARDO, 1974; below: a f t e r BIJU-DUVAL et al., 1974).

158

ent i re boundary conditions to solve the general equation. Instead of solving the total

hyperbolic equation one can pick a visible s t ruc ture from the wave field, say a cres t

or trough line or, in a general notation, a wave front (WRIGHT, 1979)o To solve the

spreading process of the wave front one can use Huygens ~ principle (COURANT & HIL-

BERT, 1968; OFFICER, 1974). If a wave front is known at a cer ta in t ime, one studies

the evolution of the wavelets that spread from every point of the generat ing wave front.

The successive wave fronts are the envelopes of the wavelets . In the three-dimensional

case the wavelets are spheres, and the successive wave fronts are surfaces F(x,y,z) =

constant . If the propagation of the wavelets is fairly constant along the generat ing wave

front, the successive wave fronts can be approximated by a transport of the original

surface along its no,reals with constant velocity. In two dimensions the spherical wavelets

reduce to circular ones, and the problem is moderate ly simplified. In a mathemat ica l

sense this construct ion of the propagating wave fronts can be described as a continuous

map, and, at least locally, one can suspect that this map can be summarized in a poten-

tial equation V = w(x,y,z).

The in teres t ing s t ruc tures of these potent ia ls are their s table singularities. The

geometr ica l singularit ies can welt be studied by topological methods. DANGELMAYER

& GUTTINGER (1982) discussed the physical aspects of remote s e n s i n g - - Fresnel-zone

topographies, d i f f ract ion pa t te rns e tc . - - careful ly in te rms of ca tas t rophe theory. They

showed that the topological approach yields reasonable results for the inverse sca t te r ing

problem as well as for the on-s i te survey:

"Since, in practice, the analytic analysis is of little interest to the seis-

mologist -- because he needs an overall picture, a 'Gestalt' point of view

to classify his forms-- tackling the inverse problem at its topological

roots comes much closer to the interpreter's intuitive geometric9 i.e. quali-

tative, approach" (DANGELMAYER & GUTTINGEB, 1982).

This "quali tat ive approach" is, indeed, still more important for the geologist than for

the seismologist . Interpret ing seismograms from areas with salt domes it may be a

reasonable question whether in tersect ing re f l ec to r s are real is t ic or singular pa t te rns

within the seismogram. Shadow zones, double ref lec t ions and 'hyperbolic re f lec t ions '

are well known pa t t e rns of the record. Figs. 4.14 and 4.15 give some examples how

such 'anomalies ' show up in the t rave l t ime record. Much more impressive s t ruc tures

of these types have been described from 3.5 to 12 kHz records (echograms) {JOHNSON

& DAMUTH, 1979; DAMUTH, 1980;FLOOD, 1980; EMBLEY, t980). The in terpre ta t ion

of echograms and of pa t te rns like 'hyperbolic ref lec t ions ' became of interest during

the last decade, when sedimentologists s ta r ted to in terpret the deep sea morphology

for their purposes.

In the case of high frequency echograms, the wave front evolution can be well

159

Fi, g. 4.15: Trave l t ime records with in te rsec t ing ref lect ions , ' extensions of sed imenta ry layers into the basement s ' and high energy zones within an isotropic sed iment cover,

approximated by l inear rays (FLOOD, 1980). tn the case tha t the model is reduced to

the 1seismic t rack ' , the problem is fur ther simplified. The re la t ion be tween a (cylindrical)

sur face and the t r ave l t ime record along the t rack line can be viewed as a map R 2 . . . .

R 2 or ( x , y ) ~ (u,2t), where (x,y) are the spat ia l coordinates of a sect ion through the

sur face and (u,2t) are the space - t r ave l t ime coord ina tes of the record. In this notat ion,

the in t e rp re t a t ion of the seismogram becomes identical with the problem to de te rmine

the map tsL This approach was recent ly s t ressed by FLOOD (1980), who analyzed periodic

wavefields. He found tha t 'hyperbol ic re f lec t ions ' depend on the wavelength of sinusoidal

surfaces and wate r depth. But the approach by any global sur face approximation is much

160

too general : There is no real chance to es tabl ish a suff ic ient c a t a l o g u e of global morphol-

ogies. One problem, the re fore , is whe the r one can classify sur face points in such a way

t ha t the i r image on the seismic record can be uniquely ident if ied. This, of course, is

a topological problem. It is the purpose of this sec t ion to der ive a c lass i f ica t ion of

t r a v e l t i m e images for the two-dimensional (seismic track} problem. This c lass i f ica t ion

will be one which re l a t e s the t r a v e l t i m e pa t t e rn s to local topological proper t ies of the

re f lec tor . There are two ways to discuss the re la t ion be tween sur face proper t ies and

the t r a v e l t i m e image, versus the wave fronts or versus the plane map approach. Both

methods will be discussed to demons t r a t e d i f fe ren t aspec ts of ca t a s t rophe theory.

4.2.1 Compute r Simulat ions of Rays, Wave Fron ts and

Trave l t ime Records

Ray theory becomes dras t ical ly s implif ied if one assumes tha t the rays are s t ra igh t

lines, and t ha t the source and the rece iver are located in a single point. This s i tua t ion

is near ly rea l ized for deep sea echograms (FLOOD, 1980), but it can also be used as

a f irst approximat ion for o ther re f lec t ion seismograms. Under these special condit ions,

the s u r f a c e of the r e f l ec to r can be viewed as the envelope of the wavelets , it forms

a 'wave f ront ' , and the normals of the r e f l ec to r are the rays along which the r e f l ec t ed

wave f ront propagates (e.g. GRANT & WEST, 1965). If the sec t ion through the r e f l ec to r

along the t rack line is given analyt ical ly , one can immedia te ly wr i te down the (linear)

ray equat ions.

tf the re f l ec t ing sur face is given in expl ic i t form as y = f(x), then the l inear rays

a re given in pa rame t r i c i zed form as

u = Xo - Tf'(xo)

w = f(xo) + T

(4.6)

where (Xo, f(Xo)) def ines a point on the re f l ec to r , (u,w) are the spat ia l coordina tes of

the ray which passes through the point (Xo, f(Xo)), and T is a p a r a m e t e r which genera tes

the ray. It turns out t ha t the pa t te rns , which can be found, are not a proper ty of the

r ays , but t h a t they need to be formed by the morphology of the r e f l ec to r (WRIGHT,

1979). The r e f l ec to r impresses its pa t t e rn gener ica l ly onto the sensing wave system

(DANGELMAYER & GOTTINGER, t982), in this case, onto the family of rays.

Equat ion (4.6) allows to draw the l inear ray p a t t e rn for any d i f fe ren t iab le function.

This was done for a sinusoidal funct ion in Fig. 4.16, and it becomes c lear tha t the seis-

mic record will be dis turbed in the areas of ray overlap. In these areas one finds double

and t r iple ref lec t ions . The two p ic tures of Fig. 4.16 show the farf ie ld p a t t e rn and, en-

161

Fig. 4.16: Computer simulations of the linear ray system of a sinusoidal re f lector. Left: farfield pat tern, right: nearfield pat tern.

larged, the nearfield pat tern (close to the surface). Fig. 4.17 i l lustrates that the ra ther

complicated farfield pat tern results from superpositions of various nearfield pat terns .

If the rays, which pass through the surface e lement of a wave front, are known,

then the evolution of the wave front along the ray family can be simulated, a f ter the

ray equations have been normalized by arc length, tn the case of linear rays one finds

the successive wave fronts from the continuous map

u = x - Tf'(x)//(l+f'(x) 2 ) (4.7)

w = f ( x ) + " r / / ( l + f ' ( x ) 2 ) .

(u,w} are the new spatial coordinates of a point {x,f(x)) on the original wave front, and

T is the distance between these two points { T = vt, v: velocity, t: time). The equations

(4.7)) define a continuous map from the plane into the plane

{(u,~) ~(x,y)]S(~):(~,v)}, (4.8)

where (Xo,Y o) are the points on the (cylindrical) ref lector . Fig. 4.17 i l lustrates how the

wave fronts evolve from sinusoidal surfaces, and how they are folded within the areas

in which the rays in tersect . As was noted earlier, the ref lec tor impresses its s t ructure

onto the family of rays and onto the wave fronts. This is i l lustrated in Fig. 4.18 where

the re f l ec to r was simulated by a cycloid. When the general ized cycloid develops a cusp,

a local singular point, then the ray and the wave front pa t te rn change dramatical ly.

Now, it is not hard to see that this is not a s t ructural ly stable situation. First one

162

. . . . . - _ ~ = - = - = - = - = - :

• ~ ~ ¢ ~ , ~ , ~ ~

Fig. 4.17: Co mpu te r s imula t ions of the evolut ion of rays and wave f ronts from sinusoidaI re f lec tors .

Fig. 4.18: Linear ray sys t em and wave f ronts of a cycloid. The degenera ted case occurs when the cycloid develops a cusp.

163

can argue that the cusp is morphologically not stable. Any small disturbance turns it

into a pa t te rn like Fig. 4.17. In addition, in this degenera ted case, the ray approach

does not fit Huygens' principle. Fig. 4.19 i l lustrates how the envelope of the wavelets

evolves near the singular surface point. It turns out that the successive wave fronts

' ignore ' the singularity of the ref lector . They are again continuous functions, which can

be approximated by a surface model like the general ized cycloid of Fig. 4.18. This obser-

vation allows to ignore such singularities for most of the following discussion.

Fig. 4.19: Huygens' wave front construction by wavelets near a singular surface point.

The last point to be discussed is how the surface maps onto the t ravel t ime record.

To find the map (Xo,Y o) -{u,vt /2) one has to section the ray pa t te rn in a cer tain

distance of the ref lector , i.e. along the survey track line. The modified horizontal coor-

dinate 'u' (Fig. 4.20) is a function of the inclination of the rays. If the track line is

taken as the zero level, y = 0, then the horizontal dislocation is

U X 0

Fig. 4.20: The parameter system for the linear ray model: (x,y,): surface coordinates of the ref lector , u: horizontal coordinate of source and receiver at y = 0, r: distance between source (receiver) and surface.

164

AX = (x - u) = -f'(x)f(x). (4.9)

The horizontal dislocations can be easily derived from equations (4.6}} and the condition

y = w = 0 if the pa ramete r T is el iminated. Now, the t ravel t ime is proportional to the

distance be tween the shot point and the re f lec tor point {Fig. 4.20) or proportional to

r = ¢ ' ( ( u - x ) 2 + f ( x ) 2 ) • (4.10)

Thus, if all variables are expressed in te rms of x and f(x), one finds the map (FLOOD,

1980) {x,f(x)) ~ {u,w) as

u = x - f'(x)f(x)

r = f(x)/(l-f'(x) 2) = vt/2.

{4.11)

The equations allow to simulate t rave l t ime record numerically for any di f ferent iable

surface t race . Fig. 4.21 gives some examples of such simulations. The figures include

the ray systems and the simulated t rave l t ime records. The surface models are all convex,

and this gives 'hyperbolic ref lec t ions ' on the t ravel t ime record. The mapping equations,

which have been discussed so far, allow to simulate the linear ray pa t te rn , the evolution

of wave fronts and the t rave l t ime record for any d i f ferent iable surface e lement . Indeed,

Fig. 4.21: Computer simulations of rays and of the t rave l t ime record for various s u r f a c e models {fourth order polynomials}. The saddle points of the t rave l t ime record have been set to zero depth.

165

.- ..... ~?.:.'.' . . . . . ".::,a . . . . . . :~c."..:';:iI'~':'.':..~.-.x.

L.'- . . . . . . t " ' "~'bx" .- . ' :'-,::,>

S, 7 • | - , , :

Fig. 4.22: Simulations of the t ravel t ime record like in Fig. 4,21 but as point plots for ' layered media' .

these approximations can be extended to three-dimensional surface s t ructures . But they

are still much too general. There is an infinite number of possible functions~ which can

be used to approximate a re f lec tor surface, and these functions may depend on a large

number of parameters so that we are not able to catalogue the t ravel t ime pat te rns

within the pa ramete r space. The major aim of the next sect ion is, therefore , to analyze

the local propert ies of ref lectors , and to show how these local propert ies impress their

s t ruc ture generically on a sensing wavefield and, therefore , on the t ravel t ime record.

The computer simulation has the advantage that more complicated systems can

be simulated. Thus, one can t ransform the geometr ica l simulation into a point plot

(Fig. 4.22) which resembles the received energies to some extent , i.e. the observed travel-

t ime record. The comparison of such a plot for a mult i layer-system (Fig. 4.22) with

t rave l t ime records {e.g. Fig. 4.15) shows that not only 'hyperbolic re f lec tors ' may arise

from local concave surface elements . High energy zones, which are somet imes found

in otherwise nearly isotropic areas, may well be re la ted to the surface morphology ra ther

than to a property of the sediment cover.

4,2.2 Local Surface Approximation

tn the preceding discussion it became clear that the ref lect ing surface impresses

its s t ruc ture onto the rays, onto the wave fronts and, therefore , finally onto the t ravel-

t ime record. It is this generic situation which makes remote sensing a s tructural ly stable

p r o c e s s - - and s t ructural s tabil i ty alone secures that one has a chance to recons t ruc t

the surface propert ies. On the other hand, it turned out that it is unreasonable to work

with global surface s t ructures . Therefore, at first one needs a classif icat ion of the cri t i-

166

cal po in t s on t h e r e f l e c t i n g s u r f a c e , This c l a s s i f i c a t i o n was done by D A N G E L M A Y E R

& G U T T I N G E R {1980, 1982) in de t a i l for t h r e e - d i m e n s i o n a l p rob l ems . Here t h e d i s cus s ion

will be r e s t r i c t e d to p l ane c u r v e s , i .e . to a f i r s t a p p r o x i m a t i o n o f cy l ind r i ca l r e f l e c t o r s

a long t h e t r a c k l ine. In th i s c a se , t he usua l s i t u a t i o n will be t h a t t he func t i on , wh ich

d e s c r i b e s t h e s u r f a c e , is o f bounded v a r i a t i o n (GUGGENHEIMER, 1977). For m o s t rea l

s u r f a c e s we c a n e v e n suppose t h a t a s u r f a c e l ine is o f sma l l v a r i a t i o n , and, t h e r e f o r e ,

t h e usua l s i t u a t i o n will be t h a t t h e c u r v e can be loca l ly a p p r o x i m a t e d by a Tay lo r s e r i e s

f(x) = a 0 + alx + a2x2 + ... + higher terms. (4.12)

Fig. 4.23: R a y s y s t e m s (x,z) and ' t r a v e i t i m e r eco rd ' (x,t) o f l inea r s u r f a c e e l e m e n t s . The s t r a i g h t l ines m a p on to s t r a i g h t l ines, bu t t he h o r i z o n t a l d i s loea - t ion c a n c a u s e on lapp ing f e a t u r e s and shadow zones .

Now, t h e r e a r e two i n t e r e s t i n g cases : If t he c o e f f i c i e n t s a 2 = 0 and a 1 ~ 0, t he r e f l e c t o r

equa l s loca l ly (near x = 0) a s t r a i g h t l ine, This s t r a i g h t r e f l e c t o r e l e m e n t m a p s on to

a s t r a i g h t l ine on t h e t r a v e l t i m e r e c o r d (Fig. 4.23), as c an be p roved by u se of e q u a t i o n

(4.11). The only s p e c t a c u l a r p a t t e r n s a r e s u m m a r i z e d in Fig. 4.23. An inc l ined s t r a i g h t

l ine is d i s l o c a t e d a long t h e h o r i z o n t a l c o o r d i n a t e . Th i s h o r i z o n t a l d i s l oca t i on can c a u s e

shadow zones and on lapp ing p a t t e r n s in t h e t r a v e l t i m e r eco rd , E x a m p l e s for th i s d i s t u r b -

a n c e o f t he r e c o r d a r e t he i n t e r s e c t i o n s b e t w e e n s e d i m e n t c o v e r and b a s e m e n t in

Fig. 4.15. On t he o t h e r hand , t h e g e o m e t r i c a l p r o p e r t y ' t o be a s t r a i g h t l ine ' is p r e s e r v e d

u n d e r t h e map , i .e . t h e r e is no s p e c t a c u l a r d e f o r m a t i o n on t he t r a v e l t i m e r eco rd .

T h i n g s b e c o m e m o r e i n t e r e s t i n g if t h e p a r a m e t e r a 2 is n o n - z e r o . In th i s c a s e , t h e

Tay lo r e x p a n s i o n ca n be s i m p l i f i e d in t he fo l lowing way: One l o c a t e s a new c o o r d i n a t e

s y s t e m at x = a ° by use o f t h e m a p x - - ~ x - a o. T h e n one r o t a t e s t h e new c o o r d i n a t e

s y s t e m in such a w a y t h a t t h e t r a n s f o r m e d x - a x i s c o i n c i d e s wi th t h e t a n g e n t a t x = 0,

and t h a t t h e y - a x i s c o i n c i d e s w i t h t h e n o n - o r i e n t e d n o r m a l a t t h i s po in t . T h e t r a n s f o r m a -

t ion r e d u c e s t h e T a y l o r s e r i e s to t he fo rm

1 f(x) = ~ kx 2 + ... + higher terms.

The p a r a m e t e r k is t h e local c u r v a t u r e o f t h e c u r v e a t t h e po in t x = 0

(4.13)

167

k = f " ( 0 ) / ( 1 - f ' ( 0 ) 2 ) 3 / 2 • (4.14)

Dependent on the sign of k the point is e i ther a maximum or a minimum in the local

coordinates. The discussed t ransformation re la tes the local s t ructure of the re f lec tor

to its curvature at the cr i t ical point, a well known procedure from different ial geometry

(GUGGENHEIMER, 1977; DO CARMO, 1976). If the Taylor series s ta r t s with higher 2

te rms than x , one has a degenerated situation, and one will find pa t te rns like in the

case of the cycloid of Fig. 4.18. Such situations will be avoided during most of the

following discussion.

In the case 0 <k< co the only spectacular points are local minima, i.e. a locally

concave ref lec tor . Under these constraints there exists an area where the rays in tersect ,

and one will receive ref lect ions from several surface points (Fig. 4.16). Therefore, the

stable spatial pa t te rns of rays will be analyzed in the next section.

4.2.3 Linear Rays, Caustics and the Cusp Catas t rophe

The interest ing s t ructures in ray geometry are caustics (e.g. BEN-MENAHEM &

SINGH, 1981) -- the envelopes of the rays or the boundary line of the area where rays

overlap (Fig. 4.16). In geometrical optics a caust ic appears as a line of high intensity

(e.g. NYE, 1979), in ref lect ion seismics it separates those areas, which are covered by

a single family of rays, from those areas with two or more intersect ing ray systems.

Within the linear ray model the caust ic is found as the locus of the radii of curvature

of the re f l ec to r line. If the ref lect ing curve is given explicitly, one has the classical

formulae (e.g. GUGGENHEIMER, 1977)

u = x - f'(x)(l+f'(x)2)/f"(x)

w = f ( x ) + ( l + f ' ( x ) 2 ) / f " ( x ) (4.15)

or by use of the curvature k = 1/R

u = x - R f ' ( x ) / ( l + f ' ( x ) 2) (4.16)

w = f ( x ) + R / ( l + f ' ( x ) 2

The second set of equations re la tes the caustics to the continuous map (4.7) for the wave

front evolution. From the relation T (t) = R(x) one finds the t ime t, at which a cer tain

point of the wave front arrives at the caustic.

Now, if the re f lec tor line is locally approximated by a parabola y = bx 2, the equa-

tions for the caust ic take the form

168

or i m p l i c i t l y

u = 4b2x 3

I w = 3bx 2 +-~

(4.17)

= 1 3 27u 2 16b(w - -2--~) .

This is a s e m i c u b i c pa r abo l a h a n g i n g ove r t he g e n e r a t i n g c o n v e x r e f l e c t o r l ine. I ts on ly

p a r a m e t e r is t h e local c u r v a t u r e o f t h e r e f l e c t o r (b = k/2). T h e r e f o r e , t h e c a u s t i c is

a s t a b l e s p a t i a l p a t t e r n wh i ch is un ique ly d e t e r m i n e d by t he local p r o p e r t y o f t h e r e f l e c -

tor .

The s e m i c u b i c pa rabo la , wh i ch a p p e a r s h e r e as t h e c a u s t i c , is wel l known as t he

c r i t i c a l s e t o f T H O M ' s (1975) cu sp c a t a s t r o p h e . Indeed, for t he r ay p a t t e r n s d i s c u s s e d

ea r l i e r t h i s c u r v e bounds t h e a r e a w h e r e t h e i n t e r s e c t i o n o f r ay s c a u s e s a b n o r m a l r e f l e c -

t ion p a t t e r n s . To s e e how c a t a s t r o p h e t h e o r y is invo lved we c h a n g e t he v i e w p o i n t s l igh t ly .

In t he s e i s m i c r e c o r d one p i cks up t h e r e f l e c t i o n s a long a l ine wh ich is l o c a t e d in a

c e r t a i n h e i g h t above t h e r e f l e c t o r . T h e h o r i z o n t a l d i s l o c a t i o n o f a r e f l e c t o r po in t on

t h e r e c o r d c a n be found f r o m t h e e q u a t i o n s o f t h e n o r m a l s (4.6) by e l i m i n a t i o n o f t h e

p a r a m e t e r • , and one f inds t h e n e w h o r i z o n t a l c o o r d i n a t e on a t r a c k l ine o f h e i g h t

w to be

u = x - f'(x)(w - f(x)). (4.18)

By s e t t i n g w = c o n s t a n t ( the t r a c k line) one ha s a r e l a t i on b e t w e e n t he s t a r t p o i n t o f

t he r ays and t h e poin t w h e r e t h e y hi t t he t r a c k l ine. Now, one can i n se r t t h e e q u a t i o n

for a local p a r a b o l i c r e f l e c t o r a p p r o x i m a t i o n , y = bx 2, and f rom (4.18) one f inds t he

po in t w h e r e t h e r ay i n t e r s e c t s t h e t r a c k line:

u = x - 2 b x ( w - b x 2 )

or (4.19)

u = (1 - 2 b w ) x - 2 6 2 x 3 .

B e c a u s e t h e c u r v a t u r e shou ld no t v a n i s h a t t h e s p e c t a c u l a r po in t , one c a n s t a n d a r d i z e

th i s e q u a t i o n :

u _ (i - 2bW)x - x 3

2b 2b 2

or (4.20) 3

U = x - sx

wi th obvious p a r a m e t e r i d e n t i f i c a t i o n s .

169

The new p a r a m e t e r ' s ' is a c o m p o s i t e s t r u c t u r e of t he local c u r v a t u r e o f t he r e f l e c -

tor (b) and o f t he h e i g h t of t he t r a c k line (w) above t h e s p e c t a c u l a r point ; thus , a va r i a -

t ion of Vs' c o r r e s p o n d s e i t h e r to a c h a n g e o f t he h e i g h t o f t he t r a c k line o r / a n d to

a c h a n g e o f t h e local c u r v a t u r e . The r e s u l t i n g cub ic e q u a t i o n (4.20) c an be s imp ly

ana lyzed . It a l lows for t h r e e poss ib le s i t u a t i o n s . If s > 0, t hen t h e cub ic is m o n o t o n o u s l y

i n c r e a s i n g (or dec r e a s i ng ) , and t he m a p x ~ u is one to one , i.e. t h e r e e x i s t s only one

f a m i l y of n o n - i n t e r s e c t i n g rays . If s <0 , t h e n t h e cub ic has a m a x i m u m and a m i n i m u m ,

and t h e r e e x i s t s an a r e a w h e r e t he m a p x ~ u is no t un ique ly d e t e r m i n e d , i .e. t h e r ay s

i n t e r s e c t . The c a s e s = 0 d e f i n e s t he t r a n s i t i o n s t a t e b e t w e e n t h e s e two poss ib i l i t i e s ,

it d e f i n e s t h e loci w h e r e t h e c a u s t i c i n t e r s e c t s t he t r a c k line.

Now, t he c r i t i c a l a r e a in t he (x , s}-space is g iven by t he e x t r e m a of t he cub ic

e q u a t i o n . T h e s e e x t r e m a de f i ne t he loci on a t r a c k line, w = c o n s t a n t , w h e r e t h e c a u s t i c

i n t e r s e c t s t h e t r a c k line. T he e x t r e m a a re g iven by

U = 0 = s + 3 x 2 X

or (4.21)

s = -3x 2 •

F r o m e q u a t i o n (4.20) and (4.21) one f inds t h e c r i t i c a l l ine in t he (u , s ) - space by e l i m i n a t i o n

o f t he v a r i a b l e x:

U2/4 = -s3/27 (4.22)

i.e. up to a proper parameter setting the same equation as before. If one relates a

change of the parameter 's' to a change of track height (b = constant), then the semi-

cubic parabola describes just the earlier discussed caustic. On the other hand, we have

a l r e a d y s e e n t h a t t h e m e a n i n g now is m u c h m o r e g e n e r a l b e c a u s e t h e p a r a m e t e r ' s '

i nc ludes a lso c h a n g e s o f t he local c u r v a t u r e of t he r e f l e c t o r l ine. Thus , e q u a t i o n (4.20)

c a p t u r e s t he poss ib le spa t i a l p a t t e r n s in the i r m o s t g e n e r a l s e n s e by a m i n i m a l s e t o f

p a r a m e t e r s . One can use e q u a t i o n (4.20) to d raw a p i c t u r e o f t he c r i t i c a l s u r f a c e in

t h e t h r e e - d i m e n s i o n a l s p a c e {u,x,s). This fo lded s u r f a c e is shown in Fig. 4.24. Eve ry

s e c t i o n s = c o n s t a n t t h r o u g h th i s fo lded s u r f a c e d e s c r i b e s t he d i s loca t ion of t h e h o r i z o n t a l

r e f l e c t o r c o o r d i n a t e a long a poss ib le t r a c k l ine (for a f ixed local c u r v a t u r e ) .

To a r r ive a t t he f inal c a t a s t r o p h e r e p r e s e n t a t i o n , t he v i ewpo in t ha s to be c h a n g e d

once more . The cub ic e q u a t i o n (4.20) c an be d i s c u s s e d in t e r m s of i t s d i s c r i m i n a n t or

in t e r m s of t h e n u m b e r o f i t s roots . This g ives the add i t iona l i n f o r m a t i o n how m a n y

r ays m a y i n t e r s e c t a t a poin t in t he spa t i a l c o o r d i n a t e s . The d i s c r i m i n a n t t a k e s t he

fo rm

D = u2/4 - s3/27. {4.23)

170

K

Fig, 4.24: The ca t a s t rophe set of the cusp ca tas t rophe ,

For D = 0 one has the points which sepa ra te the p a r a m e t e r se t t ings leading to a single

root (D > 0) from those tha t cause t r iple roots (D < 0). Now, the quest ion how many

roots a cubic equat ion has is ident ical with the quest ion how many ex t r ema a quar t ic

equat ion may have. The cubic can, the re fore , be embedded into the ca t a s t rophe potent ia l

V = x4/4 + ux2/2 + sx, (4.24)

which has been published as the cusp ca t a s t rophe (Rieman-Hugoniot ca t a s t rophe in THOM,

1975). This c a t a s t ro phe po ten t ia l cap tu res the discussed two-dimensional ray pa t t e rn s

in the i r most general topological behavior ,

The previous discussion of the cusp ca t a s t rophe allows to classify the t r a v e l t i m e

record in t e rms of depth and local cu rva tu re of the r e f l ec to r line. To do this explici t ly

one can wr i te the local parabol ic approximat ion as y = - a + bx 2, where the p a r a m e t e r

' a ~ indica tes depth, The t rack line is then located at depth zero. From the equat ions

(4.11) one finds the local t r a v e l t i m e image:

u = (l-2ab)x + 2b2x 3

r = (-a+bx2)(l-4b2x 2) 1/2

(4.25)

As turned out from the analysis of the cusp ca tas t rophe , the c r i t i ca l set is given by

s = (l-2ab)/(2b 2) = O. (4.26)

The p a r a m e t e r iden t i f ica t ion a = - w re la tes this r ep resen ta t ion to equat ion (4.20). Now

the cr i t ica l se t can be r ewr i t t en in t e rms of the pa rame te r s (a,b) as

171

/k

O - - K ~

i

O _ _

b

A w - W - ~?

e-% 7- " T Y

m •

Fig. 4.25: The morphology of local elements on the ref lec tor line (upper graph) and their image on the t ravel t ime record. The surface elements and their images are located in the parameter system depth (a) and local curvature of the ref lec tor line (b). Inside the hyperbolic boundary the ref lec tor image is inverted, i.e. it is the domain of hyperbolic reflections.

172

1 - 2ab = 0 . {4.27)

Equation {4.27) descr ibes a hyperbolic boundary line in the (a,b)-space, and equation (4.25)

allows to compute the image of various parabolas dependent on the choice of the parame-

ters 'a ' and tbL Fig. 4.25 i l lustrates the relationship between the local morphology of

the re f l ec to r and its image on the t rave l t ime record within the pa ramete r space (a,b).

From the discussion of the cusp ca tas t rophe we know that the pa ramete r 's ' a f f ec t s

the in tersec t ion of the caust ic with the track line. In analogy to the pa ramete r 's ' one

can vary equation {4.27):

c - 2ab = 0 . (4.28)

This defines a family of hyperbolae in the (a,b)-space of identical t ravet t ime image

(different depth location), which result from di f ferent conditions (Fig. 4.25).

Thus, the previous discussion provides us with some pract ical results, at least for

the in terpre ta t ion of echograms. The analysis of the t rave l t ime record, in te rms of local

propert ies of the re f lec tor line and of caustics, allows to classify the t rave l t ime images

by a minimal set of parameters , and it becomes clear that these p a r a m e t e r s - - depth

and local c u r v a t u r e - - are not independent with respect to the e f f e c t s they produce

on the t rave l t ime record. In addition, it becomes ctear that the ex t rema of a ref lec t ing

surface are s tabte points. In this case, one can approximate the re f l ec to r line locally

by a parabola without any rotat ion of the local coordinate system, and the point (0,f(x))

is recorded at its cor rec t ly horizontal position as well as with the cor rec t t rave t t ime.

This allows to e s t ima te the wavelength of sand waves and similar s t ructures along the

track line from the original t rave l t ime record. Fur thermore, the amplitude can he esti-

mated as well in te rms of t ravel t ime, and the relation to the cusp ca tas t rophe allows

to draw charts , from which the local curvature can be es t imated .

4.2.4 Wave Fronts and the Swallowtail Ca tas t rophe

The next point of in teres t is how the wave fronts evolve near a cuspoid caust ic .

Within the linear ray model, a wave front is given as a set of points {on the family

of rays} which have equal dis tance from the r e f l ec to r

{(u,w) E (x,y) 1 ((x-u) 2 + (Y-W)2) I/2 = r = const.}. (4.29)

This two-dimensional relationship can be extended to the three-dimensional case

(DANGELMAYR & GOTTINGER, 1982). Here it is more appropriate to re turn to the

continuous map for the wave fronts, as it was derived in equation (4.7). From these

173

equations one finds a wave front by se t t ing • = cons tant . The only cr i t ica l set for the

rays is the cuspoid caust ic . Therefore , as WRIGHT (1979) points out, we should "expect

the c r i t i ca l value graph of the cusp ca tas t rophe , which is the b i furca t ion se t of the

swallowtail". Indeed, if one draws the wave fronts for several values of the p a r a m e t e r

T to s imula te thei r evolut ion from a locally parabolic ref lec tor , then they take the

form of sect ions through the swallowtail ca t a s t rophe (Fig. 4.26) as far as they are located

inside the caust ic . To see in detai l how the swallowtail is r e la ted to wave fronts one

/~il'l//llll/l/ltlllllllltlllltllfllll'llllllllllllll~ IIIIIIIIIIItltlIItlfltt'tIII'tlIIItlIIHIIlItt~ / /ltfft/////llltlltflliltttt![II!{llllIlllllil;

Fig. 4.26: Evolution of parabolic wave fronts into swallowtails. The unfolding of the wave fronts is caused by the folded ray system, which is due to the cusp ca tas t rophe . The numbers indicate values for the p a r a m e t e r of evolution -

= vt.

can develop the equat ions (4.7) in Taylor series. If the local parabol ic surface approxima-

t ion formula is inser ted into equat ions (4.7), then the Taylor expansion of these equat ions

up to order 4 gives the approximat ions

u = (l-2bT)x + 4b3~x 3

w = T + b(l-2bI)x 2 + 6b4x 4.

(4.30)

If b ~ 0 and

where

T~ O, then these equat ions can be s tandard ized to the form

W = sx 2 + 3x 4

U = 2sx + 4x 3

W = (w-T)/(2b4T), U = u/(b3T),

(4.31)

174

s = (l-2bT)/(b3T).

On the o ther hand, the ca tas t rophe potent ial of the swallowtail is defined as

V = x 5 / 5 + a x 3 / 3 + b x 2 / 2 + c x . {4.32)

Its cr i t ical value graph in the pa ramete r space (a,b,c) is defined by its der ivat ives

(THOM, t975}

V xxx

V = x 4 + ax 2 + bx + c = 0 x

V = 4x 3 + 2ax + b = 0 xx

= 12x 2 + 2a = 0.

(4.33)

If one uses the first two derivat ives to solve for the parameters b and c in te rms of

a and x, one finds the map

b = -4x 3 - 2ax

c = 3x 4 + ax.

(4.34)

By a proper choice of the signs (take x - - -x) this map becomes equivalent to the

local Taylor expansion {4.31) if one takes the foIlowing pa ramete r ident if icat ions

IJ - c, W ~- b, a - s.

At least locally (by an approximation up to order 4), one finds that the wave fronts

are equivalent to sections a = s = constant through the catastrophe set of the swallow-

tail. Again, one finds that a standard catastrophe on THOM's (1975) list gives a good

approximation to the ray model. In this case, the swallowtail catastrophe describes rather

pretty the evolution of wave fronts.

4.2.5 Wave Front Evolution and the Travel t ime Record

An examinat ion of the original Taylor approximations for the wave fronts (equations

(4.30)) shows that the sect ions s = const , through the swallowtail are located on a line

w = T. The project ions onto the {u,w)-plane { {u,w)~(x,y) ) of these sect ions through

the modified swallowtail (4.30) give the typical evolution pa t te rn for the wave fronts

{Fig. 4.26}. The sect ions through the swallowtail are si t t ing one behind the other in

the caust ic . Now, the pa ramete r T can be wr i t ten as ~r = vt (v: velocity, t: time}, and

we find that the way, in which the swallowtail is si t t ing above the cuspoid caustic,

depends on the sonic velocity of the medium or on the velocity of the wave front dislo-

cation. On the other hand, the velocity cannot a f fec t the spatial pa t te rn -- the caust ic --

as turned out during the discussion of the cusp catas t rophe. The caust ic is a s tructural ly

175

s t a b l e spa t i a l p a t t e r n , wh ich only depends on t h e local s u r f a c e s t r u c t u r e , i .e. t h e local

c u r v a t u r e .

Now, t h e ana lys i s of w a v e f ron t s adds a th i rd d imens ion , t i m e , to t h e t w o - d i m e n -

s iona l s p a t i a l c o o r d i n a t e s . T he m a p f rom t he r e f l e c t o r to t he t r a v e l t i m e r eco rd , t h e r e -

)f x

Fig. 4,27: Two v iews of t he m o d i f i e d swa l lowta i I c a t a s t r o p h e . The swa l lowta i l s i t s on a l ine y = vt , T he s e c t i o n s y = c o n s t a n t t h r o u g h th i s c a t a s t r o p h e s e t a r e t he r e c o r d e d r e f l e c t i o n s n e a r a c o n c a v e s u r f a c e e l e m e n t .

176

fore, turns out to be a map R 3 ~ R 2 or ( x , y , t ) ~ (u,t). From the caus t ic we know

tha t it is a s table pa t t e rn in the (x,y)-plane. In addition, we know tha t the images of

the sect ions through the swallowtail need to have s tab le posit ions inside the caus t ics

(Fig. 4.27}. A change of the sonic velocity, the re fore , cannot a l t e r these spat ia l pa t te rns ,

i t can only a f f ec t the recorded t r ave l t ime , i.e. the spreading veloci ty of the wave front.

For the t r a v e l t i m e record this means tha t the t ime-axis is s t r e t ched or compressed.

In the space - t ime coordina tes the sonic veloci ty can only a f f ec t the t i m e - a x i s . The

only allowed t r ans fo rma t ion of the ca t a s t rophe se t {Fig. 4.27) by a change of the ve loc i ty

is, the re fore , pure shear in the (y, t)-plane with equat ion t = ay. This t r ans fo rma t ion

does not a l t e r the spat ia l coordina tes of the sect ions through the modified swallowtail

(4.30)), i.e. the i r projec t ions onto the spat ia l (x,y)-plane.

How does the local surface pa t t e r n map onto the t r ave l t ime record? To study

this, one has to sect ion the modified swallowtail (4.30) by a plane w = y = cons tan t

(in the equat ions (4.30) 'w' means depth). This gives the image of the local surface

in the space - t ime coordinates , in the (u,vt /2)-plane. Fig. 4.27 gives two views of this

modified swallowtail , which have been sect ioned by a plane w = cons tan t . Again the

ca t a s t rophe approach summar izes the pa t te rns , which can ar ise from a local concave

r e f l ec to r a rea in a very condensed way. By comparison of the observed record with

plane sec t ions through the th ree-d imens iona l ca t a s t rophe set , one can get reasonable

qua l i t a t ive in format ion about the local surface s t ruc ture . Especially the 'hyperbol ic r e f l ec -

t ions ' turn out to represen t local surface inversions, which are re la ted to the wave

fronts , which have en t e red the local caus t ic .

4.2.6 The Traveltime Record as a Plane Map

A second approach to analyze the re la t ion be tween the local r e f l ec to r geomet ry

and the t r a v e l t i m e record is versus the plane map (x,y} -~ (u ,v t /2 ) , which has been defined

by equat ions (4.11}. This method is very close to FLOOD's (1980) study of 'hyperbol ic

r e f l ec t ions ' in deep sea echograms. Again the ca t a s t rophe approach versus local proper t ies

of the r e f l ec to r will provide general results .

Firs t , one has to specify the mapping equat ions (4.11). To int roduce depth explici t ly,

the r e f l e c t o r line is locally approximated by a parabola f(x) = a + bx 2 like in equat ions

{4.25). The Taylor expansion of ' r ' (equation (4.25)) up to order 4 gives the local map

u = (l+2ab)x + 2b2x 3

r = a + b(l+2ab)x 2 + 2b3(l-ab) x4. (4.35)

177

Although this map is very similar to the evolution equation of the wave fronts

(4,30), it is not possible to t ransform it into the standard form of the swallowtail (4,34)

by means of simple transformations, which preserve the topological s t ructure , Indeed,

as follows from the previous discussion, we should expect arbi trary sect ions through

the swallowtail ra ther than its standard form,

Now, instead of r we can use r 2 = v2t2/4 as the dis tance measurement between 2 the source and the ref lect ion point. The square r ms a monotonous function of r because

r > 0 (Fig. 4.20). This t ransformat ion is not unusual to a seismologist (e.g. KERTZ, 1969},

and it allows to formulate the dis tance r as

2 (f(x)2 )2 r = + (n-x

2 x 2 2 = a + (l-2ab) + u - 2ux + b2x 4.

(4.36)

This equation can be rewri t ten as a ' ca tas t rophe potent ia l ' if b ¢ 0,

V -- (r 2 - a2)/b 2

4 (l-2ab)x2 U 2 = x + - 2Ux + b 2

or

V = x 4 + 2vx 2 - 2Ux + U 2

with obvious parameter identifications.

(4.37)

The first derivat ive of this ' ca tas t rophe potential ' defines U:

V x = 0 = 4x 3 + 4vx - 2U,

i.e. the original cusp catastrophe (eq. 4.20).

(4.38)

This ca tas t rophe potent ial does not appear in Thorn's list of e lementary ca tas t ro-

phes, but he discusses it as a selfreproducing singularity or as the stopping potent ial

of the cusp ca tas t rophe (THOM, 1975), In terms of ca tas t rophe theory this potent ial

is the universal unfolding of the cusp catas t rophe, and we can embed it into a local

potent ial

V 1 = x5/5 + v x 3 / 3 + u x 2 / 2 + u2x (4.39)

by a proper choice of the parameters . This is a swallowtail with a degenera ted paramete r

space. The parameters ' c ' and 'b' from equations (4.33) are now rela ted by b = c 2. The

cr i t ical set appears in the (V,U,v)-space (Fig. 4.28), and the t rave l t ime record is re la ted

to sect ions v = constant through the cri t ical set . Thereby one has to keep in mind that 2

V means r , not r. The sect ions v = constant have locally a swallowtail-l ike appearance,

but, in addition, they have two maxima where the curves bend down again (in the repre-

sentat ion of Fig. 4.28).

i V

v

U

178

Fig. 4.28: The stopping potent ial of the cusp ca tas t rophe (a} in the pa ramete r space (V,U,v). The positive V-axis is drawn downward for the convenience in comparing it with the standard swallowtail (b) and the hyperbolic ref lec t ion of the t rave l t ime record.

The appearance of the two additional maxima above the point of se l f in tersec t ion

in Fig. 4.28 needs an explanation because we cannot expect this pa t tern from the simple

parabolic approximation of the ref lec tor . Similar pa t te rns can be found in the simulated

record of Fig. 4.21, but it will turn out that these pa t te rns are of a very d i f ferent

type because they are really re la ted to the surface s t ructure . What happens with the

stopping potential , i l lustrates Fig. 4.29. There, the parabolic re f lec tor line extends over

the track line (S). Now, one can const ruct the image of this abs t rac t surface on the

t rave l t ime record in a very simple way. One has just to project the length of the rays,

which connect the rece iver with the ref lec t ion points s t ra ight downward from the point

where they in tersec t the track line. This gives the curve (r), i.e. the image on the t ravel -

t ime record. This construct ion can also be done for those parts of the re f l ec to r line

which extend above the track line. Because t rave l t ime is measured without a directional

component , i.e. it can only assume positive values, the curve (r) bends down again, as

one moves away from the in tersect ion point of (S} and (r). Therefore, one has to choose

s s A

Fig. 4.29: The abs t rac t si tuation that the t rack line (S) in te rsec ts the re f lec tor line. In this case, the t rave l t ime record (r) reaches the track line at the in tersect ion point and bends then down again because r can assume only positive, values.

179

carefully the correc t interval if the stopping potential is used as a model for the t ravel-

t ime record. The cor rec t interval is, in any case, located between the two maxima of

the sect ions v = constant of Fig. 4.28.

If one analyzes the cr i t ical surface of Fig. 4.28 with the noted res t r ic t ions in

mind, then it turns out that the typical 'hyperbolic ref lec t ions ' with a swallowtail-like

appearance are res t r ic ted to a limited range of the parameter v. If v is positive, one

has a convex ref lector , which in a topological sense is recorded correct ly . As v assumes

suff icient ly large negative values, the 'hyperbolic ref lect ions ' turn smoothly into a more

parabolic appearance, which, like the 'hyperbolic ref lect ions ' , is an inversion of the local

re f lec tor topology -- a concave surface e lement turns into a convex image. Those 'para-

bolic ref lec t ions ' are also well known from echograms (FLOOD, 1980), but, more common-

ly, they are found within basement ref lect ions (Figs. 4.14, 4.15).

As was shown in the last section, the approach versus wave fronts provides another

f rame to summarize the images on the t ravel t ime record. The advantage of the plane

map approach is that the 'stopping potential ' represents the images in a still more con-

densed way.

4.2.7 Singularities on the Ref lector Line

So far, a very simple re f lec tor model was used. In the case of faults, folds and

flexures the situation may become more complicated although a local parabolic approxi-

mation with rotat ion of the coordinate axes may be still possible. The most simple case,

where one can find such a cri t ical situation, are flexures and folds. A first impression

ii f );

Fig. 4.30: First order approximation of ray systems and wave fronts near flexures.

180

of what may happen near a fault gives the simple linear model of sect ion 4.2.2

(Fig. 4.23). What is actually new in this linear approximation, is the appearance of a 3

shadow zone. A flexure can be simulated by a cubic equation x = y + ay which also

includes simple folds. Fig. 4.30 gives a rough approximation of rays and wave fronts

which arise from the cubic re f l ec to r line model with a > 0, a = 0 and a < 0. For a < 0,

the caust ic pa t t e rns can be approximated by a parabolic approximation at the ex t rema

of the cubic equation, but only parts of the wave fronts sca t t e r back to the t rack line,

i.e. only one branch of the caust ic in te rsec t s with the t rack line. Fig. 4.31 trys to

capture the behavior of the caust ic over a family of cubic re f lec tor lines. For compari-

Fig. 4.31: The caust ics of a family of cubic re f lec tor lines. Left: The family of caust ics of only one ex t remum {the lower one). Right: Separation of the caust ics into their relevant parts, i.e. the branches which reach the track line.

son, the family of caust ics for only one ext remum is also shown. These graphical methods

only give a very rough idea of what happens near such s t ructures , but it is not the

scope here to analyze these problems in detail . In this context it becomes at least neces-

sary to study di f f ract ion pat terns . For this approach see DANGELMAYR & GUTTtNGER

(1982).

Similar problems arise if the re f lec tor has singular points like

Fig. 4.18 in sect ion 4.2.1. The cycloid can be described by the map

x = t - sin(t)

y = I - cos(t).

By taking a Taylor expansion near the cusp point, one finds

the cycloid of

(4.40)

181

x = t 3

Y -- t 2 (4.41)

w h e r e t h e c o n s t a n t s h a v e been abso rbed in x and y for c o n v e n i e n c e . If t he p a r a m e t e r 3 2

t is e l i m i n a t e d , one f inds t h a t t h e cusp po in t e q u a l s t h e s e m i c u b i c pa r abo l a y = x .

The ma in point , h o w e v e r , is no t t h a t we h a v e a cusp , bu t t h a t t h e s i n g u l a r i t y is an

i so l a t ed po in t of t he r e f l e c t o r l ine a t wh ich dx / d t = 0 and d y / d t = 0. The c a u s t i c n e a r

t h e c r i t i c a l po in t is g iven as t he loci of t he radi i of c u r v a t u r e on t he n o r m a l s o f t he

s e m i e u b i c pa rabo la :

x -- 4t 3 + o4--t (4.42) C .3

Yc --~ ~ t4 - t 2 .

Thus, not even too close to the isolated singular point (t = 0) the caustic behaves like

the map

u = t 2

(4.43) V = t ,

i .e. i t is a fold c a t a s t r o p h e (Fig. 4.32; LU, 1976). The t e r m 'no t too c l o s e ' m e a n s t h a t

t 3 is m u c h s m a l l e r t han t and t h a t t 4 is m u c h s m a l l e r t han t 2. A t a fold c a u s t i c t he

r ays a r e only l o c a t e d on one s ide o f the c a u s t i c s and cause , t h e r e f o r e , local ly a shadow

zone. A d e t a i l e d ana lys i s o f such s ingu la r po in t s on t h e r e f l e c t o r l ine would r e q u i r e

a t opo log i ca l c l a s s i f i c a t i o n , and i t would be n e c e s s a r y to s t u d y t h e w a v e f i e l d r a t h e r

t h a n t h e r ay s y s t e m .

m m

Fig . 4.32: The fold c a t a s t r o p h e (caus t ic ) n e a r a s ingu la r poin t on t he r e f l e c t o r c a u s e s a s h a d o w zone .

182

Table 4-1: Summary of the Ray Model

The t r ave l , l ine record in i ts most c r i t i ca l case corresponds to sect ions y = cons tan t (y:depth) through a swallowtai l c a t a s t r ophe which is located on a l ine y = vt in the th ree -d imens iona l space {x,y,t). The various types of specia l ized deformat ions depend on t he locaI cu rva tu re of the r e f l ec to r line, on the d i s tance be tween the t r ack line and the c r i t i ca l point on the r e f l ec to r line, and on the sonic ve loc i ty of the medium. In the p a r a m e t e r space depth of the c r i t i ca l point (a) and local cu rva tu re (b), the c r i t i - cal boundary line for an image inversion, i.e. for the occur rence of ~hyperbolic re f lec - t ions ' , is given by the hyperbola 1 - 2ab = 0. This hyperbol ic equat ion simply compares the local cu rva tu re of the r e f l ec to r with a c i r cu la r wave f ront a t depth ' a ' . In detai l , one finds t h a t these p a r a m e t e r s a f f e c t the t r a v e l t i m e record in the following way:

I) Spatial pa t te rns , the cusp ca t a s t rophe

1) The local cu rva tu re of the r e f l ec to r line:

Only convex a reas of the r e f l ec to r line are spec tacu la r (cause t rouble within the record) because a cuspoid caus t ic evolves. Two special s i tuat ions occur:

a) The local approximat ion of the r e f l ec to r line requires a ro ta t ion of the local coordina te sys tem wi th respec t to the global one. The sect ions through the ca t a s t rophe se t becomes oblique. This p a t t e r n can be de t ec t ed on the t r a v e l t i m e record because the 'hyperbol ic r e f l ec t ions ' are asymmetr ic .

b) Di f fe ren t local cu rva tu res (b = k/2) or the r e f l ec to r cause a dis locat ion and s t r e t ch ing (compression) of the caus t i c in the spat ia l coordinates . This de format ion can only be dist inguished from (2) if the t rue dep th posit ion of the spec tacu la r point on the r e f l ec to r l ine is known.

2) The hei_~.h_t_of ,_he t r ack l ine above the r e f l ec to r line:

Because t he re f l ec t ion p a t t e r n depends on the re la t ion be tween the cu rva tu re of the inc ident wave f ront and the cu rva tu re a t the spec tacu la r point on the r e f l ec to r line, this case canno t be dis t inguished from a change of the local cu rva tu re of the r e f l ec to r wi thout addi t ional in format ion (e.g. a m eas u r emen t of t rue depth). This p a r a m e t e r chooses a special line through the ca t a s t r ophe se t of the cusp which is s tably located in the space coordinates . Because the cusp ca t a s t rophe is the b i fu rca t ion se t for the swallowtai l and the discussed s topping potent ia l , th is p a r a m e t e r also appears in the o the r ca ta s t rophes .

3) Ex t r ema of curva ture :

In case the r e f l ec to r has a local minimum of curva ture , it can be approximated by a parabola , and the discussion of sect ions 4.2.1-7 holds: Typical pa t t e rn s inside the caus t ic are 'hyperbol ic re f lec t ions ' . However~ if the r e f l ec to r has a local maximum of curva ture , the caus t ic p a t t e r n is inversed, as discussed in sec t ion 4.2.8. Anyway, the previous discussion remains valid if the propagat ion of wave f ronts is inversed. A f t e r ~ the wave f ronts have passed through the caus t ic , a parabol ic re f lec t ion pa t t e rn r e s u l t s which allows to dist inguish this case from the ' s t anda rd s i tua t ion ' .

I!) S p a c e - t i m e pa t t e rns : the swallowtai l c a t a s t rophe

The sonic ve loc i ty of the medium does only a f f e c t the t r ave l t ime . This p a r a m e t e r can, the re fo re , cause only those t r ans fo rma t ions which le t t he space p a t t e r n i n v a r i a n t - - pure shear in the ( y , t ) - - plane. The ca t a s t r ophe set , which descr ibes the evolut ion of the wave fronts , is a modif ied swallowtai l which is loca ted on a l ine y = vt . The t r ave l , l ine images a re plane sec t ions through this ca t a s t rophe set . Al te rna t ive ly , the t r a v e l t i m e image can be descr ibed by the unfolding of the cusp ca tas t rophe , i.e. by i ts s topping potent ia l . The l a t t e r approach gives a descr ipt ion in the coordina tes

(x,y,v2t2).

183

Table 4-2: Summary of s t ra teg ies in the analysis of t r ave l t ime records

"wave f ront approach" "plane mapping method"

Cons t ruc t ion of the ray syste m (normals of the local r e f l ec to r e lement )

The caus t ic or the envelope of the rays cen te r s of curvature)

Evolution of the wave f ronts along the rays, the swallowtail ca t a s t rophe

t r a v e l t i m e sec t ions through the ca t a s t rophe set of the wave f ron t s - - t he modif ied swallowtail

The ca tas t rophe map along the t rack line, the cusp ca t a s t rophe

Unfolding of the cusp ca tas t rophe , the ' s topping po ten t i a l '

i¢ The local image of the t r a v e l t i m e record

4.2.8 Genera l ized Re f l ec to r P a t t e r n s in Two and Three Dimensions

In case the r e f l ec to r can be described by an explici t funct ion y=f(x), the previous

discussion provides a f ini te c lass i f ica t ion of r e f l ec to r pa t t e rns as long as a l inear ray

modeI is suf f ic ient and ca t a s t rophe theory provides a f rame for this c lass i f icat ion, as

summar ized in tables 4-1 and 4-2. However, the appl icat ion of ca t a s t rophe theory requires

local coordinate changes, which somet imes may be assumed inadequate for the problem.

tn the previous discussion it turned out tha t the t r ave l t ime record depends on a p a r a m e t e r

s= 1-2ab which appears in alI equat ions -- for the caust ic , the wave fronts and the t r ave l -

t ime record. The p a r a m e t e r 'a ' is equivalent to the depth of the re f lec tor , and '2b=k'

is i ts local curva tu re (cf. equat ion 4.13). The p a r a m e t e r 's ' , the re fore , provides a simple

in te rp re ta t ion , i t measures the re la t ion be tween an incident wave f ront wi th radius ' a '

(depth} and the radius of cu rva tu re of the re f lec tor . Image inversion occurs for a > l/(2b),

i.e. if the radius of the incident 'wave f ront ' is larger than the radius of curvature , mul-

t iple re f lec t ions arise locally because the curva tu re of the r e f l ec to r increases, as one

depar ts from the c r i t i ca l minimum. Fig. 4.33 i I lus t ra tes this viewpoint .

A) The Deformed Circle and the Dual Cusps

A na tura l quest ion is what happens if the r e f l ec to r has a d i f fe ren t s t ruc ture , i.e.

184

~ J

t I •

~ t

Fig. 4.33: The contac t be tween the incident wave front and the circle of curvature de te rmines the possible number of received ref lect ions: In the case of a parabolic ref lec tor , multiple ref lect ions result only if the curvature of the incident wave front is larger than the local curvature of the ref lec tor , i.e. if the shotpoint is located inside the ' caus t ic ' of normals. The usual si tuation is a fold point on the caust ic (b); a cusp point appears only at a local ex t remum of curvature .

if the curvature decreases , as one depar ts from the minimum. This causes a d i f fe rent

type of con tac t be tween the circle of curvature and the ref lector : The ref lec tor is total ly

bound to the convex side of the circle of curvature, a si tuation which cannot arise in

the case of a locally 'parabolic re f lec to r ' . An appropriate way to study both si tuations

simultaneously is to consider a per fec t circular arc, and to t ransform it by a simple

aff ine t ransformat ion

{ ,)= r[10 co ,1

which takes the circle into an ellipse. Fig. 4.34 i l lustrates how the re f lec tor e lement ,

its contac t with the circle of curvature and the caust ic are a l tered by a smooth change

of the pa ramete r 'e ' :

In the case 0 < e < 1 the ellipse has a local minimum of curvature. The circle of

curvature is bounded to the concave side of the ref lec tor , which, therefore , can

be approximated by a parabola, and the previous discussion can be applied.

For e=0, the r e f l ec to r is a pe r fec t circular arc. All rays pass through a single

p o i n t - - a singularity with indefinite codimensions. This si tuation is s tructural ly

unstable, as any small disturbance t ransforms the singular point into a caustic.

If e >1, the circle of curvature is located on the convex side of the ref lector ,

and a new pa t te rn arises. However, the caust ic is again cuspoid, similar to the

caust ic of a cycloid (Fig. 4.18); but the cusp points into the opposite direct ion

than in the case of a parabolic ref lec tor .

I85

Fig. 4.34: R a y s and w a v e f r o n t s f r om an e l l i p t i c r e f l e c t o r , a: 0 < e <t , b: e=0, c: e > 1. See t e x t for d i scuss ion .

The t ype o f c a u s t i c t hus d e p e n d s on t he t ype o f c o n t a c t b e t w e e n t he c i r c l e of c u r v a t u r e

and t h e r e f l e c t o r . T he e l l ipse s t i l l p rov ides a r a t h e r spec ia l e x a m p l e . A m o r e g e n e r a l

v i e w p o i n t and c l a s s i f i c a t i o n can be de r i ved if t he a r g u m e n t s o f s e c t i o n 4.2.2 a re appl ied

to m o r e g e n e r a l c u r v e s .

Loca l ly , t he c i r c l e o f c u r v a t u r e p rov ides a r a t h e r good a p p r o x i m a t i o n o f a t w o - -

d i m e n s i o n a l c u r v e . C h o o s i n g i ts c e n t e r as t he or ig in o f a po la r c o o r d i n a t e s y s t e m we

c a n d e s c r i b e t h e r e f l e c t o r by an e q u a t i o n

r = R + f ( e ) , (4.45}

w h e r e R is t h e local r ad ius of c u r v a t u r e and f(0} d e s c r i b e s t he d e v i a t i o n o f t he c u r v e

f rom the p e r f e c t c i r c u l a r a rc (of. Fig. 4.35). The q u e s t i o n is w h a t we can in fe r abou t

t h e f u n c t i o n f(0). The r ad i u s o f c u r v a t u r e in polar c o o r d i n a t e s is g iven by

186

d Fig. 4.35: The three possible con tac t s be tween a two-dimensional r e f l ec to r and its c i rc le of curvature.

R = ( r 2 + r ' 2 ) 3 / 2 2 r + 2 r ' 2 - r r " (4.46)

At O =0 the re f l ec to r has curvature R, and this is the case if f{0) sat isf ies the three

conditions f(0)=0, f'(0)=0, and f"(0)=0 as can easily be verif ied from the standard equation

(4.46). If we use a power series to approximate fie), then this series cannot involve

powers less than three, i.e. we need at least a function frO)= fla+...+higher terms. Such

functions, of course, are really flat at the origin, their curvature vanishes at 8 =0.

However, f(fl)= ( 3 is an odd function, and if we insert it into equation (4.45), it

becomes clear that the circle of curvature in te rsec ts the re f lec tor in some neighborhood

of e =0; the local re f lec tor model is a 'spiral arc ' with monotonously increasing (decrea-

sing) curvature in a suff ic ient ly small neighborhood of e =0 (Fig. 4.35a). A sign change

of the leading term (f(e)=-+e a) simply re f l ec t s the intersect ion pa t te rn at the ray fl =0;

the pat tern , however, does not change.

The situation becomes d i f fe ren t if the power series s ta r t s with a fourth order

term. Then the re f lec tor deviates symmetr ical ly from the circle of curvature , and a

sign change of the leading fourth order term changes the type of contact : For +8 4 the

c i rc le of curvature is ent i re ly on the concave side of the re f l ec to r while for -0 4 it

is on the convex side (Fig. 4.35).

The two a l te rna t ive power series with leading te rms of order three or four are

really dist inct and exclude one another, as now will be shown. A local re f lec tor approxi-

mation involving both te rms could always be brought to the form

f(8) = O 3 + ae 4 + ... + higher terms. (4.47)

187

[ However, by a redefinit ion of the zero angle ( 8 - 8 - ~-a) ,equat ion (4.47) can be t rans-

formed into

3 2 04/(4a) - ~0 + 2a20 + (a4-a3). (4.48)

tn equation (4.48) the radius of curvature is given by (R+c), and f~) has again to sat isfy

f(0)=f'(0)=f"(0)=0, i.e.

1 3 a 0 - 302 + 2a 2 = 0

3 e 2 - 6(? -. 0 . (4.49) a

These two equations, however, are usually not zero, and the function f(O) is dominated

by the first and second order te rms with non-vanishing first and second order derivat ives

and, therefore , does not sat isfy the requested approximation.

Therefore, our problem is, locally, strongly equivalent to a power series which

s ta r t s e i ther with a third or a fourth order term, and ca tas t rophe theory implies a fold

or cusp catas t rophe. The cr i t ical point in our problem is the point r=R, the cen te r of

the circle of curvature which, of course, is a point on the evolute of the rays, i.e.

a point on the caustic. Sufficiently close to 0 =0, the radii of our polar coordinate system

coincide with the rays. The t ransformat ion p =r-R maps the re f lec tor {the wave front}

to the cr i t ical point. Near this point, we take the re f lec tor as f{0)=04 or more conven-

iently, we use the unfolding

0 = -+04/4 + u02/2 + v@. (4.50)

We cannot choose u and v

to the set of equations

freely because f(O) has to satisfy f'(O)=f"(O)=O. This leads

v = ¥0 3 - uO

and (4.51)

u = $3@ 2

If we solve for u and v in terms of fl and insert this in equation (4.50), this equation

simplifies to a fourth order term as required. However, if we use u and v as local or tho-

gonal coordinates, then we can el iminate fl and find one of the dual cusps

u 3 v 2 (g) = ~(~) (4.52)

188

i.e. the caus t i c we expect . In a spatial in te rpre ta t ion , u is the (negative) first, v the

second der iva t ive of the funct ion p . In te rpre ted as vec tors , they provide a local o r tho-

gonal f r ame and cap tu re qua l i ta t ive ly the dislocat ion of rays close to the cr i t ica l point

r=R. Similar a r g u m e n t s can be applied to the case f ( 9 ) = 0 3 , the cr i t ica l points are

fold points.

If we re s t r i c t our a t t en t ion to local s t ruc tu res , there is not more than fold and

cusp points on a caus t ic . Their occur rence is a funct ion of the c on t a c t be tween the

re f l ec to r and its local c i rc le of cu rva tu re , as i l lus t ra ted in Fig. 4.35. In t e rms of r e f l ec -

t ion pa t t e rns , however, ' local t is r a the r re la t ive . In this con tex t , a fold point is a point

where two rays in te rsec t ; however, this is only the case on the caus t i c i tself . In the

inter ior of a caus t i c (cf. Fig. 4.34), which is not re la ted to a s ingular point on the re-

f lec tor {e.g. Fig. 4.32), we find tha t th ree rays i n t e r sec t at every point. Thus, fold points

are not such impor tan t f rom a less tocal viewpoint . Impor tant , however , is the d i f f e rence

be tween the dual cusps because they provide an essent ia l source for the se i smic in ter -

pre ta t ion .

In one sense the dual cusps are not d i f fe ren t , they are s imply dual re f l ec t ions

at the x-axis which resul t from a sign change of the leading power t e rms , and thus

are all r e la ted pa t t e rns . This is obvious because any wave front can be considered as

a r e f l ec to r and vice v e r s a - - here a wave front is a map of the r e f l ec to r along the

rays preserv ing angles. The wave front pa t t e rn s of the two dual cusps, therefore , are

ident ical , only the di rect ion of propagat ion is inversed. This is a nice resul t because

it shows tha t the previous discussion holds also for the dual cusp if the di rect ion of

wave propagat ion is inversed; and the ear l ier discussion provides really a ca t a logue of

the essen t ia l re f lec t ion pa t t e rns as far as a l inear approach is suf f ic ien t .

On the o ther band, the re r emains a d i f f e rence be tween the dual cusps. In the case

of a locally parabolic ref lector , the wave f ronts are sec t ions through the swallowtail

wi th its cusps and se l f in t e r sec t ions , and the t r ave l t ime records in the cr i t ica l case are

F i ~ 4.36: Wave f ronts of the dual cusps.

189

'hyperbol ic re f lec t ions ' , again with cusps and se l f in tersec t ions . In the case of the dual

cusp, the re f l ec to r is an e l l ip t ic arc which is bounded to the in ter ior of the cuspoid

caust ic; as soon as the image passes through the cusp point, the wave fronts have a

'parabol ic ' appearance , and thus has the t r ave l t ime record; cusp points and se l f in te r sec -

t ions then are missing. A typical pa t t e rn , which commonly arises, is a ser ies of parabolae

which a l t e rna t ive ly correspond to synclines and ant icl ines, and which in te r sec t on the

t r a v e l t i m e record, but wi thout cusp points. In the case the t rack line sect ions the

caust ic , swallowtail pa t t e rns may arise, but they are inver ted with respect to the pa t t e rns

arising from a 'parabol ic r e f l ec to r ' (Fig. 4.36).

In summary, the various re f lec t ion pa t te rns , which may arise, are well c lass i f iable

in t e rms of the con tac t be tween the re f lec tor , i ts c i rc le of cu rva tu re and the incident

wave front (dis tance from the source). Usually there should be enough informat ion

avai lable for a qual i ta t ive ly cor rec t in terpre ta ion . The l inear ray model, of course, is

only a f irst approximation, but the principal re lat ionships remain s table even if the sonic

veloci ty of the medium is not a cons tan t .

B) Three-Dimensional Pa t t e rn s -- The Double Cusp

At least, a few remarks shall be made in what respec t the simplif ied model of

l inear rays and especial ly of a two-dimensional r e f l ec to r line gives insight into a larger

class of images which may resul t from compl ica ted r e f l ec to r topologies. The two-dimen-

sional approach extends wi thout di f f icul t ies to cylindrical surface e l ement s or, more

generally, to parabolic surface points. Fig. 4.37 gives two e x a m p l e s - - a cyl indrical

and a conical surface -- tha t show how the caus t ic and a single wave f ront are located

over the surface . In such cases, the t r a v e l t i m e record will depend on the re la t ion be-

tween the axis of the syncline and the t rack line -- one may find 'hyperbol ic re f lec t ions ' ,

onlapping pa t te rns , doubted or t r ipled re f lec t ions (Fig. 4.38). Thus, an i r regular topo-

graphy, which impresses i ts s t ruc tu re onto the wavefield, can cause nice mult iple re f lec-

t ion pa t t e rns which look like pe r fec t ly s t ra t i f i ed sediments; and, therefore , one may

ask how much onlapping fea tures in Fig. 4.15 are real, and which ones are due to the

rough topography of the basement . The complexi ty of these e f f ec t s increases if one

considers far f ie ld e f f ec t s or more compl ica ted sur face e l ement s like hyperbolic and

el l ip t ic surface points. Represent ing the surface near (Xo,Yo,Zo) by z=f(x,y) the evolut ion

equat ion for the wave fronts becomes

{(U,V,w) E (x,y,z) I ((x-u)2 + (Y-V)2 + (w-f(x'y))2) I/2 = r = const. }. (4.53)

In the case of a parabol ic or hyperbolic surface point, the family of rays is given by

the (vector) equat ion

190

Fig. 4.37: The caust ic and a single wave front over a cylindrical (above) and conical (below) surface.

r = (x, y, x 2 ± ay 2) + k(2x, +2ay, -i), (4.54)

and a point on the track line may be given as (Xo,Yo,Zo). To see, which surface points

map onto the track line, one has to solve the equation

(Xo' YO' Zo) = (x, y, x 2 + ay 2) + ),(2x, +2ay, -i). (4.55)

Let the track line be located at Zo, then by elimination of the parameter ), , one finds

the relationship 2 = x 2 -+ ay - z 0

x 0 = (1-2Zo)X + 2x 3 + 2ay2x

YO = (1 -~- 2z 0 + 2ay 3 +- 2x2y, (4.56)

191

Fig. 4.38: Sketch of the t ravel t ime record of a cylindric syncline with track line sect ions parallel and oblique to the syncline axis.

a map which is a special degenera ted case of the double cusp ca tas t rophe including

the standard umbilic catas t rophes . The caust ic pat terns , which result from the double

cusp, are ra ther complicated. A full discussion of three-dimensional phenomena is above

the scope of this discussion; however, a detailed study in terms of standard ca tas t rophes

was given by DANGELMAYER & GUTTINGER (1983).

4.2.9 Distr ibuted Receivers

Seismic shooting rarely resembles the idealized situation that source and receiver

are at the same place. However, as turned out during the previous discussion, the results

found from ra ther idealized assumptions hold for a much wider class of 'dis turbed'

problems. It will be shown here that the principal results still hold if source and re-

ceiver are at d i f ferent places, or if a chain of receivers is used. In the la t ter case,

not a single ref lect ion but the re f lec ted and deformed wavelet is recorded. What we

shall do here is, therefore , to study how the re f lec ted wavelet deforms.

The ref lect ion of a wavelet is governed by Snell 's law of equal angles, i.e. the

angle an incident ray forms with the normal of the ref lect ing surface is the same as

192

the angle the re f l ec ted ray forms with the same normal. A convenient way, therefore ,

is to view the incident and the r e f l ec t ed rays in te rms of the ref lec tor . Let the ref lec-

tot be given in te rms of its local curvature, i.e. with the cen te r of the global coordi-

nate system at the cen te r of its local circle of curvature (cf. equ. 4.45): Locally the

re f l ec to r can be wr i t ten

and the normal rays are

Yn = r (sin 0 + X -r Lsin + r ( cos

(4.57)

(4.58)

Now, in a plane problem we can express the incident rays in local coordinates by means

of the tangent (t) and normal (n) vectors at the surface:

r . = r + X ( - a n + g t ) , (4.59) 1

and the re f l ec ted rays are simply the ref lec t ions of incident rays at the normals

r = r + X(-ccn - B t ) . (4.60) r

The coef f ic ien t s ' a ' and 'b ' can be de termined to sat isfy special conditions of the

source, e.g. in the case of a point source, equation (4.59) leads to a pair of linear

equations from which the coef f ic ien t s can be determined. A very simple system arises

if the re f l ec to r is locally a pe r fec t circular arc. The equations for the incident and

re f l ec ted rays then simplify to the pair of equations

cos [-sin

[~] -- [ c°s Ab [ -sin o001. (4.61)

The condition that the incident rays originate from a point source requires that these

rays pass through the source point for some value of I . Without loss of generali ty,

we can choose the value k=l , and the values for the parameters 'a ' and 'b ' can be

de termined from the linear equations

(r-a)cose - bsinO = x 0

(r-a)sin@ + bcose = YO

to be

193

a = r - (YosinO + xncosO) b YoCOSO - x~sinO (4.62)

,xi] and the r e f l ec t ed rays are

Because of the symmet ry of the c i rcular arc a s imple ro ta t ion allows to locate the

source formal ly at (Xo,0) so t ha t the previous equat ions simplify fur ther . If one inser ts

' a ' and 'b ' from equat ion {4.62) into equat ions {4.61), one finds a s implif ied equat ion

for the incident rays

= r ( 1 - X + )` (4.63} sin

(Xrl [ fc°81 II )`x r cos2e + (4.64)

= r ( 1 - ) , ) 0 s i n 2 0 Yr [sin @

As previously, the caus t ic of the r e f l ec ted ray system is of special in teres t . If we

consider equat ion (4.61) as a map, the caus t ic is equivalent to i ts singular set , which

can be de t e rmined from the condit ion tha t the Jacobian of the map vanishes, i.e. tha t

I xe xk I J = = xey ~ - xky ~ = O.

YO YX

From this condit ion and equat ions (4.61) and (4.64) we de te rmine the c r i t i ca l se t in

t e rms of )`:

= a I - xocose =

2(a2+b2)_a l+2x2_3xocos 8 if yo=O. (4.65)

If we inser t these values for X into equat ion (4.64), we find an equat ion for the caus t ic

= r 2 x° ....... - x o c o s e xo - XoCOSe2 (4.66)

Yr l + 2 x 2 - 3 x c o s e { s i n e J l + 2 x 2 - a x o c o s O ~ . s i n 2 e j J

which looks r a the r compl ica ted . However, a simple observat ion is impor tant . Let us

compute the values ( l - X ) and )`Xoat e=0:

2 2 (I-~) -- 2 Xo - xo Xo - x~

i+2xo2 ; kxo = - I +2x 2

194

F_ig. 4.39: The cardioid caust ics of a circular r e f l ec to r and their relat ion to point sources.

Locally, near 0=0, we have the simple relationship 2(1-%)= kxo, and this means that

near this special point the caust ic behaves like a cardioid independent of the complexi-

ty of our original equation. The cardioid, however, has a cusp point at O =0, and this

is a s tandard cusp point, as can easily be shown by developing the equations

x = r(2cos8 - cos2e);

in Taylor series near the critical point e =0:

x n, 1 + 82; y ~ nl----~0e3

y = r(2sin9 - sin2e)

- - - > ( x _ l ) 3 = ( . ~ . y ) 2

What we now can do with the source point, is to dis locate it along the x-coordinate

{Fig. 4.39). Clearly, a cr i t ical si tuation arises if the source is located at {0,0), the

cen te r of the c i rc le of curvature. In this case, all rays pass through the origin, the

caust ic degenera tes to a singular point, and we would not rece ive any ref lec t ions at

points besides this degenera ted singularity.

If 0 <lXoi> R, we find that the caust ic has formally two cusp points if we consid-

er not simply a circular arc but a full circle. These cusp points are given by 0=0

and 8 =~r, tn addition, we observe that these cusp points are simple inversions of the

corresponding source locations x o ~ - x o . A somewhat striking point is that we always

have the same type of a cusp (what we called the dual cusp of the ref lec t ion prob-

lem) independent of the radius of the incident wavelet . The caust ic pat tern , therefore ,

does not depend on the con tac t between the re f l ec to r and the (circular) incident wave-

let.

Another special si tuation occurs if tXoi =R. In this case, we have only one cusp

point, the other one degenera tes into a fold point with its tangent coincident with

195

the t angent of the c i rcular r e f l e c t o r - - the caus t ic becomes a pe r fec t cardioid, in the

case the source point is located outside the circle, there remains only one cusp point,

but in addition we find two cr i t ical fold points where the deformed cardioid has tangen-

tial co n t ac t with the c i rcular re f lec tor . In some sense, the s i tua t ion Xo=l def ines a

b i furca t ion point. However, if Xo~m, we find again a s y m m e t r i c solution, the caus t i c

is now a nephroid (cf. POSTON & STEWART, 1978} whereby the s y m m e t r y re fe r s to

the two sources Xo=+~.

The caus t ic phenomena associa ted with a point source and a per fec t c i rcular ref lec-

tor, therefore , can be summar i zed as cont inuous de format ions of a cardioid. The s table

pa t t e rn is the cusp point of the cardioid, which locally remains the identical cusp caus-

t ic independent of the location of the source. Now, the ci rcular r e f l ec to r is unstable ,

and the quest ion ar ises what happens if it is deformed. Before going in detai ls , we

Fig. 4.40: The vir tual sources of a planar and circular ref lec tor . The wave f ronts provide vir tual re f lec tors .

f irst observe tha t there exis ts a vir tual surface , for which the re f l ec ted rays of the

wavele t are normals . In the case of a plane ref lec tor , this vir tual sur face is again

a point source, a s tandard example in se ismology (Fig. 4.40). tn a more genera l sense ,

every wave front is a potent ia l ly virtuaI r e f l ec to r su r f ace because the wave f ronts in ter-

sec t the rays orthogonatty. In the case of linear rays, the wave fronts are found from

the normal ized equat ion (4.61), i.e. from

196

[Xr} IcosO [afc°sO r s,n011 Yr = r [sinej (a2+b2) I/2 [sinOj -b [ cosOJ. (4.67)

The t r av e l t im e is equiva lent to the sum of the length of the incident and re f l ec t ed

rays. If the rece ive rs are on the s ame x-level as the source , , t he t r a ve l t ime is given

by

2t = (a2+b2)I/2(l x__~o _- r cos 8 - acos ~ bsinO )' (4.68)

and the identical t r ave l t ime record would be rece ived in a sys t em where source and

rece iver coincide, e i ther from a vir tual source or a vir tual r e f l ec to r which, of course,

is s imply a wave front (Fig. 4.40).Now, we can use the discussion of the last sect ion.

A cr i t ica l s i tua t ion ar ises if the vir tual re f lec t ing su r face becomes a circle. Any small

de fo rmat ion then de fo rms i t , and the s ingular caus t i c point evolves in e i ther one of

the dual cusps. We consider this degene ra t ed s i tua t ion and dis turb the re f l ec ted rays

(the normals) by a not necessar i ly cons t an t ro ta t ion

[XrJ = riO@sO} - It°s01 + Yr [ainOJ [sinSJ [ coseJ

(4.69)

where

f COSe (0) -sins (8)] A = [sins(e) cose(O)J. (4.70)

The cr i t ica l se t can again be found from the Jacobian to be

-X = (p2+p'2)e°se

{l+c~, } p2 {2+e,) p, 2_pp,, (4.70)

The deformat ion of the original ray sys tem, therefore , cons is t s of a ro ta t ion as defined

by the m a t r i x 'A ' and a dislocat ion along the rays which is proportional to cos ~ . In

the case cos e vanishes , the re f lec tor becomes identical with its caust ic , and the image

is inversed, as cos e a s sumes nega t ive values. However, this would require ra the r s t rong

de format ions . We conclude, the re fore , t ha t the c a us t i c pa t t e rn formed by the normals

r emains s t ab le as long as the de fo rma t ions are of reasonable order.

We note finally tha t we can r e fo rmu la t e cos c~ A as

1 [l+cos2e -sin2e 1

(cose)A = ~_ [sin2e l+cos2~J .

The caustic formed by the rotated norrnals can now be written

(4.71)

197

Fig. 4.41: Normal and re f l ec ted ray sys tem and caus t i cs at a parabolic and hyperbolic re f lec tor . A point source does not change the caus t ic pa t te rn .

Fi G . 4.42: Ro ta t ed normals of a circular ref lec tor . The s ingular point is t r ans - formed into a caus t ic . The ro ta ted rays can be cons t ruc ted as average of the normals and rays ro ta ted twice the original angle (but which still have the length of the normals).

198

r r = r - f ( r , r ' ) ~- [ s i n 2 u c o s 2 ~ j n (4.72)

The resul t ing p a t t e r n is the average of two vec tor fields which di f fer by a ro ta t ion .

Fig. 4.42 e luc idates this point and i l lus t ra tes once more the ins tabi l i ty of a c i rcular

re f lec tor . Even a cons tan t ro ta t ion of the normals deforms the singular point of the

c i rc le into a c i rcular fold line. Fig. 4.42 e luc idates in addit ion tha t only a ro ta t ion

with angles larger than ~r/2 can real ly change the caus t ic pa t t e rn , as can also be infer red

from equat ion {4.73). Thus, we can finally conclude tha t the caus t ic pa t t e rn of the

normals of a sur face remains s tab le even if r e f l ec t ed wavele t s are rece ived because

the ro ta t ion of the normal at a c i rcu la r r e f l ec to r is equivalent to a deformat ion of

this re f lec tor . This point is especial ly e lucidated by equat ion (4.73), which s t a t e s tha t

a monotonous de format ion of t h e re f lec t ion angle within reasonable bounds cannot real ly

change the original caus t ic and the re la ted pa t te rns . Therefore , the caus t i c formed

by the normal rays must be accep t ed as a s t ruc tura l ly s tab le pa t t e rn , even under reason-

able d is turbances .

4.3 "PARALLEL SYSTEMS" IN GEOLOGY

Structural geology describes and analyzes the "geometry" of deformed rocks. The

procedure is mainly geometrical, and the relations to the physical processes are estab-

lished by "classification procedures" (GZOVSKY et al., 1973). The base for these relation-

ships is developed from various physical, exper imen ta l and numer ica l methods for which

a wide var ie ty of m a t h e m a t i c a l methods has been used (BAYLY, 1974; MATTHEWS et

al., 1971; JOHNSON & POLLARD, 1973; BEHZADI & DUBEY, 1980; COBBOLD et al.,

1971; DIETRICH, 1970; FLETCHER, 1979; SMITH, 1975 to give a few examples). Most

of the geological s t ruc tu res are the resul t of complex s t ra in fields. These s t ra in fields,

in general , are not the resul t of s imilar complex global fields of forces, but the complex

and inhomogeneous s t ra in field resul ts from the var iab le e las t ic , plast ic , and viscous

behavior of rocks, i.e. from the i r pr imary inhomogenei t ies . The deformat ions of rocks

can be very large, and then they are outs ide of the scope of classical d i f fe ren t ia l calcu-

lus. This is especial ly t rue if t rans i t ions from e las t ic to plast ic and viscous behavior

occur, if the boundary condi t ions are not k n o w n - - in general they are n o t - - and if

dis locat ions of mate r ia l by solution and recrys ta l l i za t ion play an impor tan t role during

the de format ion process (STEPHANSON, 1974; TRURNIT & AMSTUTZ, 1979). A classical

approach to s tudy deformat ions of rocks, the re fore , is the geomet r i ca l analysis. A common

way is to apply the methods of f ini te s t ra in analysis {RAMSAY, 1967; JAEGER, 1969;

HOBBS, 1971) to regions for which a near ly homogeneous s t ra in can be assumed. Basically,

this type of analysis is the study of some special mappings, and some of them will be

br ief ly discussed here.

199

4.3.1 Some Examples of Para l le l Systems

Much of the previous discussion focussed on systems of quasi-paral lel layers, which

posses a formal geomet r ica l s imilar i ty with deformed s t ruc tu res in geology. Considering

a three-d imensional space such a paral lel system can be wr i t t en

F(u,v;t) = x(u,v) + tN(u,v)

where N(u,v) = (Nx,Ny,Nz) , the unit normal vec tor at x(u,v).

(4.73)

If F(u,v;0) is everywhere d i f ferent iable , then the Jakobian de t e rminan t of such a system

is nowhere zero (DoCARMO, 1976):

det J(F)= = I ( F u) (F v) (F t) l = ]XuA x v ] ~ 0 (4.74)

where F u e tc . are column vectors of the Jacobian mat r ix (see sect ion 4.3.4 for details).

Equation (4.74) shows tha t there exists a tubular neighborhood to the surface x(u,v)

which is uniquely defined. Given a solution for a surface x(u,v) under ce r t a in conditions,

we can extend this solution into a small but f inite neighborhood of x(u,v). In a conceptual

sense this secures tha t the solutions can be applied to a real physical system where

a surface is always of f ini te thickness. Assume equat ion (4.73) is applicable as a l inear

first order approximation, then we immedia te ly get an e s t ima te of the maximal local

ex ten t of the tubular neighborhood, i.e. the area into which we may extend the solution.

This area is hounded by the ' focal sur faces '

Xl(U,V) = x(u,v) + p "IN(U,V)

x2(u,v) = x(u,v) + 92N(u,v)

where 0 1' P2 are the principal

1976; GUGGENHEIMER, 1977).

(4.75)

curva tu res of the sur face (cf. DoCARMO,

The assumption of parallel layers has a long t radi t ion in the recons t ruc t ion of

folds in t ec ton ics (e.g. HILLS, 1963 for an overview). The recons t ruc t ion of folds from

surface measurement s is i l lus t ra ted in Fig. 4.43 a f t e r an example of GILL (1953). A

point, obvious from Fig. 4.43, is tha t the fold cannot be extended continuously into

depth~ as several segments vanish along the ~caustic of the normal rays ' as discussed

in the previous sect ion; and it has been assumed (e.g. BUSK, 1956) tha t t h e s e ' l i n e s (or

sufaces) evolve into faults. Of course, equat ion (4.74) is only a f irst order approximat ionl

however, r a t h e r s imilar a rguments hold for the 'normal var ia t ion ' of a sur face x{u,v)

which can be wr i t t en (DoCARMO, 1976)

200

~ Recons t ruc t ion of paral lel folds from sur face data. Modified a f t e r GILL

F(u,v;t) = x(u,v) + th(u,v)N(u,v)

where h(u,v) is some sca lar var iable .

(4.76)

Such a formula t ion provides us with the possibil i ty to adapt some condi t ions which have

to be sa t i s f ied by h(u,v), and equat ion (4.76) can be considered as a var ia t ional problem,

or we may consider equat ion (4.76) as the dis turbed l inear problem descr ibed by equat ion

(4.73)•

Para l le l sys tems are encoun te red in various areas. With respec t to geology~ an impor-

t an t one is the concep t of sl ip-l ines in the theory of pe r fec t plas t ic i ty , which is closely

r e l a t ed to evolu tes and involutes as s t a t ed by Hencky~s and Prandt l t s theorems (LING,

1973):

HENCKY's theorem: The angle formed by the t angen ts of two fixed shear l ines

of one family at the i r points of in te r sec t ion wi th a shear line of the second family

does not depend on the choice of the in te r sec t ing shear line of the second family.

PRANDTL~s theorem: Along a fixed shear line of one family, the cen te r s of curva-

ture of the shear l ines of the o the r family form an involufe of the fixed shear

line.

Given one non- l inear shear line, the t l inear system t of normals and involutes provides

a f i rs t approximat ion for the sl ip-l ine field. The most s imple cases are or thogonal ly

in t e r sec t ing s t r a igh t lines and t cen te red fans ' of c i rcu la r arcs~ which provide reasonable

f irst order approximat ions of p las t ic de fo rmat ion (e.g. LING, 1973). Fig• 4.44 i l lus t ra tes

P rand tPs solut ion for sl ip-l ines below a s t r ip load. A more general soIution consis ts of

20I

Fig. 4.44: Prandt l ' s solution for slip-lines below a s tr ip load above a homogeneous hal fspace.

cen te red arcs of logar i thmic spirals: Consider x(u,v) a genera l ized logar i thmic spiral,

as discussed in sect ion 3.5.3, and h(u,v) to be proport ional to the curva tu re of the leading

spiral x(u,v), then equat ion (4.76} describes a family of possible solutions, from which

we have to choose the locally valid one which then can be ex tended to neighboring areas

by connect ing local solutions along the s t ra ight charac te r i s t i c s .

ODE (1960) applied the slip-line theory to the format ion of faul ts in sand and

clay under the condi t ions of plane s t rain. By a s imilar a t t emp t , also more geometr ica l ly ,

FREUND (1974) s tudied the t e rmina t ion of t r anscu r ren t faul ts by splaying; from his

analysis the curva' ture of t r anscu r ren t faul ts can be re la ted to the format ion of an

evolute of a fan of faults. Evolutes, as lines {or surfaces) of discontinuity, occur fu r ther

under unidirect ional glide in solid crys ta ls (e.g. KLEMANN, 1983).

4.3.2 Similar and Paral le l Folds

Concerning geologically 'shallow' deformat ions (without phase transitions} HOEPPE-

NER (1978) found from exper iments tha t most folds can be t raced back to the following

types:

1) similar folds

2) paral lel folds

a) concen t r i c folds

b) box folds.

Para l le l folds occur usually near the f ree surface or near shear planes while else-

where the more energy consuming s imilar folds develop. The d i f fe rences be tween the

202

=

A Fig. 4.45: A) Ideal parallel folds (kinks or box folds) and B) ideal similar folds (chevron folds},

two types of folds are schemat ical ly i l lustrated in Fig. 4.45. Parallel folds are of finite

depth range, i.e. they resemble the parallel wave fields discussed in the last section.

Similar folds in contrary continue (ideally) infinitely. The strain in folded layered systems

has extensively be studied by HOBBS (1971), here we consider only volume preserving

systems. Similar folds with constant divergence are described by maps of the form

X = ax + f(y,z) {4.77)

Y = by + h(z)

Z = cz

where a,b,c: constants; f,g,h: arbi t rary functions.

The Jacobian de te rminant

3 fi Net J= axTI =abc

is constant and by choosing a,b,c in ratios such that abc=l, the deformation described

by equation (4.77) is volume conserving, locally and globally. If we consider cylindrical

folds, equation (4.77) reduces to a two-dimensional system and describes a two-dimension-

al dislocation field as i l lustrated in Fig. 4.46. A special proper ty of this case is that

the principal strains are identical along every s t ra ight ' shear line' of the dislocation

field. As these parallel dislocation lines never in tersect , the fold extends ideatly into

infinite depth, and laterally the local fold pa t te rn can easily be continued if we connect

local solutions along a straight dislocation line (cf. Fig. 4.46c). Of course, along such

lines the solution is discontinuous with respect to the curvature, a discontinuity which

occurs in sinusoidal systems at the inflection points. JOHNSON & ELLEN (1973) pointed

out that such lines of discontinuit ies may be of some value in the analysis of folds

203

a b

I

1

Fig. 4.46: Deformat ion of a homogeneous hal f -space (a) into s imilar folds (b,c). A local solution (c) can be extended la teral ly by cont inuat ion along a slip-line.

and compared them with ' cha rac t e r i s t i c s ' as they occur in the slip-line theory of plast i -

city.

The possibili ty to cont inue local solutions la tera l ly is common for both fold types.

In the case of paral lei cylindrical folds, a local solution can be cont inued along any

*normal ray v as i l lus t ra ted by Fig. '4.47. However, for a paral lel system the re exists

no solution with cons tan t Jacobian, and thus i t cannot descr ibe a deformat ion which

preserves volume locally. On the o ther hand, we have already seen in sect ion 4.1.3 tha t

it is possible to connect deformed pieces in such a way tha t the en t i r e volume of the

systems is not a l te red (Fig. 4.47}. Concerning global volume changes, s imilar and parallel

folds provide comparable solutions. Paral le l folds are best considered as lamina ted systems

which allow the laminae to glide one above the o ther as i l lus t ra ted in Fig. 4.47. Within

Fig. 4.47: Ideal (concentr ic) folds lateralIy cont inued along ' rays ' or ' l ines of discontinuity ' . Heavy lines indica te intervals of equal length for the layers, Black grid e lements : deformed 'volume e lements ~ of originally rec tangula r grid e lements .

204

Fig, 4.48: Buckling of a card deck under la tera l s t ress conf i rmed by ver t ica l plates. Right: deta i ls of kink format ion.

205

laminae the ideal model allows only for membrane stresses, a s i tuat ion somet imes appli-

cable to deformat ions in liquid crys ta ls {KLEMAN, 1983). The deformat ions be tween

subsequent layers then are proport ional to the change of surface e l ement s {rather than

volume elements) :

let F(u;t) = x(u) + zN(u)~ (4,78)

then ds/dt = (1-zk)

where k: the local curva ture of the, leading curve,

and the deformat ion is simply proport ional to the curva tu re of the surface e lement ,

We find tha t the sur face e l ement s vanish along the evolute of the normals or a t

the focal surface, which w% therefore , can expect to be par t of the shear surface sepa ra t -

ing successive paral lel folds.

The two fold types are both idealized systems, and exper imenta l ly t ransi t ions occur

be tween the two types. JOHNSON & HONEA {1975) concluded from the i r mul t i layer

exper iments tha t the common assumption tha t one can e s t ima te depth of folding by

the ' ray method ' is of l imited value. Fig. 4.48 i l lus t ra tes some phases of mul t i layer

folding under two-axial stress. The t rans i t ion from paral lel to s imilar folds is again a

cont inua t ion problem. Assume tha t the solution is known a t the f ree surface of a half

space and tha t this solution is give n by a box fold: The range of the parallel fold solution

is of l imited depth, and it is bounded by a cuspoid focal line (Fig. 4.49). We are in t e re s t -

ed to ex tend the d is turbance into depth and require tha t the fold lines are cont inuous

along the focal line. To cont inue the d is turbance we project the focal line into depth,

i.e. assume the focal line is given by a function g(x,y) = 0, then we consider the family

Fig . 4.49: Cont inuat ion of paral lel folds into s imilar folds by propaga t - ing the cusp discont inui ty into depth.

206

of functions

g(x,y) = c. (4.79)

The fold lines of the parallel system in tersec t the focal line perpendicular as was dis-

cussed in te rms of wave fronts. The extended fold lines, therefore , have also to in tersec t

the original focal line by right angles. A possible continuation, therefore , are the or tho-

gonal t ra jec tor ies of the family g(x,y) = c which are found by solving the di f ferent ia l

equation

+ gxy' = O. --gy

2 In the case of a cuspoid focal line, x

t ra jec tor ies are the family of functions

y = (8/9)x 4/3 +c

(4,80)

(y-c) 3 = 0 or y - x 2/3 = c, the orthogonal

(4.8t)

which clearly provide a set of similar folds (Fig. 4.49). Usually the solution will be bound-

ed to a str ip of f inite length, however, the str ip can be continued laterally as discussed

previously, and if we consider a layer of f inite thickness, this continuation can be adjust-

ed to preserve volume globally. Clearly, the discussed models are only first order approxi-

mations which, however, allow graphical analysis of even complicated large scale deforma-

tions and which capture some essential quali tat ive proper t ies of exper iments .

4.3.3 Bending at Fold Hinges

The previous discussion focussed on systems composed of layers of vanishing thick-

ness, or of negligible thickness with respect to the ent i re system. Concerning a compact

layer of finite thickness one has to consider the deformations near the fold hinge, as

schematicaIly i l lustrated in Fig. 4.50. The previously discussed linear approach of paraI-

Fig. 4.50: Idealized parallel folds (kinks) composed of layers of d i f ferent f inite thickness.

207

lel layer reveals Bernoulli 's theorem, i.e. undeformed cross-sect ions in the deformed

s ta te . A more realist ic model provides St. Vernant~s solution for bending of a bar by

couples. The deformed s ta te is described by the map

c X = x(1 + ~ z) (4.82)

Y = y(1 - ~ z)

c Z = z + ~ ( o ( y 2 - z 2) - x 2)

where c: s t rength of couples; E: Young~s modulus, o : Poisson~s ratio and

c /E = R-I; R: radius of curvature (see e.g. BUDO, 1974; LOVE, 1944).

In engineering the usual procedure is to study the deformation of an object under

specif ic s tress configuration. In geology we usually know li t t le about the original s t ress

field. Therefore, it is worthwhile to work with models, and the question is not mainly

how the object deforms within a cer tain s t ress field but how far the model is applicable.

One question, which can be pushed forward by mathemat ica l analysis, is how the various

parameters in teract and whether the solution is bounded to some region, i.e. concerning

the bending model we are in teres ted if the thickness of the bar is unlimited.

The limits of the solution are given by the condition that the Jacobian of St. Ver-

nant ' s map vanishes; however, in this case we can simplify the analysis by reducing

the map to a standard ca tas t rophe on Thomas list. If we slide the bar along the line

y=0, i.e. by a vert ical plane along the long axis (z), equation (4.82) simplifies to

-Z = ~ E ( o z 2 + x 2) + z (4.83)

c X = E x z + x,

and by means of the t ransformat ion oz 2 ~ z 2 this equation simplifies to the standard

form

* 2 2 2E Z = z + x +c'--o¢ z (4.84)

* 2E X = 2xy + - - x . c

The only assumptions involved are that the couples ~c' do not vanish {we consider only

deformed states) and that o ¢0. The singular set of this map is i l lustrated in Fig. 4.51

by use of the standard form of the hyperbolic umbilic. Fig. 4.52 i l lustrates a single

sect ion (E,c, =constant), and it becomes clear how the solution space is limited by a

cusp and a fold line, i.e. even for small deformations of this type the bar cannot exceed

a cer ta in thickness. Fig. 4.52a ° i l lustrates the shape of the undeformed area, i.e. the

boundaries defined by J=0 {J: Jacobian determinant) . By set t ing J=c one finds lines of

208

. .. ;i !. • . , ' £ •

Fig. 4.51: The s tandard form of Thomas hyperbolic umbilic. Isolines inside c i rc les ind ica te the local tpotential~.

a ~" b i ......... "

Fig. 4.52: a) The non-local sec t ion through a bent bar along i ts long axis (see text) . The c r i t i ca l se t ( se l f - in te rsec t ions of parabolas) corresponds wi th the sect ion through the hyperbol ic umbilic. Parabolas indicate lines which are paral lel in the undeformed s ta te , a °) assoc ia ted undeformed s ta te : Only the blank a rea can be deformed to the image indica ted in (a). b) The same bar with lines of cons tan t values of the Jacobian de te rminan t , b °) the associa ted undeformed image.

209

Fig. 4.53: The 'hyperbolic umbilic' as a sheet of paper folded in its plane.

210

"equal volume change" in the undeformed s t a t e (Fig. 4.52b °) and by means of the

mapping (4.83 or 4.84) the i r image in the deformed s t a t e (Fig. 4.52b). Fur ther proper t ies

will be analyzed in a more general sense in the next sect ions. The reduct ion of the

original map to a two-dimensional problem, clearly, gives only an idea how the en t i r e

sys tem reacts~ however, the solution is co r r ec t for the plane se lected, and it allows

to r e Ia t e the de format ion to a r a t h e r simple exper imen t {Fig. 4.53): A shee t of paper

'bended ' in i ts plane i l lus t ra tes in a r a t h e r s imple way how the l imit ing fold line evolves.

To connec t this sect ion with the fu r ther analysis of paral lel sys tems we observe

t ha t equat ion (4.83) can a l t e rna t ive ly be wr i t t en (using vec tor notation):

= y + ~z -(~y - 2--E z

e 2 2 (oY -x E/c

(4.85)

where the f irst t e rm is just the descr ipt ion of the "neut ra l surface" and the second

t e rm is the non-normal ized normal of the surface e lements . Thus, the f irst two t e rms

on the r ight side descr ibe the 'normal va r ia t ion ' of the surface, i.e. h(u,v) = I Xu A Xv{

in equat ion (4.76). The final t e rm on the r ight side can be taken as a non- l inear dis turb-

ance of the quasi-paral le l sys tem. This non- l inear t e rm depends only on z such tha t

the bending equat ion is properly approximated by the quasi-paral lel system if z is suff i-

c ient ly small.

4.3.4 Nota t ion of S t ra in

Fll X 2 =

3

deformed

Whenever e las t ic or p las t ic de format ions are considered, the problem is usually

fo rmula ted in t e r m s of s t resses and s t ra ins . The procedure is to solve a given problem in

t e r m s of dis locat ions {e.g. LOVE, 1944). The deformed s t a t e then is given in the form

discussed with s imilar folds, i.e. by a map

• = f ( x l , x 2 x2 + ' ~i 'x3)

undeformed dislocations

The elements of strain are related to the Jacobian matrix of the dislocations (e.g. LOVE,

1944).

211

J(E) =

"agl agl. ag~- axl ax 2 ax 3

ag= aga a~a ax 1 ax 2 ~x 3

a~a ~ 3 3,~3 ax 1 ax 2 a x 3 a

where (Ex) etc. are the column vec tors of the Jacobian matrix.

= [<, <, <,] (4.87)

Now, the re exists a simple relat ionship be tween the Jacobian mat r ix of the map (4.86) I

which will be denoted by J(X)~ and the Jacobian mat r ix of the dislocat ions

(4.88) J(X) = J(E} + I

where I: the ident i ty matrix.

If we now consider the more general map

[iil I: X 2 = V(Xl,X2,X3) [ ,

(Xl'X2'X3)]

and the Jacobian matrix of the dislocations is

J(E) = J(X) - I.

The mat r ix of s t ra in e lements in the l inear theory of e las t ic i ty then is given by (e.g.

LOVE, 1944; MEANS, 1976)

(4.90)

i~['¢ SXi~ + (_~)] _ I}. ( eij,] = ~ Lt-3-£-x. J ~ . , , d

l l

In the nonl inear theory of f inite s train, however, the mat r ix of s t ra in e lements can

be wr i t t en

= l { ~ 3 X i , faXj )~ - I } (4.91) ( ) L , a x i

g

where < , > denotes the scalar product of the column vectors of the Jacobian matr ix .

We shall need these nota t ions because they simplify the following work.

4.3.5 Genera l ized Plane Strain in Layered Media

Geological problems are commonly solved under the assumption of plane strain.

In this case, the three-d imensional problem simplifies to a two-dimensional one, which

212

causes less computa t iona l problems even in the computer . Here we consider the case

of genera l ized plane s t ra in and define i t by the condit ion tha t e =e =e =0. If we zz xz yz

apply this to the equat ions of s imilar folds (equation 4.77), we find tha t the s t a t e of

plane s t ra in is achieved by a map

X = ax + f{y)

Y = by + g(x)

Z = z

(4.92)

where not only the s t ra in componen ts involving the z-d i rec t ion vanish but also the asso-

c ia ted dislocations, Special cases of this type have been discussed by HOBBS (1971)

in detail , A more in te res t ing s i tuat ion arises if one considers paral lel folds. We take

the discussion of the bending of bars as mot iva t ion and consider the quasi-paral lel system

X = ~ +z (4.93)

(x,y

o r x = x(u,v) + z]x u A x v I N

and app!y the l inear theory of s t rain. The s t ra in e l emen t s are found from equat ion (4.90)

exx=-Z fxx; eyy=-Z fyy; exy= Zfxy; (4.94)

ezz=exz=eyz=0;

i.e. the quasi-paral le l system descr ibes a s t a t e of genera l ized plane s t rain. Next we

consider the pe r f ec t paral lel system

X = x(u,v) + zN(u,v) (4.95)

and apply the non- l inear theory of f ini te s t rain. The Jacobian ma t r ix of this map can

be wr i t t en

J(X) = ((x u + ZNu), (% + ZNv), (N)). (4.96)

The f ini te s t ra ins are defined by the sca lar products of the column vectors of J (equ.

4.91) which simplify i f one applies the or thogonal i ty re la t ions <Xu, Nu > =0;< Xu,N> =0

etc . . The f ini te s t ra ins are

2~< (Xu+ ZNu),(Xu + ZNu) > 1 z2< Nu,N Cxx = -1) = ~e-(<x ,x >+ 2z<N x > + u >) (4.97) 2 u u u u

@<(Xv+ZNv)'(Xv+ZNv )> -I)= <Xv,Xv>+ 2z<N x > + z2< -I) E YY v v Nv'Nv>

213

1 ~ z2<Nu,Nv>) xy = ~{2<(Xu+ZNu)'(Xv+Nv)> -1) = <Xu,Xv> + ZkNu, Xv> + <NvXu> ) +

z z = ~xz = ~ z = O .

The two equations, thus, are isomorphic with respect to the applied theory of strain,

and because the linear theory is an approximation of the non-linear one, equation (4.93)

provides an approximation of equation (4o95). Indeed, if we consider only the "neutral

surface" X = x(u,v) which is everywhere different iable , then we can approximate it locally

by an explicit function z=f(x,y) in te rms of its moving frame. There is another impor-

tant aspect: If we consider the case x(u,v)=f{u,v), then the focal surfaces, which bound

the solution space, are identical for equation (4.93) and equation (4.95) because we can

define them as the evolute (surface) of the normal rays, and the only d i f ference be tween

these equations is the magnitude of dislocation along the normals. Thus, even the solu-

tions deviate to some ex ten t in the linear and non-linear model, the cr i t ical set of focal

domains remains identical.

Both equations studied here describe families of surfaces, and this fact has its

expression in the equations for the strains, which can be expressed in te rms of the

fundamental forms of the "neutral surface". Following DoCARMO (1976) we note that

E = < Xu~X u> ; G = < Xv, Xv> ; F = < Xu~X V>

-e = <Nu,Xu > ; -g =<NvlXv> ; -2f = <Nu,Xv>+< Nv,Xu> ,

and the e lements of strain can be wri t ten

2 xx = E - 2ze + z2<Nu,Nu>

Cyy G 2zg + z < Nv,Nv>

Cxy = F - 2zf + z 2 <Nu,Nv>

(4.98)

providing the base for a potential further analysis.

The study of parallel system thus provides us with ra ther general models for gener-

alized plane strain and with s t ra tegies to find more general solutions in te rms of finite

strain than usually: If we are able to find a solution in te rms of the linear theory, the

close relat ions discussed here allow to t ransfer it to the non-linear theory as far as

quasi-parallel systems are concerned. The equations governing bending deviate from the

quasi-parallel linear model only by an additional non-linear term. In the finite strain

model a ra ther similar s t ruc ture is achieved if the variable 'z ~ is replaced by a function

h(z), e.g. h(z} = h(1-z).

214

The quasi-paral lel sys tems provide a family of funct ions of po ten t ia l value for

the analysis of large scale deformat ions , c lass i f iable by the s t ruc tu re of the i r Jacobian

mat r ix (or the re la ted s t r a i n s ) - - an aspect which links this study with ca tas t rophe

theory. The problems, which have been discussed here, are those which can be solved

by 'hand methods ' , In layered media the physical proper t ies usually a l t e rna t e (or change

gradually). Replacing the scalar funct ion h(x,y) in equat ion (3.76) by a mat r ix H(x,y;z)

allows to study more compl ica ted systems in t e rms of i t e ra t ed maps.

4.4 SUMMARY

The discussed examples are manifold, cover ing various geological and "applied

m a t h e m a t i c a l " methods. This calls for a sys t ema t i za t ion of the various forms of ins table

behavior and p a t t e r n format ion. There are two aspects: First , we have th ree major

m a t h e m a t i c a l o b j e c t s - - surfaces , t ra jec tor ies , and d is tance f u n c t i o n s - - which, under

ce r t a in maps, develop discont inui t ies or become instable in some sense. Secondly, we

have the ins tabi l i t ies themse lves which cause branching solutions.

A first group of objects , which occured repea ted ly throughout the examples, are

surfaces which can locally be described as F(x 1 . . . . , xi)=0. The in te res t ing deformat ions

occur under the t r ans fo rma t ion F=F(Xl, ... , x i) + sN(Xl, . . . . xi). The surface t rans forms

like a wave front , a t leas t locally, which is cons t ruc t ed by Huygens' principle. This

implies t ha t the family of sur faces can locally be descr ibed as F(Xl, ... , xi; s)=0, and

the concep t of s t ruc tura l s tab i l i ty can be used to study the geomet r i ca i s ingulari t ies .

The evolut ion of wave f ronts is, of course, the typical example al though the discont inu-

i t ies, which ar ise in the two-dimensional project ion of three-d imens ional objects , provide

still more geomet r i ca l examples. In addition, we can loca te here the probabi l is t ic hull

of the on togene t i c 'morphospace ' of chap te r 2, and if we allow for more general disloca-

tions, then the whole example of the 0n togene t ie 'morphospace ' belongs to this class

of problems. Clearly, the "pre -compute r" analysis of paral lel folds and the slip-line theory

are closely r e l a t ed in geomet r i ca l t e rms al though the physical p a r a m e t e r s are quite

d i f fe rent .

The second group of ob jec ts were t r a j e c t o r i e s - - on togene t i c t r aces in chap te r 2,

rays and sl ip-l ines in chap te r 4. Along these t r a j ec to r i e s a dynamics can be es tabl ished --

the spreading of a wave front , the on togene t i c deveIopment e tc . - - by an equat ion like

ds/dt=f(s) (s:arc length). But the t r a j ec to r i e s or rays themse lves are s t a t i c ob jec ts like

the previously discussed famil ies of surfaces , and they are descr ibed by the identical

equat ions up to a normal iza t ion fac to r of the evolut ion pa rame te r . If one has especial ly

a gradient system, then e l emen ta ry ca t a s t rophe theory provides a c lass i f ica t ion of the

in te res t ing geomet r i ca l s ingulari t ies . However, the gradient r es t r i c t ion is not so essent ia l

215

because e lementary ca tas t rophe theory is a local theory, i.e. it is suff ic ient if we can

t ransform such a system locally, near a cri t ical point, into a gradient system. The

interest ing singularities are usually d i f ferent from the ' involutes of rays'; however, they

are re la ted as discussed in terms of seismic reflect ion.

A further group of examples can be col lected under the term "distance functions",

and this is the most comprehensive group. Again, we can easily return to e lementary

ca tas t rophe theory. The wave fronts on a given set of rays have to sat isfy the dis tance

function r=I(x,y)-(Xo,Yo) I. Similar formulations are possible for the slip-line theory and

e las t ic i ty theory using the concept of strain and s t ress potentials. It is commonly the

approach via a dis tance or energy function, which allows to study problems more deeply

in topological terms. In principle, the problems encountered in surface reconstruct ion

and the convex hulls of point sets (chapter 2, Honda trees) belong to this group.

If we now take the example of cluster analysis, the situation is d i f ferent . What

we are doing in this case is essentially that we map a n-dimensional space onto a one--

dimensional one, the space of distances. This allows to represent clustering results from

a n-dimensional space as a binary tree. The binary t ree is nothing than the graphical

representa t ion of the pair (X,d) where 'X' is a one-dimensional point set and 'd' is a

relation be tween the points which defines an order. In the case of c luster t rees, the

problem is that the dis tance 'd' does not imply an ordering sequence and that the map

R n - - R can be multi-valued. This returns the problem to singularity theory as encoun-

tered with the two-dimensional images of three-dimensional objects.

What does hold together all examples, is that we can describe them all as maps

and that the observed instabili t ies are related to singularities of these maps whereby

the identical geometr ical singularity may have di f ferent physical meaning concerning

d i f ferent objects and may be di f ferent with respec t to any chosen subspace, i.e. in spatial

and spat io- temporal coordinates. Maps in their most general sense are not very specific.

Insofar, the previous discussion provides examples of more specific maps, which are

of some in teres t in geological applications.

Concerning the bifurcation of solutions the preceding discussion remains within

kinemat ic models. A bifurcation point is defined as the value of an evolution paramete r

at which the local topology changes. Thus, the cusp ca tas t rophe is the bifurcation set

of the swallowtail, it separa tes the area with continuous wave fronts from that one

with se l f in tersec t ing wave fronts which, of course, are of d i f ferent topological type,

and in a similar sense the transit ion from parallel to similar folds can be interpreted,

etc . . The ca tas t rophe approach "implies that a non-bifurcation point is one at which

the topology does not change, i.e. a point at which the system is s tructural ly stable.

... bifurcat ion is seen as a loss of (structural) stabil i ty of the system, ra ther than the

fo ld

loss of s t a b i l i t y o f a p a r t i c u l a r so lu t ion" (STEWART, 1982). To avoid n o m e n c l a t u r e

c o n f l i c t s , t h e s e t opo log ica l b i f u r c a t i o n s a re c a l l ed c a t a s t r o p h e s . B e c a u s e of t he i r wide

m e a n i n g and t h e i r t opo log ica l o b j e c t i v e s t h e y a p p e a r wel l s u i t e d for new a p p r o a c h e s

o f g e o m e t r i c a l r e a s o n i n g in geo logy .

E L E M E N T A R Y C A T A S T R O P H E S

P O T E N T I A L S A N D C R I T I C A L SETS

c u s p

v ( x ) = ½~3 + ux ~I

vlxl -- ~x4, ~x~ + V X

f / \

s w a l l o w t a i l

v~xl~ ~s+ ~3+ ~x~+ ~

216

h y p e r b o l i c u m b l l i c

V(x,y) = x 3 + y3 + w x y - ux -

e l l i p t i c u m b i l i c vY u

V(x,y) = x 3 - 3xy 2 + w{x 2 +y2) _ x -

T h e s e a r e on ly t h o s e e l e m e n t a r y c a t a s t r o p h e s wh ich a r e e a s i l y g r aphed , For a d e t a i l e d

l is t s e e one o f t h e t e x t b o o k s .

REFERENCES

Adler, R.E. 1970: Elektronische Datenverarbeitung in der modernen Tektonik. Clausth.

Tekt. Hefte, ]0:25-47.

Adler, R.E. eta]. 1968: Elektronische Datenverarbeitung in der Tektonik. Clausth. Tekt. Hefte, 8.

Alagi6, S. & Arbib, M.A. 1978: The design of well-structured and correct programs. (Springer), New York.

Altheimer, E., Bayer, U., Ott, R. 1982: The mapping package PEMAP. In: 8eilacher, A. et al.eds.:Paleocology. N. Jb. Geol. Pal., Abh., 164:17-23.

Andronov, A.A., Vit, A.A., Khaikin, C.E. 1966: Theory of oscillators. (Pergamon) Oxford.

Arnold, V.I. 1984: Catastrophe theory. (Springer) Heidelberg.

Ashby, W.R. 1956: An introduction to cybernetics. -- 1974: Einf~hrung in die Kybernetik. Frankfurt.

Ballentyne, C.K. & Cornish, R. 1979: Use of the Chi-square test for the analysis of orientation data. Geol. Mag.

Barker, S.B., Cumming, G.,~ Horsfield, K. 1973: Quantitative morphometry of the branching structure of trees. J. theor. Biol., 40:33-43.

Bayer, U. 1969: Die Gattung Hyperlioceras BUCKMAN (Ammoaoidea, Graphoceratidae) aus dem Unter-Bsjocium (discites-Schichten), insbesondere vom Wutachtal (SHdbaden).

Jber. u. Mitt. oberrhein, geol. Vet., N.F. 51:31-70.

-- 1970:Anomalien bei Ammoniten des Aaleniums und Bajociums und ihre Beziehung zur Lebensweise. N. Jb. Geol. Palfiont. Abh., 135:19-41.

-- 1972: Zur Ontogenie und Variabilitgt des jurassischen Ammoniten Leioceras opalinum. N. Jb. Geol. Palgont. Abh., 140:306-327.

-- 1977: Cephalopoden-Septen. Teil I: Konstruktionsmorphologie des Ammoniten- Septums. N. Jb. Geol. Palfiont. Abh., 154:290-364.

-- 1977: Cephalopoden Septen. Tell 2: Regelmechanismen im Gehfiuse- und Septenbau. N. Jb. GeoI. Palgont. Abh., 155:162-215.

-- 1978: Morphogenetic programs, instabilities and evolution -- a theoretical study. N. Jb. Geol. Palgont. Abh., 156:226-261.

-- 1978: Models :in morphogenesis. N. Jb. Geol. Paleont. Abh., 157:57-70.

-- 1978: Modelle instabiler Kompaktionsverlgufe unter Sedimentation. Geolog. Rundschau, 67:980-990.

-- 1978: Finite computations in compaction theory. In: F. Westphal & A. Seilacher eds.: PalSkologie. N. Jb. Geol. Pal~ont. Abh., 157:176-185.

1982: Data storage and data processing in paleoecology. In: A. Seilacher et al. eds.: Studies in Palecology. N. Jb. Geol. Palgont. Abh., 164:23-25.

1982: Cluster- and classification-strategies in paleoecology. In: Seilacher et al. eds.: Studies in palecology, N. Jb. Geol. Palgont. Abh., 164:12-17.

1982:Zur Auswertung yon Richtungsdaten. In: Seilacher et al. eds.: Studies in Palecology, N. Jb. Geol. Palgont. Abh., 164:23-25.

1982: WahrscheinlichkeitsmaBe auf Profilen. In: Seilacher et al. eds.: Studies in Palecology, N. Jb. Geol. Palgont. Abh., 164:26-29.

1983: "Physical properties" in: Ludwig, W.J., Krasheninikov, V.A., et al.: Initial Rep. Deep Sea Drilling Project, Vol. LXXI, (U.S. Gov. Print. Off.) Washington.

218

Bayer, U. 1983: The influence of sediment composition on physical properties inter- relationships. In: Ludwig, W.J., Krasheninikov, V.A. et sl.: Initial Rep. Deep Sea Drilling Project, Vol. LXXI (U.S. Gov. Print. Office) Wahington.

Bayer, U. & McGhee, G.R. 1984: Iterative evolution of middle Jurassic ammonite faunas. Lethaia, ]7:1-16.

Bayer, U. & Seilacher, A., eds.: Sedimentary and evolutionary cycles. (Springer) Lect. Notes Earth Sci.,1. 1985.

Bayly, M.B. 1974: An energy calculation concerning the roundness of folds. Tectono- physics, 24:291-316.

Behzadi, H. & Dubey, A.K. 1980: Variation of interlayer slip in space and time during flexural folding. Journ. struct. Geol., 2:453-457.

Ben-Menahem, A. & Singh, S.J. 1981: Seismic waves and sources. (Springer) New York.

Betz, A. 1948: Konforme Abbildungen. Berlin.

Biju-Duval, B. et al. 1974: Geology of the Mediterranean Sea basins. In: Burk, C.A. & Drake, C.L. eds.: The geology of continental margins. (Springer) New York, 695-721.

Blackith, R.E. & Reyment, R.A. 1971: Multivariate morphometrics. London.

Blind, W. 1976: Die ontogenetiscbe Entwicklung yon Nautilus pompilius (LINNE). Paleontographica, A, 153:117-160.

Blum, H. 1973: Biological shape and visual science. J. Theor. Biol. 38:205.

Blum If. & Nagel, R.W. 1977: Shape description using weighted symmetric axis features Proc. I.E.E.E. Comp. Soc. Conf. Pattern Recogn. Image Proc.

Bonner, J.T. 1968: Size changes in development and evolution. J. Paleont. 42:1- 15.

Bonyum, D. & Stevens G. 1971: A general purpose computer program to produce geological stereonet diagrams. In: Cutbill, J.L. ed.: Data processing in biology and geology. (Acad. Press) New York, 165-188.

Bookstein, F.L. 1978: The Measurement of biological shape and shape change. Lect. Notes Biomath. 24 (Springer) Berlin.

Boyce, R.E. 1977: Deep Sea Drilling procedures for shear strength measurements of cIayey sediments using modified Wykeham Farrance Laboratory Vane Apparatus. In: Barker, Dalzill et al.: Init. Rep. DSDP, 36 (U.S. Gov. Print. Office) Washington ,837-847.

Buckman, S.S. 1887-1907: Monograph of the ammonites of the Inferior Oolite series. Paleontogr. Soc. London.

Bud6, A. 1974: Theoretische Mechanik. (VEB) Berlin.

Burk, C.A. & Drake, C.L. 1974: The geology of continental margins. (Springer) New York.

Busk, H.G. 1956: Earth Flexures. Cambridge.

Callahan, J.J. 1974: Singularities and plane maps. Amer. Math. Monthly, 81:211-240.

Cavendish, J.C. 1974: Automatic triangulation of arbitrary planar domains for the finite element method. Int. J. Numer. Method. in Engineering, 8:679-696.

Chilingarian, G.V. & Wolf, K.H. eds. 1975-1976: Compaction of coarse grained sediments I,II. Developments in Sedimentology, 18A,B, (Elsevier) New York.

Cobbold, P.R., Cosgrove, J.W. Summers, J.M. 1971: Development of internal structures in deformed anisotropic rocks. Tectonophysics, 12:23-53.

Collatz, L. & Wetterling, W. 1971: Optimierungsaufgaben. (Springer) Berlin.

Cope, J.C.W. et al. 1980: A correlation of Jurassic rocks in the British Isles. Part I & II. Geol. Soc. London Spec. Rep. 14/15.

219

Courant, R. & Hilbert, D. 1968: Methoden der mathematischen Physik. (Springer) Berlin.

Damuth, J. E. 1980: Use of High-Frequency (3.5-12kHz) Echograms in the study of Near-Bottom Sedimentation Processes in the Deep-Sea: A Review. Marine Geology, 38:51-75.

Dangelmayr, G. & Armbruster, D. 1983: Singularities in phonon focusing. Preprint Inst. f. Informationsverarb. THbingen.

Dangelmayr, G. & G~ttinger, W. 1980: Remote sensing in terms of singularity theory. Mskr. Inst. f. Informationsverarbeitung T~bingen.

Dangelmayr, G. & GNttinger, W. 1982: Topological approach to remote sensing. Geo- phys. J. R. Astr. Soc., in press.

Davis, J.C. 1973: Statistics and data analysis in geology. (J.Wiley) New York.

Davis, J.G. & McCullagh, M.J. eds. 1975: Analysis of spatial data. (J. Wiley) London

DeBoor, C. ]978: A practical guide to splines. (Springer) Appl. math. Sci. 27, New York.

Denert, E. & Frank, R. 1977: Datenstrukturen. (BI) ZHrich.

Desai, Ch.S. & Christian J.T. 1977: Numerical methods in geotechnical engineering. (McGraw-Hill) New York.

Diday, E. & Simon, J.C. 1980: Clustering analysis. In: Fu, K.S. ed.: Digital pattern recognition. 2nd ed. (Springer) Berlin, 47-94.

Dietrich, J.H. 1970: Computer experiments on mechanics of finite amplitude folds. Canadian J. Earth Sei., 7:467-476.

DoCarmo, M.P. 1976: Differential geometry of curves and surfaces. (Prentice-Hall) Engelwood Cliffs.

Donath, F.A. 1969: Experimental study of kink-band development in the Martinsburg slate. Geol. Surv. Canada, 68-52:255-288.

Driver, E.S. & Pardo, G. 1974: Seismic traverse across the Gabon continental margin. In: Burk, C.A. & Drake C.L. eds.: The geology of continental margins.(Springer) New York, 293-311.

Duffy, M.R., Britton, N.F., Murray, J.D. 1980: Spiral wave solutions of practical reaction diffusion systems. SIAM J. Appl. Math., 39:8-13.

Efron, B. 1965: The convex hullof a random set of points. Biometrika, 52:331-343.

Einsele, G. 1977: Range, velocity, and material flux of compaction flow in growing sedimentary sequences. Sedimentology, 24:639-655.

Einsele, G. 1985: Response of sediments to sea-level changes in differing subsiding storm-dominated marginal and epeiric basins. In: Bayer & Seilacher eds.: Sedi- mentary and evolutionary cycles. (Springer) Heidelberg.

Einsele, G. & Seilacher, A. eds. 1982: Cyclic and event stratification. (Springer) Berlin.

Eldredge, N. & Gould, S.J. 1972: Punctuated equilibria: An alternative to phyletic gradualism. In: Schopf, T.J.M. ed.: Models in paleobiology. (Freeman), San Francisco, 82-11.

Embley, R.W. 1980: The role of mass transport in the distribution and character of deep-ocean sediments with special reference to the North Atlantic. Marine Geology, 38:23-50.

Fisher, J.B. & Honda, H. 1977: Computer simulation of branching pattern and geometry in Terminalia (Combretaceae), a tropical tree. Bot. Gaz. ]38, 377-384.

Fisz, M. 1976: Wahrscheinlichkeitsrechnung und mathematische Statistik. (VEB) Berlin

220

Fletcher, R.C. 1979: The shape of single-layer folds at small but finite amplitude. Tectonophysics, 60:77-87.

Flood, R.D. 1980: Deep-Sea sedimentary morphology, modelling and interpretation of echo-sounding profiles. Marine Geology, 38:77-92.

Fox, T. 1975: Some practical aspects of time series analysis. In: McCammon R.B. ed.: Concepts in Geostatistics° (Springer), Berlin, 70-89.

Freeman, H. & Pieroni, G.G. eds. 1980: Map data processing. (Acad. Press) New York.

Freund, R. 1974: Kinematics of transform and transcurrent faults. Tectonophysics, 21:93-134.

Fu, K.S. ed. 1980: Digital pattern recognition. (Springer) Berlin.

Gebelein, H. 1951: Anwendung gleitender Durchschnitte zur Herausarbeitung yon Trend- linien und H~ufigkeitsverteilungen. Mitt. BI. math. Statistik, 3:45-68.

Gill, W.D. 1953: Construction of geological sections of folds with steep-limb atten- uation. Bull. Amer. Assoc. Petrol. Geol., 37:2389-2406.

Gould, S.J. 1966: Allometry and size in ontogeny and phylogeny. Biol. Rev., 41:587- 640.

-- 1971: Geometric similarity in allometric growth: A contribution to the problem of scaling Jn the evolution of size. Amer. Naturalist, 105:113.

-- 1977: Ontogeny and phylogeny. (Belknap) Cambridge.

Grant, F.S. & West, G. F. 1965: Interpretation theory in applied Geophysics. (McGraw Hill) New York.

Grenander, U. 1976: Pattern Synthesis. (Springer) Appl. math Sci. 18, New York.

-- 1978: Pattern Analysis. (Springer) Appl. math. Sci. 24, New York.

-- 1981: Regular structures. (Springer) Appl. Math. Sci. 33, New York.

GHttinger, W. 1979: Catastrophe geometry in physics: a perspective. In: GHttinger & Eikemeier eds.: Structural Stability in Physics. (Springer) Berlin, 23-30.

GHttinger, W., Eikemeier, H. eds. 1979: Structural stability in physics. (Springer) Berlin.

Guggenheimer, H.W. 1977: Differential geometry. (Dover) New York.

Gzovsky, M.V~ et al. 1973: Problems of the tectonophysical characteristics of stress deformations, fractures and deformation mechanisms of the earth's crust. Tecto- nophysics, 18:157-205.

Hadeler, K.P. 1974: Mathematik fHr Biologen. (Springer) Berlin.

Haken, H. 1977: Synergetics. An Introduction. (Springer) Berlin.

-- 1979: Synergetics and a new approach to bifurcation theory. In: G~ttinger & Eikemeier: Structural stability in physics. (Springer) Berlin, 31-41.

-- ed. 1981: Chaos and order in nature. (Springer) Derlin. -- ed. 1982: Evolution of Order and Chaos. (Springer) Berlin.

Hamilton, E.L. 1976: Variations of density and porosity with depth in deep-sea sediments. J. Sediment. Petrol., 46:280-300.

Harbaugh, J.W., Doveton~ J.H., Davis, J.C. 1977: Probability methods in oil explora- tion. (J. Wiley), New York.

Harland, W.B. et al. 1982: A geological time scale. (Cambridge Univ. Pr.) Cambridge. Hartigan, J.A. 1975: Clustering algorithms. (Wiley) New York.

Hattori, I. 1973: Mathematical analysis to discriminate two types of sandstone- shale alternations. Sedimentology, 20:331-346.

Hills, E.S. 1966: Elements of structural geology. (Methuen) London.

221

Hobbs, B.E. 1971: The analysis of strain in folded layers. Tectonophysics, 11:329- 375.

Hochstadt, H. 1964: Differential equations: a modern approach. (Dover Publ.) New York, 1975.

Hoeppener, R. 1978: Grenzen quantitativer Untersuchungen bei tektonischen Experimen- ten. Geol. Rdsch., 67:858-879.

Honda, H. 1971: Description of the form of trees by the parameters of the tree- like body: Effect of the branching angle and the branch length on the shape of the tree-like body. J. theor. Biol., 31:331-338.

Honda, H. & Fisher, J.B. 1978: Tree branch angle: maximizing effective leaf area. Science, 199:888-890.

-- 1979: Ratio of tree branch length: the equitable distribution of leaf clusters on branches. Proe. Natl. Acad. Sci. USA, 76:4875-3879.

Horton, R.E. 1945: Erosional development of streams and their drainage basins: hydrophysical approach to quantitative morphology. Geol. Soc. Am. Bull., 156: 275-370.

Hoschek, J. 1969: Mathematische Grundlagen der Karthographie.(BI) Mannheim.

Hujbreyts, C.J. 1975: Regionalized variables and quantitative anaiysis of spatial data. In: Davis & McCu]lagh eds.: Display and analysis of spatial data. (J. Wiley), London , 38-53.

Huxley, J. 1932: Problems of relative growth. (Methuen) London.

Jacobs, O.L.R. 1974: Introduction to control theory. (Clarenton) Oxford.

Jaeger, J.C. 1969: Elasticity, fracture and flow. (Methuen) London, 3rd ed.

Jaeger, J.C. & Cook, N.G.W. 1971: Fundamentals of rock mechanics. (Chapman & Hall) London.

J~nich, K. 1980: Topologie. (Springer) Berlin.

Johnson, D.A. & Damuth, J.E. 1979: Deep thermohaline flow and current-controlled sedimentation in the Amirante Passage: Western Indian Ocean. Marine Geolo-

gy, 33:1-44.

Johnson, A.M. & Ellen, S.D. 1974: A theory of concentric, kink and sinusoidal folding and of monoclinal flexuring of compressible, elastic multilayers. Teetono- physics, 21:301-339.

Johnson, A.M. & Honea, E. 1975: A theory of concentric, kink, and sinusoidal folding and of monoclinal flexuring of compressible elastic multilayers. Tectonophysics 25:26]-280.

Johnson, A.M. & Page, B.M. 1976: A theory of concentric, kink and sinusoidal folding and of monoclinal flexuring of compressible, elastic multilayers, VII. Tectono- physics, 33:97-143.

Johnson, A.M. & Pollard, D.D. 1973: Mechanics of growth of some laccolithic intru- sions in the Henry Mountains, Utah. I,II: Tectonophysics, 18:261-309,311-354.

Journel, A.G. & Huijbregts, Ch.J. 1978: Mining Geostatistics. (Acad. Press) London.

Kaesler, R.L. 1966: Quantitative Re-evaluation of ecology and distribution of recent Foraminifera and Ostracoda of Todos Santos Bay, Baja California, Mexico. Univ. Kansas Paleont. Contr. I0.

Kant, R. & Kullmann, J. 1980: Umstellung im Geh[usebau jungpal~ozoischer Ammonoideen N. Jb. Geol. Pal~ont. Mh., 673-685.

Kertz, W. 1969: Einf~hrung in die Geophysik I. (BI) Mannheim, 216-221.

Kl6man, M. 1983: Points, Lines and Walls. (J. Wiley) Chichester.

222

Krause, H.F. 1970: Die yon W.B. Kamb vorgeschlagene Berechnung der Ausz~hlfl~che fur GefHgediagramme, eine kritische Diskussion. Clausth. Tekt. Hefte, 10:175- 190.

Krumbein, W.C. 1975: Markov models in the Earth Sciences. In. McCammon, R.B. ed.: Concepts in Geostatistics. (Springer) Berlin, 90-105.

Krumbein, W.C. & Graybill, E.A. 1965: An introduction to statistical models in geology. (McGraw-Hill) New York.

Lindenmayer, A. 1975: Developmental systems and languages in their biological contact. In: Herman, G.T. & Rosenberg, G. eds.: Developmental systems and languages. (North Holland Publ.) Amsterdam.

Ling, F.F. 1973: Surface mechanics. (Wiley) New York.

Lister, G.S., Paterson, M.S., Hobbs, B.E. 1978: The simulation of fabric development in plastic deformation and its application to quarzite: the model. Tecto- nophysics, 45:107-158.

Lotka, A.J. 1956: Elements of mathematical biology. (Dover) New York.

Love, A.E.H. 1944: A treatise on the mathematical theory of elasticity. (Dover) New York.

Lu, Y.-Ch. 1976: Singularity theory and an introduction to catastrophe theory. (Springer) New York.

Magara, K. 1968: Compaction and migration of fluids in miocene mudstones, Nagavka Plain, Japan. Am. Ass. Petrol. Geol. Bull, 52:2466-2501.

Mandelbrot, B. 1977: Fractals: Form, chance, and dimension. (Freeman), San Francisco

Mangold, H.v., Knopp, K. 1968: EinfHhrung in die HShere Mathematik. Teil 2. (Hizel Verl.) Stuttgart.

Mardia, K.K. 1972: Statistics of directional data. (Acad. press) New York.

Marsal, D. 1970: Ein Monte Carlo-Test zum Signifikanz-Vergleich beliebiger Geffige- diagramme im Schmidt'schen Netz. In: Kr~ckeberg, F. et al. eds.: Computer Einsatz in der Geologie, Clausthaler Tekt. H., i0.

-- 1976: Die numerische LSsung partieller Differentialgleichungen. (BI) Mannheim.

-- 1979: Statistische Methoden fHr Erdwissenschaftler. (E. Schweizerbart), 2rid ed., Stuttgart.

Marsal, D. & Philipp, W. 1970: Compaction of sediments. A simple mathematical model for calculating the gravitational porosity-depth equilibrium curve of shales. Bull. geol. Inst. Univ. Uppsala, NS, 2:59-66.

Marsden, J.E. & McCracken, M. eds. 1976: The Hopf bifurcation and its applications. (Springer) Appl. math. Sc. 19.

Matthews, P.E., Bond, R.A.B., Van den Berg, J.J. 1971: Analysis and structural implications of a kinematic model of similar folding. Tectonopbysics, 12:129- 154.

May, R.M. 1974: Biological populations: Stable points, limit cycles, and chaos. Science, 186:645-647.

McGhee, G.R. & Bayer, U. 1985: The local signature of sea-level changes. In: Bayer & Seilacher eds.: Sedimentary and evolutionary cycles. (Springer).

Means, W.D. 1976: Sh-f~ss and strain -- basic concepts of continuum mechanics for geologists. (~pringer) New York.

Meinhard, H. 1984: Models of biological pattern formation. (Academic Pr.) London.

Miall, A.D. 1973: Markov chain analyses applied to an ancient alluvial plain succession. Sedimentology, 20:347-364.

Naef, A. 1928: Die Cephalopoden. Fauna und Flora des Golfes yon Neapel. Zool. Stat. Neapel, Mon. 35.

223

Nicolis, G. & Prigogine I. 1977: Self-organization in nonequilibrium systems. (Wiley) New York.

Niklas, K.J. 1982: Computer simulations of early land plant branching morphologies: canalization of patterns during evolution? Paleobiology, 8:196-210.

Nye, J.F. 1979: Optical caustics and diffraction catastrophes. In: GHttinger & Eikemeier eds.: Structural stability in physics. (Springer) Berlin, 54-60.

-- 1979: Structural stability in evolving flow fields. In: GHttinger & Eikemeier eds.: Structural stability in physics. (Springer) Berlin: 134-140.

Od~, H. 1960: Faulting as a velocity discontinuity in plastic deformation. Geol. Soc. Am. Mem. 79:293-319.

Officer, C.B. 1974: Introduction to theoretical geophysics. (Springer) New York.

Osten, G. & Guckenheimer, J. 1976: Bifurcation phenomena in population models. In Marsden & McCracken eds.: The Hopf bifurcation and its applications. (Springer) Appl. math. Sci. 19:327-353.

Pfaltz, J.L. 1975: Representation of geographic surfaces within a computer. In: Davis & McCullagh eds.: Analysis of spatial data. (Willey) London, 2]0-230.

Poston, T. & Stewart, I. 1978: Catastrophe theory and its applications. (Pitman) London.

Ramsay, J.G. 1967: Folding and fracturing of rocks. (McGraw-Hill) New York. Raup, D.M. 1966: Geometric analysis of shell coiling: genera] problems. J. Paleont.,

40:1178-1190.

Raup, D.M. & Michelson, A. 1965: Theoretical morphology of the coiled shell. Science 147:1294-1295.

Raup, D.M. et al. 1973: Stochastic models of phylogeny and the evolution of diver- sityoJ. Geol. 81:525-542.

R~nyi, A. 1977: Wahrscheinlichkeitsrechnung. (VEB) Berlin.

Richards, O.W. & Kavanagh, A.J. 1947: The analysis of growing form. In: Essays on growth and form presented to d'Arcy Wentworth Thompson. Oxford, 188-230.

Richtmyer, R.D. 1981: Principles of advanced mathematical physics II. (Springer) New York.

Rollier, L. 1911-1918: Fossiles nouveaux ou peu connus des terrains secondaires. M~m. Soc. pal~ont. Suisse, XXXVII-XLII.

RSssler, O.E. 1979: Chaos. In: GHttinger & Eikemeier eds.: Structural stability in physics. (Springer) Berlin.

Rosenfeld, A. & Weszka, J.S. 1980: Picture Recognition. In: Fu ed.: Digital pattern recognition. Communication and cybernetic i0 (Springer).

Sampson, R.J. 1975: The SURFACE If graphics system. In: Davis & McCullagh eds.: Analysis of spatial data. (J. Wiley) London.

Schaeffler, D. 1981: General introduction to steady state bifurcation. Lect. Notes Mathematics, 898:13-47, (Springer) New York.

8cheidegger, A.E. ]982: Principles of Geodynamics. (Springer) Berlin, 2nd ed.

Schreider, J.A. 1975: Equality, resemblance, and order. (Mir Publ.) Moscow.

Schumaker, L.L. 1976: Fitting surfaces to scattered data. In: Lorentz, G.G. et al. eds.: Approximation theory II. (Acad. Press) New York, 203-268.

Schwarzacher, W. 1974: Sedimentation models and quantitative stratigraphy. Developm. in sedimentology, 19 (Elsevier) Amsterdam.

Schwarzacher, W. & Fischer, A.G. 1982: Limestone-shale bedding and perturbations of the earth's orbit. In: Einsele & Seilacher eds.: Cyclic and event stratification (Springer) Berlin, 72-95.

224

Shreve, R.L. 1966: Statistical law of stream numbers. J. Geology, 74:17-37.

Simpson, G.G. 1951: Horses. (Oxford Univ. Pr.) New York.

Smith, R.B. 1975: Unified theory of the onset of folding, boudinage and mullion structure. Geol. Soc. Am. Bull., 86:1601-1609.

Sokal, R.R. & Sheath, P.H.A. 1964: Principles of numerical taxonomy. San Francisco.

Spencer, A.B. & Clabaugh, P.S. 1967: Computer programs for fabric diagrams. Am. J. Sci., 265:166-172.

Stanley, S.M. 1975: A theory of evolution above the species level. Proc. Natl. Acad. Sci., 72:646-650.

-- 1979: Macroevolution: Pattern and Process. (Freeman) San Francisco.

8teinhauser, D. & Langer, K. 1977: Clusteranalysis. (de Gruyter) Berlin.

Stephansson, 0. 1974: Stress-induced diffusion during folding. Tectonophysics, 22:233-251

Stewart, I. 1981: Applications of catastrophe theory to the physical sciences. Physica, 2D:245-305.

-- 1982: Catastrophe theory in physics. Rep. Prog. Phys., 45:185-221.

Swift, S.A. 1977: Holocene rates of sediment accumulation in the Panama Basin, eastern equatorial Pacific: pelagic sedimentation and lateral transport. Journ. Geol. 85:301-319.

Terzaghi, K. 1943: Theoretical soil mechanics. (Wiley) New York.

Thom, R. 1970: Topological models in biology. In: Towards a theoretical biology. 3. draft, ed. C.H. Waddington. (Aldine) Chicago, 89-116.

-- 1975: Structural stability and morphogenesis. (Benjamin) Reading.

-- 1979: Towards a revival of natural philosophy. In: GHttinger & Eikemeier eds.: Structura] Stability in Physics. (Springer) Berlin. ~

Thompson, D'Arcy W. 1942: On Growth and Form. (Cambridge Univ. Pr.) Cambridge.

Thompson, J.M.T. 1982: Instabilities and catastrophes in science and engineering. (Wiley).

Trurnit, P. & Amstutz, G. Ch. 1979: Die Bedeutung des RHckstandes yon Druck-LS- sungsvorg~ngen ffir stratigraphische Abfolgen, Wechsellagerung und Lagerst~tten- bildung. Geolog. Rundschau, 68:1107-1124.

Van Hinte, J.E. 1976: A Jurassic time scale. Amer. Ass. Petrol. Geol. Bull., 60:489- 497.

Vail, P.R. et al. 1977: Seismic stratigraphy and global changes of sea level. In: Payton C.E. ed.: Seismic stratigraphy -- applications to hydrocarbon explora- tion. Am. Assoc. Petr. Geol. Mem., 26:49-212.

Vidal, C. & Pacault, A. ]982: Spatial chemical structures, chemical waves: a review. In: Haken ed.: Evolution of order and chaos. (Springer) Berlin.

Vistelius, A.B. 1976: Mathematical geology and the progress of geological sciences. Journ. Geology, 84:629-651.

Vogel, F. 1975: Probleme und Verfahren der numerischen Klassifikation. (Vanderhoeck& Ruprecht) GSttingen.

Walton, W.R. 1955: Ecology of living benthonic foraminifera, Todos Santos BayiCali- fornia. J. Paleont., 29:952-1018.

Westermann, G.E.G. 1966: Covariation and taxonomy of the Jurassic ammonite Sonninia adicra (Waagen). N. Jb. Geol. Pal~ont. Abh., 124:289-312.

Wirth, N. 1972: Systematisches Programmieren. (Teubner) Stuttgart.

-- 1975: Algorithmen und Datenstrukturen. (Teubner) Stuttgart.

225

Wright, F.J. 1979: Wavefront dislocations and their analysis using catastrophe theory. In: G~ttinger & Eikemeier eds.: Structural stability in physics. (Springer) Berlin.

Wunderlich, W. 1966: Darstellende Geometrie. (BI) Mannheim, 85-104.

Young, J.F. 1975: EinfNhrung in die Informationstheorie. (Oldenbourg) MHnchen.

Zienkiwicz, O.C. 1975: Methode der finiten Elemente. (Hauser) Wien.

ZurmHhl, R. 1964: Matrizen. (Springer) Berlin.

pattern recognition problems in geology and paleontology

Documents