ma4257

102
MA4257: Financial Mathematics II Min Dai Dept of Math, National University of Singapore, Singapore

Upload: stephen-baubau

Post on 12-Dec-2015

15 views

Category:

Documents


0 download

DESCRIPTION

MA4257 module

TRANSCRIPT

Page 1: MA4257

MA4257: Financial Mathematics II

Min DaiDept of Math, National University of Singapore, Singapore

Page 2: MA4257

2

Page 3: MA4257

Contents

1 Preliminary 11.1 Basic Financial Derivatives: Forward contracts and Options . 1

1.1.1 Forward Contracts . . . . . . . . . . . . . . . . . . . . 11.1.2 Options . . . . . . . . . . . . . . . . . . . . . . . . . . 2

1.2 No Arbitrage Principle . . . . . . . . . . . . . . . . . . . . . . 21.2.1 Pricing Forward Contracts (on traded assets) . . . . . 31.2.2 Pricing foward contracts on a non-traded underlying . 41.2.3 Properties of Option Prices . . . . . . . . . . . . . . . 5

1.3 Brownian Motion, Ito Integral and Ito’s Lemma . . . . . . . . 51.3.1 Brownian Motion . . . . . . . . . . . . . . . . . . . . . 61.3.2 Ito Process and Ito Integral . . . . . . . . . . . . . . . 61.3.3 Ito’s Lemma . . . . . . . . . . . . . . . . . . . . . . . 6

2 Option Pricing Models for European Options 92.1 Continuous-time Model: Black-Scholes Model . . . . . . . . . 9

2.1.1 Black-Scholes Assumptions . . . . . . . . . . . . . . . 92.1.2 Derivation of the Black-Scholes Model . . . . . . . . . 92.1.3 Risk-Neutral Pricing and Theoretical Basis of Monte-

Carlo Simulation . . . . . . . . . . . . . . . . . . . . . 112.2 Discrete-time Model: Cox-Ross-Rubinstein Binomial Model . 12

2.2.1 Single-Period Model . . . . . . . . . . . . . . . . . . . 122.2.2 Multi-Period Model . . . . . . . . . . . . . . . . . . . 13

2.3 Consistency of Binomial Model and Continuous-Time Model. 142.3.1 Consistency . . . . . . . . . . . . . . . . . . . . . . . . 142.3.2 *Equivalence of BTM and an explicit difference scheme 15

2.4 Continuous-dividend and Discrete-dividend Payments . . . . 162.4.1 Continuous-dividend Payment . . . . . . . . . . . . . . 162.4.2 Discrete-dividend Payment . . . . . . . . . . . . . . . 17

2.5 No Arbitrage Pricing: A General Framework . . . . . . . . . 18

3

Page 4: MA4257

4 CONTENTS

2.6 *Option Pricing: Replication . . . . . . . . . . . . . . . . . . 20

3 American options and early exercise 233.1 Pricing models . . . . . . . . . . . . . . . . . . . . . . . . . . 23

3.1.1 Continuous-time model for American options . . . . . 233.1.2 Continuous-dividend payment case . . . . . . . . . . . 253.1.3 Binomial model . . . . . . . . . . . . . . . . . . . . . . 26

3.2 Free boundary problems . . . . . . . . . . . . . . . . . . . . . 263.2.1 *Optimal exercise boundaries . . . . . . . . . . . . . . 263.2.2 Formulation as a free boundary problem . . . . . . . . 283.2.3 Perpetual American options . . . . . . . . . . . . . . . 283.2.4 Put-call symmetry relations . . . . . . . . . . . . . . . 30

3.3 Bermudan options . . . . . . . . . . . . . . . . . . . . . . . . 32

4 Multi-asset options 354.1 Pricing model . . . . . . . . . . . . . . . . . . . . . . . . . . . 35

4.1.1 Two-asset options . . . . . . . . . . . . . . . . . . . . 354.1.2 American feature: . . . . . . . . . . . . . . . . . . . . 374.1.3 Exchange option: similarity reduction . . . . . . . . . 374.1.4 Options on many underlyings . . . . . . . . . . . . . . 39

4.2 Quantos . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 404.3 Numerical Methods . . . . . . . . . . . . . . . . . . . . . . . . 42

4.3.1 *Binomial tree methods . . . . . . . . . . . . . . . . . 424.3.2 Monte-Carlo simulation . . . . . . . . . . . . . . . . . 43

4.4 *Suggestions for further reading and Appendix . . . . . . . . 444.4.1 Further reading . . . . . . . . . . . . . . . . . . . . . . 444.4.2 Appendix: . . . . . . . . . . . . . . . . . . . . . . . . . 45

5 Path-dependent options 475.1 Barrier options . . . . . . . . . . . . . . . . . . . . . . . . . . 47

5.1.1 Different types of barrier options . . . . . . . . . . . . 475.1.2 In-Out parity . . . . . . . . . . . . . . . . . . . . . . . 485.1.3 Pricing by Monte-Carlo simulation . . . . . . . . . . . 485.1.4 Pricing in the PDE framework . . . . . . . . . . . . . 495.1.5 American early exercise . . . . . . . . . . . . . . . . . 505.1.6 BTM . . . . . . . . . . . . . . . . . . . . . . . . . . . 515.1.7 Hedging . . . . . . . . . . . . . . . . . . . . . . . . . . 525.1.8 Other features . . . . . . . . . . . . . . . . . . . . . . 52

5.2 Asian options . . . . . . . . . . . . . . . . . . . . . . . . . . . 525.2.1 Payoff types . . . . . . . . . . . . . . . . . . . . . . . . 52

Page 5: MA4257

CONTENTS 5

5.2.2 Types of averaging . . . . . . . . . . . . . . . . . . . . 535.2.3 Extending the Black-Scholes equation . . . . . . . . . 535.2.4 Early exercise . . . . . . . . . . . . . . . . . . . . . . . 555.2.5 Reductions in dimensionality . . . . . . . . . . . . . . 565.2.6 Parity relation . . . . . . . . . . . . . . . . . . . . . . 575.2.7 Model-dependent and model-independent results . . . 575.2.8 Binomial tree method . . . . . . . . . . . . . . . . . . 58

5.3 Lookback options . . . . . . . . . . . . . . . . . . . . . . . . . 585.3.1 Types of Payoff . . . . . . . . . . . . . . . . . . . . . . 585.3.2 Extending the Black-Scholes equations . . . . . . . . . 595.3.3 BTM . . . . . . . . . . . . . . . . . . . . . . . . . . . 605.3.4 Consistency of the BTM and the continuous-time model: 605.3.5 Similarity reduction . . . . . . . . . . . . . . . . . . . 625.3.6 Russian options . . . . . . . . . . . . . . . . . . . . . . 62

5.4 Miscellaneous exotics . . . . . . . . . . . . . . . . . . . . . . . 635.4.1 Forward start options . . . . . . . . . . . . . . . . . . 635.4.2 Shout options . . . . . . . . . . . . . . . . . . . . . . . 645.4.3 Compound options . . . . . . . . . . . . . . . . . . . . 65

6 Beyond the Black-Scholes world 676.1 Volatility simile phenomena and defects in the Black-Scholes

model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 676.1.1 Implied volatility and volatility similes . . . . . . . . . 676.1.2 Improved models . . . . . . . . . . . . . . . . . . . . . 68

6.2 Local volatility model . . . . . . . . . . . . . . . . . . . . . . 696.3 Stochastic volatility model . . . . . . . . . . . . . . . . . . . . 70

6.3.1 Random volatility . . . . . . . . . . . . . . . . . . . . 706.3.2 The pricing equation . . . . . . . . . . . . . . . . . . . 716.3.3 Named models . . . . . . . . . . . . . . . . . . . . . . 73

6.4 Jump diffusion model . . . . . . . . . . . . . . . . . . . . . . 736.4.1 Jump-diffusion processes . . . . . . . . . . . . . . . . . 736.4.2 Hedging when there are jumps . . . . . . . . . . . . . 746.4.3 Merton’s model (1976) . . . . . . . . . . . . . . . . . . 746.4.4 Wilmott et al.’s model . . . . . . . . . . . . . . . . . . 766.4.5 Summary . . . . . . . . . . . . . . . . . . . . . . . . . 77

7 Interest Rate Derivatives 797.1 Short-term interest rate modeling . . . . . . . . . . . . . . . . 79

7.1.1 The simplest bonds: zero-coupon/coupon-bearing bonds 797.1.2 Short-term interest rate . . . . . . . . . . . . . . . . . 80

Page 6: MA4257

6 CONTENTS

7.1.3 The bond pricing equation . . . . . . . . . . . . . . . 817.1.4 Tractable models . . . . . . . . . . . . . . . . . . . . . 827.1.5 Yield Curve Fitting . . . . . . . . . . . . . . . . . . . 857.1.6 Empirical behavior of the short rate and other models 867.1.7 Coupon-bearing bond pricing . . . . . . . . . . . . . . 87

7.2 HJM Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . 877.2.1 The forward rate . . . . . . . . . . . . . . . . . . . . . 887.2.2 HJM model . . . . . . . . . . . . . . . . . . . . . . . . 887.2.3 The non-Markov nature of HJM . . . . . . . . . . . . 907.2.4 Pricing derivatives . . . . . . . . . . . . . . . . . . . . 907.2.5 A special case of HJM model: Ho & Lee . . . . . . . . 917.2.6 Concluding remarks . . . . . . . . . . . . . . . . . . . 92

Page 7: MA4257

CONTENTS i

Syllabus

Lecturer: Dr. Dai MinOffice hours: By appointment via emailOffice: S14-04-16, the best way to contact me is via email: [email protected]

Recommended background reading:Hull, J. (1993, 1997, 1999 or 2003) Options, Futures and other Deriva-

tives. Prentice Hall.

Recommended texts: (Except for the last two, these books have beenset as RBR books, available at Science Library)

• Wilmott, P. (2000) Paul Wilmott on Quantitative Finance. John Wi-ley & Sons

• Wilmott, P. (1998) Derivatives: The Theory and Practice of FinancialEngineering. John Wiley & Sons.

• Wilmott, P., Dewynne, J., and Howison, S. (1995) Option Pricing:Mathematical Models and Computation. Oxford Financial Press, Ox-ford.

• Kwok Y.K. (1998) Mathematical Models of Financial Derivatives, Springer.

• Shreve, S.E. (2004), Stochastic Calculus for Finance: The BinomialAsset Pricing Model (Vol I); Continuous-Time Models (Vol II), SpringerVerlag, New York

• Oksendal, B. (2003), Stochastic Differential Equations, Springer (forbasic theory of stochastic calculus).

An introduction to mathematical finance (based on the preface ofI. Karatzas and S.E. Shreve (1998): Methods of Mathematical Finance)

We are now able to talk about “mathematical finance”, “financial en-gineering” or “modern finance” only because of two revolutions that havetaken place on Wall Street in the latter half of the twentieth century. Thefirst revolution in finance began with the 1952 publication of “Portfolio Se-lection”, an early version of the doctoral dissertation of Harry Markowitz,where he employed the so-called “mean-variance analysis” to understandand quantify the trade-off between risk and return inherent in a portfolio

Page 8: MA4257

ii CONTENTS

of stocks. The implementation of Markowitz’s idea was aided tremendouslyby William Sharp who developed the Capital Asset Pricing Model. Fortheir pioneering work, Markowitz and Sharp shared with Merton Millier the1990 Nobel Prize in economics, the first ever awarded in finance. Markowitzand Sharp’s portfolio selection work is for one-period models. Thanks toRobert Merton and Paul Samuelson, one-period models were replaced bycontinuous-time (Brownian-motion-driven models), and the quadratic util-ity function implicit in mean-variance optimization was replaced by moregeneral increasing, concave utility functions. Model-based mutual fundshave taken a permanent seat at the table of investment opportunities of-fered to the public.

The second revolution in finance is regarding what we are going to ad-dress in this course, the option pricing theory, founded by Fisher Black,Myron Scholes, and Robert Merton in the early 1970s. This leads to anexplosion in the market for derivatives securities. Scholes and Merton wonthe 1997 Nobel Prize in economics. Black had unfortunately died in 1995.

Preliminary knowledge: MA3245 (Financial Mathematics I) or at theleast

• Basic concepts: derivatives, options, futures, forward contracts, hedg-ing, Greeks and so on.

• Elementary stochastic calculus: Brownian motion, Ito integral and Itolemma.

• Derivation of continuous time model (Black-Scholes) and discrete model(Cox-Ross-Rubinstein): delta hedging and no arbitrage.

Even if you know little about the preliminary knowledge, don’t worrytoo much because I am going to give a review in the first two classes.

Course Gradefinal exam (80%), mid-term exam (15%), one assignment with tutorials

(5%).

Contents:

• Preliminary

• Option pricing models for European options (arbitrage pricing canbe moved between “beyong Black-Scholes world” and “interest rate

Page 9: MA4257

CONTENTS iii

derivatives”; also, we don’t speak of the no-short selling;) One weekfor chapter 1 and 2.

• American options and early exercise; two weeks;

• Multi-asset options; one week

• Path-dependent options; three weeks;

• Beyond Black-Scholes world; two weeks

• Interest rate derivatives; two weeks

Our philosophy

• We value these derivative products using partial differential equations(PDEs), or equivalently, using binomial tree methods. Probabilisticapproach will be discussed in MA4265 (Stochastic Analysis in Finan-cial Mathematics).

• Explicit solutions to the PDE models, if available, will be given. Butwe don’t care how to get these solutions.

• Since explicit solutions are so rare that fast accurate numerical meth-ods are essential. We shall mainly focus on the binomial tree methodwhich can be regarded as a discrete model and is easy to implement.Other numerical methods are discussed in MA5247 (ComputationalMethods in Finance).

• Knowledge of Matlab coding is preferable, but not necessary.

Page 10: MA4257

iv CONTENTS

Page 11: MA4257

Chapter 1

Preliminary

1.1 Basic Financial Derivatives: Forward contractsand Options

A derivative is a financial instrument whose value depends on the valuesof other, more basic underlying variables such as stocks, indices, interestrate and so on. Typical examples of derivatives include forward contracts,futures, options, swaps, interest rate derivatives and so on. Futures andstandard options are traded actively on many exchanges. Forward contracts,swaps, many different types of options are regularly traded by financialinstitutions, fund managers, and corporations in the over-the-counter market(OTC market).

In this section we introduce two kinds of basic derivative products: for-ward contracts and options.

1.1.1 Forward Contracts

A forward contract is an agreement between two parties to buy or sell anasset at a certain future time (called the expiry date or maturity) for a cer-tain price (called delivery price). It can be contrasted with a spot contract,which is an agreement to buy or sell an asset today.

One of the parties to the forward contract assumes a long position andagrees to buy the underlying asset at expiry for the delivery price. The otherparty assumes a short position and agrees to sell the asset at expiry for thedelivery price. The payoff from a long position in a forward contract on oneunit of an asset is

ST −K,

1

Page 12: MA4257

2 CHAPTER 1. PRELIMINARY

where K is the delivery price and ST is the spot price of the asset at maturityof the contract. Similarly the payoff from a short position in a forwardcontract is K − ST . Observe that the payoff is linear with ST .

At the time the contract is entered into, it costs nothing to take eithera long or a short position. This means that on the starting date the valueof the forward contract to both sides is zero. A natural question:

how to choose the delivery price such that the value of (1.1)the forward contract is zero when opening the contract?

1.1.2 Options

The simplest financial option, a European call or put option, is a contractthat gives its holder the right to buy or sell the underlying at a certainfuture time (expiry date) for a predetermined price (known as strike price).For the holder of the option, the contract is a right and not an obligation.The other party to the contract, who is known as the writer, does have apotential obligation.

The payoff of a European call option is

(ST −K)+ ,

where K is the strike price and ST is the spot price of the asset at maturityof the option. Similarly, the payoff a European put option is (K − ST )+.Note that the payoff of an option is nonlinear with ST .

Since the option confers on its holder a right without obligation it musthave some value at the time of opening the contract. Conversely, the writerof the option must be compensated for the obligation he has assumed. So,there is a question:

how much would one pay to win the option? (1.2)

1.2 No Arbitrage Principle

One of the fundamental concepts in derivatives pricing is the no-arbitrageprinciple, which can be loosely stated as ‘there is no such thing as a freelunch’. More formally, in financial term, there are never any opportunitiesto make an instantaneous risk-free profit. In fact, such opportunities mayexist in a real market. But, they cannot last for a significant length of timebefore prices move to eliminate them because of the existence of arbitraguerin the market. Throughout this notes, we always admit the no-arbitrage

Page 13: MA4257

1.2. NO ARBITRAGE PRINCIPLE 3

principle whose application will lead to some elegant modeling. In addition,the market is assumed to be frictionless, i.e., no transaction costs and notaxes.

We often make use of two conclusions below derived from the no-arbitrageprinciple:

1) Let Π1(t) and Π2(t) be the value of two portfolios at time t, respec-tively.

If Π1(T ) ≤ Π2(T ) a.s., then Π1(t) ≤ Π2(t) for t < T. (1.3)

Especially,

if Π1(T ) = Π2(T ) a.s., then Π1(t) = Π2(t) for t < T. (1.4)

2) All risk-free portfolios must earn the same return, i.e. riskless interestrate. Suppose Π is the value of a riskfree portfolio, and dΠ is its priceincrement during a small period of time dt. Then

dΠΠ

= rdt, (1.5)

where r is the riskless interest rate.

Remark 1 When applying the no-arbitrage principle (for example, provingthe above two conclusions), the assumption of short-selling is needed. Exceptfor special claim, we suppose that short selling is allowed for any assetsinvolved.

In what follows we attempt to derive the price of a forward contract byusing the no-arbitrage principle.

1.2.1 Pricing Forward Contracts (on traded assets)

Consider a forward contract whose delivery price is K. Let St and V (St, t)be the prices of the underlying asset and the long forward contract at timet. The riskless interest rate r is a constant. In addition, we assume that theunderlying asset has no storage costs and produces no income.

At time t we construct two portfolios:Portfolio A: a long forward contract + cash Ke−r(T−t);Portfolio B: one share of underlying asset: St.At expiry date, both have the value of ST . At time t, portfolio A and B

have the values of V (St, t) + Ke−r(T−t) and St, respectively.We emphasize that the underlying asset discussed here is an investment

asset (stock or gold, for example) for which short selling is allowed. Then

Page 14: MA4257

4 CHAPTER 1. PRELIMINARY

we infer from the no-arbitrage principle that the two must have the samevalue at time t, i.e. (1.4) (otherwise an arbitrage would be caused). So

V (St, t) + Ke−r(T−t) = St

orV (St, t) = St −Ke−r(T−t). (1.6)

Recall that the delivery price is chosen such that at the time when thecontract is opened, the value of the contract to both long and short sides iszero. Let t = 0 be the time of opening the contract. Then we have

S0 −Ke−rT = 0,

namely,K = S0e

rT .

This answers Question 1.1.Exercise: Distinguish between the forward price and the delivery price.

How to determine the forward price of a forward contract?

1.2.2 Pricing foward contracts on a non-traded underlying

Lack of short selling of underlying assets leads to a different pricing model(we always assume that short selling of derivatives is permitted). Let uslook at one example.

If the underlying is not held for investment purposes, we should be carefulwhen using the no-arbitrage principle. For example, assume the underlyingto be a consumption commodity: oil for which short selling is not allowed.As in last section, we construct two portfolios A and B in the same way.Due to no-short selling constraint of the underlying, we cannot short sellportfolio B, but we can still short sell portfolio A (a derivative). We claimthat at time t the value of portfolio A is not greater than that of portfolioB, that is

V (St, t) + Ke−r(T−t) ≤ St. (1.7)

Indeed, suppose that instead of equation (1.7), we have

V (St, t) + Ke−r(T−t) > St. (1.8)

Then one could short sell portfolio A and buy portfolio B at time t. Then thestrategy is certain to lead to a riskless positive profit of er(T−t)(V (St, t) +Ke−r(T−t) − St) at expiry T . Therefore, we conclude from the no-arbitrageprinciple that equation (1.8) cannot hold (for any significant length of time).

Page 15: MA4257

1.3. BROWNIAN MOTION, ITO INTEGRAL AND ITO’S LEMMA 5

If short selling is allowed for the underlying (gold, for example), we areable to similarly deduce that V (St, t) + Ke−r(T−t) < St cannot hold, andthus we are certain to have equation (1.6). However, all we can assert forthe forward contract on a consumption commodity is only equation (1.7),or equivalently,

V (St, t) ≤ St −Ke−r(T−t).

Corresponding, the delivery price K ≤ S0erT .

1.2.3 Properties of Option Prices

Forward contract can be valued by the no-arbitrage principle. Unfortu-nately, because of the nonlinearity of the payoff of options, arbitrage argu-ments are not enough to obtain the price function of options. In fact, moreassumptions are required to value options. We shall discuss this in Chapter2.

No-arbitrage principle can only result in some relationships between op-tion prices and the underlying asset price, including (suppose the underlyingpays no dividend):

(1) CE(St, t) = CA(St, t). In other words, it is never optimal to exercisean American call option on a non-dividend-paying underlying asset beforethe expiration date.

(2) Put-call Parity (European Options):

CE(St, t)− PE(St, t) = St −Ke−r(T−t)

(3) Upper and Lower Bound of Option Prices:

(St −Ke−r(T−t))+ ≤ CE(St, t) = CA(St, t) ≤ St

(Ke−r(T−t)−St)+ ≤ PE(St, t) ≤ Ke−r(T−t), and (Kt−St)+ ≤ PA(St, t) ≤ K

Here CE− European call; PE− European put, CA− American call, PA−American put.

For details, we refer to Hull (2003).

1.3 Brownian Motion, Ito Integral and Ito’s Lemma

In most cases, we assume that the underlying asset price follows an Itoprocess,

dSt = a(St, t)dt + b(St, t)dWt, (1.9)

where a and b are deterministic functions, and Wt is a Brownian motion.

Page 16: MA4257

6 CHAPTER 1. PRELIMINARY

1.3.1 Brownian Motion

Formally, W is a Brownian motion if it has the following two properties:(1) The change ∆W during a small period of time ∆t is a random vari-

able, drawn from a normal distribution with zero mean and variance ∆t,i.e.

∆W = φ√

∆t.

where φ is a random variable drawn from a standardized normal distributionwhich has zero mean, unit variance and a density function given by

1√2π

e−x2

2 , x ∈ (−∞,∞).

(2) The values of ∆W for any two different short intervals of time ∆tare independent.

1.3.2 Ito Process and Ito Integral

Let us go back to (1.9). Thanks to the properties of Brownian motion, weare able to simulate the sample path of St in a given period [0, T ] by thefollowing procedure: Let ∆t = T

N , tn = n∆t, Sn = Stn , n = 0, 1, ..., N,

Sn = Sn−1 + a(Sn−1, tn)∆t + b(Sn−1, tn)ε√

∆t. (1.10)

Here ε should be taken independently for each time interval [tn−1, tn].A precise expression of (1.9) is

St = S0 +∫ t

0a(Sτ , τ)dτ +

∫ t

0b(Sτ , τ)dWτ ,

where the first integral is the Riemann integral, and the second is the Itointegral. For a rigorous definition of Ito integral, see Oksendal (2003).

1.3.3 Ito’s Lemma

Ito’s Lemma is essentially the differential chain rule of a function involvingrandom variable.

First let us recall the standard differential chain rule of a function ofdeterministic variables. Let V (S(t), t) be a function of two variables S andt, where

dS = a(S, t)dt.

Page 17: MA4257

1.3. BROWNIAN MOTION, ITO INTEGRAL AND ITO’S LEMMA 7

Then by Taylor series expansion,

dV (S(t), t) =∂V

∂tdt +

∂V

∂SdS

=∂V

∂tdt +

∂V

∂Sa(S, t)dt

=[∂V

∂t+ a(S, t)

∂V

∂S

]dt

Now let us come back to the stochastic process (1.9). Keep in mind thatdW = φ

√dt and E(dW 2) = dt. So, formally we have

(dSt)2 = (adt + bdW )2

= a2dt2 + 2abdtdW + b2 (dW )2

= b2dt + · · ·.

As a result, when applying the Taylor series expansion to V (St, t), we needto retain the second order term of dS. Thus,

dV (St, t) =∂V

∂tdt +

∂V

∂SdSt +

12

∂2V

∂S2(dSt)

2

=∂V

∂tdt +

∂V

∂SdSt +

12

∂2V

∂S2b2(St, t)dt

=

[∂V

∂t+

12b2(St, t)

∂2V

∂S2

]dt +

∂V

∂SdSt (1.11)

=

[∂V

∂t+

12b2(St, t)

∂2V

∂S2

]dt +

∂V

∂S[a(St, t)dt + b(St, t)dW ]

=

[∂V

∂t+ a(St, t)

∂V

∂S+

12b2(St, t)

∂2V

∂S2

]dt + b(St, t)

∂V

∂SdW.

This is the Ito formula, the chain rule of stochastic calculus. Note that itis not a rigorous proof. We refer interested readers to Oksendal (2003) forrigorous proof of Ito’s formula.

A question: now that dW ≈ O(dt1/2), why don’t we omit the first orderterm of the right hand side in Eq. (1.11)?

Page 18: MA4257

8 CHAPTER 1. PRELIMINARY

Page 19: MA4257

Chapter 2

Option Pricing Models forEuropean Options

2.1 Continuous-time Model: Black-Scholes Model

2.1.1 Black-Scholes Assumptions

We list the assumptions that we make for most of this notes.1. The underlying asset price follows a geometric Brownian motion

dSt

St= µdt + σdWt

where µ and σ are the expected return rate and volatility of the underlyingasset, Wt is the Brownian motion.

2. There are no arbitrage opportunities. The absence of arbitrage op-portunities means that all risk-free portfolios must earn the same return.

3. The underlying asset pay no dividends during the life of the option.4. The risk-free interest rate r and the asset volatility σ are known

constants over the life of the option.5. Trading is done continuously. Short selling is permitted and the assets

are divisible.6. There are no transaction costs associated with hedging a position.

Also no taxes.

2.1.2 Derivation of the Black-Scholes Model

Let V = V (S, t) be the value of an European option. To derive the model,we construct a portfolio of one long option position and a short position in

9

Page 20: MA4257

10CHAPTER 2. OPTION PRICING MODELS FOR EUROPEAN OPTIONS

some quantity ∆, of the underlying.

Π = V −∆S.

The increment of the value of the portfolio in one time-step is

dΠ = dV −∆dSt

= (∂V

∂t+

12σ2S2 ∂2V

∂S2)dt +

∂V

∂SdSt −∆dSt.

To eliminate the risk, we take

∆ =∂V

∂S

and then

dΠ = (∂V

∂t+

12σ2S2 ∂2V

∂S2)dt.

Since there is no random term, the portfolio is riskless. By the no-arbitrageprinciple, a riskless portfolio must earn a risk free return (i.e. 1.5). So, wehave

dΠ = rΠdt = r(V − S∂V

∂S)dt.

From the above two equalities, we obtain an equation

∂V

∂t+

12σ2S2 ∂2V

∂S2+ rS

∂V

∂S− rV = 0. (2.1)

This is the well-known Black-Scholes equation. The solution domain is D ={(S, t) : S > 0, t ∈ [0, T )}. At expiry, we have

V (S, T ) =

{(S −X)+, for call option,(X − S)+, for put option.

(2.2)

There is a unique solution to the model (2.1-2.2):

V (S, t) =

{SN(d1)−Xe−r(T−t)N (d2) for call option

Xe−r(T−t)N(−d2)− SN(−d1) for put option

where

N(x) =1√2π

∫ x

−∞e−

y2

2 dy, d1 =log S

X + (r + σ2

2 )(T − t)σ√

T − t, d2 = d1−σ

√T − t

Remark 2 The Black-Scholes equation is valid for any derivative that pro-vides a payoff depending only on the underlying asset price at one particulartime (European style).

Exercise: Use the Black-Scholes equation to price a long forward con-tract, and digital options (binary options).

Page 21: MA4257

2.1. CONTINUOUS-TIME MODEL: BLACK-SCHOLES MODEL 11

2.1.3 Risk-Neutral Pricing and Theoretical Basis of Monte-Carlo Simulation

The expected return rate µ of the underlying asset, clearly depending on riskpreference, does not appear in the equation. All of the variables appearingin the Black-Scholes equation are independent of risk preference. So, riskreferences do not affect the solution to the Black-Scholes equation. Thismeans that any set of risk preferences can be used when evaluating options(or any other derivatives). In particular, we may carry out the evaluation ina risk-neutral world.

In a risk-neutral world, all investors are risk-neutral, namely, the ex-pected return on all securities is the risk-free rate of interest r. Thus, thepresent value of any cash flow in the world can be obtained by discount-ing its expected value at the risk-free rate. Then the price of an option (aEuropean call, for example) can be represented by

V (S, t) = E[e−r(T−t)(ST −X)+|St = S

]. (2.3)

Here E denotes the expected value in a risk-neutral world under which theunderlying asset price St follows

dSt

St= rdt + σdWt. (2.4)

Note that in this situation the expected return rate of the underlying isriskless rate of interest r (suppose the underlying pays no income).

Mathematically, we can provide a rigorous proof for the equivalence of(2.1-2.2) and (2.3). In fact, this is just a corollary of Feynman-Kac formula.We refer interested readers to Oksendal (2003).

Eq (2.3) is the theoretical basis of Monte-Carlo simulation for derivativepricing. The simulation can be carried out by the following procedure:

(1) Simulate the price movement of the underlying asset in a risk-neutralworld according to (2.4) (see the discrete scheme (1.10));

(2) Calculate the expected terminal payoff of the derivative.(3) Discount the expected payoff at the risk-free interest rate.

Remark 3 It is important to emphasize that risk-neutral valuation (or theassumption that all investors are risk-neutral) is merely an artifical devicefor obtaining solutions to the Black-Scholes equation. The solutions thatare obtained are valid in all worlds, not just those where investors are riskneutral.

Page 22: MA4257

12CHAPTER 2. OPTION PRICING MODELS FOR EUROPEAN OPTIONS

2.2 Discrete-time Model: Cox-Ross-Rubinstein Bi-nomial Model

2.2.1 Single-Period Model

Consider an option whose value, denoted by V0 at current time t = 0,depends on the underlying asset price S0. Let the expiration date of theoption be T. Assume that during the life of the option the underlying assetprice S0 can either move up to S0u with probability p′, or down to S0d withprobability 1− p′ (u > 1 > d, 0 < p′ < 1). Correspondingly, the payoff fromthe option will become either Vu (for up-movement in the underlying assetprice) or Vd (for down-movement).

The following argument is similar to that of continuous time case. Weconstruct a portfolio that consists of a long position in the option and ashort position in ∆ shares. At time t = 0, the portfolio has the value

V −∆S0

If there is an up movement in the underlying asset price, the value of theportfolio at t = T is

Vu −∆S0u.

If there is a down movement in the underlying asset price, the value becomes

Vd −∆S0d.

To make the portfolio riskfree, we let the two be equal, that is,

Vu −∆S0u = Vd −∆S0d

or∆ =

Vu − Vd

S0 (u− d). (2.5)

Again, by the no-arbitrage principle, a risk-free portfolio must earn the risk-free interest rate. As a result

Vu −∆S0u = erT (V −∆S).

Substituting (2.5) into the above formula, we get

V = e−rT [pVu + (1− p)Vd] ,

where

p =erT − d

u− d.

Page 23: MA4257

2.2. DISCRETE-TIME MODEL: COX-ROSS-RUBINSTEIN BINOMIAL MODEL13

This is the single-period binomial model. Here p is called the risk-neutralprobability. Note that the objective probability p′ does not appear in thebinomial model, which is consistent with the risk-neutral pricing principleof the continuous time model.

2.2.2 Multi-Period Model

Let T be expiration date, [0, T ] be the lifetime of the option. If N is thenumber of discrete time points, we have time points n∆t, n = 0, 1, ..., N,with ∆t = T

N . At time t = 0, the underlying asset price is known, denotedby S0. At time ∆t, there are two possible underlying asset prices, S0u andS0d. Without loss of generality, we assume

ud = 1.

At time 2∆t, there are three possible underlying asset prices, S0u2, S0, and

S0d2 = S0u

−2; and so on. In general, at time n∆t, n + 1 underlying assetprices are considered. These are S0u

−n, S0u−n+2, ..., S0u

n. A complete treeis then constructed. Let V n

j be the option price at time point n∆t withunderlying asset price Sj = S0u

j . Note that Sj will jump either up to Sj+1

or down to Sj−1 at time (n+1)∆t, and the value of the option at (n+1)∆twill become either V n+1

j+1 or V n+1j−1 . Since the length of time period is ∆t, the

discounting factor is e−r∆t. Then, similar to the arguments in the single-period case, we have

V nj = e−r∆t

[pV n+1

j+1 + (1− p)Vj−1

], j = −n,−n+2, ..., n, n = 0, 1, ..., N−1

where

p =er∆t − d

u− d.

At expiry,

V Nj =

{ (S0u

j −K)+ for call,

(K − S0uj)+ for put,

j = −N,−N + 2, ..., N.

This is the multi-period binomial model.To make the binomial process of the underlying asset price match the

geometric Brownian motion, we need to choose u, d such that

p′u + (1− p′)d = eµ∆t (2.6)

p′u2 + (1− p′)d2 − e2µ∆t = σ2∆t. (2.7)

Page 24: MA4257

14CHAPTER 2. OPTION PRICING MODELS FOR EUROPEAN OPTIONS

There are three unknowns u, d and p′. Without loss of generality, we addone condition

ud = 1. (2.8)

By neglecting the high order of ∆t, we can solve the system of equations(2.6-2.8) to get

u = eσ√

∆t, d = e−σ√

∆t.

2.3 Consistency of Binomial Model and Continuous-Time Model.

2.3.1 Consistency

The binomial tree method can be rewritten as

V (S, t−∆t) = e−r∆t[pV (Su, t) + (1− p)V (Sd, t)].

Here, for the convenience of presentation, we take the current time to bet−∆t. Assuming sufficient smoothness of the V (S, t), we perform the Taylorseries expansion of the binomial scheme at (S, t) as follows

0 = −V (S, t−∆t) + e−r∆t[pV (Su, t) + (1− p)V (Sd, t)]

= −V (S, t) +∂V

∂t∆t + O(∆t2)

+e−r∆tV (S, t) +∂V

∂SSe−r∆t[p(u− 1) + (1− p)(d− 1)]

+12

∂2V

∂S2S2e−r∆t[p(u− 1)2 + (1− p)(d− 1)2]

+16

∂3V

∂S3S3e−r∆t[p(u− 1)3 + (1− p)(d− 1)3] + O(∆t2)

Observe that

e−r∆t[p(u− 1) + (1− p)(d− 1)] = r∆t + O(∆t2).

e−r∆t[p(u− 1)2 + (1− p)(d− 1)2] = σ2∆t + O(∆t2)

e−r∆t[p(u− 1)3 + (1− p)(d− 1)3] = O(∆t2).

We then get

0 = −V (S, t−∆t) + e−r∆t[pV (Su, t) + (1− p)V (Sd, t)]

= [−rV (S, t) +∂V

∂t+ rS

∂V

∂S+

12σ2S2 ∂2V

∂S2]∆t + O(∆t2)

Page 25: MA4257

2.3. CONSISTENCY OF BINOMIAL MODEL AND CONTINUOUS-TIME MODEL.15

or

−rV (S, t) +∂V

∂t+ rS

∂V

∂S+

12σ2S2 ∂2V

∂S2= O(∆t).

This implies the consistency of two models.

2.3.2 *Equivalence of BTM and an explicit difference scheme

We claim the BTM is equivalent to an explicit difference scheme for thecontinuous-time model.

Using the transformations u(x, t) = V (S, t), S = ex, (2.1-2.2) becomethe following constant-coefficient PDE problem

∂u∂t + σ2

2∂2u∂x2 + (r−σ2

2 )∂u∂x − ru = 0

x ∈ (−∞,∞), t ∈ [0, T )u(T, x) = ϕ(x)+ in (−∞,∞),

(2.9)

where ϕ (x) = ex −X (call option) or ϕ (x) = X − ex (put option).We now present the explicit difference scheme for (2.9). Given mesh

size ∆x,∆t > 0, N∆t = T, let Q = {(j∆x, n∆t) : 0 ≤ n ≤ N, j ∈ Z}stand for the lattice. Un

j represents the value of numerical approximation at(j∆x, n∆t) and ϕj = ϕ (j∆x) . Taking the explicit difference for time andthe conventional difference discretization for space, we have

Un+1j − Un

j

∆t+

σ2

2Un+1

j+1 − 2Un+1j + Un+1

j−1

∆x2+(r− σ2

2)Un+1

j+1 − Un+1j−1

2∆x− rUn

j = 0

or

Unj =

11 + r∆t

((1− σ2∆t

∆x2)Un+1

j +σ2∆t

∆x2(12

+ (r − σ2

2)∆x

2σ2)Un+1

j+1

+σ2∆t

∆x2(12− (r − σ2

2)∆x

2σ2)Un+1

j−1

),

which is denoted by

Unj =

11 + r∆t

[(1− α)Un+1

j + α(aUn+1j+1 + (1− a)Un+1

j−1 )], (2.10)

where

α = σ2 ∆t

∆x2, a =

12

+ (r − σ2

2)∆x

2σ2.

By putting α = 1 in (2.10), namely σ2∆t/∆x2 = 1, we get

Unj =

11 + r∆t

[aUn+1

j+1 + (1− a)Un+1j−1

]. (2.11)

Page 26: MA4257

16CHAPTER 2. OPTION PRICING MODELS FOR EUROPEAN OPTIONS

The final values are given as follows:

UNj = ϕ+

j , j ∈ Z.

Recall the binomial tree method can be described as follows by adoptingthe same lattice:

V nj =

[pV n+1

j+1 + (1− p)V n+1j−1

], j = n, n− 2, · · ·,−n, (2.12)

V Nj = ϕ+

j , j = N, N − 2, · · ·,−N (2.13)

In view ofρ = 1 + r∆t + O

(∆t2

)

and

p =12(1 +

√∆t

σ(r − σ2

2)) + O(∆t3/2),

Recall the binomial tree method can be described as follows by adopting thesame lattice:

V nj =

[pV n+1

j+1 + (1− p)V n+1j−1

], j = n, n− 2, · · ·,−n, (2.14)

V Nj = ϕ+

j , j = N, N − 2, · · ·,−N (2.15)

In view ofρ = 1 + r∆t + O

(∆t2

)

and

p =12(1 +

√∆t

σ(r − σ2

2)) + O(∆t3/2),

we deduce that the binomial tree method is equivalent to explicit differencescheme (2.11) in the sense of neglecting a higher order of ∆t.

Question: what’s a trinomial tree method? what about the relationbetween the trinomial tree method and finite difference schemes?

2.4 Continuous-dividend and Discrete-dividend Pay-ments

2.4.1 Continuous-dividend Payment

Let q be the continuous dividend yield. This means that in a time period dt,the underlying asset pays a dividend qStdt. Following a similar argument as

Page 27: MA4257

2.4. CONTINUOUS-DIVIDEND AND DISCRETE-DIVIDEND PAYMENTS17

in the case no dividend payment, it is not hard to derive the pricing equation.

∂V

∂t+

12σ2S2 ∂2V

∂S2+ (r − q)S

∂V

∂S− rV = 0.

For the binomial model, the risk neutral probability is adjusted as

p =e(r−q)∆t − d

u− d.

We omit the details to readers.

2.4.2 Discrete-dividend Payment

Without loss of generality, suppose that the asset pays dividend just onceduring the lifetime of the option, at time td ∈ (0, T ), with the known divi-dend yield dy. Thus, at time td, the holder of the asset receives a paymentdyS(t−d ), where S(t−d ) is the asset price just before the dividend is paid. Topreclude arbitrage opportunities, the asset price must fall by exactly theamount of the dividend payment,

S(t+d ) = S(t−d )− dyS(t−d ) = S(t−d )(1− dy).

This means that a discrete dividend payment leads to a jump in the valueof the underlying asset across the dividend date.

One important observation for option pricing model is that the valueof the option must be continuous as a function of time across the dividenddate because the holder of the option does not receive the dividend. So, thevalue of the option is the same immediately before the dividend date as itis immediately after the date, that is,

V (S(t−d ), t−d ) = V (S(t+d ), t+d )= V (S(t−d )(1− dy), t+d ).

Let S replace S(t−d ). We then have the so-called jump condition:

V (S, t−d ) = V (S(1− dy), t+d ).

For t 6= td, there is no dividend payment and thus V (S, t) still satisfies theequations without dividend payments. Therefore, we have for Europeanoptions,

∂V

∂t+

12σ2S2 ∂2V

∂S2+ rS

∂V

∂S− rV = 0, S > 0, t ∈ (0, td), t ∈ (td, T )

V (S, t−d ) = V (S(1− dy), t+d )V (S, T ) = ϕ+.

Page 28: MA4257

18CHAPTER 2. OPTION PRICING MODELS FOR EUROPEAN OPTIONS

2.5 No Arbitrage Pricing: A General Framework

We consider the derivatives on a single underlying variable, θ, which follows

θ= µ(θ, t)dt + σ(θ, t)dW.

Here the variable θ need not be the price of an investment asset. For ex-ample, it might be the interest rate, and corresponding derivative productscan be bonds or some interest rate derivatives. In this case the shortingselling for the underlying is not permitted and thus we cannot replicate thederivation process of the Black-Scholes equation where the underlying assetis used to hedge the derivative.

Suppose that f1 and f2 are the prices of two derivatives dependent onlyon θ and t. These could be options or other instruments that provide a payoffequal to some function of θ at some future time. We assume that during thetime period under consideration f1 and f2 provide no income.

Suppose that the processes followed by f1 and f2 are

df1 = a1dt + b1dW

anddf2 = a2dt + b2dW,

where a1, a2, b1 and b2 are functions of θ and t. The W is the same Brownianmotion as in the process of θ, because this is the only source of the uncer-tainty in their prices. To eliminate the uncertainty, we can form a portfolioconsisting of b2 of the first derivative and −b1 of the second derivative. LetΠ be the value of the portfolio,

Π = b2f1 − b1f2.

Then

dΠ = b2df1 − b1df2

= (a1b2 − a2b1)dt.

Because the portfolio is instantaneously riskless, it must earn the risk-freerate. Hence

dΠ = rΠdt = r(b2f1 − b1f2)dt

Therefore,a1b2 − a2b1 = r(b2f1 − b1f2)

Page 29: MA4257

2.5. NO ARBITRAGE PRICING: A GENERAL FRAMEWORK 19

ora1 − rf1

b1=

a2 − rf2

b2

Define λ as the value of each side in the equation, so that

a1 − rf1

b1=

a2 − rf2

b2= λ.

Dropping subscripts, we have shown that if f is the price of a derivativedependent only on θ and t with

df = adt + bdW

thena− rf

b= λ. (2.16)

The parameter λ is known as the market price of risk of θ. It may be depen-dent on both θ and t, but it is not dependent on the nature of any derivativef. At any given time, (a − rf)/b must be the same for all derivatives thatare dependent only on θ and t.

The market price of risk of θ measures the trade-offs between risk andreturn that are made for securities dependent on θ. Eq. (2.16) can be written

a− rf = λb.

For an intuitive understanding of this equation, we note that the variableσ can be loosely interpreted as the quantity of θ-risk present in f. On theright-hand side of the equation we are, therefore, multiplying the quantityof θ-risk by the price of θ-risk. The left-hand side is the expected return inexcess of the risk-free interest rate that is required to compensate for thisrisk. This is analogous to the capital asset pricing model, which relates theexpected excess return on a stock to its risk.

Because f is a function of θ and t, the process followed by f can beexpressed in terms of the process followed by θ using Ito’s lemma. Theparameters µ and σ are given by

a =∂f

∂t+

12σ2θ2 ∂2f

∂θ2+ µθ

∂f

∂θ

b = σθ∂f

∂θ.

Substituting these into equation (2.16), we obtain the following differentialequation that must be satisfied by f

∂f

∂t+

12σ2θ2 ∂2f

∂θ2+ (µ− λσ)θ

∂f

∂θ− rf = 0. (2.17)

Page 30: MA4257

20CHAPTER 2. OPTION PRICING MODELS FOR EUROPEAN OPTIONS

This equation is structurally very similar to the Black-Scholes equation.If the variable θ is the price of a traded asset, then the asset itself can

be regarded as a derivative on θ. Hence we can take f = θ and substituteinto Eq. (2.16) to get

µf − rf = λσf

orµ− r = λσ.

Then the equation becomes precisely the Black-Scholes equation:

∂f

∂t+

12σ2θ2 ∂2f

∂θ2+ rθ

∂f

∂θ− rf = 0.

Remark 4 Eq (2.17) implies that the risk-neutral process of θ is

dθ = (µ− λσ)θdt + σθdW.

Remark 5 Applying Ito’s lemma gives

df =

[∂f

∂t+

12σ2θ2 ∂2f

∂θ2+ µθ

∂f

∂θ

]dt + σθ

∂f

∂θdW

Substituting Eq (2.17) into the above expression, we have

df =[rf + λσθ

∂f

∂θ

]dt + σθ

∂f

∂θdW

= rfdt + σθ∂f

∂θ[λdt + dW ] .

That isdf − rfdt = σθ

∂f

∂θ[λdt + dW ] .

Observe that for every unit of risk, represented by dW, there are λ units ofextra return. That is why we call λ the market price of risk.

2.6 *Option Pricing: Replication

Consider a market where only two basic assets are traded. One is a bond,whose price process is

dP = rPdt.

The other asset is a stock whose price process is governed by the geometricBrownian motion:

dSt = µStdt + σStdWt.

Page 31: MA4257

2.6. *OPTION PRICING: REPLICATION 21

Consider a European call option whose payoff is

(ST −K)+ .

The option pricing problem is: what is the fair price of this option at timet = 0?

Let us make some observation on this problem. The price of the optionat t = T is the amount that the holder of the option would obtain as wellas the amount that the writer would lose at that time. Now, suppose thisoption has a price z at t = 0. The writer has to invest this amount of moneyin some way (called replication) in the market (where there are one bondand one stock available) so that at time t = T , his total wealth, denotedby ZT , resulting for the investment of z, should at least compensate hispotential loss (ST −K)+, namely

ZT ≥ (ST −K)+ . (2.18)

It is clear that for the same investment strategy, the larger the initial endow-ment z, the larger the final wealth ZT . Hence the writer of the option wouldlike to set z large enough so that (2.18) can be guaranteed. On the otherhand, if it happens that for some z the resulting final wealth ZT is strictlylarger than the loss (ST −K)+, then the price z of this option at t = 0 isconsidered to be too high. In this case, the buyer of the option, instead ofbuying the option, would make his own investment to get the desired payoff(ST −K)+. As a result, the fair price for the option at time t = 0 should besuch a z that the corresponding optimal investment would result in a wealthprocess ZT satisfying

ZT = (ST −K)+ .

Now, let us denote by Yt the amount that the writer invests in the stock(i.e., the number of shares Yt

St) . The remaining amount Zt − Yt is invested

in the bond. The wealth process Zt is described by

dZt = [rZt + (µ− r) Yt] dt + σYtdW.

The option price is given by Z0 at time 0.Let us assume

Zt = V (t, St).

Applying Ito lemma,

dV =

(∂V

∂t+

12σ2S2

t

∂2V

∂S2+ µSt

∂V

∂S

)dt + σSt

∂V

∂SdW (t).

Page 32: MA4257

22CHAPTER 2. OPTION PRICING MODELS FOR EUROPEAN OPTIONS

Then we get{

Yt = St∂V∂S

∂V∂t + 1

2σ2S2t

∂2V∂S2 + µSt

∂V∂S = rV + (µ− r) Yt.

In the end, we get

∂V

∂t+

12σ2S2 ∂2V

∂S2+ rS

∂V

∂S− rV = 0.

This is the Black-Scholes equation.Especially

Yt

St=

∂V

∂S∆= ∆.

This means that the strategy for replication is holding ∆ shares of stock.Consequently,

dZt = [rZt + (µ− r)∆St] dt + σ∆StdW.

That is

Zt = Z0 +∫ t

0[rZτ + (µ− r)∆S] dτ +

∫ t

0σ∆SτdWτ

orZt = Z0 +

∫ t

0rZτdτ +

∫ t

0∆(dSτ − rSτdτ) .

Page 33: MA4257

Chapter 3

American options and earlyexercise

American options are contracts that may be exercised early, prior to expiry.These options are contrasted with European options for which exercise isonly permitted at expiry. Most traded stock and futures options are Amer-ican style, while most index options are European.

3.1 Pricing models

3.1.1 Continuous-time model for American options

We now consider the pricing model for American options. Here we take intoaccount a put as an example. Let V = V (S, t) be the option value. Atexpiry, we still have

V (S, T ) = (X − S)+. (3.1)

The early exercise feature gives the constraint

V (S, t) ≥ X − S. (3.2)

As before, we construct a portfolio of one long American option positionand a short position in some quantity ∆, of the underlying.

Π = V −∆S.

With the choice ∆ = ∂V∂S , the value of this portfolio changes by the amount

dΠ = (∂V

∂t+

12σ2S2 ∂2V

∂S2)dt

23

Page 34: MA4257

24 CHAPTER 3. AMERICAN OPTIONS AND EARLY EXERCISE

In the Black-Scholes argument for European options, we set this expressionequal to riskless return, in order to preclude arbitrage. However, when theoption in the portfolio is of American style, all we can say is that we canearn no more than the risk-free rate on our portfolio, that is,

dΠ ≤ rΠdt = r(V − S∂V

∂S)dt.

The reason is the holder of the option controls the early exercise feature.If he/she fails to optimally exercise the option, the change of the portfoliovalue would be less than riskless return. Thus we arrive at an inequality

(∂V

∂t+

12σ2S2 ∂2V

∂S2)dt ≤ r(V − S

∂V

∂S)dt

or∂V

∂t+

12σ2S2 ∂2V

∂S2+ rS

∂V

∂S− rV ≤ 0. (3.3)

Remark 6 For American options, the long/short position is asymmetrical.The holder of an American option is given more rights, as well as moreheadaches: when should he exercise? Whereas the writer of the option cando no more than sit back and enjoy the view. The writer of the Americanoption can make more than the risk-free rate if the holder does not exerciseoptimally. A question: what happens if the portfolio is composed of a longposition in some quantity of the underlying and one short American option?

It is clear that (3.1-3.3) are insufficient to form a model because solutionis not unique. We need to exploit more information. Note that if V (S, t) >X−S, which implies that the option should not be exercised at the moment,then the equality holds in the inequality (3.3), namely

∂V

∂t+

12σ2S2 ∂2V

∂S2+ rS

∂V

∂S− rV = 0 if V > X − S.

If V (S, t) = X − S, of course we still have the inequality, that is

∂V

∂t+

12σ2S2 ∂2V

∂S2+ rS

∂V

∂S− rV ≤ 0 if V = X − S.

The above two formulas imply that at least one holds in equality between(3.2-3.3). So we arrive at a complete model:

∂V

∂t+

12σ2S2 ∂2V

∂S2+ rS

∂V

∂S− rV ≤ 0, V ≥ X − S

[∂V

∂t+

12σ2S2 ∂2V

∂S2+ rS

∂V

∂S− rV

][V − (X − S)] = 0, (S, t) ∈ D

V (S, T ) = (X − S)+

Page 35: MA4257

3.1. PRICING MODELS 25

It can be shown that there exists a unique solution to the model.A succinct expression of the above model is

min

{−∂V

∂t− 1

2σ2S2 ∂2V

∂S2− rS

∂V

∂S+ rV, V − (X − S)

}= 0, (S, t) ∈ D

V (S, T ) = (X − S)+

For American call options, we similarly have

min

{−∂V

∂t− 1

2σ2S2 ∂2V

∂S2− rS

∂V

∂S+ rV, V − (S −X)

}= 0, (S, t) ∈ D

V (S, T ) = (S −X)+

We claim the price function of European call option C(S, t) just satisfies theabove model. Indeed, C(S, t) > S−X for t < T and C(S, t) clearly satisfiesthe Black-Scholes equation. So C(S, t) must be the (unique) solution to theAmerican option pricing model. The result C(S, t) > S − X implies thatthe option should never be exercised before expiry.

Remark 7 From the view point of probabilistic approach, we have (for anAmerican put)

V (S, t) = maxt′

E[e−r(t′−t)(X − St′)+|St = S

], (3.4)

where t′ is a stopping time. Intuitively t′(.) can be thought of as a strategy toexercise the option. The option’s value corresponds to the optimal exercisestrategy. Mathematically we can show the equivalence between (3.4) andthe above PDE model.

3.1.2 Continuous-dividend payment case

Let q be the continuous dividend yield. Denote by

ϕ(S) =

{S −XX − S.

Then the American option price function V satisfies

min

{−∂V

∂t− 1

2σ2S2 ∂2V

∂S2− (r − q)S

∂V

∂S+ rV, V − ϕ

}= 0, (S, t) ∈ D

V (S, T ) = ϕ+

Page 36: MA4257

26 CHAPTER 3. AMERICAN OPTIONS AND EARLY EXERCISE

3.1.3 Binomial model

Let T be the expiration date, [0, N ] be the lifetime of the option. If N is thenumber of discrete time points, we have time points n∆t, n = 0, 1, ..., N, with∆t = T/N. Let V n

j be the option price at time point n∆t with underlyingasset price Sj . Suppose the underlying asset price Sj will move either up toSj+1 = Sju or down to Sj−1 = Sjd after the next timestep. Similar to thearguments in the continuous time case, we are able to derive the binomialtree method (BTM)

V nj = max

{1ρ [pV n+1

j+1 + (1− p)V n+1j−1 ], ϕj

},

for j = −n,−n + 2, ..., n and n = 0, 1, ...N − 1V N

j = ϕ+j , for j = −N,−N + 2, ..., N

where ϕj =

{S0u

j −XX − S0u

j

p =e(r−q)∆t − d

u− d,

ρ = er∆t, u = eσ√

∆t, and d = e−σ√

∆t.

A question: what about the relation between continuous and discretemodels for American options?

3.2 Free boundary problems

We still take a put for example. First we give definitions of Stopping RegionE (or, Exercise Region) and Holding Region H (or, Continuous Region) :

E = {(S, t) ∈ D : V (S, t) = X − S}H = D\E = {(S, t) ∈ D : V (S, t) > X − S}

3.2.1 *Optimal exercise boundaries

Lemma 1 If (S1, t) ∈ E, then (S2, t) ∈ E for all S2 ≤ S1.

Proof: It suffices to show that

V (S2, t) + S2 ≤ V (S1, t) + S1, if S2 ≤ S1 (3.5)

Page 37: MA4257

3.2. FREE BOUNDARY PROBLEMS 27

Indeed, (3.5) is equivalent to

V (S2, t)− (X − S2) ≤ V (S1, t)− (X − S1)

Since V (S1, t) − (X − S1) = 0 and V (S2, t) − (X − S2) ≥ 0, we deriveV (S2, t) = X − S2, which implies (S2, t) ∈ E. (3.5) can be proved in termsof the binomial model. We omit the details..

Remark 8 (3.5) can be rewritten as

V (S1, t)− V (S2, t)S1 − S2

≥ −1.

As S1 tends S2, we have∂V

∂S≥ −1.

Proposition 2 (i)There exists a boundary S∗(t), called the optimal exerciseboundary hereafter, such that

E = {(S, t) ∈ D : S ≤ S∗(t)}, and H = {(S, t) ∈ D : S > S∗(t)}

(ii) S∗(t) is monotonically increasing.(iii) S∗(T−) = min(X, r

qX)

Proof: Part i) can be derived from Lemma 1. To show part ii), it isnot hard to prove V (S, t) is monotonically decreasing w.r.t. t by using thebinomial model (financial intuition: the larger the time to expiry, the largerthe option value). Thus if V (S, t1) > X − S, then V (S, t2) ≥ V (S, t1) >X − S. A complete proof of part iii) requires some knowledge about PDEtheory, so we skip it. Numerical experiments can show its validity.

Remark 9 S∗(t) is called optimal exercise boundary because it is optimalto exercise the option exactly on the boundary. If S < S∗(t), then V (S, t) =X − S and

∂V

∂t+

12σ2S2 ∂2V

∂S2+ (r − q)S

∂V

∂S− rV < 0

ordΠ < rΠdt.

Page 38: MA4257

28 CHAPTER 3. AMERICAN OPTIONS AND EARLY EXERCISE

3.2.2 Formulation as a free boundary problem

In the continuation region H = {S > S∗(t)}, the price function of an Amer-ican put satisfies the Black-Scholes equation:

∂V

∂t+

12σ2S2 ∂2V

∂S2+ (r − q)S

∂V

∂S− rV = 0, for S > S∗(t), t ∈ [0, T ) (3.6)

On S = S∗(t) we have

V (S∗(t), t) = X − S∗(t). (3.7)

The finial condition isV (S, T ) = (X − S)+. (3.8)

However, (3.6-3.8) cannot form a complete model because S∗(t) is not knowna prior as a function of time. As a matter of fact, S∗(t) and V (S, t) must besolved simultaneously. Therefore, we need an additional boundary condition

∂V

∂S(S∗(t), t) = −1. (3.9)

This condition means that the hedging ratio ∆ is continuous across theoptimal exercise boundary. (3.6-3.9) form a complete model that is calledthe free boundary problem in PDE theory.

3.2.3 Perpetual American options

Pricing perpetual American options can give us some insights in the un-derstanding of free boundary problems. A perpetual American put can beexercised for a put payoff at any time. There is no expiry; that is why it iscalled a perpetual option. Note that the price function of such a option isindependent of time, denoted by P∞(S). It only depends on the level of theunderlying. Actually P∞(S) can be regarded as the limit of an Americanput price as the time to expiry tends to infinity, i.e.

P∞(S) = lim(T−t)→∞

V (S, t;T ) = limτ→∞ V (S, τ).

where V (S, τ) = V (S, t;T ), τ = T − t. Thanks to (3.6-3.9), V (S, τ) satisfies

∂V

∂τ− 1

2σ2S2 ∂2V

∂S2− (r − q)S

∂V

∂S+ rV = 0, for S > S∗(τ), τ ∈ [0, T )

V (S∗(τ), τ) = X − S∗(τ),∂V

∂S(S∗(τ), τ) = −1

V (S, 0) = (X − S)+

Page 39: MA4257

3.2. FREE BOUNDARY PROBLEMS 29

where S∗(τ) = S∗(T−τ). Due to part ii) of Proposition 2, S∗(τ) is monotone,and we can denote

S∞ = limτ→∞S∗(∞).

Then P∞(S) satisfies

−12σ2S2 ∂2P∞

∂S2− (r − q)S

∂P∞∂S

+ rP∞ = 0, for S > S∞ (3.10)

P∞(S∞) = X − S∞,∂P∞∂S

(S∞) = −1. (3.11)

This is actually a free boundary problem with an ordinary difference equa-tion. We now seek a solution of the form Sα to Eq. (3.11), where α satisfies

−12σ2α(α− 1)− (r − q)α + r = 0

or12σ2α2 + (r − q − σ2

2)α− r = 0.

The two solutions of the above equation are

α± =−(r − q − σ2

2 )±√

(r − q − σ2

2 )2 + 2rσ2

σ2. (3.12)

So the general solution of Eq. (3.11) is

ASα+ + BSα−

where A and B are arbitrary constants.Clearly, for the perpetual American put the coefficient A must be zero;

as S →∞ then value of the option must tend to zero. What about B? Herewe need to take advantage of the condition (3.11)

BSα−∞ = X − S∞α−BSα−−1

∞ = −1

to get

S∞ =α−

α− − 1X

B =X − S∞

Sα−∞

.

Page 40: MA4257

30 CHAPTER 3. AMERICAN OPTIONS AND EARLY EXERCISE

So, we obtain the perpetual American option price function

P∞(S) = (X − S∞)(

S

S∞

)α−

A by-product of the above calculation is

Proposition 3

limτ→∞S∗(τ) =

α−α− − 1

X

3.2.4 Put-call symmetry relations

For European options, we have the well-known put-call parity

CE(S, t)− PE(S, t) = Se−q(T−t) −Xe−r(T−t).

However, such a parity doesn’t hold for American options. Let us explainthe reason from the view point of PDE. For European options, both CE(S, t)and PE(S, t) satisfy the Black-Scholes equations. Due to the linearity of theequation, CE(S, t)− PE(S, t) also satisfies the equation, that is

[∂

∂t+

12σ2S2 ∂2

∂S2+ (r − q)S

∂S− r

](CE − PE) = 0,

together with the terminal condition

(CE − PE) (S, T ) = S −X.

On the other hand, the Black-Scholes equation with the terminal conditionhas the unique solution Se−q(T−t) −Xe−r(T−t). This leads to

CE(S, t)− PE(S, t) = Se−q(T−t) −Xe−r(T−t).

However, for American options, their governing equation is nonlinear so thatwe cannot obtain such a parity relation.

But, the following put-call symmetry relation holds for both Europeanand American options:

C(S, t;X, r, q) = P (X, t;S, q, r) (3.13)

where the underlying price and the strike price in the put formula becomethe strike price and the underlying price in the call formula, respectively.Also, the roles of q and r are interchanged, like S and X.

Page 41: MA4257

3.2. FREE BOUNDARY PROBLEMS 31

The financial explanation is given as follows. We may consider a calloption with the payoff (S −X)+ as providing the right to exchange oneasset X with dividend yield r for another asset S with dividend yield q.Similarly, a put call with payoff (X2−S2)+ is regarded as a right to exchangeone asset S2 with dividend yield q2 for another asset X2 with dividend yieldr2. Suppose

X2 = S, S2 = X, r2 = q and q2 = r.

then the two have the same payoff and thus the same price, namely,

C(S, t;X, r, q) = P (S2, t;X2, r2, q2)= P (X, t;S, q, r).

Next, we would like to establish the put-call symmetry relation for theoptimal exercise prices for American put and call options. Let S∗P (t; r, q) andS∗c (t; r, q) denote the optimal exercise boundaries for American put and calloptions on a continuous dividend payment stock, respectively. We assert

S∗c (t; r, q)S∗p(t; q, r) = X2 (3.14)

Indeed, due to the homogeneity, Eq. (3.13) can be rewritten as

C(S, t;X, r, q) =S

XP (

X2

S, t;X, q, r).

According to the definition of S∗c (t; r, q), we have

S

XP (

X2

S, t;X, q, r) = C(S, t;X, r, q) = S −X, for S ≥ S∗c (t; r, q)

S

XP (

X2

S, t;X, q, r) = C(S, t;X, r, q) > S −X, for S < S∗c (t; r, q).

or, equivalently

P (X2

S, t;X, q, r) = X − X2

S, for S ≥ S∗c (t; r, q)

P (X2

S, t;X, q, r) > X − X2

S, for S < S∗c (t; r, q).

By denoting S1 = X2

S , we then get

P (S1, t;X, q, r) = X − S1, for S1 ≤ X2

S∗c (t; r, q)

P (S1, t;X, q, r) > X − S1, for S1 >X2

S∗c (t; r, q)

Page 42: MA4257

32 CHAPTER 3. AMERICAN OPTIONS AND EARLY EXERCISE

which implies X2

S∗c (t;r,q) is the optimal exercise boundary for American putoption with interest rate q and dividend yield r, namely

S∗p(t; q, r) =X2

S∗c (t; r, q).

Applying Eqs (3.13-3.14) and Proposition (2) leads to

Proposition 4 Let S∗c (t; r, q) be the optimal exercise boundary for Ameri-can call options. Then

(i) S∗c (t) is monotonically decreasing.(ii) S∗c (T−; r, q) = max(X, r

qX)

Remark 10 When q = 0, then S∗c (T−; r, 0) = ∞. Because of the mono-tonicity of S∗c (t), we deduce S∗c (t; r, 0) = ∞ for all t, which means there isno optimal exercise boundary, that is, early exercise should never happen.

In addition, it is not hard to get the closed form price function of theperpetual American option by using the put-call symmetry relation.

Proposition 5 Suppose q > 0. Let C∞(S) be the price function of the per-petual American option. Then

C∞(S) = (S∞,c −X)

(S

S∞,c

)α+

where α is given by (3.12), and

S∞,c =α+

α+ − 1X.

3.3 Bermudan options

In a standard American option, exercise can take place at any time duringthe life of the option and the exercise price is always the same. In practice,the American options that are traded in the over-the-counter market do notalways have these standard features.

One type of nonstandard American option is known as a Bermudanoption. In this early exercise is restricted to certain dates t1 < t2 < ... < tnduring the life of the option (ti ∈ [0, T ), i = 1, 2, ..., n). Note that at any

Page 43: MA4257

3.3. BERMUDAN OPTIONS 33

interval (ti, ti+1), i = 0, 1, ..., n (let t0 = 0, tn+1 = T ), there is no earlyexercise right. So

∂V

∂t+

12σ2S2 ∂2V

∂S2+ (r − q)S

∂V

∂S− rV = 0, for t ∈ (ti, ti+1), i = 0, 1, ..., n

At t = ti, due to the early exercise feature, one has

V (S, t−i ) = max(V (S, t+i ), ϕ(S)), for i = 1, 2, ...n

At expiry,V (S, T ) = ϕ+.

These form a complete model. Also, it is easy to implement by the binomialtree method.

A question: if there is no dividend payment during the life of the option,whether is there a chance to exercise a Bermudan call option prior to theexpiry?

Page 44: MA4257

34 CHAPTER 3. AMERICAN OPTIONS AND EARLY EXERCISE

Page 45: MA4257

Chapter 4

Multi-asset options

In this chapter we introduce the idea of higher dimensionality by describingthe Black-Scholes theory for options on more than one underlying asset.This theory is perfectly straightforward; the only new idea is that correlatedrandom walks and the corresponding multifactor version of Ito Lemma.

4.1 Pricing model

4.1.1 Two-asset options

Consider a European option whose payoff, denoted by f(S1, S2), depends ontwo assets S1 and S2. The basic building block for option pricing with oneunderlying is the lognormal random walk

dS

S= µdt + σdW.

This is readily extended to a world containing two assets via models for eachunderlying

dS1

S1= µ1dt + σ1dW1

dS2

S2= µ2dt + σ2dW2

As before, we can think of dWi, i = 1, 2 as a random number drawn from aNormal distribution with mean zero and standard deviation dt1/2 so that

E(dWi) = 0 and E[dW 2i ] = dt

35

Page 46: MA4257

36 CHAPTER 4. MULTI-ASSET OPTIONS

but the random numbers dW1 and dW2 are correlated:

E[dW1dW2] = ρdt

Here ρ is the correlation coefficient between the two random walks..Let V (S1, S2, t) be the option value. Since there are two sources of

uncertainty, we construct a portfolio of one long option position, two shortpositions in some quantities of underlying assets:

Π = V −∆1S1 −∆2S2.

Consider the increment

dΠ = dV −∆1dS2 −∆2dS2

Here we need the Ito Lemma involving two variables.

dV =

[∂V

∂t+

12σ2

1S21

∂2V

∂S21

+ ρσ1σ2S1S2∂2V

∂S1∂S2+

12σ2

2S22

∂2V

∂S22

]dt

+∂V

∂S1dS1 +

∂V

∂S2dS2

Actually the two dimensional Ito Lemma can be derived by using Taylorseries and the rules of thumb: dW 2

i = dt, i = 1, 2, and dW1dW2 = ρdt.Taking ∆1 = ∂V

∂S1and ∆2 = ∂V

∂S2to eliminate risk, we then have

dΠ =

[∂V

∂t+

12σ2

1S21

∂2V

∂S21

+ ρσ1σ2S1S2∂2V

∂S1∂S2+

12σ2

2S22

∂2V

∂S22

]dt.

Then the portfolio is riskless and then earn riskless return, namely

dΠ = rΠ = r(V − ∂V

∂S1S1 − ∂V

∂S2S2)dt.

So we arrive at an equation

∂V

∂t+

12σ2

1S21

∂2V

∂S21

+ρσ1σ2S1S2∂2V

∂S1∂S2+

12σ2

2S22

∂2V

∂S22

+rS1∂V

∂S1+rS2

∂V

∂S2−rV = 0.

(4.1)The solution domain is {S1 > 0, S2 > 0, t ∈ [0, T )} . The final condition is

V (S1, S2, T ) = f(S1, S2) (4.2)

(4.1-4.2) form a complete model. Well-known payoffs are the following:

Page 47: MA4257

4.1. PRICING MODEL 37

f(S1, S2) =

(max(S1, S2)−X)+ , maximum call(X −max(S1, S2))

+ , maximum put(min(S1, S2)−X)+ , minimum call(X −min(S1, S2))

+ , minimum put(S1 − S2 −X)+, spread option

(4.3)

If the assets pay continuous dividends, then (4.1) is replaced by

∂V

∂t+

12σ2

1S21

∂2V

∂S21

+ρσ1σ2S1S2∂2V

∂S1∂S2+

12σ2

2S22

∂2V

∂S22

+(r−q1)S1∂V

∂S1+(r−q2)S2

∂V

∂S2−rV = 0.

(4.4)where q1 and q2 are dividend yields of two assets, respectively.

4.1.2 American feature:

Suppose that the option can be exercised early receiving the payoff. Thenthe pricing model is

min {−LV, V − f(S1, S2)} = 0

V (S, T ) = f(S1, S2)

where L = ∂∂t + 1

2σ21S

21

∂2

∂S21

+ρσ1σ2S1S2∂2

∂S1∂S2+ 1

2σ22S

22

∂2

∂S22

+(r− q1)S1∂

∂S1+

(r − q2)S2∂

∂S2− r

4.1.3 Exchange option: similarity reduction

An exchange option gives the holder the right to exchange one asset foranother. The payoff for this contract at expiry is (S1 − S2)+. So the finalcondition is

V (S1, S2, T ) = (S1 − S2)+.

The governing equation is still (4.1).This contract is special in that there is a similarity reduction. Let us

postulate that the solution takes the form

V (S1, S2, t) = S2H(ξ, t),

where the new variable is

ξ =S1

S2.

Page 48: MA4257

38 CHAPTER 4. MULTI-ASSET OPTIONS

If this is the case, then instead of finding a function V of three variables, weonly need find a function H of two variables, a much easier task.

It follows

∂V

∂S1= S2

∂H

∂ξ

1S2

=∂H

∂ξ,

∂2V

∂S21

=∂2H

∂ξ2

1S2

,

∂V

∂S2= H + S2

∂H

∂ξ

(−S1

S22

)= H − ξ

∂H

∂ξ,

∂2V

∂S22

=∂H

∂ξ

(−S1

S22

)− S1

S22

∂H

∂ξ− ξ

∂2H

∂ξ2

(−S1

S22

)=

1S2

ξ2 ∂2H

∂ξ2

∂2V

∂S1∂S2= −S1

S22

∂2H

∂ξ2,

∂V

∂t= S2

∂H

∂t

The partial differential equation now becomes

∂H

∂t+

[12σ2

1ξ2 − ρσ1σ2ξ

2 +12σ2

2ξ2]

∂2H

∂ξ2+(r − q1) ξ

∂H

∂ξ+(r − q2)

(H − ξ

∂H

∂ξ

)−rH = 0

or∂H

∂t+

12σ′2ξ2 ∂2H

∂ξ2+ (q2 − q1)ξ

∂H

∂ξ− q2H = 0,

where σ′ =√

σ21 − 2ρσ1σ2 + σ2

2. This equation is just the Black-Scholesequation for a single stock with q2 in place of r, q1 in place of the divi-dend yield on the single stock and with a volatility of σ′. Note that the finalcondition is

H(ξ, T ) = (ξ − 1)+

From this it follows that

V (S1, S2, t) = S1e−q1(T−t)N(d′1)− S2e

−q2(T−t)N(d′2).

where

d′1 =log(S1/S2) + (q2 − q1 + 1

2σ′2)(T − t)σ′√

T − t, and d

′2 = d

′1 − σ′

√T − t

Remark 11 An exchange option is a kind of spread option with X = 0. IfX 6= 0, the similarity reduction doesn’t work because the payoff cannot bereduced to a function of ξ and t.

Page 49: MA4257

4.1. PRICING MODEL 39

4.1.4 Options on many underlyings

Options with many underlyings are called basket options, options on basketsor rainbow options. We now extend the two-asset option pricing model toa general case. Suppose

dSi = µiSidt + σiSidWi.

Here Si is the price of the ith asset, i = 1, 2, ..., n, and µi and σi are thedrift and volatility of that asset respectively and dWi is the increment of aBrownian motion. We can still continue to think of dWi as a random numberdrawn from a Normal distribution with mean zero and standard deviationdt1/2 so that

E(dWi) = 0 and E(dX2i ) = dt

and the random numbers dWi and dWj are correlated:

E[dWidWj ] = ρijdt,

here ρij is the correlation coefficient between the ith and jth random walks.The symmetric matrix with ρij as the entry in the ith row and jth columnis called the correlation matrix. For example, if we have seven underlyingsn = 4 and the correlation matrix will look like this:

D =

1 ρ12 ρ13 ρ14

ρ21 1 ρ23 ρ24

ρ31 ρ32 1 ρ34

ρ41 ρ42 ρ43 1

Note that ρii = 1 and ρij = ρji. The correlation matrix is positive definite,so that yT Dy ≥ 0.

To be able to manipulate functions of many random variables we needa multidimensional version of Ito’s lemma. If we have a function of thevariables S1, S2, ..., Sn and t, V (S1, S2, ..., Sn, t), then

dV =

∂V

∂t+

12

n∑

i=1

n∑

j=1

σiσjρijSiSj∂2V

∂Si∂Sj

dt +

n∑

i=1

∂V

∂SidSi.

We can get to this same result by using Taylor series and the rules of thumb:

dW 2i = dt and dWidWj = ρijdt.

The pricing model for basket options is straightforward. Still set up aportfolio consisting of one basket option, and short a number ∆i of each of

Page 50: MA4257

40 CHAPTER 4. MULTI-ASSET OPTIONS

the asset Si, employ the multidimensional Ito’s Lemma, take ∆i = ∂V∂Si

toeliminate the risk, and set the return of the portfolio equal to the risk-freerate. We are able to arrive at

∂V

∂t+

12

n∑

i=1

n∑

j=1

σiσjρijSiSj∂2V

∂Si∂Sj+

n∑

i=1

(r − qi)Si∂V

∂Si− rV = 0.

Here qi is the dividend yield on the ith asset. The final condition is

V (S1, S2, ..., Sn, t) = f(S1, S2, ..., Sn)

The analytic solution to the above model is available, but involves multiplyintegral, as in the case of two-asset options. (See Page 154, Wilmott (1998))

4.2 Quantos

There is one special, and very important type of multi-asset option. This isthe cross-currency contract called a quanto. The quanto has a payoff definedwith respect to an asset or an index (or an interest rate) in one country, butthen the payoff is converted to another currency payment. The general formof its payoff can be expressed as

f (S$, S) .

Here S$ is the exchange rate between the domestic currency and theforeign currency (for example, dollar-yen rate, number of dollars per yen),and S is the level of some foreign asset (for example, the Nikkei Dow index).Note that the quanto contract is measured in domestic currency, but S is inforeign currency. So this contract is exposed to the exchange rate and theasset.We assume

dS$ = µ$S$dt + σ$S$dW$ and dS = µSdt + σSdW

with a correlation coefficient ρ between them.Let V (S$, S, t) be the quanto option value in US dollar. Construct a

portfolio consisting of the quanto, hedged with the foreign currency and theasset:

Π = V (S$, S, t)−∆$S$ −∆SS$.

Note that every term in this equation is measured in domestic currency(dollar). ∆$ is the number of the foreign currency (yen) we hold short, so−∆$S$ is the dollar value of that yen. Similarly, with the term −∆SS$ we

Page 51: MA4257

4.2. QUANTOS 41

have converted the yen-denominated index S into dollars, ∆ is the amountof the index held short.

The change in the value of the portfolio is due to the change in the valueof its components and the interest received on the yen:

dV =

[∂V

∂t+

12σ2

$S2$

∂2V

∂S2$

+ ρσ$σS$S∂2V

∂S$∂S+

12σ2S2 ∂2V

∂S2

]dt

+∂V

∂S$dS$ +

∂V

∂SdS

−∆$dS$ −∆$S$rfdt

−∆S$dS −∆SdS$ − ρσ$σ∆SS$dt

=

[∂V

∂t+

12σ2

$S2$

∂2V

∂S2$

+ ρσ$σS$S∂2V

∂S$∂S+

12σ2S2 ∂2V

∂S2− ρσ$σ∆SS$ −∆$S$rf

]dt

+(

∂V

∂S$−∆$ −∆S

)dS$ +

(∂V

∂S−∆S$

)dS

where the term −∆$S$rfdt is the interest received by the yen holding, and−ρσ$σ∆SS$dt is due to the increment of the product −∆SS$. We nowchoose

∆ =1S$

∂V

∂Sand ∆$ =

∂V

∂S$− S

S$

∂V

∂S

to eliminate the risk in the portfolio. Setting the return on this risklessportfolio equal to the US risk-free rate of interest r, since Π is measuredentirely in dollars, yields

∂V

∂t+

12σ2

$S2$

∂2V

∂S2$

+ρσ$σS$S∂2V

∂S$∂S+

12σ2S2 ∂2V

∂S2dt +(r−rf )S$

∂V

∂S$+(rf−ρσ$σ)S

∂V

∂S−rV = 0.

(4.5)This completes the formulation of the pricing equation. The equation isvalid for any contract with underlying measured in one currency but paidin another. The final conditions on t = T :

V (S$, S, T ) = f(S$, S).

Notice that these parameters correspond to two-asset options with con-tinuous dividend payments (i.e. Eqn (4.4)), where under the risk-neutralworld, the underlying assets follow

dS1

S1= (r − q1)dt + σ1dW1

Page 52: MA4257

42 CHAPTER 4. MULTI-ASSET OPTIONS

dS2

S2= (r − q2)dt + σ2dW2

with ρdt = E(dW1dW2). For quanto options (i.e. Eqn (4.5)), the underlyingsfollow in the risk-neutral world

dS$

S$= (r − rf )dt + σ$dW$

dS

S= (rf − ρσ$σ)dt + σdW

= (r − (r − rf + ρσ$σ))dt + σdW.

with ρdt = E(dW$W ). Therefore, in this case, q1 = rf and q2 = r−ff +ρσ$σ.

4.3 Numerical Methods

4.3.1 *Binomial tree methods

Suppose (S1, S2) will move to (S1u1, S2u2) with probability p1, (S1u1, S2d2)with probability p2, (S1d1, S2u2) with probability p3,and (S1d1, S2d2) withprobability p4 after the next timestep. Then the binomial model for two-asset options is

V (S1, S2, t) = e−r∆t[p1V (S1u1, S2u2, t + ∆t)+p2V (S1u1, S2d2, t + ∆t)+p3V (S1d1, S2d2, t + ∆t)+p4V (S1d1, S2u2, t + ∆t)]

where pi for i = 1, 2, 3, 4, ui, di for i = 1, 2 are chosen to be consistent withthe continuous-time model. One choice for these parameters is given asfollows

ui = eσi

√∆t, di =

1ui

for i = 1, 2.

p1 =14

1 +

r − q1 − σ2

12

σ1+

r − q2 − σ222

σ2

∆t + ρ

p2 =14

1 +

r − q1 − σ2

12

σ1− r − q2 − σ2

22

σ2

∆t− ρ

p3 =14

1 +

−r − q1 − σ2

12

σ1− r − q2 − σ2

22

σ2

∆t + ρ

Page 53: MA4257

4.3. NUMERICAL METHODS 43

p4 =14

1 +

−r − q1 − σ2

12

σ1+

r − q2 − σ222

σ2

∆t− ρ

.

We refer interested students to Kwok (1998) [pp 207-208] for derivation ofthe above parameters. It should be pointed out that we can also use thefinite difference method to determine these parameters.

4.3.2 Monte-Carlo simulation

The amount of computation of BTM grows exponentially with the numberof underlyings. We will have to give up BTM if the number of underly-ings is greater than 3, and instead employ Monte-Carlo simulation which isrelatively more efficient as the number of underlyings increases.

Monte-Carlo simulation is based on the risk-neutral valuation result.The expected payoff in a risk-neutral world is calculated using a samplingprocedure. It is then discounted at the risk-free interest rate.

Suppose in a risk-neutral world

dSi = µiSidt + σiSidWi, (1 ≤ i ≤ n)

As in the single-variable case, the life of the derivative must be divided intoN subintervals of length ∆t. The discrete version of the process for Si isthen

Si(t + ∆t)− Si(t) = µiSi∆t + σiSiεi

√∆t, (4.6)

where εi is a random sample from a standard normal distribution. Thecoefficient of correlation between εi and εj is ρij for 1 ≤ i, j ≤ n. Onesimulation trial involves obtaining N samples of the εi (1 ≤ i ≤ n) from amultivariate standardized normal distribution. These are substituted intoequation (4.6) to produce simulated paths for each Si and enable a samplevalue for the derivative to be calculated.

Note that correlated samples εi (1 ≤ i ≤ n) from standard normal dis-tributions are required. We only give a procedure for n = 2. For n ≥ 3,we refer interested readers to Appendix. Independent samples x1 and x2

from a univariate standardized normal distribution are easily obtained. Therequired samples ε1 and ε2 are then calculated as follows:

ε1 = x1

ε2 = ρx1 +√

1− ρ2x2

where ρ is the coefficient of correlation.

Page 54: MA4257

44 CHAPTER 4. MULTI-ASSET OPTIONS

Remark 12 (1) At each time step, we need to find εi (1 ≤ i ≤ 2).(2) The number of simulation trials M carried out depends on the accu-

racy required. In general, we take M = 5000 or 10000.

Remark 13 The amount of computation of Monto-Carlo simulation growsonly linearly with the number of underlyings. The main drawback of MonteCarlo simulation is that it cannot easily handle situations where there areearly exercise opportunities.

4.4 *Suggestions for further reading and Appendix

4.4.1 Further reading

Most materials in section 1 is extracted from Wilmott (1998) (Chapter 11)where a general integral representation can be found for multi-asset options.

Section 2 is from Hull (2000 4th) (pp 406-409). We refer interestedreaders to the book for some skills improving the accuracy of the Monte-Carlo simulation (pp 411-414).

Kwok (1998) presented more examples of quanto options as well as theirpricing formulas (pp 115-118).

The pricing formulas for the European options on the maximum or min-imum of n risky assets can also be found in Kwok (1998) (P. 123), where then-variate normal distribution function is involved. Especially, we present theformula for the European call option on the minimum of two assets (payoff:(min(S1, S2)−X)+).

Cmin(S1, S2, t) = S1N2

log S1

X + (r + σ212 )(T − t)

σ1

√T − t

,log S2

S1− σ2

122 (T − t)

σ12

√T − t

;ρσ2 − σ1

σ12

+S2N2

log S2

X + (r + σ222 )(T − t)

σ2

√T − t

,log S1

S2− σ2

122 (T − t)

σ12

√T − t

;ρσ1 − σ2

σ12

−Xe−r(T−t)N2

log S1

X + (r − σ212 )(T − t)

σ1

√T − t

,log S2

X + (r − σ222 )(T − t)

σ2

√T − t

; ρ

.

where σ =√

σ21 + σ2

2 − 2ρσ1σ2 and N2(a, b; ρ) is the standardized bivariatenormal distribution function, i.e.

N2(a, b; ρ) =12π

1√1− ρ2

∫ a

−∞

∫ b

−∞exp

(−x2 − 2ρxy + y2

2(1− ρ2)

)dxdy.

Page 55: MA4257

4.4. *SUGGESTIONS FOR FURTHER READING AND APPENDIX 45

A numerical procedure for N2(a, b; ρ) can be found at Appendix 11C, page272, Hull (2000, 4th)

4.4.2 Appendix:

Consider the situation where we require n correlated samples from normaldistributions where the coefficient of correlation between sample i and sam-ple j is ρij . We first sample n independent variables xi (1 ≤ i ≤ n), fromunivariate standardized normal distributions. The required samples are εi

(1 ≤ i ≤ n), where

εi =i∑

k=1

αikxk.

For εi to have the correct variance and the correct correlation with the εj

(1 ≤ j ≤ n), we must havei∑

k=1

α2ik = 1

and, for all j ≤ i,j∑

k=1

αikαjk = ρij .

The first sample, ε1, is set equal to x1. These equations for the α′s can besolved so that ε2 is calculated from x1 and x2; ε3 is calculated from x1, x2

and x3; and so on. The procedure is known as the Cholesky decomposition.For example, when n = 3,

ε1 = x1

ε2 = ρ21x1 +√

1− ρ221x2

ε3 = ρ31x1 +ρ23 − ρ31ρ21√

1− ρ221

x2 +

√1 + 2ρ23ρ21ρ31 − ρ2

21 − ρ231 − ρ2

23

1− ρ221

x3

Page 56: MA4257

46 CHAPTER 4. MULTI-ASSET OPTIONS

Page 57: MA4257

Chapter 5

Path-dependent options

The contracts we have seen so far are the most basic and important deriva-tive products. In this chapter, we shall discuss some complex contracts,including barrier options, Asian options, lookback options and so on.

5.1 Barrier options

Barrier (or knock-in, knock-out) options are triggered by the action of theunderlying hitting a prescribed value at some time before expiry. For ex-ample, as long as the asset remains below a pre-determined barrier priceduring the whole life of the option, the contract will have a call payoff atexpiry. However, should the asset reach this level before expiry, then theoption becomes worthless because it has “knocked out”. Barrier options areclearly path dependent. A barrier option is cheaper than a similar vanillaoption since the former provides the holder with less rights than the latterdoes.

5.1.1 Different types of barrier options

There are two main types of barrier option:1. The out option, that only pays off if a level is not reached. If the

barrier is reached then the option is said to have knocked out2. The in option, that pays off as long as a level is reached before expiry.

If the barrier is reached then the option is said to have knocked in.Then we further characterize the barrier option by the position of the

barrier relative to the initial value of the underlying:1. If the barrier is above the initial asset value, we have an up option.

47

Page 58: MA4257

48 CHAPTER 5. PATH-DEPENDENT OPTIONS

2. If the barrier is below the initial asset value, we have a down option.Finally, the options can also be classified as call and put according to

the payoffs. In addition, if early exercise is permitted, the option is calledAmerican-style barrier option. In the following, barrier options always referto European-style options except for special claim.

5.1.2 In-Out parity

For European style barrier option, the relationship between a ‘in’ barrieroption and an ‘out’ barrier option (with same payoff and same barrier level)is very simple:

in + out = vanilla.

If the ‘in’ barrier is triggered then so is the ‘out’ barrier, so whether or notthe barrier is triggered the portfolio of an ‘in’ and ‘out’ option has the vanillapayoff at expiry.

However, the above in-out parity doesn’t hold for American-style barrieroptions.

5.1.3 Pricing by Monte-Carlo simulation

To illustrate method, we consider an up-out-call option whose terminal pay-off can be written as

(ST −X)+I{St<H, for all t∈[0,T ]}

where H is the barrier level, I is the index function, i.e.

I{St<H, t∈[0,T ]} =

{1, if St < H for all t ∈ [0, T ]

0, otherwise.

According to the risk neutral pricing principle, the option value is

e−rT E[(ST −X)+I{St<H, for all t∈[0,T ]} ]

We then use the Monte-Carlo simulation to get an approximate value of theoption.

It should be pointed out that the simulation can only apply to Europeanstyle options.

Page 59: MA4257

5.1. BARRIER OPTIONS 49

5.1.4 Pricing in the PDE framework

Barrier options are weakly path dependent. We only have to know whetheror not the barrier has been triggered, we do not need any other informationabout the path. This is contrast to some of the contracts we will be seeingshortly, such as the Asian and lookback options, that are strongly pathdependent.

We can use V (S, t) to denote the value of the barrier contract beforethe barrier has been triggered. This value still satisfies the Black-Scholesequation

∂V

∂t+

12σ2S2 ∂2V

∂S2+ (r − q)S

∂V

∂S− rV = 0. (5.1)

The details of the barrier feature come in through the specification of theboundary conditions and solution domains.

‘Out’ Barriers

If the underlying asset reaches the barrier in an ‘out’ barrier option then thecontract becomes worthless. This leads to the boundary condition

V (H, t) = 0 for t ∈ [0, T ].

The final condition is still

V (S, T ) =

{(S −X)+ for call(X − S)+ for put

.

The solution domain is {0 < S < H} × [0, T ) for an up-out option, and{H < S < ∞}× [0, T ) for a down-out option.

‘In’ Barriers

An ‘in’ option only has a payoff if the barrier is triggered. Remember thatV (S, t) stands for the option value before the barrier has been triggered.If the barrier is not triggered during the option’s life, the option expiresworthless so that we have the final condition

V (S, T ) = 0.

Once the barrier is triggered, the barrier-in option becomes a vanillaoption then on the barrier the contract must have the same value as avanilla contract:

V (H, t) =

{CE(H, t), for callPE(H, t), for put

, for t ∈ [0, T ],

Page 60: MA4257

50 CHAPTER 5. PATH-DEPENDENT OPTIONS

where CE(S, t) and PE(S, t) represent the values of the (European-style)vanilla call and put, respectively.

The solution domain is {0 < S < H} × [0, T ) for an up-in option, and{H < S < ∞}× [0, T ) for a down-in option.

*Explicit solutions

Closed-form solutions for European-style barrier options are available. Werefer interested readers to those references for these formulas. Here we onlylist the formula for the down-and-out call option with H ≤ X:

Cdo(S, t) = CE(S, t)−(

S

H

)1−2(r−q)/σ2

CE(H2

S, t), (5.2)

where CE(S, t) is the value of the European vanilla call.Let us confirm that this is indeed the solution. Clearly, on the barrier

Cdo(H, t) = CE(H, t)− CE(H, t) = 0. At expiry

Cdo(S, T ) = (S −X)+ −(

S

H

)1−2(r−q)/σ2

(H2

S−X)+

= (S −X)+, for S > H.

So the remaining is to show Cdo(S, t) satisfies the Black-Scholes equation.The first term on the right-hand side of (5.2) does. The second term doesalso. Actually, if we have any solution VBS , of the Black-Scholes equation itis easy to show that

S1−2(r−q)/σ2VBS(

A

S, t)

is also a solution for any constant A.

5.1.5 American early exercise

For American knock-out options, the pricing model is a free boundary prob-lem:

min

{−∂V

∂t− 1

2σ2S2 ∂2V

∂S2− (r − q)S

∂V

∂S+ rV, V − ϕ

}= 0, (S, t) ∈ D

V (H, t) = 0V (S, T ) = ϕ+

where D = (H,∞) × [0, T ) for a down-out, and D = (0,H) × [0, T ) for anup-out.

Page 61: MA4257

5.1. BARRIER OPTIONS 51

However, an American knock-in option price is governed still by theBlack-Scholes equation:

∂V

∂t+

12σ2S2 ∂2V

∂S2+ (r − q)S

∂V

∂S− rV = 0, (S, t) ∈ D (5.3)

The reason is V (S, t) represents the option value before the barrier is trig-gered. Thus for (S, t) ∈ D, the option cannot be early exercised because ithas not been activated. Actually the option should be called the knock-intoAmerican option. As in the European case, at expiry we have

V (S, T ) = 0 (5.4)

Once the barrier is triggered, the option becomes an American vanilla optionthen on the barrier. So,

V (H, t) =

{CA(H, t), for callPA(H, t), for put

, for t ∈ [0, T ], (5.5)

where CA(S, t) and PA(S, t) represent the values of the American vanillacall and put, respectively. (5.3-5.5) forms a complete model for Americanknock-in options.

5.1.6 BTM

The BTM can be readily extended to cope with barrier options. Recall theBlack-Scholes still holds. So the BTM must hold too, that is

V (S, t) =1ρ[pV (Su, t + ∆t) + (1− p)V (Sd, t + ∆t)]

with appropriate final and boundary conditions. For instance, for a down-out call option,

V (H, t) = 0V (S, T ) = (S −X)+,

and the backward procedure is conducted in the region {S > H} × [0, T ).When implementing the above method, the key point is to set the barrier

to pass those nodes of the tree. Otherwise it will lead to a large error. Toguarantee that the barrier matches the tree, the starting point of tree, ingeneral, cannot begin with (S0, 0). So, the interpolation technique shouldbe employed: the option value at (S0, 0) will be computed by a linear inter-polation of the values of two adjacent tree points.

Also, the method can apply to American barrier options when earlyexercise is considered.

Page 62: MA4257

52 CHAPTER 5. PATH-DEPENDENT OPTIONS

5.1.7 Hedging

Barrier options have discontinuous delta at the barrier. For example, aup-out call option value is continuous, decreasing approximately linearlytowards the barrier then being zero beyond the barrier. This discontinuityin the delta means that the gamma is instantaneously nfinite at the barrier.So, it is very hard to hedge a barrier option. We refer interested studentsto Wilmott (1998) and references therein.

5.1.8 Other features

The barrier option discussed above is the commonest one. Actually, theposition of the barrier in the contract can be time dependent. The levelmay begin at one level and then rise, say. Usually the level is a piecewise-constant function of time.

Another style of barrier option is the double barrier. Here there is bothan upper and a lower barrier, the first above and the second below thecurrent asset price. In a double ‘out’ option the contract becomes worthlessif either of the barriers is reached. In a double ‘in’ option one of the barriersmust be reached before expiry, otherwise the option expires worthless. Otherpossibilities can be imagined: one barrier is an ‘in’ and the other ‘out’, atexpiry the contract could have either and ‘in’ or an ‘out’ payoff.

Sometimes a rebate is paid if the barrier level is reached. This is oftenthe case for ‘out’ barriers in which case the rebate can be thought of ascushioning the blow of losing the rest of the payoff. The rebate may be paidas soon as the barrier is triggered or not until expiry.

5.2 Asian options

Asian options give the holder a payoff that depends on the average price ofthe underlying during the options’ life.

5.2.1 Payoff types

The payoff from a fixed strike Asian call (or, average call) is (AT −X)+, andthat from a fixed strike Asian put (or, average price put) is (X−AT )+, whereAT is the average value of the underlying asset calculated over a predeter-mined average period. Fixed strike Asian options are less expensive thanvanilla options and are more appropriate than vanilla options for meetingsome of the needs of corporate treasurers. Suppose that a U.S. corporate

Page 63: MA4257

5.2. ASIAN OPTIONS 53

treasurer expects to receive a cash flow of 100 million Australian dollarsspread evenly over the next year from the company’s Australian subsidiary.The treasurer is likely to be interested in an option that guarantees that theaverage exchange rate realized during the year is above some level. A fixedstrike Asian put can achieve this more effectively than vanilla put options.

Another type of Asian options is a floating strike option (or, averagestrike option). A floating strike Asian call pays off (ST−AT )+ and a floatingstrike Asian put pays off (AT −ST )+. Floating strike options can guaranteethat the average price paid for an asset in frequent trading over a period oftime is not greater than the final price. Alternatively, it can guarantee thatthe average price received for an asset in frequent trading over a period oftime is not less than the final price.

5.2.2 Types of averaging

The two simplest and obvious types of average are the arithmetic averageand the geometric average. The arithmetic average of the price is the sum ofall the constituent prices, equally weighted, divided by the total number ofprices used. The geometric average is the exponential of the sum of all thelogarithms of the constituent prices, equally weighted, divided by the totalnumber of prices used. Furthermore, the average may be based on discretelysampled prices or on continuously sampled prices. Then, we have

AT =

1n

∑ni=1 Sti , disretely sampled arithmetic

1T

∫ T0 Sτdτ, continuously sampled arithmetic

exp(

1n

∑ni=1 lnSti

)= (St1St2 ...Stn)1/n , discretely sampled geometric

exp(

1T

∫ T0 lnSτdτ

), continuously sampled geometric

5.2.3 Extending the Black-Scholes equation

Monte-Carlo simulation is easy to cope with the pricing of all kinds ofEuropean-style Asian options. However, for American-style Asian options,we have to fall back on PDE framework or BTM. In the following we onlyconsider the continuously sampled Asian options. We refer interested readerto Wilmott et al. (1995), Wilmott (1998) for the PDE formulation of dis-cretely sampled Asian options.

We start by assuming that the underlying asset follows the lognormalrandom walk

dSt = µStdt + σStdWt

Page 64: MA4257

54 CHAPTER 5. PATH-DEPENDENT OPTIONS

The value of an Asian option is not only a function of S and t, but also afunction of the historical average A, that is V = V (St, At, t). Here At willbe a new independent variable. Let us focus on the arithmetic average forwhich

At =1t

∫ t

0Sτdτ.

In anticipation of argument that will use Ito lemma, we need to know thestochastic differential equation satisfied by A. It is not hard to check that

dAt =tSt −

∫ t0 Stdτ

t2dt =

St −At

tdt.

We can see that its stochastic differential equation contains no stochasticterms. So, the Ito lemma for V = V (S,A, t) is

dV =

(∂V

∂t+

12σ2S2 ∂2V

∂S2

)dt +

∂V

∂AdA +

∂V

∂SdS

=

(∂V

∂t+

S −A

t

∂V

∂A+

12σ2S2 ∂2V

∂S2

)dt +

∂V

∂SdS.

To derive the pricing PDE, we set up a portfolio containing one of thepath-dependent option and short a number ∆ of the underlying asset:

Π = V (S,A, t)−∆S

The change in the value of this portfolio is given by

dΠ = dV −∆dS

=

(∂V

∂t+

S −A

t

∂V

∂A+

12σ2S2 ∂2V

∂S2

)dt +

(∂V

∂S−∆

)dS.

Choosing

∆ =∂V

∂S

to hedge the risk, we find that

dΠ =

(∂V

∂t+

S −A

t

∂V

∂A+

12σ2S2 ∂2V

∂S2

)dt.

This change is risk free, thus earns the risk-free rate of interest r, leading tothe pricing equation

∂V

∂t+

S −A

t

∂V

∂A+

12σ2S2 ∂2V

∂S2+ rS

∂V

∂S− rV = 0 (5.6)

Page 65: MA4257

5.2. ASIAN OPTIONS 55

The solution domain is {S > 0, A > 0, t ∈ [0, T )}.This is to be solved subject to

V (S,A, T ) =

(A− S)+, floating put(S −A)+, floating call(A−X)+, fixed call(X −A)+, fixed put

The obvious changes can be made to accommodate dividends on the under-lying. This completes the formulation of the valuation problem.

For geometric average,

dA = d exp(

1t

∫ t

0lnSτdτ

)= exp

(1t

∫ t

0lnSτdτ

)t lnS − ∫ t

0 lnSτdτ

t2dt

=A ln S

A

t.

So the pde satisfied by geometric Asian options is

∂V

∂t+

A ln SA

t

∂V

∂A+

12σ2S2 ∂2V

∂S2+ rS

∂V

∂S− rV = 0.

The solution domain is {S > 0, A > 0, t ∈ [0, T )}.

5.2.4 Early exercise

The only point to mention is that the details of the payoff on early exercisehave to be well defined. The payoff at expiry depends on the value of theaverage up to expiry; this will, of course, not be known until expiry. Typi-cally, on early exercise it is the average to date that is used. For example,in an American floating strike arithmetic put the early payoff would be

(1t

∫ t

0Sτdτ − St

)+

.

In general, we denote the exercise payoff at time t by Λ(St, At), where

Λ(St, At) =

(At − St)+, floating put(St −At)+, floating call(At −X)+, fixed call(X −At)+, fixed put

.

Page 66: MA4257

56 CHAPTER 5. PATH-DEPENDENT OPTIONS

Such representation is consistent with the terminal payoff. Then the pricingmodel is formulated by

min{−∂V

∂t− LV, V − Λ(S,A)

}= 0

V (S,A, T ) = Λ(S,A)

where

L =

{S−A

t∂

∂A + 12σ2S2 ∂2

∂S2 + (r − q)S ∂∂S − r, for arithmetic

A ln(S/A)t

∂∂A + 1

2σ2S2 ∂2

∂S2 + (r − q)S ∂∂S − r, for geometric

Remark 14 All European-style Asian geometric options have explicit priceformulas. However, that is generally not true for Asian arithmetic optionsand all American-style Asian options which must be solved by numericalapproaches.

5.2.5 Reductions in dimensionality

The Asian options value depends on three variables. For some cases, areduction in dimensionality of the problem is permitted. For example, weconsider the a European floating strike geometric Asian option. Adopt thefollowing transformation:

V (S,A, t)S

= W (x, t), x = t lnA

S,

the model reduces to (suppose there are continuous dividend payments)

∂W

∂t+

12σ2t2

∂2W

∂x2+ (q − r − σ2

2)t

∂W

∂x− qW = 0, t ∈ [0, T ), x ∈ (−∞,∞)

W (x, T ) =(1− e

xT

)+.

The problem is relatively easy to solve. Actually it has an explicit solution.

Remark 15 For any floating strike Asian option (arithmetic or geometric,European or American), one can find an appropriate transformation to re-duce the model to a lower dimensional problem. As for fixed strike Asianoptions, it is true only for European-style.

Page 67: MA4257

5.2. ASIAN OPTIONS 57

5.2.6 Parity relation

We will give one example. Consider European-style floating strike arithmeticcall options. The payoff at expiry for a portfolio of one call held long andone put held short is

(S −A)+ − (A− S)+,

where is simplyS −A.

The value of the portfolio then satisfies the equation (??) with the finalcondition

V (S,A, T ) = S −A.

It can be verified that the solution is

V (S,A, t) = S − S

rT(1− e−r(T−t))− t

Te−r(T−t)A.

Therefore, for floating strike arithmetic Asian options, we have the put-callparity relation

Vfc(S,A, t)− Vfp(S,A, t) = S − S

rT(1− e−r(T−t))− t

Te−r(T−t)A.

We have similar results for other European Asian options.

5.2.7 Model-dependent and model-independent results

We need to point out that some results depends on assumptions of the model.For example, the above Asian put-call parity holds for the Black-Scholesmodel, but it might not be true, in general, provided that the geometricBrownian assumption is given up.

Recall some results are model-independent. For example:(1) the put-call parity for vanilla options(2) the in-out parity for barrier options(3) American call options should never be early exercised if there is no

dividend payment.To acquire these results, one only needs no-arbitrage principle.But some are true only under Black-Scholes framework. For instance:(1) the above Asian put-call parity(2) the put-call symmetry relation

Page 68: MA4257

58 CHAPTER 5. PATH-DEPENDENT OPTIONS

5.2.8 Binomial tree method

The binomial tree method for European-style Asian option is given as fol-lows:

V (S,A, t) = e−r∆t [pV (Su, Au, t + ∆t) + (1− p)V (Sd, Ad, t + ∆t)]V (S,A, T ) = Λ(S,A)

where

p =e(r−q)∆t − d

u− d

andAu =

tA + ∆tSu

t + ∆t, and Ad =

tA + ∆tSd

t + ∆t.

Early exercise can be easily incorporated into the above algorithm as beforeto deal with American style options.

5.3 Lookback options

The dream contract has to be one that pays the difference between thehighest and the lowest asset prices realized by an asset over some period.Any speculator is tying to achieve such a trade. The contract that pays thisis an example of a lookback option, an option that pays off some function ofthe realized maximum and/or minimum of the underlying asset over someprescribed period. Since lookback options have such an extreme payoff theytend to be expensive.

5.3.1 Types of Payoff

For the basic lookback contracts, the payoff comes in two varieties, like theAsian option: the fixed strike and the floating strike, respectively. Thesehave payoffs that are the same as vanilla options except that in the floatingstrike option the vanilla exercise price is replaced by the maximum or min-imum. In the fixed strike option it is the asset value in the vanilla optionthat is replaced by the maximum or minimum. That is

Λ(ST ,MT ) =

MT − ST , floating putST −mT , floating call(MT −X)+ , fixed call(X −mT )+, fixed put

Page 69: MA4257

5.3. LOOKBACK OPTIONS 59

HereMT = max

0≤τ≤TSτ and mT = min

0≤τ≤TSτ .

Note that for floating lookback options + can be removed.

5.3.2 Extending the Black-Scholes equations

Let V = V (St,Mt, t) (or. V = V (St,mt, t)) be the lookback option value,where

Mt = max0≤τ≤t

Sτ and mt = min0≤τ≤t

Sτ .

We anticipate that Ito lemma will be used to derive the model. However,max0≤τ≤t Sτ (or. min0≤τ≤t Sτ ) is not differentiable. So we have to introduceanother variable. (let us consider the fixed call or floating put)

Jnt = (1t

∫ t

0Sn

τ dτ)1/n.

It is easy to seelim

n→∞Jnt = max0≤τ≤t

St = Mt

and

dJnt =J1−n

nt

nt(Sn

t − Jnnt)dt.

Next we consider the function V (St, Jnt, t), which can be imagined as aproduct depends on the variable Jnt. Using the ∆−hedging argument andIto lemma, we find that V (St, Jnt, t) satisfies

∂V

∂t+

1nt

Sn − Jnn

Jn−1n

∂V

∂Jn+

12σ2S2 ∂2V

∂S2+ rS

∂V

∂S− rV = 0.

We now take the limit n → ∞. Note that St ≤ max0≤τ≤t Sτ = Mt. WhenS < M,

limn→∞

1nt

Sn − Jnn

Jn−1n

= 0;

Thus we obtain a governing equation for the floating strike lookback putoption

∂V

∂t+

12σ2S2 ∂2V

∂S2+ rS

∂V

∂S− rV = 0 for S < M

At expiry we haveV (S,M, T ) = Λ(S,M)

Page 70: MA4257

60 CHAPTER 5. PATH-DEPENDENT OPTIONS

The solution domain is {S ≤ M} × [0, T ]. Here we require an auxiliarycondition on S = M,

∂V

∂M

∣∣∣∣M=S

= 0.

For a floating call or fixed call, we can establish its governing equationin a similar way. And the solution domain is {S ≥ m} × [0, T ].

5.3.3 BTM

Consider a floating put or fixed call. Similarly we have

V (S,M, t) = e−r∆t [pV (Su, Mu, t + ∆t) + (1− p)V (Sd, Md, t + ∆t)]

where

p =e(r−q)∆t − d

u− d

andMu = max(M, Su), and Md = max(M, Sd).

Due to M ≥ S and d < 1, we have

Md = M.

Then the binomial tree method is given by

V (S,A, t) = e−r∆t [pV (Su, Mu, t + ∆t) + (1− p)V (Sd, M, t + ∆t)] , for S ≤ M

V (S,A, T ) = Λ(S,A)

5.3.4 Consistency of the BTM and the continuous-time model:

Note that M ≥ S. We only need to consider two cases:(1) M ≥ Su : In this case Mu = Md = M. Then the binomial tree

scheme can be rewritten as

V (S,M, t) = e−r∆t [pV (Su, M, t + ∆t) + (1− p)V (Sd, M, t + ∆t)] .

Using the Taylor expansion, we obtain

−rV (S,M, t)+∂V

∂t(S,M, t)+(r−q)S

∂V

∂S(S,M, t)+

12σ2S2 ∂2V

∂S2(S,M, t) = O(∆t).

(2) M = S : Then Mu = Su and Md = S, and the binomial tree schemebecomes

V (S, S, t) = e−r∆t [pV (Su, Su, t + ∆t) + (1− p)V (Sd, S, t + ∆t)] .

Page 71: MA4257

5.3. LOOKBACK OPTIONS 61

By virtue of the Taylor expansion, we have

V (S, S, t)− e−r∆t [pV (Su, Su, t + ∆t) + (1− p)V (Sd, S, t + ∆t)]

= −[∂V

∂t(S, S, t) + (r − q)S

∂V

∂S(S, S, t) +

12σ2S2 ∂2V

∂S2(S, S, t)− rV (S, S, t)]∆t

−e−r∆tp(Su− S)∂V

∂M(S, S, t) + O(∆t)

= ∆t1/2 ∂V

∂M(S, S, t) + O(∆t),

which implies∂V

∂M= 0 at M = S.

Remark 16 For Asian options, the Taylor expansion also gives the consis-tency. Indeed,

V (S,A, t)− e−r∆t [pV (Su, Au, t + ∆t) + (1− p)V (Sd, Ad, t + ∆t)]

= −[∂V

∂t(S,A, t) + (r − q)S

∂V

∂S(S,A, t) +

12σ2S2 ∂2V

∂S2(S,A, t)− rV (S,A, t)]∆t

−e−r∆t[p(Au −A) + (1− p)(Ad −A)]∂V

∂A(S,A, t)

−e−r∆t[p(u− 1)(Au −A) + (1− p)(d− 1)(Ad −A)]S∂2V

∂S∂A(S,A, t)

+O(∆t2) + O((Au −A)∆t) + O((Ad −A)∆t) + O((Au −A)2) + O((Ad −A)2).

For the arithmetic average, since Au−A = Su−At ∆t and Ad−A = Sd−A

t ∆t,we have

e−r∆t[p(Au −A) + (1− p)(Ad −A)] =S −A

t∆t + O(∆t2),

e−r∆t[ p(u− 1)(Au −A) + (1− p)(d− 1)(Ad −A)] = O(∆t2).

Then we get

V (S,A, t)− e−r∆t [pV (Su, Au, t + ∆t) + (1− p)V (Sd, Ad, t + ∆t)]

=

[−∂V

∂t− S −A

t

∂V

∂A− 1

2σ2S2 ∂2V

∂S2− (r − q)S

∂V

∂S+ rV

]∆t + O(∆t2).

It is similar for the geometric average.

Page 72: MA4257

62 CHAPTER 5. PATH-DEPENDENT OPTIONS

5.3.5 Similarity reduction

The similarity reduction also applies to some lookback options. For example,for floating strike lookback put options, it follows from the transformationsV (S,M,t)

S = W (x, t) and x = MS ,

∂W

∂t+

12σ2x2 ∂2W

∂x2+ (q − r)x

∂W

∂x− qW = 0, t ∈ [0, T ), x ∈ (1,∞)

∂W

∂x

∣∣∣∣x=1

= 0

W (x, T ) = x− 1.

The reduction can be extended to the binomial tree model. Taking thesame transformations, we have for the floating strike lookback call

SW (M

S, t) = e−r∆t

[pSuW (

max(M, Su)Su

, t + ∆t) + (1− p)SdW (M

Sd, t + ∆t)

]

SW

(M

S,T

)= M − S, for M ≥ S.

or

W (x, t) = e−r∆t [puW (max(xd, 1), t + ∆t) + (1− p)dW (xu, t + ∆t)]W (x, T ) = x− 1, for x ≥ 1. (5.7)

(5.7) can be rewritten as

W (x, t) = e−r∆t [puW (xd, t + ∆t) + (1− p)dW (xu, t + ∆t)] , for x ≥ u

W (1, t) = e−r∆t [puW (1, t + ∆t) + (1− p)dW (u, t + ∆t)] ,W (x, T ) = x− 1.

5.3.6 Russian options

The similarity reduction can also be applied to the American-style lookbackoption. Here, we consider the Russian option, a special perpetual option,whose payoff is M. Since it is a perpetual option, the option value is inde-pendent of time. Let V = V (S,M) denote the option value. The pricingmodel for the Russian option is given by

min

{−1

2σ2S2 ∂2V

∂S2− (r − q)S

∂V

∂S+ rV, V −M

}= 0, for S < M

∂V

∂M

∣∣∣∣S=M

= 0.

Page 73: MA4257

5.4. MISCELLANEOUS EXOTICS 63

Using the transformations

W (x) =V (S,M)

Sand x =

M

S,

we have

min

{−1

2σ2x2 ∂2W

∂x2− (q − r)x

∂W

∂x+ qW,W − x

}= 0, for x > 1

∂V

∂x

∣∣∣∣x=1

= 0.

For a fixed S, it becomes more attractive to exercise the Russian optionif M is sufficiently large. Then, the above model can be rewritten as

−12σ2x2 ∂2W

∂x2− (q − r)x

∂W

∂x+ qW = 0, for 1 < x < x∗

W (x∗) = x∗

∂W

∂x(x∗) = 1

∂W

∂x(1) = 0.

Solving the equations we can obtain the option value (Assignment). Aquestion: what about the option value if q = 0.

5.4 Miscellaneous exotics

5.4.1 Forward start options

As its name suggests, a forward start option is an option that comes intobeing some time in the future. Let us consider an example: a forward startcall option is bought now, at time t = 0, but with a strike price that is notknown until time T1, when the strike is set at the asset price on that date,say. The option expires later at time T.

The way to price this contract is to ask what happens at time T1. Atthat time we get an at-the-money option with a time T − T1 left to expiry.If the stock price at time T1 is S1 then the value of the contract is simplythe Black-Scholes value with S = S1, X = S1 and with given values for rand σ. For a call option this value, as a function of S1, is S1f(T1), where

f(t) = N(d1,t)− e−r(T−t)N(d2,t), (5.8)

Page 74: MA4257

64 CHAPTER 5. PATH-DEPENDENT OPTIONS

d1,t =r + 1

2σ2

σ

√T − t and d2,t =

r − 12σ2

σ

√T − t.

The value is proportional to S. Thus, at time T1, we will hold an asset worthS1f(T1). Since this is a constant multiplied by the asset price at time T1 thevalue today must be

Sf(T1),

where S is today’s asset price.We can also formulate the problem as a PDE model. Let V = V (S, t)

be the option value. Then V (S, t) must satisfy the Black-Scholes equation

∂V

∂t+

12σ2S2 ∂2V

∂S2+ rS

∂V

∂S− rV = 0

At time t = T1, the option values are known as

V (S, T1) = Sf(T1).

So we will solve the Black-Scholes equation in (S, t) ∈ (0,∞)× [0, T1) withthe above final condition. It is not hard to check that

V (S, t) = Sf(T1)

is just the solution.

5.4.2 Shout options

A shout call option is a vanilla call option but with the extra feature thatthe holder can at any time reset the strike price of the option to the currentlevel of the asset. The action of resetting is called ‘shouting’.

There is clearly an element of optimization in the matter of shouting.One would expect to see a free boundary problem occur quite naturally aswith American options. For a shout call option, if the shouting happens attime t, the holder essentially acquires an at-the-money option whose valueis

Stf(t).

Here f(t) is given by 5.8). Then its pricing model is

min

{−∂V

∂t− 1

2σ2S2 ∂2V

∂S2− rS

∂V

∂S+ rV, V − Sf(t)

}= 0, S > 0, t ∈ [0, T )

V (S, T ) = (S −X)+

This problem has no closed-form solution and must be solved numerically.It is not hard to extend to the binomial tree model and find a numericalsolution.

Page 75: MA4257

5.4. MISCELLANEOUS EXOTICS 65

5.4.3 Compound options

A compound option is simply an option on an option. There are four maintypes of compound options, namely, a call on a call, a call on a put, a puton a call and a put on a put. A compound option has two strike prices andtwo expiration dates. As an illustration, we consider a call on a call whereboth calls are European-style. On the first expiration date T, the holder ofthe compound option has the right to buy the underlying call option for thefirst strike price X. The underlying call option again gives the right to theholder to buy the underlying asset for the second strike price X1 on a laterexpiration date T1.

Let St be the price of the underlying asset of the underlying option.Suppose St satisfies the geometric Brownian motion. Then the value of theunderlying call option C(S, t;X, T ) can be represented by the Black-Scholesformula. The value of the compound option is also a function of S and t,denoted by V (S, t). Hence it satisfies the Black-Scholes equations for S > 0,t ∈ [0, T ), that is,

∂V

∂t+

12σ2S2 ∂2V

∂S2+ (r − q)S

∂V

∂S− rV = 0, for S > 0, t ∈ [0, T ).

At t = T, we have the final condition

V (S, T ) = (C(S, T ;X1, T1)−X)+.

The closed-form solution to the above model can be obtained. We referinterested readers to Hull (1998) or Kwok (1998).

Remark 17 Recall the pricing model for a future call option. At the ex-piration date t = T, the holder of a future call option has the right to getthe cash (FT − X) and a long position of a future contract that expires atlater time T1. Here Ft stands for the future price at time t. Since it is free toenter a long position of a future contract, the payoff of the future option is(FT −X)+. Let St denote the underlying asset price of the future contract.Due to Ft = Ste

(r−q)(T1−t), the payoff of the future option can be rewritten as(ST e(r−q)(T1−T ) −X)+. Then the option value, denoted by U(S, t), satisfies

∂U

∂t+

12σ2S2 ∂2U

∂S2+ (r − q)S

∂U

∂S− rU = 0, for S > 0, t ∈ [0, T )

U(S, T ) = (Se(r−q)(T1−T ) −X)+.

If we consider the option value as a function of the future price, i.e. V =V (F, t), then we can make the transformations V (F, t) = U(S, t) and F =

Page 76: MA4257

66 CHAPTER 5. PATH-DEPENDENT OPTIONS

Se(r−q)(T1−t) to get

∂V

∂t+

12σ2F 2 ∂2V

∂F 2− rV = 0, for F > 0, t ∈ [0, T )

V (F, T ) = (F −X)+.

Page 77: MA4257

Chapter 6

Beyond the Black-Scholesworld

6.1 Volatility simile phenomena and defects in theBlack-Scholes model

Before pointing out some of the flaws in the assumptions of the Black-Scholes world, we must emphasize how well the model has done in practice,how widespread is its use and how much impact it has had on financialmarkets. The model is used by everyone working in derivatives whetherthey are salesmen, traders or quants. It is used confidently in situations forwhich it was not designed, usually successfully. The value of vanilla optionsare often not quoted in monetary terms, but in volatility terms with theunderstanding that the price of a contract is its Black-Scholes value usingthe quoted volatility. The idea of delta hedging and risk-neutral pricinghave taken a formidable grip on the minds of academics and practitionersalike. In many ways, especially with regards to commercial success, theBlack-Scholes model is remarkably robust.

Nevertheless, there is room for improvement.

6.1.1 Implied volatility and volatility similes

The one parameter in the Black-Scholes pricing formulas that cannot bedirectly observed is the volatility of the underlying asset price which is ameasure of our uncertainty about the returns provided by the underlyingasset. Typically values of the volatility of an underlying asset are in therange of 20% to 40% per annum.

67

Page 78: MA4257

68 CHAPTER 6. BEYOND THE BLACK-SCHOLES WORLD

The volatility can be estimated from a history of the underlying assetprice. However, it is more appropriate to mention an alternative approachthat involves what is termed an implied volatility. This is the volatilityimplied by an option price observed in the market.

To illustrate the basic idea, suppose that the market price of a call ona non-dividend-paying underlying is 1.875 when S0 = 21, X = 20, r = 0.1and T = 0.25. The implied volatility is the value of σ, that when substitutedinto the Black-Scholes formula

c = S0N

(ln S0

X + (r + σ2

2 )T

σ√

T

)+ Xe−rT N

(ln S0

X + (r − σ2

2 )T

σ√

T

)

gives c = 1.875. In general, it is not possible to invert the formula so that σis expressed as a function of S0, X, r, T, and c. But, it is not hard to useMatlab to find a numerical solution of σ because

∂c

∂σ> 0.

In this example, the implied volatility is 23.5%.Implied volatilities can be used to monitor the market’s opinion about

the volatility of a particular stock. Analysts often calculate implied volatili-ties from actively traded options on a certain stock and use them to calculatethe price of a less actively traded option on the same stock.

Black-Scholes assumes that volatility is a known constant. If it is true,then the implied volatility should keep invariant w.r.t. different strike prices.However, in reality, the shape of this implied volatility versus strike curveis often like ‘a smile’, instead of a flat line. This is the so-called ‘volatilitysimile’ phenomena. In some markets it shows considerable asymmetry, askew, and sometimes it is upside down in a frown. The general shape tendsto persist for a long time in each underlying.

The volatility simile phenomena implies that there are flaws in the Black-Scholes model.

6.1.2 Improved models

(1) Local volatility model:Black-Scholes assumes that volatility is a known constant. If volatility

is not a simple constant then perhaps it is a more complicated function oftime and the underlying.

(2) Stochastic volatility

Page 79: MA4257

6.2. LOCAL VOLATILITY MODEL 69

The Black-Scholes formulae require the volatility of the underlying to bea constant (or a known deterministic function of time). The Black-Scholesequation requires the volatility to be a known function of time and assetvalue (i.e. the local volatility model). Neither of these is true. All volatilitytime series show volatility to be a highly unstable quantity. It is very variableand unpredictable. It is therefore natural to represent volatility itself as arandom variable. Stochastic volatility models are currently popular for thepricing of contracts that are very sensitive to the behavior of volatility.

(3) Jump diffusion model.Black-Scholes assumes that the underlying asset path is continuous. It

is common experience that markets are discontinuous: from time to timethey ‘jump’, usually downwards. This is not incorporated in the lognormalasset price model, for which all paths are continuous.

(4) Others:Discrete hedging: Black-Scholes assumes the delta-hedging is continuous.

When we derived the Black-Scholes equation we used the continuous-timeIto’s lemma. The delta hedging that was necessary for risk elimination alsohad to take place continuously. If there is a finite time between rehedgesthen there is risk that has not been eliminated.

Transaction costs: Black-Scholes assumes there are no costs in deltahedging. But not only must we worry about hedging discretely, we mustalso worry about how much it costs us to rehedge. The buying and sellingof assets exposes us to bid-offer spreads. In some markets this is insignificant,then we rehedge as often as we can. In other markets, the cost can be sogreat that we cannot afford to hedge as often as we would like.

6.2 Local volatility model

Suppose the volatility of the underlying asset is a deterministic function ofS and t, i.e. σ = σ(S, t). It is not hard to show that the option value stillsatisfies the Black-Scholes equations,

∂V

∂t+

12σ2(S, t)S2 ∂2V

∂S2+ (r − q)S

∂V

∂S− rV = 0, for S > 0, t ∈ [0, T ).

with the final condition

V (S, T ) =

{(S −X)+ for call(X − S) for put

A natural question is how to calibrate the function σ(S, t). In general,there are not closed form solutions to the pricing model provided that volatil-

Page 80: MA4257

70 CHAPTER 6. BEYOND THE BLACK-SCHOLES WORLD

ity is a function of S and t. To identify the volatility function, we need toexploit more information.

Let V (S, t;K, T ) be the option price with strike K and maturity T. Itcan be shown (see Wilmott (1998)) that as a function K and T, the functionV (., .;K, T ) satisfies

−∂V

∂T+

12σ2(K, T )K2 ∂2V

∂K2− (r − q)K

∂V

∂K− qV = 0, for K > 0, T ≥ t∗.

V (S∗, t∗;K, T ) = (S∗ −K)+

where t∗ is current time, S∗ is current asset price. Remember σ(., .) is justa function. We can identify the function σ(., .) by

σ(K,T ) =

√√√√∂V∂T + (r − q)K ∂V

∂K + qV12K2 ∂2V

∂K2

.

In practice, one often calibrates the model from the market prices of vanillaoptions so as to price some exotic options of the OTC market.

6.3 Stochastic volatility model

Volatility doesn’t not behave how the Black-Scholes equation would like itto behave; it is not constant, it is not predictable, it’s not even directly ob-servable. This make it a prime candidate for modeling as a random variable.

6.3.1 Random volatility

We continue to assume that S satisfies

dS = µSdt + σSdW1,

but we further assume that volatility satisfies

dσ = p(S, σ, t)dt + q(S, σ, t)dW2.

The two increments dW1 and dW2 have a correlation of ρ. The choice offunctions p(S, σ, t) and q(S, σ, t) is crucial to the evolution of the volatility,and thus to the pricing of derivatives.

The value of an option with stochastic volatility is a function of threevariables, V (S, σ, t).

Page 81: MA4257

6.3. STOCHASTIC VOLATILITY MODEL 71

6.3.2 The pricing equation

The new stochastic quantity that we are modeling, the volatility, is nota traded asset. Thus, when volatility is stochastic we are faced with theproblem of having a source of randomness that cannot be easily hedgedway. Because we have two sources of randomness we must hedge our optionwith two other contracts, one being the underlying asset as usual, but nowwe also need another option to hedge the volatility risk. We therefore mustset up a portfolio containing one option, with value denoted by V (S, σ, t), aquantity −∆ of the asset and a quantity −∆1 of another option with valueV1(S, σ, t). We have

Π = V −∆S −∆1V1.

The change in this portfolio in a time dt is given by

dΠ =

(∂V

∂t+

12σ2S2 ∂2V

∂S2+ ρσqS

∂2V

∂S∂σ+

12q2 ∂2V

∂σ2

)dt

−∆1

(∂V1

∂t+

12σ2S2 ∂2V1

∂S2+ ρσqS

∂2V1

∂S∂σ+

12q2 ∂2V1

∂σ2

)dt

+(

∂V

∂S−∆1

∂V1

∂S−∆

)dS

+(

∂V

∂σ−∆1

∂V1

∂σ

)dσ

where we have used Ito lemma on functions of S, σ and t.To eliminate all randomness from the portfolio we must choose

∂V

∂S−∆−∆1

∂V1

∂S= 0,

to eliminate dS terms, and

∂V

∂σ−∆1

∂V1

∂σ1= 0,

to eliminate dσ terms. This leaves us with

dΠ =

(∂V

∂t+

12σ2S2 ∂2V

∂S2+ ρσqS

∂2V

∂S∂σ+

12q2 ∂2V

∂σ2

)dt

−∆1

(∂V1

∂t+

12σ2S2 ∂2V1

∂S2+ ρσqS

∂2V1

∂S∂σ+

12q2 ∂2V1

∂σ2

)dt

= rΠdt = r(V −∆S −∆1V1)dt

Page 82: MA4257

72 CHAPTER 6. BEYOND THE BLACK-SCHOLES WORLD

where I have used arbitrage arguments to set the return on the portfolioequal to the risk-free rate.

As it stands, this is one equation in two unknowns, V and V1. Thiscontrasts with the earlier Black-Scholes case with one equation in the oneunknowns.

Collecting all terms on the left-hand side and all V1 terms on the right-hand side we find that

∂V∂t + 1

2σ2S2 ∂2V∂S2 + ρσqS ∂2V

∂S∂σ + 12q2 ∂2V

∂σ2 + rS ∂V∂S − rV

∂V∂σ

=∂V1∂t + 1

2σ2S2 ∂2V1∂S2 + ρσqS ∂2V1

∂S∂σ + 12q2 ∂2V1

∂σ2 + rS ∂V1∂S − rV1

∂V1∂σ

.

We are lucky that the left-hand side is a function of V but not V1 andthe right-hand side is a function of V1 but not V. Since the two options willtypically have different payoffs, strikes or expiries, the only way for this tobe possible is for both sides to be independent of the contract type. Bothsides can only be functions of the independent variables, S, σ and t. Thuswe have

∂V∂t + 1

2σ2S2 ∂2V∂S2 + ρσqS ∂2V

∂S∂σ + 12q2 ∂2V

∂σ2 + rS ∂V∂S − rV

∂V∂σ

= −a(S, σ, t)

for some function a(S, σ, t). Reordering this equation, we have

∂V

∂t+

12σ2S2 ∂2V

∂S2+ρσqS

∂2V

∂S∂σ+

12q2 ∂2V

∂σ2+rS

∂V

∂S+a(S, σ, t)

∂V

∂σ−rV = 0.

The final condition is

V (S, σ, T ) =

{(S −X)+, for call option(X − S)+, for put option

.

The solution domain is {σ > 0, S > 0, t ∈ [0, T )}.Remark 18 In the risk-neutral world, the underlying asset S follows thefollowing process:

dSt = rStdt + σStdWt.

We can similarly get the risk-neutral process of σ as follows

dσ = a(S, σ, t)dt + q(S, σ, t)dWt.

Here a(S, σ, t) is often rewritten as

a(S, σ, t) = p(S, σ, t)− λ(S, σ, t)q(S, σ, t),

where λ(S, σ, t) is called the market price of risk.

Page 83: MA4257

6.4. JUMP DIFFUSION MODEL 73

6.3.3 Named models

1. Hull & White (1987):

d(σ2

)= k(b− σ2)dt + cσ2dW2,

where k, b and c are constant.Using the Ito lemma, we can get

dσ =[−1

8c2σ +

k

2

(b

σ− σ

)]dt +

c

2σdW2.

2. Heston (1993)dσ = −γσdt + δdW2.

Explicit price formulas are available for Heston model. For Hull-Whitemodel, explicit formulas exist when S and σ are uncorrelated.

6.4 Jump diffusion model

There is plenty of evidence that financial quantities do not follow the lognor-mal random walk that has been the foundation of the Black-Scholes model.One of the striking features of real financial markets is that every now andthen there is a sudden unexpected fall or crash. These sudden movements oc-cur far more frequently than would be expected from a Normally-distributedreturn with a reasonable volatility.

6.4.1 Jump-diffusion processes

The basic building block for the random walks we have considered so far iscontinuous Brownian motion based on the Normally-distributed increment.We can think of this as adding to the return from one day to the next a Nor-mally distributed random variable with variance proportional to timestep.The extra building block we need for the jump-diffusion model for an assetprice is the Poisson process. A Poisson process dq is defined

dq =

{0, with probability 1− λdt

1, with probability λdt.

There is therefore a probability λdt of a jump in q in the timestep dt. Theparameter λ is called the intensity of the Poisson process.

Page 84: MA4257

74 CHAPTER 6. BEYOND THE BLACK-SCHOLES WORLD

This Poisson process can be incorporated into a model for an asset inthe following way:

dS

S= µdt + σdW + (J − 1)dq.

This is the jump-diffusion process. We assume that there is no correlationbetween the Brownian motion and the Poisson process. If there is a jump(dq = 1) then S immediately goes to the value JS. We can model a sudden10% fall in the asset price by J = 0.9. We can generalize further by allowingJ to be a random quantity.

A jump-diffusion version of the Ito lemma is

dV (S, t) =

(∂V

∂t+

12σ2S2 ∂2V

∂S2+ µS

∂V

∂S

)dt+σS

∂V

∂SdW+(V (JS, t)− V (S, t)) dq.

The random walk in lnS follows

d lnS =

(µ− σ2

2

)dt + σdW + (ln(JS)− ln(S))dq

=

(µ− σ2

2

)dt + σdW + lnJdq

6.4.2 Hedging when there are jumps

Hold a portfolio of the option and −∆ of the underlying:

Π = V (S, t)−∆S.

The change in the value of this portfolio is

dΠ = dV −∆dS

=

(∂V

∂t+

12σ2S2 ∂2V

∂S2+ µS

∂V

∂S

)dt + σS

∂V

∂SdW + (V (JS, t)− V (S, t)) dq

−∆[µSdt + σSdW + (J − 1)Sdq].

6.4.3 Merton’s model (1976)

If we choose

∆ =∂V

∂S,

Page 85: MA4257

6.4. JUMP DIFFUSION MODEL 75

we are following a Black-Scholes type of strategy, hedging the diffusive move-ments. The change in the portfolio value is then

dΠ =

(∂V

∂t+

12σ2S2 ∂2V

∂S2

)dt +

(V (JS, t)− V (S, t)− (J − 1)S

∂V

∂S

)dq.

The portfolio now evolves in a deterministic fashion, except that every sooften there is a non-deterministic jump in its value. It can be argued (Merton1976) if the jump component of the asset price process is uncorrelated withthe market as a whole, then the risk in the discontinuity should not be pricedin the option. Diversifiable risk should not be rewarded. In other words,we can take expectations of this expression and set that value equal to theriskfree return from the portfolio, namely

E(dΠ) = rΠdt.

This argument is not completely satisfactory, but is a common assumptionwhenever there is a risk that cannot be fully hedged.

Since there is no correlation between dW and dq, and

E(dq) = λdt,

we arrive at

∂V

∂t+

12σ2S2 ∂2V

∂S2+rS

∂V

∂S−rV +λ [V (JS, t)− V (S, t)]−λ [J − 1]S

∂V

∂S= 0.

If the jump size J is a random quantity, we need to take the expectationover the J. It follows

∂V

∂t+

12σ2S2 ∂2V

∂S2+rS

∂V

∂S−rV +λEJ [V (JS, t)− V (S, t)]−λEJ [J − 1]S

∂V

∂S= 0.

There is a simple solution of this equation in the special case that thelogarithm of J is Normally distributed. If the logarithm of J is Normallydistributed with standard deviation σ′ and if we write

k = EJ [J − 1]

then the price of a European non-path-dependent option can be written as

∞∑

n=1

1n!

e−λ′(T−t)(λ′(T − t))nVBS(S, t;σn, rn),

Page 86: MA4257

76 CHAPTER 6. BEYOND THE BLACK-SCHOLES WORLD

where

λ′ = λ(1 + k), σ2n = (σ2 +

nσ′2

T − t) and rn = r − λk +

n ln(1 + k)T − t

,

and VBS is the Black-Scholes formula for the option value in the absenceof jumps. This formula can be interpreted as the sum of individual Black-Scholes values, each of which assumes that there have been n jumps, andthey are weighted according to the probability that there will have been njumps before expiry.

6.4.4 Wilmott et al.’s model

In the above we hedged the diffusive element of the random walk for theunderlying. Another possibility is to hedge both the diffusion and jumps asmuch as we can. For example, we could choose ∆ to minimize the varianceof the hedged portfolio.

The changes in the value of the portfolio with an arbitrary ∆ is

dΠ = (...) dt + σS

(∂V

∂S−∆

)dW + (V (JS, t)− V (S, t)−∆(J − 1)S) dq.

The variance in this change, which is a measure of the risk in the portfolio,is

var [dΠ] = σ2S2(

∂V

∂S−∆

)2

dt+λ (V (JS, t)− V (S, t)−∆(J − 1)S)2 dt+O(dt2)

If J is a random quantity, we take expectation over J to get

var [dΠ] = σ2S2(

∂V

∂S−∆

)2

dt+λEJ[(V (JS, t)− V (S, t)−∆(J − 1)S)2

]dt+O(dt2)

By neglecting O(dt2), this is minimized by the choice

∆ =λEJ [(J − 1) (V (JS, t)− V (S, t))] + σ2S ∂V

∂S

λSEJ [(J − 1)2] + σ2S

If we value the option as a pure discounted real expectation under thisbest-hedge strategy, then we have

E [dΠ] = rΠdt

Page 87: MA4257

6.4. JUMP DIFFUSION MODEL 77

or

∂V

∂t+

12σ2S2 ∂2V

∂S2+

(µ− σ2

d(µ + λk − r)

)∂V

∂S− rV

+ λEJ[(V (JS, t)− V (S, t))

(1− J − 1

d(µ + λk − r)

)]= 0

whered = λEJ

[(J − 1)2

]+ σ2 and k = EJ [J − 1]

When λ = 0 this recovers the Black-Scholes equation.

6.4.5 Summary

Jump diffusion models undoubtedly capture a real phenomenon that is miss-ing from the Black-Scholes model. Yet they are rarely used in practice.There are three main reasons for this:

(1) difficulty in parameter estimation. In order to use any pricing modelone needs to be able to estimate parameters. In the lognormal model thereis just the one parameter to estimate. This is just the right number. Morethan one parameter is too much work. The jump diffusion model in itssimplest form needs an estimate of probability of a jump, measured by λand its size J. This can be made more complicated by having a distributionfor J.

(2) difficulty in solution. The governing equation is no longer a diffusionequation (about the easiest problem to solve numerically), and is harderthan the solution of the basic Black-Scholes equation.

(3) impossibility of perfect hedging. Finally, perfect risk-free hedgingis impossible when there are jumps in the underlying. The use of a jump-diffusion model acknowledges that one’s hedge is less than perfect.

In fact the above remarks also apply to the stochastic volatility model.

Page 88: MA4257

78 CHAPTER 6. BEYOND THE BLACK-SCHOLES WORLD

Page 89: MA4257

Chapter 7

Interest Rate Derivatives

The riskless interest rate has been assumed to be constant in the option pric-ing models discussed in previous chapters. Such an assumption is acceptablewhen the life of an option is typically six to nine months.

This chapter is focused on the interest rate derivatives. Well knownexamples of such derivatives are bonds, bond futures, bonds options, swap,swaptions, and so on. Note that interest rates are used for discounting aswell as defining the payoff for some interest rate derivative products. Thevalues of these derivatives depend sensibly on the level of interest rates.In the construction of valuation models for these securities, it is crucial toincorporate the stochastic movement of interest rates into consideration.

The mathematics of this chapter is no different from what we have seenalready. Again we shall apply the ideas of hedging and no arbitrage.

7.1 Short-term interest rate modeling

7.1.1 The simplest bonds: zero-coupon/coupon-bearing bonds

The zero-coupon bond is a contract paying a known fixed amount, the prin-cipal, at some given date in the future, the maturity date T. For example,the bond pays $100 in 10 years’time. We’re going to scale this payoff, sothat in future all principals will be $1.

This promise of future wealth is worth something now: it cannot havezero or negative value. Furthermore, except in extreme circumstances, theamount we pay initially will be smaller than the amount we receive at ma-turity.

A coupon-bearing bond is similar to the above except that as well aspaying the principal at maturity, it pays smaller quantities, the coupons,

79

Page 90: MA4257

80 CHAPTER 7. INTEREST RATE DERIVATIVES

at intervals up to and including the maturity date. These coupons areusually prespecified fractions of the principal. For example, the bond pays$1 in 10 years and 2%, i.e. 2 cents, every six months. This bond is clearlymore valuable than the bond in the previous example because of the couponpayments. We can think of the coupon-bearing bond as a portfolio of zero-coupon bearing bonds: one zero-coupon bearing bond for each coupon datewith a principal being the same as the original bond’s coupon, and then afinal zero-coupon bond with the same maturity as the original.

7.1.2 Short-term interest rate

The interest rate that we will be modeling is known as a short-term interestrate or spot interest rate r(t). This means that the rate r(t) is to apply attime t : interest is compounded at this rate at each moment in time but thisrate may change with time.

If the short-term interest rate r(t) is a known function of time, then thebond price is also a function of time only: V = V (t;T ). We begin with azero-coupon bond example. Because we receive 1 at time t = T we knowthat V (T ) = 1. I now derive an equation fro the value of the bond at a timebefore maturity, t < T.

Suppose we hold one bond. The change in the value of that bond in atime-step dt (from t to t + dt) is

dV.

Arbitrage considerations again lead us to equate this with the return froma bank deposit receiving interest at a rate r(t) :

dV = r(t)V dt.

The solution of this equation is

V (t;T ) = e−∫ T

tr(τ)dτ .

As a matter of fact, the short rate interest rate can be thing of as theinterest rate from a money market account, which is usually unpredictable.To price interest rate derivatives, we must model the (short-term) interestrate as a stochastic variable. That is, we suppose the short term interestrate is governed by a stochastic different equation of the form

dr = u(r, t)dt + ω(r, t)dW.

The functional forms of u(r, t) and ω(r, t) determine the behavior of theshort-term rate r.

Page 91: MA4257

7.1. SHORT-TERM INTEREST RATE MODELING 81

7.1.3 The bond pricing equation

When interest rates are stochastic a bond has a price of the form V (r, t;T ).Since the interest rate r is not a traded asset, there is no underlying assetwith which to hedge. The only way to construct a hedged portfolio is byhedging one bond with a bond of a different maturity. This is exactly sameas what we have discussed about the option pricing on non-traded assets inSection 2.5.2.

We set up a portfolio containing two bonds with different maturities T1

and T2. The bond with maturity T1 has price V1(r, t;T1) and the bond withmaturity T2 has price V2(r, t;T2). We hold one of the former and a number−∆ of the latter. We have

Π = V1 −∆V2.

The change in this portfolio in a time dt is given by

dΠ = dt +∂V1

∂rdr

−∆

(∂V2

∂tdt +

12ω2 ∂2V2

∂r2dt +

∂V2

∂rdr

),

where we have applied Ito’s lemma to functions of r and t. By the choice

∆ =∂V1∂r∂V2∂r

eliminates all randomness in dΠ. We then have

dΠ =

(∂V1

∂t+

12ω2 ∂2V1

∂r2−

∂V1∂r∂V2∂r

(∂V2

∂t+

12ω2 ∂2V2

∂r2

))dt

= rΠdt = r

(V1 −

∂V1∂r∂V2∂r

V2

)dt,

where we have used arbitrage arguments to set the return on the portfolioequal to the risk-free rate. The risk-free rate is just the short-term rate.

Collecting all V1 terms on the left-hand side and all V2 terms on theright-hand side we find that

∂V1∂t + 1

2ω2 ∂2V1∂r2 − rV1

∂V1∂r

=∂V2∂t + 1

2ω2 ∂2V2∂r2 − rV2

∂V2∂r

.

Page 92: MA4257

82 CHAPTER 7. INTEREST RATE DERIVATIVES

The left-hand side is a function of T1 but not T2 and the right-hand side isa function T2 but not T1. The only way for this to be possible is for bothsides to be independent of the maturity date. Dropping the subscript fromV, we have

∂V1∂t + 1

2ω2 ∂2V1∂r2 − rV1

∂V1∂r

= −a(r, t)

for some function a(r, t). As discussed in Chapter 4, we denote a(r, t) =u(r, t)− λ(r, t)ω(r, t), where λ(r, t) is the market price of risk of r(t).

The bond pricing equation is therefore

∂V

∂t+

12ω2 ∂2V

∂r2+ (u− λω)

∂V

∂r− rV = 0. (7.1)

To find a unique solution we must impose final condition

V (r, T ;T ) = 1. (7.2)

Note that Eq (7.1) holds for any interest rate derivative whose value isonly a function of r and t. It indicates that the risk-neutral process of theshort-term rate is

dr = (u− λω)dt + ωdW (7.3)

7.1.4 Tractable models

We have built up the bond pricing equation for an arbitrary model. Thatis, we have not specified the risk-neutral drift, u − λω, or the volatility, ω.How can we choose these functions to give us a good model? Like in thestochastic volatility model, a simple lognormal random walk would not besuitable for r since it would predict exponentially rising or falling rates. Thisrules out the equity price model as an interest rate model. So we must thinkmore carefully how to choose the drift and volatility.

We hope the random walk (7.3) for r has the following nice properties:(1) Positive interest rates: Except for a few pathological cases, interest

rates are positive.(2) Mean reversion: The interest rate should tend to decrease towards

the mean for large r and tend to increase towards the mean for low r.In addition, we expect the solution of (7.1-7.2) for the zero-coupon bond

is of a simple form, for example

Z(r, t;T ) = eA(t;T )−rB(t;T ).

Some named models are given as follows:

Page 93: MA4257

7.1. SHORT-TERM INTEREST RATE MODELING 83

Vasicek

This is the first meaning reversion model.

dr = (η − γr)dt + cdW,

where η, γ, and c are constants. The value of a zero-coupon bond is givenby

Z(r, t;T ) = eA(t;T )−rB(t;T ),

where

B =1γ

(1− e−γ(T−t)

)

A =1γ2

(B(t;T )− T + t)(

ηγ − 12c2

)− c2

4γB2.

The drawback of this model is that the interest rate can easily go nega-tive.

Cox, Ingersoll & Ross

The risk-neutral process of the short-term interest rate takes the form of

dr = (η − γr)dt +√

αrdW,

where η, γ, and α are constants. The short-term interest rate is meanreverting and if η > α/2 the short-term interest rate stays positive. Explicitsolutions are also available for the zero-coupon bond (see Wilmott (1998) orHull (2003, P. 542-543)).

Ho & Lee

The risk-neutral process of the short-term interest rate is

dr = η(t)dt + cdW,

where η(t) and c are parameters. The value of zero-coupon bonds is givenby

Z(r, t;T ) = eA(t;T )−rB(t;T )

whereB = T − t

Page 94: MA4257

84 CHAPTER 7. INTEREST RATE DERIVATIVES

and

A = −∫ T

tη(s)(T − s)ds +

16c2(T − t)3.

Ho-Lee model is the first ‘no-arbitrage model’. This means that thecareful choice of the function η(t) will result in theoretical zero-coupon bondsprices, output by the model, which are the same as markets prices. Thistechnique is also called yield curve fitting. This careful choice is

η(t) = − ∂2

∂t2log ZM (t∗; t) + c2(t− t∗)

where today is time t = t∗. In this ZM (t∗;T ) is the market price todayof zero-coupon bonds with maturity T. Clearly this assumes that there arebonds of all maturities and that the prices are twice differentiable withrespect to the maturity. We will see why this should give the ‘correct’ priceslater.

The drawback of this model is that the process followed by the interestrate is not mean-reverting.

Hull & White

Hull and White have extended the Vasicek model to incorporate time-dependent parameters.

dr = (η(t)− γr)dt + cdW.

Under this risk-neutral process the value of zero-coupon bonds

Z(r, t;T ) = eA(t;T )−rB(t;T ),

where

A(t;T ) = −∫ T

tη(s)B(s;T )ds+

c2

2γ2

(T − t +

e−γ(T−t) − 12γ

e−2γ(T−t) − 32γ

)

and

B(t;T ) =1γ

(1− e−γ(T−t)

).

Here the time-dependent parameter η(t) can also be identified from thetechnique of yield curve fitting given as follows.

Page 95: MA4257

7.1. SHORT-TERM INTEREST RATE MODELING 85

7.1.5 Yield Curve Fitting

Let us consider the Ho & Lee short term model. It is the simplest that canbe used to fit the yield curve. It will be useful to examine this model indetail to see one way in which fitting is done in practice.

In this model, the solution of bond pricing equation for a zero-couponbond is simply

Z(r, t;T ) = eA(t;T )−r(T−t)

where

A = −∫ T

tη(s)(T − s)ds +

16c2(T − t)3.

If we know η(t) then the above gives us the theoretical value of zero-couponbonds of all maturities. Now turn this relationship around and ask the ques-tion “What functional form must we choose for η(t) to make the theoreticalvalues of the zero-coupon bonds for all maturities equal to the market val-ues” Call this special choice for η, η∗(t). The yield curve is to be fitted today,t = t∗, when the spot interest rate is r∗ and values of the zero-coupon bondsin the market are ZM (t∗;T ). To match the market and theoretical bondprices, we must have

ZM (t∗;T ) = eA(t∗;T )−r∗(T−t∗).

Taking logarithms of this and rearranging slightly we get∫ T

t∗η∗(s)(T − s)ds = − log (ZM (t∗;T ))− r∗(T − t∗) +

16c2(T − t∗)3. (7.4)

Observe that we are carrying around in the notation today’s date t∗. Thisis a constant but we want to emphasize that we are doing the calibration totoday’s yield curve. If we calibrate again tomorrow, the market yield curvewill have changed.

Differentiate Eq (7.4) with respect to T to get∫ T

t∗η∗(s)ds = − ∂

∂Tlog (ZM (t∗;T ))− r∗ +

12c2(T − t∗)2.

Differentiating the above equation again with respect to T, we have

η∗(T ) = − ∂2

∂T 2log (ZM (t∗;T )) + c2(T − t∗).

With this choice for the time-dependent parameter η∗(t), the theoretical andactual market prices of zero-coupon bonds are the same.

The same arguments can be applied to the Hull-White Model.We refer Wilmott (1998) to a discussion about the advantages and dis-

advantages of yield curve fitting (Chapter 34).

Page 96: MA4257

86 CHAPTER 7. INTEREST RATE DERIVATIVES

7.1.6 Empirical behavior of the short rate and other models

Observe that all of above named models take the form

dr = (...)dt + crβdW

and give zero-coupon bond prices of the form

Z(r, t;T ) = eA(t,T )−rB(t,T ).

Examples are Ho & Lee (β = 0), Vasicek (β = 0) and Cox, Ingersoll &Ross (β = 1

2 ). Since the short rate in the risk-neutral world shares the samevolatility as in the real world, we are able to estimate the coefficient β fromhistorical data of the short rate. Chan, Karolyi, Longstaff & Sanders (1992)obtain the estimate β = 1.36 from US data. This agrees with the experiencesof many practitioners, who say that in practice the relative spot rate changedr/r is insensitive to the level of r.

We can think of models with

dr = ... + cdW

as having a Normal volatility structure and those with

dr = ... + crdW

as having a lognormal volatility structure. In reality it seems that the shortrate is closer to the lognormal than to the Normal models. This puts thefollowing BDT model ahead of the options.

In the Black, Derman & Toy (BDT) model, the risk-neutral short ratesatisfies

d (ln r) =(

η(t)− σ′(t)σ(t)

ln r

)dt + σ(t)dW.

The two functions of time σ and η allow both zero-coupon bonds and theirvolatilities to be matched. There are no explicit solutions of the zero-couponbond prices for this model, but the yield fitting can be done by numericalschemes.

An even more general model is the Black & Karasinski model

d (ln r) = (η(t)− a(t) ln r) dt + σ(t)dW.

Page 97: MA4257

7.2. HJM MODEL 87

7.1.7 Coupon-bearing bond pricing

Suppose coupons are paid at time ti, i = 1, 2, ..., n. (Let t0 = 0, tn = T ). LetV = V (r, t) be the value of the bond. Then

∂V

∂t+

12ω2 ∂2V

∂r2+ (u− λω)

∂V

∂r− rV = 0, for ti−1 < t < tn, i = 1, ..., n

V (r, ti−) = V (r, ti+) + ci, for i = 1, ..., n− 1V (r, T ) = 1 + cn

Here ci is the coupon at time ti.Note that the values of zero-coupon bonds with scaled payoff are essen-

tially the discounting factors. If we have known the values of the zero-couponbonds with all maturities, then the value of the coupon-bearing bond canbe represented as

V (r, t;T ) = Z(r, t;T ) +∑

ti≥t

ciZ(r, t; ti).

Actually, we never use the above formula to value coupon-bearing bonds.Conversely, we take advantage of it to find the values of zero-coupon bondsbecause all long term bonds are coupon-bearing. That is the so-called boot-strap method (see Hull (2003), page 96).

7.2 HJM Model

The short-term interest models are widely used for pricing interest rateinstruments when the simpler Black-Scholes models are inappropriate. Onelimitation of the model is that they do not give the user complete freedomin choosing the volatility structure.

The HJM model was a major breakthrough in the pricing of interestrate products. Instead of modeling a short-term interest rate and derivingthe forward rates (or, equivalently, the yield curve) from that model, Heath,Jarrow & Morton boldly start with a model for the whole forward ratecurve. Since the forward rates are known today, the matter of yield-curvefitting is contained naturally within their model, it does not appear as anafterthought. Moreover, it is possible to take real data from the real datafor the random movement of the forward rates and incorporate them intothe derivative-pricing methodology.

Page 98: MA4257

88 CHAPTER 7. INTEREST RATE DERIVATIVES

7.2.1 The forward rate

Let Z(t;T ) be the price of a zero-coupon bond at time t and maturing attime T, when it pays $1. Let Y (t;T ) be the continuously compounded yieldof the zero-coupon bond. Then, by definition,

Z(t;T ) = exp (−Y (t;T )(T − t))

or

Y (t;T ) = − lnZ(t;T )T − t

.

Let f(t;T, T + δ) be the forward rates as seen at time t for the periodbetween time T and time T + δ. By definition, we have

Z(t;T + δ) = Z(t;T ) exp (−f(t;T, T + δ)δ)

or

f(t;T, T + δ) = − lnZ(t;T + δ)− lnZ(t;T )δ

.

The (instantaneous) forward rate curve F (t;T ) at time t is defined by

F (t;T ) = limδ→0

f(t;T, T + δ) = − ∂

∂TlnZ(t;T ). (7.5)

So

Z(t;T ) = exp

(−

∫ T

tF (t; s)ds

).

The spot rate can be simply represented by the forward rate for a ma-turity equal to the current date, i.e.

r(t) = F (t; t).

7.2.2 HJM model

The key concept in the HJM model is that we model the evolution of thewhole forward rate curve, not just the short end.

Let us assume that in a real world all zero-coupon bonds evolve accordingto

dZ(t;T ) = µ(t, T )Z(t;T )dt + σ(t, T )Z(t;T )dW

At this point d· means that time t evolves but the maturity date T is fixed.This is not much of an assumption.

Page 99: MA4257

7.2. HJM MODEL 89

Note that since Z(t; t) = 1, we must have

σ(t, t) = 0.

From (7.5) we have

dF (t;T ) =∂

∂T

(12σ2(t, T )− µ(t, T )

)dt− ∂σ(t, T )

∂TdW.

All of the variables we have introduced above have been real variables.But when we come to pricing derivatives we must do so in the risk-neutralworld. Since a zero-coupon bond is a traded asset, the drift of Z(t;T ) in arisk-neutral world must satisfy

µ(t, T ) = r(t),

where r(t) is the spot rate in a risk-neutral world. In other words, therisk-neutral dynamics for discount factors must be

dZ(t;T ) = r(t)Z(t;T )dt + σ(t, T )Z(t;T )dW.

Note that the spot rate r(t) is itself determined by the Z(t;T ).The risk-neutral process of the forward rate is then given as

dF (t;T ) =∂

∂T

(12σ2(t, T )− r(t)

)dt− ∂σ(t, T )

∂TdW

= σ(t;T )∂σ(t, T )

∂Tdt− ∂σ(t, T )

∂TdW.

Let us denotev(t, T ) = −∂σ(t, T )

∂T.

Then

σ(t, T )∂σ(t, T )

∂T= v(t, T )

∫ T

tv(t, s)ds,

where we have used σ(t, t) = 0.So, in the risk-neutral world, the forward rate curve follows

dF (t;T ) = m(t, T )dt + v(t, T )dW, (7.6)

where

m(t, T ) = v(t, T )∫ T

tv(t, s)ds.

This is known as the HJM model. It indicates that there is a link betweenthe drift of and the standard deviation (volatility) of the risk-neutral forwardrate process

Page 100: MA4257

90 CHAPTER 7. INTEREST RATE DERIVATIVES

7.2.3 The non-Markov nature of HJM

In the short-term interest rate model, we assume the spot rate follows anIto process. Let us look at what process the spot rate follows in the HJMmodel.

Suppose today is t∗ and that we know the whole forward rate curvetoday, F (t∗;T ). We can write the spot rate for any time t in the future as

r(t) = F (t; t) = F (t∗; t) +∫ t

t∗dF (s; t)

= F (t∗; t) +∫ t

t∗m(s, t)ds + v(s, t)dW.

Differentiating this with respect to time t we arrive at the stochastic differ-ential equation for r

dr =(

.F (t∗; t) + m(t, t) +

∫ t

t∗

.m (s, t)ds +

∫ t

t∗

.v (s, t)dW

)dt + v(t, t)dW.

In a Markov process it is only the present state of a variable that deter-mines the possible future state. Note that in the above expression the lastterm in the drift

∫ tt∗

.m (s, t)dW is random, depending on the history of the

stochastic increments dW. Therefore, for a general HJM model it makes themotion of the spot rate non-Markov.

Having a non-Markov model may not matter to us if we can find a smallnumber of extra state variables that contain all the information that we needfor predicting the future. Recall the pricing of an Asian option where thehistoric average is involved. But we can define the average as a new variable,and then derive the pricing PDE equations or the binomial tree methods.Unfortunately, the general HJM model requires an infinite number of suchvariables to define the present state; if we were to write the HJM modelas a PDE we would need an infinite number of independent variables. Ifwe attempt to use a binomial method to implement the HJM model, theamount of computation will grow exponentially.

7.2.4 Pricing derivatives

Pricing derivatives is all about finding the expected present value of all cashflows in a risk-neutral framework with the number of time step. Explicitsolutions of derivatives’ prices are always rare. Because of the non-Markovnature of HJM in general a PDE approach and a BTM are unfeasible. As aresult, we have to adopt the Monte-Carlo simulation.

Page 101: MA4257

7.2. HJM MODEL 91

To price a derivative using a Monte Carlo simulation perform the fol-lowing steps. Assume that we have chosen a model for the forward ratevolatility, v(t, T ) for all T. Today is t∗ when we know the forward rate curveF (t∗;T ).

1. Simulate a realized evolution of the risk-neutral forward rate for thenecessary length of time until T ∗, say. This requires a simulation of Eq (7.6).After this simulation we will have a realization of F (t;T ) for t∗ ≤ t ≤ T andT ≥ t. We will have a realization of the whole forward rate path.

2. Using this forward rate path calculate the value of all the cashflowsthat would have occurred.

3. Using the realized path for the spot interest rate r(t) calculate thepresent value of these cashflows. Note that we discount at the continuouslycompounded risk-free rate, not at any other rate. In the risk-neutral worldall assets have expected return r(t).

4. Return to Step 1 to perform another realization, and continue untilwe have a sufficiently large number of realizations to calculate the expectedpresent value as accurately as required.

The disadvantage of the above algorithm is that such a simulation canbe very slow. In addition, it cannot easily be used for American-style deriva-tives.

7.2.5 A special case of HJM model: Ho & Lee

We make a comparison between the spot rate modelling of the HJM modeland the spot rate model.

In Ho & Lee the risk-neutral spot rate satisfies

dr = η(t)dt + cdW

for a constant c. The value of zero-coupon bonds is

Z(t;T ) = eA(t;T )−rB(t;T )

whereB = T − t

and

A = −∫ T

tη(s)(T − s)ds +

16c2(T − t)3.

So

F (t;T ) = − ∂

∂Tlog Z(t;T ) = − ∂

∂T[A(t;T )− rB(t;T )]

=∫ T

tη(s)ds− 1

2c2(T − t)2 + r.

Page 102: MA4257

92 CHAPTER 7. INTEREST RATE DERIVATIVES

Then

dF (t;T ) =[−η (t) + c2(T − t)

]dt + dr

=[−η (t) + c2(T − t)

]dt + η(t)dt + cdW

= c2(T − t)dt + cdW

=

(c

∫ T

tcds

)dt + cdW.

This confirms that the forward rate in the Ho-Lee model has the struc-ture of HJM model. Hence Ho-Lee model is a special case of HJM model.Actually, most of popular models discussed before have HJM representa-tions.

7.2.6 Concluding remarks

There are some drawbacks of the HJM model: (1) The instantaneous for-ward rates involved are not directly observable in the market; (2) It is noteasy to calibrate it to prices of actively traded instruments such as caps. Analternative model proposed by Brace, Gatarek, and Musiela (BGM) over-comes this problem. It is known as the LIBOR market model (see Hull(1998) or Wilmott (2000)). It can be thought of as a discrete version of theHJM model in the LIBOR market. To understand the model, the knowledgeof stochastic differential equations and martingales is required. As a matterof fact, to have better understanding of interest rate models, we must learnmore about stochastic calculus.