commodity derivatives: modeling and pricing

151
HAL Id: pastel-00712137 https://pastel.archives-ouvertes.fr/pastel-00712137 Submitted on 26 Jun 2012 HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entiļ¬c research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers. Lā€™archive ouverte pluridisciplinaire HAL, est destinĆ©e au dĆ©pĆ“t et Ć  la diļ¬€usion de documents scientiļ¬ques de niveau recherche, publiĆ©s ou non, Ć©manant des Ć©tablissements dā€™enseignement et de recherche franƧais ou Ć©trangers, des laboratoires publics ou privĆ©s. Commodity Derivatives: Modeling and Pricing Zaizhi Wang To cite this version: Zaizhi Wang. Commodity Derivatives: Modeling and Pricing. Economics and Finance. Ɖcole Na- tionale SupĆ©rieure des Mines de Paris, 2011. English. NNT : 2011ENMP0088. pastel-00712137

Upload: others

Post on 27-Oct-2021

5 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Commodity Derivatives: Modeling and Pricing

HAL Id: pastel-00712137https://pastel.archives-ouvertes.fr/pastel-00712137

Submitted on 26 Jun 2012

HAL is a multi-disciplinary open accessarchive for the deposit and dissemination of sci-entific research documents, whether they are pub-lished or not. The documents may come fromteaching and research institutions in France orabroad, or from public or private research centers.

Lā€™archive ouverte pluridisciplinaire HAL, estdestinĆ©e au dĆ©pĆ“t et Ć  la diffusion de documentsscientifiques de niveau recherche, publiĆ©s ou non,Ć©manant des Ć©tablissements dā€™enseignement et derecherche franƧais ou Ć©trangers, des laboratoirespublics ou privĆ©s.

Commodity Derivatives: Modeling and PricingZaizhi Wang

To cite this version:Zaizhi Wang. Commodity Derivatives: Modeling and Pricing. Economics and Finance. Ɖcole Na-tionale SupĆ©rieure des Mines de Paris, 2011. English. NNT : 2011ENMP0088. pastel-00712137

Page 2: Commodity Derivatives: Modeling and Pricing

NĀ°: 2009 ENAM XXXX

MINES ParisTech Centre CERNA

60, Boulevard Saint-Michel 75272 PARIS cedex 06

TT

HH

EE

SS

EE

Ɖcole doctorale nĀ° 396 : Ɖconomie, Organisation, SociĆ©tĆ©

prƩsentƩe et soutenue publiquement par

Zaizhi WANG

le 14/12/2011

Produits DĆ©rivĆ©s des MatiĆØres PremiĆØres:

ModƩlisation et Evaluation

Commodity Derivatives:

Modeling and Pricing

Doctorat ParisTech

T H ƈ S E

pour obtenir le grade de docteur dƩlivrƩ par

lā€™Ć‰cole nationale supĆ©rieure des mines de Paris

SpĆ©cialitĆ© ā€œ Finance Quantitative ā€

Directeur de thĆØse : Alain GALLI

Co-encadrement de la thĆØse : Eric BENHAMOU

T

H

ƈ

S

E

Jury

Mme. Delphine LAUTIER, Professeur, UniversitƩ Paris Dauphine Rapporteur

M. Bernard LAPEYRE, Professeur, Ecole des Ponts ParisTech Rapporteur

M. RenƩ AID, Directeur ExƩcutif, EDF - Finance for Energy Market Research Centre Examinateur

M. Eric BENHAMOU, CEO, Pricing Partners Examinateur

M. Alain GALLI, directeur de recherche, ENSMP Examinateur

M. Michel SCHMITT, ingƩnieur gƩnƩral des Mines, ENSMP Examinateur

Page 3: Commodity Derivatives: Modeling and Pricing
Page 4: Commodity Derivatives: Modeling and Pricing

i

Abstract

Commodity prices have been rising at an unprecedented pace overthe last years making commodity derivatives more and more popular inmany sectors like energy, metals and agricultural products. The quickdevelopment of commodity market as well as commodity derivative marketresults in a continuously uprising demand of accuracy and consistency incommodity derivative modeling and pricing.

The specification of commodity modeling is often reduced to an appro-priate representation of convenience yield, intrinsic seasonality and meanreversion of commodity price. As a matter of fact, convenience yield canbe extracted from forward strip curve and then be added as a drift terminto pricing models such as Black Scholes model, local volatility model andstochastic volatility model. Besides those common models, some specificcommodity models specially emphasize on the importance of convenienceyield, seasonality or mean reversion feature. By giving the stochasticity toconvenience yield, Gibson Schwartz model interprets the term structure ofconvenience yield directly in its model parameters, which makes the modelextremely popular amongst researchers and market practitioners in com-modity pricing. Gabillon model, in the other hand, focuses on the featureof seasonality and mean reversion, adding a stochastic long term price tocorrelate spot price.

In this thesis, we prove that there is mathematical equivalence relationbetween Gibson Schwartz model and Gabillon model. Moreover, inspiredby the idea of Gyƶngy, we show that Gibson Schwartz model and Gabillonmodel can reduce to one-factor model with explicitly calculated marginaldistribution under certain conditions, which contributes to find the analyticformulas for forward and vanilla options. Some of these formulas are newto our knowledge and other formulas confirm with the earlier results ofother researchers.

Indeed convenience yield, seasonality and mean reversion play a veryimportant role, but for accurate pricing, hedging and risk management,it is also critical to have a good modeling of the dynamics of volatility incommodity markets as this market has very fluctuating volatility dynamics.While the formers (seasonality, mean reversion and convenience yield) havebeen highly emphasized in the literature on commodity derivatives pricing,the latter (the dynamics of the volatility) has often been forgotten. The fam-ily of stochastic volatility model is introduced to strengthen the dynamicsof the volatility, capturing the dynamic smile of volatility surface thanks toa stochastic process on volatility itself. It is a very important characteristicfor pricing derivatives of long maturity. Stochastic volatility model alsocorrects the problem of opposite underlying-volatility correlation againstmarket data in many other models by introducing correlation parameterexplicitly. The most popular stochastic volatility models include Hestonmodel, Piterbarg model, SABR model, etc.

As pointed out by Piterbarg, the need of time-dependent parameters instochastic volatility models is real and serious. It is because in one hand

Page 5: Commodity Derivatives: Modeling and Pricing

ii

stochastic volatility models with constant parameters are generally inca-pable of fitting market prices across option expiries, and in the other handexotics do not only depend on the distribution of the underlying at theexpiry, but on its dynamics through all time. This contradiction implies thenecessity of time-dependent parameters. In this thesis, we extend Piter-bargā€™s idea to the whole family of stochastic volatility model, making all thestochastic volatility models having time-dependent parameters and showvarious formulas for vanilla option price by employing various techniquessuch as characteristic function, Fourier transform, small error perturbation,parameter averaging, etc.

Page 6: Commodity Derivatives: Modeling and Pricing

iii

RƩsumƩ

Les prix des matiĆØres premiĆØres ont augmentĆ© Ć  un rythme sans prĆ©cĆ©-dent au cours des derniĆØres annĆ©es rendant les dĆ©rivĆ©s sur matiĆØres pre-miĆØres de plus en plus populaires dans de nombreux secteurs commelā€™Ć©nergie, les mĆ©taux et les produits agricoles. Le dĆ©veloppement rapidedu marchĆ© des produits dĆ©rivĆ©s sur matiĆØres premiĆØres a aussi induit unerecherche vers toujours plus de prĆ©cision et cohĆ©rence dans la modĆ©lisationet lā€™Ć©valuation de produits dĆ©rivĆ©s des matiĆØres premiĆØres.

Les points les plus importants dans la modĆ©lisation des matiĆØres pre-miĆØres sont la bonne reprĆ©sentation du rendement dā€™opportunitĆ© appelĆ©communĆ©ment Ā«convenience yield Ā», la prise en compte de la saisonnalitĆ©et la capture du phĆ©nomĆØne de retour Ć  la moyenne pour les prix des ma-tiĆØres premiĆØres. Il est Ć  noter que le rendement dā€™opportunitĆ© (convenienceyield ) peut ĆŖtre induit du prix des la courbe des forwards et ĆŖtre simplementajoutĆ© au terme dā€™Ć©volution(terme de drift) dans les modĆØles canoniques,comme le modĆØle de Black Scholes, le modĆØle Ć  volatilitĆ© locale et les mo-dĆØles Ć  volatilitĆ© stochastique. An delĆ  de ces modĆØles, dā€™autres modĆØlesont Ć©tĆ© conƧus pour modĆ©liser spĆ©cifiquement lā€™Ć©volution du convenienceyield, la saisonnalitĆ© ou le phĆ©nomĆØne de retour Ć  la moyenne des prix. Ilsā€™agit par exemple du modĆØle de Gibson Schwartz qui suppose que le termede convenience yield est alĆ©atoire. Cette approche prend donc en comptelā€™Ć©volution non dĆ©terministe du convenience yield et lā€™interprĆØte commeun paramĆØtre critique du modĆØle. Ceci explique sa grande popularitĆ© etson adoption important par les praticiens du marchĆ©. Un autre modĆØlefrĆ©quemment utilisĆ© est le modĆØle de Gabillon. Celui se concentre sur lasaisonnalitĆ© des prix et lā€™effet de retour a la moyenne, en modĆ©lisant un prixĆ  long terme stochastique corrĆ©lĆ© aux prix du spot. Dans cette thĆØse, nousprouvons que ces deux approches ne sont en fait quā€™une et quā€™il y a unerelation dā€™Ć©quivalence entre le modĆØle de Gibson Schwartz et le modĆØle deGabillon. Reposant sur le principe de diffusion Ć©quivalente introduite parGyƶngy, nous montrons que le modĆØle de Gibson Schwartz et le modĆØlede Gabillon peuvent se rĆ©duire Ć  un modĆØle Ć  un facteur dont la distribu-tion marginale peut ĆŖtre explicitement calculĆ©e sous certaines conditions.Ceci nous permet en particulier de trouver des formules analytiques pourlā€™ensemble des options vanilles. Certaines de ces formules sont nouvelles Ć notre connaissance et dā€™autres confirment des rĆ©sultats antĆ©rieurs.

Dans un second temps, nous nous intĆ©ressons Ć  la bonne modĆ©lisationde la dynamique de la volatilitĆ© des marchĆ©s des matiĆØres premiĆØres. Eneffet, les marchĆ©s de matiĆØres premiĆØres sont caractĆ©risĆ©s par des volatilitĆ©strĆØs fluctuantes et importante. Alors que les effets sur la saisonnalitĆ©, lamodĆØlisation du convenience yield et lā€™effet de retour Ć  la moyenne des prixont Ć©tĆ© fortement soulignĆ©s dans la littĆ©rature, la bonne modĆ©lisation dela dynamique de la volatilitĆ© a souvent Ć©tĆ© oubliĆ©e. La famille de modĆØle Ć volatilitĆ© stochastique est introduite pour renforcer la dynamique de la vola-tilitĆ©, capturant le phĆ©nomĆØne de smile de la surface de volatilitĆ© grĆ¢ce Ć  unprocessus stochastique pour la volatilitĆ©. Cā€™est une caractĆ©ristique trĆØs im-

Page 7: Commodity Derivatives: Modeling and Pricing

iv

portante pour les dĆ©rivĆ©s Ć  maturitĆ© longue oĆ¹ lā€™effet volatilitĆ© stochastiqueconduit Ć  des rĆ©sultats trĆØs diffĆ©rents de ceux obtenus avec des modĆØlesplus conventionnels. Les modĆØles Ć  volatilitĆ© stochastique permettent ausside prendre en compte le phĆ©nomĆØne de corrĆ©lation nĆ©gative entre le sous-jacent et la volatilitĆ© en introduisant de maniĆØre explicite ce paramĆØtre decorrĆ©lation. Les modĆØles Ć  volatilitĆ© stochastique les plus populaires com-prennent le modĆØle dā€™Heston, le modĆØle de Piterbarg, le modĆØle de SABR,etc.

Dans cette thĆØse, nous Ć©tendons les idĆ©es de Piterbarg Ć  la famille desmodĆØles Ć  volatilitĆ© stochastique en rendant le concept plus gĆ©nĆ©ral. Nousmontrons en particulier comment introduire des paramĆØtres dĆ©pendant dutemps dans les modĆØles Ć  volatilitĆ© stochastique et explicitons diffĆ©rentesformules de calcul explicite du prix dā€™options vanilles, permettant ainsi unecalibration des paramĆØtres du modĆØles extrĆŖmement efficace.

Page 8: Commodity Derivatives: Modeling and Pricing

Contents

Contents v

1 Introduction 1

2 Modeling commodity with convenience yield 72.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 82.2 Review of research and background . . . . . . . . . . . . . . . . . 8

2.2.1 Convenience yield . . . . . . . . . . . . . . . . . . . . . . . 82.2.2 Gibson Schwartz model . . . . . . . . . . . . . . . . . . . . 102.2.3 Gabillon model . . . . . . . . . . . . . . . . . . . . . . . . . 112.2.4 Previous research . . . . . . . . . . . . . . . . . . . . . . . . 11

2.3 Equivalence between Gibson Schwartz model and Gabillon model 122.4 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15

3 Model factor reduction technique 173.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 183.2 Review of research . . . . . . . . . . . . . . . . . . . . . . . . . . . 183.3 Model factor reduction technique in general form . . . . . . . . . 193.4 Reduced one-factor Gabillon model . . . . . . . . . . . . . . . . . 253.5 Reduced one-factor Gibson Schwartz Model . . . . . . . . . . . . 293.6 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32

4 Stochastic volatility model 354.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 374.2 Review of research . . . . . . . . . . . . . . . . . . . . . . . . . . . 384.3 Heston model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 40

4.3.1 Formula for vanilla option . . . . . . . . . . . . . . . . . . . 414.3.2 Control variate in Heston model . . . . . . . . . . . . . . . 444.3.3 Discussion of numerical stability . . . . . . . . . . . . . . . 494.3.4 Dynamic Heston Model . . . . . . . . . . . . . . . . . . . . 50

4.4 Piterbarg model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 53

v

Page 9: Commodity Derivatives: Modeling and Pricing

CONTENTS vi

4.4.1 Diffusion and formula for vanilla option . . . . . . . . . . 544.4.2 Piterbarg Dynamic model . . . . . . . . . . . . . . . . . . . 564.4.3 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . 64

4.5 SABR model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 644.5.1 Diffusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 644.5.2 Static SABR model . . . . . . . . . . . . . . . . . . . . . . . 664.5.3 Dynamic SABR model . . . . . . . . . . . . . . . . . . . . . 664.5.4 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . 72

4.6 Vol of vol expansion . . . . . . . . . . . . . . . . . . . . . . . . . . 734.6.1 Framework . . . . . . . . . . . . . . . . . . . . . . . . . . . 734.6.2 The expansion technique . . . . . . . . . . . . . . . . . . . 744.6.3 Deriving the series . . . . . . . . . . . . . . . . . . . . . . . 754.6.4 Example of vol of vol expansion: Heston model . . . . . . 76

4.7 Calibration of Stochastic Volatility Model . . . . . . . . . . . . . . 784.7.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . 784.7.2 Calibration order . . . . . . . . . . . . . . . . . . . . . . . . 834.7.3 Numerical method . . . . . . . . . . . . . . . . . . . . . . . 86

4.8 Numerical analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . 1024.8.1 Market data: crude oil . . . . . . . . . . . . . . . . . . . . . 1024.8.2 Calibration Heston model with constant parameters . . . . 1034.8.3 Calibration Heston model with time-dependent parameters 1044.8.4 Sensibility of parameters . . . . . . . . . . . . . . . . . . . . 110

4.9 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 113

5 Conclusion 115

Bibliography 117

A Proof of propositions 123A.1 Proof of proposition 25 . . . . . . . . . . . . . . . . . . . . . . . . . 123A.2 Proof of proposition 26 . . . . . . . . . . . . . . . . . . . . . . . . . 125A.3 Proof of proposition 28 . . . . . . . . . . . . . . . . . . . . . . . . . 128

List of Symbols and Abbreviations 133

List of Figures 135

List of Tables 137

Index 139

Page 10: Commodity Derivatives: Modeling and Pricing

Acknowledgements

This dissertation would not have been possible without the guidance and thehelp of several individuals who in one way or another contributed and extendedtheir valuable assistance in the preparation and completion of this study.

First and foremost, I am heartily thankful to my supervisor, Alain Galli,whose encouragement, guidance and support from the initial to the final levelenabled me to develop an understanding of the subject.

Doctor Eric Benhamou, CEO of Pricing Partners, who is also the enterprisetutor of the CIFFRE thesis project, I would like to thank for his support in theproject with his outstanding insight and useful advices.

Margaret Armstrong, professor of Department of Quantitative Finance atMines ParisTech, for her excellent teaching in financial market class and contin-uous encouragement during the project.

Matthieu Glachant, Director of Cerna, Professor at Mines ParisTech, for hisinteresting seminar and valuable advice in my internal presentation.

Vladimir Piterbarg, Head of Quantitative Analytics at Barclays Capital, forhis quick reply in email to help me understand his paper.

Changyin Huang and Sylvain Le Corff, trainees in Pricing Partners, fortheir help on implementing and testing the algorithms and formulas duringthe project.

Mohammed Miri and Pierre Gauthier, quantitative analysts in Pricing Part-ners, for sharing their insights.

SĆ©saria Ferreira, administrative head of CERNA, Mines ParisTech, for heruntiring effort in encouraging the teaching staff to pursue professional growth.Likewise the staff of the Deanā€™s Office for their relaying every communication.

Last but not the least, I offer my regards and blessings to my family, myfriends and all of those who supported me in any respect during the completionof the project.

vii

Page 11: Commodity Derivatives: Modeling and Pricing
Page 12: Commodity Derivatives: Modeling and Pricing

Chapter 1

Introduction

RƩsumƩ du chapitre

Les matiĆØres premiĆØres sont un sous-jacent important en finance, cette impor-tance sā€™est accrue avec lā€™Ć©mergence du processus de globalisation dans le monde.Les marchĆ©s des matiĆØres premiĆØres telles que le pĆ©trole, les mĆ©taux prĆ©cieux etlā€™agriculture ont une influence trĆØs large sur lā€™Ć©conomie mondiale.

Le premier marchĆ© Ć  terme organisĆ©, moderne a commencĆ© en 1710 Ć  la Boursede Dojima Rice, Ć  Osaka, au Japon. Lā€™Ć©change a Ć©tĆ© utilisĆ© pendant prĆØs de 300ans jusquā€™Ć  la Seconde Guerre mondiale. Il a ensuite Ć©tĆ© dissous entiĆØrement en1939, Ć©tant absorbĆ© par le riz Agence gouvernement.

Au 19ĆØme siĆØcle, un contrat Ć  terme est apparu aux Ɖtats-Unis. Des produitsagricoles comme le maĆÆs et le bĆ©tail, ont Ć©tĆ© Ć©changĆ©s par contrat Ć  terme deChicago et du Midwest. Le Chicago Board of Trade (CBOT) a Ć©tĆ© fondĆ© en 1848.Le premier contrat Ć©tait un contrat Ć  terme sur le maĆÆs, Ć©crit le 13 Mars, 1851. LeChicago Mercantile Exchange (CME) a Ć©tĆ© crĆ©Ć© en 1874, sous le nom de ChicagoProduit Exchange, puis rĆ©organisĆ© en 1919 ou il a pris son nom actuel.

Ce marchĆ© initialement des matiĆØres premiĆØres agricoles sā€™Ć©tend progressi-vement, couvrant dā€™autres produits comme le charbon, le pĆ©trole, les mĆ©tauxprĆ©cieux, etc. Ces produits sont commercialisĆ©s dans diffĆ©rentes bourses partoutdans le monde. En terme de volume, le pĆ©trole brut et ses dĆ©rivĆ©s dĆ©passe untiers du volume traitĆ©.

1

Page 13: Commodity Derivatives: Modeling and Pricing

CHAPTER 1. INTRODUCTION 2

Commodity is an important kind of financial asset among others, especiallyunder the process of globalization all over the world. The markets of commoditysuch as fossil fuels, precious metal and agricultural have a very wide and radicalinfluence on the worldā€™s economy.

The first modern organized futures exchange began in 1710 at the DojimaRice Exchange in Osaka, Japan, according to West [2000]. The exchange wasused for nearly 300 years till the World War II. It was then dissolved entirely in1939, being absorbed into the Government Rice Agency.

In the 19th century, forward contract appeared in the United States. Agri-cultural commodity, such as corn and cattle, was traded by forward contractin Chicago and Midwest. Chicago Board of Trade (CBOT) was founded in1848. The first contract was a forward contract on corn, written on March 13,1851. Chicago Mercantile Exchange (CME) was established in 1874. The originalname was ā€œChicago Produce Exchangeā€ and then reorganized in 1919 with thename Chicago Mercantile Exchange.

The commodity market gradually extends beyond agricultural, coveringother commodities such as coal, petroleum, precious metal, etc. Nowadays,the trading commodity in exchange are shown in table 1.1. These commoditiesare traded in various exchanges all over the world. In term of volume, crude oiland its derivative exceeds one third of total volume in all kinds of commodity.We list hereby some of the major commodity exchanges in the world in table 1.2.The major indices in commodity market are shown in table 1.3.

Table 1.1: The commodity categories

Category DescriptionAgricultural soybean, corn, wheat, cocoa, coffee, sugar, cotton, cit-

rus, orange juice, cattle, hogs, pork bellies, etc.Metal gold, silver, platinum, palladium, copper, tin, zinc,

nickel, aluminum, etc.Oil and oil production crude oil, Liquefied Petroleum Gas (LPS), gasoline,

naphtha, kerosene, diesel, fuel oil, etc.Gas and gas production gas and Liquefied Natural Gas (LNG).ElectricityCoal and CO2 emission

The commodity trading volume has grown to a very large scale. The trad-ing volume of Intercontinental Exchange (ICE), a leading operator of regulatedglobal futures exchanges, clearing houses and over-the-counter (OTC) markets,reported strong futures volume growth in November 2010. Average daily vol-ume (ADV) across ICEā€™s futures exchanges was 1,366,669 contracts, an increaseof 26% from November 2009. Year-to-date through November 30, 2010, ICEā€™s

Page 14: Commodity Derivatives: Modeling and Pricing

CHAPTER 1. INTRODUCTION 3

Table 1.2: Some commodity exchanges

Abuja Securities and Commodities ExchangeAfrica Mercantile ExchangeBhatinda Om & Oil Exchange BathindaBrazilian Mercantile and Futures ExchangeChicago Board of TradeChicago Mercantile ExchangeCommodity Exchange Bratislava, JSCDalian Commodity ExchangeDubai Mercantile ExchangeDubai Gold & Commodities ExchangeEuronext.liffeHong Kong Mercantile ExchangeIndian Commodity ExchangeIntercontinental ExchangeIranian Oil BourseKansas City Board of TradeLondon Metal ExchangeMinneapolis Grain ExchangeMulti Commodity ExchangeNational Commodity and Derivatives ExchangeNational Multi-Commodity Exchange of India LtdNational Food ExchangeNew York Mercantile ExchangeNew York Board of TradeRosario Board of TradeSteelbayWinnipeg Commodity ExchangeNational Spot Exchange

futures ADV was 1.325 million contracts, up 27% compared to the same period of2009. Total futures volume in November 2010 was 28.7 million contracts. ICE fu-tures volume exceeded 300 million contracts through November 30; the previousannual record, established in 2009, was 262 million contracts. ICE also reportedOTC energy average daily commissions (ADC) of $1.33 million US dollars in thefourth quarter of 2010 and a record $1.37 million for the full year.

The derivative market of commodity contributes the majority part in commod-ity trading. Unlike stock market or currency market, most commodities involvephysical transaction and therefore shipping cost or so-called freight cost, whichmake spot trading very rare in commodity market. Instead, forward trading ismuch more common. Consequently, the underlying commodity for the com-

Page 15: Commodity Derivatives: Modeling and Pricing

CHAPTER 1. INTRODUCTION 4

Table 1.3: Some commodity indices

Continuous Commodity Index (CCI)Astmax Commodity Index(AMCI)Commin Commodity IndexDow Jones-AIG Commodity IndexGoldman Sachs Commodity IndexThomson Reuters/Jefferies CRB IndexRogers International Commodity IndexStandard & Poorā€™s Commodity IndexNCDEX Commodity IndexDeutsche Bank Liquid Commodity Index (DBLCI)UBS Bloomberg Constant Maturity Commodity Index (CMCI)Merrill Lynch Commodity index eXtra (MLCX) , etc.

modity option is not the commodity itself, but rather a standardized forwardcontract, or commonly known as futures contract, for that commodity. For ex-ample, a November soybean option will actually be an option for a Novemberdelivery soybean futures contract. In this sense, the options are on futures andnot on the physical commodity. This fact results in a somehow slightly differ-ent form of formula for option price from other asset classes even under classicBlack-Scholes model.

Nevertheless, the carry cost as well as potential benefit of holding a commod-ity, also known as convenience yield, changes the fundamental relation betweenspot price and future price. Besides, commodity market has other distinguishcharacters, among which seasonality and mean reversion patterns play a veryimportant role. Both patterns describe the periodic behavior of commodity pricein both short term and long term. Certain specific designed commodity modelscan explicitly reflect these characters in their model parameters, which presentsa better financial interpretation. In these models, convenience yield or longterm price themselves are assigned to follow a distinguish stochastic processand eventually make the drift term of spot price to be stochastic.

Quite differently, stochastic volatility models introduce the second stochasticprocess on the volatility parameter. The models in this family allows for adifferent and as well complex marginal distribution, which in one way has abetter fit to the forward volatility surface, but in the other way adds difficultieson even the simple vanilla option price. Thus, it is crucial to have the stable andprecise option formula and calibration to enjoy the advantage of the models.

In the first two chapters of this thesis, two most popular specific commoditymodels, namely Gibson-Schwartz model and Gabillon model, will be studied.In chapter two we will prove the mathematical equivalence between the two

Page 16: Commodity Derivatives: Modeling and Pricing

CHAPTER 1. INTRODUCTION 5

models. And in the chapter three we develop a so-called model factor reductiontechnique and apply on both models to achieve the formulas for vanilla optionsbased on spot and future.

In chapter four, we will focus on the stochastic volatility model. We extendthe popular stochastic volatility model such as Heston model by allowing themodel parameter to be time-dependent. This extension gives the more degreeof freedom to stochastic volatility model in term structure, but in the other handintroduces difficulty for formula of option price and process of calibration. Wewill study different methods including parameter averaging, perturbation andvol of vol expansion to get the option price formula in the extended models.Some of calibration algorithm are also listed in the last part of this chapter.

Page 17: Commodity Derivatives: Modeling and Pricing
Page 18: Commodity Derivatives: Modeling and Pricing

Chapter 2

Modeling commodity withconvenience yield

RƩsumƩ du chapitre

Contrairement Ć  dā€™autres actifs classiques, les matiĆØres premiĆØres montrent descaractĆ©ristiques spĆ©cifiques : importance du convenience yield, de la saisonnalitĆ©et du retour Ć  la moyenne. Le concept clĆ© est ici convenience yield, qui estclairement expliquĆ© par exemple dans Lautier [2009]. Celui-ci dicte la relationentre les prix futurs et les prix au comptant. Il ya beaucoup de modĆØles surdes produits de base spĆ©cifiques qui mettent lā€™accent sur ces concepts, citonsles principaux : Gibson and Schwartz [1990], Brennan [1991], Gabillon [1991],Schwartz [1997], Hilliard and Reis [1998], Schwartz and Smith [2000], Casassusand Collin-Dufresne [2005]. A prĆ©sent les modĆØles de Gibson et Schwartz etde Gabillon sont les plus populaires chez les chercheurs et les praticiens dumarchĆ©, non seulement parce que leur interprĆ©tation en terme de rendement decommoditĆ© ou de prix Ć  long terme, mais aussi de part leur simplicitĆ© et leurclartĆ©.

Dans ce chapitre, nous montrerons quā€™il ya Ć©quivalence entre les modĆØlesde Gibson Schwartz et Gabillon. Cette conclusion nā€™est pas surprenante Ć  silā€™on considĆØre que dans le modĆØle de Gibson et Schwartz le convenience yieldapparaĆ®t comme un terme de dĆ©rive stochastique qui nā€™est pas loin dā€™un com-portement Ć  long terme comme dans le modĆØle de Gabillon.

7

Page 19: Commodity Derivatives: Modeling and Pricing

CHAPTER 2. MODELING COMMODITY WITH CONVENIENCE YIELD 8

2.1 Introduction

Unlike other asset classed, commodity has a strong presence of convenienceyield, seasonality and mean reversion. The key concept here is convenienceyield, which is clearly stated in Lautier [2009]. It dictates the relation betweenfuture price and spot price. There are many commodity-specialized modelsemphasize on these concepts, including Gibson and Schwartz [1990], Brennan[1991], Gabillon [1991], Schwartz [1997], Hilliard and Reis [1998], Schwartz andSmith [2000], Casassus and Collin-Dufresne [2005], etc. Among others, GibsonSchwartz model and Gabillon model are the most popular in researchers andmarket practitioners, not only because their appropriate interpretation on con-venience yield or long term price, but also because their simplicity and clarity.

In this chapter, we will point out that there is a mathematical equivalencerelation between Gibson Schwartz model and Gabillon model. This conclusionis not surprising provided the consideration that the former model suggestsconvenience yield as its drift term while the latter uses long term price as itsdrift term, which is an equivalent counterpart of convenience yield. The closerelation between convenience yield and long term price results in the equivalencebetween two models.

2.2 Review of research and background

2.2.1 Convenience yield

Almost all the commodities, apart from the exception of electricity, are normallyconsidered as storable. That is to say, people can hold their commodities in thestock for economic purpose. The following discussion will rule out electricityand focus on the storable commodity. The concept of convenience yield is closelylinked to storage, which can be referred to Lautier [2005] and Lautier [2009].

Storability is essentially a very important feature of commodities. Variousscholars and market participants, such as Kaldor [1939], Working [1949], Brennan[1958] and Telser [1958], have worked on the theory of storage. There are twomajor economic effects based on storage theory:

1. The time value of holding a commodity. It is because the inventory enablesholders to store the commodity when its price is low and sell it on the marketwhen the price goes high. It also avoids the disruption of manufacturing.

2. The holding cost. Holding cost is also called carry cost or cost of storage. Itincludes the land and facility to store the commodity, as well as the operationmanagement cost.

Convenience yield is then defined as the total effect of these two aspects,which generally reflects the economic value of holding a commodity.

Page 20: Commodity Derivatives: Modeling and Pricing

CHAPTER 2. MODELING COMMODITY WITH CONVENIENCE YIELD 9

Of course, this description only helps understand the economic meaning ofconvenience yield. More elaborated and mathematical definition, as shown inLautier [2009] and Geman [2005], will involve market structure, precisely therelation between spot price and future price.

Definition 1 (Convenience yield). Note Ft,T to be the future price of a commodity attime t for maturity T. Note St to be the spot price at time t. Note rt to be the shortrate, which is the (annualized) interest rate at which an entity can borrow money for aninfinitesimally short period of time from time t. Then we define convenience yield yt bysatisfying the arbitrage-free condition:

Ft,T = Steāˆ« T

t (rsāˆ’ys)ds (2.1)

or equivalently,

ys = rsāˆ’dds

(log

Ft,s

St

)(2.2)

with t < s < T.

The definition of convenience yield in equation (2.2) is in continuous form,which presents the average of convenience yield from time t to T. In real worldapplication, since the future values are normally available at some discretiontime points, we can then defined convenience yield yt accordingly in piecewiseconstant form by the following definition:

Definition 2 (Convenience yield in discrete form). Let t = t0 < t1 < t2 < ... < tn = T.Note Ft,ti to be the future price at time t for maturity ti. Note St to be the spot price. NoteD(ti, t j) to be the discount factor between time ti and t j, which stands for the presentvalue at time ti of a future cash flow of 1 unit currency at time t j.

Convenience yield ys is then given by the following equation:

ys =1

ti+1āˆ’ tilog

(Ft,ti

Ft,ti+1

1D(ti, ti+1)

), ti 6 s < ti+1 (2.3)

It is very easy to verify that this definition fulfills the arbitrage-free condition

in equation (2.1) by using D(ti, ti+1) = eāˆ’āˆ« ti+1

tirsds

. This definition of convenienceyield in discrete form is practically used as the formula to calculate convenienceyield from market data.

The notion of discount factor is very useful in following paragraphs. We hereseparately write it as a definition.

Remark For a fixed discount rate, r, continuously compounded discount ratefrom time t to T, then we have the relation D(t,T) = eāˆ’r(Tāˆ’t). Normally discountfactor is provided by interest rate market data and very easy to access. So wewill treat it as a known variable hereafter.

Page 21: Commodity Derivatives: Modeling and Pricing

CHAPTER 2. MODELING COMMODITY WITH CONVENIENCE YIELD 10

Convenience yield v.s. stock dividend

We compare arbitrage-free condition (2.1) to its counterpart in stock market. Wecontinue use the same symbol: Ft,T, St and rt for future, spot and interest raterespectively. Note dt to be the dividend of the stock. Then we have

Ft,T = Steāˆ« T

t (rsāˆ’ds)ds (2.4)

Notice that it is exactly the same form as equation (2.1). This fact encouragesus to migrate the stock market models to commodity. All we need to do is tosimply replace dividend term by convenience yield.

Term structure: backwardation and contango

We copy here the arbitrage-free condition, equation (2.1), to facilitate reading.The relation between future and spot is,

Ft,T = Steāˆ« T

t (rsāˆ’ys)ds (2.5)

The termāˆ« T

t (rs āˆ’ ys)ds can be negative or positive. We call these two differentterm structures backwardation and contango respectively.

Backwardation meansāˆ« T

t (rsāˆ’ ys)ds is negative. In this case, future price Ft,T

is smaller than spot St. It happens when interest rate in low and convenienceyield is high. For example, when the Gulf war broke up, holding oil becameparticularly profitable at that time. And convenience yield of oil became so highthat caused backwardation in the future market.

Contrarily, whenāˆ« T

t (rs āˆ’ ys)ds is positive, we have contango. In this case,future price Ft,T is greater than spot St. The structure of future curve becomes anincreasing function against maturity time.

2.2.2 Gibson Schwartz model

Gibson Schwartz model, first introduced by Gibson and Schwartz [1990], is basedon a two-factor diffusion. The spot price St is assumed to follow a lognormaldiffusion but with a stochastic drift. Gibson and Schwartz [1990] assume thatconvenience yield, Ī“t, is a stochastic process, explicitly following a mean revert-ing normal diffusion, which leads to the following diffusion for the commodityspot price on risk-neutral measure:

dSS

= (rāˆ’Ī“)dt +Ļƒ1dz1 (2.6)

dĪ“ = (k(Ī±āˆ’Ī“)āˆ’Ī»Ļƒ2)dt +Ļƒ2dz2 (2.7)

where r is the risk free rate, z1 and z2 are two correlated Brownian motionswith correlation Ļ.

Page 22: Commodity Derivatives: Modeling and Pricing

CHAPTER 2. MODELING COMMODITY WITH CONVENIENCE YIELD 11

2.2.3 Gabillon model

Gabillon [1991] created this model to account for long term trend. He arguedthat it was beneficial to use a two-factor diffusion to capture the differencebetween short and long term effects. He introduced a long term trend factorthat would influence the convenience yield. This led him to assume that thespot prices, S, follows a log-normal diffusion but with a stochastic drift term.Instead of assuming a stochastic convenience yield like in Gibson and Schwartz[1990] model, Gabillon assumed that the drift term is influenced by a long termfactor that is itself stochastic. This long term factor denoted by L follows also alog-normal diffusion.

In the original paper the model parameters are assumed to be constant. Toallow an easy calibration across time, we will extend the hypothesis of Gabillonand assume that these parameters are time dependent. This setting is very inter-esting as we capture the long term trend stochasticity through the second factorand capture time varying value through time dependent model parameters. Wewill take the traditional assumption in mathematical finance and assume thatthe uncertainty is represented by a probability space (Ī©,F ,P) with a two dimen-sional Brownian motion. The two components of the Brownian motion will bedenoted by z1,z2 and will be assumed to be correlated with a constant correlation.

dz1dz2 =Ļdt

This leads us to assume that the spot price S follow under the historicalprobability,

dSS

= k lnLS

dt +ĻƒSdz1 (2.8)

dLL

= ĀµLdt +ĻƒLdz2 (2.9)

where in the above equations, we have removed any obvious reference totime to simplify the notation, like for instance St,Lt,zt

1,zt2.

2.2.4 Previous research

As two of the most popular models in commodity world, Gibson Schwartzmodel and Gabillon model are actually mathematically equivalent to each other.The previous researches includes Gabillon [1991] and Schwartz and Smith [2000].The former studied and compared the economic interpretation of the parametersof both models. The latter built a Gabillon style model, namely short-term/long-term model, and proved that the model was equivalent to Gibson Schwartzmodel by change of variable. And our result in this thesis goes in the other

Page 23: Commodity Derivatives: Modeling and Pricing

CHAPTER 2. MODELING COMMODITY WITH CONVENIENCE YIELD 12

direction: the equivalence relation between Gabillon model and short-term/long-term model. Our result together with the result of Schwartz and Smith [2000]complete the proof of the equivalence relation between Gibson Schwartz modeland Gabillon model.

2.3 Equivalence between Gibson Schwartz model andGabillon model

We establish the equivalence relation through a third model. The idea is torespectively prove Gibson Schwartz model and Gabillon model are both equiva-lent to a third model. Consequently, they are equivalent to each other. Moreover,we interpret the equivalence relation in term of the economic significance of theparameters in both models.

The significance of this equivalent relation is to establish a close connectionbetween two most popular models in commodity derivative pricing. Althoughthe two models are mathematically equivalent, the market practitioners havetheir preference on choosing between the two. It is because they have goodknowledge about the economic meaning of the parameters of the model, whichguarantees them to better understand and use the model.

In fact, the equivalence between the third model and Gibson Schwartz modelwere already proved in Schwartz and Smith [2000]. The model is called short-term/long-term model in the paper. The definition is as follows.

Definition 3 (Short-term/long-term model). Let St denote the spot price of a com-modity at time t. We define,

log(St) = Ļ‡t +Ī¾t, (2.10)

where Ļ‡t and Ī¾t present short term price and long term price respectively. They followthe stochastic process:

dĻ‡t = āˆ’ĪŗĻ‡tdt +ĻƒĻ‡dzĻ‡ (2.11)

dĪ¾t = ĀµĪ¾dt +ĻƒĪ¾dzĪ¾ (2.12)

Here zĻ‡ and zĪ¾ are two Brownian motion with correlation Ļ. Parameters Īŗ, ĀµĪ¾, ĻƒĻ‡and ĻƒĪ¾ are constant.

The first step is to prove the equivalence relation between short-term/long-term mode and Gibson Schwartz model. It is done in Schwartz and Smith [2000].The key here is the change of variable .

Page 24: Commodity Derivatives: Modeling and Pricing

CHAPTER 2. MODELING COMMODITY WITH CONVENIENCE YIELD 13

Ļ‡t =1Īŗ

(Ī“tāˆ’Ī±) (2.13)

Ī¾t = logSāˆ’1Īŗ

(Ī“tāˆ’Ī±) (2.14)

Then simply inject the stochastic diffusion of Gibson Schwartz model, equa-tion (2.6) and (2.7), into the equations above. We have the derivation as follows.

dSS

= (Āµāˆ’Ī“)dt +Ļƒ1dz1 (2.15)

dĪ“ = (k(Ī±āˆ’Ī“)āˆ’Ī»Ļƒ2)dt +Ļƒ2dz2 (2.16)

Note Xt = logSt, we have

dXt =(Āµāˆ’Ī“tāˆ’

12Ļƒ2

S

)dt +ĻƒSdz1 (2.17)

Using 2.13, we have

Ļ‡t =1Īŗ

(Ī“tāˆ’Ī±) (2.18)

ā‡’ dĪ“t = Īŗ(Ī±āˆ’Ī“t)dt +ĪŗĻƒĻ‡dzĻ‡ (2.19)

Ī¾t = logSāˆ’1Īŗ

(Ī“tāˆ’Ī±) (2.20)

ā‡’ dXt = (ĀµĪ¾+Ī±āˆ’Ī“t)dt +ĻƒĪ¾dzĪ¾+ĻƒĻ‡dzĻ‡ (2.21)

Assume zĻ‡ = ĻzĪ¾+āˆš

1āˆ’Ļ2zāŠ„Ī¾ , where zāŠ„Ī¾ is independent to zĪ¾.

ā‡’ dĻƒĪ¾dzĪ¾+ĻƒĻ‡dzĻ‡ = (ĻƒĪ¾+ĻĻƒĻ‡)dzĪ¾+

āˆš1āˆ’Ļ2ĻƒĻ‡dzāŠ„Ī¾ (2.22)

Then we note

||Ļƒ||2t =

āˆ« t

0

((ĻƒĪ¾+ĻĻƒĻ‡)2 + (1āˆ’Ļ2)Ļƒ2

Ļ‡

)ds (2.23)

So

||Ļƒ||t = Ļƒ1 (2.24)

z1(t) =

āˆ« t

0

ĻƒĪ¾+ĻĻƒĻ‡||Ļƒ||t

dzĪ¾+

āˆš1āˆ’Ļ2ĻƒĻ‡||Ļƒ||t

dzāŠ„Ī¾

(2.25)

Page 25: Commodity Derivatives: Modeling and Pricing

CHAPTER 2. MODELING COMMODITY WITH CONVENIENCE YIELD 14

According to Levy Theorem, z1(t) is Brownian motion. So the two modelsare equivalent.

dĻ‡t =1Īŗ

dĪ“t (2.26)

= (Ī±āˆ’Ī“t)dt +Ļƒ2

Īŗdz2 (2.27)

= āˆ’ĪŗĻ‡tdt +Ļƒ2

Īŗdz2 (2.28)

and

dĪ¾t = dlogSāˆ’1Īŗ

dĪ“t (2.29)

= (Āµāˆ’Ī“tāˆ’12Ļƒ2

1)dt +Ļƒ1dz1āˆ’ (Ī±āˆ’Ī“t)dtāˆ’Ļƒ2

Īŗdz2 (2.30)

= (Āµāˆ’Ī±āˆ’12Ļƒ2

1)dt +Ļƒ1dz1āˆ’Ļƒ2

Īŗdz2 (2.31)

Here we use the risk neutralized version of both short term/long term modeland Gibson Schwartz model to exclude the risk premium parameter. Undernon-arbitrage principle, the drift term of both models should equal to

āˆ« t0 rsds. So

we just need to take care of stochastic term for the spot diffusion. Comparingthese forms with short term/long term model, we can see that the two modelsare equivalent if we relate the parameters of the two models.

In the other hand, the equivalence relation between long term/short termmodel and Gabillon model is also easy to prove. The change of variables here is

Ī¾t = logL (2.32)

Ļ‡t = logSāˆ’ logL (2.33)

Inspired by the above result, we can similarly prove the equivalence relationbetween long term/short term model and Gabillon model. We inject the diffusionof Gabillon model, equation (2.8) and (2.9), in the equations above. We can re-findthe diffusion of long term/short term model.

dĻ‡t = dlogSāˆ’dlogL (2.34)

= āˆ’ĪŗĻ‡tdtāˆ’ĻƒSdW1 +ĻƒLdW2 (2.35)

and

Page 26: Commodity Derivatives: Modeling and Pricing

CHAPTER 2. MODELING COMMODITY WITH CONVENIENCE YIELD 15

dĪ¾t = dlogL (2.36)

=(ĀµLāˆ’

12Ļƒ2

L

)dt +ĻƒLdW2 (2.37)

Therefore, we prove the equivalence relation between long term/short termmodel and Gabillon model. This result is new to our knowledge.

Now we put the two parts together and we can get the equivalence relationbetween Gibson Schwartz model and Gabillon model. It is given by the changeof variables as follows:

logS = Xt (2.38)

logL = Xtāˆ’1Īŗ

(Ī“tāˆ’Ī±) (2.39)

For the equivalence of parameters, the calculation is a bit lengthy but verystraightforward. We just list the result in table 2.1. Given the high similaritybetween Gabillon model and short-term/long-term model, we observe the theresult in table 2.1 is in the same form as the result for the equivalence relation be-tween Gibson Schwartz model and short-term/long-term model, which is shownin Schwartz and Smith [2000].

Table 2.1: The Relationships Between Parameters in Gabillon Model and GibsonSchwartz model

Gabillon model parameterSymbols Description Equivalence in Gibson Schwartz modelĪŗ Short-term mean-reversion rate ĪŗĻƒS Short-term volatility Ļƒ2/ĪŗĀµL Equilibrium drift rate (Āµāˆ’Ī±āˆ’ 1

2Ļƒ21)

ĻƒL Equilibrium volatility (Ļƒ21 +Ļƒ2

2/Īŗ2āˆ’2ĻĻƒ1Ļƒ2/Īŗ)1/2

Ļ Correlation in increments (ĻĻƒ1āˆ’Ļƒ2/Īŗ)(Ļƒ21 +Ļƒ2

2/Īŗ2āˆ’2ĻĻƒ1Ļƒ2/Īŗ)1/2

2.4 Summary

In this chapter, we revisited the concept of convenience yield and the modelsspecialized on commodity, namely Gibson Schwartz model and Gabillon model,which emphasizes on stochastic convenience yield and stochastic long term pricerespectively. As the main result of the chapter, we proved that the two modelsare eventually equivalent in mathematics.

Page 27: Commodity Derivatives: Modeling and Pricing

CHAPTER 2. MODELING COMMODITY WITH CONVENIENCE YIELD 16

The explanation of the equivalence of the two models involves the relationof the concept of long term price and convenience yield. As we know, conve-nience yield is the total benefit and carry cost on holding a commodity, whichconsequently decides the long term price of the commodity. Gibson Schwartzmodel assumes that convenience yield follows a Brownian motion with meanreversion, while Gabillon model suggests that the logarithm of long term priceis also a Brownian motion with mean reversion. Apparently, these assumptionslead to the equivalence of the two models.

Page 28: Commodity Derivatives: Modeling and Pricing

Chapter 3

Model factor reductiontechnique

RƩsumƩ du chapitre

Le rĆ©sultat principal de ce chapitre est lā€™introduction et lā€™application de la tech-nique de rĆ©duction de modĆØle factoriel. Cā€™est une technique qui permet detrouver ā€“sous certaines conditions-un Ć©quivalent Ć  un facteur qui a la mĆŖmedistribution marginale de spot et donc du prix dā€™une option EuropĆ©enne, que lemodĆØle initial Ć  plusieurs facteurs. Nous appliquons ensuite cette la techniqueaux modĆØles de Gibson Schwartz et de Gabillon. En outre, nous allons donner ex-plicitement la distribution marginale des spots et futures de ces deux modĆØles,et en dĆ©duire une formule explicite du prix de lā€™option vanille. La techniquesā€™applique gĆ©nĆ©ralement Ć  tout modĆØle multi-facteurs de prix sous lā€™hypothĆØseque la dĆ©rive est un processus de diffusion gaussien. Cā€™est prĆ©cisĆ©ment le casdes modĆØles de commoditĆ©s que sont les deux modĆØles mentionnĆ©s ci-dessus.La technique peut Ć©galement sā€™appliquer Ć  des modĆØles pour les autres classesdā€™actifs telles que les modĆØles stochastiques avec dividendes pour les marchĆ©sdes actions.

Lā€™organisation de ce chapitre commence en prĆ©sentant notre technique derĆ©duction de modĆØle factoriel sur un modĆØle de tarification Ć  deux facteursgĆ©nĆ©raux. La premiĆØre Ć©tape consiste Ć  obtenir lā€™Ć©quivalent Ć  un facteur puis ladistribution marginale des prix au comptant, ce qui permet de dĆ©river la formulede prix de lā€™option de vanille. Ensuite, nous appliquons le rĆ©sultat gĆ©nĆ©ral auxdeux modĆØles ci-dessus modĆØle.

17

Page 29: Commodity Derivatives: Modeling and Pricing

CHAPTER 3. MODEL FACTOR REDUCTION TECHNIQUE 18

3.1 Introduction

The main result in this chapter is the introduction and application of model factorreduction technique. It is a technique to find an equivalent one-factor model thathas the same marginal distribution of spot and therefore vanilla option pricewith the original two-factor model under certain condition. Interestingly, weapply the technique to Gibson Schwartz model and Gabillon model. We showthat they are equivalent to one-factor model when pricing futures and vanillaoption. In addition, we will explicitly express the marginal distribution of spotand future price of both models, and derive the formula of vanilla option on spotand future thusly.

The equivalent one-factor model admits a weak solution, which has the sameone-dimensional marginal probability distribution and therefore same price forvanilla options. Moreover, this distribution can be explicitly expressed undercertain conditions. The result can consequently induct closed formulas for fu-ture and vanilla option price. The technique generally applies to any multi-factorpricing model under assumption of normally distributed drift, which are pre-cisely, in commodity modeling, the stochastic drift such as convenience yieldin Gibson Schwartz model or long term price in Gabillon model. Obviously,the technique can also apply to models for other asset classes such as stochasticdividend models in stock market.

The organization of this chapter begins with introducing our model factorreduction technique on a general two-factor pricing model. The first step is toget the equivalent one-factor model and then the marginal distribution of spotprice, which can derive the formula for vanilla option price. Next we apply thegeneral result to Gabillon model and Gibson Schwartz model, verifying that bothmodels fit the assumption condition. Once the equivalent reduced one-factormodel and the marginal distribution are obtained, it is straightforward to getthe formulas for vanilla option price by applying the result of general two-factormodel.

3.2 Review of research

The idea of model factor reduction technique is inspired by the paper ofGyƶngy [1986]. In his paper, Gyƶngy considered a general form of a stochas-tic process dx(t) = Ī“(t,Ļ‰) dW(t) + Ī²(t,Ļ‰)dt. He proposed an equivalent process,replacing stochastic parameters by non-random parameters. The new processadmits a weak solution having the same one-dimensional probability distribu-tion.

Proposition 4 (Gyƶngy [1986]). Let Ī¾(t) be a stochastic process starting from 0 with

Page 30: Commodity Derivatives: Modeling and Pricing

CHAPTER 3. MODEL FACTOR REDUCTION TECHNIQUE 19

Ito differential

dĪ¾(t) = Ī“(t,Ļ‰)dW(t) +Ī²(t,Ļ‰)dt, (3.1)

where (W(t),Ft) is a Wiener process, Ī“ and Ī² are bounded Ft-non-anticipative processessuch that Ī“Ī“T is uniformly positive definite. Then it is proved that there exists a stochasticdifferential equation

dx(t) = Ļƒ(t,x(t))dW(t) + b(t,x(t))dt (3.2)

with non-random coefficients which admits a weak solution x(t) having the same one-dimensional probability distribution as Ī¾(t) for every t. The coefficients Ļƒ and b have asimple interpretation:

Ļƒ(t,x(t)) =(E[Ī“Ī“T(t)|Ī¾(t) = x]

) 12 (3.3)

b(t,x) = E[Ī²(t)|Ī¾(t) = x] (3.4)

From this result, we can see the process x(t) is with non-random coefficients,which means the model is reduced to one-factor. However, the condition thatĪ“ and Ī² are bounded limits the application of this result, since two-factor mod-els usually cannot satisfy this constraint. For example, in Gibson Schwartzmodel and Gabillon model, the drift term Ī² is normally distributed, hence notbounded. Follow this line, we make one step further. We will impose nor-mal distribution on Ī²(t,Ļ‰), which holds for a large class of commodity pricingmodels. Although normal distribution isnā€™t bounded, we still can calculatethe non-random coefficients for reduced equivalent one-factor model and ex-press this one-dimensional probability distribution explicitly. Then, we show anequivalent one-factor model that presents the same marginal distribution at anytime t, as shown in the definition 6. Once we know the distribution of the spotprice, it is then possible to compute a closed formula for the future price andvanilla option price, shown and proved in proposition 10, 11 and 12 in followingsections for the general case. These results of general case are then applied toGibson Schwartz model and Gabillon model in the following sections.

3.3 Model factor reduction technique in general form

Assume that St is a stochastic process with the following dynamic:

dSt

St= (r +Ī±t,Ļ‰)dt +Ļƒ(t)dWt (3.5)

where Wt is a (potentially multi-dimensional) Wiener process under filtrationFt and risk-neutral measure Q. To be clear, all the calculation in this section isalways on risk-neutral measure Q. r is risk free rate. Drift term Ī±t,Ļ‰ is stochastic.

Page 31: Commodity Derivatives: Modeling and Pricing

CHAPTER 3. MODEL FACTOR REDUCTION TECHNIQUE 20

And we assume that it follows another diffusion process, which is the case inpratical for models such as Gibson Schwartz model and Gabillon model. Ļ‰

represents the full stochastic information. In particular, the path-wise behaviorof Xt may depend, via the random coefficients Ī±t,Ļ‰ in a complicated way on thepast filtration Ft.

Assume that Xt = logSt. So Xt is a stochastic process with the followingdynamic:

dXt =(rāˆ’

12Ļƒ(t)2 +Ī±t,Ļ‰

)dt +Ļƒ(t)dWt (3.6)

Equation (3.6) depicts a general form for pricing models under the assump-tion of a deterministic volatility Ļƒ(t). Since the parameters Ī±t,Ļ‰ is a randomvariable, the process is multi-factor. Some examples can be found in section 3.4.

Lemma 5. In process dXt = (rāˆ’ 12Ļƒ(t)2 +Ī±t,Ļ‰)dt+Ļƒ(t)dWt, as Ī±t,Ļ‰ is a normal diffusion

process for all t> 0, then Xt = X0 +rtāˆ’ 12

āˆ« t0 Ļƒ(s)2ds+

āˆ« t0 Ī±s,Ļ‰ds+

āˆ« t0 Ļƒ(s)dWs is a normal

variable for any t > 0.

Proof. Since Ī±t,Ļ‰ is normal, soāˆ« t

0 Ī±s,Ļ‰ds is normal. The Ito integralāˆ« t

0 ĻƒsdWs isnormal as the integrand Ļƒs is deterministic. Xt, being the sum of normals, isnormal for all t.

Under the assumption that Ī±t,Ļ‰ is a normal variable, we note ĻƒĪ±(t) as thestandard deviation (square root of variance) of Ī±t,Ļ‰ for future convenience.

Definition 6. For afterward convenience, we define here Āµ(t,T) and Ī¾(t,T). DenoteĀµ(t,T) the expectation of XT āˆ’Xt at time t. Denote Ī¾(t,T) the variance of XT āˆ’Xt attime t. They are simply defined by:

Āµ(t,T) = E[XT āˆ’Xt|Ft] = E

[āˆ« T

tĪ±s,Ļ‰ds

āˆ£āˆ£āˆ£āˆ£āˆ£āˆ£Ft

]+ r(Tāˆ’ t)āˆ’

12

āˆ« T

tĻƒ(s)2ds (3.7)

Ī¾(t,T) = var(XT āˆ’Xt|Ft) = var(āˆ« T

tĪ±s,Ļ‰ds +

āˆ« T

tĻƒ(s)dWs

āˆ£āˆ£āˆ£āˆ£āˆ£āˆ£Ft

)(3.8)

Remark We can write E[XT|Ft] = Xt +Āµ(t,T) and var(XT|Ft) = Ī¾(t,T).

Now we can construct a new model with non-random time-dependent pa-rameter. This one-factor model should have the same distribution.

YT = Xt +Āµ(t,T) +

āˆ« T

t

āˆšĪ¾ā€²(t,s)dWs (3.9)

where Āµ(t,T) and Ī¾(t,T) are given by equation (3.7) and (3.8). And Ī¾ā€²(t,s) =āˆ‚Ī¾(t,s)āˆ‚s .

It is easily to verify that E[YT|Ft] = Xt +Āµ(t,T) and var(YT|Ft) = Ī¾(t,T). Fromlemma 7, we know that Ī¾ā€²(t,s) is positive, so the process YT is well defined.

Page 32: Commodity Derivatives: Modeling and Pricing

CHAPTER 3. MODEL FACTOR REDUCTION TECHNIQUE 21

Lemma 7. Ī¾(t,T) is given by equation (3.8). Assume Ī±t has form dĪ±t = ĀµĪ±(t)dt +

ĻƒĪ±(t)dWĪ±t . ĀµĪ±(t) and ĻƒĪ±(t) are deterministic. Then we have āˆ€t, āˆ‚Ī¾(t,T)

āˆ‚T > 0.

Proof. Note ĀµĪ±(t,T) =āˆ« T

t ĀµĪ±(s)ds.

āˆ« T

tĪ±sds = ĀµĪ±(t,T) +

āˆ« T

tdu

āˆ« T

t1u>sĻƒĪ±(s)dsdWĪ±

s (3.10)

= ĀµĪ±(t,T) +

āˆ« T

t(Tāˆ’ s)ĻƒĪ±(s)dWĪ±

s (3.11)

Therefore,

XT =Xt + r(Tāˆ’ t)āˆ’12

āˆ« T

tĻƒ(s)2ds +ĀµĪ±(t,T)

+

āˆ« T

t(Tāˆ’ s)ĻƒĪ±(s)dWĪ±

s +

āˆ« T

tĻƒ(s)dWs

E(XT|Ft) = Xt + r(Tāˆ’ t)āˆ’12

āˆ« T

tĻƒ(s)2ds +ĀµĪ±(t,T) (3.12)

var(XT|Ft) = E

(āˆ« T

t(Tāˆ’ s)ĻƒĪ±(s)dWĪ±

s +Ļƒ(s)dWs

)2 (3.13)

Note Ļ the correlation between WĪ±t and Wt.

For any t < T, we have

Ī¾(t,T) = var(XT|Ft)

=

āˆ« T

t(Tāˆ’ s)2Ļƒ2

Ī±(s)ds +

āˆ« T

tĻƒ2(s)ds + 2Ļ

āˆ« T

t(Tāˆ’ s)ĻƒĪ±(s)Ļƒ(s)ds

>

āˆ« T

t[(Tāˆ’ s)2Ļƒ2

Ī±(s) +Ļƒ2(s)āˆ’2ĻƒĪ±(s)Ļƒ(s)]ds

=

āˆ« T

t[(Tāˆ’ s)ĻƒĪ±(s)āˆ’Ļƒ(s)]2ds

> 0

So we have,

āˆ‚Ī¾(t,T)āˆ‚T

= limĪ“ā†’0

Ī¾(t,T +Ī“)āˆ’Ī¾(t,T)

= limĪ“ā†’0

Ī¾(T,T +Ī“)

> 0

This proves lemma 7.

Page 33: Commodity Derivatives: Modeling and Pricing

CHAPTER 3. MODEL FACTOR REDUCTION TECHNIQUE 22

The process YT = Xt +Āµ(t,T) +āˆ« T

t

āˆšĪ¾ā€²(t,s)dWs is a one-factor model with non-

random time-dependent parameters. At any time s > t, Ys is a normally dis-tributed variable withE[Yt] = X0 +Āµ(0, t), var(Yt) = Ī¾(0, t). So it has the same one-dimensional marginal distribution as our original process dXt = Ī±t,Ļ‰dt +ĻƒtdWt

at any time t.From the calculation above, we know

Āµ(t,T) = ĀµĪ±(t,T) + r(Tāˆ’ t)āˆ’12

āˆ« T

tĻƒ(s)2ds. (3.14)

Therefore from YT = Xt +Āµ(t,T) +āˆ« T

t

āˆšĪ¾ā€²(t,s)dWs, we can have āˆ€s > t,

dYs =(rāˆ’

12Ļƒ(s)2 +ĀµĪ±(s)

)ds +

āˆšĪ¾ā€²(t,s)dWs (3.15)

The advantage of the one-factor model, YT = Xt +Āµ(t,T) +āˆ« T

t

āˆšĪ¾ā€²(t,s)dWs, is

that there are existing closed formula for future and vanilla option price. Weshow them in proposition 10 and 11. Theoretically, with the normal distributionin definition 6, all kinds of non-path-dependent options can be priced as in theequivalent one-factor model.

In order to use the Black Scholes formula, we list the result of Black Scholeshere for convenience. Black Scholes model is initially introduced by Black andScholes [1973]. The model suggests the marginal distribution of underlyingfollows a log-normal distribution. It is now still popular in the market thanks toits simplicity and stability. Black and Scholes [1973] also presented Black Scholesformula for the price of a vanilla option.

Definition 8 (Black Scholes model with dividend). Assume St to be the spot priceof an underlying (such as commodity) at time t. The diffusion of Black Scholes modelfollows:

dSt

St=

(rāˆ’ q(t)

)dt +Ļƒ(t)dWt (3.16)

where ĀµS(t), q(t) and ĻƒS(t) are deterministic. q(t) is dividend. Note Xt = logSt.Applying Ito lemma, we have

dXt =(rāˆ’

12Ļƒ(t)2

āˆ’ q(t))dt +Ļƒ(t)dWt (3.17)

Proposition 9 (Black Scholes formula - price of a vanilla option). Under BlackScholes model in Definition 8, we can express the price of a vanilla option as follows:

Page 34: Commodity Derivatives: Modeling and Pricing

CHAPTER 3. MODEL FACTOR REDUCTION TECHNIQUE 23

V(t) = StN(d1)eāˆ’āˆ« T

t q(s)dsāˆ’KN(d2)eāˆ’r(Tāˆ’t) (3.18)

where r is interest rate. St is the spot price observed at time t. N(x) = 12Ļ€

āˆ« xāˆ’āˆž

eāˆ’t2/2dtis the cumulative distribution function of standard normal distribution. And d1 =log(St/K)+r(Tāˆ’t)āˆ’

āˆ« Tt q(s)ds

Ļƒāˆš

Tāˆ’t+ 1

2Ļƒāˆš

Tāˆ’ t, d2 = d1āˆ’Ļƒāˆš

Tāˆ’ t.

When ĻƒS(t) is time dependent, d1 and d2 are changed to d1 =ln(St/K)+r(Tāˆ’t)āˆ’

āˆ« Tt q(s)ds

āˆšĪ½

+

12āˆšĪ½, d2 = d1āˆ’

āˆšĪ½, with Ī½ =

āˆ« Tt Ļƒ(s)2ds

Since the distribution St is log-normal and the average and variance of logSt

are known, it needs only straightforward calculation to get Black Scholes formulaby integrating St from K to āˆž. The detailed proof can be found in Black andScholes [1973] and we wonā€™t list here. Example calculation for Black Scholesmodel with dividend can also be found in page 237 - 238 of Shreve [2004].

Proposition 10 (Formula for future). If Xt follows equation dXt = Ī±t,Ļ‰dt +ĻƒtdWt

with Ī±t,Ļ‰ being a normal, assuming that St is a stochastic process defined by St = eXt ,then the price of the future F(t,T), observed at time t, expiring at time T, and given byE[ST|Ft] has the following closed formula:

F(t,T) = E[ST|Ft] = SteĀµ(t,T)+ 12Ī¾(t,T) (3.19)

Here Āµ(t,T) =E[āˆ« T

t Ī±s,Ļ‰dsāˆ£āˆ£āˆ£āˆ£Ft

]and Ī¾(t,T) = var

(āˆ« Tt Ī±s,Ļ‰ds +

āˆ« Tt ĻƒsdWs

āˆ£āˆ£āˆ£āˆ£Ft

)are given

by equation 3.7 and 3.8.

Proof. Since St is a log-normal variable, the proposition is obviously proved byemploying the property of a log-normal variable.

Proposition 11 (Formula of vanilla option). If X follows equation 3.9 under riskneutral measure, with Ī±t,Ļ‰ being a normal, then the price of a vanilla call on the spotSt = eXt with strike K is given by:

V = StN(d1)eāˆ’p(t,T)āˆ’KN(d2)eāˆ’r(Tāˆ’t) (3.20)

where N(Ā·) is standard normal cumulative distribution function;

d1 =log(St/K) + r(Tāˆ’ t)āˆ’p(t,T)āˆš

Ī¾(t,T)+

12

āˆšĪ¾(t,T)

d2 = d1āˆ’āˆšĪ¾(t,T)

p(t,T) = r(Tāˆ’ t)āˆ’12Ī¾(t,T)āˆ’Āµ(t,T)

with Āµ(t,T) and Ī¾(t,T) in equation 3.8 and St = eXt .

Page 35: Commodity Derivatives: Modeling and Pricing

CHAPTER 3. MODEL FACTOR REDUCTION TECHNIQUE 24

Proof. Simply use Black Scholes formula with on the stochastic process 3.15

dYs =(rāˆ’

12Ļƒ(s)2 +ĀµĪ±(s)

)ds +

āˆšĪ¾ā€²(t,s)dWs

We have

Ī½ =

āˆ« T

t

(āˆšĪ¾ā€²(t,s)

)2ds

= Ī¾(t,T)

and āˆ€s > t, the dividend term in Black-Scholes model is

q(s) =(rāˆ’

12

(āˆšĪ¾ā€²(t,s)

)2)āˆ’

(rāˆ’

12Ļƒ(s)2 +ĀµĪ±(s)

)ā‡’ p(t,T) def

=

āˆ« T

tq(s)ds

=12

(āˆ« T

tĻƒ(s)2dsāˆ’Ī¾(t,T)

)āˆ’ĀµĪ±(t,T)

= r(Tāˆ’ t)āˆ’12Ī¾(t,T)āˆ’Āµ(t,T)

Then simply use the result of proposition 9 to get the result of this proposition.

Proposition 12 (Vanilla option on future). If X follows equation 3.6 with Ī±(t,Ļ‰)being a normal and the spot process S = eX is a log normal process. Assume a vanillacall option has maturity at time t. The underlying is future FT

t and strike is K. Then theprice V of the option at time t is given by:

V = eāˆ’r(Tāˆ’t)(SteĀµ(t,T)+ 1

2Ī¾(t,T)N(d1)āˆ’KN(d2))

where N(Ā·) is standard normal cumulative distribution function;

d1 =log

(SteĀµ(t,T)+ 1

2Ī¾(t,T)/K)

āˆšĪ¾(t,T)

+12

āˆšĪ¾(t,T)

d2 = d1āˆ’āˆšĪ¾(t,T)

with Āµ(t,T) and Ī¾(t,T) in equation 3.8 and St = eXt .

Proof. We have to compute the following expectation:

eāˆ’r(Tāˆ’t)E[(

FTt āˆ’K

)+āˆ£āˆ£āˆ£āˆ£Ft

]= eāˆ’r(Tāˆ’t)E

[(E [ST|Ft]āˆ’K)+

āˆ£āˆ£āˆ£Ft]

= eāˆ’r(Tāˆ’t)E[(SteĀµ(t,T)+ 1

2Ī¾(t,T)āˆ’K)+

āˆ£āˆ£āˆ£āˆ£Ft]

Page 36: Commodity Derivatives: Modeling and Pricing

CHAPTER 3. MODEL FACTOR REDUCTION TECHNIQUE 25

where

Āµ(t,T) = E

[āˆ« T

tĪ±s,Ļ‰ds

āˆ£āˆ£āˆ£āˆ£āˆ£āˆ£Ft

]Ī¾(t,T) = var

(āˆ« T

tĪ±s,Ļ‰ds +

āˆ« T

tĻƒsdWs

āˆ£āˆ£āˆ£āˆ£āˆ£āˆ£Ft

)leading to

eāˆ’r(tāˆ’t)E[(

FTt āˆ’K

)+āˆ£āˆ£āˆ£āˆ£Ft

]= eāˆ’r(Tāˆ’t)

(S0eĀµ(t,T)+ 1

2Ī¾(t,T)N(d1)āˆ’KN(d2))

where

d1 =log

(SteĀµ(t,T)+ 1

2Ī¾(t,T)/K)

Ī¾(t, t)+

12

āˆšĪ¾(t,T)

d2 = d1āˆ’āˆšĪ¾(t,T)

which is the result of the proposition.

Remark The propositions assume that:

1. the drift term Ī±t,Ļ‰ in equation 3.6 is normally distributed for any t,

2. the volatility Ļƒt is deterministic.

These condition holds for at least two common models for commodity deriva-tives, which are Gibson Schwartz model and Gabillon model.

3.4 Reduced one-factor Gabillon model

Commonly, the process 3.6, Xt, is the log price of the spot. Now, we show twoapplications of the above propositions to the modeling of commodity deriva-tives, namely the models of Gibson Schwartz and Gabillon. These models arecommonly used in commodity market. And both of them are two factors modelswith a normal stochastic drift.

We take the traditional assumption in mathematical finance and assumethat the uncertainty is represented by a probability space (Ī©,F ,P) with a twodimensional Brownian motion. The two components of the Brownian motionwill be denoted by z1,z2 and will be assumed to be correlated with a constantcorrelation.

dz1dz2 =Ļdt

This leads us to assume that the spot price S follow under the historicaldiffusion

Page 37: Commodity Derivatives: Modeling and Pricing

CHAPTER 3. MODEL FACTOR REDUCTION TECHNIQUE 26

dSS

= k lnLS

dt +ĻƒSdz1 (3.21)

dLL

= ĀµLdt +ĻƒLdz2 (3.22)

where in the above equations, we have removed any obvious reference totime to simplify the notation, like for instance St,Lt,kt,zt

1,zt2, . . ..

Denoting by Ī»S the risk premium price for the spot risk, we can easily derivethe diffusion under the risk neutral probability as follows (see for instance Hulland White [1990] or Gabillon [1991]):

dSS

= (k lnLS

dtāˆ’Ī»SĻƒS) +ĻƒSdz1 (3.23)

dLL

= ĻƒLdz2 (3.24)

where dz1 and dz are two Brownian motion with a correlation dz1dz2 = Ļdt.Now let us change the variables and take X = lnS, Y = lnL. Using Itoā€™s lemma

in (3.21) and (3.22), we have:

dX =(k(Yāˆ’X)āˆ’

12ĻƒS

2āˆ’Ī»SĻƒS

)dt +ĻƒSdz1 (3.25)

dY = āˆ’12ĻƒL

2dt +ĻƒLdz2 (3.26)

It can be rewritten in the following form:

dX = a(t,Ļ‰)dt +ĻƒSdz1 (3.27)

with a(t,Ļ‰) = k(Y0āˆ’

12ĻƒL

2t +āˆ« t

0 ĻƒLdz2āˆ’X)āˆ’

12ĻƒS

2āˆ’Ī»SĻƒS

To apply the Distribution Match Method, we just need to prove that a(t,Ļ‰)is a normal. Then, propositions 10 and 11 provides us closed formula for thefuture and European options. To be clear, we work on risk neutral measure.

Lemma 13. If X follows equation 3.27, then a(t,Ļ‰) is normally distributed. In addition,the term Āµ(0, t) and Ī¾(0, t) defined in equation 3.7 and 3.8 can be computed explicitly asfollows:

Āµ(0, t) =X0

(eāˆ’ktāˆ’1

)+

(1āˆ’ eāˆ’kt

)Y0 +

1k

(āˆ’Ī»SĻƒS +

12Ļƒ2

Lāˆ’12Ļƒ2

Lktāˆ’12Ļƒ2

S

+eāˆ’ktĪ»SĻƒSāˆ’12

eāˆ’ktĻƒ2L +

12

eāˆ’ktĻƒ2S

)(3.28)

Ī¾(0, t) =1k

(1āˆ’2eāˆ’kt + eāˆ’2kt

)ĻƒSĻƒLĻ+

12k

(Ļƒ2

S + 4eāˆ’ktĻƒ2L

āˆ’3Ļƒ2Lāˆ’ eāˆ’2ktĻƒ2

Sāˆ’ eāˆ’2ktĻƒ2L + 2Ļƒ2

Lkt)

(3.29)

Page 38: Commodity Derivatives: Modeling and Pricing

CHAPTER 3. MODEL FACTOR REDUCTION TECHNIQUE 27

Proof. Re-write equation 3.27 here,

dX = a(t,Ļ‰)dt +ĻƒSdz1

with a(t,Ļ‰) = k(Y0āˆ’

12ĻƒL

2t +āˆ« t

0 ĻƒLdz2āˆ’X)āˆ’

12ĻƒS

2āˆ’Ī»SĻƒS

Denote a1 = k(Y0āˆ’

12ĻƒL

2t +āˆ« t

0 ĻƒLdz2

)āˆ’

12ĻƒS

2āˆ’Ī»SĻƒS, it is obvious a1 is normal.

So we can write a(t,Ļ‰) = a1āˆ’ kX. Change the variable, H = ektX,

dH = ektdXāˆ’ kektXdt

= ekta1dt + ektĻƒSdz1 (3.30)

So H = H0 +āˆ« t

0 eksa1dt +āˆ« t

0 eksĻƒSdz1 is normal since a1 is normal. Therefore,X = eāˆ’ktH is normal. a(t,Ļ‰) = a1 āˆ’ kX is normal. This proves that the drift isnormal.

Let us now calculate the terms Āµ(0, t) and Ī¾(0, t). In equation 3.30, we canderive the expectation and variance of H.

E(H) = H0 +E

[āˆ« t

0eksa1ds

]= H0 +

āˆ« t

0eksa2ds

var(H) =

āˆ« t

0e2ksĻƒS

2ds +

āˆ« t

0

(ektāˆ’ eks

)2ĻƒL

2ds + 2Ļāˆ« t

0eks

(ektāˆ’ eks

)ĻƒSĻƒLds

with a2 = kY0āˆ’12 kĻƒL

2tāˆ’ 12ĻƒS

2āˆ’Ī»SĻƒS

Knowing X = eāˆ’ktH, we have

E(X) = X0eāˆ’kt +

āˆ« t

0ek(sāˆ’t)a2ds

var(X) =

āˆ« t

0e2k(sāˆ’t)ĻƒS

2ds +

āˆ« t

0

(1āˆ’ ek(sāˆ’t)

)2ĻƒL

2ds + 2Ļāˆ« t

0ek(sāˆ’t)

(1āˆ’ ek(sāˆ’t)

)ĻƒSĻƒLds

finally we get

Āµ(0, t) =E(X)āˆ’X0

=X0

(eāˆ’ktāˆ’1

)+

āˆ« t

0ek(sāˆ’t)a2ds

=X0

(eāˆ’ktāˆ’1

)+

āˆ« t

0ek(sāˆ’t)

(kY0āˆ’

12

kĻƒL2sāˆ’

12ĻƒS

2āˆ’Ī»SĻƒS

)ds

Ī¾(0, t) =var(X)

=

āˆ« t

0e2k(sāˆ’t)ĻƒS

2ds +

āˆ« t

0

(1āˆ’ ek(sāˆ’t)

)2ĻƒL

2ds

+ 2Ļāˆ« t

0ek(sāˆ’t)

(1āˆ’ ek(sāˆ’t)

)ĻƒSĻƒLds

Page 39: Commodity Derivatives: Modeling and Pricing

CHAPTER 3. MODEL FACTOR REDUCTION TECHNIQUE 28

The calculation of the integrals is straightforward, and we finally get theresult of lemma 13.

The distribution match method summarized by propositions 10 enables usto retrieve the original result of Gabillon for the futures price. The result ofproposition 11 and 12 introduces a new result for the price of European optionsin Gabillon model.

Proposition 14. The future price at time t is

F(0, t) = A(t)S0B(t)L0

1āˆ’B(t)

with, A(t) = expĪ»SĻƒS

(eāˆ’ktāˆ’1

k

)+

v4k

(2eāˆ’kt

āˆ’ eāˆ’2ktāˆ’1

)B(t) = eāˆ’kt

v = ĻƒS2 +ĻƒL

2āˆ’2ĻĻƒSĻƒL

Proof. Application of proposition 10 with equation 3.28 and 3.29. The calculationis straightforward.

Note that the above result gives another proof of the result of the Gabillonmodel which is originally done using PDE arguments.

Proposition 15. The price (at time t) V of a European call option on spot with strike Kand option expiry T is given by:

V = StN(d1)eāˆ’p(t,T)āˆ’KN(d2)eāˆ’r(Tāˆ’t)

where N(Ā·) is standard normal cumulative distribution function;

d1 =log(St/K) + r(Tāˆ’ t)āˆ’p(t,T)āˆš

Ī¾(t,T)+

12

āˆšĪ¾(t,T)

d2 = d1āˆ’āˆšĪ¾(t,T)

p(t,T) = r(Tāˆ’ t)āˆ’12Ī¾(t,T)āˆ’Āµ(t,T)

with Āµ(t,T) and Ī¾(t,T) in equation 3.28 and 3.29. St = eXt .

Proof. Straightfuture application of proposition 11.

Lemma 16. Under the hypothesis of lemma 13. The variance of process X from t to T is

Ī¾(t,T) =2ĻƒL

k

(eāˆ’kTāˆ’ eāˆ’kt

) (ĻƒLāˆ’ĻƒSĻ

)āˆ’

12k

(eāˆ’2kT

āˆ’ eāˆ’2kt)(Ļƒ2

S +Ļƒ2Lāˆ’2ĻĻƒSĻƒL

)+Ļƒ2

L(Tāˆ’ t) (3.31)

Page 40: Commodity Derivatives: Modeling and Pricing

CHAPTER 3. MODEL FACTOR REDUCTION TECHNIQUE 29

Proof. Substitute lemma 13 into definition 3.8.

Proposition 17. Assume a vanilla call option has maturity at time T. The underlyingis future F(t,T) and strike is K. Then the price V of the option at time t is given by:

V = eāˆ’r(Tāˆ’t)(SteĀµ(t,T)+ 1

2Ī¾(t,T)N(d1)āˆ’KN(d2))

where N(Ā·) is standard normal cumulative distribution function;

d1 =log

(SteĀµ(t,T)+ 1

2Ī¾(t,T)/K)

āˆšĪ¾(t,T)

+12

āˆšĪ¾(t,T)

d2 = d1āˆ’āˆšĪ¾(t,T)

with Āµ(t,T) and Ī¾(t,T) in equation 3.28 and 3.31. St = eXt .

Proof. Straightfuture application of proposition 12.

The results in proposition 15 and 17 are new to our knowledge.

3.5 Reduced one-factor Gibson Schwartz Model

Like the Gabillon model, Gibson and Schwartz [1990] model is based on a two-factor diffusion. The spot price S is assumed to follow a lognormal diffusion likein the Black Scholes model but with a stochastic drift. The difference betweenGibson Schwartz model and Gabillon model lies in the second factor assump-tion. Gibson Schwartz Gibson and Schwartz [1990] assume that it is directly theconvenience yield, Ī“, that is stochastic, following a mean reverting log normaldiffusion. This leads to the following diffusion for the commodity spot priceunder measure Q:

dSS

= (rāˆ’Ī“)dt +Ļƒ1dz1 (3.32)

dĪ“ = (k(Ī±āˆ’Ī“)āˆ’Ī»Ļƒ2)dt +Ļƒ2dz2 (3.33)

where r is the risk free rate, z1 and z2 are two correlated Brownian motions.To apply the Distribution Match Method, we will do some change of variables.Let X = lnS. Using Itoā€™s lemma in equation 3.32 and 3.33, we have:

dX = (rāˆ’Ī“āˆ’12Ļƒ1

2)dt +Ļƒ1dz1 (3.34)

dĪ“ = (k(Ī±āˆ’Ī“)āˆ’Ī»Ļƒ2)dt +Ļƒ2dz2 (3.35)

Page 41: Commodity Derivatives: Modeling and Pricing

CHAPTER 3. MODEL FACTOR REDUCTION TECHNIQUE 30

These equations can be rewritten in the following form:

dX = a(t,Ļ‰)dt +Ļƒ1dz1 (3.36)

with a(t,Ļ‰) = rāˆ’ (k(Ī±āˆ’Ī“)āˆ’Ī»Ļƒ2)tāˆ’āˆ« t

0 Ļƒ2dz2āˆ’12Ļƒ1

2.

Lemma 18. If X follows equation 3.36, then (a(t,Ļ‰),Wt) is a normal. In addition, theterm Āµ(0, t) and Ī¾(0, t) can be computed explicitly as follows:

Āµ(0, t) = rtāˆ’1kĪ“0

(1āˆ’ eāˆ’kt

)āˆ’

kĪ±āˆ’Ī»Ļƒ2

kt

+kĪ±āˆ’Ī»Ļƒ2

k

(1āˆ’ eāˆ’kt

)āˆ’

12Ļƒ1

2t (3.37)

Ī¾(0, t) = Ļƒ12t +

Ļƒ22

2k3

(2ktāˆ’ eāˆ’2kt + 4ekt

āˆ’3)

+2ĻĻƒ1Ļƒ2

k2

(āˆ’kt + 1āˆ’ eāˆ’kt

)(3.38)

Proof. We re-write the process of convenience yield Ī“ here:

dĪ“ = (k(Ī±āˆ’Ī“)āˆ’Ī»Ļƒ2)dt +Ļƒ2dz2 (3.39)

Let H = ektĪ“, then

dH = ektdĪ“+ kektĪ“dt

= ekt (kĪ±āˆ’Ī»Ļƒ2)dt + ektĻƒ2dz2

So, H = H0 +āˆ« t

0 eks(kĪ±āˆ’Ī»Ļƒ2)dt +āˆ« t

0 eksĻƒ2dz2. Consequently, we have:

Ī“ = eāˆ’ktH

= eāˆ’ktĪ“0 +

āˆ« t

0ek(sāˆ’t)(kĪ±āˆ’Ī»Ļƒ2)ds +

āˆ« t

0ek(sāˆ’t)Ļƒ2dz2 (3.40)

Ī“(t) = eāˆ’ktĪ“0 +(1āˆ’ eāˆ’kt

) kĪ±āˆ’Ī»Ļƒ2

k+Ļƒ2eāˆ’kt

āˆ« t

0eksdz2 (3.41)

Denote Y(t) =āˆ« t

0 Ī“ds.In the other hand, we can derive from equation 3.39,

Ī“(t)āˆ’Ī“(0) = (kĪ±āˆ’Ī»Ļƒ2)tāˆ’ kY(t) +Ļƒ2

āˆ« t

0dz2 (3.42)

Substitute equation 3.41 into equation 3.42, we can get the expression for Y(t).

Page 42: Commodity Derivatives: Modeling and Pricing

CHAPTER 3. MODEL FACTOR REDUCTION TECHNIQUE 31

Y(t) =kĪ±āˆ’Ī»Ļƒ2

kt +

1kĻƒ2

āˆ« t

0dz2

āˆ’1k

((eāˆ’ktāˆ’1

)Ī“0 +

(1āˆ’ eāˆ’kt

) kĪ±āˆ’Ī»Ļƒ2

k+Ļƒ2eāˆ’kt

āˆ« t

0eksdz2

)Therefore, we can rewrite X as following:

X =

āˆ« t

0

(rāˆ’Ī“āˆ’

12Ļƒ1

2)ds +

āˆ« t

0Ļƒ1dz1

= rtāˆ’Y(t)āˆ’12Ļƒ1

2t +

āˆ« t

0Ļƒ1dz1

It is obvious that the drift is normally distributed since Y(t) is a normal.Now we calculate Āµ(0, t) and Ī¾(0, t).

Āµ(0, t) = E(X)āˆ’X0

= rtāˆ’1kĪ“0

(1āˆ’ eāˆ’kt

)āˆ’

kĪ±āˆ’Ī»Ļƒ2

kt +

kĪ±āˆ’Ī»Ļƒ2

k

(1āˆ’ eāˆ’kt

)āˆ’

12Ļƒ1

2t (3.43)

Ī¾(0, t) = var(X)

= Ļƒ12t +

Ļƒ22

2k3

(2ktāˆ’ eāˆ’2kt + 4ekt

āˆ’3)+

2ĻĻƒ1Ļƒ2

k2

(āˆ’kt + 1āˆ’ eāˆ’kt

)(3.44)

This proves lemma 18.

Now we can use proposition 10, 11 and 12 to derive following result.

Proposition 19. The future price at time t is

F(0, t) = S0eĀµ(0,t)+ 12Ī¾(0,t)

with Āµ(0,T) and Ī¾(0,T) in equation 3.37 and 3.38. S0 = eX0 .

Proof. Straightfuture application of proposition 10.

Proposition 20. The price (at time T) of a vanilla call option with strike K and maturityT is

V = S0N(d1)eāˆ’p(0,T)āˆ’KN(d2)eāˆ’rT (3.45)

Page 43: Commodity Derivatives: Modeling and Pricing

CHAPTER 3. MODEL FACTOR REDUCTION TECHNIQUE 32

where N(Ā·) is standard normal cumulative distribution function;

d1 =log(S0/K) + rTāˆ’p(0,T)āˆš

Ī¾(0,T)+

12

āˆšĪ¾(0,T)

d2 = d1āˆ’āˆšĪ¾(0,T)

p(0,T) = rTāˆ’12Ī¾(0,T)āˆ’Āµ(0,T)

with Āµ(0,T) and Ī¾(0,T) in equation 3.37 and 3.38 and S0 = eX0 .

Proof. Straightfuture application of proposition 11.

Lemma 21. Under the same hypothesis of lemma 18. The variance of process X from tto T is

Ī¾(t,T) =1k

(1āˆ’2eāˆ’k(Tāˆ’t) + eāˆ’2k(Tāˆ’t)

)ĻƒSĻƒLĻ+

12k

(Ļƒ2

S + 4eāˆ’k(Tāˆ’t)Ļƒ2L

āˆ’3Ļƒ2Lāˆ’ eāˆ’2k(Tāˆ’t)Ļƒ2

Sāˆ’ eāˆ’2k(Tāˆ’t)Ļƒ2L + 2Ļƒ2

Lk(Tāˆ’ t))

(3.46)

Proof. Same calculation with lemma 18.

Proposition 22. Assume a vanilla call option has maturity at time T. The underlyingis future F(t,T) and strike is K. Then the price V of the option at time t is given by:

V = eāˆ’r(Tāˆ’t)(SteĀµ(t,T)+ 1

2Ī¾(t,T)N(d1)āˆ’KN(d2))

where N(Ā·) is standard normal cumulative distribution function;

d1 =log(SteĀµ(t,T)+ 1

2Ī¾(t,T)/K)āˆšĪ¾(t,T)

+12

āˆšĪ¾(t,T)

d2 = d1āˆ’āˆšĪ¾(t,T)

with Āµ(t,T) and Ī¾(t,T) in equation 3.37 and 3.46. St = eXt .

Proof. Straightfuture application of proposition 12.

Remark The results in proposition 19 and 20 are equivalent to those of Jamshid-ian and Fein [1990] and Bjerksund [1991]. The result in proposition 22 is new toour knowledge.

3.6 Summary

In this chapter, we introduce model factor reduction method. We show that adiffusion process with deterministic log-normal volatility and a normal stochastic

Page 44: Commodity Derivatives: Modeling and Pricing

CHAPTER 3. MODEL FACTOR REDUCTION TECHNIQUE 33

drift can be approximated by an equivalent one dimensional process with thesame final marginal distribution. This enables us to compute explicitly futures,vanilla call options on the spot and futures allowing for fast calibration. Insection 3.4, we show how to apply the model factor reduction technique onGabillon models and Gibson Schwartz model. This enables us to find a formulafor the Gabillon model for vanilla option allowing fast calibration. For GibsonSchwartz model, the model factor reduction method provides another proof forthe closed form solution for futures and vanilla options. In addition, as modelfactor reduction technique provides explicitly the marginal distribution, we caneasily extend our result to other common vanilla options like digitals. Modelfactor reduction technique in addition, implies that for future and vanilla option,the Gabillon model and Gibson Schwartz model are indeed equivalent to a one-factor model. This contrasts with the common belief that these models wereexplicit two-factor models.

An important consequence of model factor reduction technique is that itshows, in the sense of marginal distribution, the similarity between the originaltwo-factor model and one-factor model. This contrasts slightly with the generalopinion that commodity derivatives should be priced with two-factor models.Our result shows that in the particular case of vanilla options, Gibson Schwartzmodel and Gabillon model are indeed equivalent to one-factor models. Theequivalence holds only for this specific case and not for path dependent options.

One of the limitation of the model factor reduction method is the loss ofinformation on the dynamic of the original process. Therefore, it can not work forpath-dependent products such as Asian option and snowball option. A possiblefuture work could be on the joint distribution at different time 0< t1 < t2 < Ā· Ā· Ā·< tn

that would provide a way to calculate closed forms solutions for path dependentoptions.

Page 45: Commodity Derivatives: Modeling and Pricing
Page 46: Commodity Derivatives: Modeling and Pricing

Chapter 4

Stochastic volatility model

RƩsumƩ du chapitre

Dans le domaine de la finance quantitative pour Ć©valuer des titres dĆ©rivĆ©s,comme les options, les produits sont souvent chers et couvert Ć  lā€™aide du modĆØleBlack-Scholes, dans lequel il ya une relation univoque entre le prix de lā€™option etla volatilitĆ© ĻƒB. En thĆ©orie, la valeur de volatilitĆ© ĻƒB dans le modĆØle de Black estconstante sur la durĆ©e du produit dĆ©rivĆ©, et ne dĆ©pend pas des changements duniveau des prix du sous-jacent. Cependant, en pratique, ce nā€™est pas le cas. DoncmodĆØle Black-Scholes ne peut pas expliquer les caractĆ©ristiques de la surface devolatilitĆ© implicite comme le Ā« smile Ā» de volatilitĆ© et le Ā« skew Ā», qui indiquentque la volatilitĆ© implicite tend Ć  varier par rapport aux niveaux de prix, du strikeet de la distance de lā€™expiration.

Le dĆ©veloppement de modĆØles de volatilitĆ© locale par Dupire et Derman-Kania Ć©tĆ© une avancĆ©e majeure. Ces modĆØles Ć  volatilitĆ© locale sont auto-cohĆ©rents,sans arbitrage, et peuvent ĆŖtre calibrĆ©s pour correspondre exactement aux vola-tilitĆ©s observĆ©es. Ces modĆØles sont trĆØs populaires actuellement. Cependant, lecomportement dynamique des Ā« smiles Ā» et Ā« skew Ā» prĆ©dite par des modĆØlesĆ  volatilitĆ© locale est diffĆ©rent du comportement observĆ© dans le marchĆ©. Sur lemarchĆ©, les prix des actifs et des smiles Ć©voluent dans dans la mĆŖme direction,mais le modĆØle de volatilitĆ© locale prĆ©dit le mouvement inverse. Cette contradic-tion crĆ©e des difficultĆ©s pour la couverture en delta et vega dĆ©rivĆ©es du modĆØlede volatilitĆ© locale. Ceci est prouvĆ© dans lā€™article Ā« Gestion du risque Smile Ā» parHagan et al. [2002].

Les modĆØles citĆ©s auparavant ne permettent pas de respecter une dynamiquerĆ©aliste pour la volatilitĆ©. Les modĆØles Ć  volatilitĆ© stochastiques ainsi que leurcalibration sont bien examinĆ©es dans Javaheri [2005].

Les modĆØles Ć  volatilitĆ© stochastique dĆ©coulent du traitement de la volatilitĆ©

35

Page 47: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 36

du titre sous-jacent comme un processus alĆ©atoire, gouvernĆ© par des variablesdā€™Ć©tat tels que le niveau des prix du sous-jacent, la tendance de la volatilitĆ©Ć  revenir vers une certaine valeur Ć  long terme et la variance du processus devolatilitĆ© elle-mĆŖme, la corrĆ©lation entre le processus alĆ©atoire pour le sous-jacentet la volatilitĆ© du processus alĆ©atoires, et dā€™autres. Ces variables peuvent dĆ©crireprĆ©cisĆ©ment les Ā« smiles Ā» et Ā« skew Ā» du marchĆ©. En supposant que la volatilitĆ©du prix sous-jacent est un processus stochastique plutĆ“t que dā€™une constantequi peut ĆŖtre adaptĆ© Ć  lā€™exigence du marchĆ© en changeant les paramĆØtres etles modĆØles de formulaires, il devient possible de modĆ©liser les dĆ©rivĆ©s plusde prĆ©cision. Les modĆØles tels que ceux de Heston, Piterbarg et SABR sontles modĆØles les plus populaires actuellement pour leurs performances dans lamanipulation des smiles et skew. Dans ce chapitre on les Ć©tend Ć  des fonctionsqui dĆ©pendent du temps, afin de leur donner encore plus de souplesse. Ensuiteon les calibre et on discute leurs performances.

Page 48: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 37

4.1 Introduction

In the field of quantitative finance to evaluate derivative securities, such asoptions, products are often priced and hedged using Black-Scholes model, inwhich there is a one-to-one relation between the price of option and the volatilityĻƒB. In theory, the volatility value ĻƒB in Blackā€™s model is constant over the life ofthe derivative, and unaffected by the changes in the price level of the underlying.However in practice, options with different strikes K require different valatilitiesto match their market prices. So Black-Scholes model cannot explain long-observed features of the implied volatility surface such as volatility smile andskew, which indicate that implied volatility does tend to vary with respect tostrike price and expiration.

The development of local volatility models by Dupire and Derman-Kani wasa major advance in managing smiles and skews. Local volatility models are self-consistent, arbitrage-free, and can be calibrated to match exactly observed marketsmiles and skews. These models are very popular currently in most financialinstitution. However, the dynamic behavior of smiles and skews predicted bylocal volatility models is different from the behavior observed in the marketplace.In the market, asset prices and market smiles move in the same direction, but thelocal volatility model predicts the reverse movement. This contradiction createsdifficulties for delta and vega hedges derived from the local volatility model.This is proved in the article "Managing Smile Risk" by Hagan et al. [2002]).

Commodity specialized model is able to better capture the features such asconvenience yield, seasonality and mean reversion. Various drift terms are sug-gested in these models. These models for commodity derivatives pricing relieson a specific dynamics of some state variables from whom one can derives thecorresponding futures prices. The states variables are often assumed to followa mean reverting process to capture the mean reversion nature of commodityprices. Typical examples include Gibson and Schwartz [1990], Brennan [1991],Gabillon [1991], Schwartz [1997], Hilliard and Reis [1998], Schwartz and Smith[2000], and Casassus and Collin-Dufresne [2005]. But these models also lack arealistic dynamics for the volatility. These models as well as their calibration arewell reviewed in Javaheri [2005].

Stochastic volatility models derive from the modelsā€™ treatment of the un-derlying securityā€™s volatility as a random process, governed by state variablessuch as the price level of the underlying, the tendency of volatility to revert tosome long-run mean value, and the variance of the volatility process itself, thecorrelation between the random process for the underlying and the volatilityrandom process, and others. These variables can describe precisely the marketsmiles and skews. By assuming that the volatility of the underlying price is astochastic process rather than a constant which can be adapted to the marketrequirement by changing the parameters and model forms, it becomes possible

Page 49: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 38

to model derivatives more accurately. The models such as Heston model, Piter-barg model and SABR model are the most popular stochastic volatility modelsstudied currently for their performance in handling the smiles and skews.

4.2 Review of research

Indeed, various authors had already noticed the importance of stochastic volatil-ity for commodity modeling. This includes the work of Eydeland and Geman[1998], Richter and SĆørensen [2002], Nielsen and Schwartz [2004], and Trolleand Schwartz [2008] who explicitly allow for stochastic volatility for the corre-sponding state variables. Their models are often an adapted version of someexisting stochastic volatility models already developped for another asset thancommodity. For instance, Eydeland and Geman [1998] adapt an Heston modelto commodity. Trolle and Schwartz [2008] adapt their corresponding interestrates HJM model.

This suggest first to study the various stochastic models already developedin equity option pricing and see how to adapt them to commodity issue. Thenwe need to benchmark these models in the case of commodity to assess the fitor the consistency of these models for this market. These two questions are themotivations of this work.

Stochastic volatility modeling has been studied for quite some time. Amongthe precursors, Hull and White [1990] and Wiggins [1987] studied lognormaldistributed stochastic volatility. The Hull-White model was developed usinga trinomial lattice, although closed-form solutions for European-style optionsand bond prices are possible. They assumed the correlation between spot priceand volatility is zero. But from the market data, evidence showed that the twoprocesses are strongly correlated.

Wiggins studied the same model with Hull and White, but with the non-zerocorrelation. The advantage of the model is that the volatility is always positiveby assuming geometric Brownian motion.

Scott [1987], and then Stein and Stein [1991] studied an Ornstein Uhlen-beck process (OU process) for volatility. OU process adds mean reversion intovolatility. It is more natural to assume the volatility mean reversion rather thanlognormal. The disadvantage of the model is that, the model cannot guaranteepositiveness. That is to say, if volatility follows an OU process, then there ischance that volatility goes to negative.

Heston [1993] studied a special case of stochastic volatility model. He sup-posed the square of volatility follows an OU process. This assumption resolvedthe problem that volatility goes to negative. Another advantage of Heston modelis that, it allows correlation between spot and volatility. The model has also asemi-analytic formula, which makes it practical to use.

Page 50: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 39

Richter and SĆørensen [2002] and Trolle and Schwartz [2008] studied the un-spanned stochastic volatility model with abundant data analysis. It is an appli-cation of perturbation theory. They expand the formula of price around vol ofvol near zero.

In dynamic stochastic volatility models, parameters depend on the time t.The process for the forward and the volatility become much more complicatethan in constant parameter case. In reality, we only need to price products forgiven dates, so we intend to consider the parameters as piecewise constant ones.With this assumption, it is possible to find out closed pricing formulas whichare used later for the calibration. There are several methods as we noted before:firstly, compute the effective mean value of the parameters so that we can use theclosed formula of the constant stochastic volatility model by replacing the timedependent parameters with their effective mean values. This method consiststo find out the effective medium value of each parameter who depends on time.This method is employed by Piterbarg [2005]. Secondly as another method, byresolving the PDEs, we can compute the option price directly by an integrationof a function which is continuous on time t. Theoretically, we can use both thesetwo methods for each model. But as each model has its different characteristics,one method should be easier than another. For SABR model in the case ofconstant Ī², the first method is easier that the second. But we can use the secondmethod to price an option with SABR model in the case of time dependent Ī².For Heston model and Piterbarg model with correlation Ļ, we use the secondmethod, and for Piterbarg model without correlation, we can use both of them.In the following, we will present one by one the methods used for each model.

Stochastic volatility models with time dependent parameters are derivedfrom stochastic volatility models with constant parameters to be able to priceproducts with long maturities. Different periods have variant market smilesand skews, which need different parameter values to match exactly the curves.In reality, the parameters in the models, especially correlation Ļ between theprocess of underlying and the process of volatility and parameters related to thesmile and skew, depend on time t. As we only need the model values for certaindates, we can consider the parameters as piecewise functions of time t. This isthe approach taken for all the models.

Although jump can relax the need of time-dependent parameter, it cannotfulfill the need completely, especially for the options with long maturity. It isbecause the expectation of jumps tends to flatten in long time. We donā€™t considerjump in this article, interested readers can refer to Andersen and Andreasen[2000] and Benhamou et al. [2008].

However in practical, SV models are usually difficult to use due to difficultiesof calibration. The difficulty is obvious: 1. too many parameters to calibrate:an SV model usually has more than 5 parameters; 2. some of the parameters

Page 51: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 40

cannot be observed from the market. Therefore we cannot perform the sameprocedure of calibration as we do for BS model or LV models. To calibratean SV model, the common way is to minimize the vector distance betweentheoretical price (or implied BS vol) and the market price(or implied BS vol).One of the key points here is how to calculate the theoretical price fast andprecisely. The numerical algorithms such as partial differential equations (PDEs)or Monte Carlo is too time-consuming to use in calibration, even for the simplestEuropean-style options. For this reason, we investigate closed form formula forEuropean options.

A closed form formula is typically available for SV models with constantparameters. A series of references are available on this topic, such as Andersenand Brotherton-Ratcliffe [1998], Hagan et al. [2002], Zhou [2003] and Andersenand Andreasen [2002]. Lewis summarizes the Fourier formed solution for staticmodels in his book Lewis [2000]. There are closed formulas for all the commonSV models including square root model, GARCH model and 3/2 model.

Here we have some clarification on the term ā€œclosed form formulaā€. Thedefinition of the term ā€œclosed form formulaā€ can vary from person to person.A solution is said to be a closed form solution if it is in terms of functions andmathematical operations from a given generally accepted set. However, thechoice of what to call closed form is rather arbitrary. For example, an infinitesum is be called a closed form in certain context. In this paper, we call the Fouriertransform integration a closed form function. This kind of infinite integration isalso called semi-analytical solution in some references as well.

For an SV model with time-dependent parameters, saying dynamic SVmodel, the closed form formula is more difficult to derive. Several differentmethods have been reported, such as asymptotic expansion and character func-tion. References for the former one includes Labordere [2005] and Osajima[2007]. For the latter, readers can refer to Mikhailov and Nogel [2003a]. Somereferences suggest piecewise constant parameters since market data in practicalis only available on certain expiries. It is a natural assumption because there is noinformation about how the parameters behave between two adjacent expiries.This assumption can considerably simplify the closed form formula withoutlosing any generality.

4.3 Heston model

Heston [1993] supposes that spot price St follows a log-normal process and itsvariance Vt also follows a log-normal process.

dSt

St= Āµdt +

āˆšVtdW1 (4.1)

dVt = k(Īøāˆ’Vt)dt +Ī¾āˆš

VtdW2 (4.2)

Page 52: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 41

where W1 and W2 are two standard Brownian motion with correlation Ļ; Āµ is driftterm; Īø is the mean reversion of variance process; k is speed of mean reversion;Ī¾ is vol of vol parameter. Another model parameter is V0, the initial value of thevariance process.

Using Fourier transform and characteristic function, we can get the formulafor vanilla option of Heston model as in Heston [1993] and Mikhailov and Nogel[2003b]. The idea is to write the solution in the same form as Black Scholesformula and assume the characteristic function has the form as eC+DV0+iĻ† f . Bysolving C and D , we can have the final result.

4.3.1 Formula for vanilla option

Case 1: Heston formula This is the method developed by Steven L. Heston in"A Closed-Form Solution for Options with Stochastic Volatility with Applica-tions to Bond and Currency Options" Heston [1993].

Firstly we guess that the option value of a call has the form

C(S,V, t) = SP1āˆ’KP2

Note x = logF(t,T), from the PDEs that C satisfies (that we can get by applyingthe Itoā€™s formula), we get the PDE for P1 and P2:

12

Vāˆ‚2P j

āˆ‚x2 +ĻĪ¾Vāˆ‚2P j

āˆ‚xāˆ‚V+

12Ī¾2V

āˆ‚2P j

āˆ‚V2 + u jVāˆ‚P j

āˆ‚x+ (a jāˆ’b jV)

āˆ‚P j

āˆ‚V+āˆ‚P j

āˆ‚t= 0

for j = 1, 2, where u1 = 1/2, u2 = āˆ’1/2, a = kĪø, b1 = kāˆ’ĻĪ¾, b2 = k with the initialcondition P j(x,V,T; logK) = 1xā‰„logK when t = T.

P j can be considered as the conditional probability of x j that the option expiresin the money:

P j = Proba[x j(T) ā‰„ logK

āˆ£āˆ£āˆ£x j(0) = log f ,v(0) = V(0)]

where x j follows:

dx j(t) = u jv(t)dt + x j(t)āˆš

v(t)dW1

dv(t) = (aāˆ’b j)dt +Ī¾āˆš

v(t)dW2

< dW1,dW2 > = Ļdt

We canā€™t get the solutions for P j immediately. So we introduce the characteristicfunctions f j(x,V, t;Ļ†) of their probability density function, we have by usingLĆ©vyā€™s theorem to get the distribution function from characteristic function:

Page 53: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 42

P j(x,V, t,T; log(K)) = 1āˆ’Proba[x(T) ā‰¤ log(K)

āˆ£āˆ£āˆ£x(t) = x,v(t) = v]

(4.3)

= 1/2 +1Ļ€

āˆ«āˆž

0Re

eāˆ’iĻ†ln(K) f j(x,V, t,T;Ļ†)iĻ†

dĻ† (4.4)

where f j(x,V, t;Ļ†) satisfies:

12

Vāˆ‚2 f j

āˆ‚x2 +ĻĪ¾Vāˆ‚2 f j

āˆ‚xāˆ‚V+

12Ī¾2V

āˆ‚2 f j

āˆ‚V2 + u jVāˆ‚ f j

āˆ‚x+ (a jāˆ’ b jV)

āˆ‚ f j

āˆ‚V+āˆ‚ f j

āˆ‚t= 0 if 0 ā‰¤ t < T

f j(x,V,T; logK) = eiĻ†x if t = T

The solutions for f j(x,V, t,T;Ļ†) can be found by introducing:

f j(x,V, t,T;Ļ†) = eC(Tāˆ’t;Ļ†)+D(Tāˆ’t;Ļ†)V+iĻ†x

where C j(Ļ„;Ļ†) and D j(Ļ„;Ļ†) with Ļ„ = Tāˆ’ t satisfy the PDEs following:dC j(Ļ„;Ļ†)

dĻ„āˆ’ aD(Ļ„,Ļ†) = 0

dD j(Ļ„;Ļ†)dĻ„

āˆ’

Ī¾2D2j (Ļ„;Ļ†)

2+ (b jāˆ’ĻĪ¾Ļ†i)D j(Ļ„,Ļ†)āˆ’u jĻ†i +

12Ļ†2 = 0

And C j(0;Ļ†) = 0, D j(0;Ļ†) = 0 .

The solutions for C j(Ļ„;Ļ†) and D j(Ļ„;Ļ†) are:

C j(Ļ„;Ļ†) =aĪ¾2

(b jāˆ’ĻĪ¾Ļ†i + d j

)Ļ„āˆ’2log

1āˆ’ g jed jĻ„

1āˆ’ g j

D j(Ļ„;Ļ†) =

b jāˆ’ĻĪ¾Ļ†i + d j

Ī¾2 Ā·1āˆ’ ed jĻ„

1āˆ’ g jed jĻ„

g j =b jāˆ’ĻĪ¾Ļ†i + d j

b jāˆ’ĻĪ¾Ļ†iāˆ’d j

d j =āˆš

(ĻĪ¾Ļ†iāˆ’ b j)2āˆ’Ī¾2(2u jĻ†iāˆ’Ļ†2)

Case 2: Lewis formula Another method using Fourier Transform is the methoddeveloped by Lewis [2000]. Instead of adapting to the Black Scholes form asHeston, we do directly Fourier Transform to the PDE of the option value Cfollowing:

āˆ’Ct =12

VF2CFF + (Ļ‰āˆ’ kV)CV +12Ī¾2VCVV +ĻĪ¾VFCFV if 0 ā‰¤ t < T

C = (Sāˆ’K)+ if t = T

Page 54: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 43

Here CFF, CV, CVV and CFV are the partial derivation of option price C withrespect to forward F(t,T) and variance V.

It can be written by noting x = logF(t,T) and Ļ„ = Tāˆ’ t:

CĻ„ =12

VCxxāˆ’12

VCx + (Ļ‰āˆ’ kV)CV +12Ī¾2VCVV +ĻĪ¾VCxV if Ļ„ > 0

C = (S(T)āˆ’K)+ if Ļ„ = 0

Then by computing the Fourier Transform C =āˆ«āˆž

āˆ’āˆžeiĻ†xC(x,V, t)dx:

CĻ„ = āˆ’12

VĻ†2Cāˆ’ i12

VĻ†C + (Ļ‰āˆ’ kV)CV +12Ī¾2VCVV + iĻĪ¾VĻ†CV

=(āˆ’

12

VĻ†2 + i12

VĻ†)C +

(Ļ‰āˆ’ kV + iĻĪ¾V

12Ļ†

)CV +

12Ī¾2VCVV (4.5)

for Ļ„ > 0. And the initial condition for Ļ„ = 0 is:

Financial Claim Payoff Function Payoff Transform Ļ†-plane RestricitonsCall

option max[S(T)āˆ’K,0] āˆ’KiĻ†+1

Ļ†2āˆ’iĻ† Im(Ļ†) > 1

Putoption max[Kāˆ’S(T),0] āˆ’

KiĻ†+1

Ļ†2āˆ’iĻ† Im(Ļ†) < 0

Here we introduce the fundamental transform H(Ļ†,Ļ„,V) who satisfies the sameequation as C but with an initial condition: H(Ļ†,0,V) = 1. Then C can be writtenas : C = H(0,V)*Payoff Transform, and:

C( f ,V,Ļ„) = f āˆ’K2Ļ€

āˆ« iĻ†i+āˆž

iĻ†iāˆ’āˆž

eāˆ’iĻ†X H(Ļ†,Ļ„,V)Ļ†2āˆ’ iĻ†

dĻ† (4.6)

To get the solution for H(Ļ†,Ļ„,V), we guess the form H(Ļ†,Ļ„,V) = exp( f1 + f2V),and we have the PDEs for f1 and f2:

d f1dĻ„

= Ļ‰ f2

d f2dĻ„

= āˆ’12Ļ†2 + i

12Ļ†+ (āˆ’k + iĻĪ¾Ļ†) f2 +

12Ī¾2 f 2

2

with initial conditions f1(0) = 0 and f2(0) = 0. Here Ļ‰ = kĪø.

Page 55: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 44

The solutions for f1 and f2 are:

f1 = Ļ‰

[tgāˆ’ log

(1āˆ’hexp(dt)

1āˆ’h

)]f2 = g

(1āˆ’exp(dt)

1āˆ’hexp(dt)

)d = [Īø2 + 4c]1/2

g =12

(Īø+ d)

h =Īø+ dĪøāˆ’d

where Īø(k) = 2Ī¾2

[(1āˆ’Ī³+ iĻ†)ĻĪ¾+

āˆšk2āˆ’Ī³(1āˆ’Ī³)Ī¾2

], t = 1

2Ī¾2Ļ„, Ļ‰= 2

Ī¾2Ļ‰, and c =Ļ†2āˆ’iĻ†Ī¾2 .

The advantage of Lewis formula is that it has a better numerical convergence.The reason is obvious: it has only one integral instead of two in Mikhailovformula. The following figure shows a test between two formulas. We calculatethe same option and integral interval is set to (0,M). M = 2,3, Ā· Ā· Ā· ,12. For Lewisformula, an interval (0,6) is enough to converge, while Mikhailov formula needsan interval (0,12) to converge.

Figure 4.1: Option price with different formula under Heston model

4.3.2 Control variate in Heston model

This section focuses on the instability of the semi-analytical solution of Hestonmodel. We notice the numerical difficulty on the infinitive integral. When wecut the integral to finite, it will cause a systematic error. We therefore propose amore stable solution to overcome this difficulty. Let us consider a similar model

Page 56: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 45

with deterministic volatility. Instead of integrating the original function, weintegrate the difference between the original function and the function in case ofdeterministic. As the new integrand is smoother, we get a better convergence.Following this idea, we will show more detail in both Mikhailov formula andLewis formula.

Our improvement is supported in the section of empirical result. It solves thefailure in calibration caused by instability, especially with in-the-money options.

Here we consider a modified Heston model with deterministic volatility. InHeston model we take vol of vol Ī¾ = 0 and get the following model:

dS(t)DET

S(t)DET= Āµdt +

āˆšV(t)DETdW1 (4.7)

dV(t)DET = k(Īøāˆ’V(t)DET

)dt (4.8)

The objective is to improve the closed formula solution of the option pricewith Heston Stochastic Model. As the volatility follows a process stochastic,the convergency of the price formula is not very strong. We try to find out avariate that increases the convergency of the pricing formula. The variate that wechoose is the price of an option with Heston model but in which the volatility ofvolatility is 0. This method is used both for Heston formula and Lewis formula.

Case1: Heston Formula The Heston model without volatility of volatility hasthe process for the square of volatility:

dV(t) = k[Īøāˆ’V(t)]dt (4.9)

which is a deterministic process. Noting V the option price of Heston modelwithout volatility of volatility Ī¾, V can be written in two ways:

V = SP1āˆ’KP2

where P j satisfies the same PDEs as P j by setting Ī¾ to 0. And:

V = SN(d1)āˆ’KN(d2) (4.10)

where N is the Gaussian distribution, and by noting ĻƒT the total volatility (def-inition in equation 4.11) of the Heston model without volatility of volatility, wehave

d1 =logS/KĻƒT

+12ĻƒT

d2 =logS/KĻƒT

āˆ’12ĻƒT

Page 57: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 46

To get the total volatility ĻƒT, as we know dV(t) = k[Īøāˆ’V(t)]dt and V(t) is thesquare of volatility, then:

(ĻƒT)2 =

āˆ« T

0V(t)dt

= ĪøT +V0āˆ’Īø

k(1āˆ’ eāˆ’kT)

And so

ĻƒT =

āˆšĪøT +

V0āˆ’Īøk

(1āˆ’ eāˆ’kT) (4.11)

By resolving the PDEs of C j and D j with Ī¾ = 0, let DDETj and CDET

j the solutionsin the case Ī¾ = 0, we have:

DDETj =

u jĻ†iāˆ’1/2Ļ†2

b j(1āˆ’ eāˆ’b jĻ„)

CDETj = a

u jĻ†iāˆ’1/2Ļ†2

b2j

(eāˆ’b jĻ„āˆ’1) +(u jĻ†iāˆ’1/2Ļ†2)Ļ„

b j

And we can have the characteristic functions f DET

j of the density function of P j:

f DETj (x,V,Ļ„;Ļ†) = eCDET

j (Ļ„;Ļ†)+DDETj (Ļ„;Ļ†)V+iĻ†x (4.12)

We have two different way to calculate the option price with vol of vol = zero.One is on closed formula in equation 4.10. The other is on integral form withcharacteristic functions in equation 4.12. Now the option price can be expressedby adding and minus the two form on the original formula.

C( f ,V(0), t) = f ĖœP1āˆ’K ĖœP2 + [SN(d1)āˆ’KN(d2)]āˆ’Sāˆ’K

2(4.13)

where

ĖœP j = 1/2 +1Ļ€

āˆ«āˆž

0Re

eāˆ’iĻ†ln[K]( f j(x,V,T;Ļ†)āˆ’ f DETj )

iĻ†

dĻ†

The control variate method improve much the calibration process. For Hestonstatic model, without control variate, we usually canā€™t success the calibrationprocess. We know the vega : vega = dC

dV is very small when we are in the moneyas we can see in a graph (figure 4.2) for vega with strike = 110.

The Heston formula without control variate calculates the option price in themoney with a difference which results a greater error for the implied volatility.Because we calibrate to the market implied volatility, we canā€™t reach the givencalibration level with this formula since the error is accumulated in the calibrationloop to make the whole process fail. To illustrate the effect of control variate,

Page 58: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 47

Figure 4.2: vega and spot

Figure 4.3: Effect of control variate to implied volatility - without control variate

by taking the same calibrated parameters, here we have the implied volatilitycalculated with the Heston formula without control variate at left and the Hestonformula with control variate in figure 4.3 and 4.4.

We see from the figure above that the Heston formula gives a very big errorfor the case when the spot is very small. The error is bigger if we have evensmaller spots. This difference is enough for that we canā€™t success the calibration.

Page 59: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 48

Figure 4.4: Effect of control variate to implied volatility - with control variate

The error shows that it is more difficult for in-the-money option since vega ismuch smaller in this area.

Case2: Lewis Formula We have the same process for the volatility as in thecase of Heston formula. Note C the option price of Heston model with Ī¾ = 0,then we get: CĻ„ =

12

VCxxāˆ’12

VCx + (Ļ‰āˆ’ kV)CV if Ļ„ > 0

C = ( f (T)āˆ’K)+ if Ļ„ = 0

where Ļ‰ = kĪø, by doing Fourier Transform of C: Ė†C =āˆ«āˆž

āˆ’āˆžeiĻ†xC(x,V, t)dx, we get:

Ė†CĻ„ = āˆ’12

Vk2 Ė†C + i12

Vk Ė†C + (Ļ‰āˆ’ kV) Ė†CV

for Ļ„ > 0 and the initial condition for Ļ„ = 0 is the same as in the case withĪ¾ , 0.Suppose Ė†C = H(Ļ†,Ļ„,V)āˆ—Payoff Transform, H(Ļ†,Ļ„,V) = exp( f1 + f2V) and H(Ļ†,0,V) =

1 as initial value, we get easily:

Page 60: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 49

d f1dĻ„

= Ļ‰ f2

d f2dĻ„

= āˆ’12Ļ†2 + i

12Ļ†āˆ’ k f2

with Ļ‰ = kĪø And their solutions:

f1 =Ļ†iāˆ’Ļ†2

2k(1āˆ’ eāˆ’kĻ„)

f2 = Ļ‰

(Ļ†iāˆ’Ļ†2

2k2 (eāˆ’kĻ„āˆ’1) +

(Ļ†iāˆ’Ļ†2)Ļ„2k

)The same as for Heston Formula, we can get the option price by computing

the total volatility and use Blackā€™s formula:

C = fN(d1)āˆ’KN(d2)

Finally, we have the option price formula:

C( f ,V,Ļ„) = āˆ’K2Ļ€

āˆ« iĻ†i+āˆž

iĻ†iāˆ’āˆž

eāˆ’iĻ†X HĻ†2āˆ’ iĻ†

dĻ†+ [ fN(d1)āˆ’KN(d2)]

where:H = H(Ļ†,Ļ„,V)āˆ’ H(Ļ†,Ļ„,V)

and H(Ļ†,Ļ„,V) is the fundamental function that we calculated for Lewis for-mula(See the section 3.2.2).

The figure 4.5 shows the numerical result of using control variate. We canspot the zig-zag effect of the price for the formula without control variate. Thecontrol variate makes the price curve smoother and more quickly convergence.

4.3.3 Discussion of numerical stability

As function log(Ā·) is not continuous in C++ for complex numbers, we have topay attention during the implementation by using simple a method to track theangle in order to keep it continuous. For doing this, we can use the methodexplained in Kahl and Jackel [2005].

The formulas we have here can be written in another way by changing d toāˆ’d.This is because d is the square root of a complex number and the principal squareroot value is returned for d. The principal square root value makes f pass acrossthe negative real axis when increasing Ļ† and hence leads to a discontinuousfunction causing numerical trouble. In contrast, when we change d to āˆ’d, wecan go around this difficulty. The proof is given in Albrecher et al. [2007].

The numerical problem of solution (4.4) and (4.6) is that it is a semi-analyticsolution. In practical use, we must replace the infinitive integral to an integral

Page 61: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 50

Figure 4.5: Option price with different formula under Heston model

from 0 by a large number M. It causes a systematic error. The structure of Vegaalso amply the pricing error, especially for the case of in the money options. Theempirical result is also given in Albrecher et al. [2007].

4.3.4 Dynamic Heston Model

For Heston stochastic volatility model with piecewise parameters, we use thesame method as in the case of Heston static stochastic volatility model byMikhailov and Nogel [2003b]. The model is presented as:

dF(t) = F(t)āˆš

V(t)dW1

dV(t) = k(t)(Īø(t)āˆ’V(t))dt +Ī¾(t)āˆš

V(t)dW2 (4.14)

< dW1,dW2 > = Ļ(t)dt

Suppose that we have N periods with time intervals [T0,T1], [T1,T2], [T2,T3],...., [TNāˆ’1,TN] where TN = T, and T0 = t, and in each subinterval of time the pa-rameters are constant. We note the parameters: k1, k1, k1, ..., k1, Īø1, Īø2, ...., ĪøN, Ļ1 ,Ļ2, ..., ĻN, and Ī¾1, Ī¾2, ..., Ī¾N. As we use the inverse time Ļ„ = Tāˆ’ t, the subintervalsfor Ļ„ become [0,TN āˆ’TNāˆ’1], [TN āˆ’TNāˆ’1,TN āˆ’TNāˆ’2], ..., [TN āˆ’T1,TN āˆ’ t].

Return to the Heston static model, the option price V = f P1āˆ’KP2, to calculateP1 and P2, we introduce their characteristic functions f1 and f2, and f j ( j = 1 or2) is presented as f j = eC j(Tāˆ’t;Ļ†)+D j(Tāˆ’t;Ļ†)V+iĻ†x. Our final task is to resolve theequations of C j(Ļ„,Ļ†) and D j(Ļ„,Ļ†):

Page 62: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 51

dC j(Ļ„;Ļ†)

dĻ„āˆ’ a(Tāˆ’Ļ„)D(Ļ„,Ļ†) = 0

dD j(Ļ„;Ļ†)dĻ„

āˆ’

Ī¾2D2j (Ļ„;Ļ†)

2+

(b j(Tāˆ’Ļ„)āˆ’Ļ(Tāˆ’Ļ„)Ī¾(Tāˆ’Ļ„)Ļ†i

)D j(Ļ„,Ļ†)āˆ’u jĻ†i +

12Ļ†2 = 0

where a(T āˆ’ Ļ„) = k(T āˆ’ Ļ„)Īø(T āˆ’ Ļ„), b1(T āˆ’ Ļ„) = k(T āˆ’ Ļ„)āˆ’ Ļ(T āˆ’ Ļ„)Ī¾(T āˆ’ Ļ„) andb2(Tāˆ’Ļ„) = k(Tāˆ’Ļ„).

As the parameters are constant in each subinterval of Ļ„, note i the ith subin-terval of Ļ„, 1 ā‰¤ i ā‰¤N, we rewrite the equations for C j and D j in the ith subinterval[TN āˆ’TN+1āˆ’i,TN āˆ’TNāˆ’i] of Ļ„ where we note Ci

j and Dij and Ļ„ = Ļ„āˆ’ (TN āˆ’TN+1āˆ’i):

dCi

j(Ļ„;Ļ†)

dĻ„āˆ’ aN+1āˆ’iDi

j(Ļ„,Ļ†) = 0

dDij(Ļ„;Ļ†)

dĻ„āˆ’

Ī¾2(Dij(Ļ„;Ļ†))2

2+ (bN+1āˆ’i

j āˆ’ĻN+1āˆ’iĪ¾N+1āˆ’iĻ†i)Dij(Ļ„,Ļ†)āˆ’u jĻ†i +

12Ļ†2 = 0

where bN+1āˆ’i1 = kN+1āˆ’iāˆ’ĻN+1āˆ’iĪ¾N+1āˆ’i, bN+1āˆ’i

2 = kN+1āˆ’i and 0ā‰¤ Ļ„ā‰¤ TN+1āˆ’iāˆ’TNāˆ’i.

The initial conditions are Cij(0), Di

j(0) are to be determined later.

The solution for the previous equations are:

Cij(Ļ„;Ļ†) =

aN+1āˆ’i

Ī¾2N+1āˆ’i

(bN+1āˆ’ij āˆ’ĻN+1āˆ’iĪ¾N+1āˆ’iĻ†i + d j

)Ļ„āˆ’2log

1āˆ’ g jed jĻ„

1āˆ’ g j

+ Ci

j(0)

Dij(Ļ„;Ļ†) =

bN+1āˆ’ij āˆ’ĻN+1āˆ’iĪ¾N+1āˆ’iĻ†i + d jāˆ’ (bN+1āˆ’i

j āˆ’ĻN+1āˆ’iĪ¾N+1āˆ’iĻ†iāˆ’d j)g jed jĻ„

(1āˆ’ g jed jĻ„)Ī¾2N+1āˆ’i

g j =bN+1āˆ’i

j āˆ’ĻN+1āˆ’iĪ¾N+1āˆ’iĻ†i + d jāˆ’Dij(0)Ī¾2

N+1āˆ’i

bN+1āˆ’ij āˆ’ĻN+1āˆ’iĪ¾N+1āˆ’iĻ†iāˆ’d jāˆ’Di

j(0)Ī¾2N+1āˆ’i

d j =āˆš

(ĻN+1āˆ’iĪ¾N+1āˆ’iĻ†iāˆ’bN+1āˆ’ij )2āˆ’Ī¾2

N+1āˆ’i(2u jĻ†iāˆ’Ļ†2)

For the first interval i = 1, the same as in the constant case, the initial conditionsare:

C1j (0) = 0,

D1j (0) = 0.

C j and D j are continuous on Ļ„, so for the second subinterval of Ļ„, we take thefinal value of the first subinterval as initial value. We do the same procedure for

Page 63: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 52

each subinterval of Ļ„, every time we take the final value of previous interval asinitial value, this means for i > 2:

Cij(0) = Ciāˆ’1

j (TN+2āˆ’iāˆ’TN+1āˆ’i;Ļ†),

Dij(0) = Diāˆ’1

j (TN+2āˆ’iāˆ’TN+1āˆ’i;Ļ†).

The final values of the last subinterval of Ļ„: CNj (T1 āˆ’ t;Ļ†) and DN

j (T1 āˆ’ t;Ļ†)equal to C j(Tāˆ’ t;Ļ†) and D j(Tāˆ’ t;Ļ†) that we searched for.

Control variate on Heston Dynamic Model

In the Heston static model, we used control variate to improve the closed formulasolution, and we used some methods to resolve the discontinuity problemsduring the implementation.

The discontinuity problems exist also in Heston dynamic model, and we usethe same methods as we cited in section 3.2.4 to handle them.

The control variate that we use here is an option which follows the Hestondynamic model without the volatility of volatility, which means Ī¾(t) = 0. So thevolatility part is a deterministic part instead of a volatility process:

dV(t) = k(t)(Īø(t)āˆ’V(t))dt

In this part, we use the same notation as for the Heston static model. As pa-rameters are piecewise constant, we compute CDET

j and DDETj in each subinterval

of Ļ„. The only difference is that the initial values for each subinterval (DDETj (0),

CDETj (0)) are not 0 except for the first subinterval. So we have the solutions for

each subinterval of Ļ„ as following:

DDETj =

u jĻ†iāˆ’1/2Ļ†2

b j

(1āˆ’ eāˆ’b jĻ„

)+ DDET

j (0)eāˆ’b jĻ„

CDETj = a

u jĻ†iāˆ’1/2Ļ†2

b jāˆ’DDET

j (0)

1b j

(eāˆ’b jĻ„āˆ’1

)+

(u jĻ†iāˆ’1/2Ļ†2)Ļ„b j

+ CDETj (0)

And the final values of the last subinterval are the values of CDETj (Tāˆ’ t;Ļ†) and

DDETj (Tāˆ’ t;Ļ†) that we need to compute f DET

j .The other way to calculate the option value by using the total volatility ĻƒT is

quite the same as in the Heston static model. We can write:

Page 64: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 53

(ĻƒT)2 =

āˆ« T

0V(t)dt

=

Nāˆ‘i=1

Īøi(Tiāˆ’Tiāˆ’1) + (V0i āˆ’Īøi)

(āˆ’

1ki

)(eāˆ’kiTi āˆ’ eāˆ’kiTiāˆ’1

)where

V0i = V(Tiāˆ’1) = Īøiāˆ’1 + (V0

iāˆ’1āˆ’Īøiāˆ’1)eāˆ’kiāˆ’1(Tiāˆ’1āˆ’Tiāˆ’2)

and V01 = V0.

The same way as in section 3.2.3, the option price can be expressed by:

C( f ,V(0), t) = f ĖœP1āˆ’K ĖœP2 + [ fN(d1)āˆ’KN(d2)]āˆ’f āˆ’K

2(4.15)

where

ĖœP j = 1/2 +1Ļ€

āˆ«āˆž

0Re

eāˆ’iĻ† logK

(f j(x,V,T;Ļ†)āˆ’ f DET

j

)iĻ†

dĻ†

4.4 Piterbarg model

In this section we extend the closed form formula for square root stochasticvolatility model with time-dependent parameters. An earlier result by Piterbarghas two constraints on parameters: 1. the correlation between the stochasticprocesses of underlying and volatility must be zero; 2. mean reversion parameteris constant. By employing our new calculation, we are able to remove the twoconstraints. The formula can facilitate the calibration of the model.

In the second part, we show an application. We use the result that we haveobtained and extend the closed form formula of another stochastic volatilitymodel, SABR model, by removing the constraint on the parameter ā€œskew".

One of the recent results on square root model is the method of ā€œeffective"parameters, introduced by Piterbarg [2005] Piterbarg [2003]. The author suggestsreplacing the time-dependent parameters by their average so that the model canbe converted into a static one. The kernel calculation shows how to get theaverages from the piecewise parameters in order to approximate the originalmodel as closely as possible. The method is lightweight comparing to asymptoticexpansion method and still have good performance.

However, there are two constraints on parameters in the calculation: 1. thecorrelation between the stochastic processes of underlying and volatility must

Page 65: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 54

be zero; 2. mean reversion parameter is constant. To remove these constraints,we employ Malliavin calculus and perform integration by parts on randomvariables.

The main result of this paper includes two different closed form formula forsquare root model. In the first formula, we follow the same line with Piterbarg,and find all the effective parameters. The other way is to transform the model toHeston model, of which the closed form formula is already known. Interestingly,when changing the parameters, we have a constraint: to keep the continuity ofthe underlying process, the parameter ā€œskewā€ must be constant, and to averagethe skew we once more find the need of zero correlation. In a word, it turns outto be the same question with the previous formula. Therefore, the solution forthe first formula can apply to the second.

This article is organized in the following way. In chapter 2 we present squareroot SV model. In chapter 3 we see the transform to Heston model. In chapter 4we review Piterbargā€™s result. Chapter 5 is the main result: we show the formulawithout the previous constraint. In the last chapter, we show one application onSABR model.

4.4.1 Diffusion and formula for vanilla option

In this section, we present the square root stochastic volatility model. Supposethe price of an underlying S(t) follows a shifted log-normal diffusion.

dS(t) = Ļƒ(Ī²S(t) + (1āˆ’Ī²)S(0))āˆš

z(t)dU(t) (4.16)

dz(t) = Īø(z(0)āˆ’ z(t))dt +Ī³āˆš

z(t)dV(t) (4.17)

where d怈U,V怉 = Ļdt.

In this model, z(t) is the variance of S(t); it follows a CIR process. z(0) is theinitial value of variance z(t). Ļƒ is the deterministic part of the volatility. Ī² is calledskew parameter; usually, 0 6 Ī² 6 1; when Ī² = 1, the process is log-normal; whenĪ² = 0, the process is normal. Īø is the mean reversion of the volatility process. Ī³is the volatility of volatility. U(t) and V(t) are two Brownian motions and theircorrelation is Ļ.

We can extend the model (4.16), (4.17) with time-dependent parameters. Todistinguish the extended model from the original one, we call model (4.16), (4.17)static model and call extended model dynamic model. The dynamic model isthe primary target in this article.

dS(t) = Ļƒ(t)(Ī²(t)S(t) + (1āˆ’Ī²(t))S(0))āˆš

z(t)dU(t) (4.18)

dz(t) = Īø(t)(z(0)āˆ’ z(t))dt +Ī³(t)āˆš

z(t)dV(t) (4.19)

where d怈U,V怉 = Ļ(t)dt.

Page 66: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 55

The parameters of the model are continuous. But for practical use, we prefer tosupposing the parameters to be piecewise constant. It is because in calibration wefit the price of options on certain maturities. There is no information about howthe parameters evolve between the maturities. So the assumption of piecewiseconstant parameters wonā€™t damage the generality of the model.

More precisely and mathematically, we can present the assumption of piece-wise constant parameters in the following way. Suppose that we have n dif-ferent maturities t1, t2, ..., tn. Therefore we divide the parameters in n periods:0 = t0 < t1 < t2 < Ā· Ā· Ā· < tn = T. When tiāˆ’1 < t 6 ti, the parameters are constant on theinterval, we note Ļƒ(t) = Ļƒi, Ī³(t) = Ī³i, Ļ(t) = Ļi, Īø(t) = Īøi ,Ī²(t) = Ī²i.

Now we show the closed formulas of European options on our model. Firstwe give the formula for the static model, then for the dynamic model. In bothcases, we suppose that the option has strike K and maturity T.

Closed formula for static model

We list the closed formula for static model in this subsection. It will be refer-enced in following sections.

We rewrite the static model (4.16), (4.17) here.

dS(t) = Ļƒ(Ī²S(t) + (1āˆ’Ī²)S(0))āˆš

z(t)dU(t)

dz(t) = Īø(z(0)āˆ’ z(t))dt +Ī³āˆš

z(t)dV(t)

where d怈U,V怉 = Ļdt.

This model can be regarded as a shifted Heston model. The closed formulaof Heston model can refer the result of Lewis [2000] and Mikhailov and Nogel[2003a] in last section. So all we need to do is to change the variables to convertthe model to Heston model.

Let S(t) = S(t) +1āˆ’Ī²Ī² S(0). z(t) = Ļƒ2Ī²2z(t). Ī³ = ĻƒĪ²Ī³. Then we get a Heston model.

dS(t) = S(t)āˆš

z(t)dU(t) (4.20)

dz(t) = Īø(z(0)āˆ’ z(t))dt + Ī³āˆš

z(t)dV(t) (4.21)

where d怈U,V怉 = Ļdt, with S(0) = 1Ī²S(0).

Here we list the closed formula for Heston model. It bases on the Fouriertransform of characteristic function.

Page 67: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 56

Proposition 23. Suppose we have a Heston model.

dS(t) = S(t)āˆš

V(t)dW1

dV(t) = k(Īøāˆ’V(t))dt +Ī¾āˆš

V(t)dW2 (4.22)

d怈W1,W2怉t = Ļdt

Then the price of a European call option with maturity T and strike K is,

Call = S(0)P1(0)āˆ’KP2(0)

where

P j(t) = 1/2 +1Ļ€

āˆ«āˆž

0Re

eāˆ’iĻ† log(K) f j(x,V, t,T;Ļ†)iĻ†

dĻ†

f j(x,V, t,T;Ļ†) = eC j(Tāˆ’t;Ļ†)+D j(Tāˆ’t;Ļ†)V+iĻ†x

C j(Ļ„;Ļ†) =aĪ¾2

(b jāˆ’ĻĪ¾Ļ†i + d j)Ļ„āˆ’2log

1āˆ’ g jed jĻ„

1āˆ’ g j

D j(Ļ„;Ļ†) =b jāˆ’ĻĪ¾Ļ†i + d j

Ī¾2 Ā·1āˆ’ ed jĻ„

1āˆ’ g jed jĻ„

g j =b jāˆ’ĻĪ¾Ļ†i + d j

b jāˆ’ĻĪ¾Ļ†iāˆ’d j

d j = āˆ’āˆš

(ĻĪ¾Ļ†iāˆ’b j)2āˆ’Ī¾2(2u jĻ†iāˆ’Ļ†2)

with j = 1,2, u1 = 1/2, u2 = āˆ’1/2, a = kĪø, b1 = kāˆ’ĻĪ¾, b2 = k.

4.4.2 Piterbarg Dynamic model

Dynamic model with zero correlation

In this subsection we show the result of Piterbarg [2005]. His idea is to find astatic model to mimic the original dynamic model, in the objective of having thesame price for a European options. This technique is called parameter averaging.

Here two constraints are added on the dynamic model: mean reversion Īø isconstant and the correlationĻ is zero. The latter assumption is strong because thatcorrelation controls the form of volatility smile. How to release this constraint isthe main purpose of this article and will be presented in the following sections.

dS(t) = Ļƒ(t)(Ī²(t)S(t) + (1āˆ’Ī²(t))S(0))āˆš

z(t)dU(t) (4.23)

dz(t) = Īø(z(0)āˆ’ z(t))dt +Ī³(t)āˆš

z(t)dV(t) (4.24)

where d怈U,V怉 = 0.

Page 68: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 57

Piterbarg gave three approximate formula to convert Ī³(t), Ī²(t) and Ļƒ(t) to threeconstants, namely Ī·, b and Ī». After applying these three formulas, we have astatic model. The closed formula of a static model have already solved in section(4.4.1).

dS(t) = Ī»(bS(t) + (1āˆ’b)S(0))āˆš

z(t)dU(t) (4.25)

dz(t) = Īø(z(0)āˆ’ z(t))dt +Ī·āˆš

z(t)dV(t) (4.26)

where d怈U,V怉 = 0.

Parameter averaging formula To approximate the same price for European op-tions with maturity T, the parameters in model (4.25) (4.26) should satisfy thefollowing equations. Basically, we use Ī·, b and Ī» in the static model (4.25) (4.26)to replace Ī³(t), Ī²(t) and Ļƒ(t) in model (4.23) (4.24).

Proposition 24. (Piterbarg [2005]) For parameter Ī·,

Ī·2 =

āˆ« T0 Ī³

2(t)p(t)dtāˆ« T0 p(t)dt

with p(t) =āˆ« T

r dsāˆ« T

s dtĻƒ2(s)eāˆ’Īø(tāˆ’s)eāˆ’2Īø(sāˆ’r)

Proof. Ļƒ(t) is the determinist part of the variance; z(t)is the volatility part of thevariance on the other hand. It can change the curvature of the implied volatilitysmile. The curvature is

āˆ« T0 Ļƒ

2(t)z(t)dtāˆ’Ļƒ2(t)z(t); as the termĻƒ2(t)z(t) is constant,

therefore,āˆ« T

0 Ļƒ2(t)z(t)dt should be the same in both two models. This gives the

following equation:

E

(āˆ« T

0Ļƒ2(t)z(t)dt

)2

= E

(āˆ« T

0Ļƒ2(t)z(t)dt

)2

(4.27)

Calculation can be detailed in the following way. Fubiniā€™s theorem is employed.

Page 69: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 58

E

(āˆ« T

0Ļƒ(t)2z(t)dt

)2=E

[(āˆ« T

0Ļƒ(t)2z(t)dt

)(āˆ« T

0Ļƒ(s)2z(s)ds

)]=E

[āˆ« T

0

āˆ« T

0Ļƒ(t)2Ļƒ(s)2z(t)z(s)dsdt

]=2

āˆ« T

0

āˆ« t

0Ļƒ(t)2Ļƒ(s)2E [z(t)z(s)]dsdt (4.28)

=2āˆ« T

0

āˆ« t

0Ļƒ(t)2Ļƒ(s)2

(z0

āˆ« s

0eĪø(2rāˆ’sāˆ’t)Ī³(r)2dr + z2

0

)dsdt

=2z0

āˆ« T

0

āˆ« T

r

āˆ« T

sĻƒ(t)2Ļƒ(s)2eĪø(2rāˆ’sāˆ’t)Ī³(r)2dtdsdr + 2z2

0

āˆ« T

0

āˆ« t

0Ļƒ(t)2Ļƒ(s)2dsdt

=2z0

āˆ« T

0Ī³(r)2Ļ(r)dr + z2

0

āˆ« T

0

āˆ« T

0Ļƒ(t)2Ļƒ(s)2dsdt

Where, Ļ(r) =āˆ« T

r dsāˆ« T

s dtĻƒ(t)2Ļƒ(s)2eāˆ’Īø(tāˆ’s)eāˆ’2Īø(sāˆ’r)

We substitute this result to (4.27), and get 2z0āˆ« T

0 Ī³(r)2Ļ(r)dr = 2z0āˆ« T

0 Ī·2Ļ(r)dr

This gives that, Ī·2 =

āˆ« T0 Ī³(r)2Ļ(r)drāˆ« T

0 Ļ(r)dr

In the above equation we use the covariance of the z(s), which can be derivedin the following way.

To calculate the covariance of z(s), this follows:

dz(s) = Īø(z0āˆ’ z(s))ds +Ī³(s)āˆš

z(s)dV(s) , z(0) = z0

Let x(s) = eĪøs(z(s)āˆ’ z0).

So we can derive that x(0) = 0 ,E[x(s)] = 0 and z(s) = x(s)eāˆ’Īøs + z0.

Therefore,

dx(s) = ĪøeĪøs(z(s)āˆ’ z0)ds + eĪøsdz(s)

= eĪøsĪ³(s)āˆš

x(s)eāˆ’Īøs + z0dV(s)

So the covariance of E(x(s)x(t)), s 6 t is

Page 70: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 59

E(x(s)x(t)) = E

[āˆ« s

0eĪørĪ³(r)

āˆšx(r)eāˆ’Īør + z0dV(r)

āˆ« t

0eĪørĪ³(r)

āˆšx(r)eāˆ’Īør + z0dV(r)

]= E

[āˆ« s

0e2ĪørĪ³(r)2

(x(r)eāˆ’Īør + z0

)dr

]= z0

āˆ« s

0e2ĪørĪ³(r)2dr

Now we substitute x(s) = eĪøs(z(s)āˆ’ z0)into the left-hand side of the formulaabove.

E[(

eĪøs(z(s)āˆ’ z0))(

eĪøt(z(t)āˆ’ z0))]

= z0

āˆ« s

0e2ĪørĪ³(r)2dr

E [(z(s)āˆ’ z0)(z(t)āˆ’ z0)] = z0eāˆ’Īø(s+t)āˆ« s

0e2ĪørĪ³(r)2dr

We note that,E[z(s)] = z0

So

E[z(s)z(t)] = z20 + z0eāˆ’Īø(s+t)

āˆ« s

0e2ĪørĪ³(r)2dr (4.29)

which proves the result.

Proposition 25. For parameter b,

b =

āˆ« T

0Ī²(t)w(t)dt

with w(t) =v2(t)Ļƒ2(t)āˆ« T

0 v2(t)Ļƒ2(t)dt

v2(t) = z20

āˆ« t

0Ļƒ2(s)ds + z0Ī·

2eāˆ’Īøtāˆ« t

0Ļƒ2(s)

eĪøsāˆ’ eāˆ’Īøs

2Īøds

Proposition 26. For parameter Ī», we need to solving an equation numerically.

E

[g(āˆ« T

0Ļƒ2(t)z(t)dt

)]= E

[g(Ī»2

āˆ« T

0z(t)dt

)]with g(x) = S0

b

(2N

(12 bāˆš

x)āˆ’1

)N(Ā·) is standard normal cumulative distribution function.

Proof. The proof of proposition 25 and 26 can be found in appendix A.1 andA.2.

Page 71: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 60

Remark An interesting fact is that the zero correlation condition is only usedin the proof of proposition 25. In another word, proposition 24 and 26 can workwith a non-zero correlation model, such as model (4.18) (4.19). So we can target toimprove this calculation by relaxing the need of the zero correlation assumption.

Remark 2 The formulas in Proposition 24, 25 and 26 can apply to both contin-uous parameters and piecewise constant parameters.

Dynamic model with constant skew parameter In this part, we follow the ideain section 4.4.1, and regard the square root SV model as a shifted Heston model.Since we know the closed formula for dynamic Heston model, all we have to dois to find a change of variables to connect these two models.

Suppose the skew parameter is constant; that is to say Ī²(t) ā‰” b in model (4.18)(4.19). We re-write square root SV model here.

dS(t) = Ļƒ(t)(bS(t) + (1āˆ’ b)S(0))āˆš

z(t)dU(t) (4.30)

dz(t) = Īø(t)(z(0)āˆ’ z(t))dt +Ī³(t)āˆš

z(t)dV(t) (4.31)

where d怈U,V怉 = Ļ(t)dt.

Then we take the change of variables in section 4.4.1, saying S(t) = S(t)+ 1āˆ’bb S(0).

The model can change to a Heston model.

dS(t) = Ļƒ(t)S(t)bāˆš

z(t)dU(t) (4.32)

dz(t) = Īø(t)(z(0)āˆ’ z(t))dt +Ī³(t)āˆš

z(t)dV(t) (4.33)

where d怈U,V怉 = Ļ(t)dt. S(0) =S(0)

b .

The hypothesis of constant Ī²(t) is necessary because it keeps the continuity ofS(t). Otherwise we cannot use the result of Heston model on S(t). The followingproposition shows a closed formula for Heston model.

Proposition 27. The price for a European call option with maturity T, strike K undermodel (4.32) (4.33) is given by the following equation.

V = S(0)P1(0)āˆ’ KP2(0)

with K = K + 1āˆ’bb S(0)

Page 72: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 61

Where, for j = 1,2, note u1 = 1/2, u2 = āˆ’1/2, a = Īø(t)z(0), b1 = Īø(t)āˆ’Ļ(t)Ī³(t)Ļƒ(t)b,b2 = Īø(t)

P j(t) = 1/2 +1Ļ€

āˆ«āˆž

0Re

eāˆ’iĻ†ln(K) f j(x,V, t,T;Ļ†)iĻ†

dĻ†

f j(x,V, t,T;Ļ†) = eC j(Tāˆ’t;Ļ†)+D j(Tāˆ’t;Ļ†)V+iĻ†x

With

C j(Ļ„;Ļ†) =a

Ī³2(Tāˆ’Ļ„)

(b jāˆ’Ļ(Tāˆ’Ļ„)Ī³(Tāˆ’Ļ„)Ļƒ(Tāˆ’Ļ„)bĻ†i + d j)Ļ„āˆ’2log

1āˆ’ g jed jĻ„

1āˆ’ g j

D j(Ļ„;Ļ†) =b jāˆ’Ļ(Tāˆ’Ļ„)Ī³(Tāˆ’Ļ„)Ļƒ(Tāˆ’Ļ„)bĻ†i + d j

Ī³2(Tāˆ’Ļ„)Ā·

1āˆ’ ed jĻ„

1āˆ’ g jed jĻ„

g j =b jāˆ’Ļ(Tāˆ’Ļ„)Ī³(Tāˆ’Ļ„)Ļƒ(Tāˆ’Ļ„)bĻ†i + d j

b jāˆ’ĻĪ³Ļ†Ļƒ(Tāˆ’Ļ„)Ī²iāˆ’d j

d j = āˆ’

āˆš(ĻĪ³Ļƒ(Tāˆ’Ļ„)bĻ†iāˆ’b j

)2āˆ’Ī³2

(2u jĻƒ2(Tāˆ’Ļ„)b2Ļ†iāˆ’Ļƒ2(Tāˆ’Ļ„)b2Ļ†2

)Note that i =

āˆšāˆ’1 is the unit of pure imaginary number.

Proof. The proof is very similar to the proof in Mikhailov and Nogel [2003a] ondynamic Heston model.

Remark The proposition can apply to both continuous parameters and piece-wise constant parameters.

Dynamic model without constraint

We rewrite the model here.

dS(t) = Ļƒ(t)(Ī²(t)S(t) + (1āˆ’Ī²(t))S(0))āˆš

z(t)dU(t) (4.34)

dz(t) = Īø(t)(z(0)āˆ’ z(t))dt +Ī³(t)āˆš

z(t)dV(t) (4.35)

where d怈U,V怉 = Ļ(t)dt.

First we analyze why we introduce the constraints in previous sections. Insection 4.4.2, we use the condition of zero correlation in the proof of proposition24. More precisely, in equation (A.6) of the proof of proposition 25 we use thecondition of zero correlation to calculate v2(t). The definition of v2(t) is in (A.4).

Page 73: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 62

For the closed formula in section 4.4.2, we have a constraint of constant skewĪ²(t). If we first use parameter averaging technique on parameter Ī²(t), we canremove this constraint. However it introduces the constraint of zero correlation.We find the same problem as in the previous case.

In conclusion, both problems need a new way to average the parameter Ī²(t).More precisely, we need to calculate v2(t) in case of non-zero correlation. Themethod we propose here is to use the formula of integration by parts.

Proposition 28. Suppose all the parameters of the model are piecewise constant. Sup-pose we have n periods: 0 = t0 < t1 < t2 < Ā· Ā· Ā· < tn = T. The parameters are piecewisetime-dependent. When tiāˆ’1 < t < ti, we note Ļƒ(t) = Ļƒi, Ī³(t) = Ī³i, Ļ(t) = Ļi, Īø(t) = Īøi

,Ī²(t) = Ī²i, Bi = 4Ī³iĻƒiĻiāˆ’Īøi.Then the value of v2(t) in equation (A.4) with tiāˆ’1 < t 6 ti is given by:

v2(t) =v2(tiāˆ’1)e(t+1)Bi +(e(tāˆ’tiāˆ’1)Bi āˆ’1

)Ļƒ2

i z01Bi

(z0 +

12Īøi

Ī³2i

)+ z2

0Ļƒ2i Īøi

1Bi

(e(tāˆ’tiāˆ’1)BiBitiāˆ’1 + e(tāˆ’tiāˆ’1)Bi āˆ’Bitāˆ’1

)āˆ’

12Īøi(Bi + 2Īøi)

Ļƒ2i z0Ī³

2i

(e(tāˆ’tiāˆ’1)Biāˆ’2Īøitiāˆ’1 āˆ’ eāˆ’2Īøit

)with v2(t0) = 0

Proof. See the proof in appendix A.3.

Remark Obviously, the proposition can only apply to the case of piecewiseconstant parameters.

Piterbarg Dynamic Model with Ļ = 0 and constant k

In this case, we use effective medium values of the parameters and we changethe problem into the constant case.

The effective values for each parameter are computed in Piterbarg [2005] andPiterbarg [2003]. Note Ī¾ the effective volatility of volatility, Ī² the effective skew,and Ļƒ the effective volatility, the final constant model that replaces the dynamicmodel is:

dS(t) = Ļƒ[Ī²S(t) + (1āˆ’ Ī²)F(0)

] āˆšV(t)dW1

dV(t) = k(V(0)āˆ’V(t))dt + Ī¾āˆš

V(t)dW2

< dW1,dW2 > = 0

Page 74: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 63

The effective volatility of volatility Ī¾:

(Ī¾)2 =

āˆ« T0 Ī¾

2(s)g(s)dsāˆ« T0 g(s)ds

where g(.) satisfies:

g(t) =

āˆ« T

tds

āˆ« T

sdrĻƒ2(r)Ļƒ2(s)eāˆ’Īø(rāˆ’s)eāˆ’2Īø(sāˆ’t)

The effective skew Ī²:

Ī² =

āˆ« T

0Ī²(s)w(s)ds

where the weights w(.) is given by:

w(t) =Ī½2(t)Ļƒ2(t)āˆ« T

0 Ī½2(t)Ļƒ2(t)dt

Ī½2(t) = V20

āˆ« t

0Ļƒ2(s)ds + V0(Ī¾)2eāˆ’Īøt

āˆ« t

0Ļƒ2(s)

eĪøsāˆ’ eāˆ’Īøs

2Īøds

The effective volatility Ļƒ is the solution of:

Ļˆ0

(āˆ’

gā€(Ī¶)gā€²(Ī¶)

Ļƒ2)

= Ļˆ

(āˆ’

gā€(Ī¶)gā€²(Ī¶)

)where

Ī¶ = V0

āˆ« T

0Ļƒ2(t)dt

Ļˆ(Āµ) = E[exp

(āˆ’ĀµV(T)

)]Ļˆ0(Āµ) = E

[exp

(āˆ’Āµ

āˆ« T

0V(t)dt

)]

Piterbarg Dynamic Model with constant Ī²

In this case, by doing the following transformations:

L(t) = F(t) +1āˆ’Ī²Ī²

F(0)

Z(t) = Ļƒ2(t)Ī²2V(t)

we have:

Page 75: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 64

dL(t) = L(t)āˆš

Z(t)dW1

dZ(t) = k(t)(Īø(t)āˆ’Z(t))dt + Ī¾(t)āˆš

Z(t)dW2

< dW1,dW2 > = Ļ(t)dt

L(0) =F(0)Ī²

Z(0) = Ļƒ2(0)Ī²2V(0)

where:

Īø(t) = Ļƒ2(t)Ī²2V(0)

Ī¾(t) = Ī¾(t)Ļƒ(t)Ī²

Now as the parameters are piecewise constant, we use the method that weintroduced for Heston dynamic model to get the option price.

Note: In the case when Ī² is piecewise constant too, the tranformation: L(t) =

F(t) +1āˆ’Ī²Ī² F(0) introduces jumps for the forward at the end of each constant pa-

rameter period. So L(t) is not continuous. We canā€™t use exactly the same methodas we used in the Heston model.

4.4.3 Summary

Finally, let us summarize the result here. We showed two paths to find a closedformulas without constraints on parameters.

First path, we use proposition 24, 25 and 26 with replacing the calculationof v2(t) by proposition 28. As we convert the dynamic model to a static model,we can use proposition 23 to get a closed formula. In this case, the correlationis piecewise constant. This constraint may be relaxed by averaging a time-dependent correlation to a constant correlation. That is left for future research.

Second path, we use the calculation of Ī²(t) in proposition 23 to convert Ī²(t)to b, where the calculation of v2(t) is replaced by proposition 28. Then we useproposition 27 to get a closed formula. In this case, correlation and skew areboth time-dependent.

As we can see, proposition 28 enables us to avoid the constraint of zerocorrelation in both paths.

4.5 SABR model

4.5.1 Diffusion

SABR model is a stochastic volatility model which name stands for ā€œStochastic-Ī±-Ī²-Ļ modelā€. The model and its dynamic variation are first introduced by Hagan

Page 76: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 65

et al. [2002]. It is mainly used for forward interest rate and other asset classesincluding commodity. The diffusion of SABR model is defined as follows.

Definition 29 (SABR model). Denote Ft = F(t,T) the forward of underlying (i.e.commodity) at time T, observing at time t, and denote Ī±t the volatility of underlying atdate t, then SABR model is given by:

dFt = Ī±tFtĪ²dW1 (4.36)

dĪ±t = Ī³Ī±tdW2 (4.37)

Where W1 and W2 are two standard Brownian motion with correlation Ļ. Skewparameter Ī² and vol of vol parameter Ī³ are constant.

The parameters in this model are

1. Ī±(0) - the initial value of volatility at time t = 0,

2. Ī² - skew parameter. When Ī² = 0, the model becomes to normal; when Ī² = 1,the model is log-normal model,

3. Ī³ - vol of vol parameter. When Ī³ = 0, the model reduces to a CEV model.

4. Ļ - the correlation between the two Brownian processes W1 and W2.

Remark By using forward Ft instead of spot price at time t, the model alreadyincludes the information of convenience yield and discounting term in forward.Therefore, there is no drift term in the stochastic diffusion. Moreover, we haveFT = F(T,T) = ST the forward at time T observed at time T, which is obviouslythe spot price ST at time T.

If we let parameters depend on time, namely Ī² to Ī²(t), Ī³ to Ī³(t) and Ļ to Ļ(t),then we extend SABR model to dynamic SABR model.

Definition 30 (Dynamic SABR model). Denote Ft = F(t,T) the forward of underlying(i.e. commodity) at time T, observing at time t, and denote Ī±t the volatility of underlyingat date t, then dynamic SABR model is given by:

dFt = Ī±tFtĪ²(t)dW1 (4.38)

dĪ±t = Ī³(t)Ī±tdW2 (4.39)

Where W1 and W2 are two standard Brownian motion with correlation Ļ(t). Skewparameter Ī²(t) and vol of vol parameter Ī³(t) are time-dependent.

Page 77: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 66

4.5.2 Static SABR model

Proposition 31 (Vanilla option price under SABR model, Hagan et al. [2002]).Under SABR model, the price (at time t = 0) V of a vanilla call option with strike Kand maturity T is defined as V = D(0,T)E[(ST āˆ’K)+

|F0]. Here D(0,T) is the discountfactor, which equals to the value of 1 unit bond at time 0 with maturity at time T.Forward Ft follows the same definition in SABR model. F0 is the filtration of probabilityspace at time 0.

Then we can have the formula for V in form of Black Scholes formula:

V = D(0,T) (F0N(d1)āˆ’KN(d2)) (4.40)

where F0 = F(0,T) is the forward observed at time 0 with maturity T. N(x) =1

2Ļ€

āˆ« xāˆ’āˆž

eāˆ’t2/2dt is the cumulative distribution function of standard normal distribution.

d1,2 = log(F0/K)Ā± 12ĻƒBS

2TĻƒBSāˆš

T. Implied Black Scholes volatility ĻƒBS is given by

ĻƒBS =Ī±

(F0K)(1āˆ’Ī²)/2Ā·

1

1 +(1āˆ’Ī²)2

24 ln2 F0/K +(1āˆ’Ī²)4

1920 ln4 F0/K + ...Ā·Ī¶

x(Ī¶)

Ā·

1 +

[(1āˆ’Ī²)2Ī±2

24(F0K)1āˆ’Ī²+

ĻĪ±Ī³Ī²

4(F0K)(1āˆ’Ī²)/2+

2āˆ’3Ļ2

24Ī³2

]T + ...

(4.41)

where

Ī¶ =Ī³

Ī±

āˆšF0K log(F0/K) (4.42)

x(Ī¶) = log

āˆš

1āˆ’2ĻĪ¶+Ī¶2āˆ’Ļ+Ī¶

1āˆ’Ļ

(4.43)

Remark The pricing of SABR model with constant parameters is well presentedby Hagan et al. [2002]. In practise, Benhamou and Croissant [2007] use Taylorseries to get a more precise implied volatility when we are at the money. Closedformula for static SABR model is given in Hagan et al. [2002].

4.5.3 Dynamic SABR model

For dynamic model, Hagan et al. [2002] showed a closed formula with constantskew Ī²(t) ā‰” Ī². In Wang [2009], the parameter averaging technique is employed;it shows another closed formula with zero correlation Ļ(t) ā‰” 0.

We follow the same line with Wang [2009], applying the parameter averagingtechnique. We loosen the constraint of zero correlation Ļ(t) ā‰” 0 to the conditionof Ļ(t) ā‰” Ļ being constant. We target to convert the dynamic model in equation4.38 and 4.39 to a static model in the following form.

Page 78: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 67

dĪ±(t) = Ī½Ī±(t)dV(t)

dS(t) = Ī±(t)S(t)bdU(t)

d怈U,V怉t = Ļdt

For parameters Ī³(t) of the dynamic model, the calculation doesnā€™t need thezero correlation condition in Wang [2009]; therefore we take directly the resultfrom the paper and we donā€™t list the result in this thesis. Here we focus on thecalculation of Ī²(t).

Proposition 32. Suppose all the parameters of the model are piecewise constant. Sup-pose we have n periods: 0 = t0 < t1 < t2 < Ā· Ā· Ā· < tn = T. The parameters are piecewisetime-dependent. When tiāˆ’1 < t < ti, we note Ī³(t) = Ī³i, Ļ(t) = Ļi, Ī²(t) = Ī²i, Bi = 4Ī³iĻƒiĻi.For parameter b,

b =

āˆ« T

0Ī²(t)w(t)dt

with w(t) =v2(t)Ļƒ2(t)āˆ« T

0 v2(t)Ļƒ2(t)dt

v2(t) = v2(tiāˆ’1)e(t+1)Bi +z0

4Ļi

(1

4Ļi+ĻƒiĪ³it +

Ļƒiz0

Ī³i

)(eBi(tāˆ’tiāˆ’1)

āˆ’1)

Proof. We use the same calculation with the proof of proposition 28. The onlydifference is that the volatility process (equation 4.37) of SABR has no drift.Therefore, we take the limit of the result in proposition 28 when drift term Īø(t)tends to zero. Then we get the result of proposition 32.

Remark With proposition 32, we convert the dynamic model to a static SABRmodel. Then we can use the closed formula for a static SABR model.

In proposition 32, correlation is actually piecewise constant. It means thatthe constraint of correlation being constant can be removed if we can find a wayto averaging a piecewise constant correlation. That is left for future research.

Vol of vol expansion for SABR with constant Ī²

For a SABR model with constant Ī², we can use the effective medium parametermethod. Firstly, the model is:

dF(t) = Ī±(t)F(t)Ī²dW1

dĪ±(t) = Ī³(t)Ī±(t)dW2

< dW1,dW2 >= Ļ(t)dt

as in the constant case, we will rewrite Ī±āˆ’ā†’ ĪµĪ± and Ī³āˆ’ā†’ ĪµĪ³, with Īµ 1 which isthe distinguished limit. And we get the answers in terms of the original variables

Page 79: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 68

by replacing ĪµĪ± āˆ’ā†’ Ī± and ĪµĪ³ āˆ’ā†’ Ī³ after obtaining the results.

Let V(t, f ,Ī±) be the value of a European call option at date t, with the economyin state F(t) = f , Ī±(t) = Ī±. Let also T be the optionā€™s expiry, and K be its strike.

As in the constant parameter case, by defining the probability density p(t, f ,Ī±;T,F,A):

p(t, f ,Ī±;T,F,A)dFdA = probF < F(T) < F + dF,A < Ī±(T) < A + dA | F(t) = f ,Ī±(t) = Ī±

And

P(t, f ,Ī±;T,K) =

āˆ«āˆž

āˆ’āˆž

A2p(t, f ,Ī±;T,F,A)dA

we have:

V(t, f ,Ī±) = [ f āˆ’K]+ +12Īµ2K2Ī²

āˆ« T

tP(t, f ,Ī±;sā€²,K)dsā€²

where P(t, f ,Ī±;T,K) satisfies the forward problem:

Pt +12Īµ2

Ī±2C2( f )P f f + 2Ī·(t)Ī±2C( f )P fĪ±+ v2(t)Ī±2PĪ±Ī±

= 0 for t < T

P = Ī±2Ī“( f āˆ’K) for t = T

where

Ī·(t) = Ļ(t)Ī³(t)

v(t) = Ī³(t)

We solve this problem by using an effective media strategy. Our objectiveis to determine constant values Ī· and v yield the same option price as the timedependent coefficients Ī·(t) and v(t). Then the problem is reduced to the non-dynamic SABR model solved before.

Noting C( f ) = f Ī², and C(0) = KĪ². By defining the variate z:

z =1ĪµĪ±

āˆ« f

K

d f ā€²

C( f ā€²)

we have the probability density P is Gaussian in z.

We carry out this strategy to resolve the near-Hamiltonian system by applyingthe same series of time-independent transformations L that was used to solvethe non-dynamic SABR model before. The transformations here are in terms ofthe constants Ī· and v to be determined later.

We name L all the transformations used:

Page 80: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 69

1. z = 1ĪµĪ±

āˆ« fK

d f ā€²

C( f ā€²) ;

2. P = Ī±ĪµKāˆ’Ī²P;

3. P =

āˆšf Ī²

KĪ²H;

4. H = eĪµ2ĻĪ³b1z2/4H;

5. x = 1ĪµĪ³

āˆ« ĪµĪ³z0

dĪ¾āˆš

1āˆ’2ĻĪ¾+Ī¾2= 1ĪµĪ³ log

( āˆš1āˆ’2ĪµĻĪ³z+Īµ2Ī³2z2āˆ’Ļ1+ĪµĪ³z

1āˆ’Ļ

);

6. H =(1āˆ’2ĪµĻĪ³z +Īµ2Ī³2z2

)1/4Q;

Then, the value V(t, f ,Ī±) can be written as:

V = [ f āˆ’K]+ +12ĪµĪ±

āˆšKĪ² f Ī²I1/2(Īµvz)e1/4Īµ2Ī±b1Ī“z2

āˆ« Tāˆ’t

0Q(Ļ„,x)dĻ„ (4.44)

where I(Ī¶) =āˆš

1āˆ’2ĻĪ¶+Ī¶2, fav =āˆš

f K, b1 =Ī²fav

, b2 =Ī²(Ī²āˆ’1)

f 2av

.Q satisfies:

Qt +12

Qxx = Īµ[(Ī·āˆ’ Ī·)xQxx] +Īµ2[12

(v2āˆ’ v2āˆ’3Ī·(Ī·āˆ’ Ī·)

)x2Qxx

+12Ī±b1(Ī·āˆ’Ī“)(xQxāˆ’Q)āˆ’

34Ī±b1Ī“Qāˆ’ v2

(14

Iā€Iāˆ’18

Iā€²I)Qāˆ’Ī±2

(14

b2āˆ’38

b21

)Q]

for t < T

Q = Ī“(x) for t = T

Using perturbation expansion, we expand Q(t,x,T) as:

Q(t,x,T) = Q(0)(t,x,T) +ĪµQ(1)(t,x,T) +Īµ2Q(2)(t,x,T) + ...

Then we get the equations that satisfy Q0, Q1 and Q2:Q(0)t +

12

Q(0)xx = 0 for t < T

Q(0) = Ī“(x) for t = TQ(1)t +

12

Q(1)xx = (Ī·āˆ’ Ī·)xQ(0)

xx for t < T

Q(1) = 0 for t = T

Q(2)

t +12

Q(2)xx =

12

(v2āˆ’ v2āˆ’3Ī·(Ī·āˆ’ Ī·)

)x2Q(0)

xx +12Ī±b1(Ī·āˆ’Ī“)

(xQ(0)

x āˆ’Q(0))āˆ’

34Ī±b1Ī“Q(0)

āˆ’ v2(14

Iā€Iāˆ’18

Iā€²I)Q(0)āˆ’Ī±2

(14

b2āˆ’38

b21

)Q(0) + (Ī·āˆ’ Ī·)xQ(1)

xx for t < T

Q(2) = 0 for t = T

Page 81: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 70

By resolving these standard diffusion equations, we set some parts to 0 and findvalues for Ī·, v, Ī“, Ļ„:

Ī· =

āˆ« Tt (Tāˆ’ s)Ī·(s)ds

12 (Tāˆ’ t)2

Ī“ =Ī·

v2 =1

13 (Tāˆ’ t)3

āˆ« T

t(Tāˆ’ s)2v2(s)dsāˆ’3Ī·

āˆ« T

t(Tāˆ’ s)2[Ī·(s)āˆ’ Ī·]ds

āˆ’6āˆ« T

t

āˆ« s1

t(s1āˆ’ s2)[Ī·(s1)āˆ’ Ī·][Ī·(s2)āˆ’ Ī·]ds2ds1

Ļ„ =Tāˆ’ t +

āˆ« Tāˆ’t

0s[v2(s)āˆ’ v2]ds

Then the option price is identical to the static modelā€™s option price by makingthe identifications:

ĻĪ·v , Ī³ v, Ļ„ Ļ„

SABR: Dynamic Ī²

For a SABR model with a non-constant Ī², it is not easy to find out an effectivemedium value for Ī². But as we have seen during the search for the effectivemedium values, we have the formula to calculate the option price in the form ofan integration. The integrand is a function continuous on time. Inspired fromthis, we can compute the option price with Ī² depending on time. The model isgiven as:

dF(t) = Ī±(t)F(t)Ī²(t)dW1

dĪ±(t) = Ī³(t)Ī±(t)dW2

< dW1,dW2 >= Ļ(t)dt

We consider firstly the most simple case where we have just two periods:

Hypotheses: Suppose that all the parameters are piecewise, and we just havetwo periods: from 0 to T1 and from T1 to T2.

In the case of SABR model, we have parameters as

Ī±(t) = Ī±0 if t = 0

Ī²(t) =

Ī²1 if 0 6 t 6 T1

Ī²2 if T1 6 t 6 T2

Page 82: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 71

Ļ(t) =

Ļ1 if 0 6 t 6 T1

Ļ2 if T1 6 t 6 T2

Ī³(t) =

Ī³1 if 0 6 t 6 T1

Ī³2 if T1 6 t 6 T2

We define the probability density p and P as before, we can write the value as:

V(t, f ,Ī±) = [ f āˆ’K]+ +12Īµ2

āˆ« T2

0K2Ī²(T2āˆ’Ļ„)P(Ļ„, f ,Ī±)dĻ„

where P(Ļ„, f ,Ī±) satisfies the forward problem:

PĻ„ =12Īµ2

Ī±2 f 2Ī²(T2āˆ’Ļ„)P f f + 2Ļ(T2āˆ’Ļ„)Ī±2 f Ī²(T2āˆ’Ļ„)P fĪ±+Ī³2(T2āˆ’Ļ„)Ī±2PĪ±Ī±

for Ļ„ > 0

P = Ī±20Ī“( f āˆ’K) for Ļ„ = 0

As the parameters are piecewise constant, we rewrite the equation above asfollowing:

V(t, f ,Ī±) = [ f āˆ’K]+ +12Īµ2

āˆ« T2āˆ’T1

0K2Ī²2P(Ļ„, f ,Ī±)dĻ„+

12Īµ2

āˆ« T2

T2āˆ’T1

K2Ī²1P(Ļ„, f ,Ī±)dĻ„

and

PĻ„ =12Īµ2

Ī±2 f 2Ī²2P f f + 2Ļ2Ī±

2 f Ī²2P fĪ±+Ī³22Ī±

2PĪ±Ī±

for 0 < Ļ„ < T2āˆ’T1

PĻ„ =12Īµ2

Ī±2 f 2Ī²1P f f + 2Ļ1Ī±

2 f Ī²1P fĪ±+Ī³21Ī±

2PĪ±Ī±

for T2āˆ’T1 < Ļ„ < T2

P = Ī±20Ī“( f āˆ’K) for Ļ„ = 0

P is a function continuous on time t. The option price computing problem canbe reduced to:

V(t, f ,Ī±) = V(2) + V(1) (4.45)

V(2) = [ f āˆ’K]+ +12Īµ2

āˆ« T2āˆ’T1

0K2Ī²2P(2)(Ļ„, f ,Ī±)dĻ„

V(1) =12Īµ2

āˆ« T2

T2āˆ’T1

K2Ī²1P(1)(Ļ„, f ,Ī±)dĻ„

where

P(2)Ļ„ =

12Īµ2

Ī±2 f 2Ī²2P(2)

f f + 2Ļ2Ī±2 f Ī²2P(2)

fĪ±+Ī³22Ī±

2P(2)Ī±Ī±

for 0 < Ļ„ < T2āˆ’T1

P(2) = Ī±20Ī“( f āˆ’K) for Ļ„ = 0 (4.46)

and

P(1)Ļ„ =

12Īµ2

Ī±2 f 2Ī²1P(1)

f f + 2Ļ1Ī±2 f Ī²1P(1)

fĪ±+Ī³21Ī±

2P(1)Ī±Ī±

for T2āˆ’T1 < Ļ„ < T2

P(1) = P(2)(T2āˆ’T1) for Ļ„ = T2āˆ’T1 (4.47)

Page 83: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 72

Remark1: V(2) is the value of a portfolio in state F(0) = f , Ī±(0) = Ī±0 at date 0,with parameters of the second period and maturity equal to T2āˆ’T1.

P(2)(T2 āˆ’T1) can be got easily regarding to the constant case. By doing somesimplifications and transformations (see detail in Appendix A), we can writeV(1) in the form of:

V(1) = M āˆ—12ĪµĪ±0

āˆšKĪ²1 f Ī²1I1/2

(1) (ĪµĪ³1z)e1/4Īµ2Ļ1Ī³1Ī±0b(1)1 z2

āˆ« T2

T2āˆ’T1

Q(Ļ„,x)dĻ„

= M āˆ—N1(āˆ« T2āˆ’Ļ„0

0Q(Ļ„,x)dĻ„āˆ’

āˆ« T2āˆ’T1āˆ’Ļ„0

0Q(Ļ„,x)dĻ„

)= M āˆ— (V11

āˆ’V12) (4.48)

where

V11 = [ f āˆ’K]+ + N1āˆ« T2āˆ’Ļ„0

0Q(Ļ„,x)dĻ„ (4.49)

V12 = [ f āˆ’K]+ + N1āˆ« T2āˆ’T1āˆ’Ļ„0

0Q(Ļ„,x)dĻ„ (4.50)

and M is a constant with M =āˆš

2āˆš

2K2(Ī²1āˆ’Ī²2)+Īµ2 J(T2āˆ’T1)KĪ²1āˆ’Ī²2eĪµ

2(k(2)āˆ’k(1))(T2āˆ’T1)eĪµ

2k(1)Ļ„0 ,

N1 = 12ĪµĪ±0

āˆšKĪ²1 f Ī²1I1/2

(1) (ĪµĪ³1z)e1/4Īµ2Ļ1Ī³1Ī±0b(1)1 z2

.

Remark2: V11 is the value of a portfolio in state F(0) = f , Ī±(0) = Ī±0 at date 0,and with parameters of the first period and maturity equal to T2āˆ’Ļ„0.

Remark3: V12 is the value of a portfolio in state F(0) = f , Ī±(0) = Ī±0 at date 0,and with parameters of the first period and maturity equal to T2āˆ’T1āˆ’Ļ„0.

So the value of the option is obtained by:

V(t, f ,Ī±) = V2 + V1

= V2 + M āˆ— (V11āˆ’V12) (4.51)

4.5.4 Summary

In this section we discuss the Fourier formed closed formula for square root SVmodel. We reviewed the existing result on square root SV model with constraintof zero correlation. We figured out another path by changing of variables, andfound a closed formula with constraint of constant skew. The constraints inboth cases can be removed by employing the integration by parts on random

Page 84: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 73

variables. We show this result in proposition 28. The proposition helps twomentioned closed formulas relax the constraint.

An application on another SV model, SABR model, is also shown. It removesthe constraint of constant skew in the original paper of Hagan.

4.6 Vol of vol expansion

In order to calibrate stochastic volatility models, it is convenient to have an accu-rate and fast-computing analytical formula for call options. However, derivingsuch a formula is not always an easy task. For instance, as regards Heston model,the most popular technique involves numerical integration, which is necessarilytime-consuming. This article presents the volatility of volatility series expan-sion, a powerful technique for deriving fast-computing analytical formulae forEuropean options. The main idea is to apply perturbation method to the param-eter vol of vol, calculating the first and second order of the difference between astochastic volatility model and a Black Scholes model. In general case, we canreduce the integration of the exact formula to some simpler integration. For cer-tain models, for example Heston model, the new integrations can be express intoanalytic form. The result is significant: the time of calculation can be only 1% ofexact formula with optimization in integral; and the difference in implied vol isoften smaller than 0.05% in most cases except too short maturity. The method canbe applied successfully to several stochastic volatility models including Hestonmodel, GARCH model, SABR model, etc, to enhance the calibration and pricingroutines.

4.6.1 Framework

Consider the following two-factor stochastic volatility model

dS = (rāˆ’d)Sdt +ĻƒSdBt (4.52)

dV = b(V)dt +ĪµĪ½(V)dW (4.53)

dBdW = Ļ(V)dt (4.54)

where r is the short rate, d the dividend yield, Īµ a constant, and b(V) and Ī½(V) areindependent of Īµ. We assume parameters r and d to be constant for the sake ofsimplicity. The series expansion consists in writing the option price formula asa series in Īµ.

Fourier transform integration tells us that the call option price is given by

C(S,V,T) = Seāˆ’dTāˆ’

Keāˆ’rT

2Ļ€

āˆ« i/2+āˆž

i/2āˆ’āˆžexp(āˆ’iuX)

Ī¦(u,V,T)u(uāˆ’ i)

du, (4.55)

where X = log(S/K) + (rāˆ’d)T.

Page 85: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 74

Figure 4.6: Comparison of series A (expasion on price), series B (expansion onimplied volatility) and exact volatility

4.6.2 The expansion technique

We are looking for solutions which can be written as a power series of Īµ. Thus,we can obtain the power series of the call price using either 4.55 directly

C(u,V,T) = C(0)(u,V,T) +ĪµC(1)(u,V,T) +Īµ2C(2)(u,V,T)

or expanding first the implied volatility

Vimp(u,V,T) = V(0)imp(u,V,T) +ĪµV(1)

imp(u,V,T) +Īµ2V(2)imp(u,V,T)

and then plugging it into the option price with Black-Scholes formula.As a matter of fact, these two methods differ significantly. The former -

denoted by series A in the remainder of this article - gives the call price firstand implied volatility while the latter - denoted by series B - gives the impliedvolatility and call price is obtained afterwards. Nevertheless, in most cases,numerical difference is slight between the two series. However, regarding farout-of-the money options, the two series give different results as shown in figure4.6. Empirical evidence shows that series B is very often better than series A.Even though this is not a general rule, series B should be usually preferred overseries A.

Page 86: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 75

4.6.3 Deriving the series

After we have presented the expansion technique, we will now explicit the twoseries for the following model. In particular, this model encompasses the Hestonmodel for the special case Ļ† = 0.5.

dS = (rāˆ’d)Sdt +ĻƒSdBt (4.56)

dV = (Ļ‰āˆ’ĪøV)dt +ĪµVĻ†dW (4.57)

dBdW = Ļ(V)dt (4.58)

The expansion is first on the fundamental transform of the closed formula,which is presented by Ī¦(u,V,T) in equation 4.55. The idea is that we can expandthis function into simpler form, so that the integration in the equation 4.55 can bereduced to analytic form. We donā€™t jump into the detailed derivation. Interestedreaders can refer to the book of Lewis [2000]. Here we just list the result.

For series A, the expansion on price, is

C(S,V,Ļ„) = c(S,v,Ļ„) +ĪµĻ„āˆ’1J(1)R(1,1)cV(S,v,Ļ„)

+Īµ2Ļ„āˆ’1J2 +Ļ„āˆ’2J(3)R(2,0) +Ļ„āˆ’1J(4)R(1,2) +

Ļ„āˆ’2

2(J1)2R(2,2)

cV(S,v,Ļ„)

+ O(Īµ3)

For series B, the expansion on price, is

Vimp = v(S,v,Ļ„) +ĪµĻ„āˆ’1J(1)R(1,1)

+Īµ2Ļ„āˆ’1J2 +Ļ„āˆ’2J(3)R(2,0) +Ļ„āˆ’1J(4)R(1,2) +

Ļ„āˆ’2

2(J(1))2

[R(2,2)

āˆ’ (R(1,1))2R(2,0)]

+ O(Īµ3)

In the above formulae, term c(S,v,Ļ„) presents corresponding Black Scholesprice. When vol of vol Īµ = 0, the stochastic model reduces to a Black Scholesmodel. v here is the equivalent variance for Black Scholes, which is basically thethe integration of the variance from 0 to Ļ„.

The function R(p,q) and J(s) are the derivative ratios and integration respec-

Page 87: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 76

tively. Here list their expressions for practical use purpose.

R(2,0) = Ļ„

[12

X2

Y2 āˆ’1

2Zāˆ’

18

],

R(1,1) =[āˆ’

XZ

+12

],

R(1,2) =

[X2

Z2 āˆ’XZāˆ’

14Z

(4āˆ’Z)],

R(2,2) = Ļ„

1

2 X4

Z4

āˆ’12

X3

Z3 āˆ’3X2

Z3 +18

XZ2 (12 + Z) +

132

1Z2 (48āˆ’Z2)

with Z = VĻ„

And

J(1)(V,Ļ„) =Ļ

Īø

āˆ« Ļ„

0

(1āˆ’ eāˆ’Īø(Ļ„āˆ’s)

)[Ļ‰Īø

+ eāˆ’Īøs(Vāˆ’Ļ‰Īø

)]Ļ†+ 1

2ds,

J(2)(V,Ļ„) = 0

J(3)(V,Ļ„) =1

2Īø2

āˆ« Ļ„

0

(1āˆ’ eāˆ’Īø(Ļ„āˆ’s)

)2[Ļ‰Īø

+ eāˆ’Īøs(Vāˆ’Ļ‰Īø

)]2Ļ†

ds

J(4)(V,Ļ„) = (Ļ†+12

)āˆ« Ļ„

0

[Ļ‰Īø

+ eāˆ’Īø(Ļ„āˆ’s)(Vāˆ’Ļ‰Īø

)]Ļ†+ 1

2J(6)(V,Ļ„)ds

with J(6)(V,Ļ„)ds =

āˆ« Ļ„

0

(eāˆ’Īø(Ļ„āˆ’s)

āˆ’ eāˆ’Īøs)[Ļ‰Īø

+ eāˆ’Īø(Ļ„āˆ’u)(Vāˆ’Ļ‰Īø

)]Ļ†āˆ’ 1

2du

4.6.4 Example of vol of vol expansion: Heston model

In this section we will present an interesting example to show how the expansionof vol of vol works with Heston model. By the asymptotic expansion, we canfinally obtain an approximate analytic formula for the European call option. Thiswork comes from the result of Benhamou et al. [2009].

Suppose we take a Heston model,

dXt =āˆš

vtdWtāˆ’vt

2dt, X0 = x0, (4.59)

dvt = Īŗ(Īøtāˆ’vt)dt +Ī¾tāˆš

vtdBt, v0, (4.60)

d怈W,B怉t = Ļtdt,

Here we have adjusted to risk neutral probability; as consequence it intro-duces the drift term in spot process by change of probability.

To expand the model, we add Īµ in the model.

dXĪµt =

āˆšvĪµt dWtāˆ’

vĪµt2

dt, XĪµ0 = x0,

dvĪµt = Īŗ(Īøtāˆ’vĪµt )dt +ĪµĪ¾t

āˆšvĪµt dBt, vĪµ0 = v0, (4.61)

Page 88: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 77

Now we will expand the European call option price formula respect to Īµ.Note that, when Īµ = 0, we have a Black Scholes model; while Īµ = 1, we have aHeston model. We have already the closed formula of Black Scholes for Īµ = 0.We expand at Īµ = 0, and let Īµ = 1 to obtain the approximate formula. This canwrite in mathematics language as follows.

PHeston = PBS +E

[āˆ‚PBS

āˆ‚Īµ

]+

12E

[āˆ‚2PBS

āˆ‚Īµ2

]+E.

Another approximation we will take here is to simulate the partial derivativesin the equation above by the linear combination of the Greek letters of BlackScholes. The idea is that using the chain rule in derivative can make āˆ‚PBS

āˆ‚Īµ =āˆ‚PBSāˆ‚S

āˆ‚Sāˆ‚Īµ + āˆ‚PBS

āˆ‚Ļƒāˆ‚Ļƒāˆ‚Īµ . Same idea for the second derivative.

PHeston =PBS(x0,varT) +

2āˆ‘i=1

ai,Tāˆ‚i+1PBS

āˆ‚xiy(x0,varT) +

1āˆ‘i=0

b2i,Tāˆ‚2i+2PBS

āˆ‚x2iy2(x0,varT) +E,

(4.62)

We omit the lengthy derivation but just listing the result here. Readers canrefer to Benhamou et al. [2009] for proofs and intermediate derivation. Theparameters in the formula are:

varT =m0v0 + m1Īø, a1,T =ĻĪ¾(p0v0 + p1Īø),

a2,T =(ĻĪ¾)2(q0v0 + q1Īø), b0,T =Ī¾2(r0v0 + r1Īø).

varT =

āˆ« T

0v0,tdt,

Page 89: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 78

m0 =eāˆ’ĪŗT

(āˆ’1 + eĪŗT

)Īŗ

,

m1 =Tāˆ’eāˆ’ĪŗT

(āˆ’1 + eĪŗT

)Īŗ

,

p0 =eāˆ’ĪŗT

(āˆ’ĪŗT + eĪŗT

āˆ’1)

Īŗ2 ,

p1 =eāˆ’ĪŗT

(ĪŗT + eĪŗT(ĪŗTāˆ’2) + 2

)Īŗ2 ,

q0 =eāˆ’ĪŗT

(āˆ’ĪŗT(ĪŗT + 2) + 2eĪŗT

āˆ’2)

2Īŗ3 ,

q1 =eāˆ’ĪŗT

(2eĪŗT(ĪŗTāˆ’3) +ĪŗT(ĪŗT + 4) + 6

)2Īŗ3 ,

r0 =eāˆ’2ĪŗT

(āˆ’4eĪŗTĪŗT + 2e2ĪŗT

āˆ’2)

4Īŗ3 ,

r1 =eāˆ’2ĪŗT

(4eĪŗT(ĪŗT + 1) + e2ĪŗT(2ĪŗTāˆ’5) + 1

)4Īŗ3 .

The advantage is that there is no integration in the approximate formula. Sothat the calculation speed is much faster than that of the exact formula. We willsee this point in section numerical results.

The error in the approximation is estimated as E = O([Ī¾supāˆš

T]3 āˆš

T).

Numerical results

Benhamou et al. [2009] tested the approximate formula with strikes from 70%to 130% for short maturity, and 10% to 730% for long maturity. Implied Black-Scholes volatilities of the closed formula, of the approximation formula andrelated errors (in bp), expressed as a function of maturities in fractions of yearsand relative strikes. Parameters: Īø= 6%, Īŗ= 3, Ī¾= 30% and Ļ= 0%. Except shortmaturity with very small strike, where we observe the largest difference (18.01bp), the difference is less than 5 bp (1 bp = 0.01%) in almost all other cases. Forthe calculation speed, the approximate formula is about 100 times quicker thanthe exact formula (with the optimization in integral).

4.7 Calibration of Stochastic Volatility Model

4.7.1 Introduction

A calibration procedure is a procedure in which we use market data (usuallyprices) as input to determine the parameters of a financial model. We search

Page 90: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 79

for those parameters which enable the model to fit the best the input marketdata. Calibration can be time consuming, this issue depending on the size of theinput data, on the number of parameters to calibrate and also on the calibrationalgorithm itself.

Because a model parameter can be either time dependent or constant, wehave two categories of calibration methods: (1) the calibration procedure of timedependent parameters is called "bootstrap", while (2) the calibration procedureof the constant parameters is called "optimization".

Usually, the model we use for pricing contains one or several parameters.These parameters give the model some flexibility to better fit the characteristicof the market. Before using a model to price, we need to get the parameters fromthe market data. This procedure is often called calibration.

There are at least two methods to find the parameters. The quick way is toestimate manually the parameters. This method is good when the number ofparameters are small. For example, in Black-Scholes model Black and Scholes[1973], we need volatility. User of this model can estimate the parameter and thenuse the model with his estimation. But this method is inaccurate, and useless tothe models with many parameters. Stochastic volatility models usually have 4or more parameters. So we need to calibrate the models.

DescriptionGenerally, the calibration process is a process using certain algorithm to find

the minimum of the difference of the target function by changing the parameter(or parameter vector if there are more than one parameter).

Target functionUsually we use the option prices listed in the market as the target function. For

each given parameter vector, we need the option price calculated by the model.For a given financial instrument, there are different maturities. So we need

to minimize the āˆ‘all maturities

(pmodelāˆ’pmarket)2

where pmodel and pmarket are respectively the option price calculated by themodel and the option price listed in the market.

AlgorithmThe algorithm are usually the minimum finding algorithm.

Levenberg MarquadtL-BFGS-BNelder and Mead (Simplex)

Page 91: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 80

The calibration of stochastic volatility model is aWe need analytic formula to evaluate the price of options, in order to calibrate

the model.

The Calibration Scheme in General

To calibrate the static stochastic volatility models, we search for ways to com-pute the value of option with the corresponding model. Several methods areintroduced for the calibration of stochastic volatility models according to theirforms. For SABR model, by introducing the probability density, we compute thevalue of a call option by resolving the PDEs. For the square root model, twodifferent ways using Fourier Transform are used, one of which will be appliedin the dynamic case.

The calibration of stochastic volatility models require analytic solution of theoption price or implied volatility. We calibrate the models to implied volatilitythat we get from the market with an optimization algorithm "Levenberg Mar-quardt". Let T be the time to maturity with f wd the corresponding forward. LetK1, K2, ... KM be a set of strikes, and V1, V2, ... VM be the corresponding marketimplied volatiliy. The aim of the calibration is to minimize the least square error:

R =

Māˆ‘i=1

(Viāˆ’Vimi )2

where Vimi is the implied volatility corresponding to the model(calculated

with the Blackā€™s formula after having obtained the option price). So the speedof the calculation of the option price is very important for the speed of thecalibration.

By using methods to increase the convergency of the closed formulas and toresolve the discontinuities during the implementation, we donā€™t dispose muchdifficulty in the case of models with constant parameters. Based on these solu-tions, we try to find out solutions for time dependent models. Several approachesare used in the research. These methods are adapted to their process of spot (orforward) and volatility. The effective medium parameter method is to find outa mean value for each parameter. The effective medium parameter depends ontime and the values of all time dependent parameters in each period. By doingthis, we change a time dependent model into a constant parameter model wherewe can use the analytical formulas that we developed in the constant parame-ter case for the calibration. Another method is to compute the option price byresolving the PDEs and to use the final value of one constant parameter periodas the initial value of the following constant parameter period. According to theform of each model, we will choose the easiest method.

Page 92: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 81

Note: In the following, we work with "forward" instead of "spot" of theunderlying. So we are in the neutral risk environment. This will simplifyour calculus. The option price that we calculate is the expected value of thepayoff, which equals to the real option price divided by the discount factor. Thediscount factor is generally DF = eāˆ’rT where r is the constant interest rate and Tis the expiry.

Bootstrap

The so called "Bootstrap Technique" may be used for parameters which are timedependent, that is, deterministic functions depending on time. Below we explainin which manner the bootstrap technique is performed.

Consider a financial model M, having an unknown function parameter fdepending on time. Suppose we want to calibrate this function f . This meansthat we want to exactly say who f (t) is for any t belonging to [0,T]. As a basehypothesis, suppose that you are also knowing the shape of f ; typically, yourfunction will be piecewise constant, or piecewise linear, or quadratic.

Recall that we have to recover f (t) over the time interval [0,T], and supposethat we get on the market n prices p1, . . . , pn of financial instruments Ļ€1, . . . , Ļ€n:

p1 = Market price(Ļ€1;0, t1),

p2 = Market price(Ļ€2;0, t2),

. . .

pi = Market price(Ļ€i;0, ti),

. . .

pn = Market price(Ļ€n;0, tn),

where each price pi is computed at time t = 0 for the maturity ti, and we have:

0 < t1 < t2 < Ā· Ā· Ā· < tn = T.

Because we know the interpolation type of f , to compute f (t) over the timeinterval [0,T], it suffices to compute f at the points

t1, t2, . . . , tN,

as f is completely defined by the vector of couples:

t1, f (t1)

t2, f (t2)

. . .

ti, f (ti)

. . .

tn, f (tn).

Page 93: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 82

Indeed, if f1 = f (t1), f2 = f (t2), . . . , fn = f (tn) are known, to compute f (t), for t in[0,T], we have only to find the index k, such that : t belongs to [tk, tk+1]; once wehave k, we will compute f (t) interpolating f between tk and tk+1.

We explain now how we can compute f1, f2, . . . , fn. This will be done itera-tively, solving numerically n equations; first we compute f1; then, using that f1is known, we compute f2; then, using that f1 and f2 are known, we compute f3;and so on; at the end, using that f1, f2, . . . , fnāˆ’1 are known, we compute fn.

Step 1 To compute f over the first interval [0, t1], we have to use the price p1;as this information does not suffice to recover f (t), for any t āˆˆ [0, t1], wesuppose in addition that f is constant over this interval: f (t) ā‰” f1, for anyt āˆˆ [0, t1].

Then, on [0, t1], f is defined by the 2-dimensional vector

[0, f1, t1, f1],

which will be known, once f1 will be known. Denote by f (t1, f1) thefunction f , constructed with the vector above. We will compute f1 solvingthe following equation in y:

M(Ļ€1; f (t1, y)) = p1,

whereM(Ļ€1; f (t1, y)) is the price of the financial instrument Ļ€1, computedwith the modelM, which uses the function f (t1, y). The equation above issolved using Newton Raphson algorithm.

Step 2 We will use now that f is already constructed over [0, t1], and we willcompute it over (t1, t2]. Our goal function f is defined by the 3-dimensionalvector

[0, f1, t1, f1, t2, f2 = f (t2)],

where f1 is already know, and we have only to compute f2.

This will be done solving the following equation in y:

M(Ļ€2; f (t2, y)) = p2,

whereM(Ļ€2; f (t2, y)) is the price of the financial instrument Ļ€2, computedwith the modelM, which uses the function f (t2, y). The above equation iswell defined because we know to computeM(Ļ€2; f (t2, y)) using: (1), thatwe have already computed the function on the interval [0, t1], and, (2), thatf has a known interpolation type.

Step 3 to n Are analogous to Step 2.

In this way we iteratively compute f (t1), f (t2), Ā·, f (tn).

Page 94: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 83

Optimization

Consider a financial modelM, depending on m unknown parameters

Ī±1, . . . , Ī±m,

where each Ī±i is a constant real number: Ī±i āˆˆ R. Suppose we want to calibratethese numbers and that we get on the market n prices p1, . . . , pn of financialinstruments Ļ€1, . . . , Ļ€n:

p1 = Market price(Ļ€1),

p2 = Market price(Ļ€2),

. . .

pi = Market price(Ļ€i),

. . .

pn = Market price(Ļ€n).

We consider the following Least Square Problem: find the numbers x1, . . . ,xm

that minimize the following function F:

F(x1, . . . , xm) =

nāˆ‘i=1

[M(Ļ€i;x1, . . . , xm)āˆ’pi

]2 ,

whereM(Ļ€i;x1, . . . , xm) is the price of the financial instrument Ļ€i, computed withthe modelM, which uses the parameters x1, . . . , xm. We suppose that m ā‰¤ n.

To resolve this Least Square Problem, we can employ the following numericalmethods

(1) Levenberg-Marquardt algorithm.

(2) Broyden-Fletcher-Goldfarb-Shanno (BFGS) algorithm.

Find details on these numerical methods in following subsection.

4.7.2 Calibration order

When we calibrate a model its parameters are calibrated one by one, but thereare two possible calibration procedures:

(1) Successive calibration.

(2) Nested calibration.

These calibration procedures use essentially an optimization algorithm tocompute (xāˆ—1, ...,x

āˆ—m), which is the optimal value for the parameter x = (x1, ...,xm):

(xāˆ—1, ...,xāˆ—

m) = Calibratex = (x1, ...,xm)|(y1, ..., yn),K

using the known values (y1, ..., yn) and performing at most K steps.

Page 95: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 84

Successive calibration

This method calibrates the parameters independently: each parameter is cali-brated independently with respect to the other parameters.

It is the end user who chooses the calibration order in the parameter vector.Suppose that you have to calibrate tree parameters called m, v, w. If you want tocalibrate first v, then m, and then w you have to construct the vector: (v, m, w).Writing (v, w, m) means that v will be calibrated first, then w and then m.

To explain the successive calibration procedure, let us suppose the model tocalibrate has only two parameters: (a1,a2), having the initial value (a0

1,a02). The

successive calibration procedure has only 2 steps and is as follows:

(1) Compute aāˆ—1, that is the optimum value of the first parameter:

Calibrate(a1|a0

2,K).

Calibrate a1 using the value a02 for the second parameter. a0

1 is used as thestarting point of the algorithm.

(2) Compute aāˆ—2, that is the optimum value of the second parameter:

Calibrate(a2|aāˆ—1,K

)calibrate a2 using the value aāˆ—1 for the first parameter. Note that to computeaāˆ—2 we used aāˆ—1.

The maximum total number of steps of the 2-dimensional successive cali-bration procedure is K + K. When calibrating N parameters, the maximum totalnumber of steps is NƗK.

Example

Let consider that we want to calibrate a Hull-White 1 factor model to thefollowing portfolio of swaptions. We denote by f (start,end,Ļƒ,a) the price of anATM swaption with Hull-White model. Ļƒ can be a time-dependent parameter.

Initial volatility (Ļƒ0) and initial mean reversion are both set to (a0) are bothset to 1%. The prices of portfolio 1 are given by:

First, we start by resizing the volatility parameter to the size of the portfolio.The volatility is now a curve model parameter with size 10. Abscissas are equal to14-May-08,...,15-May-17 and ordinates are equal to Ļƒ0

1 = 1%,Ļƒ02 = 1%, ...,Ļƒ0

10 = 1%.

Then we bootstrap the volatility. We look for the value Ļƒāˆ—1 such that

f (14-May-08,14-May-18,Ļƒ1,a0) = 1.16186% (4.63)

Page 96: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 85

where Ļƒ1 is the volatility parameters with ordinates

Ļƒāˆ—1,Ļƒ02 = 1%, ...,Ļƒ0

10 = 1%. (4.64)

Once we have found Ļƒāˆ—1, we look for the value Ļƒāˆ—2. It is the value of Ļƒ2 such that

f (14-May-09,14-May-18,Ļƒ,a0) = 2.06796% (4.65)

where Ļƒ is the volatility parameters with ordinates

Ļƒāˆ—1,Ļƒāˆ—

2,Ļƒ03 = 1%, ...,Ļƒ0

10 = 1%. (4.66)

At the end of this process, we obtain the following volatility parameter Ļƒāˆ— :Once we have finished the bootstrap of volatility, we wonā€™t modify it any-

more. We will optimize the parameter aāˆ—. It is the mean reversion value thatminimizes

āˆ‘10i=1[ f (starti,endi,Ļƒāˆ—,a)āˆ’ f M(starti,endi)]2 where f M is the market price

of a swaption, starti are the start dates of the swaption portfolio and endi are theend dates.

The linked calibration algorithm is now finished and output parameters areĻƒāˆ— and aāˆ—.

Nested calibration

This method nests calibration loops; which means we go through each loop ofparameter 1 by finishing a full calibration on parameter 2 with the present valueof parameter 1. We also call this kind of calibration order as "linked".

To explain the nested calibration procedure, let us suppose the model tocalibrate has only two parameters: (a1,a2), having the initial value (a0

1,a02). The

nested calibration procedure has at most KƗK steps and is as follows:

Calibrate (a2 |Calibrate (a1|a2,K) ,K) ,

that is, compute aāˆ—2, doing at most K steps, and using at each step

(ai1)āˆ— = Calibrate

(ai

1|ai2,K

)the optimal value of ai

1; ai1 corresponds, at the ith step, to ai

2, that is the currentvalue of the second parameter.

The maximum total number of steps of the 2-dimensional nested calibrationprocedure is KƗK. When calibrating N parameters, the maximum total numberof steps is KN.

We take the same example as in the linked calibration.Let consider that wewant to calibrate a Hull-White 1 factor model to the following portfolio of swap-tions. We denote by f (start,end,Ļƒ,a) the price of an ATM swaption with Hull-White model. Ļƒ can be a time-dependent parameter.

Page 97: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 86

Initial volatility (Ļƒ0) and initial mean reversion are both set to (a0) are bothset to 1%. The prices of portfolio 1 are given by :

First, we start by resizing the volatility parameter to the size of the portfolio.The volatility is now a curve model parameter with size 10. Abscissas are equal to14-May-08,...,15-May-17 and ordinates are equal to Ļƒ0

1 = 1%,Ļƒ02 = 1%, ...,Ļƒ0

10 = 1%.

Then we bootstrap the volatility. We look for the value Ļƒāˆ—11 such that

f (14-May-08,14-May-18,Ļƒ1,a0) = 1.16186% (4.67)

where Ļƒ1 is the volatility parameters with ordinates

Ļƒāˆ—1,Ļƒ02 = 1%, ...,Ļƒ0

10 = 1%. (4.68)

Once we have found Ļƒāˆ—1, we look for the value Ļƒāˆ—2. It is the value of Ļƒ2 suchthat

f (14-May-09,14-May-18,Ļƒ,a0) = 2.06796% (4.69)

where Ļƒ is the volatility parameters with ordinates

Ļƒāˆ—1,Ļƒāˆ—

2,Ļƒ03 = 1%, ...,Ļƒ0

10 = 1%. (4.70)

At the end of this process, we obtain the following volatility parameter Ļƒāˆ—1 :Like in the successive calibration scheme, we will now optimize the param-

eter a. It is the mean reversion value that minimizesāˆ‘10

i=1[ f (starti,endi,Ļƒāˆ—,a)āˆ’f M(starti,endi)]2 where f M is the market price of a swaption, starti are the startdates of the swaption portfolio and endi are the end dates. The optimizationalgorithm returns that aāˆ—1 = 0.87%.

Contrary to the successive calibration, the nested calibration is not finishedyet. We will now redo the bootstrap process with a mean reversion equal to aāˆ—1.Thus, we get a bootstrapped volatility Ļƒāˆ—2. We iterate this process until we get agood calibration precision. This nested calibration generally finds better resultsthan the successive calibration but it can be much slower.

4.7.3 Numerical method

Newton Raphson

Newton Raphson is an efficient algorithm for finding approximations to the zeros(or roots) of a real-valued function. As such, it is an example of a root-findingalgorithm. It can also be used to find a minimum or maximum of such a function,by finding a zero in the functionā€™s first derivative.

The idea of the method is as follows: one starts with an initial guess whichis reasonably close to the true root, then the function is approximated by its

Page 98: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 87

tangent line, and one computes the x-intercept of this tangent line (which iseasily done with elementary algebra). This x-intercept will typically be a betterapproximation to the functionā€™s root than the original guess, and the method canbe iterated.

Suppose f : [a,b]ā†’ R is a differentiable function defined on the interval [a,b]with values in the real numbers R. The formula for converging on the root canbe easily derived. Suppose we have some current approximation xn. Then wecan derive the formula for a better approximation xn+1 by referring to the Fig 4.7.We know from the definition of the derivative at a given point that it is the slopeof a tangent at that point.

Figure 4.7: An illustration of one iteration of Newtonā€™s method (the functionf is shown in blue and the tangent line is in red). We see that xn+1 is a betterapproximation than xn for the root x of the function f .

That is

f ā€²(xn) =āˆ†yāˆ†x

=f (xn)āˆ’0xnāˆ’xn+1

=0āˆ’ f (xn)xn+1āˆ’xn

were f ā€² denotes the derivative of the function f. Then by simple algebra we canderive

xn+1 = xnāˆ’f (xn)f ā€²(xn)

.

Page 99: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 88

We start the process off with some arbitrary initial value x0. (The closer to thezero, the better. But, in the absence of any intuition about where the zero mightlie, a "guess and check" method might narrow the possibilities to a reasonablysmall interval by appealing to the intermediate value theorem.) The method willusually converge, provided this initial guess is close enough to the unknownzero, and that f ā€²(x0) , 0. Furthermore, for a zero of multiplicity 1, the conver-gence is at least quadratic in a neighborhood of the zero, which intuitively meansthat the number of correct digits roughly at least doubles in every step.

Levenberg Marquardt

Levenberg-Marquardt (LM) algorithm provides a numerical solution to the prob-lem of minimizing a function, generally nonlinear, over a space of parameters ofthe function.These minimization problems arise especially in least squares curvefitting and nonlinear programming.

Minimization algorithm We reformulate the Least Square Problem: find thevector p = (p1, . . . ,pm) that minimize the following function F:

F(p) =

nāˆ‘i=1

[fi(p)

]2 , (4.71)

where f :Rmā†’Rn, p 7ā†’ f (p) = ( f1(p), . . . , fn(p)) āˆˆRn, and f is of class C1.

Like any other numeric minimization algorithms, the LM algorithm is aniterative procedure. To start a minimization, the user has to provide an initialguess for the vector p. In many cases, an uninformed standard guess likep0 = (1,1, ...,1) will work fine; in other cases, the algorithm converges only if theinitial guess is somewhat close to the final solution.

At each iteration step, the vector p is replaced by a new estimate p + q. Todetermine q, the value f (p + q) is approximated by its linearization:

f (p + q) ā‰ˆ f (p) + Jq.

At a minimum of the sum of squares F in eq. (4.71), the gradient of F is zero.Differentiating F the square of the right hand side of the equation (4.71) andsetting it to zero leads to:

(JTJ)q = JT[yāˆ’ f (p)],

from which q can be obtained by inverting JTJ. The key hallmark of the LMalogorithm is to replace this equation by a ā€œdamped versionā€:

(JTJ +Ī»I)q = JT[yāˆ’ f (p)],

where I is the identity matrix, and

q = (JTJ +Ī»I)āˆ’1JT[yāˆ’ f (p)].

Page 100: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 89

The non-negative damping factor Ī» is adjusted at each iteration. If the reductionof F is rapid, then a smaller value can be used bringing the algorithm close tothe Gauss-Newton alogorithm. On the other hand, if an iteration gives insuf-ficient reduction in the residual, then Ī» can be increased, giving a step close tothe gradient descent direction. A similar damping factor appears in Tikhonovregularization, which is used to solve linear ill-posed problems.

The solution of our minimization problem is found when one of the followingconditions is satisfied:

(1) The calculated step length q falls below a predefined limit.

(2) The reduction of the squares sum from the latest parameter vector p, fallsbelow predefined limits.

In these cases, the iteration is stoped and the last parameter vector p is consideredto be the solution.

Advantages The advantage of this algorithm is that it introduces a dampingparameter Āµ, which influences both the direction and the size of the step. Thestep hāˆ— is defined by following formula:

(JTJ +ĀµI)hāˆ— = āˆ’JT f ,

where f is the objective function and J is the Jaccobian of f .Large damping parameter: When the damping parameter Āµ is large

hāˆ— ā‰ˆ āˆ’1Āµ

JT f ,

the algorithm takes the steepest descent direction. This is advantageous if thecurrent search is far from the solution.

Small damping parameter: When the damping parameter Āµ is small, thealgorithm becomes the Gauss-Newton algorithm, which is very advantageouswhen the current search is close to the solution. The damping parameter isupdated at each loop of the algorithm.

In short, LM algorithm is the combination of the Steepest Descent methodand the Gauss-Newton algorithm. As a consequence it converges more quicklythan these two algorithms.

Quasi-Newton (BFGS)

In mathematics, the Broyden-Fletcher-Goldfarb-Shanno (BFGS) method is amethod to solve an unconstrained nonlinear optimization problem.

The BFGS method is derived from the Newtonā€™s method in optimization, aclass of hill-climbing optimization techniques that seeks the stationary point of

Page 101: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 90

a function, where the gradient is 0. Newtonā€™s method assumes that the functioncan be locally approximated as a quadratic in the region around the optimum,and use the first and second derivatives to find the stationary point.

In Quasi-Newton methods the Hessian matrix of second derivatives of thefunction to be minimized does not need to be computed at any stage. TheHessian is updated by analyzing successive gradient vectors instead. Quasi-Newton methods are a generalization of the secant method to find the root of thefirst derivative for multidimensional problems. In multi-dimensions the secantequation is under-determined, and quasi-Newton methods differ in how theyconstrain the solution.

Rationale The search direction pk at stage k is given by the solution of theanalogue of the Newton equation

Bkpk = āˆ’āˆ‡ f (xk). (4.72)

A line search in the direction pk is then used to find the next point xk+1.Instead of requiring the full Hessian matrix at the point xk+1 to be computed

as Bk+1, the approximate Hessian at stage k is updated by the addition of twomatrices.

Bk+1 = Bk + Uk + Vk (4.73)

Both Uk and Vk are rank-one matrices but have different bases. The rank oneassumption here means that we may write

C = abT (4.74)

So equivalently, Uk and Vk construct a rank-two update matrix which is robustagainst the scale problem often suffered in the gradient descent searching.

(as in Broydenā€™s method, the multidimensional analogue of the secant method).The quasi-Newton condition imposed on this update is

Bk+1(xk+1āˆ’xk) = āˆ‡ f (xk+1)āˆ’āˆ‡ f (xk). (4.75)

Algorithm From an initial guess x0 and an approximate Hessian matrix B0 thefollowing steps are repeated until x converges to the solution.

1. Obtain sk by solving: Bksk = āˆ’āˆ‡ f (xk).

2. Perform a line search to find the optimal k in the direction found in the firststep, then update xk+1 = xk +Ī±ksk.

3. yk = āˆ‡ f (xk+1)āˆ’āˆ‡ f (xk).

4. Bk+1 = Bk +yky>ky>k skāˆ’

Bksk(Bksk)>

s>k Bksk.

Page 102: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 91

f (x) denotes the objective function to be minimized. Convergence can bechecked by observing the norm of the gradient,

āˆ£āˆ£āˆ£āˆ‡ f (xk)āˆ£āˆ£āˆ£. Practically, B0 can

be initialized with B0 = I, so that the first step will be equivalent to a gradientdescent, but further steps are more and more refined by Bk, the approximationto the Hessian.

The first step of the algorithm is carried out using an approximate inverseof the matrix Bk, which is usually obtained efficiently by applying the Sherman-Morrison formula to the fourth line of the algorithm, giving

Bāˆ’1k+1 = Bāˆ’1

k + (sks>k )(s>k yk + y>k Bāˆ’1k yk)/(s>k yk)2

āˆ’ (Bāˆ’1k yks>k + sky>k Bāˆ’1

k )/(s>k yk). (4.76)

Credible intervals or confidence intervals for the solution can be obtainedfrom the inverse of the final Hessian matrix.

Simplex algorithm

The algorithm begins with n+1 randomly sampled vectors, xi i āˆˆ [0,n], when wewant to calibrate n parameters in our model. Each component xi, j j āˆˆ [0,nāˆ’ 1]represents a parameter of our model. The aim is to find the best parametersto compute implied volatilities with our model. We minimize the followingfunction

f (x) =āˆ‘

(T,K)āˆˆĪ“

(Ļƒ(x,T,K)āˆ’ĻƒMarket(T,K))2

where Ī“ is the set of all the strikes and maturities in our market data and Ļƒ(x,T,K)is the implied volatility computed when we consider a model whose parametersare the components of x. When itā€™s possible, Ļƒ(x,T,K) is directly computed witha closed formula, otherwise we can get it with a closed formula for the price.

The Nelder-Mead algorithm (also known as the simplex algorithm) is usuallyused for multidimensional unconstrained optimization. It remains a famoussolution for at least two reasons : itā€™s one of the methods which do not requirederivatives and itā€™s considered as reliable when it comes to deal with noisyfunctions. If one has to find an optimal solution to a problem in n dimensions,this algorithm creates and modifies a polytope of n + 1 vertices. Then, one hasjust to compute objective function values on these vertices to know the nextmodification of the simplex : reflexion, expansion, contraction, shrinkage. Themain idea of these distorsions is to progress toward areas where the objectivefunction takes the best values.

First, we need to find n + 1 vertices to create a simplex. We use a first set ofparameters given by the user and the others are choosen randomly. Algorithmā€™send occures when one of these condition is satisfied :

Page 103: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 92

ā€¢ A vertex giving the good precision, x0, is found :

| f (x0)| < Īµ1

ā€¢ New vertices donā€™t modify objective function values anymore :āˆšāˆ‘ni=0

( f (xi)āˆ’Āµ)2

n+1 < Īµ2 where Āµ =āˆ‘n

i=0 f (xi)n+1

ā€¢ Time given to the algorithm has passed.

ā€¢ The execution has rised above iteration limit.

We usually choose :Ī± = 1 (reflexion parameter)Ī² = 2 (expansion parameter)Ī³ = 1

2 (contraction parameter)Ī“ = 1

2 (shrink parameter)This algorithm consists in the following pseudo code algorithm 1.

Modified simplex algorithm

Previously, we generated n + 1 vertices to create a simplex. However, itā€™s notenough to find the minimum weā€™re looking for and we compute more objectivefunction values to increase our hope to reach a global solution. Thatā€™s why webegin with the computation of 4n values so as to keep n interesting vertices.This is the first modification of the initial Nelder-Mead algorithm. Then, we alsohave to look for the minimum in every direction : the initial vertices mustnā€™tbe linearly dependent, this would prevent us from transforming the simplex ina part of the space (and from computing objective function values in this part).This step is also called degeneration test. After every initialization a test checks ifthe simplex is degenerated and resets it if need. The modified simplex algorithmconsists in algorithm 2.

Advantages / Drawbacks The simplex algorithm is really famous for manyreasons. First of all, it does not need derivatives and we can use it for noisy func-tions. Moreover, it does not compute many objective function values (indeed,one by iteration in general) so itā€™s not a time-consuming algorithm.

The biggest difficulty with this algorithm appears when we have to minimizemulti-modal functions. Itā€™s difficult to prevent the algorithm from finding a ā€™badā€™local minimum and giving it as a solution of the problem.

Differential evolution algorithm

The algorithm begins with p randomly sampled vectors to initialize the firstgeneration : xi,0 i āˆˆ [0,pāˆ’ 1]. Each component xi,0, j j āˆˆ [0,nāˆ’ 1] represents a

Page 104: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 93

if User hasnā€™t provided initial value of x1,x2, . . . ,xn+1 thenRandomly generate n + 1 vertices in n dimension: x1,x2, . . . ,xn+1 āˆˆR

N;endĪ± = 1 // reflexion parameterĪ² = 2 // expansion parameterĪ³ = 1

2 // contraction parameter

Ī“ = 12 // shrink parameter

while condition function is true doSort x1,x2, . . . ,xn+1 so that f (x1) < f (x2) < Ā· Ā· Ā· < f (xn+1);xcenterā†

1nāˆ‘n

k=1 xk;xrā† xcenter +Ī±(xcenterāˆ’xn+1);if f (xr) < f (x1) then

xeā† xcenter +Ī²(xrāˆ’xcenter);if f (xe) < f (xr) then

xn+1ā† xe;else

xn+1ā† xr;end

else if f (x1) ā‰¤ f (xr) < f (xn) thenxn+1ā† xr;

else if f (xn) ā‰¤ f (xr) < f (xn+1) thenxcoutā† xcenter +Ī³(xrāˆ’xcenter);if f (xcout) < f (xr) then

xn+1ā† xcout;else

xn+1ā† xr;Call shrink function;

endelse if f (xn+1) ā‰¤ f (xr) then

xcinā† xcenter +Ī³(xn+1āˆ’xcenter);if f (xcin) < f (xn+1) then

xn+1ā† xcin;else

Call shrink function;end

endend

Algorithm 1: Simplex

Page 105: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 94

begin condition functionif time limit reached or iteration limit reached then

return false;endif | f (x1)| < Īµ1 then

return false;endĀµā† 1

nāˆ‘n+1

i=1 f (xi);

ifāˆšāˆ‘n+1

i=1( f (xi)āˆ’Āµ)2

n+1 < Īµ2 thenreturn false;

endreturn true;

endFunction for simplex

begin shrink functionfor i = 2 to n + 1 do

xiā† x1 +Ī“(xiāˆ’x1);compute f (xi);

endend

Function for simplex

parameter of our model (we want to calibrate n parameters). One iteration startswith a generation xi,g i āˆˆ [0,pāˆ’1] and creates a new set of vectors xi,g+1 i āˆˆ [0,pāˆ’1].The aim is to find the best parameters to compute implied volatilities with ourmodel. We minimize the following function

f (x) =āˆ‘

(T,K)āˆˆĪ“

(Ļƒ(x,T,K)āˆ’ĻƒMarket(T,K))2

where Ī“ is the set of all the strikes and maturities in our market data and Ļƒ(x,T,K)is the implied volatility computed when we consider a model whose parametersare the components of x. When itā€™s possible, Ļƒ(x,T,K) is directly computed witha closed formula, otherwise we can get it with a closed formula for the price.

The differential evolution algorithm is a population-based optimizer. An initialpopulation is sampled in the domain and objective function values are computedto start the optimization. Then it produces a new generation of vectors withperturbations of the existing points. To generate a new population the scaleddifference of two randomly chosen vectors is added to a third vector. The nextstep consists in a recombination between this new population and the previous

Page 106: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 95

repeatif User hasnā€™t provided initial value of x1,x2, . . . ,xn+1 then

Randomly generate n + 1 vertices in n dimension:x1,x2, . . . ,xn+1 āˆˆR

N;endmā† 4n;Generate xn+2,xn+3, . . . ,xm āˆˆRN;Sort x1,x2, . . . ,xm so that f (x1) < f (x2) < Ā· Ā· Ā· < f (xm);

until The volume of polyhedron (x1,x2, . . . ,xn+1) > Īµ;Ī± = 1; // reflexion parameterĪ² = 2; // expansion parameterĪ³ = 1

2 ; // contraction parameterĪ“ = 1

2 ; // shrink parameterwhile condition function is true do

Sort x1,x2, . . . ,xn+1 so that f (x1) < f (x2) < Ā· Ā· Ā· < f (xn+1);xcenterā†

1nāˆ‘n

k=1 xk;xrā† xcenter +Ī±(xcenterāˆ’xn+1);if f (xr) < f (x1) then

xeā† xcenter +Ī²(xrāˆ’xcenter);if f (xe) < f (xr) then

xn+1ā† xe;else

xn+1ā† xr;end

else if f (x1) ā‰¤ f (xr) < f (xn) thenxn+1ā† xr;

else if f (xn) ā‰¤ f (xr) < f (xn+1) thenxcoutā† xcenter +Ī³(xrāˆ’xcenter);if f (xcout) < f (xr) then

xn+1ā† xcout;else

xn+1ā† xr;Call shrink function;

endelse if f (xn+1) ā‰¤ f (xr) then

xcinā† xcenter +Ī³(xn+1āˆ’xcenter);if f (xcin) < f (xn+1) then

xn+1ā† xcin;else

Call shrink function;end

endend

Algorithm 2: Modified simplex

Page 107: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 96

one. To achieve the algorithm, we compare objective function values betweenthe two sets of vectors to determine which vectors are part of the new generation.

Algorithmā€™s end occures when one of these condition is satisfied :

ā€¢ A vector giving the good precision, x0,g, is found :

| f (x0,g)| < Īµ1

ā€¢ New generations donā€™t modify objective function values anymore :āˆšāˆ‘ni=0

( f (xi,g)āˆ’Āµ)2

n+1 < Īµ2 where Āµ =āˆ‘n

i=0 f (xi,g)n+1

ā€¢ Time given to the algorithm has passed.

ā€¢ The execution has rised above iteration limit.

We use these parameters throughout the evolution of the optimization :

F = 0.5 (scale factor used in the mutation step)C = 0.5 (criterion used in the crossover step)The index g refers to the gth generation.

This algorithm consists in the following algorithm 3.

Advantages / Drawbacks An interesting advantage of this algorithm is theuse of difference vectors. The difference distribution adapts the algotithm tothe objective function landscape. When we have to cope with a multi-modalfunction, this distribution is also multi-modal and gives us the opportunity tolook for the minimum in each region and to move vectors from a region toanother. We have a better chance to find a globalm minimum (in comparisonwith the simplex algorithm for example).

However, we are not sure to be in a position to afford this opportunity. Themain difficulty is the time consumed in each iteration because many objectivefunction values have to be computed for each generation.

MGB algorithm

Call price approximation This algorithm uses an approximation of call pricesunder the Heston model. We can compute them in the constant case and with a

Page 108: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 97

if User hasnā€™t provided initial value of x1,x2, . . . ,xn thenRandomly generate n vertices in n dimension: x1,x2, . . . ,xn āˆˆRN;

endmā† 5n;Generate xn+1,xn+2, . . . ,xm āˆˆRN;gā† 0;for i = 1 to m do

xi,gā† xiendFā† 0.5; Cā† 0.5;while condition function is true do// Mutationfor i = 1 to m do

Randomly choose r0,r1,r2 from 1 to m with r0 , r1 , r2;ui,gā† xr0,g + F(xr1,gāˆ’xr2,g)

end// Crossoverfor i = 1 to m do

Randomly choose jr from 1 to nāˆ’1;for j = 0 to n do

randomly choose u āˆˆ [0,1];if u < C or j == jr then

j-th ordinate of vi,gā† j-th ordinate of ui,g, jelse

j-th ordinate of vi,gā† j-th ordinate of xi,g, jend

endend// Selectionfor i = 1 to m do

if f (vi,g) < f (xi,g) thenxi,g+1ā† vi,g

endxi,g+1ā† xi,g

endgā† g + 1

endAlgorithm 3: Differential evolution

Page 109: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 98

begin condition functionif time limit reached or iteration limit reached then

return false;endif | f (x1)| < Īµ1 then

return false;endĀµā† 1

māˆ‘m

i=1 f (xi);

ifāˆšāˆ‘m

i=1( f (xi)āˆ’Āµ)2

m < Īµ2 thenreturn false;

endreturn true;

endFunction for differential evolution

time dependent model. From now on we use the following model :

dS(t) = rtS(t)dt +āˆš

V(t)S(t)dW1,

dV(t) = Īŗ(Īøtāˆ’V(t))dt +Ī¾āˆš

V(t)dW2,

x0 = ln(S0),

v0 = V0.

Ļt is the correlation coefficient between W1 and W2. These formula are veryinteresting since they give us the opportunity to write a call price as an easilycomputable function of Ļ and Ī¾. This will be really useful for a new calibrationprocedure. In this subsection we will just focus on these analytical formula.

In the constant case, we can derive the following expression:

varT =m0v0 + m1Īø, a1,T =(p0v0 + p1Īø)āˆ‚2PBS

āˆ‚xy(x0,varT),

a2,T =(q0v0 + q1Īø)āˆ‚3PBS

āˆ‚x2y(x0,varT), b0,T =(r0v0 + r1Īø)

āˆ‚2PBS

āˆ‚y2 (x0,varT),

b2,T =(p0v0 + p1Īø)2

2āˆ‚4PBS

āˆ‚x2y2 (x0,varT).

Page 110: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 99

where

m0 =eāˆ’ĪŗT

(āˆ’1 + eĪŗT

)Īŗ

, m1 = Tāˆ’eāˆ’ĪŗT

(āˆ’1 + eĪŗT

)Īŗ

,

p0 =eāˆ’ĪŗT

(āˆ’ĪŗT + eĪŗT

āˆ’1)

Īŗ2 , p1 =eāˆ’ĪŗT

(ĪŗT + eĪŗT(ĪŗTāˆ’2) + 2

)Īŗ2 ,

q0 =eāˆ’ĪŗT

(āˆ’ĪŗT(ĪŗT + 2) + 2eĪŗT

āˆ’2)

2Īŗ3 , q1 =eāˆ’ĪŗT

(2eĪŗT(ĪŗTāˆ’3) +ĪŗT(ĪŗT + 4) + 6

)2Īŗ3 ,

r0 =eāˆ’2ĪŗT

(āˆ’4eĪŗTĪŗT + 2e2ĪŗT

āˆ’2)

4Īŗ3 , r1 =eāˆ’2ĪŗT

(4eĪŗT(ĪŗT + 1) + e2ĪŗT(2ĪŗTāˆ’5) + 1

)4Īŗ3 .

and PBS(x0,varT) is a call price under a Black-Scholes model with volatilityāˆš

varTT .

Then, the MGB formula is :

PMGB(x0,T) = PBS(x0,varT,T) + a1,TĻĪ¾+ (a2,T + b2,T)Ļ2Ī¾2 + b0,TĪ¾2

If we consider piecewise constant parameters, we write T0 = 0ā‰¤T1 ā‰¤ Ā· Ā· Ā· ā‰¤Tn =

T such thatĪø,Ļ,Ī¾ are constant on each interval ]Ti,Ti+1[ and are equal respectivelyto ĪøTi+1 ,ĻTi+1 ,Ī¾Ti+1 . The new coefficients used to derive our analytical formula are

a1,Ti+1 = a1,Ti + Ļ‰āˆ’ĪŗTi,Ti+1

Ļ‰1,Ti +ĻTi+1Ī¾Ti+1 f 1Īŗ,v0,Ti

(ĪøTi+1 ,Ti,Ti+1),

a2,Ti+1 = a2,Ti + Ļ‰āˆ’ĪŗTi,Ti+1

Ī±Ti +ĻTi+1Ī¾Ti+1Ļ‰0,āˆ’ĪŗTi,Ti+1

Ļ‰1,Ti + (ĻTi+1Ī¾Ti+1)2 f 2Īŗ,v0,Ti

(ĪøTi+1 ,Ti,Ti+1),

b0,Ti+1 = b0,Ti + Ļ‰āˆ’ĪŗTi,Ti+1

Ī²Ti + Ļ‰āˆ’Īŗ,āˆ’ĪŗTi,Ti+1

Ļ‰2,Ti +Ī¾2Ti+1

f 0Īŗ,v0,Ti

(ĪøTi+1 ,Ti,Ti+1),

Ī±Ti+1 = Ī±Ti +ĻTi+1Ī¾Ti+1(Ti+1āˆ’Ti)Ļ‰1,Ti +Ļ2Ti+1Ī¾2

Ti+1g1Īŗ,v0,Ti

(ĪøTi+1 ,Ti,Ti+1),

Ī²Ti+1 = Ī²Ti + Ļ‰āˆ’ĪŗTi,Ti+1

Ļ‰2,Ti +Ī¾2Ti+1

g2Īŗ,v0,Ti

(ĪøTi+1 ,Ti,Ti+1),

Ļ‰1,Ti+1 = Ļ‰1,Ti +ĻTi+1Ī¾Ti+1h1Īŗ,v0,Ti

(ĪøTi+1 ,Ti,Ti+1),

Ļ‰2,Ti+1 = Ļ‰2,Ti +Ī¾2Ti+1

h2Īŗ,v0,Ti

(ĪøTi+1 ,Ti,Ti+1),

v0,Ti+1 = eāˆ’Īŗ(Ti+1āˆ’Ti)(v0,Ti āˆ’ĪøTi+1) +ĪøTi+1 ,

wheref 0Īŗ,v0

(Īø, t,T) =eāˆ’2ĪŗT(e2Īŗt(Īøāˆ’2v0)+e2ĪŗT((āˆ’2Īŗt+2ĪŗTāˆ’5)Īø+2v0)+4eĪŗ(t+T)((āˆ’Īŗt+ĪŗT+1)Īø+Īŗ(tāˆ’T)v0))

4Īŗ3 ,

f 1Īŗ,v0

(Īø, t,T) =eāˆ’ĪŗT(eĪŗT((āˆ’Īŗt+ĪŗTāˆ’2)Īø+v0)āˆ’eĪŗt((Īŗtāˆ’ĪŗTāˆ’2)Īøāˆ’Īŗtv0+ĪŗTv0+v0))

Īŗ2 ,

f 2Īŗ,v0

(Īø, t,T) =eāˆ’Īŗ(t+3T)(2eĪŗ(t+3T)((Īŗ(Tāˆ’t)āˆ’3)Īø+v0)+e2Īŗ(t+T)((Īŗ(Īŗ(tāˆ’T)āˆ’4)(tāˆ’T)+6)Īøāˆ’(Īŗ(Īŗ(tāˆ’T)āˆ’2)(tāˆ’T)+2)v0))

2Īŗ3 ,

g1Īŗ,v0

(Īø, t,T) =2eĪŗTĪø+eĪŗt(Īŗ2(tāˆ’T)2v0āˆ’(Īŗ(Īŗ(tāˆ’T)āˆ’2)(tāˆ’T)+2)Īø)

2Īŗ2 ,

g2Īŗ,v0

(Īø, t,T) =eāˆ’ĪŗT(e2ĪŗTĪøāˆ’e2Īŗt(Īøāˆ’2v0)+2eĪŗ(t+T)(Īŗ(tāˆ’T)(Īøāˆ’v0)āˆ’v0))

2Īŗ2 ,

h1Īŗ,v0

(Īø, t,T) =eĪŗTĪø+eĪŗt((Īŗtāˆ’ĪŗTāˆ’1)Īø+Īŗ(Tāˆ’t)v0)

Īŗ ,

h2Īŗ,v0

(Īø, t,T) =(eĪŗtāˆ’eĪŗT)(eĪŗt(Īøāˆ’2v0)āˆ’eĪŗTĪø)

2Īŗ ,

and Ļ‰ut (T) = āˆ’etu+eTu

u , Ļ‰0,ut (T) =

eTu(āˆ’tu+Tuāˆ’1)+etu

u2 , Ļ‰u,ut (T) =

(etuāˆ’eTu)2

2u2 .

Page 111: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 100

Thanks to these expressions we can write PMGB(x0,Ti+1) as a function of ĻTi+1

and Ī¾Ti+1 with :

aā€²

1,Ti+1= f 1

Īŗ,v0,Ti(ĪøTi+1 ,Ti,Ti+1)

āˆ‚2PBS

āˆ‚xy(x0,varTi+1) + Ļ‰0,āˆ’Īŗ

Ti,Ti+1Ļ‰1,Ti

āˆ‚3PBS

āˆ‚x2y(x0,varTi+1)

+ (a1,Ti + Ļ‰āˆ’ĪŗTi,Ti+1

Ļ‰1,Ti)āˆ‚4PBS

āˆ‚x2y2 (x0,varTi+1),

aā€²

2,Ti+1= f 2

Īŗ,v0,Ti(ĪøTi+1 ,Ti,Ti+1)

āˆ‚3PBS

āˆ‚x2y(x0,varTi+1),

bā€²

0,Ti+1= f 0

Īŗ,v0,Ti(ĪøTi+1 ,Ti,Ti+1)

āˆ‚2PBS

āˆ‚y2 (x0,varTi+1),

bā€²

2,Ti+1=

12āˆ‚4PBS

āˆ‚x2y2 (x0,varTi+1),

aā€²

0,Ti+1=(a1,Ti + Ļ‰

āˆ’ĪŗTi,Ti+1

Ļ‰1,Ti)āˆ‚2PBS

āˆ‚xy(x0,varTi+1)

+ (a2,Ti + Ļ‰āˆ’ĪŗTi,Ti+1

Ī±Ti)āˆ‚3PBS

āˆ‚x2y(x0,varTi+1)

+ (b0,Ti + Ļ‰āˆ’ĪŗTi,Ti+1

Ī²Ti + Ļ‰āˆ’Īŗ,āˆ’ĪŗTi,Ti+1

Ļ‰2,Ti)āˆ‚2PBS

āˆ‚y2 (x0,varTi+1)

+(a1,Ti + Ļ‰

āˆ’ĪŗTi,Ti+1

Ļ‰1,Ti)2

2āˆ‚4PBS

āˆ‚x2y2 (x0,varTi+1).

PMGB(x0,Ti+1) =aā€²

0,Ti+1+ PBS(x0,varT,T) + a

ā€²

1,Ti+1ĻTi+1Ī¾Ti+1 + (a

ā€²

2,Ti+1+ b

ā€²

2,Ti+1)ĻTi+1

2Ī¾Ti+12

+ bā€²

0,Ti+1Ī¾Ti+1

2

From now on, for each maturity T, this formula gives us the opportunity towrite a call price as an easily computable function of ĻT and Ī¾T.

Calibration This algorithm calibrates the Heston model using the new formulato compute prices. We consider a time dependent Heston model with constantv0 (chosen accordingly to our market data) and Īŗ. With several maturities, Ī˜

represents the values of the longterm vol on [0,T1], ..., [TNāˆ’1,TN] and Ī“ and Īž thevalues of the correlation and the volatility of volatility on [0,T1], ..., [TNāˆ’2,TNāˆ’1].If ĻN and Ī¾N are the corresponding values on [TNāˆ’1,TN], the call price for a strikeK is :

PMGB(TN,K) = a(Īŗ,Ī˜,Ī“,Īž) + b(Īŗ,Ī˜,Ī“,Īž)ĻNĪ¾N + c(Īŗ,Ī˜,Ī“,Īž)Ī¾2N + d(Īŗ,Ī˜,Ī“,Īž)Ļ2

NĪ¾2N

It seems really interesting to use this approximation in a calibration proceduresince two market prices are enough to solve the system and get the volatility ofvolatility and the correlation. However, numerical solutions for Ī¾N and ĻN can

Page 112: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 101

be useless for our model (for instance if we have to deal with complex numbersor huge Ī¾N). Finding a good value for ĪøN in order to get interesting Ī¾N and ĻN

is the aim of this calibration.

The algorithm starts with given values for Īŗ, v0 and Īø1 for the first maturity.Then, we use a simplex algorithm to find a good Īø1. For each value of Īø1, thesimplex solves the following system (using Mathemartica):

P1(T1) = a(Īŗ,Īø1) + b(Īŗ,Īø1)Ļ1Ī¾1 + c(Īŗ,Īø1)Ī¾2

1 + d(Īŗ,Īø1)Ļ21Ī¾

21

P2(T1) = a(Īŗ,Īø1) + b(Īŗ,Īø1)Ļ1Ī¾1 + c(Īŗ,Īø1)Ī¾21 + d(Īŗ,Īø1)Ļ2

1Ī¾21

where P1(T1) and P2(T1) are two market prices (at the money and the first strikehigher than the spot) and the coefficients a, b, c and d are given in the previoussubsection. This system gives four solutions but two of them lead to negativeĪ¾. We decided to remove one of the two remaining solutions since it did notgive suitable results with our pricing formula. Then we have to realize ouroptimization with one of these solution : (Ļ1,Ī¾1). The simplex computes (Ī¾ is theupper bound chosen for Ī¾)

F(Ļ1,Ī¾1) = max(0, |Ļ1| āˆ’1) + |Im(Ļ1)|+ |Im(Ī¾1)|+ |min(0,Re(Ī¾1))|

+ max(0,Re(Ī¾1)2āˆ’2ĪŗĪø1) + max(0,Re(Ī¾1)āˆ’Ī¾)

If F(Ļ1,Ī¾1) = 0 this is a good solution and the simplex returns the current value ofĪø1 and save Ļ1 and Ī¾1. Otherwise, it tries to minimize F. To end the calibrationon the first maturity, we use these solutions as an initial vector for a LevenbergMarquardt optimization.

When the model is calibrated on several maturities, T0,...,TN, we write Ī“, Ī˜

and Īž the calibrated values of Ļ, Īø and Ī¾ from 0 to TN and the algorithm beginswith a given value ĪøN+1 for TN+1. For each value of ĪøN+1, the simplex solves thefollowing system :

P1(TN+1) = a(Īŗ,Ī“,Īž,Ī˜,ĪøN+1) + b(Īŗ,Ī“,Īž,Ī˜,ĪøN+1)ĻN+1Ī¾N+1

+c(Īŗ,Ī“,Īž,Ī˜,ĪøN+1)Ī¾2N+1 + d(Īŗ,Ī“,Īž,Ī˜,ĪøN+1)Ļ2

N+1Ī¾2N+1

P2(TN+1) = a(Īŗ,Ī“,Īž,Ī˜,ĪøN+1) + b(Īŗ,Ī“,Īž,Ī˜,ĪøN+1)ĻN+1Ī¾N+1

+c(Īŗ,Ī“,Īž,Ī˜,ĪøN+1)Ī¾2N+1 + d(Īŗ,Ī“,Īž,Ī˜,ĪøN+1)Ļ2

N+1Ī¾2N+1

where P1(TN+1) and P2(TN+1) are two market prices (at the money and the firststrike higher than the spot). Then, the simplex tries to find a ĪøN+1 which givesgood values for ĻN+1 and Ī¾N+1. If it fails, this part is replaced by a classicalsimplex optimizing every parameter. The solution of this subsection is thenused as an initial vector for a Levenberg Marquardt optimization.

Page 113: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 102

Table 4.1: Market data of crude oil future

Maturity Jun09 Jul09 Aug09 Sep09 Oct09 Nov09Future Value 66.12 67.09 68.02 68.86 69.66 70.46

Dec09 Jan10 Feb10 Mar10 Apr10 May1071.11 71.63 72.11 72.59 73.06 73.52Jun10 Jul10 Aug10 Sep10 Oct10 Nov1073.97 74.3 74.53 74.79 75.05 75.3Dec10 Jan11 Feb11 Mar11 Apr11 May1175.56 75.84 76.13 76.42 76.71 77

4.8 Numerical analysis

4.8.1 Market data: crude oil

We take a market data as of the 3rd June 2009. The spot value is 66.12. Thefuture and volatility market data are shown in figure 4.8 , table 4.1 and figure 4.9respectively.

Figure 4.8: Market data of crude oil future

Now we use the spot, futures and volatility surface market data to calibratea Heston model. All the numerical tests are carried on under financial softwarePrice-itĀ®Excel.

We recall Heston model here for convenience.

dSt

St= rdt +

āˆšVtdW1 (4.77)

dVt = k(Īøāˆ’Vt)dt +Ī¾āˆš

VtdW2 (4.78)

There are five parameters to calibrate:

1. V0, the initial value of Vt, is initial volatility.

Page 114: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 103

Figure 4.9: Market data of crude oil volatility

2. Īø is long term volatility.

3. k is mean reversion.

4. Ī¾ is vol of vol.

5. Ļ is the correlation between W1 and W2

.

4.8.2 Calibration Heston model with constant parameters

In the first test, we assume all the parameters of Heston model are constant.And we use only one year maturity on the volatility surface. The optimization

Page 115: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 104

Table 4.2: Calibration result

InitialVol LongTermVol MeanReversion VolOfVol Correlation0.3981 0.0614 1.0922 0.6451 -0.0853

algorithm is set to be Levenberg-Marquardt. The result of calibration is listedin table 4.2. We use this set of parameter in Heston model and re-calculate theoptions we use to calibrate. In this way, we can compare the smile from themodel and the smile of market data. It is shown in figure 4.10.

Figure 4.10: Smile of Market data and Heston model

From figure 4.10 we observe the largest difference between two curves atstrike 95%. In market data, the smile presents a slight ā€œZā€ form instead ofnormal ā€œUā€ form, especially at point strike 95%. Heston model cannot matchexactly at this point. Therefore it causes a relative larger difference than otherpoints. The difference is shown in figure 4.11.

4.8.3 Calibration Heston model with time-dependent parameters

In this test, we continue to use the same market data in figue 4.8 , table 4.1 andfigure 4.9. We let parameters in Heston model be time-dependent, more preciselypiecewise constant. Among the five Heston model parameters, we allow three

Page 116: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 105

Figure 4.11: Difference on smile

of them to be piecewise constant: long term volatility, vol of vol and correlation.Parameter initial volatility and Mean reversion parameter are set to be constantover different maturities. The reason of setting mean reversion to be constant isto decrease the degree of freedom in calibration, in order to stabilize the resultof calibration. We will discuss the stability of calibration in more details in thenext subsection.

Here we take maturity 1 year, 18 months and 2 years. So there are 21 inputvolatility in total, seven strikes for each maturity. The calibration targets tominimize the sum of the square of the difference on all the 21 points. Thecalibrated parameters are listed in table 4.3 and the difference between marketdata and Heston model is shown in figure 4.13 and 4.12.

Compare figure 4.11 and 4.12, we can see they present same form. Thereason is always that market data presents irregularity at strike 95%. Anotherobservation from table 4.3 is that the correlation parameter changes sign in thesecond maturity, though there is little difference in the shape of the volatilitycurves in figure 4.13.

With the observation on correlation parameter, we now set correlation asconstant over different maturities. The calibrated parameters are listed in table4.4 and the difference between market data and Heston model is shown in figure4.14 and 4.15. We can observe that all the parameters are smoother without bigchange over different maturities in table 4.4. And the calibration error in figure4.14 remains as little as that in the case of time-dependent correlation.

Page 117: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 106

Table 4.3: Calibration result for time-dependent Heston

InitialVol LongTermVol MeanReversion VolOfVol Correlation1Y 47.96% 8.68% 408.47% 142.19% -9.13%1Y+6M 47.96% 4.72% 408.47% 186.55% 30.82%2Y 47.96% 7.49% 408.47% 112.43% -1.90%

Figure 4.12: Difference on Smile of Market data and time-dependent Hestonmodel

From the comparison, we find that we donā€™t need to allow every parametersto be time-dependent. Two time-dependent parameters can achieve a calibrationresult as good as the case of three time-dependent parameters.

Page 118: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 107

Figure 4.13: Smile of Market data and Heston model for time dependent case

Table 4.4: Calibration result with constant correlation

InitialVol LongTermVol MeanReversion VolOfVol Correlation1Y 59.77% 6.32% 495.80% 171.95% -5.01%1Y+6M 59.77% 6.06% 495.80% 171.98% -5.01%2Y 59.77% 6.49% 495.80% 169.17% -5.01%

Page 119: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 108

Figure 4.14: Difference on Smile of Market data and time-dependent Hestonmodel with constant correlation

Page 120: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 109

Figure 4.15: Smile of Market data and Heston model for constant correlationcase

Page 121: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 110

4.8.4 Sensibility of parameters

In this subsection, we add 0.02% on the volatility surface of market data. Thenwe use this shifted market data to calibrate Heston model. And we compareagainst the previous result to see if the parameters largely change or just havesmall change.

The spot value remains at 66.12. And the futures are still as in table 4.1. Butthe volatility surface is shifted by 0.02% as shown in figure 4.16.

Figure 4.16: Shifted market data of crude oil volatility

The calibrated parameters are listed in table 4.5 and the difference betweenmarket data and Heston model is shown in figure 4.17 and 4.18.

The result in table 4.4 and 4.5 shows that the parameters of Heston modeldonā€™t change a lot. They are relatively stable. And the calibration result in figure

Page 122: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 111

Table 4.5: Calibration result with constant correlation

InitialVol LongTermVol MeanReversion VolOfVol Correlation1Y 64.48% 5.41% 494.33% 157.08% -19.32%1Y+6M 64.48% 6.36% 494.33% 159.48% -19.32%2Y 64.48% 6.79% 494.33% 155.21% -19.32%

4.17 stays at a good level as the case before shifting in figure 4.14.

Figure 4.17: Difference on Smile of Market data and time-dependent Hestonmodel with shifted market data

Page 123: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 112

Figure 4.18: Smile of Market data and Heston model with shifted market data

Page 124: Commodity Derivatives: Modeling and Pricing

CHAPTER 4. STOCHASTIC VOLATILITY MODEL 113

4.9 Summary

In this chapter, we have mainly discussed stochastic volatility model in twoparts. In the first part (section 4.1 to 4.5), we replace the constant model param-eters to time dependent, which extends the original stochastic volatility modelto dynamic stochastic volatility model. It allows more degree of freedom onthe implied volatility surface generated by these models. We study the closedformula for vanilla option price for three typical stochastic volatility models,namely Heston model, Piterbarg model and SABR model, by employing controlvariate, parameter averaging technique and perturbation theory. Instead of thethe formulas for the option price, we also have the formula for implied volatility.In section 4.6, we discuss the result of vol of vol expansion. The advantage of themethod is that it can get directly the implied volatility which is very convenientfor calibration, but the precision is often not good enough for calibration.

In the second part (section 4.7), we discuss the calibration for stochasticvolatility models. Substantially, calibration is the process to numerically reversethe closed formula of option price in n-dimension space (n is the number of theparameters to calibrate). As the closed formulas for option price have readyachieved in the first part, we focus on different numerical algorithm, includ-ing bootstrap, Newton Raphson, simplex, differential evolution. We list theirpseudo-code and comment on their advantage and limitation.

Page 125: Commodity Derivatives: Modeling and Pricing
Page 126: Commodity Derivatives: Modeling and Pricing

Chapter 5

Conclusion

The modeling of commodity and commodity derivatives consists in three steps:

1. choose the dynamic model of the spot price,

2. calibrate the model,

3. price the products with the calibrated model.

All these three steps are necessarily needed for a complete pricing process andthey are related to each other. Step three needs the model parameters and soit depends on the result of step two, the calibration. Step two needs to usethe formula of vanilla option price to calibrate the model to the market impliedvolatility surface, which depends the choice of the dynamic model. Interestingly,the choice of the dynamic model sometimes depends on the final product wewant to price. For example, if the product is sensitive to the volatility smile,then the stochastic volatility model family is a better choice than Black-Scholesmodel.

In this thesis, we focus on the different choices of dynamic model and theirformula for vanilla option price, as well as some calibration algorithms. Usingthe model to price the products by employing numerical method, such as MonteCarlo simulation or PDE grid is beyond the scope of the thesis.

There are basically two different families of dynamic models in our discus-sion, namely stochastic convenience yield model and stochastic volatility model.We pick up the most popular models, Gibson-Schwartz model and Gabillonmodel, from the family of stochastic convenience yield model. In chapter 2, weproved the mathematical equivalence relation between Gibson-Schwartz modeland Gabillon model. It is not surprising because both models share the similareconomy concept of stochastic convenience yield. And in financial mathematics

115

Page 127: Commodity Derivatives: Modeling and Pricing

CHAPTER 5. CONCLUSION 116

equation, it shows the stochastic drift term. In chapter 3, we presented that bothGibson-Schwartz model and Gabillon model were equivalent to a one-factormodel by introducing model factor reduction technique. Of course, this conclu-sion is true only when the product is not path dependent because the equivalenceis established on the marginal distribution of the stochastic process. Based onthe equivalent one-factor model, we give the formula of vanilla option price withthe underlying being spot as well as forward.

The family of stochastic volatility model is studied in chapter 4. The mainidea is to extend the original version of the models to dynamic models by al-lowing their parameter depending on time. To make it more practical, theparameters are not set to be continuous function depending on time. We choosepiecewise constant parameters. The reason to choose piecewise constant param-eters is based on the fact that market data is normally given on a few maturities.Therefore, we simply donā€™t have the information of the parameters between twomaturities of market data from the point of view of calibration. These piecewiseconstant parameters represent for the average value of the parameters betweentwo maturities in term of calibration result. Based on this assumption, we ex-tended Heston model, SABR model and Piterbarg model into dynamic Hestonmodel, dynamic SABR model and dynamic Piterbarg model. To set up theformula of vanilla price for these extended models we employ different tech-niques including parameter averaging techniques, perturbation theory and volof vol expansion. These formulas are approximation but with a good numericalprecision.

In the last part of chapter 4, we list some calibration algorithms. Thesealgorithms are global optimization in high dimension space. We gave someadjustment to increase the probability of convergence. Although the algorithmsare model-free, they can be more helpful for the family of stochastic volatilitymodel, since these models have more parameters.

Page 128: Commodity Derivatives: Modeling and Pricing

Bibliography

Hansjorg Albrecher, Philipp Mayer, Wim Schoutens, and Jurgen Tistaert. Thelittle heston trap. Wilmott magazine, 1:83ā€“92, 2007. [cited at p. 49, 50]

L. Andersen and J. Andreasen. Jump diffusion process: Volatility smile fittingand numerical methods for pricing. Review of Operations Research, 4:231ā€“262,2000. [cited at p. 39]

L. Andersen and J. Andreasen. Volatile volatilities. Risk, pages 163ā€“168, Decem-ber 2002. [cited at p. 40]

L. B. G. Andersen and R. Brotherton-Ratcliffe. The equity option volatility smile:an implicit finite-difference approach. Journal Computational Finance, 1:5ā€“37,1998. [cited at p. 40]

E. Benhamou and O. Croissant. Local time for the sabr model: Connection withthe ā€™complexā€™ black scholes and application to cms and spread options. SSRN,2007. [cited at p. 66]

Eric Benhamou, Emmanuel Gobet, and Mohammed Miri. Smart expansionand fast calibration for jump diffusion. September 2008. Available at SSRN:http://ssrn.com/abstract=1079627. [cited at p. 39]

Eric Benhamou, Emmanuel Gobet, and Mohammed Miri. Time dependent hes-ton model. SSRN eLibrary, 2009. [cited at p. 76, 77, 78]

P. Bjerksund. Contingent claims evaluation when the convenience yield isstochastic: analytical results. Working paper, Norwegian School of Economicsand Business Administration, 1991. [cited at p. 32]

F. Black and M. Scholes. The pricing of options and corporate liabilities. Journalof Political Economy, 81(3):637ā€“654, 1973. [cited at p. 22, 23, 79]

Michael Brennan. The price of convenience and the valuation of commoditycontingent claims. Diderik Lund and Bernt Oksendal, eds.: Stochastic Models andOption Values, North-Holland, Amsterdam, 1991. [cited at p. 7, 8, 37]

117

Page 129: Commodity Derivatives: Modeling and Pricing

BIBLIOGRAPHY 118

M.J. Brennan. The supply of storage. American Economic Review, 48:50ā€“72, 1958.[cited at p. 8]

J. Casassus and P. Collin-Dufresne. Stochastic convenience yield implied fromcommodity futures and interest rates. Journal of Finance, (60):2283ā€“2331, 2005.[cited at p. 7, 8, 37]

A. Eydeland and H. Geman. Pricing power derivatives. Risk, September 1998.[cited at p. 38]

J. Gabillon. The term structure of oil futures prices. Oxford Institute for EnergyStudies, Working Paper, (17), 1991. [cited at p. 7, 8, 11, 26, 37]

Helyette Geman. Commodities and Commodity Derivatives Modelling and Pricingfor Agriculturals Metals and Energy. Wiley Finance, 2005. [cited at p. 9]

R. Gibson and E. S. Schwartz. Stochastic convenience yield and the pricing of oilcontingent claims. Journal of Finance, XLV(3):959ā€“976, 1990. [cited at p. 7, 8, 10, 11,

29, 37]

I. Gyƶngy. Mimicking the one-dimensional marginal distributions of processeshaving an ito differential. Probability Theory and Related Fields, 71:501ā€“516, 1986.[cited at p. 18]

P. Hagan, D. Kumar, A. Lesniewski, and D. Woodward. Managing smile risk.Wilmott Magazine, pages 84ā€“108, September 2002. [cited at p. 35, 37, 40, 64, 66]

S.L. Heston. A closed-form solution for options with stochastic volatility withapplications to bond and currency options. Review of Financial Studies, 6(2):327ā€“343, 1993. [cited at p. 38, 40, 41]

Jimmy Hilliard and Jorge Reis. Valuation of commodity futures and optionsunder stochastic convenience yields, interest rates, and jump diffusions on thespot. Journal of Financial and Quantitative Analysis, 33(1):61ā€“86, 1998. [cited at p. 7,

8, 37]

John Hull and Alan White. Pricing interest-rate derivative securities. The Reviewof Financial Studies, 3(4):573ā€“592, 1990. [cited at p. 26, 38]

F. Jamshidian and M. Fein. Closed-form solutions for oil futures and europeanoptions in the gibson-schwartz model: A note. 1990. Working paper, MerrillLynch Capital Markets. [cited at p. 32]

Alireza Javaheri. Inside volatility arbitrage : The secrets of skewness. September2005. Wiley Finance. [cited at p. 35, 37]

Page 130: Commodity Derivatives: Modeling and Pricing

BIBLIOGRAPHY 119

C. Kahl and P. Jackel. Not-so-complex logarithms in the heston model. Wilmottmagazine, pages 94ā€“103, September 2005. [cited at p. 49]

N. Kaldor. Speculation and economic stability. The Review of Economic Studies, 7:1ā€“27, 1939. [cited at p. 8]

P. H. Labordere. General asymptotic implied volatility for sv models. 2005. Work-ing paper; available at http://arxiv.org/abs/cond-mat/0504317v2. [cited at p. 40]

Delphine Lautier. Term structure models of commodity prices: a review. TheJournal of Alternative Investments, pages 42ā€“64, Summer 2005. [cited at p. 8]

Delphine Lautier. Convenience yield and commodity markets. Bankers, Markets& Investors, (102):59ā€“66, September - October 2009. [cited at p. 7, 8, 9]

Alan L. Lewis. Option Valuation Under Stochastic Volatility: With Mathematica Code.Finance Press, February 2000. [cited at p. 40, 42, 55, 75]

S. Mikhailov and U. Nogel. Hestonā€™s stochastic volatility model. implementation,calibration and some extensions. Wilmott magazine, pages 74ā€“79, July 2003a.[cited at p. 40, 55, 61]

Sergei Mikhailov and Ulrich Nogel. Hestonā€™s stochastic volatility model imple-mentation, calibration and some extensions. Wilmott magazine, pages 74ā€“79,July 2003b. [cited at p. 41, 50]

M. J. Nielsen and E. S. Schwartz. Theory of storage and the pricing of commodityclaims. Review of Derivatives Research, 7:5ā€“24, 2004. [cited at p. 38]

Yasufumi Osajima. The asymptotic expansion formula of implied volatilityfor dynamic sabr model and fx hybrid model. February 2007. Available athttp://ssrn.com/abstract=965265. [cited at p. 40]

V. Piterbarg. A stochastic volatility forward libor model with a term structure ofvolatility smiles. SSRN, 2003. [cited at p. 53, 62]

V. Piterbarg. Time to smile. Risk, 19(5), May 2005. [cited at p. 39, 53, 56, 57, 62]

M. Richter and C. SĆørensen. Stochastic volatility and seasonality in commodityfutures and options: The case of soybeans. Copenhagen Business School, Workingpaper, 2002. [cited at p. 38]

E. S. Schwartz and J. E. Smith. Short-term variations and long-term dynamics ofcommodity prices. Management Science, 46:893ā€“911, 2000. [cited at p. 7, 8, 11, 12, 15,

37]

Page 131: Commodity Derivatives: Modeling and Pricing

BIBLIOGRAPHY 120

Eduardo S. Schwartz. The stochastic behavior of commodity prices: Implicationsfor valuation and hedging. Journal of Finance, 52(3):923, July 1997. [cited at p. 7, 8,

37]

L. Scott. Option pricing when the variance changes randomly: Theory, estimationand an application. Journal of Financial and Quantitative Analysis, 22(4):419ā€“438,1987. [cited at p. 38]

Steven Shreve. Stochastic calculus for finance ii: Continuous-time models.Springer Finance, 2004. [cited at p. 23]

Stein and Stein. Stock price distributions with stochastic volatility: an analyticalapproach. Review Finance Study, 4:727ā€“752, 1991. [cited at p. 38]

L.G. Telser. Futures trading and the storage of cotton and wheat. Journal ofPolitical Economy, 66:233ā€“255, 1958. [cited at p. 8]

Anders B. Trolle and Eduardo S. Schwartz. Unspanned stochastic volatility andthe pricing of commodity derivatives. SSRN eLibrary, January 2008. [cited at p. 38,

39]

Zaizhi Wang. Effective parameters for stochastic volatility models. Journal ofFinancial Transformation, pages 108ā€“115, September 2009. Paris December 2007Finance International Meeting AFFI-EUROFIDAI Paper; Available at SSRN:http://ssrn.com/abstract=1071768. [cited at p. 66, 67]

Mark D. West. Private ordering at the worldā€™s first futures exchange. (Dojimarice exchange in osaka, japan). Michigan Law Review, 98(8):2574ā€“2615, August2000. Symposium: Empirical Research in Commercial Transactions (Aug.,2000); Stable URL: http://www.jstor.org/stable/1290356. [cited at p. 2]

J. B. Wiggins. Option values under stochastic volatilities. Journal of FinancialEconomics, 19:351ā€“372, 1987. [cited at p. 38]

H. Working. Theory of inverse carrying charge in futures markets. Journal FarmEconomics, 30:1ā€“28, 1949. [cited at p. 8]

F. Zhou. Black smirks. Risk, pages 87ā€“91, May 2003. [cited at p. 40]

Page 132: Commodity Derivatives: Modeling and Pricing

Appendices

121

Page 133: Commodity Derivatives: Modeling and Pricing
Page 134: Commodity Derivatives: Modeling and Pricing

Appendix A

Proof of propositions

A.1 Proof of proposition 25

To porve the proposition, we consider two diffusions more generally,

dX(t) = f (t,X(t))āˆš

z(t)Ļƒ(t)dW(t), X(0) = x0

dY(t) = f (Y(t))āˆš

z(t)Ļƒ(t)dW(t), Y(0) = x0

Without loss of generality:

f (t,x0) ā‰” f (x0) = 1

The objective is to find the best function f (Ā·), which minimizes the "distance"between two diffusion X(t) and Y(t).

We note the objective function to minimize as:

p(X,Y) = E[(X(T)āˆ’Y(T))2

]Use Itoā€™s lemma and then expand to the first order,

p(X,Y) =

āˆ« T

0E

[z(t)( f (t,X(t))āˆ’ f (Y(t)))2

]Ļƒ2(t)dt

=

āˆ« T

0E

z(t)(āˆ‚āˆ‚x

f (t,x0)(X(t)āˆ’x0)āˆ’āˆ‚āˆ‚x

f (x0) (Y(t)āˆ’x0))2Ļƒ2(t)dt

Further approximation:

X(t)āˆ’x0 = X0(t)āˆ’x0

Y(t)āˆ’x0 = X0(t)āˆ’x0

123

Page 135: Commodity Derivatives: Modeling and Pricing

APPENDIX A. PROOF OF PROPOSITIONS 124

Where X0(t) is given by,

dX0(t) =āˆš

z(t)Ļƒ(t)dW(t), X0(0) = x0 (A.1)

Finally we get,

p(X,Y) =

āˆ« T

0

(āˆ‚āˆ‚x

f (t,x0)āˆ’āˆ‚āˆ‚x

f (x0))2

E[z(t)(X0(t)āˆ’x0)2

]Ļƒ2(t)dt

Minimizing p(X,Y) leads to,

āˆ‚ f (x0)āˆ‚x

=

āˆ« T

0

āˆ‚ f (t,x0)āˆ‚x

w(t)dt (A.2)

where the weights w(t) can calculate by,

w(t) =v2(t)Ļƒ2(t)āˆ« T

0 v2(t)Ļƒ2(t)dt(A.3)

v2(t) = E[z(t)(X0(t)āˆ’x0)2

](A.4)

Now we apply the general result (A.2) to our diffusion,

dS(t) = Ļƒ(t)(Ī²(t)S(t) + (1āˆ’Ī²(t))S(0))āˆš

z(t)dU(t)

And we can get,

b =

āˆ« T

0Ī²(t)w(t)dt

w(t) =v2(t)Ļƒ2(t)āˆ« T

0 v2(t)Ļƒ2(t)dt

v2(t) = E[z(t)(X0(t)āˆ’x0)2

]Remark In this part of proof, we donā€™t use the zero correlation condition.Therefore, the result (A.2) can also apply to a model with non-zero correlation.

Now we calculate v2(t). It needs the assumption of zero correlation.

Using the independence between X0 and z we obtain,

v2(t) = E[z(t)(X0(t)āˆ’x0)2

](A.5)

= E[z(t)E

[(X0(t)āˆ’x0)2

āˆ£āˆ£āˆ£z(Ā·)]]

(A.6)

= E

[z(t)

āˆ« T

0z(s)Ļƒ2(s)ds

](A.7)

=

āˆ« T

0Ļƒ2(s)E[z(t)z(s)]ds (A.8)

Page 136: Commodity Derivatives: Modeling and Pricing

APPENDIX A. PROOF OF PROPOSITIONS 125

Calculate the covariance of z(t) and substitute in equation (A.8), we can get,

v2(t) = z20

āˆ« t

0Ļƒ2(s)ds + z0Ī·

2eāˆ’Īøtāˆ« t

0Ļƒ2(s)

eĪøsāˆ’ eāˆ’Īøs

2Īøds

This proves proposition 25.

A.2 Proof of proposition 26

We can first remark that the model given by:

dS(t) = Ļƒ(t)(bS(t) + (1āˆ’b)S(0))āˆš

z(t)dU(t)

where the stochastic volatility z(t)follows

dz(t) = Īø(z0āˆ’ z(t))dt +Ī·āˆš

z(t)dV(t),z(0) = z0

has closed formula. We can use Fourier transform as in Lewis (2000), with thenon-constant sigma directly. In this case, instead of solving a Riccati systemby Runge-Kutta method, we have to solve a Riccati system in the fundamentaltransform to construct the closed formula.

Consider the European style at-the-money call option.

E (S(T)āˆ’S0)+ = E[E

[(S(T)āˆ’S0)+]āˆ£āˆ£āˆ£z(Ā·)

](A.9)

Where (S(T)āˆ’S0)+ =

S(T)āˆ’S0, when S(T) ā‰„ S0

0, when S(T) < S0

Thatā€™s to say, we can express(S(T)āˆ’S0)+ as a function ofZ(T), saying, in formof g(Z(T)). And

Z(T) =

āˆ« T

0Ļƒ2(t)z(t)dt (A.10)

Now we derive the explicit form of function g(Ā·)

Page 137: Commodity Derivatives: Modeling and Pricing

APPENDIX A. PROOF OF PROPOSITIONS 126

The process S(t)is a shifted lognormal.

dS(t) = Ļƒ(t)(bS(t) + (1āˆ’b)S0)āˆš

z(t)dU(t),S(0) = S0

dS(t) = Ļƒ(t)b(S(t) +

(1āˆ’b)b

S0

) āˆšz(t)dU(t)

d(S(t) +

1āˆ’ bb

S0

)= Ļƒ(t)b

(S(t) +

(1āˆ’b)b

S0

) āˆšz(t)dU(t)

S(t) +1āˆ’b

bS0 = exp

āˆ« t

0Ļƒ(t)b

āˆšz(t)dU(t) + log

(S0 +

1āˆ’bb

S0

)=

S0

bexp

āˆ« t

0Ļƒ(t)b

āˆšz(t)dU(t)

S(T)āˆ’S0 =S0

b

(exp

āˆ« t

0Ļƒ(t)b

āˆšz(t)dU(t)āˆ’1

)(S(T)āˆ’S0)+ =

S0

b

(exp

āˆ« t

0Ļƒ(t)b

āˆšz(t)dU(t)āˆ’1

)+

The right-hand side of the equation (expāˆ« t

0 Ļƒ(t)bāˆš

z(t)dU(t)āˆ’1)+can be regardedas Black-Scholes model, with the spot price being 1, strike price being 1 andinterest rate being 0.

Therefore,

(S(T)āˆ’S0)+ =S0

b(N(d1)āˆ’N(d2))

With

d1 =log(S0/K) + (r + (vol)2/2)T

Ļƒ Ā·āˆš

T=Ļƒ Ā·āˆš

T2

=12

bāˆš

Z(T) (A.11)

d2 = d1āˆ’Ļƒāˆš

T = āˆ’12

bāˆš

Z(T)

N(y) = 1āˆš

2Ļ€

āˆ« yāˆ’āˆž

eāˆ’x2/2dx is cumulative function of standard normal distribu-tion. Z(T)is defined in A.10.

c(S(T)āˆ’S0)+ =S0

b

(N

(12

bāˆš

Z(T))āˆ’N

(āˆ’

12

bāˆš

Z(T)))

=S0

b

(2N

(12

bāˆš

Z(T))āˆ’1

)We note g(x) = S0

b

(2N

(12 bāˆš

x)āˆ’1

)So we get an equation for Ī»:

Eg(āˆ« T

0Ļƒ2(t)z(t)dt

)= Eg

(Ī»2

āˆ« T

0z(t)dt

)(A.12)

Now we make an approximation for g(x)

Page 138: Commodity Derivatives: Modeling and Pricing

APPENDIX A. PROOF OF PROPOSITIONS 127

We have obtained g(x) = S0b

(2N

(bāˆš

x2

)āˆ’1

)Here, N(y) = 1

āˆš2Ļ€

āˆ« yāˆ’āˆž

eāˆ’x2/2dx is cumulative function of standard normal dis-tribution

Note that, N(

bāˆš

x2

)= 1āˆš

2Ļ€

āˆ« bāˆš

x2

āˆ’āˆžeāˆ’t2/2dt

The derivative respect to x is:(N

(bāˆš

x2

))ā€²=

1āˆš

2Ļ€eāˆ’

b2x8

So, we can approximate N(

bāˆš

x2

)with p + qeāˆ’rx

Thatā€™s to say,g(x) ā‰ˆ a + beāˆ’cx (A.13)

a,b and care constants, which are to be determined.Substitute (A.13) into (A.12), we get:

a + bEexp(āˆ’c

āˆ« T

0Ļƒ2(t)z(t)dt

)= a + bEexp

(āˆ’cĪ»2

āˆ« T

0z(t)dt

)

Eexp(āˆ’c

āˆ« T

0Ļƒ2(t)z(t)dt

)= Eexp

(āˆ’cĪ»2

āˆ« T

0z(t)dt

)(A.14)

And the constant ccan be given by:

c = āˆ’gā€²ā€²(Ī¶)gā€²(Ī¶)

,

Ī¶ = EZ(t) =

āˆ« T

0Ļƒ2(t)Ez(t)dt = z0

āˆ« T

0Ļƒ2(t)dt

We note the Laplace transform of Z(T) as:

Ļ•(Āµ) = Eexp(āˆ’ĀµZ(T))

And the Laplace transform of z(t) as:

Ļ•0(Āµ) = Eexp(āˆ’Āµ

āˆ« T

0z(t)dt

)Substitute the transforms above into (A.14), we get the equation, from which wecan solve the parameterĪ».

Ļ•0

(āˆ’

gā€²ā€²(Ī¶)gā€²(Ī¶)

Ī»2)

= Ļ•

(āˆ’

gā€²ā€²(Ī¶)gā€²(Ī¶)

)(A.15)

The Laplace transform can be presented as:

Ļ•(Āµ) = exp(A(0,T)āˆ’ z0B(0,T))

Page 139: Commodity Derivatives: Modeling and Pricing

APPENDIX A. PROOF OF PROPOSITIONS 128

A(t,T) and B(t,T) suffice the Riccati system of ordinary differential equations:

Aā€²(t,T)āˆ’Īøz0B(t,T) = 0 (A.16)

Bā€²(t)āˆ’ĪøB(t,T)āˆ’12Ī·2B2(t,T) +ĀµĻƒ2(t) = 0 (A.17)

A(T,T) = B(T,T) = 0

WhenĻƒ(t) ā‰” 1, we arrive from Ļ•(Āµ) to Ļ•0(Āµ). This case can be solved explicitly.

Ļ•0(Āµ) = exp(A0(0,T)āˆ’ z0B0(0,T))

B0(0,T) =2Āµ(1āˆ’ eāˆ’Ī³T)

(Īø+Ī³)(1āˆ’ eāˆ’Ī³T) + 2Ī³eāˆ’Ī³T

A0(0,T) =2Īøz0

Ī·2 log(

2Ī³Īø+Ī³(1āˆ’ eāˆ’Ī³T) + 2Ī³eāˆ’Ī³T

)āˆ’2Īø

Āµ

Īø+Ī³T

Here,Ī³ =āˆšĪø2 + 2Ī·2Āµ

To solve (A.15), we use Newtonā€™s root searching method. We use recursionto get the solution of f (x) = 0.

xn+1 = xnāˆ’f (xn)f ā€²(xn)

Because the scope of Ī»2 is relatively small, saying [0,2] typically, Newtonā€™smethod works very fast.

(A.16) and (A.17) are called Riccati system. When Ļƒ(t) is not constant, Riccatisystem doesnā€™t have explicit solution. In this case, we employ Runge-Kuttaā€™smethod to solve it numerically which concludes the result.

A.3 Proof of proposition 28

According to the definition in equation (A.4)

v2(t) = E

z(t)(āˆ« t

0

āˆšz(u)Ļƒ(u)dW(u)

)2We note Y(t) =

āˆ« t0

āˆšz(u)Ļƒ(u)dW(u) and N(t) = Y(t)2.

Apparently, N(0) = Y(0) = 0.

E[Y(t)] = 0

E[N(t)] = z0

āˆ« t

0Ļƒ(s)2ds

dN(t) = 2Y(t)dY(t) + d怈Y怉t

Page 140: Commodity Derivatives: Modeling and Pricing

APPENDIX A. PROOF OF PROPOSITIONS 129

dz(t) = Īø(t)(z0āˆ’ z(s))ds +Ī³(s)āˆš

z(s)dV(s)

with z(0) = z0

E[z(s)2

]= z2

0 + z0eāˆ’2Īø(s)sāˆ« s

0e2Īø(u)uĪ³(u)2du

Here we use the integration by parts (XtYt = X0Y0 +āˆ« t

0 Xsāˆ’dYs +āˆ« t

0 Ysāˆ’dXs +

怈X,Y怉t)

v2(t) = E[z(t)N(t)] (A.18)

= E

[āˆ« t

0z(s)dN(s) +

āˆ« t

0N(s)dz(s) + 怈z,N怉t

](A.19)

The first term of equation A.19:

E

[āˆ« t

0z(s)dN(s)

]= E

[āˆ« t

0z(s) Ā·2Y(s)dY(s) +

āˆ« t

0z(s)d怈Y怉s

]= E

[āˆ« t

0z(s)d怈Y怉s

]= E

[āˆ« t

0z(s)2Ļƒ(s)2ds

]=

āˆ« t

0E

[z(s)2

]Ļƒ(s)2ds

The second term of equation A.19:

E

[āˆ« t

0N(s)dz(s)

]= E

[āˆ« t

0N(s)Īø(s) (z(0)āˆ’ z(s))ds +

āˆ« t

0N(s)Ī³(s)

āˆšz(s)dV(s)

]= z0

āˆ« t

0E[N(s)]Īø(s)dsāˆ’

āˆ« t

0E[N(s)z(s)]Īø(s)ds

The third term of equation A.19:

E[怈z,N怉t] = E

[āˆ« t

0Ī³(s)z(s) Ā·4Y(s)2Ļƒ(s)Ļ(s)ds

]= 4

āˆ« t

0Ī³(s)Ļƒ(s)Ļ(s)E [z(s)N(s)]ds

Finally we get,

E[z(t)N(t)] = A(t) +

āˆ« t

0B(s)E[z(s)N(s)]ds (A.20)

Page 141: Commodity Derivatives: Modeling and Pricing

APPENDIX A. PROOF OF PROPOSITIONS 130

Or,

v2(t) = A(t) +

āˆ« t

0B(s)v2(s)ds

dv2(t)dt

= Aā€²(t) + B(t)v2(t) (A.21)

with

A(t) =

āˆ« t

0E

[z(s)2

]Ļƒ(s)2ds + z0

āˆ« t

0E[N(s)]Īø(s)ds

B(s) = 4Ī³(s)Ļƒ(s)Ļ(s)āˆ’Īø(s)

v2(0) = 0

Note that the ordinary differential equation (ODE) A.21 is easy to solve if allthe parameters are constant. The solution in this case is,

v2(t) = eāˆ«

B(t)dt(āˆ«

Aā€²(t)eāˆ’āˆ«

B(t)dtdt + K)

(A.22)

K is the constant of integration,

K = āˆ’

āˆ«Aā€²(t)eāˆ’

āˆ«B(t)dtdt

āˆ£āˆ£āˆ£āˆ£āˆ£t=0

Now we consider all the parameters of the model is piecewise constant. TheODE A.21 can be solved in each interval. And the initial value for each intervalis the last value for the last interval.

Suppose we have n periods: 0 = t0 < t1 < t2 < Ā· Ā· Ā· < tn = T.When tiāˆ’1 < t < ti, we note Ļƒ(t) = Ļƒi, Ī³(t) = Ī³i, Ļ(t) = Ļi, Īø(t) = Īøi ,Ī²(t) = Ī²i,

B(t) = Bi = 4Ī³iĻƒiĻiāˆ’Īøi.

Now solve ODE A.21 on interval [tiāˆ’1, ti], with the initial value v2iāˆ’1 = v2(tiāˆ’1).

We obtain the solution for t āˆˆ (tiāˆ’1, ti],

v2(t) = eāˆ«

B(t)dt(āˆ«

Aā€²(t)eāˆ’āˆ«

B(t)dtdt + K)

(A.23)

K = v2iāˆ’1eāˆ’

āˆ«B(t)dt

āˆ’

āˆ«Aā€²(t)eāˆ’

āˆ«B(t)dtdt

āˆ£āˆ£āˆ£āˆ£āˆ£t=tiāˆ’1

(A.24)

For the first period, initial value v20 = v2(0) = 0.

Page 142: Commodity Derivatives: Modeling and Pricing

APPENDIX A. PROOF OF PROPOSITIONS 131

Now we substitute the initial value and develop equation A.23. It yields,

v2(t) =v2iāˆ’1eBi(t+1) +

(eBi(tāˆ’tiāˆ’1)

āˆ’1)Ļƒ2

i z01Bi

(z0 +

12Īøi

Ī³2i

)+ z2

0Ļƒ2i Īøi

1Bi

(eBi(tāˆ’tiāˆ’1)Bitiāˆ’1 + eBi(tāˆ’tiāˆ’1)

āˆ’Bitāˆ’1)

āˆ’1

2Īøi(Bi + 2Īøi)Ļƒ2

i z0Ī³2i

(eBi(tāˆ’tiāˆ’1)āˆ’2Īøitiāˆ’1 āˆ’ eāˆ’2Īøit

)with Bi = 4Ī³iĻƒiĻiāˆ’Īøi.

It proves proposition 28.

Page 143: Commodity Derivatives: Modeling and Pricing
Page 144: Commodity Derivatives: Modeling and Pricing

List of Symbolsand Abbreviations

Abbreviation Description Definition

ADC average daily commissionsADV average daily volumeBS model Black Scholes modelBS implied vol implied Black Scholes volatilityD(t,T) discount factor It equals to the value of 1 unit

of risk free bond at time t withmaturity at time T.

Ft,T forward price The forward price at time t withmaturity T.

ICE Intercontinental ExchangeLV model local volatility modelOTC over-the-counterOU process Ornstein Uhlenbeck processSt spot price Spot price at time t.SV model stochastic volatility modelvol of vol volatility of volatility

133

Page 145: Commodity Derivatives: Modeling and Pricing
Page 146: Commodity Derivatives: Modeling and Pricing

List of Figures

4.1 Option price with different formula under Heston model . . . . . . . 444.2 vega and spot . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 474.3 Effect of control variate to implied volatility - without control variate 474.4 Effect of control variate to implied volatility - with control variate . . 484.5 Option price with different formula under Heston model . . . . . . . 504.6 Comparison of series A (expasion on price), series B (expansion on

implied volatility) and exact volatility . . . . . . . . . . . . . . . . . . 744.7 An illustration of one iteration of Newtonā€™s method (the function f

is shown in blue and the tangent line is in red). We see that xn+1 is abetter approximation than xn for the root x of the function f . . . . . . 87

4.8 Market data of crude oil future . . . . . . . . . . . . . . . . . . . . . . 1024.9 Market data of crude oil volatility . . . . . . . . . . . . . . . . . . . . . 1034.10 Smile of Market data and Heston model . . . . . . . . . . . . . . . . . 1044.11 Difference on smile . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1054.12 Difference on Smile of Market data and time-dependent Heston model 1064.13 Smile of Market data and Heston model for time dependent case . . 1074.14 Difference on Smile of Market data and time-dependent Heston model

with constant correlation . . . . . . . . . . . . . . . . . . . . . . . . . . 1084.15 Smile of Market data and Heston model for constant correlation case 1094.16 Shifted market data of crude oil volatility . . . . . . . . . . . . . . . . 1104.17 Difference on Smile of Market data and time-dependent Heston model

with shifted market data . . . . . . . . . . . . . . . . . . . . . . . . . . 1114.18 Smile of Market data and Heston model with shifted market data . . 112

135

Page 147: Commodity Derivatives: Modeling and Pricing
Page 148: Commodity Derivatives: Modeling and Pricing

List of Tables

1.1 The commodity categories . . . . . . . . . . . . . . . . . . . . . . . . . 21.2 Some commodity exchanges . . . . . . . . . . . . . . . . . . . . . . . . 31.3 Some commodity indices . . . . . . . . . . . . . . . . . . . . . . . . . . 4

2.1 The Relationships Between Parameters in Gabillon Model and GibsonSchwartz model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15

4.1 Market data of crude oil future . . . . . . . . . . . . . . . . . . . . . . 1024.2 Calibration result . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1044.3 Calibration result for time-dependent Heston . . . . . . . . . . . . . . 1064.4 Calibration result with constant correlation . . . . . . . . . . . . . . . 1074.5 Calibration result with constant correlation . . . . . . . . . . . . . . . 111

137

Page 149: Commodity Derivatives: Modeling and Pricing
Page 150: Commodity Derivatives: Modeling and Pricing

Index

backwardation, 10Black Scholes mdoel, 22Bootstrap, 81

CalibrationBootstrap, 81Calibration order, 83Differential evolution algorithm, 92Levenberg Marquardt, 88MGB algorithm, 96Modified simplex algorithm, 92Nested calibration, 85Newton Raphson, 86Numerical method, 86Optimization, 83Simplex algorithm, 91Successive calibration, 84

Calibration order, 83contango, 10convenience yield, 8

backwardation, 10concept, 8contango, 10definition, 9formula, 9

Differential evolution algorithm, 92dynamic SABR model, 65

Heston, 56

Levenberg Marquardt, 88

MGB algorithm, 96model factor reduction, 17Modified simplex algorithm, 92

Nested calibration, 85Newton Raphson, 86

Numerical method, 86

Optimization, 83

SABR model, 64definition, 65

short-term/long-term model, 12Simplex algorithm, 91Successive calibration, 84

139

Page 151: Commodity Derivatives: Modeling and Pricing

Produits DĆ©rivĆ©s des MatiĆØres PremiĆØres:

ModƩlisation et Evaluation

RESUME : Les prix des matiĆØres premiĆØres ont augmentĆ© Ć  un rythme sans prĆ©cĆ©dent au

cours des derniĆØres annĆ©es rendant les dĆ©rivĆ©s sur matiĆØres premiĆØres de plus en plus

populaires dans de nombreux secteurs comme lā€™Ć©nergie, les mĆ©taux et les produits agricoles.

Le dĆ©veloppement rapide du marchĆ© des produits dĆ©rivĆ©s sur matiĆØres premiĆØres a aussi induit

une recherche vers toujours plus de prĆ©cision et cohĆ©rence dans la modĆ©lisation et lā€™Ć©valuation

de produits dĆ©rivĆ©s des matiĆØres premiĆØres.

Reposant sur le principe de diffusion Ʃquivalente introduite par Gyƶngy, nous montrons que le

modĆØle de Gibson Schwartz et le modĆØle de Gabillon peuvent se rĆ©duire Ć  un modĆØle Ć  un

facteur dont la distribution marginale peut ĆŖtre explicitement calculĆ©e sous certaines conditions.

Ceci nous permet en particulier de trouver des formules analytiques pour lā€™ensemble des

options vanilles. Certaines de ces formules sont nouvelles Ć  notre connaissance et dā€™autres

confirment des rƩsultats antƩrieurs.

Dans cette thĆØse, nous Ć©tendons les idĆ©es de Piterbarg Ć  la famille des modĆØles Ć  volatilitĆ©

stochastique en rendant le concept plus gƩnƩral. Nous montrons en particulier comment

introduire des paramĆØtres dĆ©pendant du temps dans les modĆØles Ć  volatilitĆ© stochastique et

explicitons diffĆ©rentes formules de calcul explicite du prix dā€™options vanilles, permettant ainsi

une calibration des paramĆØtres du modĆØles extrĆŖmement efficace.

Mots clĆ©s : Produits DĆ©rivĆ©s des MatiĆØres PremiĆØres, modĆØle de Gibson Schwartz, modĆØle de

Gabillon, ThĆ©orĆØme de Gyƶngy, modĆØle Ć  volatilitĆ© stochastique, parametres dĆ©pend du temp.

Commodity Derivatives:

Modeling and Pricing

ABSTRACT : Commodity prices have been rising at an unprecedented pace over the last years

making commodity derivatives more and more popular in many sectors like energy, metals and

agricultural products. The quick development of commodity market as well as commodity

derivative market results in a continuously uprising demand of accuracy and consistency in

commodity derivative modeling and pricing.

In this thesis, we prove that there is mathematical equivalence relation between Gibson

Schwartz model and Gabillon model. Moreover, inspired by the idea of Gyƶngy, we show that

Gibson Schwartz model and Gabillon model can reduce to one-factor model with explicitly

calculated marginal distribution under certain conditions, which contributes to find the analytic

formulas for forward and vanilla options. Some of these formulas are new to our knowledge and

other formulas confirm with the earlier results of other researchers.

We extend Piterbarg's idea to the whole family of stochastic volatility model, making all the

stochastic volatility models having time-dependent parameters and show various formulas for

vanilla option price by employing various techniques such as characteristic function, Fourier

transform, small error perturbation, parameter averaging, etc.

Keywords : Commodity derivative, Gibson-Schwartz model, Gabillon model, theory of

Gyƶngy, Stochastic volatility model, time-dependent parameter.