probability elicitation and calibration in a research & development portfolio a 13-year case...

Probability Elicitation and Calibrationin a Research & Development Portfolio

A 13-Year Case Study

• Dimensions of value for R&D projects• Probability of technical success as a metric• Assessing probabilities• A review of thirteen years of data

Jay AndersenEli Lilly and Company

June, 2012

R&D Portfolio Model Structure

ProjectValue

TechnicalFeasibility

MarketOpportunity

Cost Timing

PreclinicalTech. Feas.

Phase 1Tech. Feas.

Phase 3+Tech. Feas.

Phase 2Tech. Feas.

CapitalCosts

ClinicalCosts

DevelopmentCosts Regulatory

Review

PreclinicalStudies

ClinicalStudies

Patients Competition

All CostsTechnicalFeasibility

PreclinicalTech. Feas.

Phase 1Tech. Feas.

Phase 3+Tech. Feas.

Phase 2Tech. Feas.

The focus of today’s discussion 2

Measuring Technical Feasibility

• Qualitative descriptions of uncertainty suffer from vagueness and lack of collective agreement on useful definitions.

• A subjective probability represents the degree of belief in an event by an individual.

• Quantification of this uncertainty allows other business metrics to be specified (e.g, probabilized NPV, cash flows, expenses).

3

A probability of technical success can be elicited by type of uncertainty

TechnicalSuccess

ToxicologyResults

ClinicalEfficacyResults

ClinicalSafetyResults

RegulatoryResults

4

A probability of technical success can be elicited by stage of development

TechnicalSuccess

PreclinicalSuccess

Phase 1Success

Phase 2Success

Phase 3 &Registration

Success

5

Each stage of development can be broken down by type of uncertainty

Phase 2Success

ToxicologyResults

TumorResponse

Rate

Time toProgression

AdverseEvents

6

Probability ElicitationProcess Alternatives

• Self-assessed by project team • Using a trained facilitator• Independent review board• Independent review board with a trained

facilitator

7

Facilitators are trained to deal with bias

• Anchoring and Adjustment• Availability• Conditioning• Motivational• Representativeness

8

A historical assessment ofprobability projections

• At Lilly, an independent review board (PAG) has been charged with the responsibility of objectively assessing the P(TS) of R&D portfolio projects since early 1997.

• These assessments have been partitioned by stage of development:– P(preclinical success)– P(phase 1 success given preclinical success)– P(phase 2 success given phase 1 success)– P(phase 3 & registration success given phase 2 success)

• Our database has over 730 probability estimates over the past 13 years. We have been able to couple these estimates with actual success and failures to determine the accuracy of probability assessments.

9

0.10 0.20 0.30 0.40 0.50 0.60 0.70 0.80 0.90

0.10

0.20

0.30

0.40

0.50

0.60

0.70

0.80

0.90

PAG probability assessment

Act

ual S

ucce

ss R

ate

146

PAG predicted a probability of 0.80146 times over the last 13 years. 118 of those events were successes, for an observed success rate of 0.81.

PAG Performance

10

Figure 1

0.10 0.20 0.30 0.40 0.50 0.60 0.70 0.80 0.90

0.10

0.20

0.30

0.40

0.50

0.60

0.70

0.80

0.90


Act

ual S

ucce

ss R

ate

Most of the “high-sample size” observations lie within a +/- 0.10 band about the target line

134

146

70

111

1

45

23

3022

2411

8

PAG Performance, all raw data2

2

2 2

4

2

2

2 2 1

12 12 1 1 1 2 1 1

223

1

1

21

11

11 3 110

318

1

11 111

Figure 2

0.10 0.20 0.30 0.40 0.50 0.60 0.70 0.80 0.90

0.10

0.20

0.30

0.40

0.50

0.60

0.70

0.80

0.90


Act

ual S

ucce

ss R

ate

19

139

In this analysis, “nearby” observations were combined together into adjacent intervals centered at the 2½% points.

For example, there were:• 5 observations at 0.48 (4 successes)• 45 observations at 0.50 (24 successes)• 2 observations at 0.51 (2 successes)• 1 observation at 0.52 (0 successes)• 2 observations at 0.53 (0 successes)• 1 observation at 0.55 (0 successes)• 2 observation at 0.57 (0 successes)These combined for:• 58 observations at 0.525 (30 successes)

41

49 56

58

117

96153

5

Collecting Nearby Observations into Buckets, view 1

12

Figure 3

0.10 0.20 0.30 0.40 0.50 0.60 0.70 0.80 0.90

0.10

0.20

0.30

0.40

0.50

0.60

0.70

0.80

0.90


Act

ual S

ucce

ss R

ate

16

22

52

55

78

119

75

173

140

3

For example, there were:• 3 observations at 0.34 (1 success)• 18 observations at 0.35 (6 successes)• 2 observations at 0.36 (1 success) • 1 observation at 0.37 (0 success)• 30 observations at 0.40 (13 successes)• 1 observation at 0.42 (0 success)These combined for:• 55 observations at 0.375 (21 successes)


13

Figure 4

In this analysis, “nearby” observations were combined together into adjacent intervals centered at the 7½% points.

0.10 0.20 0.30 0.40 0.50 0.60 0.70 0.80 0.90

0.10

0.20

0.30

0.40

0.50

0.60

0.70

0.80

0.90


Act

ual S

ucce

ss R

ate

In this analysis, intervals were centered about the deciles, and observations at 0.05, 0.15, …, 0.95 were split half/half between adjacent intervals.

For example, there were:• 23 observations at 0.45 (7 successes)These were split “evenly” between the

intervals centered at 0.40 & 0.50.


17½

33

49

67

84½

164

2

118½

138½

14

2½

56½

Figure 5

Summary

• The PAG assessments have been remarkably accurate – no matter how the data are grouped the actual results are usually within 5% of the predictions, and never more than 10% away.

• Our experience has shown that a well-planned process for probability assessment can provide executives with reliable measurements of technical feasibility.– A careful consideration of technical feasibility is key to portfolio

management.– Probability is an excellent language for quantifying this

uncertainty.

15

probability elicitation and calibration in a research & development portfolio a 13-year case...

Documents