pitfalls of historical data use in software estimation · employee number total regular hours...

30
Pitfalls of Historical Data Use in Software Estimation David DeWitt – Director Commercial and International Programs Presented at the 5th Annual ICEAA Canada Workshop, September 17-18, 2019 – iceaa.ca/2019

Upload: others

Post on 12-Jul-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Pitfalls of Historical Data Use in Software Estimation · Employee Number Total Regular Hours Payroll Ending Date. 1234674. REGULAR HOURS: Date: Employee Name. OPERATIONS EMPLOYEE

Pitfalls of Historical Data Use in Software Estimation

David DeWitt – Director Commercial and International Programs

Presented at the 5th Annual ICEAA Canada Workshop, September 17-18, 2019 – iceaa.ca/2019

Page 2: Pitfalls of Historical Data Use in Software Estimation · Employee Number Total Regular Hours Payroll Ending Date. 1234674. REGULAR HOURS: Date: Employee Name. OPERATIONS EMPLOYEE

Mark Twain Famously Said…

Presented at the 5th Annual ICEAA Canada Workshop, September 17-18, 2019 – iceaa.ca/2019

Page 3: Pitfalls of Historical Data Use in Software Estimation · Employee Number Total Regular Hours Payroll Ending Date. 1234674. REGULAR HOURS: Date: Employee Name. OPERATIONS EMPLOYEE

'There are three kinds of lies: lies, damned lies, and statistics'

Presented at the 5th Annual ICEAA Canada Workshop, September 17-18, 2019 – iceaa.ca/2019

Page 4: Pitfalls of Historical Data Use in Software Estimation · Employee Number Total Regular Hours Payroll Ending Date. 1234674. REGULAR HOURS: Date: Employee Name. OPERATIONS EMPLOYEE

Problem Is – He Never Said it… His Autobiography Quotes Benjamin Disraeli, a

19th century British Prime Minister

But no historical evidence exists that Disraeli made

this statement either.

It was Prime Minister Arthur Balfour Quoting

Professor Joseph Munro!Presented at the 5th Annual ICEAA Canada Workshop, September 17-18, 2019 – iceaa.ca/2019

Page 5: Pitfalls of Historical Data Use in Software Estimation · Employee Number Total Regular Hours Payroll Ending Date. 1234674. REGULAR HOURS: Date: Employee Name. OPERATIONS EMPLOYEE

5

Around the Same Time Mark Twain was Misremembering a Famous Quote

“Let data and facts do the talking”Frederick Taylor: The Principals of Scientific Management 1901.

Presented at the 5th Annual ICEAA Canada Workshop, September 17-18, 2019 – iceaa.ca/2019

Page 6: Pitfalls of Historical Data Use in Software Estimation · Employee Number Total Regular Hours Payroll Ending Date. 1234674. REGULAR HOURS: Date: Employee Name. OPERATIONS EMPLOYEE

“In God We Trust… all Others Bring Data”

- W. Edwards Deming6

Can We Trust the Data?Maybe… But

People Can be A Bigger Problem

Presented at the 5th Annual ICEAA Canada Workshop, September 17-18, 2019 – iceaa.ca/2019

Page 7: Pitfalls of Historical Data Use in Software Estimation · Employee Number Total Regular Hours Payroll Ending Date. 1234674. REGULAR HOURS: Date: Employee Name. OPERATIONS EMPLOYEE

Four Common Fallacies Made in Historical Data Selection

And You Won’t Even See It Coming

Narrative Fallacy Casual Analysis

7

Silent Evidence Ludic Fallacy

Presented at the 5th Annual ICEAA Canada Workshop, September 17-18, 2019 – iceaa.ca/2019

Page 8: Pitfalls of Historical Data Use in Software Estimation · Employee Number Total Regular Hours Payroll Ending Date. 1234674. REGULAR HOURS: Date: Employee Name. OPERATIONS EMPLOYEE

“…limited ability to look at sequences

of facts without weaving an

explanation into them, or,

equivalently, forcing a logical link, an

arrow of relationship upon them”

8

Narrative Fallacy

Presented at the 5th Annual ICEAA Canada Workshop, September 17-18, 2019 – iceaa.ca/2019

Page 9: Pitfalls of Historical Data Use in Software Estimation · Employee Number Total Regular Hours Payroll Ending Date. 1234674. REGULAR HOURS: Date: Employee Name. OPERATIONS EMPLOYEE

Sort Pieces

Build The Frame

Assemble Large Parts

The Narrative Fallacy

A Quick Example

Fill In The Holes

Hunt for Missing Pieces

Assume Some Missing

Presented at the 5th Annual ICEAA Canada Workshop, September 17-18, 2019 – iceaa.ca/2019

Page 10: Pitfalls of Historical Data Use in Software Estimation · Employee Number Total Regular Hours Payroll Ending Date. 1234674. REGULAR HOURS: Date: Employee Name. OPERATIONS EMPLOYEE

1. Neckerchiefs

2. Dogs Paws

3. Tower

4. Umbrellas

5. Sunglasses

Presented at the 5th Annual ICEAA Canada Workshop, September 17-18, 2019 – iceaa.ca/2019

Page 11: Pitfalls of Historical Data Use in Software Estimation · Employee Number Total Regular Hours Payroll Ending Date. 1234674. REGULAR HOURS: Date: Employee Name. OPERATIONS EMPLOYEE

Our Trust in Narratives Can Shape How Data is SelectedPresented at the 5th Annual ICEAA Canada Workshop, September 17-18, 2019 – iceaa.ca/2019

Page 12: Pitfalls of Historical Data Use in Software Estimation · Employee Number Total Regular Hours Payroll Ending Date. 1234674. REGULAR HOURS: Date: Employee Name. OPERATIONS EMPLOYEE

Our Trust in Narratives Can Shape How Data is SelectedPresented at the 5th Annual ICEAA Canada Workshop, September 17-18, 2019 – iceaa.ca/2019

Page 13: Pitfalls of Historical Data Use in Software Estimation · Employee Number Total Regular Hours Payroll Ending Date. 1234674. REGULAR HOURS: Date: Employee Name. OPERATIONS EMPLOYEE

How To Prevent the Narrative Fallacy

13

Nobel Prize winner Daniel Kahneman provided a solution to the impact of an “Inside View” in his book Thinking Fast and Slow

Presented at the 5th Annual ICEAA Canada Workshop, September 17-18, 2019 – iceaa.ca/2019

Page 14: Pitfalls of Historical Data Use in Software Estimation · Employee Number Total Regular Hours Payroll Ending Date. 1234674. REGULAR HOURS: Date: Employee Name. OPERATIONS EMPLOYEE

It’s Hard to See the Narrative Coming!Famous Authors Hold Their Heads…

Steven King

Truman Capote

JK Rowling Oscar Wilde Kurt Vonnegut

Virginia Woolf

14 Presented at the 5th Annual ICEAA Canada Workshop, September 17-18, 2019 – iceaa.ca/2019

Page 15: Pitfalls of Historical Data Use in Software Estimation · Employee Number Total Regular Hours Payroll Ending Date. 1234674. REGULAR HOURS: Date: Employee Name. OPERATIONS EMPLOYEE

How Can Causal Errors Slip In?

SAME ENVIRONMENT

We specifically selected only

ERP and Business Mission

Critical Environment data

SAME PROBLEM DOMAIN

We said we were only looking

for any projects that were built

for Financial Transactions.

SIMILAR SCALE

We limited our data to only

completed projects of our

anticipated size in Function

Points

ASSUMED CORRELATION

The Narrative lead us to

believe all these data points

were of equal value for

analysis. And they may be!

15 Presented at the 5th Annual ICEAA Canada Workshop, September 17-18, 2019 – iceaa.ca/2019

Page 16: Pitfalls of Historical Data Use in Software Estimation · Employee Number Total Regular Hours Payroll Ending Date. 1234674. REGULAR HOURS: Date: Employee Name. OPERATIONS EMPLOYEE

Correlation Can Make Us Assume Causation

Same Vendor and Same Team!

Schedule Includes Three months of Sequestration

Offshore Team Working uncompensated Hours not captured

Missing Test Phase

Presented at the 5th Annual ICEAA Canada Workshop, September 17-18, 2019 – iceaa.ca/2019

Page 17: Pitfalls of Historical Data Use in Software Estimation · Employee Number Total Regular Hours Payroll Ending Date. 1234674. REGULAR HOURS: Date: Employee Name. OPERATIONS EMPLOYEE

How To Prevent Causal Analysis Errors?

17

Again we turn to the Outside View. Need to have someone that understands what the data means – not just where it falls in a historical

trend line.Presented at the 5th Annual ICEAA Canada Workshop, September 17-18, 2019 – iceaa.ca/2019

Page 18: Pitfalls of Historical Data Use in Software Estimation · Employee Number Total Regular Hours Payroll Ending Date. 1234674. REGULAR HOURS: Date: Employee Name. OPERATIONS EMPLOYEE

“In God We Trust… all Others Bring Data”

- W. Edwards Deming18

How Can We Trust the Data?Better Question is… Do We Have All the

Data?

Presented at the 5th Annual ICEAA Canada Workshop, September 17-18, 2019 – iceaa.ca/2019

Page 19: Pitfalls of Historical Data Use in Software Estimation · Employee Number Total Regular Hours Payroll Ending Date. 1234674. REGULAR HOURS: Date: Employee Name. OPERATIONS EMPLOYEE

Building A Historical Data Archive

ID Completed Projects

We look for programs that

have a sufficient available

data to store in a repository.

Sort Into Domains

We specify what bucket of

data the projects belong and

we create identifiers for easy

extraction.

Grade Data Quality

We provide some indicators as

to the integrity of the data and

any limitations to usage –

typically a quality flag.

Promote the Data

We Notify others of the data

availability and encourage

usage of the repository.

19 Presented at the 5th Annual ICEAA Canada Workshop, September 17-18, 2019 – iceaa.ca/2019

Page 20: Pitfalls of Historical Data Use in Software Estimation · Employee Number Total Regular Hours Payroll Ending Date. 1234674. REGULAR HOURS: Date: Employee Name. OPERATIONS EMPLOYEE

ID Completed Projects

We look for programs that

have a sufficient available

data to store in a repository.

Sort Into Domains

We specify what bucket of

data the projects belong and

we create identifiers for easy

extraction.

Grade Data Quality

We provide some indicators as

to the integrity of the data and

any limitations to usage –

typically a quality flag.

Promote the Data

We Notify others of the data

availability and encourage

usage of the repository.

20

Excluded Failed Projects

Rationalize Buckets When Difficult to Sort

Intentionally Allow Bad Data to Remain Silent

Fail to Mention the Missing Evidence

Uncomfortable Narratives Are Eliminated

Presented at the 5th Annual ICEAA Canada Workshop, September 17-18, 2019 – iceaa.ca/2019

Page 21: Pitfalls of Historical Data Use in Software Estimation · Employee Number Total Regular Hours Payroll Ending Date. 1234674. REGULAR HOURS: Date: Employee Name. OPERATIONS EMPLOYEE

Transparency Into All the Evidence Forces Difficult Conversations

Descoped

Canceled

Overrun

Presented at the 5th Annual ICEAA Canada Workshop, September 17-18, 2019 – iceaa.ca/2019

Page 22: Pitfalls of Historical Data Use in Software Estimation · Employee Number Total Regular Hours Payroll Ending Date. 1234674. REGULAR HOURS: Date: Employee Name. OPERATIONS EMPLOYEE

Fundamental Problems With Historical Data

It’s costly to obtain

Difficult to catalog and store

Can be intentionally laced with inaccuracies

Can often be wrong

Fallacy of Silent Evidence

Drives the conversation about data quality

Presented at the 5th Annual ICEAA Canada Workshop, September 17-18, 2019 – iceaa.ca/2019

Page 23: Pitfalls of Historical Data Use in Software Estimation · Employee Number Total Regular Hours Payroll Ending Date. 1234674. REGULAR HOURS: Date: Employee Name. OPERATIONS EMPLOYEE

“basing studies of

chance on the narrow

world of games and

dice”

23

Ludic Fallacy – Latin for Games

Presented at the 5th Annual ICEAA Canada Workshop, September 17-18, 2019 – iceaa.ca/2019

Page 24: Pitfalls of Historical Data Use in Software Estimation · Employee Number Total Regular Hours Payroll Ending Date. 1234674. REGULAR HOURS: Date: Employee Name. OPERATIONS EMPLOYEE

The Pareto Principle:

80% of Results Come from 20% of the Production

Life is not Fair

Presented at the 5th Annual ICEAA Canada Workshop, September 17-18, 2019 – iceaa.ca/2019

Page 25: Pitfalls of Historical Data Use in Software Estimation · Employee Number Total Regular Hours Payroll Ending Date. 1234674. REGULAR HOURS: Date: Employee Name. OPERATIONS EMPLOYEE

Has The Historical

Data Been Gamed?

25

“Men follow their sentiments and their self-

interest, but it pleases them to imagine that they follow reason.”

“Randomness has an additional layer of

uncertainty concerning the rules of the game in real life.”

Presented at the 5th Annual ICEAA Canada Workshop, September 17-18, 2019 – iceaa.ca/2019

Page 26: Pitfalls of Historical Data Use in Software Estimation · Employee Number Total Regular Hours Payroll Ending Date. 1234674. REGULAR HOURS: Date: Employee Name. OPERATIONS EMPLOYEE

Most Common Omissions From Historical Data Ranked In Order Of Significance

Source: Capers Jones, Errors And Omissions In Software Historical Data: Separating Fact From Fiction, August 17, 2009 Presented at the 5th Annual ICEAA Canada Workshop, September 17-18, 2019 – iceaa.ca/2019

Page 27: Pitfalls of Historical Data Use in Software Estimation · Employee Number Total Regular Hours Payroll Ending Date. 1234674. REGULAR HOURS: Date: Employee Name. OPERATIONS EMPLOYEE

Typical Results Reviewing Customer Historical Data

Source: Capers Jones, Errors And Omissions In Software Historical Data: Separating Fact From Fiction, August 17, 2009 Presented at the 5th Annual ICEAA Canada Workshop, September 17-18, 2019 – iceaa.ca/2019

Page 28: Pitfalls of Historical Data Use in Software Estimation · Employee Number Total Regular Hours Payroll Ending Date. 1234674. REGULAR HOURS: Date: Employee Name. OPERATIONS EMPLOYEE

Incentives For Fraud Have Been Somewhat Mitigated

June 14, 2025

6/1 6/2 6/3 6/4 6/5 6/6 6/7Task Loc WO# Work Description Job Title Title # Sun Mon Tue Wed Thu Fri Sat

1 SD 32984 Coding Optical Scanner CSC#3 Programmer SW 6.00 7.00 7.00 5.00 8.00 33.002 SD 32984 Unit Testing CSC #3 Programmer SW 1.00 2.00 3.003 SD 32985 Bug Fixing CSC #3 Programmer SW 1.00 1.00 2.004 SD 400329 Design Support Programmer SW5 SD 400329 Requirements Review Programmer SW 1.00 1.00 2.00

8.00 8.00 8.00 8.00 8.00 40.00

Joe CoderEmployee Number

Total Regular Hours

Payroll Ending Date 1234674

REGULAR HOURS: Date:

Employee Name

OPERATIONS EMPLOYEE TIMESHEET

TotalWeek 1Regular

Joe – Please Move hours for task 1 across task 4 and 5.

The Boss

Fixed Price

Cost Plus

Presented at the 5th Annual ICEAA Canada Workshop, September 17-18, 2019 – iceaa.ca/2019

Page 29: Pitfalls of Historical Data Use in Software Estimation · Employee Number Total Regular Hours Payroll Ending Date. 1234674. REGULAR HOURS: Date: Employee Name. OPERATIONS EMPLOYEE

Overcoming Four Fallacies Made in Historical Data Selection

Narrative Fallacy Casual Analysis

29

Silent Evidence Ludic Fallacy

Deploy An Outside View

Deploy An Outside View

Bring All Your Data

Trust…But Verify!

Presented at the 5th Annual ICEAA Canada Workshop, September 17-18, 2019 – iceaa.ca/2019

Page 30: Pitfalls of Historical Data Use in Software Estimation · Employee Number Total Regular Hours Payroll Ending Date. 1234674. REGULAR HOURS: Date: Employee Name. OPERATIONS EMPLOYEE

THANK YOU

30

Galorath Incorporated

[email protected]

Thank For Your Time Today

All content within this document is the property ofGalorath Incorporated without exception. Do notshare, edit or repurpose without written consent. © Copyright Galorath Incorporated 2019Presented at the 5th Annual ICEAA Canada Workshop, September 17-18, 2019 – iceaa.ca/2019