automating estimation of warm-up length katy hoad, stewart robinson, ruth davies warwick business...

Automating estimation of warm-up length

Katy Hoad, Stewart Robinson, Ruth DaviesWarwick Business School

The AutoSimOA ProjectA 3 year, EPSRC funded project in collaboration with SIMUL8 Corporation.

http://www.wbs.ac.uk/go/autosimoa

Research Aim

• To create an automated system for dealing with the problem of initial bias, for implementation into simulation software.

• Target audience: non- (statistically) expert simulation users.

The Initial Bias Problem

• Model may not start in a “typical” state.

• Can cause initial bias in the output.

• Method used: Deletion of the initial transient data by specifying a warm-up period (or truncation point).

• How do you estimate the length of the warm-up period required?

• Literature search – 44 methods

• Short-listing of methods• Accuracy & robustness

• Ease of automation

• Generality

• Computer running time

• Preliminary Testing – 6 methods

• MSER-5 most accurate and robust method.

MSER-5 warm-up method

0 50 100 150 200 250 300 350 400

Truncation Point

MSER-5 test statistic

Rejection zone

Estimated warm-up period

Estimated truncation point, Lsol

Output data (batched means values)

Further Testing of MSER-5

1. Artificial data – controllable & comparable initial bias functions steady state functions

2. Full factorial design.

3. Set of performance criteria.

Parameters Levels

Data Type Single run

Data averaged over 5 reps

Error type N(1,1), Exp(1)

Auto-correlation

None, AR(1), AR(2), MA(2), AR(4), ARMA(5,5)

Bias Severity 1, 2, 4

Bias Length 0%, 10%, 40%, 100% (of n = 1000)

Bias direction Positive, Negative

Bias shape 7 shapes

1. Artificial Data Parameters

• Mean Shift:

• Linear:

• Quadratic:

• Exponential:

• Oscillating (decreasing):

Quadratic ExponentialLinear

Add Initial Bias to Steady state:

Superpostion: Bias Fn, a(t), added onto end of steady state function:

2. Full factorial design

3048 types of artificial data set

MSER-5 run with each type 100 times

i. Coverage of true mean.

ii. Closeness of estimated truncation point (Lsol) to true truncation point (L).

iii. Percentage bias removed by truncation.

iv. Analysis of the pattern & frequency of rejections of Lsol (i.e. Lsol > n/2).

3. Performance Criteria

MSER-5 Results

Does the true mean fall into the 95% CI for the estimated mean?

Non-truncated data sets

Truncated data sets

% of cases

yes yes 7.7%

no yes 72.5%

no no 19.8%

yes no 0%

i. Coverage of true mean.

0 20 40 60 80 100run

Lsol -

Quadratic bias Mean-shift bias

ii. Closeness of Lsol to L.

• Wide range of Lsol values.

(Positive bias functions, single run data, N(1,1) errors, MA(2) auto-correlation, bias severity value of 2 and true L = 100.)

iii. Percentage bias removed by truncation.

% bias removed

All valid runs

Effect of data parameters on bias removal

No significant effect: Error type Bias direction

Significant effect: Data type Auto-correlation

type Bias shape Bias severity Bias length

% of bias removed

s Single run

Averaged replications

More bias removed by using averaged replications rather than a single run.

% of bias removed

lative

s no a-c AR(1)

AR(2) AR(4)

MA(2) ARMA(5,5)

The stronger the auto-correlation, the less accurate the bias removal.

Effect greatly reduced by using averaged data.

% of bias removed

lative

mean-shift Linear

Quad Exp

OscL OscQ

The more sharply the initial bias declines, the more likely MSER-5 is to underestimate the warm-up period and to remove increasingly less bias.

% of bias removed

As the bias severity increases, MSER-5 removes an increasingly higher percentage of the bias.

% of bias removed

Longer bias removed slightly more efficiently than shorter bias.

Shorter bias - more overestimations - partly due to longer bias overestimations being more likely to be rejected.

<x≤2

<x≤4

<x≤6

<x≤8

<x≤1

x = no. of Lsol rejections

ARMA(5,5)

No auto-correlation

Rejections caused by: high auto-correlation, bias close to n/2, smooth end to data = ‘end point’ rejection.

Averaged data slightly increases probability of getting ‘end point’ rejection but increases probability of more accurate L estimates.

iv. Lsol rejections

1000 1100 1200 1300 1400 1500 1600 1700 1800n

+ meanshift

+ linear

+ quadratic

+ osclinear

+ oscquad

+ oscexp

Giving more data to MSER-5 in an iterative fashion produces a valid Lsol value where previously the Lsol value had been rejected.

e.g. ARMA(5,5)

Lsol values Percentage of cases

Lsol = 0 71%

Lsol ≤ 50 93%

Testing MSER-5 with data that has no initial bias.

Want Lsol = 0

Lsol > 50 mainly due to highest auto-correlated data sets - AR(1) & ARMA(5,5).

Rejected Lsol values: 5.6% of the 2400 Lsol values produced. 93% from the highest auto-correlated data ARMA(5,5).

Testing MSER-5 with data that has 100% bias.

Want 100% rejection rate: Actual rate = 61%

Bias shape

M1 M2 M4

Bias severity

Single data Averaged data

Summary

• MSER-5 most promising method for automation– Not model or data type specific. – No estimation of parameters needed. – Can function without user intervention. – Shown to perform robustly and effectively

for the majority of data sets tested. – Quick to run. – Fairly simple to understand.

Heuristic framework around MSER-5

Run k (= 5) replications of length, n ≥ 100

Create averaged

Batch data into b batches of length m, where number of

batches = bmn and n* =

b×m ≤ n

MSER-5 returns Lsol value

Produce more data to create

batches of no. orig of %10 or a user specified

number.

Dynamic graph of batched data; single reps, or

MSER-5 statistic

Graph of batched data; single reps,

or MSER-5 statistic with valid Lsol value shown.

Input data into MSER-5 algorithm.

Does User wish to keep running with more data? END

Lsol valid.

Lsol invalid.

Is Lsol ≤ (n* - (m × 5))/2

Have there been 10 invalid Lsol

values in a row?

Yes No

Does User wish to keep running with more data?

Produce more data to create

batches of no. orig of %10

Iterative procedure for procuring more data when required.

‘Failsafe’ mechanism - to deal with possibility of data not in steady state; insufficient data provided when highly auto-correlated.

Being implemented in SIMUL8.

ACKNOWLEDGMENTSThis work is part of the Automating Simulation Output

Analysis (AutoSimOA) project (http://www.wbs.ac.uk/go/autosimoa) that is funded by

the UK Engineering and Physical Sciences Research Council (EP/D033640/1). The work is being carried out in

collaboration with SIMUL8 Corporation, who are also providing sponsorship for the project.

Katy Hoad, Stewart Robinson, Ruth DaviesWarwick Business School

automating estimation of warm-up length katy hoad, stewart robinson, ruth davies warwick business...

Documents

epsrc calls template

ai & robotics @ epsrc...dr victoria mico, senior portfolio...

epsrc centre for predictive modelling in healthcare ›...

epsrc final report

1 all epsrc visits epsrc plans and priorities. 2 digital...

· pdf file95 360 ati daf -trucks daf-trucks ... scania ct...

epsrc mathematical sciences programme

tim hoad - creating value from your intangible assets - may...

epsrc thermal management of industrial processes

epsrc national centre for energy system integration (cesi)...

the autosimoa project

uksg conference 2015 - epsrc research data management...

autosimoa : a framework for automated analysis of simulation...

the autosimoa project katy hoad, stewart robinson, ruth...

vickers & hoad auctioneers phone 02 96997887 ... · vickers...

home improvements - home - epsrc website

current epsrc associate college members

introduction to eu and epsrc grants

epsrc delivery plan 2011-2015

epsrc strategic plan