#1 databeersbcn - xavier

20
1 Alea Iacta Est! Understanding historical dynamics using Monte Carlo simulations Xavier Rubio Campillo [email protected] @xrubiocampillo @DataBeersBCN

Upload: databeersbcn

Post on 17-Aug-2015

119 views

Category:

Data & Analytics


1 download

TRANSCRIPT

Page 1: #1 DataBeersBCN - Xavier

1

Alea Iacta Est!

Understanding historical dynamics using Monte Carlo simulations

Xavier Rubio Campillo

[email protected]

@xrubiocampillo

@DataBeersBCN

Page 2: #1 DataBeersBCN - Xavier
Page 3: #1 DataBeersBCN - Xavier

3

Understanding human culture

Gray, R. D.; Atkinson, Q.D. (2003) "Language-tree divergence times support the Anatolian theory of Indo-European origin." Nature 426.6965: 435-439

Page 4: #1 DataBeersBCN - Xavier

What data can we use?

Page 5: #1 DataBeersBCN - Xavier

A Data Science challenge

Page 6: #1 DataBeersBCN - Xavier

Pandora's box

Page 7: #1 DataBeersBCN - Xavier

We can do it!

Evolutionary biologists use data with similar problems

It's another type of historical research

We can use some of their tools

Page 8: #1 DataBeersBCN - Xavier

Siege warfare in XVIIIth century

Page 9: #1 DataBeersBCN - Xavier

Can we test this hypothesis?

"Vauban reversed the dominance of the trace italienne and overturned the pattern of long sieges of early centuries"

Ostwald, J. (2007). Vauban under siege. Engineering efficiency and Martial Vigor in the War of the Spanish Succession, Brill Academic.

Page 10: #1 DataBeersBCN - Xavier

“Data! Data! Data! I can't make bricks without clay!”

Page 11: #1 DataBeersBCN - Xavier

1. Formalise hypotheses

During the period 1702-1714...

H1 – The duration of sieges increased

H2 – The duration of sieges decreased

H3 – The uncertainty of sieges increased

H4 – The uncertainty of sieges decreased

Page 12: #1 DataBeersBCN - Xavier

2. Define a model

For each year between 1702-1714

Sample the number of sieges

For each siege sample its duration

Add to the mean/variance of duration some fixed value

Page 13: #1 DataBeersBCN - Xavier

3. Define the prior knowledge

Page 14: #1 DataBeersBCN - Xavier

4. The dice are cast!

Execute the model with randomly sampled values from your prior:

run 152112:

mean duration modifier = +1.5

variance duration modifier = -0.3

Page 15: #1 DataBeersBCN - Xavier

5. Approximate Bayesian Computation

Execute this algorithm for millions of runs

Store the parameters for the 1k runs with most similar output to historical data

Page 16: #1 DataBeersBCN - Xavier

Results

Page 17: #1 DataBeersBCN - Xavier

Find your posterior

Page 18: #1 DataBeersBCN - Xavier

Some remarks

Long-term perspectives are needed to understand cultural change

Data science is a revolution for understanding our past

Page 19: #1 DataBeersBCN - Xavier
Page 20: #1 DataBeersBCN - Xavier

All text and image content in this document is licensed under the Creative Commons Attribution-Share Alike 3.0 License (unless otherwise specified). "LibreOffice" and "The Document Foundation" are registered trademarks. Their respective logos and icons are subject to international copyright laws. The use of these therefore is subject to the trademark policy.

20

Moltes gràcies!

Xavier Rubio Campillo

[email protected]

@xrubiocampillo