mathematics, statistics, and political sciencegelman/presentations/mathstatpshandout.pdf ·...

38
Mathematics, statistics, and political science Andrew Gelman Department of Statistics and Department of Political Science Columbia University 31 July 2011 Andrew Gelman Mathematics, statistics, and political science

Upload: hakhanh

Post on 04-Feb-2018

236 views

Category:

Documents


2 download

TRANSCRIPT

Page 1: Mathematics, statistics, and political sciencegelman/presentations/mathstatpshandout.pdf · Mathematics, statistics, and political science Andrew Gelman Department of Statistics and

Mathematics, statistics, and political science

Andrew GelmanDepartment of Statistics and Department of Political Science

Columbia University

31 July 2011

Andrew Gelman Mathematics, statistics, and political science

Page 2: Mathematics, statistics, and political sciencegelman/presentations/mathstatpshandout.pdf · Mathematics, statistics, and political science Andrew Gelman Department of Statistics and

Themes

I Mathematical models in social science are cool . . .

I But they tend to give qualitative rather than quantitativepredictions

I Statistical modeling as an alternative

Andrew Gelman Mathematics, statistics, and political science

Page 3: Mathematics, statistics, and political sciencegelman/presentations/mathstatpshandout.pdf · Mathematics, statistics, and political science Andrew Gelman Department of Statistics and

Some stupid stuff

Andrew Gelman Mathematics, statistics, and political science

Page 4: Mathematics, statistics, and political sciencegelman/presentations/mathstatpshandout.pdf · Mathematics, statistics, and political science Andrew Gelman Department of Statistics and

A silly network model

“Scientists at Rensselaer Polytechnic Institute have found thatwhen just 10 percent of the population holds an unshakable belief,their belief will always be adopted by the majority of the society.”

$16 million federal grant, published in Physical Review, featured inFreakonomics, . . .

Andrew Gelman Mathematics, statistics, and political science

Page 5: Mathematics, statistics, and political sciencegelman/presentations/mathstatpshandout.pdf · Mathematics, statistics, and political science Andrew Gelman Department of Statistics and

More silly math

“The performance of national governments declines with increasingmembership and undergoes a qualitative change in behavior at aparticular group size.”

Published by Santa Fe Institute and Journal of StatisticalMechanics, featured in New Scientist, Physics World, . . .

Andrew Gelman Mathematics, statistics, and political science

Page 6: Mathematics, statistics, and political sciencegelman/presentations/mathstatpshandout.pdf · Mathematics, statistics, and political science Andrew Gelman Department of Statistics and

Next, some more interesting examples

1. Forecasting elections

2. Political representation

3. Trench warfare

4. Rational voting

5. Moderation and vote-getting

Andrew Gelman Mathematics, statistics, and political science

Page 7: Mathematics, statistics, and political sciencegelman/presentations/mathstatpshandout.pdf · Mathematics, statistics, and political science Andrew Gelman Department of Statistics and

Part 1: forecasting elections

Andrew Gelman Mathematics, statistics, and political science

Page 8: Mathematics, statistics, and political sciencegelman/presentations/mathstatpshandout.pdf · Mathematics, statistics, and political science Andrew Gelman Department of Statistics and

“13 Keys to the White House”

I Forecasting system developed by a historian “in collaborationwith Volodia Keilis-Borok, a world-renowned authority on themathematics of prediction models”

I Correctly predicted every presidential election winner from1860 thorugh 2008.

I Forecast as of May 2011: “The president will securere-election in 2012. Only a historically unprecedented reversalof fortune could alter the verdict of the keys.”

I The problem: trying to make a discrete prediction of thewinner

I 1960, 1968, 1976, 2000 were essentially ties

Andrew Gelman Mathematics, statistics, and political science

Page 9: Mathematics, statistics, and political sciencegelman/presentations/mathstatpshandout.pdf · Mathematics, statistics, and political science Andrew Gelman Department of Statistics and

Statistical forecasting

Forecasting elections from the economy

Carter vs. Reagan (1980) ●

McCain vs. Obama (2008) ●

Bush, Sr. vs. Clinton (1992) ●

Nixon vs. Kennedy (1960) ●

Clinton vs. Dole (1996) ●

Ford vs. Carter (1976) ●

Bush, Jr. vs. Kerry (2004) ●

Bush, Sr. vs. Dukakis (1988) ●

Gore vs. Bush, Jr. (2000) ●

Stevenson vs. Eisenhower (1952) ●

Eisenhower vs. Stevenson (1956) ●

Humphrey vs. Nixon (1968) ●

Nixon vs. McGovern (1972) ●

Reagan vs. Mondale (1984) ●

Johnson vs. Goldwater (1964) ●

45% 50% 55% 60%

Incumbent party's share of the popular voteIncome growth

more than 4%

3% to 4%

2% to 3%

1% to 2%

0% to 1%

negative

Above matchups are all listed as incumbent party's candidate vs. other party's candidate.Income growth is a weighted measure over the four years preceding the election. Vote share excludes third parties.

Andrew Gelman Mathematics, statistics, and political science

Page 10: Mathematics, statistics, and political sciencegelman/presentations/mathstatpshandout.pdf · Mathematics, statistics, and political science Andrew Gelman Department of Statistics and

Part 2: political representation

Andrew Gelman Mathematics, statistics, and political science

Page 11: Mathematics, statistics, and political sciencegelman/presentations/mathstatpshandout.pdf · Mathematics, statistics, and political science Andrew Gelman Department of Statistics and

What does it mean to be “represented”?

I The U.S. is a representative democracy

I The right to vote; # representatives per voter

I Procedures vs. outcomes: what if 90% of the voters get theCongressmember whom they want?

I How close are actual elections?

Andrew Gelman Mathematics, statistics, and political science

Page 12: Mathematics, statistics, and political sciencegelman/presentations/mathstatpshandout.pdf · Mathematics, statistics, and political science Andrew Gelman Department of Statistics and

Congressional elections in 1948 and 1988

0.0 0.2 0.4 0.6 0.8 1.0Democratic share of vote for Congress

U.S. Congressional districts in 1948

0.0 0.2 0.4 0.6 0.8 1.0Democratic share of vote for Congress

U.S. Congressional districts in 1988

Andrew Gelman Mathematics, statistics, and political science

Page 13: Mathematics, statistics, and political sciencegelman/presentations/mathstatpshandout.pdf · Mathematics, statistics, and political science Andrew Gelman Department of Statistics and

Comparing to votes for President

0.0 0.2 0.4 0.6 0.8 1.0

010

3050

Dem share of Congressional vote in 1988

0.0 0.2 0.4 0.6 0.8 1.0

020

4060

Dem share of Congressional vote in 1948

0.0 0.2 0.4 0.6 0.8 1.0

020

4060

80

Dem share of Presidential vote in 1988

Andrew Gelman Mathematics, statistics, and political science

Page 14: Mathematics, statistics, and political sciencegelman/presentations/mathstatpshandout.pdf · Mathematics, statistics, and political science Andrew Gelman Department of Statistics and

What does it mean to be “represented”?

I Equal votes, satisfaction with outcomes, having your votepotentially matter

I Are your political views represented?

I Do your representatives look like you? Data from 1989:Proportion of

Proportion of seats in HouseU.S. population of Representatives

Catholic 0.28 0.27Methodist 0.04 0.14Jewish 0.02 0.07Black 0.12 0.09Female 0.51 0.06Under 25 0.37 0

Andrew Gelman Mathematics, statistics, and political science

Page 15: Mathematics, statistics, and political sciencegelman/presentations/mathstatpshandout.pdf · Mathematics, statistics, and political science Andrew Gelman Department of Statistics and

Seats and votes in a legislature

I Proportional representation in Europe

I No proportional representation in U.S.

I Wasted votes

I Small changes in votes

I No way to mathematically derive the “best” system

I Paradox of voting power and decisive votes

I Paradox of voting for native Australians

Andrew Gelman Mathematics, statistics, and political science

Page 16: Mathematics, statistics, and political sciencegelman/presentations/mathstatpshandout.pdf · Mathematics, statistics, and political science Andrew Gelman Department of Statistics and

Research problem: unequal representation across the world

I Small states overrepresented in U.S. Senate and electoralcollege

I Small states in U.S. get more than their share of gov’t funding

I Look at other countries: small states/provinces are generallyoverrepresented

I Small states/provinces get more than their share of funds

I Larger consequences?

Andrew Gelman Mathematics, statistics, and political science

Page 17: Mathematics, statistics, and political sciencegelman/presentations/mathstatpshandout.pdf · Mathematics, statistics, and political science Andrew Gelman Department of Statistics and

Part 3: trench warfare

Andrew Gelman Mathematics, statistics, and political science

Page 18: Mathematics, statistics, and political sciencegelman/presentations/mathstatpshandout.pdf · Mathematics, statistics, and political science Andrew Gelman Department of Statistics and

Trench warfare: the live-and-let-live system

I Front-line troops in World War I avoided fighting (historianTony Ashworth)

I Informal agreements across no-man’s-land

I How to understand this?

I Prisoner’s dilemma

Andrew Gelman Mathematics, statistics, and political science

Page 19: Mathematics, statistics, and political sciencegelman/presentations/mathstatpshandout.pdf · Mathematics, statistics, and political science Andrew Gelman Department of Statistics and

Prisoner’s dilemma for trench warfare

I Payoffs in the “game” (political scientist Robert Axelrod)

I No motivation to cooperate in single-play game

I Cooperation in repeated-play game

I Cool mathematical model

Andrew Gelman Mathematics, statistics, and political science

Page 20: Mathematics, statistics, and political sciencegelman/presentations/mathstatpshandout.pdf · Mathematics, statistics, and political science Andrew Gelman Department of Statistics and

Refuting the prisoner’s dilemma for trench warfare

I Look more carefully at payoffs

I No motivation to fight! Shooting poses a risk, whether or notthe other side shoots

I Commanders manipulate the “game” to get soldiers to fight

I Hidden assumption of conventional roles of soldiers onopposing sides

Andrew Gelman Mathematics, statistics, and political science

Page 21: Mathematics, statistics, and political sciencegelman/presentations/mathstatpshandout.pdf · Mathematics, statistics, and political science Andrew Gelman Department of Statistics and

Why was the prisoner’s dilemma model appealing?

I “The evolution of cooperation”

I Using game theory to solve the “tragedy of the commons”

I Axelrod’s theory: politically liberal or conservative?

I “The norm of self-interest” (psychologist Dale Miller)

Andrew Gelman Mathematics, statistics, and political science

Page 22: Mathematics, statistics, and political sciencegelman/presentations/mathstatpshandout.pdf · Mathematics, statistics, and political science Andrew Gelman Department of Statistics and

Toward the future

I How to defuse future conflicts?

I Axelrod’s logic: set up repeated-play structures to motivatelong-term cooperation

I Alternative strategy: set up immediate gains fromcooperations and watch out for outside agents who coulddisrupt the cooperation

Andrew Gelman Mathematics, statistics, and political science

Page 23: Mathematics, statistics, and political sciencegelman/presentations/mathstatpshandout.pdf · Mathematics, statistics, and political science Andrew Gelman Department of Statistics and

Part 4: rational voting

Andrew Gelman Mathematics, statistics, and political science

Page 24: Mathematics, statistics, and political sciencegelman/presentations/mathstatpshandout.pdf · Mathematics, statistics, and political science Andrew Gelman Department of Statistics and

Rational model for voting

I Utility of voting = pB − c :I p = probability that a single vote will be decisiveI B = net benefit from your candidate winningI c = met cost of voting (whether or not your candidate wins)

I Paradox of voting: p is very small, so even for large values ofB, there is no “instrumental” benefit to voting

I In presidential elections, p is usually less than 10−7

Andrew Gelman Mathematics, statistics, and political science

Page 25: Mathematics, statistics, and political sciencegelman/presentations/mathstatpshandout.pdf · Mathematics, statistics, and political science Andrew Gelman Department of Statistics and

Possible explanations for voting

I Utility of voting = pB − cI “Benefit” of voting or “civic duty”

I Does not explain higher turnout in close elections and moreimportant elections

I Poor estimation of pI Estimation would have to be really poor for p to be large

enough

I Is voting irrational?

Andrew Gelman Mathematics, statistics, and political science

Page 26: Mathematics, statistics, and political sciencegelman/presentations/mathstatpshandout.pdf · Mathematics, statistics, and political science Andrew Gelman Department of Statistics and

Voting to benefit others

I Utility of voting = pB − cI B = Bself + αNBsoc

I Bself = individual benefit from candidate A winningI Bsoc = (your perception of) avg. benefit of others from

candidate A winningI α (probably less than 1) discounts benefits to othersI N = number of persons affected by the election

I It can now be rational to vote!

I Decoupling rationality from selfishness

Andrew Gelman Mathematics, statistics, and political science

Page 27: Mathematics, statistics, and political sciencegelman/presentations/mathstatpshandout.pdf · Mathematics, statistics, and political science Andrew Gelman Department of Statistics and

Example: a close election

I Each candidate expected to get between 47% and 53% ofvote

I Vote differential in range ±6%I Pr (your vote is decisive) ≈ 1/(0.12n)

I Suppose the selfish benefit to you is $10,000

I If n = 1 million, then expected selfish benefit is less than 10cents

I Now consider a “social voter”I Suppose n/N = 1/3 and suppose that the benefit to others (as

you perceive it) is $10 eachI The effect of your vote on their expected gain is

$10N/(0.12n) = $250I Voting is like making a $250 charitable contribution

Andrew Gelman Mathematics, statistics, and political science

Page 28: Mathematics, statistics, and political sciencegelman/presentations/mathstatpshandout.pdf · Mathematics, statistics, and political science Andrew Gelman Department of Statistics and

Supporting evidence for the theory

I Small contributions to national campaigns

I Declining response rates in opinion polls

I Turnout is higher, not lower, in large elections

I Turnout is higher in close elections

I Strategic voting

I Voting on issues without direct instrumental benefits(abortion, All-Star game, Academy awards, . . . )

I Ask people why they vote

Andrew Gelman Mathematics, statistics, and political science

Page 29: Mathematics, statistics, and political sciencegelman/presentations/mathstatpshandout.pdf · Mathematics, statistics, and political science Andrew Gelman Department of Statistics and

Empirical tests

I Are altruistic people more likely to vote?

I Is turnout higher in U.S. Senate elections in small states?

I Is turnout higher in NYC when there is heavy snow in Buffalo?

Andrew Gelman Mathematics, statistics, and political science

Page 30: Mathematics, statistics, and political sciencegelman/presentations/mathstatpshandout.pdf · Mathematics, statistics, and political science Andrew Gelman Department of Statistics and

Part 5: candidate positioning

Andrew Gelman Mathematics, statistics, and political science

Page 31: Mathematics, statistics, and political sciencegelman/presentations/mathstatpshandout.pdf · Mathematics, statistics, and political science Andrew Gelman Department of Statistics and

Candidate positioning

The “median voter theorem” (Hotelling, 1928):

Left-wing Right-wingD RM

Andrew Gelman Mathematics, statistics, and political science

Page 32: Mathematics, statistics, and political sciencegelman/presentations/mathstatpshandout.pdf · Mathematics, statistics, and political science Andrew Gelman Department of Statistics and

Median voters and Newt Gingrich

I In the 1994 election, the Republicans gained about 50 seats inCongress

I The Democrats who lost were mostlymoderate-to-conservative

I The liberal Democratic congressmembers were reelected

I Democrats should be liberal and be proud?

Andrew Gelman Mathematics, statistics, and political science

Page 33: Mathematics, statistics, and political sciencegelman/presentations/mathstatpshandout.pdf · Mathematics, statistics, and political science Andrew Gelman Department of Statistics and

Looking at the 1994 election more carefully

xx

x

x

x

x

x

xx

xx

xx

x

x

x

xxx

xx

x

xx

xx

x

x

x

x

x

x

x

x

x

x

x

x x

xx

xx x

xx

x x

x x

xx

x

x

x

x

xxx

x

x

x

x

x

xx

x

xx

x

x

x x

x

xx

xxxx

x

x

x

x

x

xx

x

x

x

xx

x

x

x

x

x

xx

x

x

x

x

xx

xx

x

xx

x

x

x

x x

x

x

x

x

x

x

x

x

x

xx x

x

x

x

x x

xxx

x

x

−1.0 −0.8 −0.6 −0.4 −0.2 0.00.

40.

60.

8

Democrats running for reelection in 1994

(liberal) ideology (moderate)

vote

sha

re in

199

4

x

x

x

x

xx

x

xx

xx

xx x

x

x

xxx

x

xx

xx x

x

x

x

x

x

x

xx

x

x

x

xx

x

xx

x

x

x

x

x

xxx

x

x

xxx

x

x

x

x

x

x

x

x

x

x

xx

xx

xx

x

x

x

x

x

x

xx

x

xx

x

x

x

xx

x

x

xx

xx

x

x

x

x

x

xx

x

x

x

x

x

xx

x

x

xx

x

xx

xx

x

x

x

x

x

xx

x

x

xx

xx

xx

x

x

xx

xxx

−1.0 −0.8 −0.6 −0.4 −0.2 0.0−0.1

50.

000.

10

(liberal) ideology (moderate)vote

in 1

994

com

pare

d to

exp

ecte

d

o

oo o

oo

o

o o

o

o

o

o

oo

o

o

oooo

o

o

o

ooo

o

oo

o

oo

o

o

o

oo

o

o

oo

o

o

o

oo ooo

o

o

o

o

o

o

o

o ooo

o

o

o

o

o

o

o

o

o

oo

o

o

o

o

o o

o

o

o

o

oo

o

o

o

o

o

o

0.0 0.2 0.4 0.6 0.8 1.0

0.55

0.65

0.75

Republicans running for reelection in 1994

(moderate) ideology (conservative)

vote

sha

re in

199

4

o

o

oo

ooo

o

o

o

o

o

oo

o

oo

o

o oo

o

o

o

o

oo

o

o

o

o

oo o

oo

oo

o

o

o

o

o

o

o

oo ooo

o

o

o

o

o

o

o

oo

oo

o

o

o

o

o

o

o

oo

oo

o

o

ooo o

o

oo

o

oo

o

o

oo

oo

0.0 0.2 0.4 0.6 0.8 1.0

−0.1

5−0

.05

0.05

(moderate) ideology (conservative)vote

in 1

994

com

pare

d to

exp

ecte

dAndrew Gelman Mathematics, statistics, and political science

Page 34: Mathematics, statistics, and political sciencegelman/presentations/mathstatpshandout.pdf · Mathematics, statistics, and political science Andrew Gelman Department of Statistics and

Congressmembers’ ideologies and median voters

0.1 0.2 0.3 0.4 0.5 0.6 0.7−1.

0−

0.5

0.0

0.5

1.0

normal vote in district

(libe

ral)

id

eolo

gy s

core

(co

nser

vativ

e)

xx

x

x

x

x

x

x

x

xx

x

x

x

x

xx

x

x

xx

x

xx

x

x

x

x

x

x

x

x

xx

x

x

x

x

x

x

x

x

x

x

x

x

xx

x

x

x

x

xx

xx

x

x

x

xx

xx

x

x

x

x

xx

x

x

x

x

x

x

xx

x

x

xx

x

x

x

x

x

x

xx

x

x

xx

x

x

x

x

x

x

x

xx

x

xx

xxx

xx

xxx

x

xx x

x

x

x

x x

x

x x

x

x

x

x

x

x

x

x xx

xxx

x

x

x

x

x

x x

x x

x

x

x

x

x

x

xx

xx

x

x

x

x

x

xx

x

x

x

x

xx

x

xx

xx

x

x

xx

x

x

x

x

xx

xx

x

x

x

x

x

x

x

x

x

x

xx

x

o

o

o

o

o

o

oo

o

o

o

o oo

o

oo

o

o

o

o

o

o

o

o

oo

o

o

o

oo

oo

ooo

oo

o

o

oo

o

o

o

o

o

oo

oo

o

oo

o

o

o

o

o

o o

o

o

o

o o

o

o

o

o

o

o

o

o

o

o

o

oo

o

o

oo

o

o

o oo

o

o

o

o

o

o

o

o

o

o

oo

o

o

oo

o

o

o

o

o

o

o

oo

o

o

oo

o

o o

o

o

o

o

o

o

o

o

o

oo

o

o

oo

o

Andrew Gelman Mathematics, statistics, and political science

Page 35: Mathematics, statistics, and political sciencegelman/presentations/mathstatpshandout.pdf · Mathematics, statistics, and political science Andrew Gelman Department of Statistics and

Estimating the electoral benefits of moderation

I Look at districts where Congressmembers are running forreelection

I Predict their vote share given their “ideology score”I Also control for Presidential vote in previous election

I Noisy estimate in any particular year, so plot estimates overtime

I Also look at Nixon, Clinton impeachments

Andrew Gelman Mathematics, statistics, and political science

Page 36: Mathematics, statistics, and political sciencegelman/presentations/mathstatpshandout.pdf · Mathematics, statistics, and political science Andrew Gelman Department of Statistics and

Estimated effects of moderation for reelection vote

Andrew Gelman Mathematics, statistics, and political science

Page 37: Mathematics, statistics, and political sciencegelman/presentations/mathstatpshandout.pdf · Mathematics, statistics, and political science Andrew Gelman Department of Statistics and

Return to the median voter theorem

I Is the median voter theorem “true”?

I No, and yes . . .

I Systematic differences between Democrats and Republicans,even in comparable districts

I Moderation is worth about 2% of the vote: some motivationto be in the median, but not a lot

I Bush’s gamble in 2001–2004 (and Truman’s in 1945–1948):how does ideology map to policy?

Andrew Gelman Mathematics, statistics, and political science

Page 38: Mathematics, statistics, and political sciencegelman/presentations/mathstatpshandout.pdf · Mathematics, statistics, and political science Andrew Gelman Department of Statistics and

Summary: mathematics and statistics in political science

I 5 examples where mathematical models gave intiutivelyappealing conclusions:

I You can predict the election based on a few simple factorsI Proportional representation is fairI Cooperation is a good strategy in the repeated prisoner’s

dilemmaI Voting is irrational (unless you find it intrinsically enjoyable)I Politicians want to be at the median

I Each theory had big holes

I Each theory’s predictions were essentially qualitative

I Statistical models take the next step

I Similar ideas hold in psychology, sociology, economics, . . .

Andrew Gelman Mathematics, statistics, and political science