decision theory and bayesian statistics. tests and problem solving petter mostad 2005.11.21
TRANSCRIPT
Decision theory and Bayesian statistics. Tests and problem solving
Petter Mostad
2005.11.21
Overview
• Statistical desicion theory
• Bayesian theory and research in health economics
• Review of tests we have learned about
• From problem to statistical test
Statistical decision theory
• Statistics in this course often focus on estimating parameters and testing hypotheses.
• The real issue is often how to choose between actions, so that the outcome is likely to be as good as possible, in situations with uncertainty
• In such situations, the interpretation of probability as describing uncertain knowledge (i.e., Bayesian probability) is central.
Decision theory: Setup
• The unknown future is classified into H possible states: s1, s2, …, sH.
• We can choose one of K actions: a1, a2, …, aK. • For each combination of action i and state j, we
get a ”payoff” (or opposite: ”loss”) Mij. • To get the (simple) theory to work, all ”payoffs”
must be measured on the same (monetary) scale. • We would like to choose an action so to maximize
the payoff. • Each state si has an associated probability pi.
Desicion theory: Concepts
• If action a1 never can give a worse payoff, but may give a better payoff, than action a2, then a1 dominates a2.
• a2 is then inadmissible
• The maximin criterion
• The minimax regret criterion
• The expected monetary value criterion
Example
No birdflu outbreak
Small birdflu outbreak
Birdflu pandemic
No extra precautions
0 -500 -100000
Some extra precautions
-1 -100 -10000
Vaccination of whole pop.
-1000 -1000 -1000
states
actions
Decision trees
• Contains node (square junction) for each choice of action
• Contains node (circular junction) for each selection of states
• Generally contains several layers of choices and outcomes
• Can be used to illustrate decision theoretic computations
• Computations go from bottom to top of tree
Updating probabilities by aquired information
• To improve the predictions about the true states of the future, new information may be aquired, and used to update the probabilities, using Bayes theorem.
• If the resulting posterior probabilities give a different optimal action than the prior probabilities, then the value of that particular information equals the change in the expected monetary value
• But what is the expected value of new information, before we get it?
Example: Birdflu
• Prior probabilities: P(none)=95%, P(some)=4.5%, P(pandemic)=0.5%.
• Assume the probabilities are based on whether the virus has a low or high mutation rate.
• A scientific study can update the probabilities of the virus mutation rate.
• As a result, the probabilities for no birdflu, some birdflu, or a pandemic, are updated to posterior probabilities: We might get, for example:
( | _ ) 80%
( | _ ) 15%
( . | _ ) 5%
P none high mutation
P some high mutation
P pand high mutation
( | _ ) 99%
( | _ ) 0.9%
( . | _ ) 0.1%
P none low mutation
P some low mutation
P pand low mutation
Expected value of perfect information
• If we know the true (or future) state of nature, it is easy to choose optimal action, it will give a certain payoff
• For each state, find the difference between this payoff and the payoff under the action found using the expected value criterion
• The expectation of this difference, under the prior probabilities, is the expected value of perfect information
Expected value of sample information
• What is the expected value of obtaining updated probabilities using a sample? – Find the probability for each possible sample
– For each possible sample, find the posterior probabilities for the states, the optimal action, and the difference in payoff compared to original optimal action
– Find the expectation of this difference, using the probabilities of obtaining the different samples.
Utility
• When all outcomes are measured in monetary value, computations like those above are easy to implement and use
• Central problem: Translating all ”values” to the same scale
• In health economics: How do we translate different health outcomes, and different costs, to same scale?
• General concept: Utility• Utility may be non-linear function of money value
Risk and (health) insurance
• When utility is rising slower than monetary value, we talk about risk aversion
• When utility is rising faster than monetary value, we talk about risk preference
• If you buy any insurance policy, you should expect to lose money in the long run
• But the negative utility of, say, an accident, more than outweigh the small negative utility of a policy payment.
Desicion theory and Bayesian theory in health economics research
• As health economics is often about making optimal desicions under uncertainty, decision theory is increasingly used.
• The central problem is to translate both costs and health results to the same scale: – All health results are translated into ”quality
adjusted life years”– The ”price” for one ”quality adjusted life year”
is a parameter called ”willingness to pay”.
Curves for probability of cost effectiveness given willingness to pay
• One widely used way of presenting a cost-effectiveness analysis is through the Cost-Effectiveness Acceptability Curve (CEAC)
• Introduced by van Hout et al (1994).
• For each value of the threshold willingness to pay λ, the CEAC plots the probability that one treatment is more cost-effective than another.
Review of tests
• Below is a listing of most of the statistical tests encountered in Newbold.
• It gives a grouping of the tests by application area
• For details, consult the book or previous notes!
One group of normally distributed observations
Goal of test: Test statistic: Distribution:
Testing mean of normal distribution,
variance known
standard normal:
Testing mean of normal distribution, variance unknown
t-fordelingen, n-1
frihetsgrader:
Testing variance of normal population
Chi-kvadrat, n-1 frihetsgrader
0
/
X
n
(0,1)N
0
/x
X
s n
2
20
( 1) xn s
21n
1nt
Comparing two groups of observations: matched pairs
Assuming normal distributions, unknown variance: Compare means
Sign test: Compare only which observations are largest
S = the number of pairs with positive difference. Large samples
(n>20):
Wilcoxon signed rank test: Compare ranks and signs of differences
T=min(T+,T-);
T+ / T- are sum of positive/negative ranks
Wilcoxon signed rank statistic
0
/D
D D
s n
(D1, …, Dn differences)
1nt
( ,0.5)Bin n
(0,1)N
Large samples: * 0.5
0.5
S n
n
Comparing two groups of observations: unmatched data
Diff. between pop. means: Known variances
Standard normal
Diff. between pop. means: Unknown but equal variances
Diff. between pop. means: Unknown and unequal variances
Testing equality of variances for two normal populations
Assuming identical translated distributions: test equal means: Mann Whitney U test
Based on sum of ranks of obs. from one group; all obs. ranked together
Standard normal (n>10)
22
0( ) / yx
x yn nX Y D (0,1)N
2 2
0( ) / p p
x y
s s
n nX Y D 2x yn nt
22
0( ) / yx
x y
ssn nX Y D t
see book for d.f.
2 2/x ys s 1, 1x yn nF
(0,1)N
Comparing more than two groups of data
One-way ANOVA: Testing if all groups are equal (norm.)
Kruskal-Wallis test: Testing if all groups are equal
Based on sums of ranks for each group; all obs. ranked together
Two-way ANOVA: Testing if all groups are equal, when you also have blocking
Two-way ANOVA with interaction: Testing if groups and blocking variable interact
/( 1)
/( )
SSG K
SSW n K
1,K n KF
21K
/( 1)
/(( 1)( 1))
SSG K
SSE K H
1,( 1)( 1)K K HF
/(( 1)( 1))
/( ( 1))
SSI K H
SSE HK L
( 1)( 1), ( 1)K H HK LF
Studying population proportions
Test of population proportion in one group (large samples)
Standard normal
Comparing the population proportions in two groups (large samples)
Standard normal
0
0 0(1 ) /
p
n
(0,1)N
0 0 0 0(1 ) (1 )x y
x y
p p
p p p pn n
(p0 common estimate)
(0,1)N
Regression tests
Test of regression slope: Is it ?
Test on partial regression coefficient: Is it ?
Test on sets of partial regression coefficients: Can they all be set to zero (i.e., removed)?
1
1 *
b
b
s
2nt
*
*
j
j
b
b
s
* 1n Kt
2
( ( ) ) /
e
SSE r SSE r
s
, 1r n K rF
Model tests
Contingency table test: Test if there is an association between the two attributes in a contingency table
Goodness-of-fit test: Counts in K categories, compared to expected counts, under H0
Tests for normality:
•Bowman-Shelton
•Kolmogorov-Smirnov
2
1 1
( )r cij ij
i j ij
O E
E
2
( 1)( 1)r c
2
1
( )Ki i
i i
O E
E
21K
* *
Tests for correlation
Test for zero population correlation (normal distribution)
Test for zero correlation (nonparametric): Spearman rank correlation
Compute ranks of x-values, and of y-values, and compute correlation of these ranks
Special
distribution
2
2
1
r n
r
2nt
Tests for autocorrelation
The Durbin-Watson test (based on normal assumption) testing for autocorrelation in regression data
Special distribution
The runs test (nonparametric), testing for randomness in time
Counting the number of ”runs” above and below the median in the time series
Special distribution, or standard normal
for large samples
21
2
2
1
( )n
t ti
n
ti
e e
e
(0,1)N
From problem to choice of method
• Example: You have the grades of a class of studends from this years statistics course, and from last years statistics course. How to analyze?
• You have measured the blood pressure, working habits, eating habits, and exercise level for 200 middleaged men. How to analyze?
From problem to choice of method
• Example: You have asked 100 married women how long they have been married, and how happy they are (on a specific scale) with their marriage. How to analyze?
• Example: You have data for how satisfied (on some scale) 50 patients are with their primary health care, from each of 5 regions of Norway. How to analyze?