1/54: topic 5.1 – modeling stated preference data microeconometric modeling william greene stern...

54
1/54: Topic 5.1 – Modeling Stated Preference Data Microeconometric Modeling William Greene Stern School of Business New York University New York NY USA 5.1 Modeling Stated Preference Data

Upload: terence-joseph

Post on 19-Dec-2015

227 views

Category:

Documents


1 download

TRANSCRIPT

1/54: Topic 5.1 – Modeling Stated Preference Data

Microeconometric Modeling

William GreeneStern School of BusinessNew York UniversityNew York NY USA

5.1 Modeling Stated Preference Data

2/54: Topic 5.1 – Modeling Stated Preference Data

Concepts

• Revealed Preference• Stated Preference• Attribute Nonattendance• Random Utility• Attribute Space• Experimental Design• Choice Experiment• Environmental Attitude

Models

• Multinomial Logit Model• Latent Class MNL• Nested Logit• Mixed Logit• Error Components Logit

3/54: Topic 5.1 – Modeling Stated Preference Data

Revealed Preference Data

Advantage: Actual observations on actual behavior Market (ex-post, e.g.,

supermarket scanner data)

Individual observations

Disadvantage: Limited range of choice sets and attributes – does not allow analysis of switching behavior.

4/54: Topic 5.1 – Modeling Stated Preference Data

Stated Preference Data

Purely hypothetical – does the subject take it seriously?

No necessary anchor to real market situations

Vast heterogeneity across individualsE.g., contingent valuation

5/54: Topic 5.1 – Modeling Stated Preference Data

Strategy

Repeated choice situations to explore the attribute space

Typically combined RP/SP constructions Mixed data Expanded choice sets

Accommodating “panel data” Multinomial Probit [marginal, impractical] Latent Class Mixed Logit

6/54: Topic 5.1 – Modeling Stated Preference Data

Application: Shoe Brand Choice

Simulated Data: Stated Choice, 400 respondents, 8 choice situations, 3,200 observations

3 choice/attributes + NONE Fashion = High / Low Quality = High / Low Price = 25/50/75,100 coded 1,2,3,4

Heterogeneity: Sex (Male=1), Age (<25, 25-39, 40+)

Underlying data generated by a 3 class latent class process (100, 200, 100 in classes)

7/54: Topic 5.1 – Modeling Stated Preference Data

Stated Choice Experiment: Unlabeled Alternatives, One Observation

t=1

t=2

t=3

t=4

t=5

t=6

t=7

t=8

8/54: Topic 5.1 – Modeling Stated Preference Data

Application: Pregnancy Care Guidelines

9/54: Topic 5.1 – Modeling Stated Preference Data

Application: Travel Mode

Survey sample of 2,688 trips, 2 or 4 choices per situationSample consists of 672 individualsChoice based sample

Revealed/Stated choice experiment: Revealed: Drive,ShortRail,Bus,Train Hypothetical: Drive,ShortRail,Bus,Train,LightRail,ExpressBus

Attributes: Cost –Fuel or fare Transit time Parking cost Access and Egress time

10/54: Topic 5.1 – Modeling Stated Preference Data

Customers’ Choice of Energy Supplier

California, Stated Preference Survey 361 customers presented with 8-12 choice

situations each Supplier attributes:

Fixed price: cents per kWh Length of contract Local utility Well-known company Time-of-day rates (11¢ in day, 5¢ at night) Seasonal rates (10¢ in summer, 8¢ in winter, 6¢ in

spring/fall)

11/54: Topic 5.1 – Modeling Stated Preference Data

Combining RP and SP Data Sets - 1 Enrich the attribute set by replicating choices E.g.:

RP: Bus,Car,Train (actual) SP: Bus(1),Car(1),Train(1)

Bus(2),Car(2),Train(2),… How to combine?

12/54: Topic 5.1 – Modeling Stated Preference Data

Each person makes four choices from a choice set that includes either two or four alternatives.

The first choice is the RP between two of the RP alternatives

The second-fourth are the SP among four of the six SP alternatives.

There are ten alternatives in total.

13/54: Topic 5.1 – Modeling Stated Preference Data

An Underlying Random Utility Model

14/54: Topic 5.1 – Modeling Stated Preference Data

Nested Logit Approach

Car Train Bus SPCar SPTrain SPBus

RP

Mode

Use a two level nested model, and constrain three SP IV parameters to be equal.

15/54: Topic 5.1 – Modeling Stated Preference Data

Enriched Data Set – Vehicle Choice

Choosing between Conventional, Electric and LPG/CNG Vehicles in Single-Vehicle Households

David A. Hensher William H. Greene Institute of Transport Studies Department of Economics School of Business Stern School of Business The University of Sydney New York University NSW 2006 Australia New York USA

Conventional, Electric, Alternative 1,400 Sydney Households Automobile choice survey RP + 3 SP fuel classes Nested logit – 2 level approach – to handle the scaling

issue

16/54: Topic 5.1 – Modeling Stated Preference Data

Attribute Space: Conventional

17/54: Topic 5.1 – Modeling Stated Preference Data

Attribute Space: Electric

18/54: Topic 5.1 – Modeling Stated Preference Data

Attribute Space: Alternative

19/54: Topic 5.1 – Modeling Stated Preference Data

Experimental Design

20/54: Topic 5.1 – Modeling Stated Preference Data

Survey sample of 2,688 trips, 2 or 4 choices per situationSample consists of 672 individualsChoice based sample

Revealed/Stated choice experiment:Revealed: Drive, ShortRail, Bus, TrainHypothetical: Drive, ShortRail, Bus, Train, LightRail, ExpressBus

Attributes:Cost –Fuel or fareTransit timeParking costAccess and Egress time

21/54: Topic 5.1 – Modeling Stated Preference Data

Mixed Logit Approaches Pivot SP choices around an RP outcome. Scaling is handled directly in the model Continuity across choice situations is handled by

random elements of the choice structure that are constant through time Preference weights – coefficients Scaling parameters

Variances of random parameters Overall scaling of utility functions

22/54: Topic 5.1 – Modeling Stated Preference Data

Each person makes four choices from a choice set that includes either 2 or 4 alternatives.

The first choice is the RP between two of the 4 RP alternatives

The second-fourth are the SP among four of the 6 SP alternatives.

There are 10 alternatives in total.

A Stated Choice Experiment with Variable Choice Sets

23/54: Topic 5.1 – Modeling Stated Preference Data

Experimental Design

24/54: Topic 5.1 – Modeling Stated Preference Data

Willingness to Pay for Green Energy

25/54: Topic 5.1 – Modeling Stated Preference Data

26/54: Topic 5.1 – Modeling Stated Preference Data

27/54: Topic 5.1 – Modeling Stated Preference Data

28/54: Topic 5.1 – Modeling Stated Preference Data

29/54: Topic 5.1 – Modeling Stated Preference Data

30/54: Topic 5.1 – Modeling Stated Preference Data

31/54: Topic 5.1 – Modeling Stated Preference Data

Stated Choice Experiment: TravelMode by Sydney Commuters

32/54: Topic 5.1 – Modeling Stated Preference Data

Would You Use a New Mode?

33/54: Topic 5.1 – Modeling Stated Preference Data

Value of Travel Time Saved

34/54: Topic 5.1 – Modeling Stated Preference Data

Hybrid Choice Models

Incorporates latent variables in choice model

Extends development of discrete choice model to incorporate other aspects of preference structure of the chooser

Develops endogeneity of the preference structure.

35/54: Topic 5.1 – Modeling Stated Preference Data

Endogeneity

"Recent Progress on Endogeneity in Choice Modeling" with Jordan Louviere & Kenneth Train & Moshe Ben-Akiva & Chandra Bhat & David Brownstone & Trudy Cameron & Richard Carson & J. Deshazo & Denzil Fiebig & William Greene & David Hensher & Donald Waldman, 2005. Marketing Letters Springer, vol. 16(3), pages 255-265, December.

Narrow view: U(i,j) = b’x(i,j) + (i,j), x(i,j) correlated with (i,j)(Berry, Levinsohn, Pakes, brand choice for cars, endogenous price attribute.) Implications for estimators that assume it is.

Broader view: Sounds like heterogeneity. Preference structure: RUM vs. RRM Heterogeneity in choice strategy – e.g., omitted attribute models Heterogeneity in taste parameters: location and scaling Heterogeneity in functional form: Possibly nonlinear utility functions

36/54: Topic 5.1 – Modeling Stated Preference Data

Heterogeneity

Narrow view: Random variation in marginal utilities and scale RPM, LCM Scaling model Generalized Mixed model

Broader view: Heterogeneity in preference weights RPM and LCM with exogenous variables Scaling models with exogenous variables in

variances Looks like hierarchical models

37/54: Topic 5.1 – Modeling Stated Preference Data

The MNL Model

j ij

J(i)

j ijj=1

exp(α + ' )P[choice j | i] =

exp(α + ' )

β x

β x

38/54: Topic 5.1 – Modeling Stated Preference Data

Observable Heterogeneity in

Preference Weights

ij j i ij j i ijU =α + + +εβx z

i i

i,k k k i

Hierarchical model - Interaction terms

= +

Parameter heterogeneity is observable.

Each parameter β = β +

Prob[choi

β β Φh

φ h

i

j

j ij i

J

j ij ij=1

exp(α + + )ce j | i] =

exp(α + + )

i j

i

β x z

β x z

39/54: Topic 5.1 – Modeling Stated Preference Data

Observable Heterogeneity in Utility Levels

ij j i ij j i ij

j

U =α + + +εβx z

j ij i

J(i)

j ij ij=1

exp(α + + )Prob[choice j | i] =

exp(α + + )

jβ'x z

β'x z

Choice, e.g., among brands of cars

xitj = attributes: price, features

zi = observable characteristics: age, sex, income

40/54: Topic 5.1 – Modeling Stated Preference Data

‘Quantifiable’ Heterogeneity in Scaling

ij j i ij j i ijU =α + + +εβx z

2 2 2ij j i 1

Var[ε ] = σ exp( ), σ = π / 6jδ w

wi = observable characteristics: age, sex, income, etc.

41/54: Topic 5.1 – Modeling Stated Preference Data

Unobserved Heterogeneity in Scaling

i

ij ij i ij

i i

i iji J

i ijj=1

HEV formulation: U = +(1/ σ )ε

Generalized mixed model with = 1 and = [0].

Produces a scaled multinomial logit model with β =σβ

exp( )Prob(choice = j) = ,i=1,...,N, j=1,

exp( )

βx

Γ

βx

βx

i

2i i

2i i i

...,J

The random variation in the scaling is

σ =exp(-τ / 2+τw )

The variation across individuals may also be observed, so that

σ =exp(-τ / 2+τw + )δz

42/54: Topic 5.1 – Modeling Stated Preference Data

A Helpful Way to View Hybrid Choice Models

Adding attitude variables to the choice model

In some formulations, it makes them look like random parameter models

43/54: Topic 5.1 – Modeling Stated Preference Data

Unbservable Heterogeneity in Utility Levels and Other Preference

Indicators

i i

ij j i ij j i ij

Multinomial Choice Model

U =α + + +ε

Indicators (Measurement) Model(s)

Outco

βx z

t

i

j ij i

J (i)

j ij ij=1

exp(α + + )Prob[choice j | i] =

exp(α + + )

j

z b w

β'x z

β'x z

= ( , )m immes f vim iy z

44/54: Topic 5.1 – Modeling Stated Preference Data

45/54: Topic 5.1 – Modeling Stated Preference Data

46/54: Topic 5.1 – Modeling Stated Preference Data

47/54: Topic 5.1 – Modeling Stated Preference Data

*1 1 1 1

*2 2 2 2

*3 3 3 3

*1 1 1 1

*2 2 1 1

* *3 3 1 2 1

*4 4 2 2

*5 5 2 2

*6 6 3 3

*7 7 3 3

( , )

( , )

( , , )

( , )

( , )

( , )

( , )

z u

z u

z u

y g z

y g z

y g z z

y g z

y g z

y g z

y g z

h

h

h

Observed Latent Observed x z* y

48/54: Topic 5.1 – Modeling Stated Preference Data

MIMIC ModelMultiple Causes and Multiple

Indicators

X z* Y1 1 1

2 2 2* *+w z ... ... ...

M M M

y

yz

y

β x

49/54: Topic 5.1 – Modeling Stated Preference Data

should be kl ikx

Note. Alternative i, Individual j.

50/54: Topic 5.1 – Modeling Stated Preference Data

*

*

U =

ij k ik kl ik jl ijk l k

k ik kl jl ik ijk k l

k ik kl jl ik ijk k l

k ik kj ik ijk k

k kj ik ijk

x x

x x

x x

x x

x

This is a mixed logit model. The interesting extension is the source of the individual heterogeneity in the random parameters.

51/54: Topic 5.1 – Modeling Stated Preference Data

“Integrated Model”

Incorporate attitude measures in preference structure

52/54: Topic 5.1 – Modeling Stated Preference Data

53/54: Topic 5.1 – Modeling Stated Preference Data

54/54: Topic 5.1 – Modeling Stated Preference Data

Swait, J., “A Structural Equation Model of Latent Segmentation and Product Choice for Cross Sectional Revealed Preference Choice Data,” Journal of Retailing and Consumer Services, 1994

Bahamonde-Birke and Ortuzar, J., “On the Variabiity of Hybrid Discrete Choice Models, Transportmetrica, 2012

Vij, A. and J. Walker, “Preference Endogeneity in Discrete Choice Models,” TRB, 2013

Sener, I., M. Pendalaya, R., C. Bhat, “Accommodating Spatial Correlation Across Choice Alternatives in Discrete Choice Models: An Application to Modeling Residential Location Choice Behavior,” Journal of Transport Geography, 2011

Palma, D., Ortuzar, J., G. Casaubon, L. Rizzi, Agosin, E., “Measuring Consumer Preferences Using Hybrid Discrete Choice Models,” 2013

Daly, A., Hess, S., Patruni, B., Potoglu, D., Rohr, C., “Using Ordered Attitudinal Indicators in a Latent Variable Choice Model: A Study of the Impact of Security on Rail Travel Behavior”