estimating the costs of standardization: evidence from the ...estimating the costs of...

71
Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui * October 4, 2020 Abstract This paper studies the decentralized adoption of a technology standard when network effects are present. If the new standard is incompatible with the current installed base, adoption may be inefficiently delayed. I quantify the magnitude of “excess inertia” in the switch of the movie distribution and exhibition industries from 35mm film to digital. I specify and estimate a dynamic game of digital hardware adoption by theaters and digital movies supply by distributors. Counterfactual simulations establish that excess inertia reduces surplus by 16% relative to the first-best adop- tion path; network externalities explain 41% of the surplus loss. Keywords: dynamic games, network effects, technology adoption, movie industry JEL Classification: C73, L15, O33, L82 * University of Toronto. Email: [email protected]. I am grateful for the advice and support of John Asker, Simon Board, Martin Hackmann, and Brett Hollenbeck. I thank Heski Bar-Isaac, Allan Collard-Wexler, Ruben Gaetani, Ricard Gil, Renato Giroldo, Hugo Hopenhayn, Keyoung Lee, Matthew Mitchell, Volker Nocke, Rustin Partow, Ksenia Shakhgildyan, and seminar participants at Pennsylvania State, Utah, Duke Fuqua, Michigan, Wharton BEPP, Science Po, Paris School of Economics, IESE Barcelona, Toronto, and CESifo for their helpful comments and suggestions. I thank Michael Karagosian, Jean Mizrahi, David Hancock, Olivier Hillaire, and Frederick Lanoy for answering my questions about the digitalization of the U.S. and French movie industries. All errors remain my own.

Upload: others

Post on 24-Nov-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

Estimating the Costs of Standardization:

Evidence from the Movie Industry

El Hadi Caoui∗

October 4, 2020

Abstract

This paper studies the decentralized adoption of a technology standard when networkeffects are present. If the new standard is incompatible with the current installedbase, adoption may be inefficiently delayed. I quantify the magnitude of “excessinertia” in the switch of the movie distribution and exhibition industries from 35mmfilm to digital. I specify and estimate a dynamic game of digital hardware adoptionby theaters and digital movies supply by distributors. Counterfactual simulationsestablish that excess inertia reduces surplus by 16% relative to the first-best adop-tion path; network externalities explain 41% of the surplus loss.

Keywords: dynamic games, network effects, technology adoption, movie industryJEL Classification: C73, L15, O33, L82

∗University of Toronto. Email: [email protected]. I am grateful for the adviceand support of John Asker, Simon Board, Martin Hackmann, and Brett Hollenbeck. I thank HeskiBar-Isaac, Allan Collard-Wexler, Ruben Gaetani, Ricard Gil, Renato Giroldo, Hugo Hopenhayn,Keyoung Lee, Matthew Mitchell, Volker Nocke, Rustin Partow, Ksenia Shakhgildyan, and seminarparticipants at Pennsylvania State, Utah, Duke Fuqua, Michigan, Wharton BEPP, Science Po,Paris School of Economics, IESE Barcelona, Toronto, and CESifo for their helpful comments andsuggestions. I thank Michael Karagosian, Jean Mizrahi, David Hancock, Olivier Hillaire, andFrederick Lanoy for answering my questions about the digitalization of the U.S. and French movieindustries. All errors remain my own.

Page 2: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

1 Introduction

Technology standards play a central role in networked industries.1 When the value of a

technology increases with the number of users, firms have an incentive to coordinate on

a single technology, the standard, to exploit the benefits of a larger network. However,

once coordinated on a standard, the industry may become reluctant to switch to new and

superior technologies if they are incompatible with the installed base. Can standardization

prevent the efficient adoption of new technologies? While the theoretical literature (Farrell

and Saloner (1985, 1986)) has shown that inefficient delays (i.e., excess inertia) can arise,

attempts to empirically assess it remain scarce. This paper studies the diffusion of a new

technology standard in the movie industry and estimates the magnitude of excess inertia.

Two sources of inefficiency can delay the adoption of the new technology. First, network

effects may give rise to adoption externalities: the marginal adopter benefits other adopters

(and may hurt non-adopters) so that the private costs and benefits do not reflect the social

ones.2 Second, incomplete information about other firms’ willingness to adopt induces coor-

dination failure: firms are unwilling to risk switching without being followed, which creates

delays in adoption.3

As an application, I study the conversion of movie distribution and exhibition from

the 35mm film standard to digital cinema between 2005 and 2013 in France, the largest

European market by number of screens and theaters. Digital cinema consists of distributing

motion pictures to theaters over digital supports (internet or hard drives) as opposed to the

historical use of 35mm film reels. To screen digital movies, theaters must equip their screens

with digital video projectors instead of film projectors.

This technological switch is well suited for assessing excess inertia in standard adop-

tion. Indeed, the movie distribution-exhibition industries constitute a hardware-software

system with (indirect) network effects (Katz and Shapiro (1985)). Because film and digi-

tal are incompatible technologies, adoption of digital projectors—the hardware—by theaters

is contingent on the availability and variety of digital movies—the software—supplied by

distributors. Conversely, software variety depends on the hardware installed base.

Market forces may not have provided sufficient incentives for an efficient switch from

1Examples include AC in electric power distribution, FedACH for wire transfers in banking, USB for datatransmission, MP3 and 3.5mm jack for music distribution, NTSC for color TV transmission, and 4G LTEin broadband networks for mobile devices.

2When network effects are direct, adoption externalities always arise. When network effects are indirect,such as in hardware-software systems, Church, Gandal, and Krauske (2008) show that software variety maygive rise to (technological) externalities between hardware adopters.

3Another potential source of inefficiency, not considered in this paper, is market power in technologyprovision.

1

Page 3: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

35mm film to digital. On the exhibition side, the adoption of digital projectors by a the-

ater raises distributors’ incentive to release movies in digital. This increased availability of

digital movies is a positive externality on other theaters equipped with digital projectors,

which is not internalized by the adopter.4 On the distribution side, uncertainty about other

distributors’ willingness to release movies in digital can lead to coordination failure, with

some distributors inefficiently delaying their digital conversion.5 Indeed, it is optimal for a

distributor to release a given movie in digital only if sufficiently many theaters are equipped

to screen it. The fraction of equipped theaters will be high if sufficiently many distributors

are releasing digital movies. This chicken-and-egg problem can make a unilateral switch to

digital suboptimal.6

The paper leverages novel datasets on adoption of digital projectors at the theater-screen

level, theater characteristics, digital conversion costs, digital movie availability, and average

distribution costs (printing, shipping and storage cost per movie print) under the film and

digital technologies.

To test whether the diffusion of digital was inefficiently delayed, I specify a dynamic

structural model of the movie exhibition and distribution sectors. Theaters’ technology-

adoption choices are modeled as a dynamic game played at the level of the French movie

industry, allowing for rich theater and market heterogeneity (e.g., type of programming, chain

affiliation, market size, number of rival’s screens, etc.). Every period, theaters choose the

number of screens to equip with digital projectors, given the adoption cost and availability

of digital movies. In turn, the availability of digital movies depends on the installed base of

digitally equipped screens in the industry.

Because network effects are at the industry level, with hundreds of theaters adopting, this

framework generates a particularly high-dimensional state space. To alleviate the computa-

tional burden, I define a nonstationary oblivious equilibrium (Weintraub, Benkard, Jeziorski,

and Van Roy (2008)) of the game exploiting the fact that a single theater is non-atomic and

plays against the expected distribution of the industry state. The model is estimated using a

conditional choice probability-based method (Hotz and Miller (1993), Hotz, Miller, Sanders,

and Smith (1994)) combined with matrix inversion to obtain choice-specific value functions

(Aguirregabiria and Mira (2007), Pesendorfer and Schmidt-Dengler (2008)). The estimation

approach exploits differences in adoption behavior across theaters (e.g., differences in adop-

4A negative externality on non-adopters (due to decreased availability of film movies) is possible. But inthe case of digital cinema, multi-homing (i.e., dual distribution on film and digital) blunts this externality.

5Distributors’ willingness to switch depends on costs that vary widely across heterogeneous distributors(large U.S. studios vs. small French distributor) and are private information.

6In general, these two sources of inefficiency may arise both for upstream distributors and downstreamtheaters. The focus on externalities among downstream theaters is motivated by the data and the institu-tional framework.

2

Page 4: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

tion times, units of new technology acquired, and adoption costs) to estimate how exogenous

theaters and market characteristics affect the single-period profits from equipping a movie

screen with digital technology.

Using the estimated model, the paper simulates two counterfactuals. First, in the plan-

ner’s benchmark, I solve for the adoption path chosen by a planner maximizing aggregate

theater profits, taking as given upstream distributors’ equilibrium reaction function (fraction

of movies released in digital given the installed base of digital screens). In this counterfactual,

adoption externalities across theaters are internalized, leading to a steeper adoption path.

Differences in theaters’ surplus between the market outcome and the planner’s benchmark

are attributed to adoption externalities across theaters (downstream excess inertia). Second,

in the coordination benchmark, I assume that the planner mandates coordination on digital

distribution upstream starting in 2005 and maximizes aggregate theater profits. Differences

in theater’ surplus between the coordination and the planner’s benchmarks are attributed

to upstream excess inertia.7

The adoption path under the equilibrium market outcome is delayed compared to the

coordination and planner’s benchmarks: by 0.8 to 3 years for the time to 10% adoption, and

0.3 to 0.5 years for the time to 90% adoption. Additionally, inefficient delays in adoption

reduce surplus by 16% relative to the first-best path (coordination benchmark). Adoption

externalities across theaters explain 41% of the surplus loss. From a policy perspective, the

results indicate that coordinating distributors to make their movies available in digital would

be more effective than coordinating theater adoption.

The methodology and empirical findings are relevant to study standardization in other

hardware-software industries with many firms. Recent examples include: the switch from

USB-A/B to USB-C in the consumer electronics industry, where computer and smartphone

(hardware) manufacturers’ adoption depends on the availability of USB-C compatible soft-

ware (i.e., peripherals and accessories such as monitors, webcams, audio system, controllers,

TVs, keyboards, mouses, and printers); or the switch from chip card to contactless technology

in the debit/credit card market, where issuance of contactless cards by financial institutions

(hardware side) is predicated on merchants (software side) acceptance and adoption of con-

tactless point-of-sale terminals.

The rest of the paper is organized as follows. The next section reviews the literature and

highlights the main points of departure from it. Section 3 presents the movie distribution

and movie exhibition industries, describes the technology and highlights the specificities of

the French market. Section 4 describes the data and gives preliminary descriptive statistics.

7In this sense, upstream excess inertia is obtained as a residual and can arise because of coordinationfailure and adoption externalities across distributors.

3

Page 5: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

Section 5 develops the dynamic structural model of technology adoption. Section 6 shows the

identification and estimation of the industry model. Section 7 presents the counterfactual

analysis. Section 8 concludes.

Tables and figures are located after the main text.

2 Related Literature

This paper is related to the literature studying the emergence of technology standards, and

in particular, the benefits of standard-setting organizations (SSOs) relative to decentralized

markets. Recent work modelling delays in consensus standard setting includes Farrell and

Simcoe (2012) and Simcoe (2012). Rysman and Simcoe (2008) show that despite the lack of

SSOs’ enforcement power, they can impact subsequent adoption of promoted technologies.

This paper differs from this literature by shifting the focus from delays occurring during

an SSO’s deliberation process to the subsequent delays in the decentralized diffusion of the

standard elected by the SSO. Given that reaching consensus on the technical specifications

of a standard does not guarantee immediate adoption, this paper complements this literature

by emphasizing delays in post-consensus adoption.8

Previous empirical work on technology adoption under network effects has primarily

focused on the identification and estimation of network effects. Identification is, in general,

not straightforward due to the reflection problem (Manski (1993)). Rysman (2019) discusses

this issue in the case of network effects and reviews the approaches taken to address it. In

the context of direct network effects, recent contributions use regional or individual-specific

exogenous shifters of network size to identify network effects: Gowrisankaran and Stavins

(2004) study ACH adoption by banks, Tucker (2008) studies video-messaging adoption by a

bank’s employees, Goolsbee and Klenow (1999) study adoption of home computers.

For technologies with indirect network effects, the literature relies on instruments that

shift adoption on one side of the market (e.g., software market) to identify the degree of

indirect network effects on the other side of the market (e.g., hardware market).9 Recent

examples include: video games platforms (Clements and Ohashi (2005), Corts and Leder-

man (2009), Dube, Hitsch, and Chintagunta (2010)), compact disks titles-players (Gandal,

Kende, and Rob (2000)), DVD titles-players (Karaca-Mandic (2003), Gowrisankaran, Park,

and Rysman (2014)). In addition to the endogeneity problem, the latter paper highlights

8In the case of digital cinema, deliberation by the Society of Motion Picture and Television Engineerstook place between 2000 and 2004. I focus on the diffusion of the standard elected, between 2005 and 2014.

9This type of network effects have been formally modeled by Chou and Shy (1990), and further developedby Church and Gandal (1992). Recent theoretical contributions include Markovich (2008), and Markovichand Moenius (2013). For a review of the literature on network effects, see Farrell and Klemperer (2007).

4

Page 6: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

other econometric issues arising with time series data: hierarchical variation and spurious

correlation. Because the data structure in this paper is likely to give rise to the same econo-

metric issues, I build on their approach in the estimation section.

A few papers within this literature go beyond the estimation of network effects and discuss

welfare implications. Ohashi (2003) studies the war between VHS and Betamax in the VCR

market and quantifies the value of compatibility between the two technologies. Rysman

(2004) studies the Yellow Pages market and analyzes the trade-off between market power

and internalization of network effects. Augereau, Greenstein, and Rysman (2006) discuss the

potential role of ISPs differentiation in the initial failure to coordinate on a standard for 56K

modems. Ackerberg and Gowrisankaran (2006) examine the welfare implications of customer

and bank subsidies in ACH adoption. Ryan and Tucker (2012) study video-calling adoption

within a multi-national firm and simulate counterfactual diffusion paths under alternative

network seeding policies. Lee (2013) studies the welfare impact of exclusivity contracts

between software providers and hardware platforms in the video game industry. The present

paper contributes to this literature by investigating whether equilibrium standard adoption

in network industries would differ from the social optimum, and if so, by identifying the role

of adoption externalities in the surplus loss.

This paper makes a contribution to the literature using dynamic games to study innova-

tion and technology adoption. The recent research on this topic builds dynamic structural

models and simulates the effect of competition on innovation (e.g., Goettler and Gordon

(2011), Igami (2017), Igami and Uetake (2017)) or technology adoption (Schmidt-Dengler

(2006), Macher, Miller, and Osborne (2017)). This paper contributes to this literature in

two ways. First, I apply the dynamic oligopoly framework to tackle a novel question: i.e.,

the efficiency of the technology diffusion path in a network industry. Recent advances in

the estimation of dynamic games allow high-dimensional strategic interdependence (adop-

tion externalities) between firms at the industry level. Second, this paper highlights the

importance of measuring and modeling adoption within the firm (e.g., at the unit of capital

level). Indeed, multi-homing (i.e, the simultaneous use of two technologies across a firm’s

capital stock) is a common feature of a wide array of industries.10

This paper contributes to the empirical literature studying the movie industry. This liter-

ature has considered many facets of the industry: the effect of vertical integration (Gil (2009),

Andrew Hanssen (2010)), seasonality (Einav (2007)), release dates (Einav (2010)), strate-

gic entry and exit, and spatial retail competition (Davis (2006a), Davis (2006b), Takahashi

10Examples from the intra-firm technology adoption literature include: Mansfield (1963), Nabseth and Ray(1974), Romeo (1975), Levin, Levin, and Meisel (1992), Fuentelsaz, Gomez, and Polo (2003), and Battistiand Stoneman (2005).

5

Page 7: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

(2015), Gil, Houde, and Takahashi (2015)). Recent contributions have studied digitization

in the movie industry. Waldfogel (2016) studies the effect of digital movie production, alter-

native distribution channels (streaming), and online film criticism on new product releases.

Rao and Hartmann (2015) study the quality-variety trade-off in screening brought about

by digital projection. Yang, Anderson, and Gordon (2019) evaluate the impact of digital

projection on product variety and supply concentration (i.e., number of screens on which

the top movie is shown). Whereas the last two papers analyze how theater-level screening

choices change post-digital adoption, I focus on theaters’ dynamic decision to adopt, which

is primarily driven by labor costs savings and digital software availability.

3 Industry Background

This section describes the movie-distribution and movie-exhibition industries before and

after the advent of digital technology. It presents the costs and benefits of digital cinema

from the perspective of distributors and exhibitors and discusses the effect of digital cinema

on movie ticket prices and quality. Finally, this section highlights the specificities of the

French distribution and exhibition markets and important stylized facts.

3.1 From 35mm film to digital

For most of the 20th century, movies reached viewers after going through a series of specified

steps in a vertically structured industry. After the movie is shot and edited, distributors

print the movie onto 35mm film reels and ship the reels to movie theaters. At the theater, a

projectionist arranges the reels so they can be fed to a film projector. When the movie’s run

is over, the print is broken back down into shipping reels and either sent to the next theater

or returned to the distributor.

On January 19, 2000, the Society of Motion Picture and Television Engineers, in the U.S.,

initiated the first standards group dedicated to developing digital cinema. The technology

would entail (1) movie distribution on a digital support (via the internet or hard drives),

instead of the historical uses of film reels and (2) movie projection via digital projection

hardware instead of the film-projection technology.

To screen a digital movie, theaters must equip their screens with digital projectors. Four

manufacturers supply digital cinema projectors worldwide: Sony, Barco, Christie, and NEC.

The average list price of a digital projector (in 2010 euros) was e88, 000 in 2005, e50, 000 in

2010, and e40, 000 by 2012. In addition to the digital projector, a digital cinema requires a

dedicated computer, the “server.” A digital movie is supplied to the theater as a digital file

6

Page 8: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

called a Digital Cinema Package (DCP). The DCP is copied onto the internal hard drives of

the server, usually via a USB port.

3.1.1 Supply of digital movies by distributors

Digital distribution of movies drastically cuts printing and shipping costs for movie distrib-

utors. The cost of an 80-minute feature film print is on average between US$1,500 and

$2,500. By contrast, a feature-length movie can be stored on an off-the-shelf 300 GB hard

drive for $50.11 In addition, hard drives can be returned to distributors for reuse. With

several hundred movies distributed every year, the distribution industry saves over $1 billion

annually.

Importantly, the distribution format (digital or film print) is independent of the support

on which the movies is shot (i.e., with a film or digital camera). Indeed, editing and post-

production have been done digitally since the mid-1990s, thanks to the spread of computers

and video editing software—regardless of the shooting format. Before the switch to digital

distribution, all movie were edited digitally and subsequently printed on 35mm.12

3.1.2 Adoption of digital projectors by exhibitors

Digital projection allows exhibitors to cut down on operating costs. Screening film prints

is a technical task, requiring mechanical skills that are increasingly rare and costly.13 By

contrast, digital projection automates all the tasks that were previously performed by the

projectionist. Untrained staff can easily compose a playlist and launch a projection as on

a regular computer. Digital projection also opens up the possibility of using theaters for

“alternative content” such as pop concerts, opera broadcasts, and sports events.

A consequence of the “automated” nature of digital projection is that uncertainty and learn-

ing, which can be important determinants of technology diffusion, are not central to theaters’

adoption decisions.

3.1.3 Multi-homing by movie distributors and theaters

Multi-homing in movie distribution consists of the distribution of a given movie on both film

and digital supports. According to industry professionals, multi-homing was widespread over

the diffusion period studied in this paper (2005–2014). Importantly, the decision to multi-

home involves a trade-off between economies of scale and cost reduction. On the one hand,

11The latter figure of $50 does not include the price of encryption-key generation, transportation, andstorage, which add approximately $200–$300. For French/European movies, these figures are around e950for film print distribution and around e350 for digital print distribution, including shipping, encryption-keygeneration, and storing.

12Although a small fraction of movies is still shot on film, digital cameras have been the main medium forshooting since the early 2000s.

13Film projectionists are commonly represented by powerful unions. See the interview in l’Obs 07/14/2010(in French) “Frederic, projectionniste chez UGC pour 1800 euros par mois” The collective bargaining agree-ments set the minimum monthly salary to e1, 500 over the period of interest.

7

Page 9: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

there are economies of scale in printing, shipping, and storing movie prints (whether in film

or digital). By multi-homing, distributors split their production over two cost functions (film

and digital) and forgo some of these economies of scale. On the other hand, there are cost

reductions from digital relative to film distribution, which induce distributors to multi-home.

Multi-homing in movie exhibition refers to equipping a given screen with both a digital

and film projector. This type of multi-homing was rare for practical reasons (limited space

in screening booth, heavy and sensitive projection equipment), and because theaters laid off

their projectionists following the adoption of digital projection.

3.1.4 The Virtual Print Fee system

A large fraction of the cost savings from digital cinema is realized by distributors. For

this reason, theaters have been reluctant to switch without a cost-sharing arrangement with

movie distributors. An agreement was reached with the Virtual Print Fee (VPF) system.

Under this system, the distributor pays a fee per digital movie to help finance the digital

hardware acquired by the theater. The VPF contract would typically cover 50% of the

hardware adoption cost; the rest has to be paid for by the exhibitor.

3.1.5 Impact on ticket prices and movie quality; and the role of 3D

Excluding 3D movies, the film-digital quality differential was small enough not to warrant

any impact on ticket prices.14 Although 3D movies, and in particular Avatar (released in

the winter 2009, grossing $2.7 billion worldwide), were initially a major selling point for

digital projection, exhibitors quickly realized it was not expanding the audience as promised

(Bordwell (2013)). The vast majority of movies released over the diffusion period were in

2D.

3.1.6 Welfare implications

Although the paper focuses on producer surplus, digital cinema is expected to benefit con-

sumers by reducing the cost of bringing movies to the market. A first consequence of such

reduction in costs are wider releases with increased access for theaters located in small and

rural markets. Second, cost reductions in movie making will lead to new product entry.15

14As noted by Davis (2006a), theaters’ ability to set ticket prices is constrained by distributors’ incentives.Because the conversion to digital distribution affected mainly the cost of a movie print, which is a fixed costfrom the point of view of movie tickets, it did not significantly impact ticket pricing. However, by decreasingdistribution costs, it increased competition for screens between distributors. The latter may have had apositive impact on theaters’ share in box office revenue.

15Aguiar and Waldfogel (2018) show that if product quality is unpredictable at the time of investment(as is typically the case with cultural products such as movies), new product entry can have large welfarebenefits.

8

Page 10: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

3.2 The French distribution and exhibition market

3.2.1 The French exhibition industry

The French exhibition industry is fragmented, with a large fraction of small theaters: half

of theaters are mono-screen, and an additional 15% are two-screen theaters. The largest

theater chains by share of total screens in 2014 (end of the diffusion of digital cinema) are

Gaumont-Pathe (13.6% of screens), CGR (7.8%), and UGC (7.5%). These three chains make

up 50.1% of total box office revenue (Kopp (2016)). In the early phase of the diffusion period

(in 2007), these shares were: Gaumont-Pathe (12.1% of screens), CGR (7.1%), and UGC

(7.0%). Market shares were relatively stable over this period. The French exhibition industry

experienced small entry and exit rates over the diffusion period (around 1.5% per year). As a

result, the majority of digital projectors acquired were replacing old film projectors, enabling

the analysis of the industry’s choice between the old and new technology standard.

3.2.2 The French distribution industry

Between 2005 and 2014, U.S. movies had an average 47% market share (of total box-office

revenue in France), French movies had a 39% market share, and European and other nation-

alities made up 14% of the box office.16 Importantly, U.S. studios distribute their movies

in France via national subsidiaries (e.g., Universal France or Warner Bros. France). Sub-

sidiaries tailor their advertising and distribution campaigns to the national market they

operate in. Therefore, the support—film or digital—over which U.S. movies are distributed

in France depends in part on the installed bases of film and digital projectors in France.

3.2.3 The VPF and government subsidies

The VPF system was initially the result of bilateral negotiations between distributors and

exhibitors. In September 2010, a law was passed making VPF contributions mandatory:

any distributor willing to distribute digital copies of its movie must pay a fixed fee to the

theater booking the digital copy. The VPF contributions would go toward covering 50% of

the digital projector cost, the rest being paid by the exhibitor.

Government and regional subsidies to small theaters were another important feature

of the hardware-acquisition process in France. Many small “continuation” theaters, which

receive movies only two or three weeks after their national release, did not generate enough

VPF to be able to acquire the digital-projection hardware. The government, along with

the regions, stepped in to help these theaters finance their digital conversion. These aids

were allocated to theaters that owned fewer than three screens and were not part of a chain

controlling 50 screens or more.

16Calculations based on the CNC 2014 annual report.

9

Page 11: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

3.2.4 Art house theaters

French theaters can acquire the “art house” label if they screen a minimum share of inde-

pendent and art house movies. This share depends on the theater location (e.g., small rural

area, large city). The label, awarded every year, entitles the theater to government financial

support (in the form of a lump-sum subsidy).

A priori, art house theaters may differ in their adoption behavior from commercial (i.e.,

non-art house) theaters if there are significant differences in operating profits.

4 Data and Descriptive Statistics

This section describes the data and presents descriptive statistics. The data contain infor-

mation on theaters’ digital-adoption decisions, theater characteristics, adoption costs, and

availability of digital movies over time.

The main dataset is a panel describing digital adoptions by theaters. This dataset was

collected from two sources: the European Cinema Yearbooks published by Media Salles, and

an online database maintained by Cinego, a private digital platform.17 Both sources are

public and provide snapshots of the digital-exhibition industry at different periods spanning

June 2005 through March 2013, in France. Thirteen dates are obtained from the Cinema

Yearbooks, and 5 dates from the Cinego database. At each of the 18 observation dates, the

number of digital projectors acquired is known for every active theater. The observation

dates and sources are detailed in Appendix A. Figure 1a represents the 18 observation dates

along with the industry share of screens equipped with digital projection. The panel is

aperiodic (starting in 2008) and stops before the diffusion is complete in 2014. Five periods

are dropped to ensure a relative periodicity in the sample (6 months). Details about this

procedure can be found in Appendix A.

Three auxiliary datasets complement the main adoption panel dataset. The first is ob-

tained from the French National Center of Cinematography (CNC hereafter). The CNC

dataset provides a rich set of information on theaters’ characteristics, local market size, and

the share of movies available in digital, between 2005 and 2015. More precisely, this annual

dataset contains: (1) lists of all active theaters, (2) the number of screens, the number of

seats, the address, the owner’s identity (theater chain, individual), and art house status for

each active theater, (3) market population (categorical) at the urban/rural unit level (de-

fined below), and (4) the annual share of movies released in digital (distributed partially or

17Raw data available at: http://www.mediasalles.it/yearbook.htm andhttps://cinego.net/basedessalles (via the Internet Archive)

10

Page 12: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

entirely in digital) in France.18

The second auxiliary dataset, obtained from the European Audiovisual Observatory, pro-

vides time-series information on digital-projector acquisition costs.19 Namely, the time-series

for the hardware adoption cost is constructed by adding (1) the price of a digital projector

(net of VPF contributions) to (2) ancillary costs. The time-series for digital-projector prices

is based on a survey of projector manufacturers. Actual prices paid by specific theaters

are not public due to nondisclosure agreements between theaters and manufacturers. This

time-series is taken as representative of the “list” price (or manufacturer’s suggested retail

price MSRP) of digital projectors. The analysis accounts for the VPF subsidies, which cover

50% of the projector price.20 Ancillary costs include the price of other equipment (the server

and the digital sound processor), Theater Management software, and labor costs (installa-

tion). Estimates of ancillary costs were collected by the European Audiovisual Observatory,

but are only available for 2010. In the analysis, these ancillary costs are assumed to have

stayed constant over the sample period. This assumption seems reasonable for labor costs.

According to the Observatory, price declines for the server and digital sound processor are

more limited than for the digital projector. The hardware-adoption cost is adjusted to 2010

constant euros.21 The hardware adoption cost is interpolated to obtain estimates at the 13

observation dates. Figure 1b shows the time series for this variable.

Third, data on the number of movies released in digital in the U.S. between 2005 and

2015 is obtained from the Internet Movie Database (IMDb). For each movie release, the

website reports technical specifications such as the type of camera used, aspect ratio and,

importantly, the “printed film format” over which the movie was distributed (e.g., 35mm,

digital). This information is collected for all U.S. movie releases over the period of interest.

The analysis is conducted on the data after the following preparation. Itinerant theaters,

which account for 5% of active theaters, are dropped. Because the focus is on firms’ decision

to convert existing capital from film to digital, theaters that enter during the diffusion period

already equipped with digital projectors are excluded from the model. Their contribution to

the overall installed base of digital screens is, however, accounted for and taken as exogenous.

Firms exiting before conversion to digital are also excluded.22 Rates of entry and exit are,

18This share is constructed as the ratio of the number of digital releases (distributed partially or entirelyin digital) and the total number of releases in France in a given year.

19See ”The European Digital Cinema Report - Understanding digital cinema roll-out” (Council of Europe,2012)

20Although the law mandating VPF subsidies was only enacted in September 2010, it was retroactive.Moreover, anecdotal evidence indicates that pre-2010, the majority of projectors were purchased under VPFagreements. The model will assume digital projector-purchases before 2010 benefited from VPF subsidies.

21The GDP Implicit Price Deflator for France is used.22The inability to convert is, however, not a significant cause for exit, because the CNC and regional

governments subsidized digital adoption for the smallest and less financially sound theaters.

11

Page 13: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

however, low (between 1 and 1.5% of firms enter or exit every year). Theaters in French

overseas territories are excluded. The final sample includes 1, 671 theaters, located in 1, 169

markets (urban or rural units, defined below), and observed over 13 dates between June 2005

and April 2012. The sample covers 87% of all non-itinerant theaters located in Metropolitan

France, which were active in 2005 or entered before 2008 equipped with the old technology.

Table 1 shows a description of the variables used in the analysis.

Local market definition and competitors

Local market demand and competition are defined based on the urban or rural unit in which

the theater is located. An urban unit is defined by the INSEE, the French National Statistics

Office, for the measurement of contiguously built-up areas. It is a “commune” alone or a

grouping of communes forming a single unbroken spread of urban development, with no

distance between habitations greater than 200 meters, and a total population greater than

2,000 inhabitants. Communes not belonging to an urban unit are considered rural.23 In

2010, Metropolitan France contained 2, 243 urban units and about 33, 700 rural units.

For the largest cities (Paris, Lyon, Marseille), the urban unit division is not appropriate,

because the resulting local markets are unreasonably large. In these cases, the relevant

market within each city is the “arrondissement” (equivalent to zipcode in the U.S.).24 In

the rest of the paper, a theater’s competition is measured using the number of competing

screens in the same local market.

Descriptive statistics

The analysis focuses on theaters with at least four screens, due to the prevalence of gov-

ernment and regional subsidies for theaters with three screens or fewer.25 Table 2 and 3

report cross-sectional summary statistics, and highlight the market and firm heterogeneity

captured by the data.

Table 2 shows summary statistics for the 399 theaters with at least four screens. A

significant fraction of these theaters, 33%, are art house theaters. The average theater has

eight screens. Thirty-five percent of theaters are part of the three largest theaters chains:

Gaumont-Pathe, CGR, and UGC. In total, 53.4% of theaters are miniplexes (4-7 screens),

and 46.6% are multiplexes/megaplexes (8 screens or more).

23Communes correspond to civil townships and incorporated municipalities in the U.S.24The subdivision by arrondissement is arbitrary, given that theaters are engaged in spatial competition.

As a robustness check, a theater’s competition can also be measured using distance bands around the theaterlocation. With a distance band of 5 miles, this alternative definition gives similar values for the variable“number of competitors’ screens.”

25These subsidies covered part or all of a theater’s adoption costs. They were allocated according toa government-determined (or regional) timeline. Therefore, subsidized theaters’ adoption behavior doesnot stem from an optimization problem and is not informative about underlying benefits from adoption.Although the model does not formally include subsidized firms, their adoption decisions contribute to theinstalled base of digital projectors but assumed to be exogenous in the model.

12

Page 14: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

Table 3 reports summary statistics by market size. Paris and its suburbs are controlled

for separately because attendance rates are significantly higher in the capital compared to

national averages.26 As expected, the stock of screens grows with the market size. A larger

fraction of theaters is art house in rural areas because the CNC’s threshold requirements

to qualify are lower for relatively less dense areas. Theater size increases on average with

market size (except for Paris, where the scarcity of space limits theater size).

Preliminary analysis of the data uncovers two important points. First, theaters tend

to gradually roll-out digital technology over their stock of screens. Figure 2a shows the

number of new digital screens equipped per year. Figure 2b decomposes this number into:

(1) screens installed by new adopters (theaters with no digital screens in t − 1), and (2)

screens installed by theaters with some digital screens by t − 1. (1) is informative about

the degree of adoption at the extensive margin, whereas (2) is informative about the degree

of adoption at the intensive margin (within-theater). Starting in 2008, a large fraction of

screens converted to digital per year belong to theaters that have already adopted at least

one digital screen in previous periods, highlighting the importance of the intensive margin.27

Gradual adoption is consistent with decreasing marginal returns from converting an ad-

ditional screen. If digital movie availability is limited, one would expect theaters to convert

only a fraction of their screen in a given period. To closely match this feature of the data in

the theoretical section, adoption decisions will be modeled at the screen-level. In Appendix

B, I provide reduced-form evidence that the gradual adoption of digital screens is partly

driven by increasing availability of digital movies.

Second, adoption by a theater’s competitors does not significantly affect its likelihood

of adoption. Table 4 shows the result of an ordered probit model in which the fraction of

screens converted by a theater during the past 6 months is regressed on firm and market

characteristics. A polynomial in time is included to capture aggregate time shocks (e.g.,

adoption cost, availability of digital movies, etc.). Once I control for market characteristics

that may create correlation between firms in the same local market, adoption by a theater’s

competitors does not significantly affect its adoption decision. The likelihood-ratio test of

specification (3) against a specification omitting competitors’ digital screens adoption fails

to reject the null that the coefficient on competitors’ adoptions is zero at the 5% confidence

26Moviegoers in Paris visit theaters on average 12 times a year, compared to a national average of 4 − 5times over the diffusion period.

27An alternative way of measuring the contribution of the intensive margin is to decompose the samplevariance in adoption times across all screens in the industry into within-theater and between-theater vari-ances. The former sample variance is 1.28 (corresponding to a standard deviation of 1.13 years), the latteris 1.02 (or a standard deviation of 1.01 years). Therefore, within-theater variance in adoption times explainsabout 56% of total variance across all screens in the industry. This likely underestimates the contributionof the intra-firm margin because the panel stops before the end of the diffusion.

13

Page 15: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

level. To reflect this feature of the data, the industry model will include strategic interactions

only at the industry level (via network effects) but not at the local market level.28

5 Industry Model

This section presents the dynamic structural model. The model will be subsequently used

to guide the estimation and recovery of theaters’ operating profits under the film and digital

technologies. These profits are required to quantify, via simulation of counterfactuals, the

amount of surplus lost due to excess inertia.

Theaters’ technology adoption choices are modeled as a dynamic game played at the level

of the French movie industry. Digital projectors are durables, so the model must incorporate

the fact that theaters can delay their adoption to a future date to benefit from lower prices

and greater availability of complementary goods (i.e., digital movies).

The central part of the model specifies how theaters make their technology adoption

decisions—at the screen-level—as a function of their firm and market-level characteristics,

the adoption cost, and the availability of technology-specific complementary goods (film or

digital movies). Theaters have an incentive to convert to digital projection due to cost-

reductions (primarily labor cost savings). Although the analysis focuses on the exhibition

sector, the model captures via a reduced-form distributors’ per-period decision regarding

on which support to distribute movies (film and/or digital), given the technology-specific

installed bases (screens equipped with film/digital projectors).

Finally, an equilibrium of the distribution-exhibition industries is specified. I start by

characterizing a Markov Perfect equilibrium of the game. Next, and in anticipation of the

estimation and computation sections, I impose simplifying assumptions to deal with the

high-dimensionality of the state space. These assumptions are shown to be consistent with

the (nonstationary) oblivious equilibrium concept of Weintraub, Benkard, Jeziorski, and Van

Roy (2008).

All vectors are denoted in bold.

28The lack of strategic interactions can be explained as follows. Theaters compete on prices and productofferings. Adoption of digital would affect strategic interactions between firms if it translates into a costadvantage for the adopter or allows the adopter to differentiate its product offerings. For most of the diffusionperiod, movies were multi-homed (i.e., distributed on both film and digital) so adopting digital would notgive rise to a differentiation advantage. Moreover, anecdotal evidence indicate that theaters did not reduceticket prices after switching to digital, potentially because distributors prevent theaters from lowering ticketprices (Davis (2006b) presents evidence of this).

14

Page 16: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

5.1 Adoption of digital projectors by theaters

Time is discrete and infinite, t = 1, 2, . . . ,∞. A period corresponds to six months.

Firms: A firm is a movie theater. There are N firms indexed by i ∈ N = {1, ..., N}. This

set is fixed throughout the game: no entry and exit occur.

Firm types: Denote by τ the vector representing a theater’s type, which is fixed throughout

the game. A firm’s type includes the theater size (number of screens), local market char-

acteristics (market size and number of competitors’ screens), art house label, and a chain

identifier (if the theater is horizontally integrated).29 Denote by T the set of possible types.

For type τ ∈ T , let Nτ and Sτ denote the total number and size of theaters of type τ

(satisfying∑τ∈T Nτ = N). The type of theater i is denoted τ (i).

Firm state space: Firm heterogeneity is reflected through firm states. In period t, the

state of theater i ∈ N, publicly observed by all firms, is the number of screens converted to

digital by t, denoted sit ∈ Sτ (i) ≡ {0, 1, ..., Sτ (i)}. The remaining Sτ (i) − sit screens operate

using the film technology.

Let xit = (τ (i), sit). The variables xit and (τ (i), sit) are used interchangeably. The

industry state is the vector of all players’ public state variables in time t and is denoted by

xt = ((τ (1), s1t), . . . , (τ (N), sNt))

Denote by S =∑

i∈N Sτ (i) the total number of screens in the industry, and st =∑

i∈N sit

the total number of digital screens in the industry in period t.

Theaters part of the same chain are assumed to make their adoption decisions indepen-

dently. This assumption is imposed to keep the estimation computationally tractable.30 I

discuss its implications in more length in section 5.3.

Transition dynamics: A theater can increase its number of digital screens, sit, by paying

an adoption cost. If firm i converts ait screens to digital in period t, the firm transitions to

a state si,t+1 given by

si,t+1 = sit + ait for ait ≤ Sτ (i) − sit (1)

There is no uncertainty in state transition. A theater’s state sit is bounded above by its

maximum capacity Sτ (i).

Aggregate adoption cost: The aggregate adoption cost process, {pt}t≥1, includes the digital-

projector price (net of VPF contributions) and ancillary costs. This process is publicly

29Theater size, measured by the number of screens, is stable over time for all theaters.30Each chain’s state should record firms’ states for all theaters part of the chain, resulting in high-

dimensional chain states. For instance, Gaumont-Pathe (70 theaters) has a chain state with dimension1,088,430—assuming all theaters have four screens and ignoring theaters’ types.

15

Page 17: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

observable to all firms and assumed to evolve exogenously and deterministically. The process

reflects technological advances in the manufacturing of digital projectors, as well as learning-

by-doing and scale economies, which exogenously decrease the hardware adoption cost over

time.

Firm-specific adoption cost: The per-screen adoption cost for theater i in period t is the sum

of two components:

pt + εit (2)

where pt is the aggregate adoption cost and εit is a theater-specific shock, drawn from a

distribution with c.d.f F . This theater-specific shock is privately observed at the beginning

of each period and is independent across periods and theaters.

Before defining theaters’ single-period profit function, digital movie availability is first

discussed.

Availability of digital movies: Movies are short-lived and are screened by theaters for one

period. According to industry professionals, multi-homing was widespread over the diffusion

period studied in this paper (2005–2014).31 To match this feature of the distribution market,

I impose the restriction that a given movie is either multi-homed (i.e., distributed on both

film and digital) or released exclusively in film.

For the rest of the analysis, define ht as the share of multi-homed movies. This share is a

function of the industry state xt as well as an exogenous shifter, the share of digital movies

released in the U.S, denoted hUS,t.32 Let

ht = Γ(xt, hUS,t) (3)

denote distributors’ reaction function giving the share of movies multi-homed as a func-

tion of the industry state vector and the state variable hUS,t. The latter variable is publicly

observable to all firms and assumed to evolve exogenously.

Theaters’ Single-Period Profit Function: The single-period profit of theater i with type

τ (i) (net of the adoption cost) in period t, if it adopts ait units of digital hardware, is given

by

Πi(ait,xt, pt, hUS,t, εit) = πτ (i)(sit, ht)− ait(pt + εit) (4)

where ht is determined by equation (3), πτ (i)(sit, ht) are theater i’s operating profits (screen-

ings, concessions, advertisements) in period t, which depends on the firm type and state

31This is consistent with the findings of Yang, Anderson, and Gordon (2019) for the digitization of theKorean movie industry. They find that the vast majority of distributors multi-homed until 2015.

32The dependence of ht on the whole vector of firm states xt is without loss of generality. In practice,however, this vector is high-dimensional and simplifying restriction are imposed in Section 5.2.

16

Page 18: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

xit = (τ (i), sit) and the shares of digital movies ht, and ait(pt + εit) is the total cost of

converting ait screens in period t. There is a positive feedback loop between the hardware

and software sides of the market, if πτ (i)(sit, ht) is increasing in ht. For a theater, the ben-

efit from adopting a digital projector depends on the share of movies available in digital ht

(technology-specific software), which in turn depends on the installed base of digital screens

in the industry (Equation (3)).

Let the difference in single-period profits from choosing action a versus a′ be denoted

∆Πi(a, a′,xt, pt, hUS,t, εit) = Πi(a,xt, pt, hUS,t, εit)− Πi(a

′,xt, pt, hUS,t, εit)

= −(a− a′)(pt + εit)(5)

The single-period profit function satisfies the following “decreasing difference” restriction

∆Πi(a, a′,xt, pt, hUS,t, εi) < ∆Πi(a, a

′,xt, pt, hUS,t, ε′i) for any a > a′ and εi > ε′i (DD)

State space: In a Markov Perfect Equilibrium (MPE), firms use Markov adoption strategies

and condition their adoption decision only on the current vector of state variables yt ≡(xt, pt, hUS,t) and firm-specific shock εit. Although theater i’s single-period profits, Πi(.), do

not depend directly on the industry state vector xt, the firm tracks this variable in order to

form expectations about the future evolution of the (payoff-relevant) share of multi-homed

movies ht, determined by equation (3).

A pure Markov strategy is a mapping from the current state and shock (yt, εit) into an

action ait ∈ Ai (i.e., number of screens to be converted to digital).33 An action profile at

denotes the vector of joint actions in period t, at = (a1t, . . . , aNt) ∈ A = ×Ni=1Ai. The

notation a−i,t = (a1t, . . . , aNt) \ ait ∈ A−i refers to the actions by theaters other than i.

Value function and optimal adoption rule: Let σi(a|y) denote theater i’s conditional

ex-ante (i.e., before the realization of εi) belief that action profile a will be played in state

y. The ex-ante value function (where the subscript t is omitted and next-period variables

are marked with a prime) is given by

33The action space Ai is in fact state-specific, because the space of admissible actions is bounded aboveby Sτ (i) − sit. I omit this dependence in the notation for the sake of expositional clarity.

17

Page 19: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

Vi(y;σi) =∑a∈A

σi(a|y)

[πτ (i)(si, h)− aip+ +β

∑y′

G(y′|y, a)Vi(y′;σi)

]

−Sτ(i)−si∑ai=0

σi(ai|y)aiE[εi|ai,y]

(6)

where β is the discount factor, G(y′|y, a) is the probability that state y′ is reached when the

current action profile and state are (a,y), and h is determined by equation (3). The next-

period industry state x′ (in y′ = (x′, p′, h′US)) is a deterministic function of (x, a) because,

for every firm j,

s′j = sj + aj

Uncertainty about y′ is encapsulated in firm i’s beliefs about other firms’ adoption deci-

sions in σi(a|y).

The optimal adoption rule can be expressed as a function of the choice-specific value

functions. Let Wi(ai|y;σi) denote the discounted expected value function of theater i, net

of current period payoff, when converting ai screens in state y with beliefs σi:

Wi(ai|y;σi) = β∑

a−i∈A−i

σi(a−i|y)G(y′|y, ai, a−i)Vi(y′;σi) (7)

Define ∆Wi(a, a′|y;σi) ≡ Wi(a|y;σi) −Wi(a

′|y;σi) for (a, a′) ∈ {0, 1, 2, .., Sτ (i) − sit} as

the difference in the choice-specific value functions of converting a and a′ screens to digital in

state y with beliefs σi. Firm i’s optimal adoption rule is derived by noting that, in deciding

the number of screens to convert to digital technology, the firm compares the choice-specific

value functions net of the adoption cost. The adoption cost, in turn, depends on the current

list price pt, and firm i’s idiosyncratic shock εit.

Player i’s set of best responses in state (y, εi) and under beliefs σi is defined as

BRi(y, εi;σi) = {ai ∈ Ai : Wi(ai|y;σi)− ai(p+ εi) ≥ Wi(a′|y;σi)− a′(p+ εi) for all a′ ∈ Ai}

(8)

where operating profits πτ (i)(si, h) cancel out because they do not depend on the action

taken in the current period. Srisuma (2013) shows that, if the single-period payoffs satisfy

restriction (DD), BRi(y, εi;σi) is a singleton set almost surely. Additionally, for any given

beliefs, each player’s best response is a nonincreasing pure strategy almost surely, that is,

18

Page 20: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

a(y, εi;σi) ≤ a(y, ε′i;σi) for all εi > ε′i

With a discrete action space, the optimal adoption rule, given beliefs σi, takes the form

of a set of cut-offs in εi. Additionally, for any beliefs, the lowest and highest actions (a = 0 or

a = Sτ (i) − sit) are always played with positive probability because the shock εit takes value

on the real line whereas the choice-specific value functions is uniformly bounded above—due

to bounded profits and expected adoption cost (last term in Equation (6)).34

5.2 Market equilibrium

In every period, the sequence of events is as follows: First, given the industry state vector

xt and digital availability shifter hUS,t, the share of movies multi-homed ht is determined

(Equation (3)). Second, each theater receives a private draw εit from the distribution of

hardware costs, and decides whether to convert any screens to digital, given the vector of

current state variables yt and its private adoption cost. Third, theaters receive operating

profits and pay the adoption cost. The state variables evolve as the adoption decisions are

completed and new values of the exogenous variables are realized.

A MPE in pure strategies is first defined. In a MPE, each theater’s adoption decision is

optimal in every state, given its beliefs about future states, and those beliefs are consistent

with the adoption decisions of other theaters. The adoption strategy profile a = (a1, . . . , aN)

and beliefs σ = (σ1, . . . , σN) form a Markov Perfect equilibrium if:

(i) for all i, ai is a best response to a−i given the beliefs σi at all states y (optimality).

(ii) for all i, the beliefs σi are consistent with the strategies a (belief consistency).

Under restriction (DD), Srisuma (2013) provides existence results for a pure strategy

MPE in which equilibrium strategies are monotone in εi. Moreover, defining the correspond-

ing conditional choice probabilities (CCP) as

Fi(a|y) = P (a(y, εi;σi) ≤ a|y)

the latter paper shows that a necessary and sufficient condition for a MPE is that the

vector of CCP {Fi(a|y)}i,y is a fixed point of the best-response probability mapping (an

extension of the equilibrium characterization results of Pesendorfer and Schmidt-Dengler

(2008) (Proposition 1) and Aguirregabiria and Mira (2007) (Representation Lemma) to the

class of ordered choice games).

Estimation and computation of a MPE present computational difficulties due to the high

34This result is immediate by taking the shock εit to plus or minus infinity.

19

Page 21: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

dimensionality of the state space xt. In this particular setting, N = 399 and each theater

has at least 5 possible states (Sτ (i) ≥ 4). In anticipation of these computational difficulties, I

reduce the dimensionality of the state space by imposing behavioral restrictions and defining

an alternative equilibrium concept.

Assumption 1. Theaters ignore the feedback of their own actions on ht (given by Equation

(3)), and the corresponding feedback on other firms’ actions. Theaters take the evolution of

the aggregate state variables (xt, ht, pt, hUS,t) as deterministic.

The equilibrium concept used is the nonstationary oblivious equilibrium (NOE) of Wein-

traub, Benkard, Jeziorski, and Van Roy (2008). This equilibrium concept is used to approx-

imate the short-run dynamics of an industry starting from a given initial industry state.35

If there is a large number of firms and no aggregate shocks, the industry state starting from

an initial state approximately follows a deterministic path.36 Each firm can make close-

to-optimal decisions based on its own type and state xit = (τ (i), sit) and by knowing the

deterministic path followed by the industry state (xt or, equivalently, the payoff-relevant

state ht).

The NOE concept is particularly well suited to model equilibrium outcomes in the digital

conversion of the movie industry: there is a large number of small firms and screens so that

converting a single screen to digital has a negligible impact on the aggregate state of the

industry and the availability of digital movies ht. Moreover, it is reasonable to assume that

theaters do not track the whole industry state vector xt (recording adoption rate of each

theater in the industry) or the number of firm in every possible firm state, but rather, form

beliefs about the evolution of ht. With a large number of firms, the process {ht}t≥1 is close

to deterministic.

Next, I define nonstationary oblivious strategies, beliefs, and value functions.

Nonstationary oblivious strategies. In a NOE, each firm’s decisions depend only on

the firm’s own type and state xit = (τ (i), sit) and the time period. In this sense, strategies

are type-symmetric. A nonstationary oblivious strategy is a sequence ano = {ano1 , ano2 , . . . },where for each period t, theater i takes action anot (xit, εit). This type of strategy differs

from a (stationary) Markov strategy ai(yt, εit) which is a function of the current vector

of state variables yt = (xt, pt, hUS,t) (in particular, the whole industry state xt) but does

not depend on time. Nonstationary oblivious strategies generate decisions made without

complete knowledge of the circumstances (in particular, the true realized state of the industry

xt).

35NOE differs from the stationary oblivious equilibrium of Weintraub, Benkard, and Van Roy (2008),which is suited to approximate the long-run steady state dynamics of an industry.

36The number of firm types (i.e., the cardinality of set T ) is held fixed.

20

Page 22: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

Nonstationary oblivious beliefs. Theaters make their adoption decision assuming that

the industry state evolves deterministically: in particular, the industry state at time t is

the expected industry state after t time periods of evolution given the other firms’ strategy

and starting from the initial industry state. Under this assumption, the industry state is a

function of time. For a common nonstationary oblivious strategy ano played by all theaters,

and given the initial industry state x1, I define the deterministic process of digital movie

availability {ht}t≥1 as

ht = Ext [ht|x1] (9)

where the expectation is taken with respect to the industry state xt generated by strategy

ano and, as before, ht is determined by Equation (3). I make the dependence of the process

{ht}t≥1 on the strategy ano explicit by denoting it{ht(a

no)}t≥1.

37

Nonstationary oblivious value functions. For a pair of nonstationary oblivious strate-

gies (ano, ano), I define an ex-ante nonstationary oblivious value function for period t (that

is before εit is realized), if firm i follows strategy ano whereas all other firms use strategy ano,

as

Vτ (i),t(sit|ano, ano) = Eεit[πτ (i)(sit, ht(a

no))− anot (xit, εit)(pt + εit

)+ βVτ (i),t+1(si,t+1|ano, ano)

](10)

where si,t+1 = sit + anot (xit, εit). This value function corresponds to the expected net

present value of a type-τ (i) theater that is at state sit at time t and follows nonstationary

oblivious strategy ano, under the assumption that, for all t ≥ 1, the availability of digital

37Weintraub, Benkard, Jeziorski, and Van Roy (2008) use an alternative representation of the industrystate. The industry state can be expressed as the distribution (or number of firms) in each possible state. Thisrepresentation is without loss of generality when focusing on anonymous and (type-)symmetric equilibriumstrategies because the identity i of the firm does not matter (Doraszelski and Satterthwaite (2010)). Foreach type-state combination (τ , s) with τ ∈ T and s ∈ Sτ , one can alternatively define xt(τ , s) to be thenumber of type-τ theaters (across the industry) with s digital screens by period t. The industry state spaceat time t is defined as{

(xt(τ , 0), xt(τ , 1), . . . , xt(τ , Sτ ))∣∣∣ Sτ∑s=0

xt(τ , s) = Nτ

}τ∈T

⊂ N∑

τ∈T (Sτ+1)

That is, for each type τ , the number of firms with 0, . . . , Sτ digital screens at time t. The authors definext as an element of this set, referred to as the industry state at time t. In a NOE, the expected number offirms in state (τ , s′) at time t+ 1 is given by

xt+1(τ , s′) =∑s≤s′

xt(τ , s)P (anot = s′ − s|τ , s)

The two representations are equivalent under type-symmetric and anonymous equilibrium strategies.

21

Page 23: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

movies is ht(ano) at time t. Note that the nonstationary oblivious value function remains a

function of the competitors’ strategy ano through the expected availability of digital movies

ht(ano), even if the firm’s state trajectory ({sit}t≥1) only depends on the firm’s own strategy

ano. I abuse notation by using Vτ (i),t(sit|ano) ≡ Vτ (i),t(sit|ano, ano) to refer to the nonstation-

ary oblivious value function when firm i follows the same strategy ano as its competitors.

Nonstationary oblivious equilibrium. An nonstationary oblivious equilibrium consists

of a strategy ano that satisfies

supano

Vτ (i),t(sit|ano, ano) = Vτ (i),t(sit|ano) for all i ∈ N, state sit ∈ Sτ (i) and periods t (11)

(by dynamic programming principles, it is sufficient for the equality to hold for t = 1).

The belief consistency requirement in a nonstationary oblivious equilibrium is “weaker”

than for a Markov Perfect equilibrium. In the latter, belief consistency requires that each firm

optimizes given beliefs (about state transitions) generated by its competitiors’ equilibrium

strategies. In the former equilibrium concept, belief consistency only requires that each

firm optimizes given the deterministic path of expected aggregate states generated by its

competitors’ equilibrium strategies. Importantly, firm i ignores: (1) its impact on the process

{ht}t≥1 (given competitors’ equilibrium strategies) and (2) the feedback of its own action on

other competitors. The actual stochastic process {ht}t≥1 will in general not coincide with

the deterministic process {ht}t≥1, but converges to it as the number of firms grows to infinity

(keeping the number of firm types fixed).

Weintraub, Benkard, Jeziorski, and Van Roy (2008) provide existence results for NOE

that converge to a stationary strategy as t goes to infinity. That is, they consider nonsta-

tionary oblivious strategies ano = {ano1 , ano2 , . . . }, such that

limt→∞

anot (xi, εi) = ano (xi, εt) for all (xi, εi)

for some stationary oblivious strategy ano (xi, εi).38 In this particular setting, and for

the purpose of estimation, it is natural to focus on outcomes generated by strategies that

become stationary as t goes to infinity, that is, where the movie industry converges to a sta-

tionary state (i.e., industry-wide adoption of digital, limt→∞

sit = Sτ (i) for all i, or equivalently,

limt→∞

anot (τ (i), si, εi) = 0). I return to this assumption in the estimation section as it allows

me to focus on a finite horizon.

38For this particular setting, the relevant assumptions guaranteeing existence are: the single-period profitsare monotone in the state, bounded, and differentiable, and firm states sit are bounded, as well as standardfull support assumptions on the structural shock εit.

22

Page 24: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

5.3 Remarks

The assumption that theaters make their adoption decisions independently, even within

chains, is violated if theater chains coordinate adoption decisions across theaters. Two

incentives to do so are: (1) to benefit from lower per-unit adoption cost when placing large

orders of projectors and (2) to tip the industry by significantly increasing the share of digital

screens in the industry. To alleviate these concerns, the model controls for chain effects in

the profits from operating (firm type τ (i) include an indicator for the three largest theater

chains). Discussions with chain managers (at Gaumont-Pathe) revealed that theaters were

converted starting from the most profitable ones, which is consistent with theater-level profit

maximization as in the model.

6 Identification and Estimation

This section presents the identification and estimation of the structural model presented in

section 5. The objective is to recover firms’ operating profits πτ (i)(sit, ht). Estimation results

indicate that there is a significant reduction in costs with the conversion to digital. These

cost-reductions are consistent with labor cost savings and the magnitude of projectionnists’

wages. Finally, the analysis uncovers the main dimensions of hetereogeneity in profits from

digital across theaters.

6.1 Estimation approach

Given that the behavioral restrictions and NOE concept are motivated by issues related

to estimation, I start by discussing estimation assuming that the parameter of interest are

identified, and show identification of the model in the following section.

Two issues make the estimation of a MPE in the dynamic game of Section 5.1 complicated.

First is the curse of dimensionality: the large number of firms (due to network effects at the

industry level) and the dimension of firms’ states generate a high-dimensional industry state

space.39 Second, as is common in games of technology adoption under network effects, there

are potentially multiple equilibria.

Although recent approaches for dynamic game estimation (Aguirregabiria and Mira

(2007), Bajari, Benkard, and Levin (2007), Pesendorfer and Schmidt-Dengler (2008), and

Pakes, Ostrovsky, and Berry (2007)) can be used to address the multiplicity issue, the esti-

mation and computation of counterfactuals will still be infeasible due to the dimensionality

39For instance, ignoring firm heterogeneity and assuming all 399 firms are four-screen theaters (so sit ∈{0, 1, 2, 3, 4}), the total number of possible industry states xt is 1,071,993,300.

23

Page 25: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

of the state space. The model is, therefore, estimated under the assumption that the data

is generated by a nonstationary oblivious equilibrum (that becomes stationary as time pro-

gresses) as introduced in Section 5.2. The assumptions on the data generating process are

as follows

Assumption 2. The data is generated by a nonstationary oblivious equilibrium ano in which

strategies are nonincreasing in εi. In this equilibrium, there exists a finite time horizon

of length T by the end of which the industry is in its long-run adoption steady-state with

probability arbitrarily close to one. In the long-run steady state, all theaters have converted

to digital (sit = Sτ (i)), adoption strategies are stationary (in fact constant, anot = 0), all

movies are available in digital (ht = 1), and single-period profits are constant and given by

πτ (i)(Sτ (i), ht = 1).

Assumption 2 allows me to restrict attention to the finite vector of choice-probabilities

corresponding to the NOE played in the data, for t = 1, . . . , T , that is,

{Pt(anoit |xit)} for all t = 1, . . . , T and xit = (τ (i), sit)

In the industry model, the shock εit has full support on the real line and single-period

profits are bounded, therefore, a firm can always choose ait = 0 with non-zero probability.

In practice, T is chosen large enough so that, given the realization of the equilibrium ano

played in the data—in which, by 2015 the industry was entirely converted to digital—the

industry-wide adoption steady state has been reached by T with probability arbitrarily close

to one. In what follows, T is set to 30 periods (i.e., 15 years) and the probability that the

steady state is reached (over 1, 000 simulations of the industry trajectory) is 97%.

The estimation approach follows the CCP-based method proposed in Hotz and Miller

(1993) and Hotz, Miller, Sanders, and Smith (1994) for single-agent dynamic problems, and

Pesendorfer and Schmidt-Dengler (2008) for dynamic games (see Srisuma (2013) for an ex-

tension to games with ordered choices). This estimation approach relies on the necessary

condition for an equilibrium: i.e., choice probabilities are a fixed point of the best-response

probability mapping.40 In a first step, the nonstationary oblivious CCP giving the equilib-

rium policy rule are estimated from the data. Next, using the CCP, I obtain the nonsta-

tionary oblivious choice-specific value functions for each candidate parameter. In this step, I

avoid forward-simulation by using the matrix inversion method suggested by Aguirregabiria

and Mira (2007) and Pesendorfer and Schmidt-Dengler (2008). In the last step, I estimate

40As shown in Pesendorfer and Schmidt-Dengler (2008) (Proposition 1), the necessary condition for anequilibrium is a direct consequence of the definition of an equilibrium (i.e., combining the optimality andbelief consistency conditions).

24

Page 26: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

the parameter of interest by minimizing the distance between the predicted CCP and the

actual CCP (or equivalently, between predicted and actual differences in choice-specific value

functions).

In dynamic games with unordered choices (such as the ones studied in Pesendorfer and

Schmidt-Dengler (2008)), unbounded shocks guarantee that all actions are chosen with posi-

tive probability: the Hotz-Miller inversion approach provides a one-to-one mapping between

choice probabilities and all differences in choice-specific value functions. This is not always

the case in games with ordered choices.41 Under monotonicity of equilibrium strategies, the

inversion approach must account for the fact that choice probabilities can only be mapped to

differences in choice-specific value functions between actions played with positive probability.42

First-step estimation:

Movie theaters’ adoption-policy function. The estimation proceeds by first recovering the

nonstationary oblivious CCP governing theaters’ equilibrium adoption of digital projectors.

The CCP Pt(anoit |xit), which are a function of time, and the type and state of firm i, are

estimated using an ordered probit model, and in what follows are assumed to be known.

Denote Pt(anoit |xit) estimates of the nonstationary oblivious CCP.

As discussed above, some actions may not be played with positive probability in equilib-

rium, therefore, the Hotz-Miller inversion approach does not directly apply to games with

ordered choices. However, monotonicity of equilibrium adoption strategies can be used to

characterize the mapping from CCP to differences in choice-specific value function. Denote

the set of actions played with positive probability in period t and state xit by

Supp(xit, t) = {ai ∈ Ai : Pt(ai|xit) > 0} ⊆ Ai

Let the elements of Supp(xit, t) be ranked in increasing order and indexed by k. Denote

by ak the kth largest element of Supp(xit, t). Invoking the monotonicity of strategies, the

indifference condition between action ak and ak+1 can be expressed as

Wt(ak|xit)− ak(pt + εi,k) = Wt(ak+1|xit)− ak+1(pt + εi,k)

⇔ εi,k =∆Wt(ak+1, ak|xit)

ak+1 − ak− pt

(12)

41In this setting with unbounded error and bounded expected value functions, the lowest and highestactions are always played with positive probability.

42The mapping of CCP to differences in choice-specific value functions gives a closed-form of the best-response mapping and facilitates estimation. This closed-form is available because the support of the CCPis known. However, for computing counterfactual equilibria, one would still need to characterize the set ofactions played with positive probability. An algorithm for finding this set was introducted in Gowrisankaran,Lucarelli, Schmidt-Dengler, and Town (2010).

25

Page 27: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

where ∆Wt(ak+1, ak|xit) is the difference in the oblivious choice-specific value functions

of converting ak+1 and ak screens to digital in state (t,xit). Equation (12) expresses the

cut-offs characterizing the equilibrium adoption strategy as a function of differences in the

choice-specific value function. These cut-offs can be used to map CCP to differences in the

choice-specific value function.

Pt(anoit ≤ ak|xit) = 1− F (εi,k) = 1− F

(∆Wt(ak+1, ak|xit)

ak+1 − ak− pt

)(13)

Differences in choice-specific value functions can be obtained from knowledge of the CCP,

by inverting equation (13):

∆Wt(ak+1, ak|xit)ak+1 − ak

− pt = F−1 (1− Pt(anoit ≤ ak|xit)) (14)

Using equation (14), I can construct estimates of ∆Wt(ak+1, ak|xit) from knowledge of the

CCP Pt(anoit ≤ ak|xit) estimates for all ak ∈ Supp(xit, t).

Value functions. Given a candidate parameter vector and knowledge of the CCP, the ex-

pected oblivious value function solves a system of linear equations.43 For each firm type and

state xit = (τ (i), sit) and period t = 1, . . . , T ,

Vτ (i),t(sit|ano) =

Sτ(i)−sit∑ait=0

Pt(anoit |xit)

[πτ (i)(sit, ht(a

no))− anoit (pt + E[εi|anoit ,xit])]

+ β

Sτ(i)−sit∑ait=0

Pt(anoit |xit)Vτ (i),t+1(si,t+1|ano)

(15)

where si,t+1 = sit + anoit .

In period t = T , the industry enters its long-run steady state where ht = 1 and sit = Sτ (i)

for all i and t > T . All firms that have not fully converted yet are restricted to do so, that

is, PT (Sτ (i) − sit|xi,T ) = 1 for all xi,T . (as discussed above, T is chosen large enough so

that firms have completed their conversion by this period with probability close to one). For

t > T , the value function is constant and equal to:

Vτ (i),T+1(Sτ (i)|ano) = πτ (i)(Sτ (i), h = 1) + βVτ (i),T+1(Sτ (i)|ano) =∞∑l=0

βlπτ (i)(Sτ (i), h = 1)

43This is noted by Aguirregabiria and Mira (2007) and Pesendorfer and Schmidt-Dengler (2008) for sta-tionary games.

26

Page 28: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

Denote by L the cardinality of the set of possible type-state combination xit = (τ (i), sit).

L equals∑τ∈T

(Sτ+1) because the number of digital screens in a type-τ theater is an element of

the set Sτ = {0, 1, ..., Sτ}. Collecting the equilibrium ex-ante value functions Vτ (i),t(sit|ano)for all t = 1, . . . , T , all types τ ∈ T , and states s ∈ Sτ in matrix notation, the expected

value function as a function of a candidate structural parameter α can be written

V(α) =∑a

P(a)(Π(α)− a(p + e(a)

)+ β · F · V(α)

= (I− β · F)−1

{∑a

P(a)(Π(α)− a(p + e(a))

)} (16)

where V(α) is the (L · T ) × 1 dimensional vector of ex-ante value functions, I is the

(L · T )-dimensional identity matrix, F is the (L · T )× (L · T ) dimensional matrix consisting

of a type-τ theater’s conditional choice probabilities {Pt(a|τ , s)}Sτ−sa=0 in row (τ , s, t) and

columns {(τ , s + a, t + 1)}Sτ−sa=0 and zeros in the remaining columns; For t = T , all rows

{(τ , s, T )}τ ,s transition to the absorbing state (τ , Sτ , T ) with probability one. P(a) is an

(L · T )× (L · T )-dimensional matrix of CCP with diagonal elements equal to Pt(a|τ , s) and

off-diagonal elements equal to zero, Π(α) is an (L · T ) × 1 vector of single-period profits

with row (τ , s, t) containing πτ (s, ht), p is an (L · T ) × 1 block vector of adoption costs

with row (τ , s, t) containing pt, and e(a) is an (L · T )× 1 vector with row (τ , s, t) equal to

E[εi|a, (τ , s)]. The matrix (I−β ·F) is invertible because it is a strictly diagonally dominant

matrix (1 > βSτ−s∑a=0

Pt(a|τ , s)).44

Let V(α) = {Vτ ,t(s;α)}τ∈T ,s∈Sτ ,t=1,...,T be the solution of system (16), for a given can-

didate parameter α and the actual equilibrium CCP vector played in the data. The pre-

dicted nonstationary oblivious choice-specific value functions Wt(a|xit;α), and predicted

CCP Pt(ait|xit;α) can be derived as follows

Wt(a|xit;α) = βVτ (i),t+1(si,t+1;α) where xit = (τ (i), sit) and si,t+1 = sit + a (17)

Pt(anoit ≤ ak|xit;α) = 1− F

(∆Wt(ak+1, ak|xit;α)

ak+1 − ak− pt

)for all ak ∈ Supp(xit, t) (18)

44The discount factor used is β = 0.95, annually.

27

Page 29: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

Second-step estimation:

In the second step, the underlying parametersα are set such that the predicted CCP Pt(anoit ≤

ak|τ (i), sit;α) match estimates of the actual equilibrium CCP Pt(anoit ≤ ak|τ (i), sit) (obtained

in step 1) for every firm type τ ∈ T , state s ∈ Sτ , period t = 1, . . . , T , and action ak ∈Supp(xit, t).

45 Equivalently, one can match the differences in choice-specific value functions,

which are a monotone transformation of the CCP. I follow this second approach.46 The

objective function is

Q(α) = ||∆Wt(ak+1, ak|τ , s;α)− ∆Wt(ak+1, ak|τ , s)||2

=

√∑τ ,s,t,k

(∆Wt(ak+1, ak|τ , s;α)− ∆Wt(ak+1, ak|τ , s)

)2 (19)

The estimator of the underlying parameters is the solution of

minαQ(α)

Standard errors are obtained by bootstrap sampling. One difficulty with non-parametric

bootstrap is the presence of correlation in decisions across local markets and firms, there-

fore, sampling market-histories (or firm-histories) with replacement, as is commonly done in

dynamic oligopoly games, is not a valid approach. Instead, a parametric boostrap procedure

is used. This approach is detailed in Appendix C.1.

6.2 Identification

This section examines the identification of the industry model. In a NOE, firms take the

evolution of the industry state as exogenous and deterministic. This path would be in-

deed deterministic with infinitely many firms playing an equilibrium nonstationary oblivious

strategy ano. In this section, I assume that the econometrician observes a single infinite

population of firms (fixing the set of firm types) playing strategy ano over a finite horizon

of length T . This approach differs from the common identification approach for MPE in

dynamic oligopoly games which relies on a cross-section of market-paths and requires, under

multiplicity of equilibria, that the same equilibrium is played in all markets. By using only

a single market, the paper sidesteps the multiplicity issue by identifying the parameters of

interest from observation of many firms of the same type.

45One action in Supp(xit, t) is uninformative and dropped because the CCP add up to one.46Matching differences in the choice-specific value functions is preferred here because the objective function

is quadratic in the parameter, and the gradient can be easily derived.

28

Page 30: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

I assume that the conditional choice probabilities Pt(a|τ , s) are identified from the data

for every firm type τ ∈ T , firm state s ∈ {0, 1, . . . , Sτ − 1}, action a ∈ {1, . . . , Sτ − s}, and

period t = 1, . . . , T − 1. The corresponding differences in the choice-specific value functions

∆Wt(a′, a|τ , s) are identified for actions played with non-zero probability (Equation (14)).

As is standard in the literature on identification of dynamic decision problems (Rust (1994),

Magnac and Thesmar (2002), and Bajari, Chernozhukov, Hong, and Nekipelov (2015)), the

discount factor and the distribution of firm shocks (β, F ) are assumed to be known.

The unknown elements are the (nonstationary) oblivious single-period profits which are

a function of the firm type and state (τ (i), sit), action ait, and time t (i.e., firms assume that

all aggregate state variables evolve exogenously and deterministically). Adoption costs are

assumed to enter linearly into single-period profits under the restriction

πτ (i)(sit, ht)− ait(pt + εit)

which fixes the dependence of profits on firm adoption decision ait. This restriction is

arguably well-informed given the structure of the problem. The structural parameters to

identify are, therefore, the set of single-period profits gross of the adoption cost

α ≡ Π = {πτ (s, ht)}τ∈T ,s∈Sτ

for every firm type τ ∈ T , state s ∈ Sτ , and period t = 1, . . . , T . The number of unknown

parameters is

(L× T ) =∑τ∈T

(Sτ + 1)× T

The necessary conditions for optimality can be expressed as

∆Wt(ak+1, ak|τ , s) = ∆Wt(ak+1, ak|τ , s; Π) (20)

for every firm type τ ∈ T , firm state s ∈ {0, 1, . . . , Sτ − 1}, actions (ak+1, ak) ∈Supp(τ , s, t), period t = 1, . . . , T − 1, and where ∆Wt(ak+1, ak|τ , s; Π) can be derived by

Equation (17). Importantly, there are no equilibrium conditions for s = Sτ (the theater has

completed its conversion) or t = T (the steady state is reached and all firms are restricted

to convert their remaining screens).

Due to restrictions imposed by the NOE (in particular, that a firm’s action or state does

not affect the deterministic trajectory of the industry state), some of the optimality condi-

tions are linearly dependent. I impose sufficient condition, which holds in the equilibrium

played in the data, under which the model is under-identified.

29

Page 31: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

Assumption 3. Equilibrium strategies are such that action a = 1 is played with non-zero

probability for every firm type τ ∈ T , state s ∈ {0, 1, . . . , Sτ −1}, and period t = 1, . . . , T −1

(in addition to the lowest and highest actions (0 and Sτ − s)).

Let τ denote the cardinality of the type space T .

Lemma 1. Under Assumption 3, the necessary optimality conditions (20) form a system of

at most (L− τ) · (T − 1) linearly independent equations in the unknown parameter Π.

Proof. From Equation (17), we have that for all a ∈ {1, . . . , Sτ}

∆Wt(a, a− 1|τ , s; Π) = β(Vτ ,t+1(s+ a; Π)− Vτ ,t+1(s+ a− 1; Π)

)= Wt(1|τ , s+ a− 1; Π)− Wt(0|τ , s+ a− 1; Π)

= ∆Wt(1, 0|τ , s+ a− 1; Π)

(21)

The predicted differences in choice specific value functions can be rewritten as a linear

combination of predicted differences between actions 1 and 0, for all (ak+1, ak) ∈ Supp(τ , s, t)

∆Wt(ak+1, ak|τ , s; Π) = ∆Wt(ak+1, ak+1 − 1|τ , s; Π) + . . .+ ∆Wt(ak + 1, ak|τ , s; Π)

= ∆Wt(1, 0|τ , s+ ak+1 − 1; Π) + . . .+ ∆Wt(1, 0|τ , s+ ak; Π)(22)

Since actions a = 1 and a = 0 are played with non-zero probability, ∆Wt(1, 0|τ , s) is

identified from the data. The linear dependence between optimality conditions shown in

Equation (22) implies that only the set of conditions for a = 1 and a = 0 provide identifying

restrictions for Π. The set of restrictions is

∆Wt(1, 0|τ , s) = ∆Wt(1, 0|τ , s; Π) (23)

for every firm type τ ∈ T , firm state s ∈ {0, 1, . . . , Sτ −1}, and period t = 1, . . . , T −1. The

optimality conditions (23) form a (underdetermined) system of (L− τ) · (T − 1), equations

in the unknown parameter Π.47

Expressing the differences in choice-specific value functions as a function of the ex-ante

value function (Equation (17)), we obtain

∆Wt(1, 0|τ , s) = β(Vτ ,t+1(s+ 1;α)− Vτ ,t+1(s;α)

)47Recall that L equals

∑τ∈T

(Sτ + 1). Therefore the number of optimality conditions is∑τ∈T

(Sτ ) · (T − 1) =

(L− τ) · (T − 1).

30

Page 32: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

Collecting these equations for every firm types τ ∈ T , firm state s ∈ {0, 1, . . . , Sτ − 1},and period t = 1, . . . , T − 1, in matrix form, and substituting the ex-ante value function

(Equation (16)), we obtain a system of linear equations in the unknown parameter Π

∆W = βHV

= βH(I− β · F)−1Π− βH(I− β · F)−1∑a

P(a)a[p + e(a)](24)

where H is ((L − τ) · (T − 1)) × (L × T )-dimensional with value −1 in row (τ , s, t) and

column (τ , s, t + 1) and value +1 in row (τ , s, t) and column (τ , s + 1, t + 1), ∆W is a

((L− τ) · (T − 1))× 1-dimensional vector with row (τ , s, t) equal to ∆Wt(1|τ , s), and V is

the (L ·T )× 1 dimensional vector of ex-ante value functions. The linear system in Π can be

written

Y = X ·Π (25)

where Y and X are a ((L− τ) · (T −1))×1-dimensional and ((L− τ) · (T −1))× (L×T )-

dimensional matrices which are function of the (known) CCP, prices, discount factor β, and

distribution of the firm-specific shock F . I augment this system of equations with two sets

of restrictions: profits under the film technology are normalized to zero and, if there are no

digital movies, profits under the digital technology are normalized to zero.

Assumption 4. The following payoff normalizations are imposed.

1. Single-period profits in state s = 0 are normalized to zero:

πτ (s = 0, ht) = 0

for every firm type τ ∈ T and period t = 1, . . . , T .

2. Single-period profits in period 1 (h1 = 0) are normalized to zero:

πτ (s, h1 = 0) = 0

for every firm type τ ∈ T and firm state s ∈ {0, 1, . . . , Sτ}.

Assumption 4 imposes a total of L + τ × (T − 1) restrictions. In matrix form, these

normalization are represented as:

R ·Π = 0L+τ×(T−1)

31

Page 33: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

where R is a matrix with L+ τ × (T − 1) rows and (L× T ) columns and 0L+τ×(T−1) is a

(L + τ × (T − 1))× 1-dimensional vector of zeros. Augmenting system (25) with the set of

normalization of Assumptions 4 yields a system of (L× T ) in (L× T ) equations in as many

unknowns [X

R

]Π =

[Y

0L+τ×(T−1)

]⇔ X ·Π = Y (26)

the parameter Π is exactly identified if X has full rank. If Assumption 3 does not

hold at the equilibrium strategies, then there are more restrictions than unknowns and Π is

over-identified (under the normalizations of Assumption 4).

Finally, note that πτ (s, ht) is only identified over the support of the deterministic path

{ht}t=1,...,T . Therefore, for every firm type τ ∈ T and firm state s ∈ {0, 1, . . . , Sτ}, at most T

restrictions are available to identify the dependence of πτ (s, ht) on ht.48 In the next section,

I impose parametric restriction that reduces the dimensionality of the parameter space Π

and allows identification of the effect of ht on profits.

6.3 Parameterization

This section details the model parameterization. Operating profits are obtained by summing

profits per digital screen (relative to film) over the sit digital screens converted by theater i

and time t. Profits from film screens are normalized to zero.

The parameterization aims to capture two features of the model: (i) the heterogeneity

in profits per digital screen across theaters and markets and (ii) the positive dependence of

these profits on the share of movies available in digital ht. In particular, when ht is low (early

phase of diffusion), theaters have a limited choice of digital movies and optimally convert

only a fraction of their screens to digital: the marginal benefit from converting an additional

screen to digital is decreasing.49 To capture this dependence parsimoniously, I define, for

each theater, two levels of profits per digital screen, denoted πd(τ ) and πd(τ , ht)

(1) πd(τ ) corresponds to the baseline profits per digital screen (relative to film), i.e., the

cost-reduction from switching a screen from film to digital if all movies were available in

digital (ht = 1).

48If the econometrician has access to a cross-section of markets (with infinite number of firms), e.g., differentcountries, multiplicity of equilibria can help with the identification of the effect of ht. Different deterministicpaths for ht would be observed across markets and one can use the time series and cross-sectional variationin ht to identify the effect of ht.

49If the marginal benefits per additional digital screen were constant across screens, a theater would convertall screens simultaneously.

32

Page 34: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

(2) πd(τ , ht) corresponds to profits per digital screen (relative to film), i.e., the cost-

reduction from switching a screen from film to digital if only a fraction ht of movies is

available in digital (ht < 1).

When ht is less than one, theater i has a limited choice of digital movies, therefore,

its profits from a digital screen (πd(τ , ht)) will be lower than if all movies were available

in digital (πd(τ )). Denote by δ(τ , ht) ≡ πd(τ ) − πd(τ , ht), the profit loss (or decay) from

limited availability of digital movies. As ht converges to one, δ(τ , ht) is expected to decrease

(or equivalently, πd(τ , ht) is expected to increase).

In summing profits across digital screen, the following functinal form is assumed: a

fraction min{ sitSτ(i)

, ht} of screens yields profits πd(τ ) per screen, whereas the remaining

max{0, sitSτ(i)− ht} yield profits πd(τ , ht). Theater i’s operating profits in state (xit, ht) are

therefore given by

πτ (si, ht) =

siπd(τ ) if siSτ(i)≤ ht

Sτ (i)

(htπd(τ ) + ( si

Sτ(i)− ht)πd(τ , ht)

)if si

Sτ(i)≥ ht

(27)

This specification captures decreasing marginal benefits from an additional digital screen.

For instance, a 5−screen theater with adoption rate sitSτ(i)

= ht = 0.4 receives profits per

digital screen equal to πd(τ ). If the firm were to convert its third screen to digital, the

marginal benefit would be πd(τ , ht) < πd(τ ).

The effect of a marginal increase in ht on profits (i.e., the “network benefit”) for a theater

that has fully converted ( sitSτ(i)

= 1) is given by

Sτ (i)

(πd(τ )− πd(τ , ht) + (1− ht)

∂πd(τ , ht)

∂ht

)> 0

The term πd(τ )− πd(τ , ht) corresponds to the change in profits for the marginal screen,

whereas ( sitSτ(i)−ht)∂πd(τ ,ht)∂ht

corresponds to the change in profits for the supra-marginal screens

(a share sitSτ(i)−ht = 1−ht of screens). Profits for infra-marginal screens (a share ht) are not

affected since constant and equal to πd(τ ).

Figure 3 illustrates this representation of operating profits in the case of a 5-screen the-

ater when ht = 0.5. The left-hand side figure shows a general form of profits per digital

screen, whereas the right-hand side represents the functional form restriction imposed. As

ht increases, the decay decreases and profits from the fraction max{0, sitSτ(i)− ht} of digital

screens increase from πd(τ , ht) to πd(τ ).

For profits per digital screen πd(τ ) and decay δ(τ , ht), a reduced form is used:

πd(τ ) = α0 + α1Sτ (i) + α21{arti = 1}+ α3S−i + αmarketi + αchaini(28)

33

Page 35: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

δ(τ , ht) = δ0 + δ1Sτ (i) + δ2ht (29)

where Sτ (i) is the number of screens in theater i, arti is an indicator for art house theaters,

S−i is the total number of screens owned by theater i’s competitors, and αmarketi and αchaini

are dummies for market size and chain identifier.50 Equation (28) specifies how profits per

digital screen depends on firm and market characteristics. Equation (29) specifies the decay

in profits. The decay depends on theater size and availability of digital movies. The decay

is expected to decrease with digital movie availability. Because larger theaters screen more

movies, the decay is expected to increase with theater size for a given level of ht.51

The parameters of interest are the vectorα = ({αi}i=0,,3, αmarket=1,,6, αchain=1,,3, {δi}i=0,...,2)

entering the profit per digital screen and decay. All parameters are dynamic in the sense

that they must be inferred from firms’ dynamic decision process. The distribution F of the

firm-specific shock εit is set to N (0, σ2).

6.4 Endogeneity of digital movie availability

The model is estimated under the assumption that a nonstationary oblivious equilibrium

generates the data (Assumption 2). In equilibrium, firms take the evolution of digital movie

availability as deterministic, following the process {ht(ano)}t=1,,T (Equation (9)).

If the mapping Γ(·) were known, ht(ano) could be computed by simple matrix multiplica-

tion from the knowledge of the equilibrium CCP and the initial industry state x1. Estimating

Γ(·) precisely is, however, infeasible due to the high-dimensionality of xt and the short panel

nature of the data (the asymptotics are in T ). Instead, I exploit the actual process {ht}t=1,,T

observed in the data as an estimator for {ht(ano)}t=1,,T : indeed, ht converges to ht(ano) as

the number of firms grows to infinity (the asymptotics are in N).

The actual process {ht}t=1,,T observed in the data is determined endogenously by the

realized adoption of digital screens. To estimate the parameter δ2 characterizing the effect

of ht on the decay in single-period profits, I follow an approach similar to Gowrisankaran,

Park, and Rysman (2014). First, I include a set of time dummies for t = 1, . . . , T in place

of ht in equation (29). Second, I regress the time dummy coefficients on ht, and instrument

the latter variable by digital movie availability in the U.S. hUS,t. This identification strategy

exploits the fact that whether a U.S.-produced movie is released in digital in France at time t

is at least partly a function of whether the movie was released in digital in the U.S. However,

digital release decisions in the U.S. were arguably not affected by the installed base of screens

50See Table 1 for the different categories of market size.51Smaller theaters might be able to delay their conversion longer because they screen fewer movies overall.

Ignoring this mechanism would affect the estimation results by predicting lower profits for smaller theaters.

34

Page 36: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

in France (Appendix B discusses the validity of this instrument in details). This two-step

approach addresses the issue of hierarchical variation in a way similar to that suggested by

Donald and Lang (2007) for the treatment effect literature.

The estimation approach differs from that of Gowrisankaran, Park, and Rysman (2014)

in two respects: in their paper, the time dummy coefficients capture the present discounted

value of the expected stream of future benefits from the complementary goods market,

whereas, in the current paper, these coefficients enter the (static) single-period profits of

theaters and, therefore, do not include the future benefits from digital movie availability;

second, their paper assumes a general auto-regressive structure (in particular with one lag)

on the time coefficients, allowing for “accumulated capital” of complementary goods. By

contrast, the current paper assumes that movies are short-lived from the perspective of the-

aters. This assumption seems reasonable with a period of 6 months and a focus on miniplexes

and multiplexes which rules out smaller “continuation” theaters.

6.5 Estimation results

6.5.1 First-step estimates

Theaters’ adoption-policy function. The nonstationary oblivious CCP are estimated using a

flexible reduced form, via an ordered probit model. To further control the size of the state

space, theaters’ strategy space (the number of screens that can be converted) is restricted

to lie on a grid. More precisely, miniplexes (theaters with 4 to 7 screens) are assumed to

adopt on the space sit/Sτ (i) ∈ {0, 14 ,12, 34, 1}, whereas multi- and megaplexes (theaters with

8 screens or more) are assumed to adopt on the space sit/Sτ (i) ∈ {0, 18 ,28, ..., 7

8, 1}. Figure 4

shows kernel density estimates of the within-firm adoption rates for miniplexes (panel (a))

and multi/megaplexes (panel (b)), conditional on partial adoption.52 For miniplexes, the

density has three identifiable modes. Additionally, 93.1% of observations are within 5%

of a grid point in {14, 12, 34}. For multi- and megaplexes, 96.2% of observations are within

5% of a grid point in {18, 28, ..., 7

8}. Overall, the coarsening of the state space reduces its

dimensionality without imposing too strong a restriction on firms’ admissible states.

Because theaters cannot divest and roll back the film technology, a firm cannot transition

to lower states. For instance, a four-screen theater with sit/Sτ (i) = 3/4 can only transition

to sit+1/Sτ (i) ∈ {3/4, 1}. In this sense, next period’s possible states depend on the firm’s

adoption rate in the current period. This dependence is accounted for in constructing the

likelihood (see Appendix C.2).

52A gaussian kernel is used and the bandwidth is selected using the biased cross-validation approach ofScott and Terrell (1987).

35

Page 37: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

In a nonstationary oblivious equilibrium, theaters’ adoption decision are a function of the

firm state xit = (τ (i), sit) and time. A theater’s share of screens converted to digital between

t and t + 1, denoted ait/Sτ (i), is explained by the number of screens in the theater (and its

square), the share of digital screens in the theater in period t, whether the theater is an art

house, competitors’ total number of screens, a polynomial in time, and its interaction with the

theater’s art house status. The polynomial in time captures the effect of the deterministic—

under nonstationary oblivious beliefs—processes {ht, pt, hUS,t}, which all vary in the time

series. A second specification augments the model by including market dummies to control

for market size. A third specification includes both market dummies and theater-chain

dummies for the three major French theater chains (Gaumont-Pathe, CGR, and UGC).

Finally, a fourth specification also controls for interactions between theater size Sτ (i) and all

other variables.

Table 5 presents the estimates of the ordered probit model under the four specifications.

In specification (4), where all variables are interacted with Sτ (i), interaction terms are omit-

ted. Marginal effects are, however, similar to the first three specifications. For instance, the

effect of the time coefficient for an 8-screen theater (on the scale of the linear predictors) is

0.787.

As expected, across the four specifications, the time effects are positive. This trend

reflects decreasing adoption costs and increasing digital movie availability. Larger theaters

are more likely to adopt, but the marginal effect is decreasing. The share of a theater’s

screens already converted to digital is negatively related to further adoption. This finding

is expected because, given a share of digital movies, theaters lagging in their adoption (low

sit/Sτ (i)) have a greater incentive to adopt.

Competitors’ total number of screens does not significantly impact a theater’s likelihood

of adoption. Theaters located in Paris are more likely to adopt than theaters located in the

small urban areas with fewer than 20, 000 inhabitants and rural areas, although the effect

is not significant. Among the chain dummies, CGR theaters are more likely to adopt than

single theaters or theaters belonging to smaller chains.

Appendix C.3 compares model predictions for the share of digital screens to actual shares

in the data. Overall, and given the limitations imposed by the parametric specification of

the policy function, the model captures the main trends in the data well. The rest of the

analysis uses specification (3) based on the AIC.

6.5.2 Second-step estimates

This section presents estimation results for the profit per digital screen (the baseline πd(τ (i))

and decay δ(τ (i), ht)). These components are combined, in equation (27), to obtain theaters’

single-period operating profits.

36

Page 38: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

Table 6 shows estimates of the parameters entering the single-period profits per digital

screen relative to a film screen. First, there is heterogeneity in profits across theaters.

Profits per screen are increasing in theater size. This is the case if there are scale economies

in operating a theater, and these economies of scale are larger under digital technology than

under 35mm film. Art house status, market size, and competitors’ screens do not significantly

affect profits per digital screen. This is expected because these variables affect only revenues

not the cost of operating, whereas πd(τ (i)) reflects the cost-reduction from converting a

screen to digital.

The decay function is decreasing in the availability of digital movies ht. As ht increases, a

theater’s choice set of digital movies increases so profits per digital screen increase. Theater

size doesn’t significantly impact the decay. One would expect the decay to be larger for large

theaters, fixing digital movie availability, because larger theaters screen more movies.

Profits per digital screen relative to film implied by the structural estimates have the

correct order of magnitude. Figure 5 shows the distribution of single-period profits per

digital screen across theaters (over a period of 6 months) predicted by the model. Profits

per digital screen are between e2, 603 and e6, 423, with a mean of e4, 928 and median of

e4, 986. These results are contrasted with estimates of projectionist’s wages. Collective

bargaining agreements set a projectionist’s minimum monthly salary at e1, 500 over the

period of interest, or e9, 000 over a period of 6 months. The latter figure is an upper bound

on cost-reductions per screen because projectionists are replaced with lower-wage workers.

Therefore, profits levels implied by the structural model are economically plausible.

7 Counterfactual Analysis

This section uses the estimated model to quantify the magnitude of excess inertia and the

role of adoption externalities in the transition from 35mm film to digital cinema. Two coun-

terfactual adoption paths are simulated for the industry: in the first benchmark, a social

planner maximizes aggregate theater profits taking as given upstream distributors’ reaction

function; in the second benchmark, the social planner mandates coordination on digital dis-

tribution for all movie–periods, and maximizes aggregate theater profits. Results point to a

sizeable surplus loss, 41% of which is due to externalities among downstream theaters. The

last subsection analyzes distributors’ costs of multi-homing and provide evidence that they

benefit under the two counterfactual scenarios.

37

Page 39: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

7.1 Planner’s benchmark

By converting to digital, theaters raises distributors’ incentive to release movies in digital.

This increased number and variety of digital movies is a positive externality on other theaters

equipped with digital projectors.53 Additionally, by increasing digital movie availability,

non-adopters’ option value from digital technology also increases. Such increased profits and

option values are not internalized by the adopter, and as a result, industry profits are not

maximized under the non-cooperative market outcome.

In this benchmark, the social planner chooses a sequence of adoption decisions for each

theater {at = a1t, ..., aIt}t≥0 to maximize the discounted sum of aggregate theater profits

max{at}t≥1

E

[∑i∈I

∞∑t=1

βt−1Π(ait,xt, pt, hUS,t, εit)

]s.t. ht = Γ(xt, hUS,t) (30)

where the expectation is taken with respect to the sequence of theaters’ idiosyncratic

shocks and the planner takes distributors’ equilibrium best response Γ as given. The initial

industry state x1 is fixed to si1 = 0 for all i.

The Social Planner’s problem (30) is computationnally hard to solve because of the large

number of players in the industry: the industry state space is high-dimensional, and an

industry adoption policy rule is, in general, a function of the realization of the full vector of

adoption shocks {εit}i∈I ∈ RI , making the policy-rule space high-dimensional as well.

I deal with these computational issues in two steps. First, I replace the high-dimensional

industry state vector xt with its first moment (the aggregate share of digital screens in the

industry st/S) and parameterize distributors’ reaction function (equation (3)) as a function

of the aggregate share of digital screens in the industry and the share of digital movies in the

U.S. This approximation helps reduce the dimension of the state space and Γ mapping.54 I

use the following reduced-form specification

ht =(stS

)ηshηhUS,t (31)

The parameters (ηs, ηh) are calibrated using the short-panel of aggregate variables in the

data. The robustness of the results is also evaluated under alternative choices for (ηs, ηh).

Second, I leverage the fact that the object of interest is the maximized value of the

planner’s objective function (discounted sum of industry profits), not the social planner’s

53Without multi-homing, there would be a negative externality on non-adopters, as the fraction of moviesreleased in 35mm film would decrease.

54Replacing the industry state by moments has been proposed as an approximation method to MPE byIfrach and Weintraub (2017). This assumption is restrictive in that the social planner uses only a fractionof the data available (i.e., st

S instead of xt)).

38

Page 40: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

policy rule per se. Therefore, instead of solving the planner’s full dynamic decision problem,

I search for the maximum value of the objective function over the space of feasible industry

adoption paths {at = a1t, ..., aIt}t≥1. To do so, I generate a large number of random industry

adoption paths and select the path that maximizes the objective function in problem (30).

The simulated paths are generated by adding perturbations to the equilibrium cut-offs in each

theater’s CCP. For a given vector of perturbations {ξi}i∈I , the simulated path is obtained

using the following procedure.

1. Initialize the industry at x1 such that si1 = 0 for all i.

2. Draw firm specific adoption shocks {εi1}i∈I and corresponding adoption decisions dic-

tated by each firm’s perturbed CCP.

3. Calculate single-period industry profits∑i∈I

Π(ai1,x1, p1, hUS,1, εi1).

4. Update the current state y1 = (x1, p1, hUS,1) according to the adoption decisions and

transition of the exogeneous processes to next period state: y2. In particular,

h2 =(s2S

)ηshηhUS,2

and

si2 = si1 + ai1

5. Repeat steps 1-4 for T periods.

Given a vector of perturbations {ξi}i∈I , the objective function is obtained by averaging

L simulated paths. The paths have length T = 30 periods (or 15 years).55 For t ≥ T , the

industry is its long-run steady state and profits are constant (sit = Sτ (i) and ht = 1). An

estimate of the objective function is obtained as

1

L

L∑l=1

{∑i∈I

∞∑t=1

βt−1Πl(ait,xt, pt, hUS,t, εit)

}(32)

where Πl(ait,xt, pt, hUS,t, εit) is the single-period profit of firm i in simulation l at period t,

when the firm follows the adoption strategy obtained by adding the perturbation ξi to its

equilibrium CCP. In practice, I draw K = 15, 000 perturbation vectors and compute the

55In the equilibrium market outcome, firms have an incentive to free-ride on other firms’ adoption, sothe speed of digital conversion is slower than what the profit-maximizing industry planner would choose.Therefore, it is expected that the planner will have switched the industry to digital sooner than in thenon-cooperative market equilibrium.

39

Page 41: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

objective function for each vector {ξi}i∈I . I select the perturbation vector that yields the

highest value of the estimated objective function (32). Let {aPt }t≥1 be the planner’s policy

rule corresponding to the optimal perturbation vector.

Figure 6a shows an example of simulated industry adoption paths. Each simulated path

corresponds to a perturbation vector {ξi}i∈I .56 Figure 6b plots the effect of 500 additional

perturbation draws on the maximum value of the estimated objective function (expression

(32)) as a function of the total number of perturbation draws. Beyond 10, 000 perturbation

draws, the incremental improvement in the maximized objective function is less than 2, 000

e (i.e. less than 0.002% of aggregate industry profits).

As a robustness check, the social planner’s dynamic problem (30) is explicitly solved

after reducing the dimension of the state and strategy spaces. The results are presented in

Appendix D. The optimum policy rule obtained yields an industry diffusion path that is

consistent with {aPt }t≥1.

7.2 Coordination benchmark

To quantify the magnitude of excess inertia in the upstream distribution market, I assume

that the planner mandates multi-homing for all movies from the first period on and maximize

aggregate theater profits. In other words, ht = 1, ∀t. The social planner chooses a sequence

of adoption decisions for each theater {at = a1t, ..., aIt}t≥1 to maximize the discounted sum

of aggregate theater profits

max{at}t≥1

E

[∑i∈I

∞∑t=1

βt−1Π(ait, pt, εit)

]s.t. ht = 1 (33)

where the expectation is taken with respect to the sequence of theaters’ idiosyncratic shocks

and I omit the dependence of single-period profits on (xt, hUS,t) because ht is set to one.

The initial industry state x1 is fixed to si1 = 0 for all i. Because the value of ht is fixed,

network effects (in hardware adoption) are shut-down, and the problem is equivalent to

profits maximization by each theater: I solve a single-agent dynamic decision problem for

each theater i, given ht = 1,∀t.Each single-agent decision problem is an optimal stopping problem. Indeed, when multi-

homing is mandated for all movies-periods, the marginal benefit from converting a screen

to digital is constant across screens. As a consequence, theaters make a 0 − 1 adoption

decision. Decreasing adoption costs {pt}t≥1 and exogenous theater characteristics are the

56And one particular sequence of theaters’ idiosyncratic shocks. More precisely, for each firm i, I transformthe equilibrium cut-offs κ in firms CCP (estimated by ordered probit) into κ + ξi where ξi is drawn fromN (X,Y ) where X ∼ N (−1.5, 2) and Y ∼ U [0.5, 2].

40

Page 42: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

factors driving the diffusion process. Starting from T , each problem is solved by backward

induction.

Let {aCt }t≥1 be the solution of problem (33).

7.3 Counterfactual Results

Figure 7a and 7b show the diffusion paths under the non-cooperative market outcome and

the two benchmarks for digital screens (left) and digital movies (right).57 In the planner’s

benchmark, I use the parameters estimates (ηs, ηh) = (0.52, 0.21) obtained through calibra-

tion (by ordinary least squares, Equation (31)). I evaluate the sensitivity of the results with

respect to these parameters below.

The diffusion of digital is faster under the two counterfactual scenarios: by 0.8 to 3 years

for the time to 10% adoption, and 0.3 to 0.5 years for the time to 90% adoption. Addition-

ally, the diffusion path is steeper in the planner’s benchmark relative to the coordination

benchmark because, in the former, digital movie availability is endogenously determined by

the installed base of digital screen. The planner must build up the installed base of digital

screen to incentivize distributors to switch.

Next, I compare theaters’ aggregate profits under the three scenarios. Let {aMt }t≥1 be

the vector of adoption decisions in the non-cooperative market outcome. For given sequences

of adoption decisions {at}t≥1 and digital movie availability {ht}t≥1, define aggregate theater

surplus (net of adoption costs) as

TS({at}t≥1, {ht}t≥1) = E

[∑i∈I

∞∑t=0

βt(πτ (i)(sit, ht)− ait(pt + εit)

)](34)

Denote aggregate theater surplus under the non-cooperative market outcome, planner’s

benchmark, and coordination benchmark by TSM , TSP , and TSC , given adoption decisions

{aMt }t≥1, {aPt }t≥1, and {aCt }t≥1 and corresponding sequences of digital movies availabilities

{ht}t≥1.Table 7 shows industry profits under the three scenarios. Profits from film are normal-

ized to zero, so these figures are relative to the status-quo with no conversion to digital.

Industry profits net of adoption costs are 76.18 million euros under the market outcome,

82.25 million euros under the planner’s benchmark, and 90.66 million euros under the coor-

dination benchmark. The difference TSP − TSM is attributed to downstream excess inertia

(specifically, adoption externalities across theaters), and TSC−TSP to excess inertia among

57In all three scenarios, I compute the average over the sequence of firm idiosyncratic adoption shocks.

41

Page 43: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

upstream distributors.58 Overall, excess inertia causes theaters’ surplus to be 16% lower

than under full coordination. Adoption externalities across downstream firms explain 41%

of this surplus loss.

To verify the sensitivity of the results to the parameters governing the specification of ht

(Equation (31)), the planner’s problem (30) is solved repeatedly under alternative choices of

(ηs, ηh). I vary each parameter from 0.2 to 0.8 and calculate counterfactual industry profits

under the planner’s benchmark. Low values of ηs (respectively, ηh) correspond to more elastic

supply of digital movies with respect to digital screens (respectively, digital releases in the

U.S.). Net industry profits are between 78.76 and 83.32 million euros (i.e., 4.2% lower to

1.3% higher than net industry profits under the baseline estimates of (ηs, ηh) = (0.52, 0.21)).

The share of surplus loss explained by adoption externalities across downstream firms varies

between 17% and 49%. The lower figure obtains when digital movies are little responsive

to either stS

or hUS,t. If the reduced-form estimates for the elasticity of st with respect to ht

(presented in Appendix B and found equal to 0.806) are of any indication, the lower figure is

arguably a conservative lower bound for the surplus loss due to adoption externalities among

theaters.

7.4 Distributors’ surplus

The previous section focuses on theaters’ surplus. Does distributors’ surplus increase under

the two benchmarks? A priori, it is not clear: multi-homing is more prevalent and may be

costlier than 35mm film. This section shows that, given estimates of average distribution

costs under film and digital, distributors also benefit under the two benchmarks.

Digital distribution affects only printing, shipping and storage costs of movie copies

(PSS). Other costs incurred by distributors (e.g., advertising space purchase, advertising

content creation, promotional events) remain constant.

Multi-homing may be costlier than film distribution because of economies of scale in

PSS. When multi-homing, distributors split their production over two cost functions (film

and digital), and in doing so, lose some of the scale economies. Denote by Cf (Q) (resp.

Cd(Q)) the total PSS costs of Q film copies (resp. Q digital copies); and ACf (Q) (resp.

ACd(Q)) the corresponding decreasing average cost curves. Digital distribution is more

efficient so Cd(Q) < Cf (Q). For a distributor releasing Q copies of a movie, multi-homing

(with a fraction κ of digital copies and (1−κ) of film copies) is costlier than film distribution

if

Cf (Q) ≤ Cd(κQ) + Cf ((1− κ)Q)

58Upstream excess inertia is estimated as a residual and can arise because of coordination failure andadoption externalities across distributors.

42

Page 44: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

or equivalently

ACf (Q) ≤ κACd(κQ) + (1− κ)ACf ((1− κ)Q)

Whether the previous inequalities hold depends on the value of κ and the extents of scale

economies in film and digital.

To compare distributors’ costs under the market outcome and benchmarks, I collect data

on average PSS costs per copy for film and digital, from the CNC. The CNC publishes a

yearly report on the French movie distribution industry.59 The reports contain a breakdown

of total (industry-level) distribution costs into PSS costs, advertising space purchase, adver-

tising content creation, and promotional costs. In particular, this information is available by

number of copies (categorical variable). Table 8 shows average costs per copy for film and

digital by total number of copies. Digital distribution is about 80% less costly than film dis-

tribution and the average cost per copy is decreasing in the number of copies. Shortcomings

of the data are that it covers only French distributors, and for the “100-200 copies” category,

costs per copy on film seem to increase.

I use the data in Table 8 to compute the discounted sum of distribution costs for a

distributor releasing one movie per period on Q copies

C(Q) = Q×T∑t=0

βt{ht

(stSACd(

stSQ) + (1− st

S)ACf ((1−

stS

)Q))

+ (1− ht)ACf (Q)}

under the market outcome, planner’s benchmark, and coordination benchmark. The

first term inside the brackets corresponds to the average cost per copy when multi-homing

(κ = stS

). The second term inside the brackets corresponds to the average cost per copy when

distributing on film.

Figure 8 plots the average total cost C(Q)/Q for various values of the number of copies Q.

The results indicate that distribution costs are lower under the two benchmarks compared

to the market outcome for a typical range of Q: the cost-reduction from digital more than

compensate for the loss in scale due to multi-homing.

8 Conclusion

Industries with network effects tend to coordinate on a single technology, the standard, to

exploit the benefits of a larger network. However, once bound together by the benefits of

59The 2017 report, for instance, can be accessed at: https://www.cnc.fr/cinema/etudes-et-rapports/etudes-prospectives/les-couts-de-distribution-des-films-dinitiative-francaise-en-2017_

959089.

43

Page 45: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

the standard, firms may become reluctant to switch to new and better technologies as they

become available. In particular, adoption externalities create a wedge between the private

and social benefits from adoption. This paper studies empirically whether a new technology

standard can diffuse efficiently in a decentralized market. I exploit rich firm level data in the

French movie industry to assess the magnitude of excess inertia in the switch from 35mm to

the digital cinema standard.

To do so, I construct a dynamic game of digital hardware adoption by theaters and

digital movie supply by distributors. Using data on theaters’ adoption decisions and the

cross-sectional variation in theater and market characteristics, I estimate theaters’ payoff

from converting their screens to digital. I simulate two counterfactuals: first, a planner

maximizes aggregate theater profits taking as given upstream distributors reaction function;

second, the planner mandates coordination on digital distribution upstream and maximizes

aggregate theater profits.

The counterfactuals show that market forces did not provide enough incentives for an

efficient switch from 35mm to digital. Industry profits are lower under the non-cooperative

market outcome relative to under coordination. Additionally, 59% of the surplus loss can

be ascribed to excess inertia in the upstream distribution market, whereas the rest is due to

adoption externalities in the downstream exhibition market.

The results suggest that policy intervention in network industries can be beneficial, in

particular, to coordinate firms’ adoption of new standards. Moreover, the most effective

instrument (between adoption subsidies or mandates) depends on the source of inefficiency

and the side of the market that exhibits excess inertia. In the case of digital cinema, the

analysis indicates that inertia originates mainly from the upstream distribution market.

Recent interventions in other industries indicate that policy-makers have been taking a more

active role in coordinating standard adoption in hardware-software markets: a particularly

topical industry are consumer electronics, where E.U. states have been considering mandating

USB-C charging ports on all smartphone manufacturers (Abdul (2020)).

The current study can be extended in several directions; an important one is noted. The

digital manufacturing sector is modelled as competitive (“non-sponsored” technology). Re-

laxing this assumption would nuance the empirical findings. Indeed, a monopoly or oligopoly

in digital projector manufacturing may help align the market outcome with the optimal

diffusion path. For example, penetration pricing internalizes network effects across peri-

ods (between early and late adopters), and contingent contracts can be used to solve the

coordination problem. However, due to market power, the overall effect on theaters and

distributors’ surplus is ambiguous.

44

Page 46: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

References

Abdul, G. (2020). “A Common Charger for All Phones? The E.U. Is on the Case”. New York

Times (January 17, 2020) (cit. on p. 44).

Ackerberg, D. A. and G. Gowrisankaran (2006). “Quantifying equilibrium network external-

ities in the ACH banking industry”. The RAND Journal of Economics 37.3, pp. 738–761

(cit. on p. 5).

Aguiar, L. and J. Waldfogel (2018). “Quality Predictability and the Welfare Benefits from

New Products: Evidence from the Digitization of Recorded Music”. Journal of Political

Economy 126.2, pp. 492–524 (cit. on p. 8).

Aguirregabiria, V. and P. Mira (2007). “Sequential Estimation of Dynamic Discrete Games”.

Econometrica 75.1, pp. 1–53 (cit. on pp. 2, 19, 23, 24, 26).

Andrew Hanssen, F. (2010). “Vertical integration during the hollywood studio era”. Journal

of Law and Economics 53.3, pp. 519–543 (cit. on p. 5).

Augereau, A., S. Greenstein, and M. Rysman (2006). “Coordination versus Differentiation

in a Standards War: 56K Modems”. The RAND Journal of Economics 37.4, pp. 887–909

(cit. on p. 5).

Bajari, P., L. Benkard, and J. Levin (2007). “Estimating dynamic models of imperfect com-

petition”. Econometrica 75.5, pp. 1331–1370 (cit. on p. 23).

Bajari, P., V. Chernozhukov, H. Hong, and D. Nekipelov (2015). “Identification and Efficient

Semiparametric Estimation of a Dynamic Discrete Game” (cit. on p. 29).

Battisti, G. and P. Stoneman (2005). “The intra-firm diffusion of new process technologies”.

International Journal of Industrial Organization 23.1-2, pp. 1–22 (cit. on p. 5).

Bordwell, D. (2013). Pandora’s Digital Box: Films, Files, and the Future of Movies. Irvington

Way Institute Press (cit. on p. 8).

Chou, C.-f. and O. Shy (1990). “Network Effect without Network Externalities”. Interna-

tional Journal of Industrial Organization 8.2, p. 259 (cit. on p. 4).

Church, J. and N. Gandal (1992). “Network Effects, Software Provision, and Standardiza-

tion”. The Journal of Industrial Economics 40.1, pp. 85–103 (cit. on p. 4).

Church, J., N. Gandal, and D. Krauske (2008). “Indirect Network Effects and Adoption

Externalities”. Review of Network Economics 7.3, pp. 337–358 (cit. on p. 1).

Clements, M. T. and H. Ohashi (2005). “Indirect Network Effects and the Product Cycle:

Video Games in the U.S., 1994-2002”. The Journal of Industrial Economics 53.4, pp. 515–

542 (cit. on p. 4).

45

Page 47: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

Corts, K. S. and M. Lederman (2009). “Software exclusivity and the scope of indirect net-

work effects in the U.S. home video game market”. International Journal of Industrial

Organization 27.2, pp. 121–136 (cit. on p. 4).

Davis, P. (2006a). “Measuring the Business Stealing , Cannibalization and Market Expansion

Effects of Entry in the U . S . Motion Picture Exhibition Market”. The Journal of

Industrial Economics 54.3, pp. 293–321 (cit. on pp. 5, 8).

— (2006b). “Spatial competition in retail markets: Movie Theaters”. Rand Journal of Eco-

nomics 37.4, pp. 964–982 (cit. on pp. 5, 14).

Donald, S. G. and K. Lang (2007). “Inference with Difference-in-Differences and Other Panel

Data”. The Review of Economics and Statistics 89.2, pp. 221–233 (cit. on p. 35).

Doraszelski, U. and M. Satterthwaite (2010). “Computable Markov-perfect industry dynam-

ics”. RAND Journal of Economics 41.2, pp. 215–243 (cit. on p. 21).

Dube, G. J. Hitsch, and Chintagunta (2010). “Tipping and Concentration in Markets with

Indirect Network Effects”. Marketing Science 29.2, pp. 216–249 (cit. on p. 4).

Efron, B. and R. J. Tibshirani (1993). An Introduction to the Bootstrap. Monographs on

Statistics and Applied Probability 57. Boca Raton, Florida, USA: Chapman & Hall/CRC

(cit. on p. 66).

Einav, L. (2007). “Seasonality in the U. S. motion picture industry”. The RAND Journal of

Economics 38.1, pp. 127–145 (cit. on p. 5).

— (2010). “Not all rivals look alike: Estimating an equilibrium model of the release date

timing game”. Economic Inquiry 48.2, pp. 369–390 (cit. on p. 5).

Farrell, J. and P. Klemperer (2007). “Chapter 31 Coordination and Lock-In: Competition

with Switching Costs and Network Effects”. Handbook of Industrial Organization 3.06,

pp. 1967–2072. arXiv: arXiv:1011.1669v3 (cit. on p. 4).

Farrell, J. and G. Saloner (1985). “Standardization, Compatibility, and Innovation”. The

RAND Journal of Economics 16.1, pp. 70–83 (cit. on p. 1).

— (1986). “Installed Base and Compatibility: Innovation, Product Preannouncements, and

Predation”. The American Economic Review 76.5, pp. 940–955 (cit. on p. 1).

Farrell, J. and T. Simcoe (2012). “Choosing the rules for consensus standardization”. RAND

Journal of Economics 43.2, pp. 235–252 (cit. on p. 4).

Fuentelsaz, I., J. Gomez, and Y. Polo (2003). “Intrafirm Diffusion of Technologies: An Em-

pirical Application”. Research Policy 32, pp. 533–551 (cit. on p. 5).

Gandal, N., M. Kende, and R. Rob (2000). “The Dynamics of Technological Adoption in

Hardware / Software Systems: The Case of Compact Disc Players”. The RAND Journal

of Economics 31.1, pp. 43–61 (cit. on p. 4).

46

Page 48: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

Gil, R. (2009). “Revenue Sharing Distortions and Vertical Integration in the Movie Industry”.

The Journal of Law, Economics, & Organization 25.2 (cit. on p. 5).

Gil, R., Houde, and Y. Takahashi (2015). “Preemptive Entry and Technology Diffusion in

the Market for Drive-in Theaters” (cit. on p. 6).

Goettler, R. L. and B. R. Gordon (2011). “Does AMD Spur Intel to Innovate More?” Journal

of Political Economy 119.6, pp. 1141–1200 (cit. on p. 5).

Goolsbee, A. and P. J. Klenow (1999). “Evidence on Learning and Network Externalities in

the Diffusion of Home Computers” (cit. on p. 4).

Gowrisankaran, G., C. Lucarelli, P. Schmidt-Dengler, and R. Town (2010). “Government

policy and the dynamics of market structure: Evidence from Critical Access Hospitals”.

Eleventh CEPR Conference on Applied Industrial Organization (cit. on p. 25).

Gowrisankaran, G., M. Park, and M. Rysman (2014). “Measuring Network Effects in a

Dynamic Environment” (cit. on pp. 4, 34, 35).

Gowrisankaran, G. and J. Stavins (2004). “Network Externalities and Technology Adoption:

Lessons from Electronic Payments”. The RAND Journal of Economics 35.2, p. 260.

arXiv: arXiv:1011.1669v3 (cit. on pp. 4, 63).

Griliches, Z. and J. A. Hausman (1986). “Errors in variables in panel data”. Journal of

Econometrics 31.1, pp. 93–118 (cit. on p. 63).

Hotz, V. J., R. A. Miller, S. Sanders, and J. Smith (1994). “A Simulation Estimator for

Dynamic Models of Discrete Choice”. The Review of Economic Studies 61.2, pp. 265–289

(cit. on p. 24).

Hotz, V. J. and R. A. Miller (1993). “Conditional Choice Probabilities and the Estimation

of Dynamic Models”. The Review of Economic Studies 60.3, p. 497 (cit. on pp. 2, 24).

Ifrach, B. and G. Weintraub (2017). “A framework for dynamic oligopoly in concentrated

industries”. Review of Economic Studies 84.3, pp. 1106–1150 (cit. on p. 38).

Igami, M. (2017). “Estimating the Innovator’s Dilemma: Structural Analysis of Creative

Destruction in the Hard Disk Drive Industry, 1981–1998”. Journal of Political Economy

125.3, pp. 798–847 (cit. on p. 5).

Igami, M. and K. Uetake (2017). “Mergers, Innovation, and Entry-Exit Dynamics: The

Consolidation of the Hard Disk Drive Industry, 1996-2015” (cit. on p. 5).

Karaca-Mandic, P. (2003). “Network Effects in Technology Adoption: The Case of DVD

Players” (cit. on p. 4).

Katz, M. L. and C. Shapiro (1985). “Network Externalities, Competition, and Compatibil-

ity”. American Economic Review 75, pp. 424–440 (cit. on p. 1).

Kopp, P. (2016). Le cinema a l’epreuve des phenomenes de concentration (cit. on p. 9).

47

Page 49: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

Lee, R. S. (2013). “Vertical Integration and Exclusivity in Two-Sided Markets”. American

Economic Review 103.7, pp. 2960–3000 (cit. on p. 5).

Levin, S. G., S. L. Levin, and J. B. Meisel (1992). “Market Structure , Uncertainty , and

Intrafirm Diffusion : The Case of Optical Scanners in Grocery Stores”. The Review of

Economics and Statistics 74.2, pp. 345–350 (cit. on p. 5).

Macher, J., N. H. Miller, and M. Osborne (2017). “Finding Mr. Schumpeter: An Empirical

Study of Competition and Technology Adoption” (cit. on p. 5).

Magnac, T. and D. Thesmar (2002). “Identifying Dynamic Discrete Decision Processes”.

Econometrica 70.2, pp. 801–816 (cit. on p. 29).

Mansfield, E. (1963). “Intrafirm Rates of Diffusion of an Innovation”. The Review of Eco-

nomics and Statistics 45.4, pp. 348–359 (cit. on p. 5).

Manski, C. F. (1993). “Identification of Endogenous Social Effects: The Reflection Problem”.

The Review of Economic Studies 60.3, p. 531. arXiv: arXiv:1011.1669v3 (cit. on p. 4).

Nabseth, L. and G. F. Ray (1974). “The Diffusion of new industrial processes: an interna-

tional study.” Cambridge: University Press (cit. on p. 5).

Ohashi, H. (2003). “The Role of Network Effects in the US VCR Market, 1978–1986”. Journal

of Economics & Management Strategy 12.4, pp. 447–494 (cit. on p. 5).

Pakes, A., M. Ostrovsky, and S. Berry (2007). “Simple estimators for the parameters of

discrete dynamic games”. RAND Journal of Economics 38.2, pp. 373–399 (cit. on p. 23).

Pesendorfer, M. and P. Schmidt-Dengler (2008). “Asymptotic least squares estimators for

dynamic games”. Review of Economic Studies 75.3, pp. 901–928 (cit. on pp. 2, 19, 23–26).

Rao, A. and W. R. Hartmann (2015). “Quality vs. Variety: Trading Larger Screens for More

Shows in the Era of Digital Cinema”. Quantitative Marketing and Economics (cit. on

p. 6).

Romeo, A. A. (1975). “Interindustry and Interfirm Differences in the Rate of Diffusion of an

Innovation”. The Review of Economics and Statistics 57.3, pp. 311–319 (cit. on p. 5).

Rust, J. (1994). “Structural Estimation Decision Processes”. Chapter 51 1, pp. 3139–3139

(cit. on p. 29).

Ryan, S. P. and C. Tucker (2012). “Heterogeneity and the dynamics of technology adoption”.

Quantitative Marketing and Economics 10.1, pp. 63–109 (cit. on p. 5).

Rysman, M. (2004). “Competition Between Networks: A Study of the Market for Yellow

Pages”. Review of Economics and Statistics 71.2, pp. 483–512 (cit. on p. 5).

— (2019). “The reflection problem in network effect estimation”. Journal of Economics &

Management Strategy 28.1, pp. 153–158. eprint: https://onlinelibrary.wiley.com/

doi/pdf/10.1111/jems.12301 (cit. on p. 4).

48

Page 50: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

Rysman, M. and T. Simcoe (2008). “Patents and the performance of voluntary standard-

setting organizations”. Management Science 54.11, pp. 1920–1934 (cit. on p. 4).

Schmidt-Dengler, P. (Dec. 2006). The Timing of New Technology Adoption: The Case of

MRI. 2006 Meeting Papers 3. Society for Economic Dynamics (cit. on p. 5).

Scott, D. W. and G. R. Terrell (1987). “Biased and unbiased cross-validation in density

estimation”. Journal of the American Statistical Association 82.400, pp. 1131–1146 (cit.

on p. 35).

Simcoe, T. (2012). “Standard Setting Committees: Consensus Governance for Shared Tech-

nology Platforms”. The American Economic Review 102.1, pp. 305–336 (cit. on p. 4).

Srisuma, S. (2013). “Minimum distance estimators for dynamic games”. Quantitative Eco-

nomics 4.3, pp. 549–583 (cit. on pp. 18, 19, 24).

Stock, J. H. and M. Yogo (2005). “Testing for weak instruments in linear IV regression”. Iden-

tification and Inference for Econometric Models: Essays in Honor of Thomas Rothenberg.

Vol. 2001, pp. 80–108 (cit. on p. 64).

Takahashi, Y. (2015). “Estimating a war of attrition: The case of the US movie theater

industry”. American Economic Review, pp. 2204–2241 (cit. on p. 5).

Tucker, C. (2008). “Identifying Formal and Informal Influence in Technology Adoption with

Network Externalities”. Management Science 54.12, pp. 2024–2038. eprint: https://

doi.org/10.1287/mnsc.1080.0897 (cit. on p. 4).

Waldfogel, J. (2016). “Cinematic Explosion: New Products, Unpredictabilty and Realized

Quality in the Digital Era”. Journal of Industrial Economics 64.4, pp. 755–772 (cit. on

p. 6).

Weintraub, G., C. L. Benkard, P. Jeziorski, and B. Van Roy (2008). “Nonstationary Oblivious

Equilibrium”. Manuscript, Columbia (cit. on pp. 2, 14, 20–22).

Weintraub, G., C. L. Benkard, and B. Van Roy (2008). “Markov Perfect Industry Dynamics

With Many Firms”. Econometrica 76.6, pp. 1375–1411 (cit. on p. 20).

Yang, J., E. Anderson, and B. R. Gordon (2019). “Digitization and Distribution: The Ends

Against the Middle”. SSRN Electronic Journal (cit. on pp. 6, 16).

49

Page 51: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

Tables

Table 1: List of variables

Type Variable Description

Movie theater digital screens: sit number of screens converted to digital by time tscreens : Sτ (i) total number of screensart house equals 1 if art house theaterchain identifier of movie theater chaincompetitors screens number of rivals’ screens in local market

Market demand region identifier for the 22 administrative regionsmarket size identifier for: Paris, Paris inner suburbs (”petite couronne”)

Paris outer suburbs (”grande couronne”),urban unit with more than 100 thousands inhabitants,urban unit with 20 to 100 thousands inhabitants,urban unit with less than 20 thousands inhabitants and rural

Digital projector adoption cost: pt list prices for 2K digital projectors(including VPF subsidies and ancillary costs) in 2010 euros

Movie distribution digital movies ht share of movies released in digital format

Note: The first three categories in market size (Paris and suburbs) are colinear with the regional dummy for”Ile-de-France”, the latter is therefore excluded.

Table 2: Summary statistics (theaters with at least 4 screens)

Minimum Mean Maximum St. Deviation

Theaters characteristicsNumber of screens 4 8.12 23 3.74Art house label 0 0.34 1 0.47Number of rival screens 0 8.70 44 11.27

Theaters sizeMiniplexes (4-7 screens) 0 0.53 1 0.50Multi- and Megaplexes (8 screens or more) 0 0.47 1 0.50

Theaters chains (indicator)UGC 0 0.08 1 0.28Gaumont-Pathe 0 0.17 1 0.37CGR 0 0.10 1 0.30

Cost of digital conversion(per screen, 2010 e) 56, 000 69, 862 84, 000 10, 166

50

Page 52: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

Table 3: Summary statistics by market size (theaters with at least 4 screens)

Geographic location Theaters Markets Theater size Art house Screens per Screens per(mean) (share) market (mean) market (st.dev)

Urban unit - >100k inhab 174 101 9.17 0.21 15.79 8.43Urban unit - 20 to 100k inhab 126 116 7.02 0.56 7.63 2.99Urban unit - <20k inhab and rural 17 17 6.65 0.65 6.65 3.55Paris 37 15 7.19 0.14 17.73 9.57Paris - inner suburbs 18 18 9.22 0.28 9.22 4.98Paris - outer suburbs 27 26 7.93 0.19 8.23 4.32

National 399 293 8.12 0.34 11.05 7.26

Table 4: Reduced form analysis of theaters’ adoption

Dependent variable: Share of screens converted

(1) (2) (3)Estimate s.e Estimate s.e Estimate s.e

Time 0.671 0.066 0.672 0.066 0.692 0.068Time squared -0.021 0.003 -0.021 0.003 -0.020 0.003Time x Art house -0.211 0.147 -0.219 0.147 -0.287 0.147Time squared x Art house 0.016 0.007 0.016 0.007 0.019 0.007Own screens 0.091 0.031 0.086 0.032 0.070 0.034Own screens squared -0.003 0.001 -0.003 0.001 -0.001 0.002Art house 0.189 0.790 0.205 0.792 0.590 0.791Competitors’ screens -0.004 0.003 -0.006 0.003 -0.001 0.003Competitors’ digital screens 0.006 0.009 0.006 0.009 0.001 0.009Own share of d-screens -1.359 0.111 -1.379 0.112 -1.664 0.118

Market dummiesParis - outer suburbs -0.241 0.170 -0.106 0.172Urban unit - 20k-100k inhabitants -0.088 0.142 -0.090 0.144Urban unit - >100k inhabitants -0.083 0.148 -0.098 0.149Paris - inner suburbs -0.348 0.189 -0.172 0.191Paris -0.020 0.172 0.187 0.174

Chain dummiesGaumont-Pathe -0.154 0.080CGR 0.326 0.094UGC -0.917 0.133

Observations 4, 788 4, 788 4, 788- log Likelihood 2, 186 2, 182 2, 144AIC 4, 392 4, 395 4, 324

Note: For market dummies, the omitted category is “urban unit with fewer than 20k inhabitantsand rural units.” For the chain dummies, the omitted category is “single firm and small chains.”

51

Page 53: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

Table 5: Adoption policy function

Dependent variable: Share of screens converted ait/Si

(1) (2) (3) (4)Estimate s.e Estimate s.e Estimate s.e Estimate s.e

Time 0.664 0.065 0.665 0.065 0.691 0.067 -0.243 0.056Time squared -0.021 0.003 -0.021 0.003 -0.020 0.003 0.032 0.004Time x Art house -0.211 0.147 -0.217 0.147 -0.291 0.147 0.129 0.009Time squared x Art house 0.016 0.007 0.016 0.007 0.019 0.007 -0.007 0.001Own screens 0.091 0.031 0.086 0.032 0.069 0.034 -0.219 0.043Own screens squared -0.003 0.001 -0.003 0.001 -0.001 0.002 -0.010 0.002Art house 0.194 0.792 0.202 0.791 0.612 0.787 0.001 0.143Competitors’ screens -0.003 0.002 -0.005 0.003 -0.001 0.003 0.012 0.006Own share of d-screens -1.360 0.111 -1.380 0.112 -1.666 0.118 -0.028 0.039

Market dummiesParis - outer suburbs -0.239 0.170 -0.108 0.172 -0.005 0.126Urban unit - 20k-100k inhabitants -0.086 0.142 -0.091 0.144 -0.004 0.124Urban unit - >100k inhabitants -0.078 0.148 -0.097 0.149 -0.010 0.118Paris - inner suburbs -0.345 0.189 -0.174 0.191 -0.005 0.069Paris -0.017 0.172 0.186 0.174 -0.004 0.079

Market dummiesGaumont-Pathe -0.151 0.080 -0.001 0.135CGR 0.328 0.094 -0.000 0.066UGC -0.914 0.133 -0.007 0.046

Interactions: own screens × other variables

Observations 4, 788 4, 788 4, 788 4, 788- log Likelihood 2, 186 2, 182 2, 144 2, 276AIC 4, 391 4, 394 4, 323 4, 613

Note: Specification (1) is the baseline specification. Specification (2) augments the baseline by including market dummies to controlfor market size. Specification (3) includes both market dummies and theater-chain dummies for the three major French theater chains(Gaumont-Pathe, CGR, and UGC). Specification (4) also controls for interactions between theater size Si and all other variables. Formarket dummies, the omitted category is urban unit with fewer than 20k inhabitants and rural units. For the chain dummies, the omittedcategory is single firm and small chains.

52

Page 54: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

Table 6: Structural parameter estimates (in 2010 e)

Estimate s.e

Profit per digital screen: πd(τi)Constant 4, 024.86 959.18Own screens 112.00 51.17Art house 638.23 696.07Competitors’ screens -2.16 5.42

Market dummiesParis - outer suburbs -219.65 278.49Urban unit - 20k-100k inhabitants -150.87 223.03Urban unit - >100k inhabitants -177.61 230.50Paris - inner suburbs -325.05 310.92Paris 246.76 295.56

Chain dummiesGaumont-Pathe -229.05 146.82CGR 622.02 166.90UGC -1, 650.32 430.57

Decay function: δ(τi, ht)Constant 3, 920.84 1, 096.80Own screens 17.03 42.71Digital movies (%) -2, 196.62 444.29

Note: Standard errors are calculated using Nb = 600 bootstrapsamples. For market dummies, the omitted category is urban unitwith fewer than 20k inhabitants and rural units. For the chaindummies, the omitted category is single firm and small chains.The standard deviation of firm shocks σ is set to 10, 000 e.

Table 7: Theaters’ surplus under the three scenarios (in millions, 2010 e)

Market Planner Coordinationoutcome benchmark benchmark

Gross Industry Profits 167.54 193.75 223.14Adoption costs 91.36 111.49 132.48Net Industry Profits 76.18 82.25 90.66Change in Net Profits 0 6.07 14.47Change in Net Profits (in %) 0 7.97 19.00

Note: Profits from film are normalized to 0: all profits are relative to thestatus-quo with no adoption.

53

Page 55: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

Table 8: Average printing, shipping, storage cost per copy, by total number of copies (inthousands, 2010 e)

Film DigitalCopies LB UB LB UB

<10 copies 1.14 2.28 0.50 1.0010-50 copies 0.88 2.19 0.13 0.6450-100 copies 0.79 1.58 0.13 0.26100-200 copies 1.01 2.02 0.12 0.23200-400 copies 0.59 1.17 0.09 0.18>400 copies 0.57 1.13 0.08 0.15

Note: Estimate for French distributors. In-puted from aggregate PSS costs divided by # ofmovies per category. For Film, I use costs from2007/2008; for digital use costs from 2015 to2017.

54

Page 56: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

Figures

Figure 1: Data and descriptive statistics

(a) Observation times for the diffusion of digitalprojectors

(b) Hardware adoption cost (2010 e)

Figure 2: Within-theater adoption

(a) New digital screens (b) Decomposition: Inter/Intra-firm

Note: “Inter-firm” corresponds to screens installed by new adopters (no digital screensby t − 1). “Intra-firm” corresponds to screens installed by theaters with some digitalscreens by t− 1. Subsidized theaters (3 screens or fewer) excluded.

55

Page 57: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

Figure 3: Parameterization of single-period profits

Figure 4: Density estimate of the intra-firm rate of adoption by firm size

(a) Miniplexes (4-7 screens) (b) Multi/Megaplexes (8 screens or more)

Notes: Both density estimates correspond to the distribution of sit/Sτ (i) conditional onsit/Sτ (i) > 0 and sit/Sτ (i) < 1

56

Page 58: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

Figure 5: Histogram of πd(τ (i)) (in 2010 e)

Figure 6: Planner’s benchmark

(a) Simulated industry paths (20 perturbationdraws)

(b) Effect of 500 additional perturbation draws onmaximized industry profits

57

Page 59: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

Figure 7: Diffusion paths under the three scenarios

(a) Digital screens (b) Digital movies

Figure 8: Average cost per copy by number of copies

Note: Film LB and Digital UB used.

58

Page 60: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

A Theater adoption panel data in France

Table 9 presents the observation dates for the panel of digital projector adoption by data

sources. The table also shows the periodic subsample selected. The periods selected are

such that there is a previous observation period 6 months earlier (in some exeption it is

5 or 7 months). For instance, “May 2012” is selected because the industry is observed on

November 2011. The observation periods selected are represented in blue in Figure 9.

Table 9: Observation times by data source

Date Source Periodic sample

July 2005 Media SallesJanuary 2006 Media SallesJuly 2006 Media SallesJanuary 2007 Media SallesJuly 2007 Media SallesJanuary 2008 Media SallesJuly 2008 Media SallesJanuary 2009 Media SallesJuly 2009 Media SallesSeptember 2009 CinegoJarnuary 2010 Media SallesApril 2010 CinegoJuly 2010 Media SallesJanuary 2011 Media SallesNovember 2011 CinegoJanuary 2012 Media SallesMay 2012 CinegoJune 2013 Cinego

B Evidence of network effects in digital screen adop-

tion

This appendix presents estimates of the causal effect of digital movie availability on digital

screen adoption. Estimating this effect using only data on the French market is challenging

because only one network is observed over a relatively short time horizon and all the varia-

tion in software availability is in the time-series. This appendix complements the analysis in

the main text by employing supplementary data on adoption for a large panel of countries.

59

Page 61: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

Figure 9: Share of digitally equipped screens and observation times

The effect is identified from cross-sectional and time-series variation in digital movie avail-

ability. Identification relies on instrumental variables which affect digital movie availability

in country c at time t but are, otherwise, independent of other factors affecting demand for

digital screens in (c, t).

B.1 Aggregate data by country

This section presents the data used in the reduced-form analysis. Aggregate data on the

number of digital screens (quarterly, 2005− 2017) and the number of digital movies released

(semesterly, 2005− 2010) is obtained for 59 countries, from the Cinema Intelligence Service

database of Omdia (previously, IHS Markit), a market research firm.60

This data is complemented by a rich set of variables on domestic movie industries from

the UNESCO Institute of Statistics (UIS).61 The UIS data contains more than 75 variables

on the movie exhibition and distribution industries by country-year over the time period

1995−2017. I extract variables that are relevant controls for digital screen adoption: namely,

average box-office revenue per screen (converted to US$), and the distribution of theater sizes

60https://technology.informa.com/Services/424103/cinema-intelligence-service/Data, section“Digital Cinema”.

61http://data.uis.unesco.org, section “Culture, Feature Films”. One of the source for the UIS data,in particular concerning European countries, are the Cinema Yearbooks used in the main text.

60

Page 62: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

Figure 10: Market share of U.S. distributors (and joint ventures) by country

(fraction of miniplexes, multiplexes etc.). Additionally, I extract information on the market

share (of box-office revenue) of U.S. distributors and their joint ventures with domestic

distributors in each country-year. The major U.S. distributors are: United International

Pictures (a venture of Paramount and Universal), Buena Vista Distribution Company (Walt

Disney Studios Motion Pictures), 20th Century Fox, Warner Bros. Entertainment Inc.,

Columbia Pictures (Sony Pictures). Joint ventures of major U.S. and domestic distributors

include, for example, Sandrew Metronome/Warner Bros in Sweden, Roadshow/Warner Bros

in Australia and New Zealand.

The availability of aggregate data by country is shown in Table 10. Due to missing

data for digital movies, the reduced form analysis is conducted on a subset of 29 countries

with complete semesterly information on digital movies and screens between 2005 and 2010.

Figure 10 shows the market share of U.S. distributors (and their joint ventures with domestic

firms) for the subset of countries in 2005.

Software variety and hardware adoption are determined jointly and endogenously, gen-

erating a simultaneity problem: the availability of digital movies could be correlated with

unobserved country-time level factors that shift the demand for digital screens which, in

turn, affects the supply of digital movies. I handle this issue using instrumental variables.

The identification strategy exploits the fact that whether a U.S.-produced movie is released

in digital in country c at time t is at least partly a function of whether the movie was released

61

Page 63: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

Table 10: Aggregate data availability by country

Digital screens Digital movies RegressorsRegion Country (quarterly, 2005-2017) (semesterly, 2005-2010) (IV, controls)

Africa Middle-East EgyptIsraelMoroccoSouth AfricaUAE

Asia-Pacific AustraliaChinaHong KongIndiaIndonesiaJapanMalaysiaNew ZealandPhilippinesSingaporeSouth KoreaTaiwanThailandTurkey

Central and Eastern Europe BulgariaCroatiaCzech RepublicEstoniaHungaryLatviaLithuaniaPolandRomaniaRussiaSlovakiaSloveniaUkraine

Central and South America ArgentinaBrazilChileColombiaMexicoVenezuela

North America CanadaUnited States

Western Europe AustriaBelgiumCyprusDenmarkFinlandFranceGermanyGreeceIcelandIrelandItalyLuxembourgNetherlandsNorwayPortugalSpainSwedenSwitzerlandUnited Kingdom

62

Page 64: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

in digital in the U.S.—the more so if distributors in country c are subsidiaries of the major

U.S. studios or joint ventures of domestic distribution companies and U.S. distributors. The

release of digital movies in the U.S., on the other hand, was arguably not affected by the roll-

out of digital screens in relatively small foreign movie markets (e.g., Denmark, Luxembourg,

Mexico etc.).62

I construct a set of country-time-specific instruments for the number of digital movies in

country-time (c, t) by interacting: (i) the number of digital movies in the U.S. in t and (ii)

the market share of U.S. distributors (or joint ventures) in (c, t = 0). Importantly, market

shares are for t = 0 (i.e., at the beginning of the diffusion in 2005) because market shares

of U.S. distributors in t = 1, 2, .. may have been affected by the roll-out of digital screens in

country c (for instance, a positive demand shock for digital screens in (c, t) would raise the

demand for digital movies; if U.S. distributors are the only firms supplying movies in digital

in (c, t), their market share would increase).

B.2 Reduced form analysis

To estimate the effect of digital movie availability on digital screen adoption, I regress the

number of digital screens in country c (excluding the U.S.) at time t, on: the number of movies

released in digital in (c, t), average box-office revenue per screen in (c, t) (in, US$), and the

fraction of multiplexes in (c, t).63 Region-time effects are included to remove unobserved

time-varying factors at the regional level such as adoption costs (see Table 10 for the list of

regions). In addition, I control for time-invariant country-category effects, where I use the

total number of screens in each country to group countries into 6 categories. Standard errors

are clustered at the country-level.

I do not allow for country fixed effects due to the small amount of data. Such country

effects wash out a large part of the correlation between digital releases and screen adoption

coming from cross-sectional variation (this issue is related to the attenuation bias discussed in

Griliches and Hausman (1986)). The use of country-categories instead aims at navigating the

trade-off between removing country-level fixed unobservables from the error term that may

be correlated with the instrument and the attenuated effect of digital movies. I experiment

62This approach is close in spirit to the third identification strategy of Gowrisankaran and Stavins (2004)which measures network effects via the response of small banks to the adoption by small branches of largebanks. The identification strategy used in the current paper is arguably most informative about the effectof U.S.-produced digital movie availability on digital screen adoption.

63Revenues per screen and the fraction of multiplexes are assumed exogenous. For the former, this assump-tion is consistent with digital technology being a cost-reducing technology. The theater size distribution mayhave been affected by the conversion to digital; in particular, for developing movie markets such as China.The results are robust when the sample is restricted to established markets with a stable theater size distri-bution.

63

Page 65: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

with different definition of country-categories: e.g., total box office revenue in 2005, GDP

per capita, etc. These alternative definition of country categories yield similar estimates.

Table 11 (Columns 1–3) show the OLS results. In specification (3), the elasticity of

digital screens with respect to digital movies is 1.012. Adoption of digital screens is also

positively correlated with average revenues per screen (a function of average admissions and

ticket prices) and the fraction of multiplexes in the industry.

Table 11 (Columns 4–5) show the first and second-stage regression of the IV model.

Once the endogeneity of digital movie availability is accounted for, the elasticity of digital

screens with respect to digital movies decreases to 0.806. This is consistent with the expected

direction of the bias for hardware-software systems with positive network effects.64 Table 11

also reports the first-stage (Kleibergen-Paap Wald rk) F-statistic which is above the critical

thresholds derived in Stock and Yogo (2005), indicating that the instrument strongly predicts

the endogenous regressor.65

C Estimation

C.1 Parametric bootstrap procedure

This section provides details about the bootstrap procedure used to compute standard errors

for the structural parameter estimates. The non-parametric bootstrap approach used in the

dynamic oligopoly game literature typically relies on the availability of a cross-section of in-

dependent markets and assumes that the same equilibrium is played in all markets. Standard

errors can be calculated by sampling market-histories (with replacement) and estimating the

parameter of interest for each cross-section of markets sampled.

This approach would not be valid in the current setting because only a single industry

is observed and firms’ decisions are correlated through network effects (at the industry-

level). Sampling firm-histories (or even local markets, e.g. urban/rural units-histories)

would destroy the dependence between observations.

Instead I employ a parametric bootstrap approach. The main idea is to rely on the

(parametric) model, or data generating process, which incorporates the dependence between

observations, to generate the bootstrap samples on which estimation is conducted.

Let D = {x1, {ait}i∈N,t=1,...,T} denote the data observed, where

64Although a chi-squared test fails to reject the null hypothesis that the specified endogenous regressorcan be treated as exogenous.

65This F-statistic is used because the standard Cragg-Donald Wald F-statistic is not valid with cluster-robust standard errors.

64

Page 66: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

Table 11: Effect of digital movie availability on digital screen adoption

OLS IV

Dep. variable: Digital screens (log) (1) (2) (3) (4) (5)First-stage Second-stage

Digital movies (log) 1.161 1.176 1.012 0.806(0.0614) (0.146) (0.202) (0.342)

Multiplexes (8+ screens, in %) 0.957 1.655 1.256(0.492) (0.415) (0.560)

Revenue per screen (log, US$) 0.286 0.358 0.380(0.114) (0.171) (0.229)

Digital movies in the US ×Market share of US distributors (log) 0.538

(0.138)

Country-category FEs No No Yes Yes Yes

Region × Time FEs No Yes Yes Yes YesKleibergen-Paap rk Wald F-stat 15.21Endogeneity test (p-value) 0.64 (0.42)Observations 300 288 288 288 288

Note: Unit of observation: country/semester for all countries in the sample excluding the U.S. Standarderrors (in parenthesis) are clustered at the country-level.“Digital movies in the US” refers to the number ofdigital releases in the U.S. at time t. “Market share of US distributors” is in country c by the end of year2005 (i.e., in period t = 0).

65

Page 67: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

x1 = ((τ (1), s11), . . . , (τ (N), sN1))

is the initial state of the industry including firm types, and si1 = 0 for all i (in the

first period, all firms operate using film screens).66 The data D is assumed to be generated

from the set of equilibrium CCP {Pt(ait|τ (i), sit)}. I assume that the distribution of CCP

is parametric, namely, it follows an ordered probit model with parameters θ (governing the

dependence of the CCP on (t, τ (i), sit)).67 The parameters in θ are reduced-form first-stage

parameters entering the CCP.

The structural parameter of interest α is a functional of the CCP. Let θ be an estimate

of the reduced-form parameters θ obtained from the data D (shown in Table 5). Let α be

an estimate of the structural parameters α obtained from D and θ (via the second step

of the two-step approach detailed in Section 6.1). I denote the estimates of the CCP by

{Pt(ait|τ (i), sit; θ)}.Standard errors for α are obtained by following a parametric bootstrap procedure (see

Efron and Tibshirani (1993), sections 8.4 and 8.5 on “more general data structures”).

Throughout the procedure, the initial industry state x1 is fixed.

1. SimulateB boostrap datasets (D1, . . . ,DB) from the estimated CCP {Pt(ait|τ (i), sit; θ)}.The bootstrap datasets have the same size as the original dataset D.

2. For each bootstrap dataset indexed by b ∈ {1, . . . , B}, follow the two-step procedure

of Section 6.1: first, obtain estimates of the reduced-form first-stage parameters θb.

Second, given data Db and estimate θb, compute an estimate αb of the structural

parameter.

The requirement that the initial industry state x1 (in particular, firm types) is kept

fixed—rather than, for instance, sampling firms with replacement and simulating bootstrap

samples—guarantees that the equilibrium played in dataset D is also an equilibrium in the

bootstrap dataset Db: the equilibrium CCP {Pt(ait|τ (i), sit; θ)} are still valid and can be

used to simulate firms’ adoption decision in the bootstrap samples. This a consequence of the

fact that the process {ht(ano)}t=1,...,T , to which firms are best-responding, is only a function

of the initial industry state x1 and equilibrium strategy ano.68

66The remaining aggregate variables (pt, hUS,t) are assumed deterministic and are index by t.67Note that observations {ait}i∈N for a given t are independent and identically distributed conditional

on (t, τ (i), sit). This a consequence of the distributional assumption on εit. The curse of dimensionalityprevents estimating the (non-parametric) empirical distribution of ait|τ (i), sit.

68If alternatively, firm types were also sampled with replacement, the deterministic process {ht} wouldchange, and firms’ best-responses to it would also change.

66

Page 68: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

While the parametric bootstrap approach relies heavily on the correct specification of the

(parametric) data generating process, it can be viewed as one path forward in dealing with

dependent data in the context of dynamic games with a small number of markets but a large

number of firms. Such dependence arises, in particular, when network effects are present.

C.2 First-step: adoption policy rule

First step estimates for the adoption policy rule are obtained by estimating an ordered probit

model. Denote by Pij the probability that theater i transitions to state j. Possible states

are {0, 14, 12, 34, 1} in the case of miniplexes (4 − 7 screens), and {0, 1

8, 28, ..., 7

8, 1} in the case

of multi/megaplexes (8 screens or more). In constructing the likelihood, one has to account

for the fact that theaters cannnot divest digital screens, and therefore, cannot transition

to lower states: the dependent variable sit/Sτ (i) satisfies sit ≥ si,t−1. The log likelihood is

constructed as follows:

LL =∑

i:4≤Sτ(i)<8

1∑j=si(t−1)/Sτ(i)

dij log (Pij) +∑

i:8≤Sτ(i)

1∑j=si(t−1)/Sτ(i)

dij log (Pij) (35)

where dij is an indicator for firm i transitioning to state j.

C.3 First step: goodness of fit

To check the goodness of fit of the first-step adoption policy (CCP), model predictions for

the share of digital screens are compared to actual shares in the data. Tables 12, 13, and

14 present the comparison for all firms, miniplexes only, and multi/megaplexes, respectively.

In each table, the aggregate share of digital screens, the share of adopters (theaters with

at least one digital screen), and the average within-theater share of digital screens (among

adopters) are shown from 2006 to 2013. Overall, given the limitations imposed by the

parametric specification of the policy function, the model captures the main trends in the

aggregate, inter-firm, and intra-firm diffusion rates, for all firms and by firm size (miniplexes

vs. multi/megaplexes).

The aggregate share of digital screens was constantly lower for miniplexes than for mul-

ti/megaplexes, as reflected in the predictions as well. Additionally, the intra-firm rates’

evolution over time is smoother in the prediction than in the data.69

69In particular, for miniplexes, the intra-firm rate jumps to 41% as early as 2007, whereas the modelpredicts a slow increase between 2006 and 2009 to reach 40%. The prediction is also smoother in the caseof multi/megaplexes, with an increase in the actual intra-firm rates from 13.5% in 2008 to 41.5% in 2009,whereas the model predicts a smoother transition. Due to the limited number of firms with at least onedigital screen in the initial years (2006 − 2008), predicting the intra-firm rate (computed only on theaters

67

Page 69: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

Table 12: Predictions using the adoption policy function - All firms

Aggregate Inter-firm Intra-firm

Year Data Prediction Data Prediction Data Prediction

2006 0.003 0.000 0.021 0.000 0.127 0.1042007 0.006 0.000 0.028 0.001 0.210 0.1892008 0.012 0.006 0.056 0.020 0.182 0.2312009 0.068 0.049 0.122 0.129 0.420 0.2972010 0.236 0.186 0.431 0.386 0.459 0.3992011 0.460 0.416 0.684 0.691 0.583 0.5332012 0.791 0.663 0.841 0.907 0.880 0.6892013 0.939 0.878 0.934 0.988 0.985 0.877

Note: The column labelled “Aggregate” corresponds to the share of digitalscreens across all firms in the industry. The column labelled “Inter-firm” cor-responds to the share of theaters with at least one digital screen. The columnlabelled “Intra-firm” corresponds to the within-theater average share of digitalscreens among theaters with at least one digital screen. The predicted rates areobtained by averaging 500 simulation paths.

Table 13: Predictions using the adoption policy function - Miniplexes (4-7 screens)

Aggregate Inter-firm Intra-firm

Year Data Prediction Data Prediction Data Prediction

2006 0.002 0 0.008 0 0.267 02007 0.007 0.000 0.017 0.000 0.415 0.3342008 0.007 0.002 0.017 0.004 0.415 0.3322009 0.024 0.018 0.054 0.045 0.435 0.3672010 0.112 0.092 0.243 0.207 0.443 0.4212011 0.294 0.275 0.498 0.528 0.569 0.5042012 0.653 0.546 0.745 0.845 0.857 0.6362013 0.879 0.837 0.891 0.979 0.975 0.853

Note: The columns are defined in the same way as in Table 12, but the refer-ence group is miniplexes instead of all firms. The predicted rates are obtainedby averaging 500 simulation paths.

68

Page 70: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

Table 14: Predictions using the adoption policy function - Multi/Megaplexes (8-23 screens)

Aggregate Inter-firm Intra-firm

Year Data Prediction Data Prediction Data Prediction

2006 0.003 0.000 0.037 0.000 0.087 0.1032007 0.005 0.000 0.043 0.002 0.107 0.1792008 0.015 0.008 0.106 0.037 0.135 0.2162009 0.092 0.065 0.207 0.226 0.415 0.2802010 0.307 0.235 0.670 0.591 0.466 0.3882011 0.554 0.490 0.920 0.878 0.593 0.5522012 0.870 0.724 0.963 0.978 0.903 0.7392013 0.973 0.899 0.989 0.998 0.997 0.904

Note: The columns are defined in the same way as in Table 12, but the ref-erence group is multi/megaplexes instead of all firms. The predicted rates areobtained by averaging 500 simulation paths.

D Computation: full-solution approach for the plan-

ner’s benchmark

This section presents the full-solution approach to the planner’s problem (30). The plan-

ner’s benchmark is solved by backward induction starting from T . To address the high-

dimensionality problem, I maintain the reduced-form specification characterizing ht (Equa-

tion (31)) as in section 7.1. Additionally, the planner’s adoption strategy space is coarsened

by assuming group-symmetric strategies: I assign theaters to G groups based on profits

πd(τ (i)) and the planner chooses the same adoption decision (adoption rate) for theaters

within the same group.

These assumptions reduce the dimensionality of the state space tracked by the planner

(excluding theaters’ idiosyncratic shocks). However, within-group adoption strategies may

in general still be a function of the realization of the vector of idiosyncratic shocks εit, a

high-dimensional vector. I deal with this issue by noting that adoption decisions depend

only on the average adoption cost across theaters within a group, i.e. (for Ng theaters in

group g):

pt +1

Ng

∑i∈g

εit

Appealing to the law of large numbers, the second term is close to zero. By this argument,

with at least one digital screen) in this initial adoption phase is more challenging.

69

Page 71: Estimating the Costs of Standardization: Evidence from the ...Estimating the Costs of Standardization: Evidence from the Movie Industry El Hadi Caoui October 4, 2020 Abstract This

Figure 11: Diffusion paths under the three scenarios (full-solution approach)

(a) Digital screens (b) Digital movies

I can approximate the planner’s problem as a non-stochastic problem.70

The solution of problem (30) is found for different values of G. Figures 11a and 11b show

the results for G = 10 (5 for miniplexes and 5 for multiplexes). The optimal diffusion path

under the planner’s benchmark is close to the optimal path found by simulation (Figure 7a).

70More precisely, it is a weighted average∑aitεit∑ait

, because the number of screens converted differ across

theaters of different sizes.

70