nonparametric least squares methods for stochastic ... · léopold simar, ingrid van keilegom,...
TRANSCRIPT
![Page 1: Nonparametric Least Squares Methods for Stochastic ... · Léopold Simar, Ingrid Van Keilegom, Valentin Zelenyuk* * School of Economics and Centre for E ciency and Productivity Analysis](https://reader036.vdocuments.us/reader036/viewer/2022081408/605b3bafc10e394e3a16f25e/html5/thumbnails/1.jpg)
Nonparametric Least Squares Methods forStochastic Frontier Models
Léopold Simar, Ingrid Van Keilegom, Valentin Zelenyuk*
* School of Economics and Centre for E�ciency and Productivity Analysis(CEPA), The University of Queensland, Australia
APPC 2014
Léopold Simar, Ingrid Van Keilegom, Valentin Zelenyuk*
![Page 2: Nonparametric Least Squares Methods for Stochastic ... · Léopold Simar, Ingrid Van Keilegom, Valentin Zelenyuk* * School of Economics and Centre for E ciency and Productivity Analysis](https://reader036.vdocuments.us/reader036/viewer/2022081408/605b3bafc10e394e3a16f25e/html5/thumbnails/2.jpg)
The Background
In productivity and e�ciency analysis, researchers are primarily
interested in two aspects:
the estimation of a function characterizing the productionfrontier and its characteristics (marginal productivity,
elasticities, etc.),
explaining of variation in ine�ciency.
One of the most popular approaches for studying these aspects is
referred to as stochastic frontier analysis (SFA), introduced by
Aigner, Lovell and Schmidt (1977) and Meusen and van den Broek
(1977) (henceforth ALSMB).
The SFA paradigm has a very appealing feature relative to other
methods�it allows the presence of both an ine�ciency term
modeling the distance of an observation to the frontier and the
more traditional error term (as in most regression models) allowing
for noise.
Léopold Simar, Ingrid Van Keilegom, Valentin Zelenyuk*
![Page 3: Nonparametric Least Squares Methods for Stochastic ... · Léopold Simar, Ingrid Van Keilegom, Valentin Zelenyuk* * School of Economics and Centre for E ciency and Productivity Analysis](https://reader036.vdocuments.us/reader036/viewer/2022081408/605b3bafc10e394e3a16f25e/html5/thumbnails/3.jpg)
The Background (cont.)
Here we work with SFA paradigm but in a non-parametric andsemi-parametric context. Speci�cally, the model is
y = m(x , z)− u + v , (1)
where y ∈ R+ represents the output that can be produced with the
inputs x ∈ Rp+ in the presence of heterogeneous conditions z ∈ Rq
+,
using the available technology characterized by the production
frontier m(·, ·); the output y is adjusted by some possible
ine�ciency level u ∈ R+ and by some statistical noise v ∈ R+
The two terms u and v are unobserved random variables which may
vary with the inputs x as well as with the other variables z .
Léopold Simar, Ingrid Van Keilegom, Valentin Zelenyuk*
![Page 4: Nonparametric Least Squares Methods for Stochastic ... · Léopold Simar, Ingrid Van Keilegom, Valentin Zelenyuk* * School of Economics and Centre for E ciency and Productivity Analysis](https://reader036.vdocuments.us/reader036/viewer/2022081408/605b3bafc10e394e3a16f25e/html5/thumbnails/4.jpg)
Estimation of Average Production Function
A relatively easy task is to estimate the `average productionfunction' relationship (i.e., if ine�ciency term u is ignored), and
this can be done in a fully non-parametric way.
Letting ε = v − u + E (u|x , z), and r1(x , z) = m(x , z)− E (u|x , z)we can rewrite (1) as
y = r1(x , z) + ε (2)
Since E (ε|x , z) = 0, and V (ε|x , z) = Vv (x , z) + Vu(x , z) ∈ (0,∞),we can apply any valid non-parametric estimator to estimate
r1(x , z) = E (y |x , z), e.g., local polynomial least squares (LPLS)estimator, which is a fully non-parametric approach with good
asymptotic properties and is relatively easy and fast to compute.
In one estimation LPLS can produce consistent and asymptoticallynormal estimates of r1(x , z), which we will denote here as
r1(x , z), and estimates of its kth-order partial derivatives (if the
order of the polynomial is chosen to be at least of order k).Léopold Simar, Ingrid Van Keilegom, Valentin Zelenyuk*
![Page 5: Nonparametric Least Squares Methods for Stochastic ... · Léopold Simar, Ingrid Van Keilegom, Valentin Zelenyuk* * School of Economics and Centre for E ciency and Productivity Analysis](https://reader036.vdocuments.us/reader036/viewer/2022081408/605b3bafc10e394e3a16f25e/html5/thumbnails/5.jpg)
LPLS Est. of Average Production Function
From i.i.d. data {(Yi ,Xi ,Zi ) : i = 1, ...n} we can estimate the
local linear estimate of r1(x , z) by solving, for any given value (x , z)
(αx ,z , βx ,z) = argminα,β
n∑i=1
[Yi − (α+ β′(Wi − w))
]2Kh
((Wi − w)/h
),
(3)
where Wi = (Xi ,Zi ), w = (x , z) and h denotes the bandwidths
(with some abuse of notations). Then we have
r1(x , z) =αx ,z
∇r1(x , z) =βx ,z ,
where the second equation provides an estimate of the gradient of
r1(x , z) at (x , z).
Léopold Simar, Ingrid Van Keilegom, Valentin Zelenyuk*
![Page 6: Nonparametric Least Squares Methods for Stochastic ... · Léopold Simar, Ingrid Van Keilegom, Valentin Zelenyuk* * School of Economics and Centre for E ciency and Productivity Analysis](https://reader036.vdocuments.us/reader036/viewer/2022081408/605b3bafc10e394e3a16f25e/html5/thumbnails/6.jpg)
Est. of Average Production Function
Note that r1(x , z) will be an estimate not of the productionfrontier m(x , z) but of r1(x , z) = m(x , z)− E (u|x , z), which we
refer to as the average production function.
Since E (u|x , z) ≥ 0, we have r1(x , z) ≤ m(x , z), ∀x ∈ Rp+,
∀z ∈ Rd+. Thus, clearly, r1(x , z) = m(x , z), ∀x ∈ Rp
+, ∀z ∈ Rd+ if
and only if E (u|x , z) = 0, i.e., if and only if there is no ine�ciency.
Hence, if there is some ine�ciency, then r1(x , z) would be a
downward-biased estimator of m(x , z) at any level of inputs.
Moreover, the bias is E (u|x , z), and so it is varying with (x , z),unless E (u|x , z) = E (u).
Note, however, that some important information about theaverage production relationship (e.g., the marginal productivity
of inputs, the scale elasticity, etc.) is still contained in r1(x , z) andso can be inferred from r1(x , z).
Léopold Simar, Ingrid Van Keilegom, Valentin Zelenyuk*
![Page 7: Nonparametric Least Squares Methods for Stochastic ... · Léopold Simar, Ingrid Van Keilegom, Valentin Zelenyuk* * School of Economics and Centre for E ciency and Productivity Analysis](https://reader036.vdocuments.us/reader036/viewer/2022081408/605b3bafc10e394e3a16f25e/html5/thumbnails/7.jpg)
Three Moments of the Total Error
Without specifying a particular choice for the localdistributions of u and of v , we can also estimate the moments of
ε since under the symmetry assumption on v ,
E (ε|x , z) = 0, (4)
E (ε2|x , z) = Vv (x , z) + Vu(x , z) > 0, (5)
E (ε3|x , z) = −E[(u − E (u|x , z)
)3|x , z] ≤ 0. (6)
We will extend the idea of the Modi�ed OLS (MOLS), originated in
the full parametric, homoskedastic stochastic frontier models (see
Olson et al., 1980) to our semi-parametric setup.
The idea is to exploit the fact that the residuals in (2) may help
estimating the conditional moments of ε.
Léopold Simar, Ingrid Van Keilegom, Valentin Zelenyuk*
![Page 8: Nonparametric Least Squares Methods for Stochastic ... · Léopold Simar, Ingrid Van Keilegom, Valentin Zelenyuk* * School of Economics and Centre for E ciency and Productivity Analysis](https://reader036.vdocuments.us/reader036/viewer/2022081408/605b3bafc10e394e3a16f25e/html5/thumbnails/8.jpg)
Estimation of 2nd and 3rd Moments
So, the residuals can be evaluated at any data point, and in
particular, to obtain
εi = Yi − r1(Xi ,Zi ), i = 1, ..., n. (7)
It is known that for all i , conditionally on (Yi ,Xi ,Zi ), εi → εi . Thesame is true for the powers of ε.
Therefore, consider the following regressions:
ε2 = E (ε2|x , z) + e2 = r2(x , z) + e2, (8)
ε3 = E (ε3|x , z) + e3 = r3(x , z) + e3, (9)
where by construction E (e2|x , z) = 0 and E (e3|x , z) = 0.
Using a nonparametric technique (e.g., LPLS), the regression
functions r2(x , z) and r3(x , z) can be consistently estimated from
{(ε2i ,Xi ,Zi )|i = 1, ..., n} and {(ε3i ,Xi ,Zi )|i = 1, ..., n}.Léopold Simar, Ingrid Van Keilegom, Valentin Zelenyuk*
![Page 9: Nonparametric Least Squares Methods for Stochastic ... · Léopold Simar, Ingrid Van Keilegom, Valentin Zelenyuk* * School of Economics and Centre for E ciency and Productivity Analysis](https://reader036.vdocuments.us/reader036/viewer/2022081408/605b3bafc10e394e3a16f25e/html5/thumbnails/9.jpg)
Estimation of 2nd and 3rd Moments
We prove that r2(x , z) and r3(x , z) are consistent andasymptotically normal estimates of E (ε2|x , z) and E (ε3|x , z)respectively.
Now from (5)�(6), note the link with conditional moments of u and
v , so plugging these estimates in the equations will provide
information on the moments of u and v .
In order to identify the frontier level and some important
parameters of the model, e.g., m(x , z), E (u|x , z), Vu(x , z) andVv (x , z) we need to select local parametric assumptions forboth the density of u|x , z and of v |x , z .
Léopold Simar, Ingrid Van Keilegom, Valentin Zelenyuk*
![Page 10: Nonparametric Least Squares Methods for Stochastic ... · Léopold Simar, Ingrid Van Keilegom, Valentin Zelenyuk* * School of Economics and Centre for E ciency and Productivity Analysis](https://reader036.vdocuments.us/reader036/viewer/2022081408/605b3bafc10e394e3a16f25e/html5/thumbnails/10.jpg)
Estimation of Frontier and of Ine�ciency
From our discussion above it must be clear that for implementing
this task we need to have information on E (u|x , z) and this is
where (for the cross-sectional data framework) we need to makelocal parametric assumptions on the types of distributions ofu|x , z and v |x , z .
(v |x , z) ∼ N(0, σ2v (x , z)), v ∈ (−∞,∞), (10)
(u|x , z) ∼ |N(0, σ2u(x , z))|, u ∈ (0,∞), (11)
where we also assume that, conditionally on (x , z), u and v are
independent.
Léopold Simar, Ingrid Van Keilegom, Valentin Zelenyuk*
![Page 11: Nonparametric Least Squares Methods for Stochastic ... · Léopold Simar, Ingrid Van Keilegom, Valentin Zelenyuk* * School of Economics and Centre for E ciency and Productivity Analysis](https://reader036.vdocuments.us/reader036/viewer/2022081408/605b3bafc10e394e3a16f25e/html5/thumbnails/11.jpg)
Estimation of Frontier and of Ine�ciency
As a result, we would have
E (u|x , z) =√
2
πσu(x , z) (12)
E (ε2|x , z) = V (ε|x , z) = σ2v (x , z) +(π − 2
π
)σ2u(x , z) (13)
E (ε3|x , z) =√
2
π
(1− 4
π
)σ3u(x , z). (14)
Rearranging the system of equations given in (13)-(14), and solving
it for σ3u(x , z) and σ2v (x , z) we get
σ3u(x , z) =
√π
2
( π
π − 4
)E (ε3|x , z) (15)
σ2v (x , z) = E (ε2|x , z)−(π − 2
π
)σ2u(x , z) (16)
Léopold Simar, Ingrid Van Keilegom, Valentin Zelenyuk*
![Page 12: Nonparametric Least Squares Methods for Stochastic ... · Léopold Simar, Ingrid Van Keilegom, Valentin Zelenyuk* * School of Economics and Centre for E ciency and Productivity Analysis](https://reader036.vdocuments.us/reader036/viewer/2022081408/605b3bafc10e394e3a16f25e/html5/thumbnails/12.jpg)
Estimation of Frontier and of Ine�ciency
The LPLS estimates r2(x , z) and r3(x , z) are asymptotically
equivalent to the corresponding true conditional moments of ε, sowe can use them to get consistent estimates of the conditionalvariances at each point of interest (x , z), i.e.,
σ3u(x , z) =
√π
2
( π
π − 4
)r3(x , z) (17)
σ2v (x , z) = r2(x , z)−(σ3u(x , z)
)2/3(π − 2
π
)(18)
Using these estimates, we can obtain the estimates of e�ciency
scores for each observation, e.g., by using the method of Jondrowet al (1982)�after generalizing it to the heteroskedasticcase, by estimating E (ui |εi , xi , zi ) instead of E (ui |εi ).
Léopold Simar, Ingrid Van Keilegom, Valentin Zelenyuk*
![Page 13: Nonparametric Least Squares Methods for Stochastic ... · Léopold Simar, Ingrid Van Keilegom, Valentin Zelenyuk* * School of Economics and Centre for E ciency and Productivity Analysis](https://reader036.vdocuments.us/reader036/viewer/2022081408/605b3bafc10e394e3a16f25e/html5/thumbnails/13.jpg)
Estimation of Average Production Function
Also, a useful information can be inferred from a consistent
estimate of the conditional mean of ine�ciency term,
conditional on (x , z),
E (u|x , z) =√π
2
(√π
2
π
π − 4r3(x , z)
)1/3(19)
Furthermore, estimates of E (u|x , z) at every combination of
interest (x , z) can then be used to recover a consistent estimate of
the stochastic frontier, m(x , z), via
m(x , z) := r1(x , z) + E (u|x , z). (20)
Asymptotic properties of m(x , z), E (u|x , z), σu(x , z) andσv (x , z) are inherited from the asymptotic properties of
r1(x , z), r2(x , z) and r3(x , z) and we provide details in appendix.
Léopold Simar, Ingrid Van Keilegom, Valentin Zelenyuk*
![Page 14: Nonparametric Least Squares Methods for Stochastic ... · Léopold Simar, Ingrid Van Keilegom, Valentin Zelenyuk* * School of Economics and Centre for E ciency and Productivity Analysis](https://reader036.vdocuments.us/reader036/viewer/2022081408/605b3bafc10e394e3a16f25e/html5/thumbnails/14.jpg)
Robust Analysis of Determinants of Ine�ciency
In the traditional SFA setup, when statistical noise is symmetricwhile ine�ciency term is asymmetric, all the information about
the ine�ciency is essentially contained in the negative skewness of
the composite error.
So, studying ine�ciency boils down into studying the conditionalskewness with respect to (x , z), which can be done via a
non-parametric regression approach.
Interestingly, to perform such analysis, we do not need to specify
the distribution of v |x , z : we can use any one-parameter scalefamily for the density u|x , z , without specifying which member of
the family is chosen.
Léopold Simar, Ingrid Van Keilegom, Valentin Zelenyuk*
![Page 15: Nonparametric Least Squares Methods for Stochastic ... · Léopold Simar, Ingrid Van Keilegom, Valentin Zelenyuk* * School of Economics and Centre for E ciency and Productivity Analysis](https://reader036.vdocuments.us/reader036/viewer/2022081408/605b3bafc10e394e3a16f25e/html5/thumbnails/15.jpg)
Robust Analysis of Determinants of Ine�ciency
The density fu(u|x , z) belongs to the one parameter scale familyif
fu(u|x , z) =1
σu(x , z)g( u
σu(x , z)
), (21)
where g(·) is any density on R+. Examples of this are the
exponential, the half-normal, the gamma with �xed shape
parameter, etc. In this family it is easy to show that for all j ≥ 1
E (uj |x , z) = σju(x , z)kj , (22)
as long as the j th moment of g , kj =´∞0 v j g(v) dv , exists. As a
result, we also have
E (ε3|x , z) = E[(u − E (u|x , z)
)3∣∣ x , z] = cσ3u(x , z). (23)
where c = (k3 − 3k2k1 + 2k1) is a constant (depends on g).
Léopold Simar, Ingrid Van Keilegom, Valentin Zelenyuk*
![Page 16: Nonparametric Least Squares Methods for Stochastic ... · Léopold Simar, Ingrid Van Keilegom, Valentin Zelenyuk* * School of Economics and Centre for E ciency and Productivity Analysis](https://reader036.vdocuments.us/reader036/viewer/2022081408/605b3bafc10e394e3a16f25e/html5/thumbnails/16.jpg)
Robust Analysis of Determinants of Ine�ciency
Often, practitioners are actually more interested in the
determinants of the ine�ciency rather than the ine�ciency or
the frontier per se.
Sometimes, researchers are even satis�ed with at least the direction
(sign) of the in�uence, although perhaps ideal information would be
about the elasticities of the ine�ciency w.r.t. certain variables,
since they do not depend on units of measurement of the variables
involved.
Léopold Simar, Ingrid Van Keilegom, Valentin Zelenyuk*
![Page 17: Nonparametric Least Squares Methods for Stochastic ... · Léopold Simar, Ingrid Van Keilegom, Valentin Zelenyuk* * School of Economics and Centre for E ciency and Productivity Analysis](https://reader036.vdocuments.us/reader036/viewer/2022081408/605b3bafc10e394e3a16f25e/html5/thumbnails/17.jpg)
Robust Analysis of Determinants of Ine�ciency
Let ψl be an element of (x , z), then the (partial) elasticitymeasure of E (u|x , z) w.r.t. ψl , denoted by ξu/ψl
(x , z), is
ξu/ψl(x , z) :=
∂E (u|x , z)∂ψl
ψl
E (u|x , z)(24)
assuming that E (u|x , z) 6= 0. Using (22) with j = 1 we
immediately get
ξu/ψl(x , z) =
∂σu(x , z)
∂ψl
ψl
σu(x , z)(25)
if σu(x , z) 6= 0 (i.e., if there is some ine�ciency at (x , z)).
Although ∂σu(x , z)/∂ψl is not directly estimable, we can stillrecover it from estimate of E (ε3|x , z).
Léopold Simar, Ingrid Van Keilegom, Valentin Zelenyuk*
![Page 18: Nonparametric Least Squares Methods for Stochastic ... · Léopold Simar, Ingrid Van Keilegom, Valentin Zelenyuk* * School of Economics and Centre for E ciency and Productivity Analysis](https://reader036.vdocuments.us/reader036/viewer/2022081408/605b3bafc10e394e3a16f25e/html5/thumbnails/18.jpg)
Robust Analysis of Determinants of Ine�ciency
Indeed, using (22) when j = 3 we get
∂E (ε3|x , z)∂ψl
ψl
E (ε3|x , z)= 3cσ2u(x , z)
∂σu(x , z)
∂ψl
ψl
cσ3u(x , z)(26)
= 3ξu/ψl(x , z)
Therefore, we can express the elasticity of ine�ciency only interms of the third moment of the total error, i.e.,
ξu/ψl(x , z) =
1
3
∂E (ε3|x , z)∂ψl
ψl
E (ε3|x , z)(27)
Léopold Simar, Ingrid Van Keilegom, Valentin Zelenyuk*
![Page 19: Nonparametric Least Squares Methods for Stochastic ... · Léopold Simar, Ingrid Van Keilegom, Valentin Zelenyuk* * School of Economics and Centre for E ciency and Productivity Analysis](https://reader036.vdocuments.us/reader036/viewer/2022081408/605b3bafc10e394e3a16f25e/html5/thumbnails/19.jpg)
Robust Analysis of Determinants of Ine�ciency
So, a non-parametric estimate of ξu/ψj(x , z) can be obtained by
replacing the true moment E (ε3|x , z) and ∂E (ε3|x , z)/∂ψl with
their non-parametric estimates, i.e., as
ξu/ψj(x , z) =
1
3
∂r3(x , z)
∂ψl
ψl
r3(x , z)(28)
where r3(x , z) and ∂r3(x , z)/∂ψl , l = 1, ..., p + q are, for example,
the LPLS estimates of (9), provided, of course, that r3(x , z) 6= 0
for the particular combination of interest (x , z).
See appendix for a proof of consistency and asymptotic normality of
the estimator ξu/ψl(x , z)
Léopold Simar, Ingrid Van Keilegom, Valentin Zelenyuk*
![Page 20: Nonparametric Least Squares Methods for Stochastic ... · Léopold Simar, Ingrid Van Keilegom, Valentin Zelenyuk* * School of Economics and Centre for E ciency and Productivity Analysis](https://reader036.vdocuments.us/reader036/viewer/2022081408/605b3bafc10e394e3a16f25e/html5/thumbnails/20.jpg)
Testing of Existance of Ine�ciency
We would expect r3(x , z) being negative and signi�cantly di�erent
from zero at some ranges of (x , z) where �rms in the sample have
signi�cant ine�ciency and insigni�cantly di�erent from zero for
some ranges of (x , z) where there is no signi�cant ine�ciency.
So, existing tests developed for LPLS framework can beadapted to test whether r3(x , z) is signi�cantly di�erent from zero
or not.
In particular, it is well known that the LPLS estimator of a
regression function (and of its derivatives) is asymptotically
normally distributed (under some regularity conditions).
So, for a combination of interest (x , z), the null hypothesis about
no ine�ciency at this particular (x , z), i.e., H0 : r3(x , z) = 0, would
be rejected in favor of the alternative hypothesis that ine�ciency at
(x , z) is present, i.e., H0 : r3(x , z) < 0, if the test statistic is beyond
the critical value corresponding to the chosen signi�cance level.
Léopold Simar, Ingrid Van Keilegom, Valentin Zelenyuk*
![Page 21: Nonparametric Least Squares Methods for Stochastic ... · Léopold Simar, Ingrid Van Keilegom, Valentin Zelenyuk* * School of Economics and Centre for E ciency and Productivity Analysis](https://reader036.vdocuments.us/reader036/viewer/2022081408/605b3bafc10e394e3a16f25e/html5/thumbnails/21.jpg)
Testing of Determinants of Ine�ciency
LPLS also allows inference about the sign and size of the impactof x and z on the ine�ciency, by using estimates of ∇x r3(x , z) and∇z r3(x , z), respectively, and testing their signi�cance from zero, at
a particular combination (x , z).
Speci�cally, the null hypothesis about no impact on ine�ciency by
a potential factor ψi , i.e., H0 : ξu/ψj(x , z) = 0, where ψi is an
element of (x , z), would be rejected in favor of the alternative
hypothesis H1 : ξu/ψj(x , z) 6= 0 if the statistic is beyond the critical
value corresponding to the chosen signi�cance level.
In practice, bootstrap-based inference adapted to LPLS, about
r3(x , z) = 0 or ξu/ψj(x , z) = 0, might give more accurate results
than the inference based on asymptotic normality results for our
estimator.
Léopold Simar, Ingrid Van Keilegom, Valentin Zelenyuk*
![Page 22: Nonparametric Least Squares Methods for Stochastic ... · Léopold Simar, Ingrid Van Keilegom, Valentin Zelenyuk* * School of Economics and Centre for E ciency and Productivity Analysis](https://reader036.vdocuments.us/reader036/viewer/2022081408/605b3bafc10e394e3a16f25e/html5/thumbnails/22.jpg)
Emperical Example
See our Working Paper (on CEPA and ISBA websites) for details
Léopold Simar, Ingrid Van Keilegom, Valentin Zelenyuk*