rls_beta
TRANSCRIPT
-
8/8/2019 RLS_beta
1/13
Quantitative Finance, 2009, 113, iFirst
Robust estimation with flexible parametricdistributions: estimation of
utility stock betas
JAMES B. MCDONALDy, RICHARD A. MICHELFELDER*z andPANAYIOTIS THEODOSSIOUzx
yDepartment of Economics, Brigham Young University, Provo, UT, USA
zSchool of BusinessCamden, Rutgers University, 227 Penn Street, Camden, NJ 08102, USA
xCyprus University of Technology, Cyprus
(Received 26 June 2006; in final form 16 January 2009)
The distributions of stock returns and capital asset pricing model (CAPM) regression residualsare typically characterized by skewness and kurtosis. We apply four flexible probabilitydensity functions (pdfs) to model possible skewness and kurtosis in estimating the parametersof the CAPM and compare the corresponding estimates with ordinary least squares (OLS) andother symmetric distribution estimates. Estimation using the flexible pdfs provides moreefficient results than OLS when the errors are non-normal and similar results when the errorsare normal. Large estimation differences correspond to clear departures from normality. Ourresults show that OLS is not the best estimator of betas using this type of data. Our resultssuggest that the use of OLS CAPM betas may lead to erroneous estimates of the cost of capitalfor public utility stocks.
Keywords: Robust estimation; Beta; Flexible distributions; Skewness; Kurtosis
1. Introduction and purpose
Consistent with the well-established literature on the
characteristics of the distributions of stock returns in
general, public utilities stock returns distributions have
thick tails (leptokurtosis) as well as skewness. Estimating
capital asset pricing model (CAPM) betas using ordinary
least squares (OLS) when the data (returns and regression
errors) are non-normal results in inefficient estimators.
Inefficient betas are prone to greater estimation error
as their distributions have larger dispersion. They aremore likely to be insignificant due to larger standard
errors. The major focus of this study is efficient robust
estimation with application to the CAPM for public
utilities. Its main motivation stems from the fact that
public utility regulators and utilities, in addition to
investors and stock analysts, regularly use CAPM betas
to estimate the cost of common equity for public utilities.
Harrington (1980) conducted two surveys on the use
of the CAPM for utility regulation and found that the
model had either been considered or was being used by
38 utility commissions. Cooley (1981) reviewed the use
of the CAPM in estimating the cost of equity capital for
public utility companies and concluded that its use
has not been merely nominal. In a review of surveys,
Cooley (1981) found that the Federal Communications
Commission and a minimum of 20 state utility commis-
sions had heard testimony involving the application of the
CAPM. Out of 54 jurisdictions surveyed in 1978, 16 rate
cases involve the use of the CAPM, and there were 12 more
the following year. A web search of the use of the CAPM in
public utility rate cases today will easily demonstrate its
widespread application. Bey (1983) found that the out-
comes of public utility rate cases had a tremendous impact
on financial health of both the consumers and the utility
companies. He concluded that the CAPM should be used
in such cases in the best possible manner.
Investor-owned public utilities are price and rate-
of-return regulated. Estimating the cost of common equity*Corresponding author. Email: [email protected]
Quantitative Finance
ISSN 14697688 print/ISSN 14697696 online 2009 Taylor & Francis
http://www.informaworld.comDOI: 10.1080/14697680902814241
DownloadedBy:
[informa
internalusers]At:16:484February
2010
-
8/8/2019 RLS_beta
2/13
for setting the regulated utilitys allowed rate of return with
inefficiently estimated betas results in larger errors in the
pricing of electricity, and therefore creates inequitable shifts
of wealth between the regulated firms and consumers.
Moreover, more precise cost of capital estimates result in
less uncertainty in the regulatory electricity price setting
process and capital investment. In a risk-averse world, more
precise cost of capital estimates will have a significant
positive impact on the societal economic welfare. From theinvestors point of view, more accurate estimates of cost of
capital and portfolio inputs in general will lead to the
construction of more efficient portfolios. Siegal and
Woodgate (2007) and Klein and Bawa (1976) discuss the
impact of estimation error on the optimal portfolio choice
and performance.
The major sources of beta estimates to investors,
utilities and regulators come from investor information
services such as those provided by Value Line, Merrill
Lynch and Goldman Sachs. These beta estimates are
mainly based on the OLS estimation method and as such
they are likely to possess larger estimation error. In this
paper, we show that the use of flexible pdfs in regressionestimation leads to betas which are more efficient
in that they may possess smaller variances than those
associated with OLS. We evaluate the effectiveness
of several flexible parametric probability distributions
for estimating more efficient betas with quasi-maximum
likelihood estimation and compare them with OLS
and the generalized method of moments (GMM). These
pdfs include the skewed generalized T (SGT), the skewed
generalized error distribution (SGED), the skewed expo-
nential generalized beta of the second kind (SEGB2)
and the inverse hyperbolic sine distribution (IHS).
The first three pdfs have been developed in the last 10
years. The IHS was first introduced in 1949 but has
remained obscure until the recent interest in robustestimation and addressing non-normality in regression.
The flexible pdfs accommodate wide ranges of skewness
and kurtosis and therefore may result in more efficient
estimated betas when the data are non-normally dis-
tributed. Although the applications herein involve electric
utility stocks, the estimation methods universally apply
to all types of company stock CAPM parameters as their
stock returns pdfs typically have thick tails and skewness.
Thus, we suggest that the application of inefficient betas
may be a source of general equity market mis-pricing
and inefficiency as stock returns and their regression
errors are typically non-normal.
2. Empirical distributions of stock returns
Mandlebrot (1963) and Fama (1965) initially established
that the distribution of stock returns regression residuals
have leptokurtosis. McDonald and Nelson (1989) and
Harvey and Siddique (1999) found skewness and thick
tails in tests of various stock indices and asset classes.
Harvey and Siddique (2000) found positive skewness and
co-skewness with the stock market for portfolios of
electric and water utility stocks, in addition to other
stock portfolios. Chan and Lakonishok (1992) concluded
that since the distribution of stock returns is non-normal
for so many studies due to kurtosis that OLS estimators
of beta will often be inefficient. They found substantial
efficiency gains using robust methods when returns
contain extreme outliers.
Efficient beta estimation addressing skewness or
kurtosis is also discussed by Fielitz and Smith (1972),Francis (1975), McDonald and Nelson (1989), and Butler
et al. (1990). Theodossiou (1998) rejected the assumption
of normality of returns for multiple stock exchanges
indices, exchange rates, and gold. Akgiray and Booth
(1988, 1991) considered a mixture of normal distributions
and non-normal empirical pdfs in modeling the statistical
property of exchange rates. Bali (2003) fits alternative
pdfs (non-normal) to model the extreme changes in the
US Treasury securities market. Bali and Weinbaum
(2007) also reject the normality of stock market returns
for various indices. The literature concluding that asset
returns pdfs have skewness and kurtosis and therefore are
non-normal pdfs is vast. Generally, stock and other assetreturns pdfs have fat tails due to extreme outliers and
are often asymmetric. OLS estimators are highly sensitive
to extreme values. Electric utility as well as non-utility
stock returns have pdfs that are thick-tailed and skewed.
In these cases, alternatives to OLS can yield more efficient
estimators.
3. Flexible probability distributions
and robust estimation
There are a myriad of robust estimation methods.
Although many are discussed in this paper, some methods
such as those of Yohai and Zamar (1997) and Martin andSimin (2003) are not, as this paper focuses on those
methods that reflect generality in pdf by accommodating
varying levels of skewness and kurtosis and that nest
many pdfs. Martin and Simin (2003), however, do find
some interesting results with the data-dependent weighted
least squares approach that they developed and tested.
Boyer et al. (2003) differentiate robust, or outlier-
resistant estimators, into reweighted least squares (RLS)
or least median squares (LMS) and partially adaptive
estimators. Partially adaptive estimation procedures can
be viewed as being quasi-maximum likelihood estimators
(QMLE) because they maximize a log-likelihood function
corresponding to an approximating error distributionover both regression and distributional parameters. RLS
and LMS address only the explicit choice of regression
parameters. Therefore the pdfs in this investigation are
referred to as flexible pdfs. Boyer et al. (2003) use Monte
Carlo simulations to test the efficiency of flexible pdfs,
RLS and LMS. Using one of the four flexible pdfs and
a more restrictive version of another pdf used in this
paper, they concluded that flexible pdfs were found to
produce more efficient estimators than outlier-resistant
methods that do not accommodate changes in pdf
2 J. B. Mcdonald et al.
DownloadedBy:
[informa
internalusers]At:16:484February
2010
-
8/8/2019 RLS_beta
3/13
parameters when regression models have skewness or
kurtosis. Therefore, among the myriad of robust estima-
tion methods, this paper focuses on flexible pdfs.
The flexible probability distributions considered in this
investigation can accommodate a wider range of data
characteristics than commonly used distributions such as
the normal, log-normal, Laplace, and T. Although the
Laplace is a pdf and least absolute deviations (LAD) is an
estimation method, they produce the same estimates,analogous to the normal pdf and OLS. The flexible
probability distributions are the SGT from Theodossiou
(1998), the SGED from Theodossiou (2001), the SEGB2
from McDonald and Xu (1995), and the IHS from
Johnson (1949). These distributions have been used by
Hansen et al. (2007) to model various financial time series
with skewed and leptokurtic distributions such as various
stock market index returns, exchange rates, and the price
of gold. The SEGB2 and the more restrictive, non-skewed
version of the SGT, the generalized T (GT), were used by
Boyer et al. (2003).
These distributions nest several well-known distribu-
tions often used in econometric modeling. Some of thedistributions that are nested in the flexible pdfs include:
the normal, T, skewed T (ST) (Hansen 1994), GT
(McDonald and Newey 1988), generalized error (GED)
(Box and Tiao 1962), Laplace, and uniform distributions.
See Appendix A.
4. Estimation of alphas and betas
The CAPM is formulated below in equation (1). This
version assumes that the intercept, , is equal to zero. A
number of empirical tests of the CAPM structure have
tested as evidence against the CAPM structure. Handa
et al. (1993) simultaneously test as a vector of s fora series of stocks within portfolios and find evidence that
s are non-zero with monthly returns data. Other studies
such as Black et al. (1972), Blume and Friend (1973), and
Fama and MacBeth (1974) perform empirical CAPM
tests by estimating the security market line and perform-
ing tests on the intercept and whether the slope is equal to
the market risk premium.
The estimation of the CAPM alpha and beta
parameters for each utility company stock return is
accomplished by estimating the following model via
maximization of the sample log-likelihood function of
equation (2):
Ri,t Rf,t i iRm,t Rf,t "i,t, 1
maxi,i,i,j
li,i, i,jji maxi,i,i,j
XTt1
lnfi,j"i,tji,j
( ), 2
where j SGT, SGED, SEGB2, IHS, Laplace, T, GED,
GT, ST, and Normal, "i,t is the error of the stock return-
generating process for utility stock i (i 1,2, . . . , 36),
t denotes the time period (t 1,2, . . . , T), Ri,t is the stock
return of utility i for period t, Rm,t is the stock market
return, Rf,t is the risk-free rate of return, i and i are the
alpha and beta for utility stock i, i is a vector of
distributional parameters in the pdf j, and i includes the
data for estimation. GMM is also applied as it requires no
prior assumption on the pdf of the error term. A complete
discussion of the special estimation methods and the
flexible pdfs is presented in appendix A.
5. Estimation results
The sample consists of 36 electric and electric and gas
combination companies that were continuously publicly
traded between January 1990 and December 2004. These
include all publicly traded companies with SICs 4911
and 4931. Any stock that stopped trading and did not
have continuous returns during the period was removed
from the sample. This exclusion involved only one utility
stock.
Market and utility stock returns are monthly total
stock returns that are obtained from the University of
Chicagos Center for Research in Security Prices (CRSP)database. The market is defined by the CRSP value-
weighted index that includes all stocks traded on the
NYSE, NASDAQ, and the AMEX. We used monthly
data to be generally consistent with practitioners use of
monthly data for estimation. Monthly data resulted in
180 stock return observations for each utility stock and
the market. The risk-free rate is the one-month return on
the one-month US Treasury Bill. The excess market
return is the same as defined in the FamaFrench
database.
A review of the descriptive statistics of the excess
returns data for the utility stocks (mean, standard
deviation, skewness, excess kurtosis, JarqueBera (JB)
statistic) was performed for the 36 utility stocks for theentire period January 1990 to December 2004. The mean
monthly excess return (in decimal format) is 0.0057 and
its standard deviation of 0.0631 results in an average
return-to-risk, or Sharpe ratio of 0.09. This is a typical
reward-to-risk ratio for stocks. By comparison, the
20-year US Treasury Bond average Sharpe ratio is 0.06
between 1961 and 2002. The mean excess kurtosis and
skewness values are 2.832 and 0.0941, respectively.
The JB statistics show that almost all of the utility stocks
returns distributions are non-normal. The JB statistic
is asymptotically 2 distributed with two degrees of
freedom and has a critical value of 5.99 at the 5% level of
significance. This test shows that the levels of skewnessand excess kurtosis of the returns distributions lead
to the conclusion that the returns are non-normally distri-
buted for 28 of the 36 companies. Rather than testing
the significance of skewness and kurtosis independently,
we reviewed their joint test with the normal pdf and
no excess kurtosis nor skewness as the null hypothesis.
Table 1 displays the beta estimates for each of the
alternative estimators. It also includes the GMM estima-
tor since GMM requires no prior distributional assump-
tion for the error term. GMM parameter estimates are
Robust estimation with flexible parametric distributions 3
DownloadedBy:
[informa
internalusers]At:16:484February
2010
-
8/8/2019 RLS_beta
4/13
-
8/8/2019 RLS_beta
5/13
asymptotically consistent despite the non-normality of
the error term.y Generally, the betas are similar in
magnitude across the various pdf estimates. The medians
of the GMM estimates are a little less than obtained using
other methods. The maximum GMM estimate is less than
the maximum of all other methods except for one stock.
However, it is common across the stocks for the OLS
(and GMM) estimate(s) to have the largest difference
from the other estimates, which may agree quite closely.
There does not seem to be any systematic under- or
under-estimation of the beta by OLS compared with the
flexible pdfs. We also did not observe any of the robust
estimators to have a tendency to be more or less similar to
OLS. The case of GMP is an example of a large difference
with the OLS estimate being 0.012 and the flexible pdf
estimates ranging between 0.109 and 0.167. CIN has
an OLS beta of 0.103 and a range of flexible pdf betas
ranging between 0.177 and 0.217. These two stocks betas
are substantially different from the OLS estimates.
The resulting risk premia, i(Rm,t Rft), to estimate
their costs of equity of capital and allowed rates of
return would differ by the same magnitude. Althoughmost of the OLS beta differences are not so dramatic, they
have the largest systematic difference from the other
estimates. The mean of the maximum difference between
the OLS and flexible pdfs betas is 0.0496. Given that
the value-weighted portfolio beta for the utility stocks is
0.21, this is a substantial difference, and would lead to
a large difference in the estimated cost of capital for the
portfolio. It appears that the agreement among the robust
estimators on the estimate of betas indicates that these
methods are controlling for the impact of large unduly
influential data points.
An inspection of electric utility company betas in Value
Line from the MarchMay 2004 issues that include
electric utilities have a mean adjusted beta (Blume 1975)
a 0.33 0.67u of 0.79 and unadjusted Value Line
mean beta of 0.69. Value Line uses OLS to estimate raw
betas then applies the Blume beta adjustment shown
above. To the extent that OLS is used to estimate betas to
compute estimates of the public utilities cost of common
equity capital and allowed (regulated) rates of return on
invested capital, the degree of the difference between OLS
and flexible pdf betas due to possible larger estimation
error should be an important regulatory policy question,
as well as a statistical problem. Note that utility betas are
adjusted by Value Line with the above Blume equation
that assumes that they converge to one, whereas in reality
they do not.
Although not presented, none of the alpha estimates
are statistically significantly different from zero. This is
consistent with the structure of the CAPM when using the
excess-return CAPM equation for empirical testing, given
that, according to theory, alpha should be equal to zero.
A comparison of the log-likelihood values corresponding
to the estimates for 11 regression error distributions
(not presented for brevity), including the four flexible
pdfs and their symmetric counterparts, normal or OLS,
the Laplace or LAD, T, and ST show that the log-
likelihood estimates are generally higher for the more
flexible distributions. The OLS results are associated with
the smallest log-likelihood value, which follows from
the normal being a special or limiting case of many of the
other pdfs being considered and due to its inability to
fit thick tails and skewness. Furthermore, in each case
(SGT, ST, SGED, SEGB2, and IHS) the more general pdf
is seen to provide a statistically significant improvement
relative to the normal for almost every stock using
a likelihood ratio (LR) test.
Table 2 reports values of the LR test statistic
corresponding to testing the hypothesis that the estimateddistributions of the regression errors are observationally
equivalent to the normal. This statistic is asymptotically
distributed as 2 with degrees of freedom equal to the
difference in the number of distributional parameters
when the normal is nested in the estimated pdf as in the
case of the SGED. The 2 is not appropriate for non-
nested pdfs. The LAD does not nest the normal and
therefore the test does not appear for that pdf.
The asymptotic distribution of LR is not 2 distributed
for limiting cases where the parameter is on the
boundary of the parameter space such as when comparing
a T with a normal pdf. While the excess returns are non-
normally distributed as shown from JB tests, we would
not be surprised if the errors behave similarly as found by
Blume (1968).
However, simulations conducted by McDonald and Xu
(1992) suggest that the statistical differences will be at
least as large as those based on the use of a chi-square
distribution.z Most of the reported LR values imply the
rejection of the normality assumption at the 5% level.
The exceptions are ED, DTE, HE, PGN, WPS, and WEC
stock returns. The tests for these stocks indicate that the
alternative pdf regression is not a better fit than the
normal. These stocks also have JB statistics for excess
returns that do not reject the null hypothesis that they are
normally distributed. PSD and PNM have some insignif-
icant chi-square statistics among the alternative pdfs.
yWe used the fully iterative GMM estimator as developed by Newey (1988). We used OLS estimates to provide the starting valuesfor the iterative process, which we allowed to iterate until convergence was achieved. We usedJ 4 moment conditions in definingthe objective function to be optimized, thus the convergence involved simultaneously minimizing the correlation between functionsof the first four moments of the estimated error with the independent variable as outlined by Newey (1988).zBased on simulations of 1000 replications, the size of the LR test associated with estimating various pdfs which nest the normal,exponential, or lognormal distribution was explored. When the nested distribution corresponded to a special case of the estimatedpdf, the size of the LR test was close to that predicted by the asymptotic Chi-square distribution. However, in the case where thenested distribution corresponded to a limiting case of the estimated distribution that violated a regularity condition, the size of theLR test appeared to be less than suggested by the asymptotic Chi-square for large sample sizes. This simulation included the GED,T, GT, and SEGB2, but not the SGT or IHS. This suggests that the statistical significance of the limiting cases in table 2 are evengreater than might be inferred from the Chi-square values.
Robust estimation with flexible parametric distributions 5
DownloadedBy:
[informa
internalusers]At:16:484February
2010
-
8/8/2019 RLS_beta
6/13
Table2.Loglikelihoodratiotestforpdfsv.Normalpdf.
Company
T
GED
EGB
GT
SGED
SEGB
IHS
ST
SGT
GMP
63.86
64.23
59.67
65.24
68.62
62.78
66.00
64.79
68.13
AEP
9.71
7.80
9.54
10.10
7.81
9.57
9.76
9.75
10.15
CMS
67.55
60.24
58.41
68.68
61.03
58.52
67.38
67.57
69.25
PGN
1.50
2.11
1.70
2.11
4.67
2.33
2.16
2.73
4.67
CIN
9.97
12.60
12.07
12.60
14.54
14.15
10.64
10.34
14.54
ED
1.77
2.44
1.77
2.44
2.68
2.09
2.07
1.91
2.68
DPL
14.26
15.45
15.47
15.59
16.04
15.79
14.88
14.33
16.05
DTE
0.00
0.18
0.01
0.18
0.65
0.53
0.54
0.35
0.65
D
12.15
12.77
13.62
13.00
14.24
15.18
14.33
13.96
14.59
DUK
20.91
25.40
25.32
25.40
25.60
25.54
22.63
21.23
25.60
EDE
20.99
17.30
19.86
21.08
18.35
19.87
20.86
21.17
21.20
FPL
24.89
24.06
24.34
25.16
24.15
24.50
25.31
24.98
25.24
HE
1.64
1.47
1.61
1.66
2.07
2.52
2.47
2.07
2.12
IDA
10.86
6.45
9.77
14.67
6.52
9.78
10.49
11.28
14.98
WR
27.16
26.96
27.40
27.31
29.74
29.53
28.11
27.45
29.74
ETR
12.77
9.57
12.15
13.67
10.23
12.89
13.30
13.32
14.23
NI
31.49
26.07
28.67
32.52
26.39
28.92
31.31
32.07
32.97
SRP
72.50
67.57
64.78
72.50
67.38
64.86
73.12
73.03
73.03
NU
11.51
16.18
15.92
16.18
16.60
16.36
12.44
11.57
16.60
OGE
8.24
8.76
8.89
8.85
11.09
10.27
9.89
9.40
11.09
PCG
44.66
45.75
43.98
45.79
45.76
43.98
45.62
44.66
45.81
PPL
36.33
35.63
35.90
36.86
38.12
38.08
38.72
37.63
38.62
PNW
33.41
32.98
33.37
34.17
33.30
33.74
34.11
33.63
34.55
POM
14.51
10.57
13.42
18.23
11.02
13.67
14.34
14.62
18.23
PNM
4.76
6.28
5.92
6.28
9.77
9.82
9.31
8.50
9.77
PEG
36.02
35.59
35.56
36.81
39.59
38.63
39.51
37.92
39.60
PSD
4.80
9.40
8.08
9.40
9.42
8.12
5.65
5.14
9.42
EIX
46.02
46.50
45.10
47.32
46.62
45.14
47.22
46.03
47.42
SCG
11.70
14.83
14.39
14.83
15.22
14.79
12.65
11.77
15.22
SO
6.10
6.68
6.42
6.71
6.74
6.46
6.28
6.10
6.74
TE
8.48
10.25
10.27
10.25
11.48
10.59
9.31
8.84
11.48
TXU
72.25
66.49
65.20
72.25
66.25
66.26
73.43
74.86
75.02
UIL
12.30
12.29
12.81
12.43
14.69
13.65
13.07
13.18
14.69
UTL
12.31
15.82
15.40
15.82
15.96
15.53
12.93
12.33
15.96
WEC
6.36
6.32
6.66
6.55
6.46
6.87
6.75
6.63
6.78
WPS
1.79
2.60
1.93
2.60
4.87
4.17
4.10
3.58
4.87
Minimum
0.00
0.18
0.01
0.18
0.65
0.53
0.54
0.35
0.65
Median
12.31
13.80
14.01
15.21
14.96
14.99
13.19
13.25
15.59
Maximum
72.50
67.57
65.20
72.50
68.62
66.26
73.43
74.86
75.02
Mean
21.54
21.27
21.26
22.92
22.32
22.10
22.52
22.19
23.94
StdDev.
20.95
19.59
18.76
20.69
19.67
18.85
21.03
21.03
20.84
Theratiosapproximatethe
2
distributionwiththedegreesoffreedomequaltothediffe
renceinthenumberofparameterscomparedto
thenormalpdf.Thepdfswiththelargestnumberofparametersis5andthe
normalhas2,sothegreatestdegreeso
ffreedomis3.Thecriticalvalueata5%level
ofsignificanceis7.81.
6 J. B. Mcdonald et al.
DownloadedBy:
[informa
internalusers]At:16:484February
2010
-
8/8/2019 RLS_beta
7/13
These stocks excess returns also have relatively higher
(than the stocks listed above) but insignificant JB
statistics.
The relative impacts of skewness and kurtosis are tested
to determine if departures from normal data and the
resulting inefficiency in beta estimates are generated
more dominantly from skewness or kurtosis. Skewness
is important to understand as negative or positive skew-
ness may be an indicator of adverse or favorable (fromthe shareholders perspective) regulation. See Brigham
and Crum (1978).y The SGT, ST, SGED, and SEGB2
regressions were estimated along with their symmetric
counterparts (GT, T, GED, EGB2) of these pdfs by
constraining the skewness parameters to the values that
represent no skewness. We have performed LR tests
(not shown) comparing their fits. All but two stocks 2
statistics are insignificant. None of the SGT-GT and ST-T
LR tests are significant and only 2.8% (one stock) of the
36 stocks LRs for the SEGB2-EGB2 and the SGED-GED
are significant. Therefore, kurtosis appears to be the
dominant non-normality parameter affecting the utility
stock regression fits.The empirical testing leads to the conclusion that
the SGT, IHS, and SEGB2 yield similar results in the
presence of kurtosis and skewness, which can differ
significantly from OLS. Additionally, since partially
adaptive estimation based on SGT includes the LAD,
OLS, and Lk (minimizes (sum of estimated errors)k; see
appendix A) estimators as special cases, the user may
want to consider SGT estimation. It performs better than
the SGED, T, LAD, and the normal based on LR tests.
It performs similarly to the SEGB2 and the IHS. One
must be careful to consider the loss of efficiency from
over-parameterization in using pdfs such as the SGT as it
is defined by five parameters. Note that likelihood ratio
tests between the SGT, SEGB2, and IHS cannot beperformed as they do not nest one another.
One issue that the empirical estimations do not address
is the performance of the flexible pdfs vis-a-vis OLS when
the CAPM regression errors are normally distributed.
When the errors are independently, identically, distri-
buted (i.i.d.) as normal, then OLS is the minimum
variance of any unbiased estimator. However, if the
errors are i.i.d. as non-normal, OLS is still the minimum
variance linear unbiased estimator, but nonlinear robust
estimators may be unbiased and have a smaller variance
than OLS. The next section involves a series of
simulations that compare the efficiency of betas estimated
with OLS and the flexible pdfs when the error term is
normally distributed, is mixed normally distributed
(varying variances and therefore has thick tails), and is
asymmetric (has skewness).
6. Simulations and estimator performance
McDonald and White (1993) and Boyer et al. (2003) used
Monte Carlo methods to compare the relative efficiency
of several regression estimators. Some of the estimators
considered included OLS, LAD, a normal kernel esti-
mator (Manski 1984), GMM (Newey 1988), and partially
adaptive maximum likelihood estimators based on the
assumption of the error terms being independently
and identically distributed as GT, GED, or SEGB2.
The actual error distributions considered included the
(1) normal, (2) a thick-tailed variance-contaminated
normal (normal mixture), and (3) a skewed log-normal.z
Major findings included (1) little efficiency loss for
partially adaptive estimation based on an over-parame-
terization of the distribution of normally distributed
errors; (2) very similar performance of fully iterative
adaptive and partially adaptive estimators in the case of
symmetric thick-tailed distributions;x and (3) clear dom-
inance of the SEGB2 partially adaptive estimator over all
other estimators considered in the case of a skewed error
distribution. Ramirez et al. (2003) used the same sample
design and found that partially adaptive estimation based
on the IHS distribution yields nearly identical results
to the SEGB2 for normal, symmetric thick-tailed, and
skewed error distributions.
The simulations reported in this section are based
on the same error distributions considered in thepapers by Johnson (1949), McDonald and Xu (1995),
and Theodossiou (1998), and use similar data-generating
processes as used by Manski (1984), Newey (1988),
McDonald and White (1993), and Ramirez et al. (2003).
The model simulated in this paper is
yt xt ut 0 0:21xt ut, 3
where the xts correspond to the 180 observed monthly
excess returns on the market, the error terms ut have zero
mean and unitary variance, and the scale parameter is
selected to generate an R2 similar to those obtained from
yThe importance of determining whether non-normality is due to asymmetry or thick tails is important since skewness can result inbiased intercepts (Jensens alphas) under specific conditions, as shown by McDonaldet al. (2009). This is also an importantestimation issue if using the single index model (RiiiRm "i) to predict stock returns. Note that the T-values of the interceptfor the log-normal pdf in the simulations shown in table 3 are substantially higher for the non-normal symmetric pdfs (LAD, GED,T, and GT). The LAD and T intercepts are statistically significant, which is an indicator of bias, as the expected value of the estimateof this parameter is 0. Secondly, negative or positive stock returns skewness can be generated by adverse or favorable regulatorytreatment of utility profits, respectively. Although beyond the scope of this paper, skewness caused by regulatory treatment ofutilities can effect the efficient and unbiased estimation of the models used to estimate the cost of capital.zAll three error distributions are standardized to have a zero mean and unitary variance. The first error distribution is merely a unitnormal generated by u1 z, where z$N(0,1). The thick-tailed variance contaminated error distribution is generated byu2 wz1 (1 w)z2, where z1$N(0, 1/9), z2$N(0,9), and w is 1 with probability 0.9 and 0, otherwise. This distribution is symmetricand has a standardized kurtosis of 24.33. The log-normal distribution is generated byu3 (e
ze0.5)/(e2e)0.5 where z$N(0, 1). Thisdistribution has standardized skewness and kurtosis of 6.185 and 113.94, respectively.xThe fully iterative adaptive kernel and GMM estimators performed much better than the corresponding two-step estimators.
Robust estimation with flexible parametric distributions 7
DownloadedBy:
[informa
internalusers]At:16:484February
2010
-
8/8/2019 RLS_beta
8/13
-
8/8/2019 RLS_beta
9/13
the regressions discussed in section 4. Appendix B
derives the standard error for scaling the error term in
equation (3) for the simulations. The selected value for
beta is 0.21, which is the market-value weighted OLS
beta for the portfolio of utility stocks. The selected value
of alpha is 0. This is based on the insignificance of the
estimates for the stocks. Ten thousand replications
were performed using the methods of OLS, LAD,
skewed Laplace (SL), GED, SGED, T, ST, GT, SGT,
IHS, SEGB2 and GMM estimators. These results extend
those reported by McDonald and White (1993) to include
the SGT, SGED, and IHS partially adaptive regression
estimators of the slope.
Table 3, Panel 1, presents the mean of the intercept,
slope and expected portfolio return and their T-values
based on 10,000 simulations. T-values for the intercept,
slope and expected portfolio return test for bias; the
expected value for the estimated parameter values of
the intercept and slope are 0 and 0.21, respectively.
The expected value for the T-value is 0. The error
distributions considered are the normal, thick-tailed
contaminated normal, and the skewed log-normal.Panel 2 contains the slope and expected portfolio returns
relative (compared with OLS) root mean square errors
(RRMSEs). Intercept RRMSEs are not presented for
brevity (they are available upon request). The expected
portfolio returns are simulated and also reported in
table 3 to test the relative pricing performance of the
CAPM with the various estimators. We predicted the
excess portfolio returns to assess those pdfs that generate
the least forecast error. The RRMSEs of the returns are
indicators of forecast error.
The results for the intercept show that it is not
significantly different from 0 and therefore is unbiased
except when the error pdf is skewed and a symmetricdensity was applied. Note that the LAD and T pdfs
estimate biased intercepts when the error term pdf is
skewed since their T-values are significantly different
from zero. The T-value for the GT is somewhat high.
Skewness generates intercept bias with the opposite
algebraic sign as all of the symmetric densities yield
intercepts with negative signs yet the log-normal is
positively skewed (see footnote 3). The RRMSEs show
that the efficiency of the intercept estimators of the
flexible pdfs is generally better than the remaining
estimators with skewed errors. The results for the slope
indicate no bias. Their values are close to 0.21 and all of
the T-values are insignificant.y All models, symmetric and
non-symmetric, appear to correspond to unbiased esti-mators of the slope coefficient. Unbiased estimates of the
slope do not mean that there will not be cost of capital
estimation errors since, on average, the slope will tend
toward its true value. Inefficiency leads to larger errors in
slope estimation since the distribution of the slope will
have greater dispersion with higher mean squared errors.
The LAD and SL are the only slope estimators that result
in substantial efficiency losses when the error term is
normally distributed. However, this is not unexpected
since the LAD distribution is the only pdf considered that
does not nest the normal pdf. For the thick-tailed and
symmetric errors distribution, OLS provides the largest
slope RMSE of any of the estimators considered. All of
the estimators except for LAD and SL yield similar
RMSEs with a thick-tailed symmetric error term. In the
case of skewed and thick-tailed error distributions, OLS is
again the worst performing slope estimator. It is in this
case that the partially adaptive estimators (ST, SGT, IHS,
SGED, and SEGB2) give evidence of the potential of
significant increased efficiency for the slope relative to
OLS. GMM also performs well given that it is an
estimator that is pdf independent.
The simulations of the portfolio of utility stocks
expected returns were performed to determine which of
the estimators produces the lowest asset pricing errors.
Simulations of the expected value of the portfolio returnwere performed with an assumed zero intercept and the
simulated slopes multiplied by 0.6, which is the expected
value of the market excess return for the sample period.
As expected, the predicted portfolio returns are unbiased
as shown by their values and their T-values. The efficiency
of the expected portfolio return is of interest since higher
efficiency is an indicator of lower returns prediction and
pricing errors. The portfolio returns simulations reflect
estimation risk, since the variance in the simulated error
term reflects the estimation risk associated with estimated
slopes to fit thick-tailed or skewed errors. The results
clearly show that when the error term is thick-tailed or
skewed, the flexible pdfs and GMM have lower RRMSEsand therefore lower pricing errors. This result essentially
shows that the flexible estimators and GMM are the most
reliable estimators of beta and produce the least pricing
errors.
The inefficiency of the OLS beta estimators can be
visualized as a probability distribution of OLS betas with
greater dispersion than a distribution of betas estimated
by robust methods. This will be reflected in higher
errors in beta estimation, and resulting predictions of
the cost of capital. Envision two beta distributions, one
that is generated from OLS and the other from the IHS
pdf estimates of beta. Although not shown (available
upon request), the beta RMSE for OLS is 0.07645 and
for the IHS the value is 0.04745. All of the four flexiblepdfs (SGT, EBG2, IHS, SGED) have lower RMSEs than
the others and OLS, with an average of about 0.05, but
the IHS is slightly lower than the others. If a beta estimate
from each CAPM beta distribution is chosen at the
yA T-test was used to compare the sample mean of beta from the 10,000 simulation estimates with the hypothesized value of 0.21.The RMSE was used as the standard error of the sample mean for the T-test. If the sample mean was significantly different than0.21, there was a bias in the sample mean estimate. A beta equal to 0.21, alpha equal to 0, a desired R2 of 0.04, 180 randomlygenerated observations, and a standard deviation of 2.1468 for x are the parametric assumptions based on averages of empiricalresults. The corresponding value of can be calculated, which is multiplied times the standardized error terms (u has a mean of 0and variance of 1 using the standard normal, scaled mixture of normals, and scaled and shifted log-normal variable).
Robust estimation with flexible parametric distributions 9
DownloadedBy:
[informa
internalusers]At:16:484February
2010
-
8/8/2019 RLS_beta
10/13
same point, such as one RMSE below the expected value
of 0.21, the OLS value is 0.134 and for IHS the value is
0.163. This would lead to 22% greater under-estimation
of the cost of capital risk premium from OLS due to lower
efficiency. The economic impact of such an allowed rate
of return on the public utility required by regulators
would be a windfall in consumer surplus to customers, an
adverse impact on the financial viability of the public
utilities, resulting in a reduction in economic welfare.Statistically, this may generate negative skewness in the
public utilities stock returns as discussed by Brigham and
Crum (1978).
We recommend that the final choice of slope and
intercept estimator is one of the flexible pdfs if thick tails
and/or asymmetry is present. If there are concerns about
over-parameterization (reduction in efficiency from esti-
mating unnecessary pdf parameters), we recommend the
use of the SGT, IHS, or SEGB2 with testing for statistical
improvements relative to special case pdfs, and then use
the simplest model that is observationally equivalent to
the most general formulation which allows for possible
skewness and/kurtosis. GMM is almost as efficient anestimator of the beta as the flexible pdfs if the flexible pdf
estimators are not readily computable.
7. Summary and conclusions
We apply several flexible pdfs for estimating the betas of
public utility company stocks. Estimation methods based
on the flexible pdfs for the error distributions in regression
models, or partially adaptive estimators, were tested
against OLS, other estimation methods involving the
choice of pdf, and GMM. Partially adaptive estimators
were found to be the most efficient of those considered in
the presence of skewed and fat-tailed distributions.The recommended partially adaptive estimator is based
on the SGT, SEGB2, or IHS if skewness/kurtosis are
present. GMM, which does not assume a particular pdf,
also performed well with non-normal errors.
OLS estimation is efficient when neither skewness nor
kurtosis exist in the pdf of the data, although the partially
adaptive methods that include the normal as a special case
yield similar results. Similar to other stocks, public utility
stock returns and their CAPM regression errors typically
have kurtosis and many have skewness. Therefore, OLS is
an inefficient estimator of beta. This leads to an inefficient
estimate of the cost of common equity capital and
produces the most prediction and pricing error whenthick tails are present. The magnitude of the error in
estimating the cost of common equity capital and allowed
(regulated) rate of return on utility investment is
proportional to the estimation error in beta. Errors in
setting the utilitys allowed rate of return create dis-
balances in consumer and producer surplus and therefore
losses in social economic welfare, result in inefficient
pricing of utility stocks, and general disequilibrium in
energy and related markets as other fuels and conserva-
tion inefficiently substitute mis-priced electric power.
More generally, our results here also suggest that further
research may demonstrate that the substitution of
statistical methods for OLS may result in more efficient
equity markets. Mean and variance estimation risk
affects the choice of optimal portfolios and portfolio
performance.
In appendix A, we review the main distributional
characteristics associated with the flexible pdfs, the
normal, and many other pdfs that they nest. Partially
adaptive estimation based on these distributions was used
to estimate the CAPM for 36 electric utility stocks.
The motivation for selecting electric utility companies
is due to the unique characteristic of regulated rates of
return that appear to be manifest in their skewed and
leptokurtic stock returns and the role estimated betas
have in calculating the cost of capital and in setting
electric utility rates. Based on LR tests and simulations,
the partially adaptive estimators provided significant
improvements relative to OLS in applications in which
the error distribution is skewed and/or thick-tailed.
The statistical performance of the partially adaptive
estimators was also explored using a Monte Carlo study.
The Monte Carlo design was adapted from one used ina number of other papers which explores the impact of
thick tails and asymmetry as well as the efficiency loss of
over-fitting in the presence of normally distributed errors.
Our results show that potential exists for significant
improvements in estimation efficiency in the presence of
leptokurtosis or skewness in the data, and the efficiency
loss from over-fitting the error distribution in the
standard normal linear model was modest.
Lastly, portfolio prediction simulations show that the
SGT, SEGB2, and IHS produce the lowest pricing
prediction errors, and the next best performer is GMM.
Future research should consider estimating time-varying
conditional betas using the conditional or intertemporal
CAPM with robust estimators as discussed by Bali (2008).
References
Akgiray, V. and Booth, G.G., Mixed diffusion-jump processmodeling of exchange rate movements. Rev. Econ. Statist.,1988, 70, 631637.
Akgiray, V. and Booth, G.G., Modeling the stochastic behaviorof Canadian foreign exchange rates. J. Multinat. Finan.Mgmt., 1991, 1, 4372.
Bali, T.G., An extreme value approach to estimating volatilityand value at risk. J. Bus., 2003, 76, 83108.
Bali, T.G., The intertemporal relation between expected returns
and risk. J. Finan. Econ., 2008, 87, 101131.Bali, T.G. and Weinbaum, D., A conditional extreme value
volatility estimator based on high-frequency returns. J. Econ.Dynam. Control, 2007, 31, 361397.
Bey, R., Market model stationarity of individual public utilities.J. Finan. Quant. Anal., 1983, 18, 6788.
Black, F., Jensen, M. and Scholes, M., The capital asset pricingmodel: some empirical tests, In Studies in the Theory ofCapital Markets, edited by M. Jensen, 1972 (New York:Praeger).
Blume, M., The assessment of portfolio performance. PhDdissertation, University of Chicago, 1968.
Blume, M., Betas and their regression tendencies. J. Finan.,1975, 30, 785795.
10 J. B. Mcdonald et al.
DownloadedBy:
[informa
internalusers]At:16:484February
2010
-
8/8/2019 RLS_beta
11/13
Blume, M. and Friend, I., A new look at the capital asset pricingmodel. J. Finan., 1973, 28, 1933.
Box, G.E.P. and Tiao, G.C., A further look at robustness viaBayes theorem. Biometrika, 1962, 49, 419432.
Boyer, B.H., McDonald, J.B. and Newey, W.K., A comparisonof partially adaptive and reweighted least squares estimation.Econometr. Rev., 2003, 22, 115134.
Brigham, E. and Crum, R., Reply to comments on the use ofthe CAPM in public utility rate cases. Finan. Mgmt., 1978, 7,
7276.Butler, R.J., McDonald, J.B., Nelson, R.D. and White, S.,Robust and partially adaptive estimation of regressionmodels. Rev. Econ. Statist., 1990, 72, 321327.
Chan, L.K.C. and Lakonishok, J., Robust measurement of beta.J. Finan. Quant. Anal., 1992, 27, 265282.
Cooley, P., A review of the use of beta in regulatoryproceedings. Finan. Mgmt., 1981, 10, 7581.
Fama, E., The behavior of stock market movements. J. Bus.,1965, 38, 17491778.
Fama, E. and MacBeth, J.D., Tests of the multiperiod two-parameter model. J. Finan. Econ., 1974, 1, 4366.
Fielitz, B.D. and Smith, E.W., Asymmetric distributionsof stock price changes. J. Am. Statist. Assoc., 1972, 67,813814.
Francis, J.C., Skewness and investors decisions. J. Finan.
Quant. Anal., 1975, 10, 163172.Handa, P., Kothari, S.P. and Wasley, C., Sensitivity ofmultivariate tests of the capital asset pricing model to thereturn measurement interval. J. Finan., 1993, 48, 15431551.
Hansen, B.E., Autoregressive conditional density estimation.Int. Econ. Rev., 1994, 35, 705730.
Hansen, C.J., McDonald, J.B. and Theodossiou, P., Someflexible models for partially adaptive estimators of econo-metric models. Economics E-Journal, 2007.
Harrington, D.R., The changing use of the capital assetpricing model in utility regulation. Publ. Util. Fortnightly,1980, 105, 2830.
Harvey, C.R. and Siddique, A., Autogressive conditionalskewness. J. Finan. Quant. Anal., 1999, 34, 465487.
Harvey, C.R. and Siddique, A., Conditional skewness in assetpricing tests. J. Finan., 2000, 55, 12631295.
Johnson, N.L., Systems of frequency curves generated bymethods of translation. Biometrika, 1949, 36, 149176.
Klein, R.W. and Bawa, V.S., The effect of estimation risk onoptimal portfolio choice. J.Finan. Econ., 1976, 3, 215231.
Koenker, R. and Bassett, G., Asymptotic theory of leastabsolute regression error. J. Am. Statist. Assoc., 1978, 73,618622.
Koenker, R. and Bassett, G., Robust methods in econometrics(with discussion). Econometr. Rev., 1982, 1, 213255.
Mandelbrot, B., The variation of certain speculative prices.J. Bus., 1963, 36, 394419.
Manski, C.F., Adaptive estimation of non-linear regressionmodels. Econometr. Rev., 1984, 3, 145194.
Martin, R.G. and Simin, T.T., Outlier-resistant estimates ofbeta. Finan. Anal. J., 2003, 59, 5669.
McDonald, J.B. and Nelson, R.T., Alternative beta estimationfor the market model using partially adaptive estimation.Commun. Statist.: Theory Meth., 1989, 18, 40394058.
McDonald, J.B. and Newey, W.K., Partially adaptive estima-tion of regression models via the generalized T distribution.Econometr. Theory, 1988, 4, 428457.
McDonald, J.B. and White, S.B., A comparison of some robust,adaptive, and partially adaptive estimators of regressionmodels. Econometr. Rev., 1993, 12, 103124.
McDonald, J.B. and Xu, Y.J., An empirical investigation of thelikelihood ratio test when the boundary condition is violated.Commun. Statist.: Simul., 1992, 21, 879892.
McDonald, J.B. and Xu, Y.J., A generalization of the betadistribution with applications. J. Econometr., 1995, 66,133152, Errata 1995, 69, 427428.
McDonald. J.B., Michelfelder, R.A. and Theodossiou, P.,Robust regression methods and intercept bias: a capitalasset pricing model application. Working Paper, 2009.
Newey, W.K., Adaptive estimation of regression models viamoment restrictions. J. Econometr., 1988, 38, 301339.
Ramirez, O.A., Misra, S.K. and Nelson, J., Efficient estimationof agricultural times series models with non-normal depen-dent variables. Am. J. Agric. Econ., 2003, 85, 10291040.
Siegel, A.F. and Woodgate, A., Performance of portfolios
optimized with estimation error. Mgmt. Sci., 2007, 53,10051015.Theodossiou, P., Financial data and the skewed generalized T
distribution. Mgmt. Sci., 1998, 44, 16501661.Theodossiou, P., Skewness and kurtosis in financial data and the
pricing of options. Mimeo, Rutgers University, 2001.Value Line Investment Survey, Ratings and Reports, March,
April, and May 2004.Yohai, V.J. and Zamar, R.H., Optimal locally robust
m-estimates of regression. J. Statist. Plann. Infer., 1997, 64,309324.
Appendix A: Special estimation methods and
flexible density functions
As indicated previously, many well-known distributions
are nested within the four flexible distributions. Thus,
maximization of the above likelihood function gives, in
the special cases of OLS,
i,iOLS arg mini,i
XTt1
"2i,t
!,
Laplace, the LAD estimator (Koenker and Bassett 1978),
i,iLAD arg mini,i
XTt1
j"i,tj
!,
and the GED (for fixed k), the Lk estimator,
i,iGED arg mini,i
XTt1
j"i,tjk
!,
for an arbitrary but fixed value of k. The skewed
Laplace (SL), the trimmed regression quantile (Chan
and Lakonishok 1992), and the estimators of Koenker
and Bassett (1982) are given by
mini,i
XTt1
"i,t or mini,i
XTt1
1
1 sign"i,tj"i,tj,
where, if"i,t j"i,tj and "i,t 0,
"i,t 1 j"i,tj if"i,t50for0551 or 1551,
and are therefore nested within the flexible pdfs.
Therefore, by definition, the flexible pdfs are increasingly
accommodating of the characteristics of the empirical
distributions of stock returns than typically used pdfs.
Characteristics of the flexible probability
density functions
All of the following flexible pdfs nest the normal among
many other symmetric and asymmetric distributions.
Robust estimation with flexible parametric distributions 11
DownloadedBy:
[informa
internalusers]At:16:484February
2010
-
8/8/2019 RLS_beta
12/13
They have either four or five parameters that describe
their shapes in contrast to the normal that has two
parameters (mean and variance distributions) and there-
fore is less flexible. As shown below, they exhibit similar
performance in fitting an approximating pdf to data that
has skewness and kurtosis. The choice of one of the
following four is recommended below.
Skewed generalized T
The SGT is a five-parameter pdf (mean, standard
deviation, two kurtosis, and skewness parameters).
The and parameters generate the conditional means.
The SGT pdf is
fSGT";,, ,, k, n
C
j"= jk
n 2=k1 sign" kk
n1=k,
where
C k=2n 2=k1=kB1=k, n=k,
S ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
1 32 4A22p
,
k=n 21=kB1=k, n=k0:5B3=k, n 2=k0:5S1,
A B2=k, n 1=kB1=k, n=k0:5B3=k, n 2=k0:5,
2AS1, B() is the beta function, and are the
mean and standard deviation of", k controls the peakness
of the distribution, n controls the thickness of the tails,
50 generate negative skewness, and 40 generates
positive skewness.
The major nested distributions, or special cases of the
SGT, are as follows: 0 gives the GT of McDonald and Newey (1988) and
Boyer et al. (2003),
k 2 gives the ST of Hansen (1994),
k 2 and 0 give the Students T,
n ! 1 gives the SGED of Theodossiou (2001),
n ! 1 and 0 give the GED of Box and Tiao (1962),
n ! 1 and k 1 give the skewed Laplace (SLAD),
n ! 1, k 1 and 0 give the Laplace,
n ! 1, k 2 and 0 give the normal, and
n ! 1, k!1 and 0 give the uniform.
Standardized values for regression error skewness
and kurtosis in the ranges (1, 1) and (1.8, 1) can be
modeled with the SGT. However, not all combinations ofskewness and kurtosis are defined by the SGT.
Skewed generalized error
The SGED is a four-parameter distribution (mean,
standard deviation, kurtosis, and skewness). The SGED
pdf is
fSGED";, , ,, k C
exp
j"= jk
1 sign" kk
:
where
C k=21=k,
2AS1,
1=k0:53=k0:5S1,
S
ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi1 32 4A22
p,
A 2=k1=k0:5
3=k,
and k controls the peakness of the distribution, 50
generates negative skewness, and 40 generates positive
skewness. The nested distributions, or special cases of the
SGED, are
0 gives the GED,
k 1 gives the SL,
k 1 and 0 give the Laplace,
k 2 and 0 give the normal, and
k ! 1 and 0 give the uniform.
Skewed exponential generalized beta of the second kind
The SEGB2 is a four-parameter pdf (mean, standarddeviation, and two joint skewness/kurtosis parameters).
The SEGB2 pdf is
fSEGB2";, , ,p, qCep="=
1 ep="=
pq,
C 1=Bp, q,
p q,
1=ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi0p 0q
p,
where p and q are positive scaling constants, B(p, q) is the
beta function, and (z) d ln (z)/dz is the psi function
and psi-prime; the first derivative of the psi function is
known as the digamma function. A smaller value ofp results in a more leptokurtic pdf, q4p reflects negative
skewness, and q5p reflects positive skewness. When
p q, the symmetric EGB2 obtains, and when p q ! 1
the normal distribution obtains.
Inverse hyperbolic sine
The IHS is a four-parameter pdf (mean, standard
deviation, skewness, kurtosis). The pdf is
fIHS";,,,, k kffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
22 "2=2 2p
exp k2
2ln = ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi2 "2=2 p ln 2 ,
w sinh z=k,
1=w,
w 1
21 e4jj 2 e2jjk
2
1=21 ek2
1=2ejjk2
,
w=w,
w sign1
21 e2jj ejj1=2k
2
,
where a smaller k results in a more leptokurtic pdf,
50 indicates negative skewness and 40 indicates
12 J. B. Mcdonald et al.
DownloadedBy:
[informa
internalusers]At:16:484February
2010
-
8/8/2019 RLS_beta
13/13
positive skewness. 0 gives the symmetric IHS. 0
and k ! 1 give the normal distribution.
Appendix B: Scaling the standardized error
term for simulations (r ut)
Given that
R2 1 SSE=n
SST=n,
for the simple bivariate regression model,
SST
n
1
n
Xnt1
yt "y2
1
n
Xnt1
xt "x ut "u2:
For OLS estimation and the related orthogonality
conditions, SST/n can be rewritten as
SST
n
1
n 2 Xn
t1
xt "x2X
n
t1
u2t" #:
Thus, for large n and stationary x, R2 approaches
R2 1 2
2varx 2
2varx
2varx 2and 1 R2
2
2varx 2:
Therefore,
2 1 R2
R2
2varx:
Given the variance of the xs used in the simulation,
a beta equal to 0.21, alpha equal to 0, a desired R2 of 0.04,
180 randomly generated observations similar to the 180
observations in the empirical estimations, and a standard
deviation of 2.1468 of x are the parametric assumptions
based on the means of the empirical results. The
corresponding value of can be calculated which is
multiplied by the standardized error terms (uihas a mean
of 0 and variance of 1 using the standard normal, scaled
mixture of normals, and scaled and shifted log-normal
variables).
Robust estimation with flexible parametric distributions 13
DownloadedBy:
[informa
internalusers]At:16:484February
2010