long-range forecasting from crystal ball to …forecastingprinciples.com/files/lrf-ch8b.pdf ·...

58
LONG-RANGE FORECASTING From Crystal Ball to Computer

Upload: lydieu

Post on 20-Feb-2018

216 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: LONG-RANGE FORECASTING From Crystal Ball to …forecastingprinciples.com/files/LRF-Ch8b.pdf · LONG-RANGE FORECASTING From Crystal Ball to Computer. Eight ECONOMETRIC METHODS Contents

LONG-RANGE FORECASTINGFrom Crystal Ball to Computer

Page 2: LONG-RANGE FORECASTING From Crystal Ball to …forecastingprinciples.com/files/LRF-Ch8b.pdf · LONG-RANGE FORECASTING From Crystal Ball to Computer. Eight ECONOMETRIC METHODS Contents

Eight

ECONOMETRICMETHODS

Contents

Conditions Favoring the Use ofEconometric Methods 193

A Priori Analysis 194

Causal Variables: Less Is More 195

Direction of Relationships 198

Pattern of Causality 199

Functional Form 200Magnitudes of Relationships 202A Priori Model 203

Stepwise Regression 204Objective Data 205

Multiple Data Sources 210A Priori Analyses of the Data 214

INTERMISSION

Analyzing the Data 217Experience Tables 217Regression Analysis 219Value of Simplicity 225

Updating the Estimates 232Estimating Current Status 235Forecasting Change 238

Mitigation 238Reducing the Errors 240

Combined Forecasts 242

Assessing Uncertainty 243Summary 245

191

Page 3: LONG-RANGE FORECASTING From Crystal Ball to …forecastingprinciples.com/files/LRF-Ch8b.pdf · LONG-RANGE FORECASTING From Crystal Ball to Computer. Eight ECONOMETRIC METHODS Contents

192 Econometric Methods

As soon as I could safely toddleMy parents handed me a Model.My brisk and energetic paterProvided the accelerator.My mother, with her kindly gumption,The function guiding my consumption;And every week I had from herA lovely new parameter,With lots of little leads and lagsIn pretty parabolic bags.

With optimistic expectationsI started on my explorations,And swore to move without a swerveAlong my sinusoidal curve.Alas! I knew how it would end:I've mixed the cycle and the trend,And fear that, growing daily skinnier,I have at length become non-linear.I wander glumly round the houseAs though I were exogenous,And hardly capable of feelingThe difference tween floor and ceiling.I scarcely now, a pallid ghost,Can tell ex-ante from ex-post:My thoughts are sadly inelastic,My acts incurably stochastic.

"The Non-Econometrician's Lament"Sir Dennis Robertson (1955)*

I have some good news and some bad news. The good news is that thischapter is fairly simple, thanks to research done over the past quartercentury. The bad news is that some people are going to be upset bythe research. As a matter of fact, this research was hard on me emo-tionally because it indicated that many of the complex econometrictechniques that I had learned were of little practical value in fore-casting. Simplicity is a virtue in econometric methods.

In this chapter, advice is given on how to use econometric methodsin forecasting. As defined here, "econometric methods" includes allquantitative procedures that use causal relationships. Chapter 8 dis-

*Used by permission of Macmillan London and Basingstoke.

Page 4: LONG-RANGE FORECASTING From Crystal Ball to …forecastingprinciples.com/files/LRF-Ch8b.pdf · LONG-RANGE FORECASTING From Crystal Ball to Computer. Eight ECONOMETRIC METHODS Contents

Conditions Favoring the Use of Econometric Methods 193

cusses what you should do, what you should not do, what is worthfurther study, and where you can save time.

A brief discussion is first presented on conditions that favor the useof econometric methods. This is followed by a description of a priorianalysis, the first stage in developing an econometric model. In effect,a priori analysis involves the use of judgment.

Sources of objective data are examined, and a checklist is providedto help in the search for data. The advantages and disadvantages ofthe various types of data are also discussed. An intermission is thenprovided because previous readers have said that the chapter is toolong to read at one sitting.

Methods are described for analyzing objective data. These includeexperience tables and regression analysis. Particular attention is givento the problems involved in the use of regression analysis.

The estimates from objective data are used to update the a prioriestimates. These estimates are then used in the two major types ofproblems: estimating current status (or current level) and forecastingchange.

The chapter concludes with brief sections on the use of combinedforecasts and on methods for assessing uncertainty.

This chapter describes many steps. Exhibit 8-1 is presented in anattempt to clarify its outline. It lists the major steps in the use ofeconometric methods.

Actually, Exhibit 8-1 oversimplifies matters. Although the processesare similar for estimating current status and for forecasting change,the models themselves may differ substantially. In other words, thedecisions in the first four boxes depend on whether you are trying toestimate current status or to forecast change.

CONDITIONS FAVORING THE USE OF ECONOMETRIC METHODS

Three basic conditions favor the use of econometric methods. The firstone is obvious: good information is needed on causal relationships. Thisinformation can come from subjective sources or from the analysis ofobjective data.

The second condition favoring econometric methods may not be ob-vious, because many econometricians ignore it: econometric methodsare appropriate for large changes (or large differences) in the causalvariables. For small changes, there is little reason to use econometricmethods. Surprisingly, most econometric forecasting; has involved small

Page 5: LONG-RANGE FORECASTING From Crystal Ball to …forecastingprinciples.com/files/LRF-Ch8b.pdf · LONG-RANGE FORECASTING From Crystal Ball to Computer. Eight ECONOMETRIC METHODS Contents

194 Econometric Methods

Exhibit 8-1 STEPS IN DEVELOPING AN ECONOMETRIC MODEL

Estimate current status

Develop a priori model

Select data

Analyze data

Update model

Forecast change

changes with the result that the method usually seems to be of littlevalue (see Chapter 15).

The third condition, or set of conditions, is that the direction ofchanges in the causal variables can be accurately predicted, and thatreasonable estimates can be made of the magnitudes of these changes.

A PRIORI ANALYSIS

Researchers and managers have useful information about the world.This information may come from experience. Better yet, it may comefrom previous empirical research, engineering studies, legislation, sur-veys of experts, and experiments. A priori analysis provides an efficientway to use this information. As noted in Chapter 4, it is better to utilizethis information before analyzing the objective data than after.

A priori analysis comes in many different forms. Subjective decisionsare required in the selection of data. Once the data are selected, ad-ditional analyses are possible as shown in Exhibit 8-2. The invertedtriangle is designed to emphasize the importance of the items at thetop of the triangle. The items in level A are always important in fore-

Page 6: LONG-RANGE FORECASTING From Crystal Ball to …forecastingprinciples.com/files/LRF-Ch8b.pdf · LONG-RANGE FORECASTING From Crystal Ball to Computer. Eight ECONOMETRIC METHODS Contents

A Priori Analysis 195

Exhibit 8-2 A PRIORI ANALYSIS IN ECONOMETRIC METHODS

Level A

Select causal variables.Specify signs of relationships.

Level B

Specify pattern of causality.

Specify functional form.Specify magnitudes of relationships.

Level C

Adjust for bias in data.Weight the observations.

casting; those in level B are often important; and those in level C aresometimes important. Within a level, it is difficult to distinguish gra-dations of importance.

Causal Variables: Less Is More

The first step in the development of an econometric model should bethe selection of the causal variables. Here there are two schools ofthought. One favors the exploratory approach, where the analyst useslittle judgment and, instead, develops a long list of potentially usefulcausal variables. At the other end of the continuum, a priori analysissuggests that a small number of causal variables be selected. (Evidenceagainst the exploratory approach was presented in Chapter 4.)

It is important to select the proper conceptual variables. For ex-ample, I find that a model using market size, ability to buy, consumerneeds, and price serves well for most problems in sales forecasting.

Page 7: LONG-RANGE FORECASTING From Crystal Ball to …forecastingprinciples.com/files/LRF-Ch8b.pdf · LONG-RANGE FORECASTING From Crystal Ball to Computer. Eight ECONOMETRIC METHODS Contents

196 Econometric Methods

These conceptual variables provide a guide for the selection ofoperational' variables.

One way to determine which variables are important is to identifythe actors in the situation and then outline the decision process usedby each actor. In economics, this has led to a consideration of supply(producer decisions) and demand (consumer decisions). The stakeholderapproach (LRF pp. 16, 28-29) may be useful in carrying out this anal-ysis of key actors.

To help in the selection of a small number of relevant variables foruse in a forecasting model, the analyst should first develop a long andcomplete list. Ideas on potential variables can be obtained from:

Previously published studiesExperts in this problem area

Brainstorming also may be useful; it may uncover variables that hadpreviously been ignored.

The second step is to use the following criteria to rate explicitly eachof the variables:

Is there a strong causal relationship (i.e., do changes in the causalvariable cause substantial changes in the dependent variable)?Can the relationship be estimated accurately?Will the causal variable show a substantial amount of change inthe forecast period?Can the changes occurring in the forecast period be forecast withfair accuracy?

These rules are summarized in Exhibit 8-3 for emphasis and to makeit easier for the reader to find this list when you need it. These questionsare necessary conditions. Should a variable score a "no" on any of thesequestions, it will be of little value in forecasting. Using these criteria,the researcher can screen out the less important variables. The re-maining variables can be ranked according to an index using thesecriteria. Just what type of index is used does not seem to be crucial;what is important is that the researcher makes a systematic effort toselect a small number of important variables. I also like the advice ofMOSTELLER and TUKEY 11977, pp. 270-2711 to chose variables thatare "reasonably presentable" and will avoid "hilarious newspaper col-umns."

By following the rules in Exhibit 8-3, you will usually find manyindicators for each given conceptual variable. Select only one indicator

Page 8: LONG-RANGE FORECASTING From Crystal Ball to …forecastingprinciples.com/files/LRF-Ch8b.pdf · LONG-RANGE FORECASTING From Crystal Ball to Computer. Eight ECONOMETRIC METHODS Contents

A Priori Analysis 197

Exhibit 8-3 CHECKLIST FOR SELECTING CAUSAL VARIABLES

Is a strong causal relationship expected?

Can the causal relationship be estimatedaccurately?

Will the causal variable changesubstantially?

Can the change in the causal variable beforecasted accurately?

Yes No

for each conceptual variable. The selection among these operationalvariables is typically of little importance. Unfortunately, this is anarea where I wasted much time in the past; I had operated under thedelusion that the choice of the operational measure was important.Despite the claims in the econometric literature, I have been unableto find empirical evidence that the choice of an operational measurehas much impact on forecast accuracy. Furthermore, I have failed whensearching for the best measure in my own studies (e.g., in Armstrong,1968a, different measures of buying power were shown to yield com-parable results).

Rather than selecting the best measure, use a combined estimatefrom a set of reasonable indicators. Although this procedure may in-crease costs slightly, it provides a safer approach to forecasting. Suchan approach was useful in my study of the photographic market, asdescribed here:

In Armstrong (1968a), the combination of different estimates ofcurrent camera sales led to substantial gains in accuracy; thecombination of different estimates of camera prices led to modest,but statistically significant, gains; and although the combinationof alternative estimates of buying power led to no gains, it didnot hurt forecast accuracy.

The advice to use a small number of causal variables in the analysishas also been a difficult one for me to accept. I falsely convince myself

Page 9: LONG-RANGE FORECASTING From Crystal Ball to …forecastingprinciples.com/files/LRF-Ch8b.pdf · LONG-RANGE FORECASTING From Crystal Ball to Computer. Eight ECONOMETRIC METHODS Contents

198 Econometric Methods

that certain variables are really important, as illustrated here in mystudy of the photographic market:

In ArMstrong (1968a), data from 11 variables were used to explaindifferences among countries. When this model was used to predictcamera sales in 11 "new" countries, the mean absolute percentageerror (MAPE) was 31%. An alternative model was developed usingonly two variables. This simpler and less expensive model wasalso used to predict camera sales in the 11 new countries; itsMAPE was 27%. There was no significant difference between theforecast accuracy of the two models.

AM*

This failure to gain accuracy from additional variables is consistentwith the more general evidence presented in Chapter 4. Additionalevidence was provided in finance by MOYER [1977] and in sociologyby Glaser (1954). Glaser's study was an exception to the work done inforecasting by criminologists over a period of 20 years. He was one ofthe first to advocate theoretically oriented research rather than thestatistical manipulation of available data. He provided evidence thata small number of theoretically selected variables yields better fore-casts than a large number of variables selected for statistical reasons.INCIARDI 119771 says that Glaser's advice is still ignored in crimi-nology.

Direction of Relationships

The a priori information should be strong enough so that the analystcan state the sign of the relationship. In other words, given a changein the causal variable, what will be the direction of change in thedependent variable? If this is not possible, then this causal variableshould not be used. This stage in the a priori analysis is a crucial one.

As an example of a variable that is subject to contrary yet reasonablehypotheses on direction, consider the uses of the consumer stock of adurable good. One might expect sales to be high when the stock is high.(If the stock is high, people are in the habit of using the product. If,in addition, the product gets broken or worn out, as happens withautomobiles, a need arises for replacement sales.) On the other hand,one might expect sales to be low when the stock is high because con-sumers have less need to purchase what they already own.

Page 10: LONG-RANGE FORECASTING From Crystal Ball to …forecastingprinciples.com/files/LRF-Ch8b.pdf · LONG-RANGE FORECASTING From Crystal Ball to Computer. Eight ECONOMETRIC METHODS Contents

A Priori Analysis 199

Pattern of Causality

Rather than treating Y as a direct function of the causal variables,one can describe causality in detail. This has been done by using causalchains and simultaneous equations.

Causal chains provide an apparently useful way to decompose prob-lems. Although the number of equations increases, the overall systemis easy to understand. The results from the first equation are used asinputs to the second equation; the results from the second equationare entered into the third equation; and so on. An example is drawnhere from my study of the photographic industry:

In Armstrong (1968b), a model was developed to make long-rangeforecasts of camera sales by country. The first stage in the causalchain was an econometric model to predict prices of cameras. Itused taxes, resale price maintenance, and quotas as predictors.These price predictions were then entered into a model that pre-dicted camera sales per capita as a function of personal con-sumption expenditures per capita and prices of cameras. The thirdstage was to predict the market size as a function of total popu-lation, literacy, age, and employment. Finally, the fourth stagecalculated total camera sales on the basis of the predictions ofcamera sales per capita and market size.

Rather than increasing complexity, causal chains help to organizea problem into more manageable pieces and to illustrate what is hap-pening. This method can turn a complex argument into a series ofsimple arguments. Using causal chains involves little danger, and itprovides the benefits of decomposition.

A popular activity among econometricians has been to develop com-plex approaches to represent "simultaneous causality." Simultaneouscausality occurs when a causal variable, say X, causes a change in Y,and the change in Y, in turn, causes a change in X. For example, asthe price of automobiles is lowered, sales increase; the increase in salesthen permits economies of scale that allow for further decreases inprice. This is the interaction that made Henry Ford wealthy. Bennion(1952) provides an interesting discussion of simultaneous causality.

To represent simultaneous causality, econometricians have used si-multaneous equations. That makes sense. Unfortunately, despite great

Page 11: LONG-RANGE FORECASTING From Crystal Ball to …forecastingprinciples.com/files/LRF-Ch8b.pdf · LONG-RANGE FORECASTING From Crystal Ball to Computer. Eight ECONOMETRIC METHODS Contents

200 Econometric Methods

expenditures of time and money by the best and the brightest econo-metricians, simultaneous equations have not been found to be of valuein forecasting. This picture may change . . . but I doubt it. The failuresof simultaneous equations have been noted for more than a quartercentury. Christ (1960) said that, although simultaneous equations couldsometimes be shown to do better on artificial data, he could find noreal-world examples. L'Esperance (1964) tried but found no advantagefor simultaneous equations over causal chains in an agricultural fore-cast.

If a priori information about the relationships were strong and ifaccurate data existed, then perhaps simultaneous equations would beof value. This is speculation. In the typical case, objective data mustbe used to analyze the relationships, and these data are prone to mea-surement error. The effects of measurement error may be greatly mag-nified by simultaneous equations. Stone (1966, p. 29) concluded thatsimultaneous equations are of little value if the causal variables aresubject to large measurement errors. Summers (1965), Houthakker andTaylor (1966, p. 7), and Evans (1974, pp. 171-173) concluded thatsimultaneous equations were of little value in forecasting. I have beenunable to find recent evidence on this issue.

Many have tried but few have succeeded. The moral of the researchto date: forget simultaneous equations! This will save both of us a lotof time and energy. Many organizations currently follow this adviceand rely on ordinary least squares regression [NAYLOR, 19811.

Functional Form

The selection of a functional form', that is, the way the dependentvariable relates to the causal variables, is of little importance for smallchanges. For large changes, however, this can become an importantdecision.

Some researchers have recommended the use of nonlinear models.This does not pose a major problem if the model can be transformedinto equivalent statements that are "linear in the parameters." If sucha transformation' is not possible, I'd suggest that you avoid the non-linear model. To date, I have been unable to find a published casewhere nonlinear models increased forecast accuracy.

A commonly used functional form in economies is the multiplicativemodel:

Y = aX'PX/P Xit),'

Page 12: LONG-RANGE FORECASTING From Crystal Ball to …forecastingprinciples.com/files/LRF-Ch8b.pdf · LONG-RANGE FORECASTING From Crystal Ball to Computer. Eight ECONOMETRIC METHODS Contents

A Priori Analysis 201

where the exponent b is interpreted as an elasticity", and y designatesthe number of variables. This form has a number of advantages:

It can be made linear in its parameters. This means that you canuse a standard regression program after taking natural logs (In) ofboth sides of the equation:

In Y = ln (a) + b, ln X, + b2 In X2 b In X

Thus it is often referred to as the loglog model. (Incidentally, thelogs can be to the base 10, the base e, or any other base.)This is a reasonable way to represent much human behavior. Con-stant elasticities are assumed; roughly speaking, this says that a1% change in X will lead to a given percentage change in Y, whichis equal to b. (This interpretation of elasticities provides a goodapproximation over a large range.)It lends itself easily to a priori analysis because the researcher doesnot have to worry about units of measurement when specifyingrelationships. This advantage is an important one for areas thathave been well studied; elasticities from different studies can becompared.It often makes more efficient use of the data because it may correctfor heteroscedasticity". In simple terms, if the errors in predictingY are proportional to the magnitude of the X's (big errors are foundwhen X is large), the loglog transformation will reduce the effectof these large errors. I have not found any evidence to suggest thatthis is an important problem in forecasting.

Serious disadvantages of the loglog model include its requirementfor ratio-scaled data and its sensitivity to measurement error.

In economic problems and in many psychological problems, the mul-tiplicative model is regarded as the standard. There is even some evi-dence for this view. O'Herlihy et al. (1967), for example, found themultiplicative model to be superior to the additive model in forecastingsales of consumer durables. Nevertheless, consideration of the advan-tages and disadvantages of the multiplicative model in a specific sit-uation may lead one to depart from this standard. For example, ifmeasurement error is substantial, a simple additive form may be pre-ferred. In general, departures from the multiplicative model should bein the direction of the simple additive model. Ward found that complexfunctions were not helpful in forecasting:

+

Page 13: LONG-RANGE FORECASTING From Crystal Ball to …forecastingprinciples.com/files/LRF-Ch8b.pdf · LONG-RANGE FORECASTING From Crystal Ball to Computer. Eight ECONOMETRIC METHODS Contents

202 Econometric Methods

Ward (1954) used four functional forms (e.g., parabolic) to predictsuccess in five types of aviator training (e.g., flight simulation).More complex functional forms yielded poorer accuracy in cross-validation.

Fancier functional forms have gained enormous popularity in someacademic disciplines. The topic has been of interest for many years inmarketing. For example, the logitG transformation has been widelyused. Based on the evidence to date, these functional forms have notproduced significant gains in accuracy (e.g., see REIBSTEIN andTRAVER11982] and SHOCKER and SRINIVASAN119791). Note, how-ever, that WILTON and PESSEMIER [19811 obtained promising re-sults.

Magnitudes of Relationships

The researcher can use her own judgment or the judgment of otherexperts in specifying magnitudes for the causal relationships. Also, itis likely that studies have been published for analogous situations.Previous studies are especially plentiful for economic problems. Forexample, in my work on the worldwide photographic industry, sourcessuch as Harberger's (1960) studies of consumer durables were useful.Often there is a literature review that fulfills this need (e.g., Burk,1968, on food consumption; Domencich and McFadden, 1975, on trans-portation; Salvendy and Seymour, 1973, on personnel; Cartter, 1976,on education; ARCHIBALD and GILLINGHAM11980], on gasoline de-mand).

The a priori specification of magnitudes is especially important when:

Information on causality is strong,Large changes are involved, andLittle information is available from objective data.

When these conditions do not hold, a priori specification of magnitudeis expected to be of little value.

The alternative to using a priori information for deriving weightsis to assume that you have no information on magnitudes. The impli-cation then is that each relationship should be weighted equally. Thisis a simple procedure and no information exists to suggest otherwise.The simplest way to weight equally is to use unit weights'.

Page 14: LONG-RANGE FORECASTING From Crystal Ball to …forecastingprinciples.com/files/LRF-Ch8b.pdf · LONG-RANGE FORECASTING From Crystal Ball to Computer. Eight ECONOMETRIC METHODS Contents

A Priori Analysis 203

Unit weights are appropriate when scaling is not a factor. This maybe the case when the causal variables are unit-free, or when the pre-dictions involve rankings rather than magnitudes (i.e., ordinal ratherthan interval or ratio data).

The advantages of unit weights over unrestricted a priori weightsare minor. Using unit weights provides an alternative approach thatoffers two benefits:

The unit weights model can be used as a baseline for comparison.If the unrestricted weights yield predictions of comparable accuracy,this provides evidence that precise estimates of magnitudes are notrequired.The unit weights model provides estimates of relationships that canbe combined with those from the unrestricted weights. This is partof an eclectic strategy.

Little research is available to test the value of unit weights relativeto unrestricted a priori weights.

A Priori Model

With the completion of the steps involved in levels A and B of Exhibit8-2, you have a completely operational model to use in forecastingchange. Although this extent of a priori analysis is rarely achieved inpractice, published reports include an economic study by Theil andG-oldberger (1961), a forecast of the demand for Ph.D.s by Cartter (1965),a forecast of bank assets by SMITH and BRAINARD [1976], and pre-dictions of the heights of five-year-old boys in Iowa by Ehrenberg (1975).An example of an a priori model was provided in my study of thephotographic market:

Armstrong (1968a) developed an a priori model that could be usedto forecast the changes in camera sales by country:

(Ct,h)-14Irt+h = (1.02)h (Mt±h)10 (It+h \ 13Y, \M) ) (7, f

where t = the current yearh = the number of years in the futureY = camera sales (units)

Page 15: LONG-RANGE FORECASTING From Crystal Ball to …forecastingprinciples.com/files/LRF-Ch8b.pdf · LONG-RANGE FORECASTING From Crystal Ball to Computer. Eight ECONOMETRIC METHODS Contents

204 Econometric Methods

M = the market size (the number of literate adults)/ = a measure of people's ability to buy (real personal

consumption expenditure per capita)C = the cost to consumers of camera goods of constant

quality

This model stated that the percentage change in camera salescould be predicted using changes due to quality improvementsand other excluded variables (2% per year growth), and to thepercentage changes in market size, income, and camera prices.Roughly speaking, a 1% increase in income would lead to a 1.3%increase in sales, while a 1% decrease in price would lead to a1.4% increase in sales.

This a priori analysis was used in obtaining backcasts' of thechange in camera sales from 1960 to 1954. These backcasts weremore accurate than those from a model assuming no change (MAPEsof 30% and 67%, respectively), and also more accurate than anextrapolation model based on a no-change component and a com-ponent assuming a constant trend within each country. (The MAPEfor this combined extrapolation was 43%.)

STEPWISE REGRESSION

Some analysts prefer to skip the a priori analysis and go directly tothe data. They let the data speak for themselves. Why do they ignoreall that good advice about a priori analysis? There are at least twoexplanations: one is related to technology, and the other to publicationstandards.

Computers have reduced the cost of exploratory research. Stepwiseregression procedures allow one to search through many possible pre-dictor variables in order to find the "best" model. By contrast a priorianalysis is slow and subjective. Unfortunately, the temptation to trustthe computer is great.

Publication practices by academic journals may also inhibit a priorianalysis. It is difficult to present the a priori analysis clearly andsuccinctly, not to mention that this part of the paper seems subjective.Some journals prefer to give the impression that science is untouchedby human hands.

Setwise regressionG, where the analyst decides what possiblegroupings of variables make sense, then tests the various groupings,

Page 16: LONG-RANGE FORECASTING From Crystal Ball to …forecastingprinciples.com/files/LRF-Ch8b.pdf · LONG-RANGE FORECASTING From Crystal Ball to Computer. Eight ECONOMETRIC METHODS Contents

Objective Data 205

is more sensible, or should I say less dangerous. PEDHAZUR [1982,pp. 164-1671 decribes it in more detail. (He calls it "blockwise regres-sion.") Williams and Linden (1971) offer a computer program for set-wise regression.

MORRIS 119811 presents stepwise procedures (and a computer pro-gram) to maximize predictive validity in a cross-validated sample. I'mskeptical about this procedure.

If you insist on using stepwise regression, do not use traditionaltests of statistical significance. McINTYRE et al. 119831 provide tablesthat reflect the number of variables examined, rather than the numberused in the final model.

OBJECTIVE DATA

Although a priori information is often valuable, it also contains theerrors of the past. Sometimes our folklore on causality is incorrect. Itis useful, then, to obtain objective data to verify as well as to improveupon our a priori information. This section describes the types of datathat can be used, both experimental and historical. The value of mul-tiple data sources is also examined. Finally, consideration is given toa priori analysis of the data.

One major distinction is the use of experimental vs. historical(nonexperimental") data. With experimental data, the effects of eachvariable can be isolated for study instead of trusting to fate, as withhistorical data. Furthermore, the effects of large changes can be as-sessed in situations where historical data may not exist. Numeroussources describe how to design experiments (e.g., Boyd and Westfall,1981).

Laboratory experiments are especially advantageous for the assess-ment of large changes. The controls are more exact, enabling one tointroduce and monitor change. Also, confidentiality can be maintained.

Field experiments are advantageous because they are more realisticthan laboratory experiments. On the other hand, they are more ex-pensive, it is difficult to bring about the intended changes, and confi-dentiality is usually sacrificed. COOK and CAMPBELL119791describeprocedures for field experiments.

Historical data are advantageous because they are economical andrealistic. For this reason, most econometric work is done with suchdata. Blalock (1964) discusses the problems and opportunities in tryingto assess causality from historical data. The more important problemscan be summarized as follows:

Page 17: LONG-RANGE FORECASTING From Crystal Ball to …forecastingprinciples.com/files/LRF-Ch8b.pdf · LONG-RANGE FORECASTING From Crystal Ball to Computer. Eight ECONOMETRIC METHODS Contents

206 Econometric Methods

Objective data may be limited.The data may not be accurate.The observationsG may not be independent of one another. (Fortime series this is called autocorrelation, but it can also occur forcross-sectional data. For example, the subjects in an experimentmay talk to one another, so each subject's action is not independent.)The factors may not vary. To assess relationships, it is necessarythat both the dependent and the causal factors vary. The morevariation there is, the easier it will be to measure the relationship.The causal factors should vary independently of one another. Cor-relation among the independent variables, called multicollinearityG,makes it difficult to determine which of the causal factors shouldget the credit for changes in the dependent variable. In effect, mul-ticollinearity introduces uncertainty into the estimates of relation-ships.The measurement process may have affected the results. This istypical for data collected in organizations; the measures may becomegoals and cause people to act differently.

The preceding criteria (except for item 6) can be examined by con-sidering their effects upon regression analysis. The variance of a coef-ficient in a regression model is

Var (YIX) Var (YIX)Var (b)v E,D= - TO2 VVar X VT3

where D is the number of historical time periods.This equation says that uncertainty in the estimate of the regression

coefficient I Var (b)1. is due to variations in Y that are not explained bythe X's, or to a failure to secure a large number of observations thatdisplay wide fluctuations. Therefore, to reduce the variance of the es-timate of the coefficient (i.e., to increase the reliability of the estimate),one can:

Increase the denominator by increasing the spread (Xd X); thatis, find data sets with large variations in each causal variable. Thisis especially useful if the variation is centered around the likelyvalues expected during the forecast horizon (see Daniel and Heer-ema, 1950); this may require experimentation.Increase D, that is, increase the sample size of the historical data.Decrease Var (YIX); that is, reduce the variation in Y that is dueto factors other than X by:

=(Xd

Page 18: LONG-RANGE FORECASTING From Crystal Ball to …forecastingprinciples.com/files/LRF-Ch8b.pdf · LONG-RANGE FORECASTING From Crystal Ball to Computer. Eight ECONOMETRIC METHODS Contents

Objective Data 207

Choosing more homogeneous sample observations (avoid er-rors, windsorize, define the sample in accordance with well-defined criteria),Ensuring that "other factors" do not vary (experiment),Measuring the other factors and including them in the mea-surement model,Grouping observations and averaging (e.g., use a 4-year mov-ing average), andObtaining multiple measures for each factor and averagingthem.

In summary, the ideal data set should provide many independent ob-servations in which the variables of interest (dependent and causal)show wide variations, and the effects of all other causal variables aresmall.

The problem presented by the interaction between the measurementand the behavior might be assessed experimentally. If this is not pos-sible, you might try to obtain data in ways where the measurement isnot obvious to people. One example: to estimate how many people havelooked at an exhibit in a museum, one could measure the amount ofwear in the floor covering in front of the exhibit. Art argument in favorof the use of such unobtrusive dataG and a collection of clever ex-amples can be found in Webb et al. (1973).

What should you do in the typical case where the forecasting budgetdoes not allow for experimentation and none of the data approachesthe ideal standards? You should use eclectic research and obtain dif-ferent types of historical data! Find data that suffer from differentdefects in the hope that the defects will compensate for one another.

To help in the search for different types of data, I have provided the"Data Matrix" in Exhibit 8-4. The rows represent decision units suchas people, organizations, or countries. The columns represent differenttime periods (days, weeks, months, quarters, or years).

Three types of data are illustrated in Exhibit 8-4. These data aredescribed here, and consideration is given to their advantages anddisadvantages in typical situations.

1. Time series' data take a given decision unit (e.g., c) and examineit at different points in time. For example, c could represent theU.S. photographic industry from 1960 to 1978. This type of data hasbeen used in economics since 1862 (Zarnowitz, 1968). Time seriesdata suffer from numerous problems. In many cases, there are fewobservations. In most cases, there is a lack of variation in the data,substantial measurement error, interactionG, autocorrelation, and

Page 19: LONG-RANGE FORECASTING From Crystal Ball to …forecastingprinciples.com/files/LRF-Ch8b.pdf · LONG-RANGE FORECASTING From Crystal Ball to Computer. Eight ECONOMETRIC METHODS Contents

208 Econometric Methods

Exhibit 8-4 THE DATA MATRIX

Longitudinal

multicollinearity. (These problems are discussed below in more de-tail.) So what do these data have in their favor? Well, they generallyprovide cheap, fast, and realistic information that is especially use-ful for estimating current status.

2. Cross-sectional data' take a given period of time (e.g., period 2)and examine differences among decision units. Such data have beenused in economics since 1883 (Zarnowitz, 1968). The advantages ofcross-sectional data over time series data are that larger variationscan generally be found in the dependent variable and in the causalfactors, there is less multicollinearity, and there is greater inde-pendence among the observations. The big disadvantage is the lossin realism for situations involving predictions of the future. Hereis an example of the analysis of cross-sectional data:

In Armstrong (1968a), household survey data for the United Stateswere used to obtain an estimate of income elasticity for camerapurchases. The following estimate was obtained:

Y = (0.0000124)(1)"

DecisionUnits 1 2 3

Time Periodst

a

b

, Time seriesTo-

---1-___

----I-

ö

i,ai

j,-",',..

1 . c.)__,

.o I

I- 1

r -r

n _I

Page 20: LONG-RANGE FORECASTING From Crystal Ball to …forecastingprinciples.com/files/LRF-Ch8b.pdf · LONG-RANGE FORECASTING From Crystal Ball to Computer. Eight ECONOMETRIC METHODS Contents

Objective Data 209

where Y is the dollar expenditure per potential buyer for pho-tographic goods (per year), and I is the average income in theparticular income category (per year). The constant term is merelya scaling factor. The income elasticity, L5, is in line with priorexpectations.

3. Longitudinal data" take a sample of decision units (e.g., a throughn) and examine changes over time (e.g., the difference betweenperiods 2 and t). The use of this type of data is more recent thanthe use of time series and cross-sectional data. Longitudinal datagenerally cost more to obtain, but they provide one big advantageover time series data: each observation serves as its own control.In other words, the unique aspects of each decision unit are constantover time and are therefore less likely to enter into an explanationabout changes. An early example of the analysis of longitudinaldata is provided by Bandeen (1957). Another is drawn from my studyof the international photographic market:

In Armstrong (1968a), longitudinal data from 2.1 countries wereexamined to obtain estimates of price and income elasticity:

(Y64 65)= (1.02)d G6640 66)Y60 61

- 0.4(C64 65

C60 61

where the constant term 1.02 was fixed a priori to represent tech-nological change, and

Y = rate of unit camera salesper potential buyer (per year)

d = number of years between initialand final data

I a measure of incomeC = price of camera goods (as estimated

from changes in price ofidentical cameras in annual Sears catalogues)

64-65 the average for the years 1964 and 1965,60-61 = the average for the years 1960 and 1961.

=

=

Page 21: LONG-RANGE FORECASTING From Crystal Ball to …forecastingprinciples.com/files/LRF-Ch8b.pdf · LONG-RANGE FORECASTING From Crystal Ball to Computer. Eight ECONOMETRIC METHODS Contents

210 Econometric Methods

In other words, each observation consists of the change in camerasales for a given country from the beginning to the end of thehistorical period, and the corresponding changes in income andthe price of cameras for that country. The regression analyzesdifferences among these country change scores.

Multiple Data Sources

A listing of the various types of data is presented in Exhibit 8-5, alongwith a rating of their advantages and disadvantages. For laboratoryexperiments, only cross-sectional data are examined because it is un-usual to obtain longitudinal or time series data from such experiments.The ratings required much subjectivity, and a difference of one pointis not crucial. You may even prefer to insert your own ratings to assessthe data for a particular problem. The ratings are relevant withincolumns but should not be added across columns (unless one has de-veloped measures of importance for each column).

Exhibit 8-5 can be used after the analyst has identified the mostimportant criteria. The analyst can then see how well each type ofdata meets the important criteria by examining the columns in Exhibit8-5. Because some types of data are strong on certain criteria and weakon others, it is likely that more than one type of data should be usedto "supplement" the weak areas.

The importance of finding different types of data is illustrated bythe attempt to determine whether a causal link exists between smokingand cancer. Initial studies were criticized for numerous weaknesses.Over many years of study, however, a smoking-cancer relationship hasbeen identified from cross-sectional data, using comparisons across peo-ple and across countries. It has been demonstrated by time series data,and by longitudinal data where specific people or other animals werefollowed across time. The results of these analyses show that the re-lationship holds for different populations, different geographical re-gions, and different times. (But some people still do not believe that acausal relationship exists. They are generally smokers or workers inthe cigarette industry. For most of them, there is no information thatcould possibly change their minds. If you do want evidence, you cango as far back as Cornfield, 1954.)

Studies that have combined different types of data go back manyyears. Among them are:

Page 22: LONG-RANGE FORECASTING From Crystal Ball to …forecastingprinciples.com/files/LRF-Ch8b.pdf · LONG-RANGE FORECASTING From Crystal Ball to Computer. Eight ECONOMETRIC METHODS Contents

Exh

ibit

8-5

RA

TIN

GS

OF

DA

TA

FO

R E

CO

NO

ME

TR

IC M

ET

HO

DS

(1 =

Unf

avor

able

and

5 =

Mos

t Fav

orab

le)

Lab

orat

ory

expe

rim

ent

Cro

ss-s

ectio

nal

33

55

55

55

1

Fiel

d ex

peri

men

tC

ross

-sec

tiona

l2

23

34

55

44

Lon

gitu

dina

l1

14

24

53

45

Tim

e se

ries

11

22

44

34

5

His

tori

cal D

ata

Cro

ss-s

ectio

nal

55

11

33

33

3

Lon

gitu

dina

l5

43

11

11

25

Tim

e se

ries

55

11

11

11

5

Mea

sure

men

t Con

fide

n-M

ulti-

Inde

pend

ent

Inte

r-T

ype

of D

ata

Spee

d C

ost

Err

ortia

lity

Var

iatio

n co

lline

arity

Obs

erva

tions

act

ion

Rea

lism

Page 23: LONG-RANGE FORECASTING From Crystal Ball to …forecastingprinciples.com/files/LRF-Ch8b.pdf · LONG-RANGE FORECASTING From Crystal Ball to Computer. Eight ECONOMETRIC METHODS Contents

212 Econometric Methods

Goreux (1957) in making long-range projections of food con-sumption, Kuh (1959) in a study on macroeconomics, Cook andSelltiz (1964) in studies on attitude measurement, Houthakker(1965) in an international study of consumer expenditures, Stout(1969) in studying the price elasticity of a specific beverage, andMize and Ulveling (1970) in a study of air travel.

The eclectic use of data is expected to be especially helpful in sit-uations where:

Uncertainty is high, andChanges are large.

These conditions occur often in the social sciences. The limited evidenceavailable suggests that there are small benefits for short-range fore-casting, that the gains for medium-term forecasting are modest, and,surprisingly, that the gains for long-range forecasting are also onlymodest. Overall, the eclectic use of data led to increased costs andprovided only small gains in accuracy, as shown in the following stud-ies, all of which were done some time ago:

Schupack (1962) found that a combination of time series and cross-sectional data did not yield more accurate forecasts than thoseobtained from time series data in 1-year forecasts for 32 fooditems. In fact, the combination did slightly poorer.

Armstrong (1968a) used various types of data to obtain backcastsof the photographic market. The accuracy from this approach wascompared with that from a single set of data. The combinationapproach used:

Cross-sectional data from 30 countries (averaged over1960-1965)Longitudinal data from 21 countries (1964-1965 to 1960-1961)Cross-sectional data on 1,200 U.S. households (aggregated bysix income categories)

Each of the data sets had its own advantages and its own limi-tations. The household data were useful because large variations

Page 24: LONG-RANGE FORECASTING From Crystal Ball to …forecastingprinciples.com/files/LRF-Ch8b.pdf · LONG-RANGE FORECASTING From Crystal Ball to Computer. Eight ECONOMETRIC METHODS Contents

Objective Data 213

in income were found and the quality of cameras offered for salewas held constant; however, the lack of fluctuations in price pre-vented an estimate of price elasticity. Cross-sectional data bycountry provided large differences in both price and income, butother factors that caused differences among countries could notbe included. The longitudinal data across countries used eachcountry as its own control, thus avoiding many of the factors thatcaused variation among countries. The longitudinal data sufferedfrom collinearity between price and income and from large mea-surement errors in the causal variables. The combination andlongitudinal-data-only models were compared in an unconditionalbackcast for 17 countries for 1954 (assuming that nothing wasknown before 1960). The combination model was slightly moreaccurate with a MAPE of 21% vs. 26% for the longitudinal model.

Ferber (1969) reported that one of his thesis students, Abo-Baker,found a combination cross-section and time series approach to bebetter than one using only time series data or one using onlycross-sectional data in forecasting expenditures for new and usedcars. This finding was not upheld in his forecast of consumptionexpenditures for public transportation.

Lyon (1969) used aggregate time series data for 1935-1961 and11 different cross-sections over states (using each year from 19511961) in developing a model to forecast U.S. cigarette consump-tion over a five-year period from 1962-1966. The combinationmodel was superior (in four of the five years) to forecasts generatedfrom a model based only on time series.

Laughhunn (1969, Chapter 5) compared models developed fromcombinations of cross-section and time series, with models fromtime series only. Conditional forecasts' were made for a one-to five-year horizon for nine categories of consumer expenditures.For the one-year forecasts, the combination approach yielded moreaccurate predictions on only three of the nine categories. However,as the forecast horizon lengthened, the combination method gained

.

Page 25: LONG-RANGE FORECASTING From Crystal Ball to …forecastingprinciples.com/files/LRF-Ch8b.pdf · LONG-RANGE FORECASTING From Crystal Ball to Computer. Eight ECONOMETRIC METHODS Contents

214 Econometric Methods

relative to the time series approach until it was superior on sevenof nine categories for both the four- and the five-year horizons.

A Priori Analyses of the Data

A priori analysis can be carried out on the data as well as the model.(These analyses were listed in level C in Exhibit 8-2.) This sectionexamines the value of a priori adjustments for biases expected in thedata.

The procedure for cleaning the data is similar to that in Chapter 7(p. 161-162). Efforts should be made to deal with outliers, for example,and this should be done before using these data to estimate values forthe forecasting model.

The data for a given conceptual variable may incorporate variationdue to other variables. For example, income estimates for a countrymay be unduly influenced by exchange rates or by the black marketand thus may provide poor estimates of "ability to purchase." If yoususpect such biases in the data, you can consider two strategies tocompensate for them. One is to find alternative indicators for the sameconceptual variable, and then to average these in the hope of washingout the variation due to irrelevant factors (e.g., different estimates ofability to buy should be obtained).

The second strategy is to adjust the observations. Whether such anadjustment is useful is speculative. I tested this approach in my studyof the photographic market and was surprised to find that it had nopredictive validity.

In Armstrong (1968a), the value of adjusting observations wasexamined. This was an ideal situation for such a test becausethere were substantial differences among countries; also, articlesabout certain national camera markets claimed that unusual fac-tors, such as high tourist sales, were present. I made an intensiveexamination of 31 countries, and predicted the residual error be-fore the econometric model was used to fit the data in the cali-1bration sample. For 17 countries, no difference was expected from:the model; however, for the remaining 14 countries, the predic-tions of direction were correct on only 5 countries. This was a'disappointing result in view of the strong prior knowledge thatwas claimed by the various sourcesas well as by my own con-fidence in this strategy!

Page 26: LONG-RANGE FORECASTING From Crystal Ball to …forecastingprinciples.com/files/LRF-Ch8b.pdf · LONG-RANGE FORECASTING From Crystal Ball to Computer. Eight ECONOMETRIC METHODS Contents

Objective Data 215

Because measurement error is so common, it is important to considerits effects. Errors can occur in either the dependent variable or thecausal variables.

The presence of measurement error in the dependent variable leadsto uncertainty in the estimates of the relationships. This uncertaintywill be reflected in the confidence intervals estimated by regressionanalysis. One solution is to gain additional information on the depen-dent variable by finding additional observations.

When the size of the measurement error varies over the range ofthe independent variables (heteroscedasticity), it may be possible toobtain a more efficient estimate by the use of a transformation. Thisis a risky strategy, however, for the transformation changes the state-ment about the type of relationship; the relationship should be decidedon the basis of the a priori analysis, not on the basis of what willprovide the most reliable estimate.

Measurement error in the causal variables will increase uncertaintyand lead to bias in the estimates of the coefficients. This is illustratedin Exhibit 8-6. Case A, with no error, is obvious. In case B, the 3'srepresent the observed value of X when the "true" value was 3.0. The4's are the observed values of X when the true value was 4.0, andsimilarly for the 5's. Case C follows the same pattern but with moreerror. The line in each case is chosen to minimize the sum of the squareof the errors in "fitting" or "explaining" Y (i.e., to minimize the squareof the vertical distances from the line to the observations). Note thebasic conclusion from Exhibit 8-6: as the error increases, the estimatedrelationship is biased toward zero, while the constant term is biasedaway from zero.

Adjustments can be made to compensate for certain types of errorsin the causal variables. In general, when a causal variable containsinformation of poor quality (high error), less emphasis should be placedupon that information. To place less emphasis upon a relationshipimplies that its absolute value should be reduced. This is what theregression model did in Exhibit 8-6. If one assumes that the measure-ment error is random, the extent of the bias toward zero can be esti-mated from:

b = (1 + E);

where 6 is the regression estimate of the relationship, b is the truerelationship, and E is the ratio of the error variance to the true vari-ance. If there were no errors (E = 0), then the regression estimatewould provide an unbiased estimate of the relationship. If the errorvariance were equal to the true variance, then E = 1 and b = (2)6;that is, the true coefficient would be twice as large as the estimated

Page 27: LONG-RANGE FORECASTING From Crystal Ball to …forecastingprinciples.com/files/LRF-Ch8b.pdf · LONG-RANGE FORECASTING From Crystal Ball to Computer. Eight ECONOMETRIC METHODS Contents

Exhibit 8-6 BIAS DUE TO RANDOM ERROR

(Circled numbers represent true values of X)Case A: No Measurement Error in X Case B: Modest Measurement Error in X

3Y

2 Actual relationship:Y = 0.0 + 1.0X

i I

2 3 4 5 6 7

Y

5

4

3

2

1 E

Case C: Large Measurement Error in X

CPCD,®

54

3 '1.175 (3 6Y2

1

(1I II 'III

0 1 2 3 4 5 6 7

Note: The limiting solution, with infinitemeasurement error, would be thehorizontal line Y = 4.0, which isthe average value for Y in this case.

Estimated relationship:Y = 3.2 + 0.2X

Estimated relationship:Y = 2.0 + 0.5X

I I I I I

00 1 2 3 4 5 6 7

coefficient. (For a discussion on how this formula was obtained, see atextbook on econometrics. I used Johnston, 1963, p. 150.)

The difficulties with the preceding formula for obtaining the truerelationship from the estimated relationship are twofold. First, theassumption was made that there was no systematic error component;and, second, the estimate of the E factor must generally be estimatedjudgmentally.

The preceding discussion focused on the effect that error has uponthe estimate of a given causal relationship. The effect upon the constant

216 Econometric Methods

X

Page 28: LONG-RANGE FORECASTING From Crystal Ball to …forecastingprinciples.com/files/LRF-Ch8b.pdf · LONG-RANGE FORECASTING From Crystal Ball to Computer. Eight ECONOMETRIC METHODS Contents

Analyzing the Data 217

term was also noted. There is still another effect, however: errors in agiven causal variable, say X1, will generally affect the estimates ofother causal relationships, say the relationship between X2 and Y.

Adjustments for bias caused by measurement error are importantwhen the measurement error is large relative to the true variation andwhen some precision is required in the estimate of the magnitude. Thissituation is expected only when large changes occur, as in long-rangeforecasting. I made a test of this hypothesis in my study of the inter-national camera market (Armstrong, 1968a). Although the measure-ment errors were large, virtually no improvement in the accuracy oflong-range backcasts of camera sales occurred when adjustments weremade for bias in the estimates of the coefficients. There is little evi-dence, then, to support adjusting for biasand little evidence to refuteit. For short-term forecasting, adjusting for bias is expected to be oflittle value.

INTERMISSION: Take a long break. You owe it to yourself!

ANALYZING THE DATA

A priori analysis provides one way to obtain coefficients for the causalrelationships. This section considers two methods to estimate theserelationships from objective data: experience tables and regressionmethods. More attention is given to regression methods; they offer morepromise and are more convenient than experience tables. The problemswith using regression analysis are reviewed, and suggestions are madefor overcoming these difficulties.

In spite of the promise of regression analysis, simplicity still hasgreat merit. Evidence is provided to compare simple and complex meth-ods.

Experience Tables

The origins of experience tables can be traced to Ben Franklin's "moralor prudential algebra" (Dawes and Corrigan, 1974). Experience tablesinvolve a scoring procedure, such as:

+1 for a favorable rating on a factor0 for an indeterminate rating1 for an unfavorable rating

Page 29: LONG-RANGE FORECASTING From Crystal Ball to …forecastingprinciples.com/files/LRF-Ch8b.pdf · LONG-RANGE FORECASTING From Crystal Ball to Computer. Eight ECONOMETRIC METHODS Contents

218 Econometric Methods

The score for an individual is simply the total across all factors. Insome situations, the scoring may allow for only two categories, favor-able and unfavorable, in which case + 1 and 0 can be used.

The direction of the relationship is generally specified a priori. Butat what point does a score become favorable or unfavorable? In general,the researcher experiments with the data and tries different levels todefine a favorable rating. It seems preferable, however, to make thesedecisions a priori if possible . . . and it is nearly always possible.

Finally, the experience table is constructed by examining the be-havior associated with each score. For example, what behavior is as-sociated with a score of + 12? With one of 6? It helps at this pointto have ample objective data.

Forecasts are made by calculating the score for an individual, andthen using the average behavior for people with that score.

Experience tables have been used in many fields. For example, theBurgess method in sociology, developed in the 1920s by Ernest W.Burgess, involved the examination of a large number of predictivefactors (Burgess, 1939). Each factor was classified as being "favorable"or "unfavorable." A score was obtained for each observation by addingthe number of favorable factors. For example, in predicting the prob-able success of paroling an eligible individual from prison, if 18 of 25factors were favorable the prisoner's score would be 18. An examinationof the data was then made to determine the percentage of successfulparolees among those who scored 18. Experience tables were developedto indicate the success rate on parole for each score.

Ernest Burgess used a large number of causal variables. He did notdifferentiate between variables; all were considered to be of equal im-portance. Also, Burgess did not give a heavier weight to a more positiverating on a given factor; both were simply called "favorable." To correctsome of these shortcomings, Glueck and Glueck (1959) suggested thatonly a small number of variables should be used and that variableweightings be obtained from data for each of the causal variables. Thus,if age were related to success on parole, middle-aged people would scorehigher than young people, and the geriatric set would score higherthan middle-aged people.

Although the improvements proposed by the Gluecks seem reason-able, a review by Gough (1962) was pessimistic. He concluded that theGlueck proposals would not yield substantial improvements over Bur-gess (thus providing more evidence on the importance of being Ernest).Similar conclusions have been reached in personnel psychology by Quinnand Mangione (1973), although Stuckert offers mild support for theGlueck index:

Page 30: LONG-RANGE FORECASTING From Crystal Ball to …forecastingprinciples.com/files/LRF-Ch8b.pdf · LONG-RANGE FORECASTING From Crystal Ball to Computer. Eight ECONOMETRIC METHODS Contents

Analyzing the Data 219

Stuckert (1958) predicted the grade-point averages for about 500freshmen at Ohio State for both 1949 and 1950. A Glueck indexyielded slightly more accurate predictions than did a Burgessindex.

Problems with experience tables persist even after making the ad-justments proposed by the Gluecks. For instance, variables are ex-amined one at a time; this can lead to serious errors when the datasuffer from multicollinearity and interaction. An additional problemis the number of subjective inputs necessary. These problems can beovercome to a great extent by using regression analysis.

Regression Analysis

Before plunging ahead with regression analysis, it is important todistinguish between the measurement model and the forecastingmodele'. Previously the discussion related to obtaining estimates foruse in the forecasting model. To obtain these estimates, one creates ameasurement model to analyze the data. Regression analysis yields ameasurement model, and it is seldom that the measurement modelcorresponds exactly to the forecasting model. In retrospect this state-ment seems obvious. But it isn't! The confusion between the measure-ment model and the forecasting model has led to serious errors inforecasting. This has been especially true whenever regression analysiswas used. Although they frequently look the same, the measurementmodel differs from the forecasting model in three ways:

1. The measurement model contains "nuisance variables." These vari-ables explain differences among observations in the data, but theyare of no interest in the forecasting model. This situation occursbecause, as a general rule, the analyst should include all variablesthat are important in explaining variation. An example, drawn frommy study of the photographic market, is presented in the followingparagraph. The one exception to this rule is that, if the nuisancevariables are not correlated with the variables in the measurementmodel, they can be ignored altogether. (That is, they will not affectthe regression weights.) This may be a reasonable assumption forexperimental data, but it is a risky one for historical data:

Page 31: LONG-RANGE FORECASTING From Crystal Ball to …forecastingprinciples.com/files/LRF-Ch8b.pdf · LONG-RANGE FORECASTING From Crystal Ball to Computer. Eight ECONOMETRIC METHODS Contents

220 Econometric Methods

In Armstrong (1968a), data from an international cross sectionwere used to obtain estimates of income and price elasticity. Themeasurement model included measures of the climate of eachcountry. These variables were related to differences in camerasales among countries, but they were irrelevant in forecastingchange because the climate in each country was not expected tochange substantially over the forecast horizon, nor could suchchanges be forecast.

The estimates from a measurement model should be combined withestimates from the a priori analysis and with estimates from othermeasurement models.The estimates from the measurement model must be adjusted torepresent the conditions faced during the forecast horizon. The mea-surement model may use data that are not representative of theforecast situation.

The first point is considered in this section, and the second and thirdpoints are discussed later in the chapter.

I am not going to discuss the details of regression analysis. Othershave done so admirably:

Good descriptions of regression analysis are provided in Blalock(1979) and Johnston (1984). Mathematical wizards enjoy Malin-vaud (1980). Advice that is oriented toward practical applicationsmay be found in Cohen and Cohen (1975), Draper and Smith(1981), and PEDHAZUR [1982]. LEVENBACH and CLEARY[1984], and MAKRIDAKIS, WHEELWRIGHT, and McGEE [1983]provide a detailed treatment with emphasis on forecasting.

Discriminant analysis' is a special form of regression analysis. Itis used when the dependent variable refers only to categorical data.Typically, only two states are of interest (e.g., purchase or no purchase,innocent or guilty, pass or fail), although more categories can be han-dled. Because this version offers little more than convenience overregression analysis, no specific discussion is provided here. In general,the opportunities and problems correspond to those of regression anal-ysis (numerous sources describe discriminant analysise.g., Greenand Tull, 1978, PEDHAZUR [1982].

Page 32: LONG-RANGE FORECASTING From Crystal Ball to …forecastingprinciples.com/files/LRF-Ch8b.pdf · LONG-RANGE FORECASTING From Crystal Ball to Computer. Eight ECONOMETRIC METHODS Contents

Analyzing the Data 221

Many of you already know about regression analysis. (If you do not,please bear with me, and pay attention to the glossary.)

Like experience tables, regression analysis is inexpensive. Butregression analysis has three major advantages over experience tables:

It uses the data in a more objective way; that is, the biases of theresearcher become less important.It estimates partial effects of each causal variable rather than grosseffects.It uses the data more efficiently.

In short, regression analysis has clear-cut advantages over experiencetables. Researchers in nearly every field have switched from experiencetables to regression tables.

Regression analysis does have its problems. Fortunately, much studyhas been done on these problems, and remedies have been proposed.In the following subsections, measurement error, interaction, multi-collinearity, autocorrelation, and simultaneous causality are discussed.

Measurement Error. For simple measurement models, the effect ofmeasurement error upon the estimate of the causal relationship ap-pears to be modest. Denton and Kuiper (1965), Denton and Oksanen(1972), and McDonald (1975) examined the impact of errors on esti-mates obtained from economic data. In each study, little change wasnoted when the preliminary data were replaced by more accurate data.

When measurement error is serious, it is important to keep themeasurement model simple. This can be done as follows:

Using a small number of causal variables (in cases with extrememeasurement error, the extrapolation method would be preferredas it uses no causal variables).Using a simple functional form. Addition is preferable to multipli-cation, and multiplication is preferable to raising to powers greaterthan 1.0. Complex forms tend to magnify errors.Using a simple set of equations. Avoid simultaneous equations.Minimizing the sum of the absolute errors rather than the sum ofthe squared errors (Wiginton, 1972).

Particular attention should be given to obtaining accurate measuresor the dependent variable:

Page 33: LONG-RANGE FORECASTING From Crystal Ball to …forecastingprinciples.com/files/LRF-Ch8b.pdf · LONG-RANGE FORECASTING From Crystal Ball to Computer. Eight ECONOMETRIC METHODS Contents

222 Econometric MethodsgrRirr^-

KEREN and NEWMAN 119781, using simulation, found that er-rors in the dependent variable are more damaging than errors inthe causal variable, which is not surprising.

Interaction. Interaction means that the relationship between twovariables, say Y and X1, is dependent upon the value of a third variable,say X2. For example, it has been found that the relationship betweeneducation and income varies according to race. Another example isthat certain chemicals do not react when placed together unless acatalyst is present.

Failure to account properly for interaction may lead to problems inforecasting. For example, a particular treatment for a given ailmentmay cure an adult but prove fatal to an infant. Efforts to rehabilitatea criminal may lead to different results depending upon whether theeffort is initiated by the criminal, a peer group, or a judge.

Two strategies have been used in dealing with interaction. The firsthas been to find a functional form that captures the effects of interaction(e.g., the multiplicative form can represent certain types of interaction).The other approach has been to use segmentation to account for in-teraction, and then to use econometric methods within segments. Thislatter strategy is discussed in Chapter 10.

Mu Iticoll inearity. Multiple regression analysis offers advantages overexperience tables especially in cases with multicollinearity. Still, highmulticollinearity makes it difficult to obtain estimates (Blalock, 1963).

Multicollinearity can be dealt with in a number of ways. Probablythe worst piece of advice is to drop one of the correlated variables. Thiswill reduce the standard error at the cost of introducing bias into theestimate of the coefficient. The regression analysis should include allimportant variables. The idea of dropping one of the important vari-ables to solve the problem of multicollinearity reminds me of the state-ment by a U.S. Army colonel that a Vietnamese village had to bedestroyed in order to save it.

Another approach to multicollinearity has been to use factor anal-ysis to create a small number of independent causal factors from theoriginal set. This approach is the technician's delight, but the theo-retician's nightmare. I side with the theoreticians. The a priori analysisshould indicate which variables to use. If more than one measure isemployed to represent a concept, the analyst can simply create an indexusing subjective weights (e.g., take an arithmetic average). Factoranalysis has disadvantages because it is more difficult to understand,

Page 34: LONG-RANGE FORECASTING From Crystal Ball to …forecastingprinciples.com/files/LRF-Ch8b.pdf · LONG-RANGE FORECASTING From Crystal Ball to Computer. Eight ECONOMETRIC METHODS Contents

Analyzing the Data 223

and because it may yield factors that make little sense. Tom Swiftlearned this the hard way:

Armstrong (1967) reported on a study by Tom Swift. Tom, itseems, was given data on some blocks of metal. He had no priorknowledge about these blocks, but he did know how to use factoranalysis and data had been collected for 11 variables. Tom in-terpreted three key factors: (1) compactness, (2) intensity, and (3)shortness. In fact, the data were explained exactly by five factors:(1) length, (2) width, (3) height, (4) density, and (5) cost per pound.Hopefully, Tom's study does not indicate the need for furtherresearch of this type.

Studies from the real world did not do much better than Tom Swift.To date, the evidence suggests that factor analysis is not helpful inforecasting. I have found eight studies:

Ostlund (1973) reduced 18 causal variables to seven factors andused these factors to predict the adoption of six different consumerproducts. This factor analysis resulted in a loss of predictive power.

Lack of success for factor analysis was also reported by Rock etal. (1970) with simulated data; Mayfield (1972), GOMEZ-MEJIA,PAGE, and TORNOW [1982], and MITCHELL and KLIMOSKI[1982] in personnel psychology; ALUMBAUGH et al. [1978] inparole prediction; and by Massy (1965), and MOORE [1982] inmarketing.

The only good solutions to multicollinearity involve the use of in-formation from other sources. A priori estimates provide one such source.Exhibit 8-5 offers ideas about other types of data. Estimates from theseother sources can then be used to remove one of the variables from themeasurement model. For example, assume that you are predicting thesales of caribou chips (Y) from knowledge of income (I) and the cost toconsumers (C) for caribou chips and you are using household surveydata:

Y = a + b11 + b2C

Page 35: LONG-RANGE FORECASTING From Crystal Ball to …forecastingprinciples.com/files/LRF-Ch8b.pdf · LONG-RANGE FORECASTING From Crystal Ball to Computer. Eight ECONOMETRIC METHODS Contents

224 Econometric Methods

If I and C are highly correlated, it is difficult to estimate bi and b2.But b2 might be estimated separately by analyzing a field experimentwhere the price of caribou chips is varied by region. This estimate ofb2 can then be used to remove the price effect from the household surveydata:

Y' Y b2C

The income coefficient, b1, is then estimated from

Y' a +

Autocorrelation. For time series data, it is common for errors to becorrelated with one another over time. When this occurs, the analystis likely to falsely assume greater certainty in the estimates of therelationships. Autocorrelation may be detected by the Durbin-WatsonstatisticG (e.g., see Johnston, 1963, p. 192). This statistic picks upautocorrelation between adjacent time periods. The runs testG (Siegel,1956, p. 52) is another way to spot autocorrelation.

One solution to autocorrelation is to formulate the model in termsof first differences; that is, the period-to-period changes in the depen-dent variable are related to the period-to-period changes in the causalvariables. The resulting estimates of certainty are improved. But thecure is worse than the problem as far as long-range forecasting isconcerned; the first-differences model picks up short-range effects in-stead of long-range effects.

Another solution is to use longer time periods (e.g., yearly ratherthan quarterly data). This is especially useful for long-range forecast-ing.

In general, however, the estimates themselves are not affected muchby autocorrelation, and I have been unable to find much evidence thatforecast accuracy is impaired. (Although GRANGER and NEWBOLD11977, pp. 202-2141, and DIELMAN119851 provide some disconfirmingevidence to this). So it is seldom worth bothering to make any correc-tion. A great deal of fuss is created over autocorrelation, not becauseit interferes with the development of a forecasting model, but becauseit messes up the statistician's tests of significance. As discussed in PartIII of this book, statistical testing will cause few problems because thesame data do not have to be used for both estimating relationships andtesting the statistical significance of the forecasts.

Simultaneous Causality. Although simultaneous causality mightcreate difficulties in estimating causal relationships, it is not clear that

=

=

Page 36: LONG-RANGE FORECASTING From Crystal Ball to …forecastingprinciples.com/files/LRF-Ch8b.pdf · LONG-RANGE FORECASTING From Crystal Ball to Computer. Eight ECONOMETRIC METHODS Contents

Analyzing the Data 225

these cause serious problems in forecasting. Certainly there has beenno lack of research on this topic. My conclusion from this research isthat attempts to model simultaneous causality have not improved ac-curacy.

The solutions for simultaneous causality increase the complexity ofthe model. This, in turn, increases the problems due to measurementerror, interferes with implementation, and raises the likelihood of mis-takes.

The simultaneous prediction of a set of dependent variables, com-monly called canonical correlation, is not recommended. It can interferewith causal thinking, add complexity, and, as found in PRISTO 119791and FRALICX and RAJUI1982 I, it harms predictive ability. So I refuseto provide references on how to use canonical correlation.

Summarizing. The problems involved with using regression analysisare summarized in Exhibit 8-7 along with ideas on what to do andwhat not to do.

Value of Simplicity

There are two important rules in the use of econometric methods, (1)keep it simple and (2) don't make mistakes. If you obey rule 1, rule 2becomes easier to follow.

Exhibit 8-7 PROBLEMS AND REMEDIES IN REGRESSION ANALYSIS

Measurement error Ignore

Interaction

Multicollinearity Omit a variableUse factor

analysis

Autocorrelation Get excited

Use a simple measurement modelAdjust coefficients for bias

Use the appropriate functionalform

Segment data first

Use other sources of dataForm an index from multiple

measures

Obtain more dataUse longer time periods

Simultaneouscausality Get excited Nothing

Problem Don't Do

Page 37: LONG-RANGE FORECASTING From Crystal Ball to …forecastingprinciples.com/files/LRF-Ch8b.pdf · LONG-RANGE FORECASTING From Crystal Ball to Computer. Eight ECONOMETRIC METHODS Contents

226 Econometric Methods

The advice to avoid mistakes would be unimportant if few mistakeswere made in applied work. I suspect errors are more common thanwe like to think. In addition to witnessing some of my own mistakes,I have, on occasion, seen errors by others.

The advice to keep things simple is not self-evident. To representreality better, it would help to use a more complex model. Furthermore,it seems reasonable to expect a better representation of reality to yieldmore accurate forecasts. Doesn't it?

Actually, increases in complexity may lead to less accurate forecasts.Carl Harrington and I examined a model that seemed to be too complex.The model, from Ueno and Tsurumi (1969), had an error twice as largeas those obtained from eyeball extrapolations in forecasting U.S. au-tomobile sales from 1966 through 1969. (A description of this study isprovided in Appendix H.)

Econometricians, however, believe that increased complexity yieldsimproved accuracy. This also seems to apply to psychometricians [CLIFF,19831. This seems evident from the movement toward more complexity.As noted by Leser (1968), the long-term trends have been toward theuse of more variables, more equations, more complex functional forms,and more complex interactions among the variables in econometricmodels. This increase in complexity can be observed by examiningvarious issues of Econometrica since 1933 or by comparing econometrictextbooks published over the last half century.

To gain further information on whether econometricians believe ina positive relationship between complexity and accuracy, I surveyed agroup of leading econometricians. There were 21 respondents from thisconvenience sample. They were all in economics, most were academics,some were consultants, and most were well known. They were askedthe following question by a mail survey: "Do complex methods gen-erally provide more accurate or less accurate forecasts than can beobtained from less complex econometric methods for forecasting in thesocial sciencesor is there no difference in accuracy?" They were pro-vided with the following definition of complexity: "Complexity" is tobe thought of as an index reflecting the methods used to develop theforecasting model:

The use of coefficients other than 0 or 1The number of variables (more variables being more complex)The functional relationship (additive being less complex than mul-tiplicative; nonlinear more complex than linear)The number of equationsWhether the equations involve simultaneity

Page 38: LONG-RANGE FORECASTING From Crystal Ball to …forecastingprinciples.com/files/LRF-Ch8b.pdf · LONG-RANGE FORECASTING From Crystal Ball to Computer. Eight ECONOMETRIC METHODS Contents

Analyzing the Data 227

Definitions were also provided for the other key terms (see AppendixF).

There was substantial agreement on the value of complexity; 72%of the experts agreed that it was helpful and only 9% disagreed, asshown in Exhibit 8-8. Moreover, the experts were confident in theirratings on the value of complexity. The average confidence level was4.0, where 1 equals no confidence, 5 equals extremely confident, andthe respondents were asked to report a low confidence level if they didnot understand the question. The experts were also given the oppor-tunity to express any qualifications they considered important; 88%expressed qualifications, but no single factor was mentioned by morethan one respondent.

Exhibit 8-8 EXPERTS' ATTITUDES ON MODELCOMPLEXITY AND FORECASTINGACCURACY (n = 21)

Complex Methods Rated Percentage

Significantly more accurate 5

Somewhat more accurate 67No difference (or undecided) 19Somewhat less accurate 9Significantly less accurate

An examination was made of the published empirical evidence re-lated to the issue of complexity versus accuracy. The following studiesprovided indirect evidence on the value of complexity in forecasting:

Fox (1956) found small differences between regression coefficientswhen comparing single-equation and simultaneous equationmethods; this would imply little difference in the forecast accu-racy.

Friend and Taubman (1964) claimed that their simple model wassuperior to more complex models (unfortunately they did not in-clude the data from their study; furthermore, the study examinedonly ex pose predictive validity).

Page 39: LONG-RANGE FORECASTING From Crystal Ball to …forecastingprinciples.com/files/LRF-Ch8b.pdf · LONG-RANGE FORECASTING From Crystal Ball to Computer. Eight ECONOMETRIC METHODS Contents

228 Econometric Methods

Guilford (1965, P. 423), using data from a problem in education,obtained coefficients of .224 and .491 with a two-variable model.Virtually identical explanatory power was obtained when theweights were rounded to .2 and .5, then to 1 and 2, and finally to1 and 1.

Howrey (1969) found no advantage for a more complex model inex post forecasts in a transportation problem.

I reanalyzed data from the study by Jorgenson et al. (1970) andfound a perfect negative correlation between the stability of theregression coefficients from one time period to the next and thecomplexity of each of their four models (as judged by the numberof variables in the model). This lack of stability for more complexmethods suggests a loss in predictive validity.

Fair (1971) found little difference between his simple model andthe more complex Wharton model in a test of ex post predictivevalidity.

Elliott's (1973) study favored simplicity in national economic fore-casting.

McLaughlin (1973) examined the accuracy of forecasts from 12econometric services in the United States. These forecasts weremade by models that differed substantially in complexity (al-though all were complex). No reliable differences in accuracy werefound among these models, as the rankings of accuracy for themodels in 1971 were negatively correlated ( .3 Spearman rankcorrelation') with those for 1972. (If there are no reliable dif-ferences, then no differences will be found between accuracy andcomplexity.) Similar evidence was provided by PENCAVEL [1971],and Fromm and Klein (1973).

Page 40: LONG-RANGE FORECASTING From Crystal Ball to …forecastingprinciples.com/files/LRF-Ch8b.pdf · LONG-RANGE FORECASTING From Crystal Ball to Computer. Eight ECONOMETRIC METHODS Contents

Analyzing the Data 229

HATJOULLIS and WOOD [1979] examined the relative accuracyof short-range forecasts by five leading British econometric modelsfrom 1974 to 1977. (Presumably they differ with respect to com-plexity.) I reanalyzed these data by calculating the Spearmanrank order correlations of the error in one forecast period versusthe error in the next period. A positive correlation means that amodel that is more accurate for one period will be more accuratethe next period. The results, shown here, indicate that the model'srelative accuracy in the period was not a good guide to its accuracyin the next period. Disappointing!

Rank Correlations for Accuracy of Five British Forecasting Models*

*To read the column headings, 75.1 means "first half of 1975".**Based on only three models.

More direct evidence on the value of complexity was sought by usingonly studies with ex ante forecasts. The survey was restricted to stud-ies that had been done in a competent manner. The results of thisliterature survey are summarized in Exhibit 8-9.

To determine whether the coding of the studies in Exhibit 8-9 wasreliable, 8 of the 11 studies (all but McNees, Grant and Bray, andJohnston and McNeal) were independently coded by two research as-

Variable

_751vs.

75.11

75.11vs.76.1

76.1vs.

76.11

76.11vs.77.1

77.1vs.

77.11 Average

Gross domestic product - .6 +.1 +1.0 - .1 - .7 - .06

Consumer expenditure - .1 -.4 - .8 - .8 + .4 -.34

Gross fixed domesticinvestment

+ .7 + .6 + .4 + .1 - .1 + .34

Exports + .1 +.1 - .1 +.4 -.3 +.04

Imports + .5 + .2 - .2 + .2 - .8 - .02

Consumer expenditureprice deflator

+ .3 - .1 * .9 - .2 + .2 + .22

Unemployment** -1.0 - .5 - .5 -.5 +.5 - .40

Average .01 .00 + .12 - .15 - .13 - .03

-

Page 41: LONG-RANGE FORECASTING From Crystal Ball to …forecastingprinciples.com/files/LRF-Ch8b.pdf · LONG-RANGE FORECASTING From Crystal Ball to Computer. Eight ECONOMETRIC METHODS Contents

Exhibit 8-9 ACCURACY OF SIMPLE VS. COMPLEX METHODS

RelativeAccuracy of Test ofComplex Source of Forecast Criterion for Nature of StatisticalMethods Evidence Situation Accuracy Comparison Significance

Significantlymore accurate(p < .05)

More accurate Stuckert (1958) Academic Percent correct Unit Noneperformance weights vs.

regressionMcNees (1974) GNP Theil Small vs. None

coefficient; largeRMSE, mean modelsabsolute error

Grant and Personnel Correlation Unit ArmstrongBray (1970) coefficient weights vs.

regressionJohnston and Medicine Correlation Unit AuthorsMcNeal (1964) coefficient weights vs.

regression

No difference

Less accurate Dawes and Academic Correlation Unit ArmstrongCorrigan performance, coefficient weights vs.(1974) simulated data, regression

psychiatricratings

Lawshe and Academic Percent correct Unit NoneSchucker performance weights vs.(1959) regressionReiss (1951) Criminology Percent correct Few vs. None

manycausalvariables

Wesman and Academic Correlation Unit NoneBennett (1959) performance coefficient weights vs.

regressionScott and Personnel Percent Unit NoneJohnson (1967) selection correct, weights vs.

correlation regressioncoefficient

Significantly Claudy (1972) Simulated data Correlation Unit Armstrongless accurate (typical of coefficient weights vs.(p < .05) psychological regression

data)Summers and Political Correlation Linear vs. ArmstrongStewart (1968) judgments coefficients nonlinear

models

230

Page 42: LONG-RANGE FORECASTING From Crystal Ball to …forecastingprinciples.com/files/LRF-Ch8b.pdf · LONG-RANGE FORECASTING From Crystal Ball to Computer. Eight ECONOMETRIC METHODS Contents

Analyzing the Data 231

sistants who were unaware of the hypothesis. Small discrepancies werenoted on two of these 8 studies.

The studies in Exhibit 8-9 suggest that complexity and accuracy arenot closely related. No study reported a significant positive relationshipbetween complexity and accuracy. Overall, seven comparisons favoredless complexity, and four favored more complexity. The studies thatassessed predictive validity directly were in agreement with the elevenstudies that provided indirect evidenceadded complexity did not yieldimprovements in accuracy. My study was published along with com-mentary by seven econometricians. The commentary did not produceadditional empirical evidence.

The implication for the analyst is clear cut: use simple methods! Inaddition to being as accurate, they are less expensive, less prone toerrors, and easier to implement. FILDES and HOWELL119791reacheda similar conclusion in their literature review.

Of particular importance is the issue of estimating causal relation-ships by regression or unit weights. Einhorn and Hogarth (1975) ex-amined the evidence on this issue and suggested that regression weightsare preferable to unit weights in the following cases:

Large sample size (d > 200) and a dependent variable that is accu-rately measured.Moderate sample size (50 < d < 200), a dependent variable that isvery accurately measured, and a strong causal relationship (1:12 >0.5).

Schmidt (1971), working with psychological data only, suggestedregression weights when:

Sample sizes are larger than 25.There are a small number of causal variables.

Additional evidence, with similar conclusions, has been reportedby DORANS and DRASGOW 119781, KEREN and NEWMAN119781, and PARSONS and HULIN P19821. PARKER and SRI-NIVASAN [1976] found contradictory evidence.

Although regression provides only modest benefits over equal weightsand then only in certain situations, it is more convenient to use, es-pecially when the estimate of the constant term is important (constant

Page 43: LONG-RANGE FORECASTING From Crystal Ball to …forecastingprinciples.com/files/LRF-Ch8b.pdf · LONG-RANGE FORECASTING From Crystal Ball to Computer. Eight ECONOMETRIC METHODS Contents

232 Econometric Methods

terms are difficult to estimate a priori) or when scaling factors areimportant.

Perhaps the most useful implication from the research on unit weightsvs. regression weights is that, in general, forecast accuracy is not verysensitive to the estimates of magnitudes. Once the sign is specified, arough estimate of magnitude is usually sufficient. In practical terms,the following advice is offered:

If the sign of the coefficient in a regression conflicts with the a priorisign, drop the variable.When uncertainty is high, combine unit weights and regressionweights.For most situations, pick the most convenient method (and thattypically means regression analysis).

In general, do not invest a lot of time and energy in learning about allthe details of regression analysis. The more sophisticated approacheshave not held up well in forecasting e.g., MORRIS, 19821.

The key factors for selecting between unit weights and regressionare summarized in Exhibit 8-10. The assumption is made that the costof using unit weights is about the same as that for regression weights.

UPDATING THE ESTIMATES

A priori estimates can be improved by incorporating estimates fromthe objective data. This updating process is especially important as aguard against folklore. For example, teachers assign strong causalrelationships between their actions and students' learning, and theyfeel confident about these relationships. An updating process that drawsupon objective data will bring the model closer to reality. In this ex-ample, it would reduce the importance of the teacher variable.

The weighting of new information has received much attention.Durbin (1953) presented a classical approach whereby each estimateis weighted inversely by its variance.' A crucial assumption behindthis procedure is that the estimates are unbiased. Such an assumptionis seldom realistic, especially when historical rather than experimentaldata are used. Furthermore, it is no easy matter to estimate the var-iance of a priori estimates.

Linguistics and statistics combined to yield a solution that was so-cially acceptable to econometricians: Bayesian analysis.' This pro-vides formal procedures for updating a priori estimates. The Bayesian

Page 44: LONG-RANGE FORECASTING From Crystal Ball to …forecastingprinciples.com/files/LRF-Ch8b.pdf · LONG-RANGE FORECASTING From Crystal Ball to Computer. Eight ECONOMETRIC METHODS Contents

Updating the Estimates 233

Exhibit 8-10 UNIT WEIGHTS VS. REGRESSION WEIGHTS

Start

Accuratemeasure ofdependentvariable

approach is described in numerous sources (e.g., Pratt, Raiffa, andSchlaifer, 1965).

The Bayesian philosophy is appealing, but the implementation isdifficult. I was unable to find a Bayesian regression program that worked,though I gave up my search because the details are not important forimproving forecast accuracy. That is, forecast accuracy seldom is sen-sitive to the precise estimate of the relationship. On the other hand,there seems to be no danger in using Bayesian analysis as long as itis used correctly.

A simple way to implement the Bayesian philosophy is to use con-

Yes Regressionweights

Yes

Yes

Unitweights

ombine estimateswhen possible

Page 45: LONG-RANGE FORECASTING From Crystal Ball to …forecastingprinciples.com/files/LRF-Ch8b.pdf · LONG-RANGE FORECASTING From Crystal Ball to Computer. Eight ECONOMETRIC METHODS Contents

234 Econometric Methods

ditional regression analysis,' as described by Wold and Jureen (1953).This "poor man's approach to Bayesian regression analysis" involvesthe following steps:

Develop some simple rules to combine the a priori and regressionestimates (e.g., use equal weighting).Obtain a combined estimate of one of the relationships, and removethe effect of this relationship from the equation. Reestimate coef-ficients for the remaining variables.Repeat step 2 until all relationships have been estimated.Omit any combined estimate that violates the a priori sign, thenreestimate.

Obviously, this procedure involves subjectivity. But it is easy to un-derstand, and it should produce reasonable results. Examples of thisprocedure are presented in Armstrong (1970a) and in Tessier and Arm-strong (1977).

A simple and reasonable approach is to make the forecasting modelthe focus of attention, rather than the measurement model. An a prioriweighting scheme can be developed. When additional estimates areobtained, they are simply listed in a table, and then a weighting schemeis used to obtain a combined estimate.

The use of combined estimates is expected to be most useful when:

large changes are expected,a substantial number of historical data are available,a variety of types of data are available,there are a small number of important relationships, anduncertainty is great.

No empirical studies were found on the value of updating, givenconditions 1 to 5. I assessed updating under conditions 1, 2, 3, and 4in my study of the photographic market; the results, shown in thefollowing, provided no support for the value of updating. Thus, myadvice remains speculative.

In Armstrong (1968a, b), unconditional backcasts were made for1954 camera sales in 17 countries, using data from no earlierthan 1960. An a priori version produced forecasts with a MAPE

Page 46: LONG-RANGE FORECASTING From Crystal Ball to …forecastingprinciples.com/files/LRF-Ch8b.pdf · LONG-RANGE FORECASTING From Crystal Ball to Computer. Eight ECONOMETRIC METHODS Contents

Estimating Current Status 235

of 30%. When the a priori model was updated, using results froman international crossc7section, from .longitudinal data acrosscountries, and from household survey data, the MAPE was re-duced to 23%, an improvement that was not statistically signif-icant. Furthermore, the gain was due almost entirely to an im-proved estimate of the initial (1962) sales. Unfortunately, however,uncertainty was not great in this test because the updated causalestimates were almost identical to the a priori estimates: theincome elasticity was updated from 1.3 to 1.34, and the priceelasticity from 1.4 to 1.80.

ESTIMATING CURRENT STATUS

It is frequently useful to decompose a problem to consider current statusand change separately (as noted in Chapter 4). Econometric modelscan be used for each part of the problem. This section of the chapterexamines the value of econometric models for estimating current sta-tus.

Econometric models are expected to be most useful for estimatingcurrent status in situations where:

uncertainty about current status is great, andgood information exists on the causal variables.

These situations occur frequently in politics (e.g., what is the militarystrength of Red China?), in foreign trade (e.g., what is the size of thephotographic market in Eastern Europe?), and in marketing. The lastarea, in particular, has been the subject of much study: estimatingmarket size was studied in the 1930s by Brown (1937a, b), Cowan(1936, 1937), Weld (1939), and Wellman (1939). Little study has beendone since 1940. (As an aside, I tried to get the U.S. Department ofCommerce interested in using this approach to estimate market sizebut the idea was turned down by the director.

Do econometric models provide improved estimates of current sta-tus? Reference was made in the preceding section of this chapter toArmstrong (1968a, b), where an econometric model led to improvedestimates of the current status. This, in turn, led to improvements inlong-range backcasts. The following studies provided additional evi-dence that econometric models yield better estimates of current status:

Page 47: LONG-RANGE FORECASTING From Crystal Ball to …forecastingprinciples.com/files/LRF-Ch8b.pdf · LONG-RANGE FORECASTING From Crystal Ball to Computer. Eight ECONOMETRIC METHODS Contents

236 Econometric Methods

In Armstrong (1970a), an econometric model was used to predictthe level of camera sales. The model was developed from an anal-ysis of 17 countries. The predictions were made for an 11-countryvalidation sample, and an R2 of 88% was found. The assumptionthat all countries look like the average country in the analysissample yields R2 0. A more meaningful measure is providedby the MAPE because it also accounts for systematic errors: forthe econometric model the MAPE was 31%; for the alternativemodel (assuming each country was the same as the average forthe validation sample) the MAPE was 117%.

-Tessier and Armstrong (1977) developed an econometric model toestimate the size of the lodging industry. A combination of theeconometric predictions and the preliminary sales data led toimproved estimates of lodging sales. A simple average of the pre-liminary sales data and the econometric estimates reduced theerror in predicting "final" sales from 10.5% to 5.7%. The combinedestimate also led to improvements in two forecasting tests.

Many forecasters do not bother to decompose a problem into currentstatus and change. In such cases, errors in the estimation of currentstatus will cause systematic errors in the forecasts. This problem iscommon in short-range economic forecasts. Some remedies are:

use the most recent errors to adjust the upcoming forecasts, oruse judgment to adjust the upcoming forecasts.

The evidence presented suggests that the econometric estimate yieldsimproved forecasts when combined with the most recent actual data.An adjustment for the error in an econometric time series model pro-vides an alternative way of accomplishing the same thing. The ad-justment is generally done mechanically as follows:

Yr+h - a + h + b2X2,,,h + ke,

The e, represents the error in predicting current status for Y (actualminus forecast). The k represents a weighting factor between 0 and 1;it would be 0 if e, was expected to be unrelated to et ,h, and would be

=

Page 48: LONG-RANGE FORECASTING From Crystal Ball to …forecastingprinciples.com/files/LRF-Ch8b.pdf · LONG-RANGE FORECASTING From Crystal Ball to Computer. Eight ECONOMETRIC METHODS Contents

Estimating Current Status 237

close to 1 if e, was closely related to For example, if the forecastwas high by 8% in year t, and k = 0.5 for a 1-year forecast, next year'sforecast would be decreased by 4%. In general, k will be higher forshort-range than for long-range forecasting, because in the short rangethe error in estimating current status is large relative to the error inforecasting change. (The need for a correction factor is reduced if dif-ferencing is used or if the predictors include a lagged dependent vari-able.) The value of mechanical adjustments has shown up in a numberof studies:

Juster (1972) showed how errors in a short-term macroeconomicforecasting model were reduced when the errors from the latesttwo time periods were used in making adjustments. Evans, Hai-tovsky, and Treyz (1972) and PFAFF [1977] reached a similarconclusion in their analysis of short-term macroeconomic fore-casts. Mechanical adjustments reduced the error in one- and two-quarter forecasts by half, but there was little gain for long-rangeforecasts. Howrey (1969) obtained improvements from mechanicaladjustments in transportation forecasts as did PFAFF [1977] inquarterly forecasts of the money stock. AHLBURG [1984] showedhow Theil's decomposition can be used for such an adjustment.

Evidence presented in Chapter 6 (on judgmental methods) suggestedthat expert judgment is useful for estimating current status. As aresult, judgmental adjustments should also help to improve the fore-cast; studies from economics suggest that they do (e.g., Evans, Hai-tovsky, and Treyz, 1972; McNees, 1975, HIRSCH, GRIMM and NAR-ASIMHAM 119741). This provides an explanation for the apparentdiscrepancy between economics and psychology: Economists claim thatsubjective adjustments will improve forecasts of change, while the lit-erature in psychology indicates just the opposite. I am unaware ofevidence from either field that subjective adjustments can improveforecasts of change. Such adjustments are often useful, however, inestimating current status. Indeed, McNees (1975) found that subjectiveadjustments of macroeconomic forecasts were of more benefit in mea-suring current status than in measuring change. In summary, subjec-tive adjustments should be made for estimates of current status, butnot for change.

The particular way in which econometric models are used to estimatecurrent status may be largely a matter of style and convenience. In

e, h.

Page 49: LONG-RANGE FORECASTING From Crystal Ball to …forecastingprinciples.com/files/LRF-Ch8b.pdf · LONG-RANGE FORECASTING From Crystal Ball to Computer. Eight ECONOMETRIC METHODS Contents

238 Econometric Methods

my opinion, it is better to decompose the problem explicitly. This makesit easier to examine what is happening, and provides a more efficientway to incorporate evidence from different sources. Furthermore, as-suming that the model for estimating current status should be thesame as the model used to forecast change imposes an unnecessaryconstraint on the analysta constraint that results from the confusionbetween measurement models and forecasting models.

FORECASTING CHANGE

One reason for the distinction between measurement models and fore-casting models is that the coefficients in the measurement model shouldbe adjusted to represent the conditions expected over the forecast ho-rizon. It is possible, for example, that legislation may have been enactedthat will alter a relationship in the future; you can make a change inthe relationship to reflect this.

In general, the errors that may arise when econometric models areused to forecast change include the following types:

1. Errors in estimating the causal relationships that will hold overthe forecast horizon. These, in turn, may be due to:

Errors in estimating the current relationships, orChanges in the relationships over the forecast period.

2. Errors in forecasting changes in the causal variables over the fore-cast horizon.

Two strategies are examined for dealing with these problems.Mitigation' is a strategy that assumes the errors cannot be reduced.The second strategy is to try to reduce the errors.

Mitigation

Errors of types 1(a), 1(b), and 2 in the previous list cause forecasterrors.The solution is simple: place less emphasis on the causal explanationsand more on the trend due to unexplained factors. In other words, asthe errors in the model increase, the researcher should move from thecausal toward the naive end of the continuum.

A hypothetical example can be used to illustrate, in intuitive terms,why the strategy of mitigation is a good one. God has told you thatthe income elasticity for camera sales will be exactly + 1.3. Because

Page 50: LONG-RANGE FORECASTING From Crystal Ball to …forecastingprinciples.com/files/LRF-Ch8b.pdf · LONG-RANGE FORECASTING From Crystal Ball to Computer. Eight ECONOMETRIC METHODS Contents

Forecasting Change 239

God is trustworthy, there is no uncertainty about the elasticity of 1.3.However, God did not tell you the extent to which income would changeover the next 10 years. To obtain forecasts of the change in income,you asked Ace Consultants. After receiving the income predictions,you learn that Ace Consultants are never truth tellers. The errors areso great that these consultants, in effect, provide random data (butthey do it with style, and they know all the latest dances). With somuch error in the forecasts of income, you should put no emphasis atall on Ace's "information." The same argument can be made if weassume the reverse situationthat is, you have perfect forecasts ofincome, but you have no idea about the income elasticity.

In the typical case there is some, but not complete, ignorance aboutchanges in the causal variables. Therefore, the impact of a causalvariable, such as income, should be reduced but not eliminated. Thiscan be accomplished either by reducing the magnitude ofthe forecastcoefficient, or by reducing the amount of change forecasted in the causalvariable. If it is assumed that

AY -- a + b(X)

where AY = the change in the dependent variablea = a constant, but unexplained, trendb = the causal relationship between X and YAX the change in the causal variable

then, if errors are expected in the model, mitigation can be carried outby reducing the magnitude of either b or X. (Ridge regression providesa sophisticated way to do this. See KEREN and NEWMAN, 1978.)

The strategy of reducing AX is useful when the quality of forecastsvaries across observations. Thus, in my camera study, different ad-justments might be made for controlled markets (where the price changemay be easy to forecast, at least for the controllers) and for free markets(where the price change may be difficult to forecast).

The degree of adjustment required in the model can be estimatedby either

or

1(AX)h = AX1 + E

=

E)

Page 51: LONG-RANGE FORECASTING From Crystal Ball to …forecastingprinciples.com/files/LRF-Ch8b.pdf · LONG-RANGE FORECASTING From Crystal Ball to Computer. Eight ECONOMETRIC METHODS Contents

240 Econometric Methods

where bh = the forecast coefficientE = the ratio of error variance to true variance

= the estimate of the true causal relationship(AX)h = the forecasted change in X

= the estimate of the true change in X

At the same time, a compensating change should be made in theconstant term. For example, if bh is used:

a = AY bh AX

The constant term a moves closer to the historical trend line, AY, asthe absolute value of b,, or AX is decreased.

The difficulty of these methods in compensating for errors in fore-casting the causal variables lies in estimating E. It is likely that Ewill have to be based on subjective estimates. Difficult though it maybe, you must find some estimate of E to complete the development ofthe model. To assume that the error is negligible is to assume a valuefor E of 0.0; this may lead to substantial errors in forecasting.

Mitigation proved to be of substantial importance in my study ofthe photographic market, but I am not aware of other studies on thevalue of this strategy.

In Armstrong (1968a), the forecast coefficients were adjusted tocompensate for errors. For example, E was estimated to be 1.0for the price forecast and 0.33 for income. These estimates yieldedforecast coefficients of 0.9 and 1.0 for price and income, re-spectively. (The causal elasticities for price and income had beenestimated to be 1.8 and + 1.3.) These crude adjustments forerror proved to be extremely valuable (compared to assumingE = 0.0) for improving the performance of the camera forecastingmodel. The adjustment for error led to a reduction in theMAPE from 43% to 23%. for the backcast of camera sales in 17countries in 1954, a result that was statistically significant (p <.05).

Reducing the Errors

One possibility for reducing errors is to obtain better estimates for therelationships over the forecast horizon. This strategy was discussedpreviously.

Page 52: LONG-RANGE FORECASTING From Crystal Ball to …forecastingprinciples.com/files/LRF-Ch8b.pdf · LONG-RANGE FORECASTING From Crystal Ball to Computer. Eight ECONOMETRIC METHODS Contents

Forecasting Change 241

Another possibility for reducing errors is to obtain better forecastsof the causal variables. The econometric method is of little value if itis not possible to obtain reasonable forecasts of the causal variables.

It was suggested earlier that one criterion for using a variable inthe forecasting model is that the variable can itself be forecast withsome accuracy. Thus proper selection of variables for the model canhelp to ensure good forecasts of the causal variables. A popular strategyhas been to find leading indicators, that is, variables whose effects willbe felt after some time. For example, the sale of automobiles precedesthe sale of replacement tires. The signing of a trade agreement precedesa change in foreign trade. Measurement of the leading indicator pro-vides an alternative to forecasting the causal variable.

In some cases, considerable time and energy may have been investedby others in forecasting the causal variables. In economics, forecastsof income can be easily obtained from expert surveys and from econ-ometric models of the national economy.

The forecasting of the causal variables is analogous to the forecast-ing of the dependent variable. Numerous methods are available formaking the forecastjudgment, extrapolation, segmentation, or econ-ometrics. If you use the last two methods, you must generate a newset of causal variables (to forecast the original set of causal variables).This explosion of the problem is another argument for simplicity inthe original forecasting model.

One puzzling finding is that the point is quickly reached wheregreater accuracy in forecasting the causal variables does not lead togreater accuracy in forecasting the dependent variable. Obviously, youmust strive for fairly accurate forecasts of the causal variables, but inmost studies the unconditional forecasts" were more accurate thanthe conditional forecasts.

Of the 13 published studies that I found, two showed conditionalforecasts to be more accurate (Fair, 1971, 1976), one showed nodifference (Theil, 1966), and 10 showed unconditional forecaststo be more accurate: Neild and Shirley, 1961; Friend and Jones,1964; O'Herlihy et al., 1967; Kennedy, 1969; Stekler, 1970; Evans,Haitovsky, and Treyz, 1972; Juster, 1972; SCHOTT J19781 in areanalysis of forecasts by Suits, and also in a reanalysis of fore-casts by Klein and Goldberger; and HIRSCH, GRIMM, and NAR-ASIMHAM, 1974). All of these studies except O'Herlihy analyzedshort-range forecasts. The superiority of the unconditional fore-casts is statistically significant (p < .05, using the sign test).

Page 53: LONG-RANGE FORECASTING From Crystal Ball to …forecastingprinciples.com/files/LRF-Ch8b.pdf · LONG-RANGE FORECASTING From Crystal Ball to Computer. Eight ECONOMETRIC METHODS Contents

242 Econometric Methods

It is still a mystery why unconditional forecasts do better. My guessis that, in these studies, many of the causal variables were forecast byjudgment; as noted in Chapter 6, judgmental forecasts are conservative.In effect, the forecasts of the causal variables were mitigated. Becausenone of the analysts in the studies cited explicitly used mitigation, theinadvertent mitigation provided by the unconditional forecasts couldhave led to improved accuracy.

My recommendations for short-range forecasting are that you tryto obtain reasonably accurate forecasts of the causal variables and thatyou use mitigation. The first piece of advice implies that you shouldnot spend much money to obtain accurate forecasts of the causal vari-ables!

My hypothesis for long-range forecasting is that accurate forecastsof the causal variables are important. Unfortunately, there has beenlittle study on this issue. The one study (by O'Herlihy) was based ona small sample and no significant differences were found.

COMBINED FORECASTS

A combination of forecasts from different econometric models wouldbe expected to improve upon a forecast from a single econometric model.One way to obtain different forecasts is to vary key elements in theeconometric model; another approach is to have models developed in-dependently by different researchers. An a priori weighting scheme isrequired in each case.

The strategy of alternative models suggests the development of aset of possible econometric forecasting models rather than a singlemodel. This set of models should be based on various combinations ofwhat the researcher feels are reasonable ways to develop the model.For example, he might try a variety of functional forms, different op-erational measures for the conceptual variables or different sets of datato develop models, and then use each of these models to develop aforecast. Differences among these forecasts will indicate their sensi-tivity to the various decisions made by the researcher. The forecastscan then be combined. This combined forecast will be most useful ifthe preceding forecasts were highly sensitive to the alternative modelsthat were tried. Such a strategy was used successfully by BRANDON,FRITZ, and XANDER 11983] and by the following study:

Namboodiri and Lalu (1971) compared the average forecasts froma set of simple regressions to the forecasts from a single multiple

Page 54: LONG-RANGE FORECASTING From Crystal Ball to …forecastingprinciples.com/files/LRF-Ch8b.pdf · LONG-RANGE FORECASTING From Crystal Ball to Computer. Eight ECONOMETRIC METHODS Contents

Assessing Uncertainty 243

regression for 10-year forecasts of population growth in 100 coun-ties in North Carolina. The MAPE for the combined forecasts was5.8 vs. 9.5 for the single regression.

It is difficult to select a priori the best econometric model for a givensituation (unless one of the models happens to be your model). Becausereasonable people differ in their biases, one strategy is to average theforecasts from models developed independently by different research-ers. If, in fact, biases are serious, and if they differ among researchers,this averaging process should lead to more accurate forecasts. The useof alternative model builders also makes it more difficult for analyststhat cheat.

The relative benefit of the strategies of alternative models and ofalternative model builders are based on limited empirical evidence. Ofthe two strategies my preference lies with the use of alternative modelbuilders, even though this is substantially more expensive than theuse of alternative models.

ASSESSING UNCERTAINTY

Numerous components contribute to uncertainty in econometric models:

Estimates of the current status,Estimates of the forecast coefficients,Forecasts of the causal variables, andExcluded variables.

Standard errors' from a regression program assess primarily theeffects of errors in estimating the forecast coefficients (item 2 above).Even then, regression analysis provides its most accurate estimates ofuncertainty near the midpoint of the causal variables. For example,income elasticity is measured most accurately near the average incomelevel for these data. It is typical for the forecast to depart substantiallyfrom this average. Sometimes the forecast of the causal variable goesoutside the range of the historical data. As this happens, the uncer-tainty due to the estimate of the causal relationship increases rapidly;forecasts outside the range of the historical data are risky.

An additional problem in using the standard errors from a regressionprogram is that they relate only to the particular set of data in question.The combination of subjective estimates and other objective estimates

Page 55: LONG-RANGE FORECASTING From Crystal Ball to …forecastingprinciples.com/files/LRF-Ch8b.pdf · LONG-RANGE FORECASTING From Crystal Ball to Computer. Eight ECONOMETRIC METHODS Contents

244 Econometric Methods

should be considered. In the case of estimates from time series data,autocorrelation will produce standard errors that are smaller than theyshould be.

Despite these problems, the standard errors of the regression areuseful. They provide a good measure of the information on each causalvariable present in the measurement model. Moreover, this informa-tion is available at no additional cost. Be on guard, however; the un-certainty as estimated from the measurement model is not an idealestimate of the uncertainty in the forecasting model.

Overall assessments of uncertainty can be made by using the fore-casting model in a way that is representative of the forecasting situ-ation. An ideal way to assess uncertainty is to withhold data and usethe forecasting models to make a series of forecasts; an estimate ofuncertainty is then based on the accuracy of these forecasts.

When it is difficult to represent a forecasting situation, you mightconsider a Monte Carlo simulation.' This requires an expected dis-tribution for each of the four types of errors listed previously. Thesedistributions can be estimated as follows:

Current status can be assessed subjectively or by comparing esti-mates drawn from different sources. Still another approach is toexamine the differences between the preliminary estimates (avail-able at the time of the forecast) and the final estimates (availablesome time later).The forecast coefficients can be estimated by pooling the standarderrors from the a priori estimate and from the various regressionanalyses. An adjustment should be made to reflect the uncertaintyabout the values of X that will be experienced in the future; thisadjustment can be done subjectively.The ability to forecast the causal variables can be assessed by ex-amining previous efforts to forecast these variables. Lacking these,it should be sufficient to use subjective estimates for these distri-butions: What is the highest you would expect variable Xi to be?What is the lowest?The errors due to excluded variables can be estimated subjectively.Some help may be gained by examining the proportion of variancethat is unexplained in the set of data that best represents the fore-casting situation.

The Monte Carlo simulation proceeds by using random numbers toselect from each of the distributions. Thus the first trial would requireselecting one value from each of the four distributions and using this

Page 56: LONG-RANGE FORECASTING From Crystal Ball to …forecastingprinciples.com/files/LRF-Ch8b.pdf · LONG-RANGE FORECASTING From Crystal Ball to Computer. Eight ECONOMETRIC METHODS Contents

Summary 245

to calculate a forecast. The process is repeated many times, perhaps1,000, to build up a distribution of the forecast variables. In additionto providing a distribution for the forecast, the Monte Carlo approachcan also allow for an analysis showing which items are most responsiblefor the uncertainty in the forecast. Such information may be useful inguiding further measurement efforts.

The value of the Monte Carlo simulation seems reasonable, but Iwas unable to find any empirical evidence pointing one way or theother.

SUMMARY

Econometric models are expected to be useful when good informationis available on the causal relationships, and when large and predictablechanges are expected in the causal variables. A general approach tothe development of econometric models was outlined in Exhibit 8-1develop an a priori model, select data, analyze the data, and updatethe model.

The various steps in the a priori analysis were outlined in Exhibit8-2. Important areas are the selection of a small set of causal variablesand a statement regarding the direction of the relationships. A check-list for the selection of causal variables was summarized in Exhibit8-3.

By adding three steps to the a priori analysis ((1) specification ofthe pattern of causality, (2) of the functional form, and (3) of the mag-nitudes of relationships), it is possible to develop an a priori model thatcan be used directly in forecasting. The most important advice in de-veloping this a priori model is to keep each step simple.

Stepwise regression provides an alternative to a priori analysis, butit is not recommended. Setwise regression is a bit more reasonable,but I believe it is risky to use.

Various types of objective data were considered as sources of esti-mates for the forecasting model. These include experimental and his-torical data. They also include cross-sectional, time series, and longi-tudinal data as illustrated in the data matrix of Exhibit 8-4. Thesesources of data were rated against nine criteria in Exhibit 8-5. It isobvious even from these crude ratings that the sources of data havedifferent advantages and disadvantages. This suggests an eclecticstrategy: use a variety of sources, especially when uncertainty is highand large changes are expected.

A priori adjustments to the coefficients are important when the

Page 57: LONG-RANGE FORECASTING From Crystal Ball to …forecastingprinciples.com/files/LRF-Ch8b.pdf · LONG-RANGE FORECASTING From Crystal Ball to Computer. Eight ECONOMETRIC METHODS Contents

246 Econometric Methods

forecast situation is highly uncertain. The procedure for such an ad-justment was described. As uncertainty increases, less emphasis shouldbe placed on causality.

Estimates of relationships can be obtained from the data by usingeither experience tables or regression analysis. Regression analysis ismore intuitively appealing and is no more expensive. One danger ofthe regression model is that some analysts equate this measurementmodel with the forecasting model. This places an unnecessary and oftenmisleading restriction on the analyses.

Certain problems cause difficulties when using regression analysis.These problems, along with some brief solutions, were summarized inExhibit 8-7. In general, the solutions rely on simplicity and on the useof different sources of data.

Simplicity is a virtue in regression. Obviously, simple methods arecheaper, easier to understand, and less prone to mistakes. In addition,simple methods are as accurate as complex ones. Although this iscontrary to the beliefs of experts in econometrics, none of the 11 em-pirical studies examined showed a significant gain in accuracy fromusing complex methods. In fact, the data suggested the opposite, al-though this tendency was not significant.

Conditional regression analysis can be used to update the a prioriestimates in light of the objective data. One way to do this is to listthe estimates for each relationship and combine them, using an a prioriweighting scheme. The choice of a weighting scheme is not expectedto be of much importance.

The estimation of current status should be done independently offorecasting change; no one model is best for both purposes. If this isnot done, it is then important to use judgment or mechanical adjust-ments based on recent errors to adjust the forecasts, especially for short-range forecasts. These adjustments help to compensate for the errorsin estimating current status.

In forecasting change, it is important to mitigate the forecast coef-ficients to allow for various types of error over the forecast horizon.This step is ignored by people who forecast directly from measurementmodels. It is a dangerous omission for long-range forecasts.

Surprisingly, little benefit was found from obtaining good forecastsof the causal variables. This may be related to the absence of mitigationin the studies that were examined.

Combined forecasts were recommended. These may be generated byhaving the analyst consider reasonable variations of the judgmentsthat went into developing the model. Preferably, econometric modelsshould be developed independently by two or more researchers.

Page 58: LONG-RANGE FORECASTING From Crystal Ball to …forecastingprinciples.com/files/LRF-Ch8b.pdf · LONG-RANGE FORECASTING From Crystal Ball to Computer. Eight ECONOMETRIC METHODS Contents

Summary 247

Regression analysis yields useful and inexpensive estimates of theuncertainty in the measurement of causal relationships. To estimatethe uncertainty in the forecast, however, it is best to simulate theforecast situation by withholding actual data. If this is not possible, aMonte Carlo simulation can be used to examine the net effects of themajor types of error.