ifdp · March 31, 1976

Have Geometric Lag Hypothesis Outlived Their Time? Some Evidence in a Monte Carlo Framework

April 1976

HAVE GEOMETRIC LAG HYPOTHESES OUTLEVED THEIR TIME? SOME EVIDENCE IN A MONTE €AREO FRAMEWORK

John F. Wilson

NOTE: International Finance Discussion Papers are preliminary materials circulated to stimulate discussion and critical comment. References in publications to International Finance Discussion Papers (other than an acknowledgment by a writer that he has had access to unpublished material) should be cleared with the author or authors.

Most distributed lag models have almost no or only a very weak theoretical underpinning. Usually the form of the lag is assumed a priori rather than derived as an implica-~ tion of a particular behavioral hypothesis. Exceptions to this statement are the adaptive expectations and partial adjustment models, which in part explains their recent popularity. But even here, the theoretical rationalizations offered are often only skin deep.

Zvi Griliches - "Distributed Lags: A Survey"

"Have Geometric Lag Hypotheses Outlived Their Time? Some Evidence in a Monte Carlo Framework" John F. Wilson*

I. Introduction

It is probably fair to say that there is today universal agreement that economic variables do not adjust instantaneously to their determinants., Full adjustment is in some sense distributed over time. A second proposition which would probably meet little opposition is that discerning the correct shape and length of an adjustment process is a very ticklish matter. Although the recent history of distributed lag estimation has witnessed great progress since the initial work undertaken by Fisher and Tinbergen in the 1930's, a great deal remains to be done.

The intention of the present paper is to illustrate, in a Monte Carlo framework, the results of applying several of the currently fashionable distributed lag estimating techniques to a body of real world data on which various known lag distributions have been imposed. In particular, the argument will be made (using the geometric lag hypothesis as an example) that investigators who begin with easily estimatable, but unduly rigid, hypotheses run the risk of obtaining misleading results. The conclusion which emerges is that certain types of frequently-used initial hypotheses have to some degree been "obsoleted" by the development in

recent years of more flexible techniques for evaluating distributed lags.

*Economist, International Finance Division, Board of Governors of the Federal Reserve System. The views expressed herein are solely those of the author and do not necessarily represent the views of the Federal Reserve System. The author is grateful to Scott Brown and Irene Cavanagh for capable research assistance in preparing the statistical material included in this paper.

~2~

Il. Basic Lag-Estimating Techniques: A Thumbnail Sketch

Prior to the major contribution in 1954 by Koyck f127, an investigator studying distributed lag processes had only a narrow range of feasible options. Among these, simple OLS procedures may have been the least distasteful, despite the problems raised by time-series collinearity. Koyck's discovery that, in a certain type of bivariate equation, a simple transformation could replace an infinite string of lagged regressors with a single lagged dependent variable revolutionized the fiela.L/ Although it was necessary for Koyck to assume that the coefficients on the lagged regressor terms declined exponentially, estimation of geometrically declining lag coefficents subsequently enjoyed (and still does) a great vogue. In the absence of more flexible procedures, there was initially hardly any gain to arguing that the world might not be governed entirely by exponential processes. Not much time had to pass, however, before Cagan Bl and Nerlove L16/ came up with two rationales which lent more economic credence to Koyck's mathematical

expedient .2/ This lag technique is still very much with us.

ene enememereemeneateemeencenemeenneene eens eee

1/ Koyck himself made only modest claims, writing 112, p. 4/: "This study, it is hoped, is one minor step in a great number of successive steps still ahead in the field of estimating structural economic relations from time-series data."

2/ Cagan's "adaptive expectations" hypothesis concerns the adjustment rate of the regressor in such a relationship. Nerlove's “partial adjustment" mechanism hypothesizes a similar adjustment process for the regressand. In their simple forms, both models result in Koycktype estimating equations, which leads to ambiguity about which hypothesis is being tested. The adaptive expectations approach also introduces serial correlation into the error term, even if it were absent in the structural hypothesis. The partial adjustment approach does not. Other complications are noted in Section VI below.

~3-

In his 1967 survey on distributed lags, Griliches £8; Pp. 24~25/ summarized the case for geometric lags as follows: “Its main advantage is ease of estimation -- everything depends on only one additional parameter. This is done, however, at the cost of forcing a particular form of the lag on the data."

The second widely used distributed lag technique is that developed in 1965 by Almon LL, and its working assumption is that a lag structure lies along some low order polynomial. Following the appearance of Almon's paper, some researchers began to "think polynomial," and enjoyed the advantage of being able to develop separate distributions for as many variables as they chose to include in a given function.3/ Polynomial interpolation is not an unmixed blessing, however, because use of the Almon technique implies a willingness on the researcher's part to sort through various (unknown) lag lengths and curve degrees. He must be ready to "search"' more actively for the lag structure than before.

The method recently developed by Shiller [22] carries the evolution of estimating techniques to yet a higher degree of flexibility. Shiller's procedure was developed from Bayesian priors by assuming that a linear combination of the coefficients (in this case coefficient differences of some degree) in a lag distribution are normally distributed with

a zero mean and some variance.’ If this variance is also zero, Shiller

3/ This is also not entirely impossible with geometric assumptions.

whe

shows that the method is equivalent to the Almon procedure.4/ However,

the advantage of the method is that it is stochastic, which makes it

possible for estimation results to deviate from the investigator's implicit expectations. That is, in the search for the "true" lag distribution,

even if the estimator prior should misspecify the order or shape of the

true curve, there is still a possibility that the regression results can approximately identify the correct pattern.

While Shiller's estimator was originally derived in a Bayesian framework, it can also be described in terms of Theil-Goldberger type mixed estimation, as has been done by Shiller [23/, Wilson [25] and Maddala [14/. A recent note by Taylor [24] also shows the equivalence between the two. Judging from the attention paid to it in the recent literature, this procedure is rapidly gaining in popularity.

A fourth estimator explored in this paper is the ridge regression technique. Originally proposed in two articles by Hoerl and Kennard L9; 10/ simply as a method for overcoming matrix collinearity, ridge methods are also finding application in distributed lag problems. Recent examples can be found in notes by Rappoport L197 and Maddala fal. In the latter paper, the Shiller estimator and various others are treated as special cases of ridge methods.

4/ It will be recalled that for a polynomial of degree "d", the "d + 1"th differences of the terms equal zero. Shiller's priors on any given level of coefficient differences, therefore, implicitly specify a polynomial curve one degree lower than the difference degree.

References to Shiller's method in this paper will generally be in terms of "polynomial equivalents."

-5-

In most ways the foregoing discussion does little more than scratch the surface of the literature on lags, but it is assumed that the reader is familiar with the derivations of these estimators and examples of their use. Beyond the four methods explored in this paper, a variety of other more or less feasible techniques is available. Solow, for instance, explored Pascal distributions as lag priors; Jorgenson developed a rational distributed lag model (of which Almon and geometric lags are special cases); spectral methods are associated with Hannan; and Shiller [2Y has recently extended his work by developing priors on differences in the logarithms of lag coefficients for the case where all coefficients are expected to be positive.=/ Along the way, certain more exotic forms have also emerged, Schmidt [20/, for instance, proposed an estimator which is a sort of hybrid of the Almon and geometric methods. Corradi and Gambetta [4] advocate the use of spline functions. In this welter of developments,certain facets of distributed-lag estimation have become rather complicated. Just how complicated they can become will be appreciated by anyone who has read the recent book by Dhrymes [5] on the subject 2/

Each of the above-noted methods of estimating distributed lags imposes certain prior information on a body of data. This information

implies something about the character of the lag curve which the

5/ In his recent paper /21, p. 12/ Shiller also shows the equivalence of his new prior, applied in a certain way, to the geometric estimator. The advantage of this new version is that it could be used to estimate a (geometric) lag-pattern on a single variable.

6/ Dhrymes notes a variety of reasons why geometric lag hypotheses have

~ gained "wide currency," but says also that one "should not turn to a theoretically deficient model simply because the estimation problems it presents can be easily tackled." [5, p. 55]

-6-

investigator expects to find.2/ But it is also clear that the evolution over time of more flexible distributed=-lag techniques has been characterized by the conjuring of increasingly less restrictive prior restrictions concerning the structure of the lag process. The Almon method, for instance, rests on a less rigid view than the Koyck about what a true lag structure might look like. The prior information imposed by the Shiller technique

is, in turn, less restrictive than the Almon priors. Each of these three methods, nonetheless, exploits priors which contain information bearing

on the relationship between "adjacent'' coefficients in the lag distribution. Ridge regression brings to bear information which is somewhat different

in this regard. The prior imposed by this technique has little to do

with the relation between a lag coefficient bj; with an adjacent coefficient bj4,, but rather with the allowable magnitude of the product of the entire coefficient vector, b'b.8/ Correspondingly, the ridge technique is

better viewed as an expedient to break collinearity than as a method

derived on the basis of an articulated hypothesis about the form of

some particular lag distribution.

7/ Each method "works" statistically, however, largely due to data transformations which disrupt the massive collinearity which makes OLS estimation so difficult with many time series.

8/ The ridge priors are introduced by adding informatton, in the form of yT, to the moment matrix (or submatrix of lagged regressors) of the data prior to inversion. The regression thus takes the form: b=(X'Xtul)“Lx'y, and b'b is suppressed toward zero as the value of y is increased. For thié reason, when ridge methods are used and y is set at a high level, individual coefficients are also "urged" toward zero. The effect is in some ways analagous to far-endpoint priors under the Almon or Shiller methods.

-7-

III. Estimator Evolution and the Persistence of Older Forms: Some Examples

The purpose of the following sections is not to examine the virtues of these estimators per se. But if it is true, as the author believes, that the recent history of estimation techniques represents a true evolution, such an evolution implies that older methods are to some extent superseded by new ones. Behind this argument lies the observation that newer and more general estimators often at least approximately encompass the older ones as special cases. For example, although a geometric lag shape is not a simple low-order polynomial, it is encompassed by the general category of rational distributed lags. A polynomial curve of any given degree is also just a special case of all higher order curves in which, as it happens, certain coefficients are equal to zero. This fact suggests that the proper way to identify a (polynomial) lag of given degree -- leaving length questions aside for the moment -- might be to use programming which "assumes" the curve is in fact of higher degree than suspected. Such a deliberate high=side misspecification can in theory and fact identify lower-order lag structures.2/ Because the Shiller estimator can be roughly described as a "Stochastic Almon," the same comments are applicable to the use of this technique.

A general argument can thus be made that in order to demonstrate the correctness of a hypothesis which assumes a particular lag form, corroboratory evidence from a more general hypothesis (and its estimators)

should be sought. Most frequently, however, an investigator will postulate

9/ Monte Carlo results obtained by the author bear this out. Known 2nd degree curves of various sorts (and with various error variances) were successfully identified using not only 2nd, but also 3rd and 4th degree assumptions when the lag length is also correctly found.

a model, derive the corresponding lag estimator and make some runs in which "good" results are obtained. By implication, said "good" results lend support to the initial hypothesis. This does not really follow, especially in the absence of a strong a priori case in favor of the hypothesis itself, without regard to the estimation results. For instance, an investigator who has settled on a 2nd degree curve may remark that the equation fit was better than with other curves or that coefficient significance was acceptable, but often without telling the reader whether the results gotten in tests with a higher order curve happen to have even looked like a 2nd degree polynomial .20/ However, if supplementary tests do not support such conclusions, it seems fair to ask that this negative evidence be taken into account 2L/

As a prime case in point, Koyck-type structures have been popular with the profession for the last twenty years. Once a lagged dependent variable has been jiggled to the RHS of an equation, one estimates geo= metric lags. More precisely, one estimates only geomtric lags. Such a procedure makes rather uncharitable assumptions about the capacity of the data to tell its own story. Although one might have thought that newer techniques would have replaced this approach, this does not in fact

seem to be the case. Just in the body of literature with which this

Sa cn center enema ae

10/ In multivariate regressions, especially those on trended data, the difference in fit is usually marginal at best. The results shown in Tables 1-3 below illustrate this point.

11/ In practice, omitted variables and other functional misspecifications can produce such results, even when the low-order lag process exists.

The point is, rather, that in the literature such evidence is usually omitted altogether.

-9-

writer is most familiar -- the area of international trade studies -numerous Koyck=type models have been estimated in the years since the Almon method became generally available. Among these are studies by Branson [2/, Kwack f13/, Prachowny fi7/, Gregory I7zi, Rao /187, Hooper [i1/, Miller-Fratianni /157, and most recently Goldstein and Khan [6/. One suspects that, in the absence of a priori reasons why a geometric lag pattern should obtain, the main reason for the choice of this technique was ease of estimation. Sorting through an abundance of curves and lag lengths is, after all, rather messy and tiring compared with the simplicity of the alternative .22/

From the above it should be clear that the writer does not find such a solution to be satisfying. The remainder of this paper will therefore be devoted to introducing evidence drawn from a recent Monte Carlo experiment to support the following two propositions:

a) Even if the true lag structure is not in fact characterized by geometric decay, the results of Koyck-type estimation can (spuriously) tend to support the Koyck-= hypothesis.

b) Supposing, in contrast, that the true structure is characterized by a geometric lag structure on one or more of the regressors, other forms of lag estimation

are capable of approximately identifying such patterns, and there is thus no need to begin with a Koyck~-type hypothesis. Experiments with such a structure have been made using Almon, Shiller and ridge estimators,

12/ The paper by Miller-Fratianni is especially interesting in that both Almon and geometrie-type estimations are made. The results are strikingly different, but the authors offer no comment on why this might be or what it suggests for the validity of either set of estimates.

-10-

IV. Equation Forms and Error Structure

Drawing on a data-bank assembled by the author in connection with a disaggregated study of U.S. import demand £257, three variants of the following equation were constructed: n Pr DV; =k + aln (¥)_¢ + gzpbyln Ba ri er This is a simplified replica of a fairly typical U.S. import demand relation, and the right-hand variables represent actual historical

data for the period 1958.1-1971.1ve! Variable definitions follow below:

DV = dependent variable (sum of RHS components, including the random error term);

Y & United States GNP in billions of constant dollars (nominal GNP divided by the implicit deflator);

Pe = Implicit deflator for imports of goods and services; Pa = Deflator for U.S. private, non-farm output; e = random error term.

Since this experiment is for illustrative purposes only, no lag structure was assigned to the income term.24/ No error component was added to right-hand variables, and the error term ep does not follow an autoregressive scheme. Using results from typical aggregate U.S. import

equations, the'known''parameters (elasticities) in the above equation were

set as follows: = -5.0

= 1.2 L by = -2.0

13/ The net estimating "sample," allowing for various lag constructions, encompasses the 1962.III to 1971.1V period. Price terms were set on a 1963 base, although any other base could be used equally well.

14/ Most empirical trade studies not using Koyck transformations have in any case "found" lag structures to be much shorter on income than on price terms in such an equation.

-Lll-

The basic approach thus is tc assume that there is reasonably reliable information on certain aspects of such an equation (e.g., long-run elasticities), but little on the shape or time=span of adjustment. The investigator searching for such information might therefore try a variety of lag techniques, most of which will, by definition, be incorrectly specified. On the basis of the regression results he will have to conclude what he can about the unknown process which generated the data. To illustrate some cases which might occur, three different forms of lag distributions were assigned to the relative price term of the basic equation. These are described below: Case 1: No lag structure. The parameter values corresponding to this case are by = -2.0, and bj = 0 for all i70. Case 2: A rudimentary, truncated lag structure which is not geometric in type. Here, for the basic case, b, = 71.5; by = «0.5; and b; = 0 for all i>l. An alternative case will also be discussed, Case 3: Geometric decay in the lag structure on the relative price term. The coefficient vector was set up to obey the relation b; = d*by, where by = -0.8 and \ = 0.6.

This process yields the following values for the individual lag coefficients:

bo = -.800 bg = -.03733

by = -.480 b7 = -.022395 by = -.288 bg = -.01343

by = -.1728 bg = -.00806

b, = -.10368

io” Nn |

~[2=

The error term e; in each equation for cases 1-3 was generated by the standard Fortran Subroutine Gauss, which produces a normally distributed variate, with mean and standard deviation specified by the user. Therefore e,wN( B,s2), where for the results tabulated ¥ = 0 and se = .020,.13/

One further comment should be added before the empirical results are described. In a full scale Monte Carlo experiment a large number of replications are needed to approximate the large~sample properties of the estimator. Due to the limited scope of this study -- which, however, still required the tabulation of a goodly number of equations -- the results summarized in the following tables are based on only ten runs with the data. (The correlation matrix of current and lagged

variables included in the regressions is shown in the Appendix.)

so ae eteemeteneeeeemeemennmeeeee erence

15/ Increasing the level of s creates rising levels of "noise" in the dependent variable, which has detrimental effects on the fitting properties of the equation. A range of values from s = .002 to s = .040 was tried. The value .020 was chosen for tabulation largely because the resulting equation fits were about the same as those produced by aggregate import equations.

~13-

V. _Estimation Results Case 1: No lag distribution.

Letting Y be the log of real income and P stand for the log of

relative prices, the equation to be estimated has the form: DV, = -5.0 + 1.2Y, -2.0Pp + et (SQ = .020)

Basic estimation results for this equation are shown in Table 1 and Graph 1 on the following pages .26/ From these it can be seen that a simple, correctly specified equation (i.e., no lags) estimates this structure quite well. (In fact, in terms of fit, long-run price elasticity and estimate of the constant, all of the equations are satisfactory.) The other two unconstrained OLS estimates each come up with results close to the true values of a, k, and by» although the 9 quarter misspecification shows the highly erratic coefficient pattern typical of such regressions.

If a geometric lag is incorrectly hypothesized and a lagged dependent variable is added to the RHS of such an equation, the results are still satisfactory in terms of the price structure. The mean coefficient on the lagged dependent variable comes out very low (.0340), and in all runs this coefficient was found to be insignificant. This suggests that if the true structure is of the restrictive Case 1 type, but the researcher incorrectly assumes a geometric type lag distribution

exists, the error can be caught by the estimation results.42/

16/ Estimates shown in all tables are the means of the results from the 10 runs. To conserve space no measure of coefficient significance is shown, but comments on this point will be made in the text.

17/ That is, so long as_there is no autocorrelation in the error structure. Griliches /8, pp. 33-34/ cites results from an equation similar to that in Case 1. His equation is yz; = axt + uz, but uz follows a first-order autoregressive scheme, The (erroneous) introduction of yy] as a regressor in this structure produced a significant coefficient. Griliches therefore concludes that "the partial adjustment model will work even if it is wrong."

*POpN{sUy eTqQeyieA Juepuedep pesZe] /T too’ = 1 fe qT

co*° = % °,,83uaTeaTNbs,, BArno [euytwouXjod uz pesserdxy £006°I= €686° LS01°S= 8cLT° 16S0°= 9€8T°= 1620° 619T° O610° €oZT’= eS9¢*= Soog*= sf0S*I- I1912°T 06 - /ganala 1 O£0S*2 8166" C7f6°H- G680° 4lzo°= = L5G0°= 19f0°= +600°= 620° 990° tHIT’= 99%77°= = B0TL*T- = LE8T°T BAIN *Zep pat at TT e9n0*e- 8166° 996°r- “6EST* 86£0°= 8680°= L1970°= 94Z0° 6080" G680° 9SS50°= 61SS*= 4I8S°I= = SH6T°T aAInD “Bap pug Z700°2= 9166° 6666°4- LOIT® 80f0°= LI60°= LS¢0°- +4690° T1701 * O1S0° €EGT°= Bsoeg*= TETH*I- 9661°T BAIND *Zap 38]

Soo THES 6 = 7{aaTITHS

B2E6"I= SI66° H66°4 €soe° 880° e88YTt- 2610" otIS0° eSut ata ZQOI*= SEz9°= 6 1zy"I- 6 R6T'T aamns “Sap pxt WLY6°I- €066° 7€66°4- TLST*= 6600° 8T2r* 16ST° €ect* L€20° QLET*= 2LE*= = 6GL9*= = SLHO"I= = OGBT*T BAND “Bap pug 06 - NOWIV : (L260°@=) (6981°T) (unz-8u0T)

L260°@= 266° 9647L"7~- O70" (0°0-) (0°0") (0°0-) (0%0-) (0°0=) (0°%O-) (TO00°=) (€200"=) (2890°=) STeC*Z= = 99HT°T (PerTdwT) WOAOX

E996"I~ 8166" ZL46° 7 ov2° €2l2*= e1lo°- 62fo°- SITY’ elye°= SSIT° GSISE°= 1862° 6070°S= = LI6T°T 3e1 d 6 GLLO*2= O266° 9LE6 *+~ 6250°= 9 9420°¢= ZOoéI'T 3eT OT €690°2- 0266° S0S6°4- €690°2= = S26 1°T 3e'] ON STO 00°Z= 0°S- 0°0 0°0 0°0 0°0 0°O 0°O 0°O 0°o 0°O 00'Z= 0z°T Wor zenby ens on rr “a Tq Poe ea woraeurasa

aainjonizg 3e] oN i] uozIenby eTdmesg roy sq[Nsey uoTIeWTISY °T aT qeL

-15- GRAPH 1

Coefficient patterns for equation with no lag structure Coefficient estimate

Almon (2nd degree) = == ==

Geometric -2.0

Shiller (2nd degree) e——— @ =e

Ridge o——o—o 1.6 -1.2 -.8 -.4

Lag quarter

-16=

A comparison of the Almon and Shiller estimates of the Case 1 equation is also instructive. All five of the cases shown have been deliberately grossly misspecified18/ None of these equations errs much in estimating the income coefficient or the constant. However, there is, for each estimator considered separately, a clear improvement in the estimate of bo as the curve prior increases. It can also be seen that for the coefficients bj-bg (whose true value is zero) results generally improve for increases in this same prior. As between the two estimators, for any given explicit or implicit curve assumption the Shiller results are clearly superior for most coefficients L2/ Further, a number of the Almon coefficients are significant, whereas beyond b, the Shiller coefficients are almost uniformly insignificant at accepted levels.

Results obtained by the ridge regression have the same erratic,

disappointing features as those obtained by the nine-quarter OLS functions2/

18/ In Tables 1-3 only the nine-quarter results are shown for Almon, Shiller and ridge specifications. In Table 3 these lag lengths are approximately correct; in the others they are misspecifications. Ihe author has also obtained less extreme five and seven quarter misspecifications which show similar features. The value of "k" (representing the ratio of the standard deviation of the random error in the structure to that of the prior) has been set at .05 for the tabulated Shiller equations. This value is about half that which would_result from the rule of thumb suggested by Shiller 22, p. 779/.

19/ The exceptions generally occur at points where the Almon curves cross the zero line.

20/ There is no well-defined theory about the proper setting for u in ridge regression. Hoerl and Kennard suggest a search procedure. Maddala, after trying values of 0.1 and 0.005, concludes that the value "has to be really very low." (p.11) Values used by the author ranged from 1.0 down to .001, with results from the lowest setting shown in the tables, Higher settings tended to depress all coefficients in the distribution toward zero, irrespective of their true values.

-17-

The equation considered under Case 1 has some theoretical interest, but is the most implausible of the structures which might obtain in the real world and is of limited usefulness. Most researchers would expect to find some sort of lag structure in such an equation, even if they had no clear notion about the exact nature of the lags. Let us therefore see what happens when even a simple lag-structure on prices is tacked onto the equation.

Case 2: Rudimentary lag on prices. The specimen equation for this case is written as follows: DV, =5.0 + 1.2¥, -1.5P, -O.5Pi-7 + ey (Se = .020)

There is again no need to persuade anyone that such a structure is plausible, but it is interesting to note the difference in estimation results when even this one period of lagged influence is assumed. Table 2 and Graph 2 on the following pages summarize these findings.

As in the no~-lag case, all equations do reasonably well in identifying the long-run price response. The correctly specified equation (OLS-1Q price lag), as expected, also perforns well, and the nine-quarter OLS results have the same erratic properties as seen earlier. Since the first OLS equation contains no lagged terms, it has no choice but to "assign" the full price response to the current quarter.

More interesting results are obtained with the (again, incorrect) geometric specification. The estimate of by is overstated and that of

by is understated; succeeding coefficients indeed go to near zero.

6686" 2266" £266"

£266"

Oc6E*

“Oo pa oN ra

0266" 8166" 1266"

9TE6"

L1L0°S= £0476 *47~ 6246 > 7856 °t7-

L776 °9~ 06476 °7-

ZTEE *= CLI YH 9LEBS Y=

4210°S-

0°S-

4621"

roa ho) SLIT® Z101°

1080°

€9TT* QSET*=

(0°0-) (0°0~)

ove"

6£80°= 81L0°= €0S0°- Zeto*=

7E50°=

OSTO*

€eLe"~

saofzg uo sangjonaqs Bey Arequewyzpny

Gtyt*= 69L0°> S9L0°=

S690°=

6280°-

LEoT*

(0°0-)

2120°-

0°0

1S90° 0900 °= +710°=

TO00*=

€1EO*=

90£T*

(0°0-) (1000°=) (S000°=) (6£00°=)

62£0°-

0°O

SEST® z0S0° €2S0° 9€2L0°

SI1¥°

LSEO°= 8050° 7290° S990°

S820°

47100°=

eLlte°=

tg uopaenby otdues A0Z sj[Nsey uoTIeUTISy

10¢T*~ 7010*= L200°~

1920 °=

810°

cogt*=

0L00°=

Go° = % *%,,squezteatnbs,, aarno [euywoudjod ut

Liee°s 62Ee°~ 8261°=

€1S2°-

TL61°=

608€°-

(8620°=)

SISE°=

0°0

17S" €9LS°= 7E19"= BES9°=

el29°=

7E99°=

(L0€2°=) EloZ°= OfSS*=

0S°O-

*pepnyouy eTqeyieA quepuedep pes3e71 /T

Zto2"T- Zsge *T- HOeE* TH

LS¢e°T°

TIEE* T=

8L00°Te

(SL%0°2-) (2961°T)

60T2°T 9O6T"T O16T*T 6£61°T

€I61°T

+I61°T

9Z8L°T= = #170°T 817S*I= LI6I°T 972S°I=- €O6I°T €€L6°I= 60Te°T 0S*I- 02°T °q e °2 STdeL

Ioo’ = 1 /e

pesseradxg [t 06 - jeu

"Bap pag

aaino earns *Zep pug earns *Zep AST 06> _uaTHINS earns °Zep pat earns *Zap pug

0 6 - NOWIV

(una-3u0T) (PeTiduy) AOAON 3e7 eoTId 0 6 Be] eTId OT

Ze I ON ‘STO woyaenby onzy “Bday Wo FT BUTI Sa

pue uoftjenby enil

GRAPH 2

Coefficient patterns for equation with rudimentary

lag structure Coefficient estimate

True Pattern ee Almon (2nd degree) = == = = -2.0 Geometric

Shiller (2nd degree) @=—e === ©

Ridge o—o——0 -1.6 h -1.2 \O N -.8 -.4 oN y > fe) cue ° [eo] +2? 0) 1 2 3 4 5 6 7 8

Lag quarter

~20-

What is somewhat disturbing is that in about half the runs the coefficient on the lagged dependent variable was found to be significant, suggesting that such a specification can produce results which confirm an erroneous hypothesis. Moreover -- also as a result of the technique -- the estimate of the current income parameter is also affected. If there were other regressors in the true relation, estimates of their coefficients would be similarly biased.

To develop this argument somewhat further, it might be borne in mind that the structure of the true equation given in Table 2 shows the Koyck technique in a relatively favorable light, considering the misspecification. Most of the true lag coefficients are zero, and lag decay is monotonic. The constructed lag is, in fact, about the simplest imaginable. Even so, estimation results show a tendency to support an incorrect view of the structure. What would happen if this lag were lengthened and complicated a bit? To illustrate, the price lag in the Case 2 equation was extended by one quarter (to t-2) and the shape of the distribution was changed to a mild inverted V, but with the same lag sum. The equation which results from this alternate Case 2 is:

DV_ =75.0 + 1.2¥, -O.5P, -1.OPL.1 -0.5Pp-2 + e. (s, = 020)

When a lagged dependent variable was included in the regression, the mean estimation results were

-2 DV_ = -3.6718 + .8632Y, -1.1858P, + .3105DVz.7 R= .9904

-21-

It is clear that a small modification of the lag process causes the Koycktype misspecification to go to pieces in almost every imaginable way. The short-run price and income estimates are much further away from their true values than before. So also are the long-run estimates. By the usual procedure, in fact, the long run income elasticity works out to 1.2519 and the long-run price elasticity to ~1.7197 (a 14 per cent error). Finally, in all but one of the runs, the coefficient on the lagged dependent variable was found to be significant; in most cases it was highly significant. There is, of course, no hope that this estimating technique can accommodate the inverted V-form of the lag 24! For this reason it might be argued that the particular lag-shape chosen for this second experiment "loads the dice" against a geometric estimator. This is not necessarily the case. First, lag shapes in the real world are unknown, but since inverted V distributions have found empirical support in various areas, this kind of shape is hardly farfetched. On these grounds alone it deserves inclusion in this experiment. Among the involved structures which may characterize real-world responses, in fact, even this form must be classed among the very simple. Secondly, for both Cases 1 and 2, the results we have chosen to tabulate are also not entirely fair to the capacities of the Almon and Shiller estimators 42! 21/ While not reported in detail, the Almon and Shiller estimates of this alternate equation were much more successful in picking up the change in_the lag structure and adapting to the inverted V form. See Shiller /21,p.25/ for another example of an ostensibly "exponential lag'' which changes shape when less restrictive priors are used. 22/ As noted above, the tabulated Almon, Shiller and ridge estimation results were deliberately and rather sharply misspecified. This was partly to give comparability to the three tables in this paper, and partly to point up the "clues" given by these more flexible estimators which can help an investigator improve the specification. Coefficient signs and significance

are useful in this regard, although differences in overall equation fit may not be very helpful without elaborate testing procedures.

-22-

Returning to the basic Case 2 results, the comments which might be made about the Almon, Shiller and ridge estimates are much the same as were made with respect to the Case 1 no=-lag equation. Estimation results improve as curve priors increase, with the Shiller results out» performing the Almon for equivalent curve degrees. Overall, the 2nd degree Almon traces the lag curve less well than the Koyck, but again this is an artifact of the particular curve chosen (and the conclusion reverses for the two period alternate lag case). Once again the ridge results show erratic sign changes and shifts in coefficient magnitudes; they seem generally inferior to the other estimates.

Summarizing the Case 2 findings, the evidence shows that when a single lagged term is added to only one of the basic regressors in the relation at least some of the coefficients (in various runs) on a lagged dependent variable will take on statistical Significance. If the lag structure is enriched slightly, most such coefficients will turn up significant, despite the fact that the lag is still primitive (probably much less complex than in reality), and even when there is no lag attached to other regressors. The disturbing conclusion from these findings is that a researcher who erroneously begins with a "gap-closing" structural assumption may find his hypothesis "confirmed" by the estimation results. In some cases this may lead to serious misjudgments as to whether short-term response with respect to all variables is "elastic" or

"inelastic," even if the long-term estimate is approximately correct,

-23-

We have here a situation where a false hypothesis can be upheld by the data. If this be true, then in the absence of further supportive evidence one can argue that estimation results obtained on the basis of Koyck-type hypotheses cannot really prove much at all about what the true lag-structure might pe.23/ Given this circumstance, one may ask why so many researchers continue to start with such a hypothesis and what other evidence aside from (possibly false) regression results they can produce to show that geometric adjustment exists at all? One possible form of evidence can be illustrated by turning to the Case 3 equation described above.

Case 3: Geometric lag structure on prices.

In this case it is postulated that there in fact exists (unknown to the researcher) a geometric lag-decay process on the relative price term of the hypothetical import equation. Such a process technically is of infinite duration, but beyond some point in time the influence of lagged regressors becomes negligible, and for estimation purposes it is convenient to truncate the distribution. The decay process was cut off at period t-9, so that the true function is

DV, =75.0 +1.2¥, + ,2obo Peg te_ (SQ = -020) b, = -0-8, X= 0.6

The sum of the bj comes to almost -2.0 over the ten included terms.

23/ Though the arguments made here are directed specifically against the use of geometric lags, they apply also with some force against drawing firm conclusions from results obtained by using any lag structure in the absence of corroboration by yet more general forms. The loworder Almon results in both Cases 1 and 2 above are another case in point.

-24—

Obviously there are numerous possible ways an investigator who is uncertain of the true distribution might misspecify such an equation while searching for the structure. The most probable of these is that, as for some of the results shown in Tables 1 and 2, the true length of the lag-distribution will be guessed incorrectly. It is also probable that it will be unknown whether or not there are other lagedistributions in the equation. For purposes of simplifying the third portion of this experiment and highlighting some of the results, we have made three assumptions: 1) the investigator knows or correctly guesses that there is a lag distribution on prices, but is unsure about the income term; 2) he further surmises that the length of the price distribution is about nine quarters, but 3) he does not know the shape of the distribution. He therefore again applies all of the types of estimators discussed in the above sections. The findings thus obtained are summarized in Table 3 and Graph 3 on the following pages.

Results produced by the OLS estimators show the expected features. The first two equations, which understate the lag-length, also underestimate the long-run price elasticity, and somewhat overstate the income elasticity built into the structure. In the third, correctly specified equation, estimates of both these parameters come quite close to their true values, and the main problem is the erratic coefficient pattern and

significance attaching to the price coefficients.

=25-

8110°e- 2990 °e- £ZL0°S=

T880°o=

0%760°3=

of80°2-

€ZLL° I>

L950 *2-

* 2008°T-

8929 °T-

0886 °T- ¥qxX

8066 ° 466° S266 * 1266°

2266" €266°

8066° 0266° 6066 ° 0686°

8506 * +=

00£6 *#- 47806 ° 7- Z916°4~

6%68°4~ L468" ~

LITL*E= oL46° 4" +1760 °S= €21G°S~

00°S-

eLit® 8S60° 8SE0°

S8to" -47900°

€8to°=

oto”

09€e"

T800°= ew

2t60°= 2160°= +8€0'= HESO°=

O4TO*=

6100°=

LS80°=

vETO°=

(8°90 = °a 9°90 = X £°a,X = Fa)

B9ET°= oS80°= OT#0°=

ST90°=

6920 °=

9900°

SéZ10° 8610°= 90£0°-

€€zo°-

€T0°=

Lg10°=

TIOT’ +7100°~ z0%0°=

S0Z0°=

2690°~

1650°=

S98T’= 0060°= SOIT’=

9OTT®=

T0z1"~ WET =

9€60°= ZOlo°- T6¥E*= 60SE°= Hezo’= ELEO’= E290°- LEOT*- Lq 94 Gq 14

s90FIg uo BinjoNAIS Be] OFAQeU0SD

08L2°> gtte*= 61¢22e°"=

ILEs°~

SS0¢°=

rh Ald

L6LT°=

82LT*= Eq

:€ uozjenbyg atdues roZ sj[Nsey uofjewMy AST

c0° = %

S6LY"- ZO6E°= ZO9e*=

Gost *=

OSEE*= 9L9E*=

G6£9°=

88¢"=

*pepnpouy eTqeyazea quepuodep peszey /T

908t°- 417S°o4eS* = T1726" =

61S" 8S2S°~

(o*0=) (To00"=) (€000°=) (OTOO*=) (ZE00°=) (#OTO*=) (77EO"~) (VETT*=) (24LE*~)

€18t°=

OSET* To

oey*-

0889°yEOL*= yOuL*= 9STL°=

T69L°=

8TIL°=

(€ZLL°T) €S€2°T= 60%8°= 2599°= 8929" T=

o8*-

°q

*, ,SqueTBATNba,, eAInND TeyuouXTOd uy pesseidxg

n fe

100° =

8002°T 06 = ganita I681°T eAand “sep pre S8S8T°T eAand *Sep pug OL8T°T 2AINd “Zep 3ST

06 - NOWIV (00S2°T) (uni-3u0T)

€1e° P2TTAWT) YOAOX LI61°T Be] 282712 0 6 Ltry2° T 8e1 2°T24a 0 I ZL8e°T Be1 ON

S10

02*T uoysJenby onIy

a add] UOT SWF IST

pue uofzjenby enizy

°€ erqeL

-28-

Some remarks on the comparative performance of the Almon and Shiller estimates seem also to be in order. One point is that the Shiller "ist degree curve (equivalent)" estimate is produced by a method which, in a Bayesian sense, assumes that the underlying curve is really of the lst degree, i.e., a straight line. This, too, is a misspecification of sorts.

Nonetheless, over most of the lag range, this Shiller estimate outperforms

an Almon of higher degree .26/ The results using higher level curve

priors underscore this conclusion, especially as regards accuracy in estimating larger, near-~period coefficients 22/ Again, the ridge estimates are clearly less stable than the others, and even seem less

reliable than the OLS results.28/

eer ener CC

26/ This is due to choosing a low value for k, the tightness prior. As Shiller cautions, k is not independent of measurement units.

27/ In contrast to results shown in Tables 1 and 2, in Table 3 the Shiller "3rd degree equivalent" estimates of bo-bg show greater errors than those made by the lower level priors. The reasons for this are unclear, but it is probable that the anomaly would be resolved if more runs had been made.

An earlier version of this paper experimented with several values for the variance of the error term. As might be expected, the Shiller results obtained in individual runs are much more sensitive to this variance than are the Almon estimates. When the value of s,. is lowered to .002, the variance of the Shiller coefficients around their mean values is cut down greatly.

28/ Levels of , higher than .001 once again rapidly produced a pronounced "flattening" of the b; estimates along the abscissa. These results suggest that even searching over small values of u is unlikely to be of much help in finding a sensible lag pattern.

-29=

VI. Conclusions

The results produced by the Case 3 experiment in this paper clearly show that several varieties of estimators are capable of approximately identifying a geometric lag decay if that is in fact the mechanism which produces a given body of data. From experiments with the more rudimentary Case 2 lag structures, we also know that using a too restrictive hypothesis may lead to the spurious conclusion that a false hypothesis is correct. In other words, Koyck-type estimation may wrongly "find"

a geometric lag pattern where it does not exist, and higher level estimators are capable of finding it if it does exist. These findings tend to confirm the two propositions put forth above at the end of Section III.

We may thus return to the question posed at the outset of this paper: Under these circumstances, why is it that so many investigators continue to hypothesize that the world is governed by partial-, stockand other types of '"'gap-adjustment" mechanisms which generate a Koyck~ type estimator with its demonstrable deficiencies, when better methods are available? As suggested at the outset, the reason is probably that these starting points are known to lead to functions which are just plain

easy to estimate .22/ This can be an important matter -- for instance,

29/ Some studies even appear to make errors in the way the data-transformation is carried out. Almost without exception, formal derivations of gap-type models begin in a bivariate framework, out of which pops a lagged-dependent variable. If the original relation is multivariate, however, other forms can result. A Cagan-type adaptive expectations model (gap closed at some rate on one regressor) will give an estimation equation showing lags on the other regressors. So will the original Koyck-type equation in which the lag distribution on some regressor is directly assumed to decline geometrically. Of the forms mentioned here, only Nerlove's partial-adjustment mechanism (dependent variable adjusts) can be transformed in the multivariate framework by the simple addition of a lagged dependent variable.

=-30—

in the case of the kinds of trade studies cited in earlier sections. In such studies the question of short-term price and exchange-rate responsiveness is of some interest independently of the long-run considerations.-2

While the criticism in this paper has been largely directed at one special form of estimating lag structures, the implications are somewhat more general. The basic point is that the estimation technique derived from any hypothesis shares the limitations of that hypothesis. Nothing in this paper should be construed to suggest the author believes that geometric (or polynomial) lag processes cannot or do not really exist somewhere out there; it is only that those who purport to find them should be obliged either to a) state a strong a priori case in favor of such findings, or b) give some additional evidence when the shape of the

lag-process is of particular interest. But in the vast mass of published

empirical conclusions on the shape of distributed lags, it is difficult to

find a study in which an author actually backs up, in either of these two

fashions, an assertion that a lag has particular properties, one result

is that the literature is filled with masses of seemingly contradictory

results, The "checking" process suggested in this paper has obvious

limitations of its own, but it is at least apparent that such verification

can help clarify what conclusions the data do or do not sustain.

30/ One thinks, for instance, of the chagrin evinced by most forecasters when the exchange rate changes set in December, 1971, under the Smithsonian Agreement failed to produce their desired results after a few quarters. If, as Table 3 suggests, geometric estimates overstate short-term responsiveness, this is one possible reason.

*jueuoduod to11e wWopuer sopnyToUT (Ad) E[qeTAeA JUepuedeq +: ALON

O°T €26° 6° 468" eye SI8* 7eL* 7SL° ZIL* £69" 4L8°= ByB°- 6d

O'l 06° 1€6° €68" Set 908° OL" lel’ 699° S98t- BHet- 8a

oO°T 696° of6* 068° LE8* O62" Til’ 069° 658° 08° La

O°l 896° 626" £88" Sze* z2L° ~~ 602° ese*=- SS8*- 9a

ca Oo" S96" S26" 618" LTE" 964" LgB*= Eget= | Sa O°L 296" £€26* +418" 008° o98*= s8Le°- Ta O°T 196° 126° 198° SSet- 068°- fg

O°T 8S6° 606° 67e*- = 168° od

O°T 066° Tye'= LEeet- Tg

O°L 128t= 628°- °d

O°T 066° A

0° ean Nn SSE ON 6a 84 La 94 Sa Va td Ca Tg Og K Aa

eqeq e[dues zOoF XTAReEW UOTIeTeII0D «=: XICNAddV

10.

ll.

12.

13.

References

Almon, Shirley. "The Distributed Lag Between Capital Appropriations and Expenditures," Econometrica, January, 1965, pp. 178-196.

Branson, William H. "A Disaggregated Model of the U.S. Balance of Trade," Staff Economic Studies #44, Board of Governors of the Federal Reserve System, February, 1968.

Cagan, P. "The Monetary Dynamics of Hyper Inflations,'' in M. Friedman,

ed., Studies in the Quantity Theory of Money (Chicago: University of Chicago Press), 1956.

Corradi, C. and Gambetta, G. "The Estimation of Distributed Lags by Spline Functions" (processed), 1975.

Dhrymes, Phoebus J. Distributed Lags: Problems of Estimation and Formulation, (San Francisco: Holden=-Day), 1971.

Goldstein, Morris, and Khan, Mohsin S. "Large Versus Small Price Changes and the Demand for Imports," (mimeographed) International Monetary Fund, DM/74/20, February 27, 1975.

Gregory, R.G. "United States Imports and Internal Pressure of Demand: 1948-68,'' American Economic Review, March, 1971, pp. 27-47.

Griliches, Zvi. "Distributed Lags: A Survey,'' Econometrica, January, 1967, pp. 16-49.

Hoerl, Arthur E., and Kennard, Robert W. "Ridge Regression: Applications to Nonorthogonal Problems," Technometrics, February, 1970, pp. 69-82.

"Ridge Regression: Biased Estimation for Nonorthogonal Problems," Technometrics, February, 1970, pp. 55-67.

Hooper, Peter III. "An Analysis of Aggregation Error in U.S. Merchandise Trade Equations,'’ Ph.D. Dissertation in Economics, University of Michigan, 1974.

Koyck, L.M. Distributed Lags and Investment Analysis (Amsterdam: North Holland), 1954.

Kwack, Sung Y. "The Determination of U.S. Imports and Exports: A Disaggregated Quarterly Model, 1960.III - 1967.1V," Southern Economic Journal, January 1972, pp. 302-15.

14.

15.

16.

17.

18.

19.

20.

21.

22.

23.

24.6

25.

References -- 2

Maddala, G.S. "Ridge Estimators for Distributed Lag Models," NBER Working Paper No. 69 (processed), October, 1974.

Miller, Joseph C. and Fratianni, Michele. "The Lagged Adjustment of U.S. Trade to Prices and Income," Journal of Economics and Business, Spring, 1974, pp. 191-198,

Nerlove, M. Distributed Lags and Demand Analysis, USDA, Agriculture Handbook No. 141, Washington, 1958.

Prachowny, Martin F. A Structural Model of the U.S. Balance of Payments (Amsterdam: North Holland), 1969.

Rao, Sista V. "An Empirical Study of U.S. Foreign Trade Sector According to Standard International Trade Classification During 1950.1 - 1970.1V," Ph.D. Dissertation in Economics, University of Pennsylvania, 1971.

Rappoport, Paul N. “Least Squares and its Alternatives in the Estimation of Dynamic Economic Models" (processed), 1974.

Schmidt, Peter, "A Modification of the Almon Distributed Lag," Journal of the American Statistical Association, September, 1974, pp. 679-681.

Shiller, Robert J. "Alternative Prior Representations of 'Smoothness' for Distributed Lag Estimation," NBER Working Paper No. 89 (processed), June, 1975.

- "A Distributed Lag Estimator Derived

from Smoothness Priors," Econometrica, July, 1973, pp. 775-787.

"Estimation of the Investment and

Price Equations of a Macroeconometric Model," Staff Economic

Studies No. 61, Board of Governors of the Federal Reserve System,

Taylor, William E. "Smoothness Priors and Stochastic Prior Restrictions in Distributed Lag Models," International Economic Review, October, 1974, pp. 803-804,

Wilson, John F, "Yet Another Econometric Model of U.S. Imports, 1958-1971, Disaggregated by End-Use Commodity Groups and Region of Origin,” Ph.D. Dissertation in Economics, University of Pennsylvania, 1974,

Cite this document

APA

Federal Reserve (1976, March 31). Have Geometric Lag Hypothesis Outlived Their Time? Some Evidence in a Monte Carlo Framework. Ifdp, Federal Reserve. https://whenthefedspeaks.com/doc/ifdp_1976-82

BibTeX

@misc{wtfs_ifdp_1976_82,
  author = {Federal Reserve},
  title = {Have Geometric Lag Hypothesis Outlived Their Time? Some Evidence in a Monte Carlo Framework},
  year = {1976},
  month = {Mar},
  howpublished = {Ifdp, Federal Reserve},
  url = {https://whenthefedspeaks.com/doc/ifdp_1976-82},
  note = {Retrieved via When the Fed Speaks corpus}
}