ifdp · September 30, 2005

Alternative Procedures for Estimating Vector Autoregressions Identified with Long-Run Restrictions

Abstract

We show that the standard procedure for estimating long-run identified vector autoregressions uses a particular estimator of the zero-frequency spectral density matrix of the data. We develop alternatives to the standard procedure and evaluate the properties of these alternative procedures using Monte Carlo experiments in which data are generated from estimated real business cycle models. We focus on the properties of estimated impulse response functions. In our examples, the alternative procedures have better small sample properties than the standard procedure, with smaller bias, smaller mean square error and better coverage rates for estimated confidence intervals.

Board of Governors of the Federal Reserve System International Finance Discussion Papers Number 842 October 2005 Alternative Procedures for Estimating Vector Autoregressions Identified with Long-Run Restrictions Lawrence J. Christiano, Martin Eichenbaum and Robert Vigfusson NOTE: International Finance Discussion Papers are preliminary materials circulated to stimulate discussion and critical comment. References in publications to International Finance Discussion Papers (other than an acknowledgment that the writer has had access to unpublished material) should be cleared with the author or authors. Recent IFDPs are available on the Web at www.federalreserve.gov/pubs/ifdp/

Alternative Procedures for Estimating Vector Autoregressions Identified with Long-Run Restrictions ∗ Lawrence J. Christiano, Martin Eichenbaum, and Robert Vigfusson † Abstract We show that the standard procedure for estimating long-run identified vector autoregressions uses a particular estimator of the zero-frequency spectral density matrix of the data. We develop alternatives to the standard procedure and evaluate the properties of these alternative procedures usingMonteCarloexperimentsinwhichdataaregeneratedfromestimated realbusinesscyclemodels. Wefocusonthepropertiesofestimatedimpulse response functions. In our examples, the alternative procedures have bettersmall sample properties than thestandard procedure, with smaller bias, smallermeansquareerrorandbettercoverageratesforestimatedconfidence intervals. Keywords: technology shocks, hours worked, frequency domain, spectral density matrix JEL Codes: E24, E32, O3 1 Introduction There is a large literature in which researchers impose long-run identifying restrictions on vector autoregressions (VARs) to identify the dynamic effects of shocks to the economy. These restrictions are motivated by the implications that many economic models have for the long-run effects of shocks. We show that the standard procedurefor estimatinglong-run-identified vector autoregressionsuses The first two authors are grateful for the financial support of grants from the NSF to the ∗ NationalBureauofEconomicResearch. Theviewsinthispaperaresolely theresponsibilityof the authors and should not be interpreted as reflecting the views of the Board of Governors of the Federal Reserve System or of any person associated with the Federal Reserve System. Christiano: Northwestern University, email l-christiano@northwestern.edu, Eichenbaum: † Northwestern University, email: eich@northwestern.edu, Vigfusson: Board of Governors of the Federal Reserve System, email: robert.j.vigfusson@frb.gov 1

a particular estimator of the zero-frequency spectral density matrix of the data. This estimator is derived from the estimated coefficients of a finite order VAR. When the actual VAR of the data contains more lags of the variables than the econometrician uses, this estimator of the zero-frequency spectral density matrix can have poor properties. These poor properties stem from the difficulties involved in estimating the sum of the coefficients in a VAR. We develop alternative procedures that combine different zero-frequency spectral density matrix estimators with VARs. The specific estimators that we consider are the Bartlett estimator and the estimator proposed by Andrews and Monahan (1992). We evaluate the properties of our alternative procedures using Monte Carlo experiments in which data are generated from estimated Real Business Cycle (RBC) models. We focus on the properties of estimated impulse response functions. In our examples, the alternative procedures have better small sample properties thanthestandardprocedure: theyareassociatedwithsmallerbias,smallermean square error and better coverage rates for confidence intervals. The small sample properties of spectral density estimators can depend on the propertiesoftheunderlyingdatageneratingmechanism. Thereforesomecaution must be taken in extrapolating from our results. At a minimum our results suggest that applied econometricians should consider alternative procedures for estimating long-run identified vector autoregressions. 2 Long-Run Identifying Restrictions and VARs Thereisanimportantliterature in which researchersimposelong-runidentifying restrictionsonVAR.Forexample,Gali(1999)imposestheidentifyingassumption thattheonlyshockthataffectsthelogoflaborproductivity,a ,inthelongrunis t atechnologyshockεz. Gali’sidentifyingassumptioncorrespondstotheexclusion t restriction: lim [E a E a ] =f(εz only). (1) t t+j t 1 t+j t j − − →∞ In addition Gali imposes the sign restriction that f is an increasing function of εz. t Gali imposes the exclusion and sign restrictions on a VAR to compute εz and t identify its dynamic effects on macroeconomic variables. Denote the N variables 2

in a VAR by Y : t Y = B(L)Y +u , Eu u =V, (2) t+1 t t+1 t+1 0t+1 B(L) B +B L+...+B Lq 1, 1 2 q − ≡ ∆loga t Y = logl . t t   x t   Herel denotespercapitahoursworkedandx isanadditionalvectorofvariables t t that may be included in the VAR. In all of our applications we assume that q = 4. Suppose that the fundamental economic shocks are related to u via the t relationship: u =Cε , Eε ε =I, CC =V, (3) t t t 0t 0 where the first element in ε is εz. It is easy to verify that: t t lim E[a Ω ] E [a Ω ] =τ [I B(1)] 1Cε . (4) t+j t t 1 t+j t 1 − t j | − − | − − →∞ Here τ is a row vector with all zeros, except unity in the first location and B(1) is the sum, B + ... + B . Also, E is the expectation operator and is Ω 1 q = Y ,...,Y is the information set. t t q+1 { To comp − ute t } he dynamic effects of εz on the elements of Y we need to know t t B ,...,B and C , the first column of C. The matrix, V, and the B ’s can be 1 q 1 i estimatedbyanordinaryleastsquaresregression. TherequirementthatCC =V 0 is insufficient to determine a unique value of C . Adding the exclusion and sign 1 restrictions does uniquely determines C . The exclusion restriction implies that: 1 number 0 [I B(1)] 1C = 1 (N 1) . − − " numbers nu × mb − ers # The 0 matrix reflects the assumption that only a technology shock can have a long run effect on a . The sign restriction implies that the (1,1) element of t [I B(1)] 1C is positive. − − The standard algorithm for computing C requires the matrix 1 D [I B(1)] 1C. − ≡ − Although we cannot directly estimate D, we can estimate DD : 0 DD 0 =[I B(1)] − 1V I B(1) 0 − 1 . (5) − − £ ¤ 3

Therighthandsideof(5)canbecomputeddirectlyfromtheVARcoefficientsand the variance-covariance matrix of the u . There are many D matrices consistent t with (5) as well as the sign and exclusion restrictions. However, these matrices all have the same first column, D (see Christiano, Eichenbaum, and Vigfusson 1 CEV (2005)). This unique D implies a unique value for C : 1 1 C =[I B(1)]D . (6) 1 1 − Therighthandsideof(5)istheestimateofthezero-frequencyspectraldensity matrix of Y , S (0), that is implicit to the VAR of order q. When the true value t Y of q is larger than the one used by the econometrician, there is reason to be concernedaboutthestatisticalpropertiesofthisestimator. Amismatchbetween the true value of q and the value that the econometrician assumes can arise when the data generating process is a dynamic stochastic general equilibrium model. For these types of models, Y may have an infinite-ordered VAR, i.e. q = . t ∞ To understand the nature of the problem when q is misspecified, it is useful to consider a simple analytic expression closely related to results in Sims (1972). Equation (7) approximates what an econometrician who fits a misspecified VAR will find. The expression is an approximation because it assumes a large sample of data. Let Bˆ ,...,Bˆ and Vˆ denote the estimated parameters of a VAR with q 1 q lags. Then, 1 π Vˆ =V + min B e iω Bˆ e iω S (ω) B eiω Bˆ eiω 0dω, − − Y Bˆ 1,...,Bˆ q 2π Z− π h − i h − i ¡ ¢ ¡ ¢ ¡ ¢ ¡ ¢ (7) where B e iω is B(L) with L replaced by e iω.1 Here, B and V are the para- − − metersoftheactualVARrepresentationofthedata, andS (ω)istheassociated Y ¡ ¢ spectral density at frequency ω. According to (7), estimating an unconstrained VAR approximately involves choosing parameters to minimize a quadratic form in the difference between the estimated and true lag matrices. The quadratic form assigns greatest weight to the frequencies where the spectral density is the greatest. If the econometrician’s VAR is correctly specified, then Bˆ = B and Vˆ = V. If the econometrician estimatesaVARwithtoofewlagsrelativetothetrueVAR,thenthisspecification error implies that Bˆ =B and Vˆ >V.2 Equation (7) indicates that Bˆ(1) will be 6 a good approximation for B(1) only if S (ω) happens to be relatively large in Y a neighborhood of ω =0. This is not something we can rely on. Our alternative procedure for estimating long-run identified VARs involves 1The minimization is over the trace of the indicated integral. 2By Vˆ >V we mean that Vˆ V is a positive definite matrix. − 4

replacingthezero-frequencyspectraldensityestimatorin(5)byotherestimators specificallydesignedforthetask. Thereisaverylargeliteraturethatanalyzesthe properties of alternative estimators of zero-frequency spectral density matrices. One alternative is to use the Bartlett estimator: T 1 k S Y (0)= − g(k)Cˆ(k), g(k)= 1 − | r | | k | ≤ r , (8) 0 k >r k= (T 1) ½ | | −X− where, after removing the sample mean from Y , t T 1 Cˆ(k)= Y Y . T t t0 k − t=k+1 X Another alternative is to use the Andrews-Monahan (1992) estimator: S Y (0) = [I (B 1 +...+B p )] − 1F u (0) I (B 1 +...+B p ) 0 − 1 (9) − − r T k 1£ ¤ F (0) = 1 G(k), G(k)= u u u | − r| T t 0t − k k= r t=k X− X Inouralternativeprocedureforestimatingthedynamiceffectofatechnology shock, we replace the estimate of S (0), used to construct DD and D in the Y 0 1 standard procedure, with the estimator of S (0) given by either (8) or (9). Y We refer to these procedures as the Bartlett and Andrews-Monahan procedures, respectively. We then construct C using (6).3 In the alternative procedures C 1 1 still depends on the sum of the VAR coefficients even though D does not. To 1 implement (8) or (9) we use essentially all possible covariances in the data by choosing a large value of r, r = 150.4 In addition, we examine the properties of our alternative procedures when we use smaller values of r. 3 The Data Generating Process CEV (2005) show that a standard RBC model with a permanent technology shock and a labor supply shock implies the following reduced form model for 3The C matrix constructed with this C will not have the property that CC =V. 1 0 4Theruleofalwayssettingthebandwidth,r,equaltosamplesizedoesnotyieldaconsistent estimatorofthespectraldensityatfrequencyzero. Weassumethatasthesamplesizeincreases the bandwidth is increased sufficiently slowly to obtain a consistent estimator. 5

Y =[∆loga ,logl ] : t t t 0 Y =φ Y +φ Y +ε +θ ε +θ ε , (10) t 1 t 1 2 t 2 t 1 t 1 2 t 2 − − − − where φ and φ are scalars, θ is two by two matrix and ε is a normally dis- 1 2 i t tributed vector with variance-covariance matrix Ψ. The sign and exclusion restrictions are satisfied by the RBC model that CEV(2005) consider. Equation (10) implies that Y has an infinite-order VAR. Therefore, any finite-order VAR t is misspecified. Inthebenchmark version ofthemodel, weusevaluesfortheexogenousshock processes obtained by CEV (2005) who maximize the Gaussian log likelihood function for Y using postwar U.S. data over the sample 1959QIII to 2001QIV. t We summarize their results as follows: 1 0.039137 0.0095548 φ = 1.9439, φ = 0.9445, Ψ= − 1 2 − 1000 0.0095548 0.073729 µ − ¶ 1.926 0.027834 0.92682 0.027548 θ 1 = 0 − .0066342 0.97012 , θ 2 = 0 − 0 . µ − − ¶ µ ¶ In the CKM version of the model, we assume values for the parameters equal to those used by Chari, Kehoe, and McGrattan’s (CKM) (2005) in their benchmark model. CKM obtain these values using a Kalman filter framework to estimate the model. CKM impose the highly questionable identifying assumption that, up to a small measurement error, the change in government spending and net exports is a good measure of the change in technology. CEV (2005) show that there is overwhelming empirical evidence against the restrictions imposed by CKM. Despite the empirical implausibility of the CKM model, we use it as a data generating process because it provides us with an additional example to assess our alternative procedures. CKM’s estimated parameter values are given by: 1 0.11604 0.14605 φ = 1.9093, φ = 0.9114,Ψ= − 1 2 − 1000 0.14605 0.52697 µ − ¶ 1.8967 0.041277 0.89971 0.040268 θ 1 = 0 − .0064704 0.96777 , θ 2 = 0 − 0 . µ − ¶ µ ¶ 4 Results To analyze the properties of our different procedures we generate multiple data sets, each of length 180 observations. We generate 1000 data sets using the benchmark model and 1000 data sets using the CKM model as the data generat- 6

ing mechanism. For each data set, we estimate a four-lag VAR and we estimate S (0) using the standard VAR-based estimator, the Bartlett estimator, and the Y Andrews-Monahan estimator. We then calculate three different estimates of the dynamic response of hours worked to a technology shock, as described in Section 2. Figure 1 reports the average estimated response across all simulations (the solid line) and the true model response (the starred line). In addition, Figure 1 reports the true degree of sampling uncertainty, measured by the standard deviation of the estimated impulse response functions across the 1000 synthetic data sets (the dashed interval). To assess whether an econometrician would correctly estimate the true degree of sampling uncertainty, we proceed as follows. For each synthetic data set, and corresponding estimated impulse response function, we calculate the bootstrap standard deviation of each point in the impulse response function. Specifically, we estimate a VAR on each of the 1000 data sets that we simulate from the economic model. We use the VAR coefficients and fitted disturbances in a bootstrap procedure to generate 200 synthetic data sets, each of length 180 observations. For each of these 200 synthetic data sets, we estimate a new VAR and impulse response function. We then calculate the standarddeviationofthecoefficientsintheimpulseresponsefunctionsacrossthe200 data sets. We use this standard deviation to construct a two standard deviation confidence interval around the estimated responses. We also calculate coverage rates for the first coefficient in the impulse response function. Specifically, we report how often, across the 1000 data sets simulated from the economic model, the econometrician’s confidence intervals contain the first coefficient of the true impulse response function. In addition, Figure 1 reports the average confidence interval (the circles) that an econometrician would construct. To help quantify the statistical properties of the different procedures Table 1 reportssummarystatisticsforX,theestimatedcontemporaneousresponseofl to t a technology shock.: median, mean and standard deviation of X across the 1000 synthetic data sets for each model. A different metric for assessing the different procedures is to focus on the mean square error (MSE) of X. The MSE combines information about bias and sampling uncertainty. We calculate the square root 1/2 of the MSE, (1000) 1 1000(X ξ)2 , for our differentprocedures. HereX − i=1 i − i denotes the vhalue of X obtained in thie ith synthetic data set and ξ is the true P response. Table 1 reports the value of this statistic for the different procedures relative to the MSE associated with the standard procedure. Finally we report coverage rates for X. Consider first the results obtained using the CKM model as the data generating process. Figure 1 indicates that there is substantial upward bias associated 7

with the standard procedure, although the bias is small relative to sampling uncertainty. Figure1indicatesthatthesizeofthebiasassociatedwiththeAndrews - Monahan procedure is substantially smaller than the bias associated with the standard procedure. The bias associated with the Bartlett procedure is smaller still. Table 1 indicates that the bias for X falls from 0.63 using the standard procedure to 0.31 for the Andrews - Monahan procedure and to 0.08 for the Bartlett procedure. The square root of the MSE for X also falls with the modified procedures: the ratio of the MSE for the Andrews - Monahan and Bartlett procedures relative to the standard procedure is 0.91 and 0.77, respectively. Table 1 indicates that the coverage rate for X of our alternative procedures improves relative to the standard procedure. For the standard procedure, the coverage rate for X is only 74 percent which is substantially below the nominal sizeof95percent. FortheAndrews-Monahanprocedure, thecoveragerateis91 percent and the coverage rate is 94 percent for the Bartlett procedure. Finally, Table 1 indicates that for both alternative procedures, setting r to 150 leads to better properties than setting r to 25 or 50. Taken together, our results indicate that for data generated by the CKM model, our alternative procedures perform better than the standard procedure with smaller bias, smaller mean square error, and coverage rates that are closer to the nominal size. In this particular application, the Bartlett procedure outperforms the Andrews-Monahan procedure. We now briefly discuss results when the data generating mechanism is our benchmark model. Figure 1 shows that the bias associated with the standard procedure is much smaller than when the data are generated using the CKM model. Even so, both modified procedures outperform the standard procedure. Again the bias is smaller, the MSE is reduced and coverage rates are closer to nominal sizes. Judging between the Andrews-Monahan and Bartlett procedures is more difficult. Bias is larger for the Bartlett procedure but the MSE is smaller and the coverage rate is closer to the nominal size. 5 Conclusion We develop alternative procedures for estimating long-run identified vector autoregressions. We assess their properties relative to the standard procedure used in the literature. We evaluate the properties of the different procedures using Monte Carlo experiments where we generate data from estimated RBC models. Focusing on estimated response functions, we find that our alternative procedures have better small sample properties than the standard procedure: they are associatedwithsmallerbias, smaller meansquareerrorandbettercoveragerates for confidence intervals. 8

References [1] Andrews, D.W.K., and J.C. Monahan, 1992, An improved heteroskedasticity andautocorrelationconsistentcovariancematrixestimator,Econometrica 60, pp.953-966. [2] Chari,V.V.,PatrickKehoe,andEllenMcGrattan,2005,‘ACritiqueofStructuralVARsUsingRealBusinessCycleTheory,’FederalReserveBankofMinneapolis Staff Report 364 July. [3] Christiano, Lawerence J., Martin Eichenbaum and Robert Vigfusson, 2005, ‘Assessing Structural VARs’, manuscript, Northwestern University. [4] Gali, Jordi, 1999, ‘Technology, Employment, and the Business Cycle: Do Technology Shocks Explain Aggregate Fluctuations?’ American Economic Review, 89(1), 249-271. [5] Sims, Christopher, 1972, ‘The Role of Approximate Prior Restrictions in DistributedLagEstimation,’ Journal of the American Statistical Association, 67(337), 169-175. 9

Table 1: The Contemporaneous Response of Hours Worked to a Positive Technology Shock Standard Procedure r Median Mean MSE* Coverage Deviation CKM Model (Hours Response 0.32) Standard 1.03 0.95 0.68 0.74 Andrews-Monahan 25 1.16 1.04 0.81 1.17 0.76 50 0.98 0.85 0.92 1.14 0.87 150 0.74 0.63 0.79 0.91 0.91 Bartlett 25 0.53 0.48 0.52 0.58 0.90 50 0.60 0.51 0.78 0.86 0.92 150 0.44 0.40 0.71 0.77 0.94 Our Benchmark Model (Hours Response 0.28) Standard 0.43 0.33 0.42 0.84 Andrews-Monahan 25 0.44 0.34 0.48 1.13 0.86 50 0.39 0.29 0.53 1.25 0.88 150 0.28 0.21 0.44 1.06 0.87 Bartlett 25 0.16 0.14 0.22 0.63 0.89 50 0.19 0.15 0.35 0.89 0.92 150 0.15 0.12 0.33 0.87 0.91 Notes: Results from Monte Carlo experiments with data simulated from two parameterizations of a RBC model. *MSE represents the ratio of the square root of the mean squared error relative to the standard procedure 10

Figure 1: The Response of Hours Worked to a Positive Technology Shock CKM Benchmark Benchmark Standard Method 2 1 0.5 1 0 0 −0.5 −1 0 5 10 0 5 10 Bartlett Window 2 1 0.5 1 0 0 −0.5 −1 0 5 10 0 5 10 Andrews Monahan 2 1 0.5 1 0 0 −0.5 −1 0 5 10 0 5 10 11

Cite this document
APA
Lawrence J. Christiano, Martin Eichenbaum, & and Robert J. Vigfusson (2005). Alternative Procedures for Estimating Vector Autoregressions Identified with Long-Run Restrictions (IFDP 2005-842). Board of Governors of the Federal Reserve System, International Finance Discussion Papers. https://whenthefedspeaks.com/doc/ifdp_2005-842
BibTeX
@techreport{wtfs_ifdp_2005_842,
  author = {Lawrence J. Christiano and Martin Eichenbaum and and Robert J. Vigfusson},
  title = {Alternative Procedures for Estimating Vector Autoregressions Identified with Long-Run Restrictions},
  type = {International Finance Discussion Papers},
  number = {2005-842},
  institution = {Board of Governors of the Federal Reserve System},
  year = {2005},
  url = {https://whenthefedspeaks.com/doc/ifdp_2005-842},
  abstract = {We show that the standard procedure for estimating long-run identified vector autoregressions uses a particular estimator of the zero-frequency spectral density matrix of the data. We develop alternatives to the standard procedure and evaluate the properties of these alternative procedures using Monte Carlo experiments in which data are generated from estimated real business cycle models. We focus on the properties of estimated impulse response functions. In our examples, the alternative procedures have better small sample properties than the standard procedure, with smaller bias, smaller mean square error and better coverage rates for estimated confidence intervals.},
}