ifdp · July 31, 2006

Assessing Structural VARs

Abstract

This paper analyzes the quality of VAR-based procedures for estimating the response of the economy to a shock. We focus on two key issues. First, do VAR-based confidence intervals accurately reflect the actual degree of sampling uncertainty associated with impulse response functions? Second, what is the size of bias relative to confidence intervals, and how do coverage rates of confidence intervals compare with their nominal size? We address these questions using data generated from a series of estimated dynamic, stochastic general equilibrium models. We organize most of our analysis around a particular question that has attracted a great deal of attention in the literature: How do hours worked respond to an identified shock? In all of our examples, as long as the variance in hours worked due to a given shock is above the remarkably low number of 1 percent, structural VARs perform well. This finding is true regardless of whether identification is based on short-run or long-run restrictions. Confidence intervals are wider in the case of long-run restrictions. Even so, long-run identified VARs can be useful for discriminating among competing economic models.

Board of Governors of the Federal Reserve System International Finance Discussion Papers Number 866 August 2006 Assessing Structural VARs Lawerence J. Christiano, Martin Eichenbaum, Robert J. Vigfusson NOTE: International Finance Discussion Papers are preliminary materials circulated to stimulate discussion and critical comment. References in publications to International Finance Discussion Papers (other than an acknowledgment that the writer has had access to unpublished material) should be cleared with the author or authors. Recent IFDPs are available on the Web at www.federalreserve.gov/pubs/ifdp/. This paper can be downloaded without charge from Social Science Research Network electronic library at http://www.ssrn.com/.

Assessing Structural VARs ∗ Lawrence J. Christiano, Martin Eichenbaum, and Robert Vigfusson † ‡ § August 2006 Abstract This paper analyzes the quality of VAR-based procedures for estimating the response of the economy to a shock. We focus on two key issues. First, do VAR-based confidence intervals accurately reflect the actual degree of sampling uncertainty associated with impulse response functions? Second, what is the size of bias relative to confidence intervals, and how do coverage rates of confidence intervals compare with their nominal size? We address these questions using data generated from a series of estimated dynamic, stochastic general equilibrium models. We organize most of our analysis around a particular question that has attracted a great deal of attention in the literature: How do hours worked respond to an identified shock? In all of our examples, as long as the variance in hours worked due to a given shock is above the remarkably low number of 1 percent, structural VARs perform well. This finding is true regardless of whether identification is based on short-run or long-run restrictions. Confidence intervals are wider in the case of long-run restrictions. Even so, long-run identified VARs can be useful for discriminating among competing economic models. JEL Codes: C1 Keywords:Vectorautoregression,dynamicstochasticgeneralequilibriummodel,confidence intervals, impulse response functions, identification, long run restrictions, specification error, sampling The first two authors are grateful to the National Science Foundation for Financial Support. We thank ∗ Lars Hansen and our colleagues at the Federal Reserve Bank of Chicago and the Board of Governors for useful comments at various stages of this project. The views in this paper are solely those of the authors and should not be interpreted as reflecting the views of the Board of Governors of the Federal Reserve System or of any other person associated with the Federal Reserve System. Northwestern University, the Federal Reserve Bank of Chicago, and the NBER. † Northwestern University, the Federal Reserve Bank of Chicago, and the NBER. ‡ Federal Reserve Board of Governors. §

1. Introduction Sims’s seminal paper Macroeconomics and Reality (1980) argued that procedures based on vector autoregression (VAR) would be useful to macroeconomists interested in constructing and evaluating economic models. Given a minimal set of identifying assumptions, structural VARs allow one to estimate the dynamic effects of economic shocks. The estimated impulse response functions provide a natural way to choose the parameters of a structural model and to assess the empirical plausibility of alternative models.1 To be useful in practice, VAR-based procedures must have good sampling properties. In particular, they should accurately characterize the amount of information in the data about the effects of a shock to the economy. Also, they should accurately uncover the information that is there. These considerations lead us to investigate two key issues. First, do VAR-based confidence intervals accurately reflect the actual degree of sampling uncertainty associated with impulse response functions? Second, what is the size of bias relative to confidence intervals, and how do coverage rates of confidence intervals compare with their nominal size? We address these questions using data generated from a series of estimated dynamic, stochastic general equilibrium(DSGE) models. We consider real business cycle(RBC) models and the model in Altig, Christiano, Eichenbaum, and Linde (2005) (hereafter, ACEL) that embodies real and nominal frictions. We organize most of our analysis around a particular question that has attracted a great deal of attention in the literature: How do hours worked respond to an identified shock? In the case of the RBC model, we consider a neutral shock to technology. In the ACEL model, we consider two types of technology shocks as well as a monetary policy shock. We focus our analysis on an unavoidable specification error that occurs when the data generating process is a DSGE model and the econometrician uses a VAR. In this case the true VAR is infinite ordered, but the econometrician must use a VAR with a finite number of lags. We find that as long as the variance in hours worked due to a given shock is above the remarkably low number of 1 percent, VAR-based methods for recovering the response of hours to that shock have good sampling properties. Technology shocks account for a much larger fraction of the variance of hours worked in the ACEL model than in any of our estimated RBC models. Not surprisingly, inference about the effects of a technology shock on hours worked is much sharper when the ACEL model is the data generating mechanism. Taken as a whole, our results support the view that structural VARs are a useful guide to constructing and evaluating DSGE models. Of course, as with any econometric procedure it is possible to find examples in which VAR-based procedures do not do well. Indeed, we present such an example based on an RBC model in which technology shocks account for less than 1 percent of the variance in hours worked. In this example, VAR-based methods work poorly in the sense that bias exceeds sampling uncertainty. Although instructive, the example is based on a model that fits the data poorly and so is unlikely to be of practical importance. Having good sampling properties does not mean that structural VARs always deliver small 1SeeforexampleSims(1989),EichenbaumandEvans(1995),RotembergandWoodford(1997),Gali(1999), FrancisandRamey(2004),Christiano,Eichenbaum,andEvans(2005),andDelNegro,Schorfheide,Smets,and Wouters (2005). 2

confidence intervals. Of course, it would be a Pyrrhic victory for structural VARs if the best one could say about them is that sampling uncertainty is always large and the econometrician will always know it. Fortunately, this is not the case. We describe examples in which structural VARs are useful for discriminating between competing economic models. Researchers use two types of identifying restrictions in structural VARs. Blanchard and Quah (1989), Gali (1999), and others exploit the implications that many models have for the long-run effects of shocks.2 Other authors exploit short-run restrictions.3 It is useful to distinguish between these two types of identifying restrictions to summarize our results. WefindthatstructuralVARsperformremarkablywellwhenidentificationisbasedonshortrun restrictions. For all the specifications that we consider, the sampling properties of impulse responseestimatorsaregoodandsamplinguncertaintyissmall. Thisgoodperformanceobtains even when technology shocks account for as little as 0.5 percent of the variance in hours. Our results are comforting for the vast literature that has exploited short-run identification schemes to identify the dynamic effects of shocks to the economy. Of course, one can question the particular short-run identifying assumptions used in any given analysis. However, our results strongly support the view that if the relevant short-run assumptions are satisfied in the data generating process, then standard structural VAR procedures reliably uncover and identify the dynamic effects of shocks to the economy. Themaindistinctionbetweenourshortandlong-runresultsisthatthesamplinguncertainty associatedwithestimatedimpulseresponsefunctionsissubstantiallylargerinthelong-runcase. In addition, we find some evidence of bias when the fraction of the variance in hours worked that is accounted for by technology shocks is very small. However, this bias is not large relative to sampling uncertainty as long as technology shocks account for at least 1 percent of the variance of hours worked. Still, the reason for this bias is interesting. We document that, when substantial bias exists, it stems from the fact that with long-run restrictions one requires an estimate of the sum of the VAR coefficients. The specification error involved in using a finitelag VAR is the reason that in some of our examples, the sum of VAR coefficients is difficult to estimate accurately. This difficulty also explains why sampling uncertainty with long-run restrictions tends to be large. The preceding observations led us to develop an alternative to the standard VAR-based estimatorofimpulseresponsefunctions. TheonlyplacethesumoftheVARcoefficientsappears in the standard strategy is in the computation of the zero-frequency spectral density of the data. Our alternative estimator avoids using the sum of the VAR coefficients by working with a nonparametric estimator of this spectral density. We find that in cases when the standard VAR procedure entails some bias, our adjustment virtually eliminates the bias. Our results are related to a literature that questions the ability of long-run identified VARs to reliably estimate the dynamic response of macroeconomic variables to structural shocks. 2See, for example, Basu, Fernald, and Kimball (2004), Christiano, Eichenbaum, and Vigfusson (2003, 2004), Fisher (2006), Francis and Ramey (2004), King, Plosser, Stock and Watson (1991), Shapiro and Watson (1988) and Vigfusson (2004). Francis, Owyang, and Roush (2005) pursue a related strategy to identify a technology shock as the shock that maximizes the forecast error variance share of labor productivity at a long but finite horizon. 3ThislistisparticularlylongandincludesatleastBernanke(1986), BernankeandBlinder(1992), Bernanke and Mihov (1998), Blanchard and Perotti (2002), Blanchard and Watson (1986), Christiano and Eichenbaum (1992), Christiano, Eichenbaum and Evans (2005), Cushman and Zha (1997), Eichenbaum and Evans (1995), Hamilton (1997), Rotemberg and Woodford (1992), Sims (1986), and Sims and Zha (2006). 3

Perhaps the first critique of this sort was provided by Sims (1972). Although his paper was written before the advent of VARs, it articulates why estimates of the sum of regression coefficients may be distorted when there is specification error. Faust and Leeper (1997) and Pagan and Robertson (1998) make an important related critique of identification strategies based on long-run restrictions. More recently Erceg, Guerrieri, and Gust (2005) and Chari, Kehoe, and McGrattan (2005b) (henceforth CKM) also examine the reliability of VAR-based inference using long-run identifying restrictions.4 Our conclusions regarding the value of identified VARs differ sharply from those recently reached by CKM. One parameterization of the RBC model that we consider is identical to the one considered by CKM. This parameterization is included for pedagogical purposes only, as it is overwhelmingly rejected by the data. The remainder of the paper is organized as follows. Section 2 presents the versions of the RBCmodelsthatweuseinouranalysis. Section3discussesourresultsforstandardVAR-based estimators of impulse response functions. Section 4 analyzes the differences between short and long-run restrictions. Section 5 discusses the relation between our work and the recent critique of VARs offered by CKM. Section 6 summarizes the ACEL model and reports its implications for VARs. Section 7 contains concluding comments. 2. A Simple RBC Model In this section, we display the RBC model that serves as one of the data generating processes in our analysis. In this model the only shock that affects labor productivity in the long-run is a shock to technology. This property lies at the core of the identification strategy used by King, et al (1991), Gali (1999) and other researchers to identify the effects of a shock to technology. We also consider a variant of the model which rationalizes short run restrictions as a strategy for identifying a technology shock. In this variant, agents choose hours worked before the technology shock is realized. We describe the conventional VAR-based strategies for estimating the dynamic effect on hours worked of a shock to technology. Finally, we discuss parameterizations of the RBC model that we use in our experiments. 2.1. The Model The representative agent maximizes expected utility over per capita consumption, c , and per t capita hours worked, l : t E ∞ (β(1+γ))t logc +ψ (1 − l t )1 − σ − 1 , 0 t 1 σ " # t=0 − X subject to the budget constraint: c +(1+τ )i (1 τ )w l +r k +T , t x,t t l,t t t t t t ≤ − where i = (1+γ)k (1 δ)k . t t+1 t − − 4See also Fernandez-Villaverdez, Rubio-Ramirez, and Sargent (2005) who investigate the circumstances in which the economic shocks are recoverable from the VAR disturbances. They provide a simple matrix algebra check to assess recoverability. They identify models in which the conditions are satisfied and other models in which they are not. 4

Here, k denotes the per capita capital stock at the beginning of period t, w is the wage rate, t t r is the rental rate on capital, τ is an investment tax, τ is the tax rate on labor income, t x,t l,t δ (0,1)isthedepreciationrateoncapital, γ isthegrowthrateofthepopulation, T represents t ∈ lump-sum taxes and σ > 0 is a curvature parameter. The representative competitive firm’s production function is: y = kα(Z l )1 α, t t t t − whereZ isthe timet stateof technologyand α (0,1). The stochasticprocesses forthe shocks t ∈ are: logz = μ +σ εz t z z t τ = (1 ρ )τ +ρ τ +σ εl (2.1) l,t+1 − l l l l,t l t+1 τ = (1 ρ )τ +ρ τ +σ εx , x,t+1 − x x x x,t x t+1 where z = Z /Z . In addition, εz, εl, and εx are independently and identically distributed t t t 1 t t t − (i.i.d.) random variables with mean zero and unit standard deviation. The parameters, σ , σ , z l and σ are non-negative scalars. The constant, μ , is the mean growth rate of technology, τ x z l is the mean labor tax rate, and τ is the mean tax on capital. We restrict the autoregressive x coefficients, ρ and ρ , to be less than unity in absolute value. l x Finally, the resource constraint is: c +(1+γ)k (1 δ)k y . t t+1 t t − − ≤ We consider two versions of the model, differentiated according to timing assumptions. In the standard or nonrecursive version, all time t decisions are taken after the realization of the time t shocks. This is the conventional assumption in the RBC literature. In the recursive version ofthemodelthetimingassumptionsareasfollows. First,τ isobserved,andthenlabor l,t decisions are made. Second, the other shocks are realized and agents make their investment and consumption decisions. 2.2. Relation of the RBC Model to VARs We now discuss the relation between the RBC model and a VAR. Specifically, we establish conditions under which the reduced form of the RBC model is a VAR with disturbances that are linear combinations of the economic shocks. Our exposition is a simplified version of the discussion in Fernandez-Villaverde, Rubio-Ramirez, and Sargent (2005) (see especially their section III). We include this discussion because it frames many of the issues that we address. Our discussion applies to both the standard and the recursive versions of the model. We begin by showing how to put the reduced form of the RBC model into a state-space, observer form. Throughout, we analyze the log-linear approximations to model solutions. Suppose the variables of interest in the RBC model are denoted by X . Let s denote the vector t t ˆ of exogenous economic shocks and let k denote the percent deviation from steady state of the t capital stock, after scaling by Z .5 The approximate solution for X is given by: t t ˆ ˆ X = a +a k +a k +b s +b s , (2.2) t 0 1 t 2 t 1 0 t 1 t 1 − − 5Let k˜ =k /Z . Then, kˆ = k˜ k˜ /k˜, where k˜ denotes the value of k˜ in nonstochastic steady state. t t t 1 t t t − − ³ ´ 5

where ˆ ˆ k = Ak +Bs . (2.3) t+1 t t Also, s has the law of motion: t s = Ps +Qε , (2.4) t t 1 t − where ε is a vector of i.i.d. fundamental economic disturbances. The parameters of (2.2)and t (2.3) are functions of the structural parameters of the model. The ‘state’ of the system is composed of the variables on the right side of (2.2): ˆ k t ˆ k ξ t = ⎛ s t − 1 ⎞. t ⎜ s ⎟ ⎜ t 1 ⎟ − ⎝ ⎠ The law of motion of the state is: ξ = Fξ +Dε , (2.5) t t 1 t − where F and D are constructed from A, B, Q, P. The econometrician observes the vector of variables, Y . We assume Y is equal to X plus iid measurement error, v , which has diagonal t t t t variance-covariance, R. Then: Y = Hξ +v . (2.6) t t t Here, H isdefinedsothat X = Hξ , thatis, relation(2.2)issatisfied. In(2.6)weabstractfrom t t the constant term. Hamilton (1994, section 13.4) shows how the system formed by (2.5) and (2.6) can be used to construct the exact Gaussian density function for a series of observations, Y ,...,Y . We use this approach when we estimate versions of the RBC model. 1 T Wenowuse(2.5)and(2.6)toestablishconditionsunderwhichthereducedformrepresentation for X implied by the RBC model is a VAR with disturbances that are linear combinations t of the economic shocks. In this discussion, we set v = 0, so that X = Y . In addition, we t t t assume that the number of elements in ε coincides with the number of elements in Y . t t We begin by substituting (2.5) into (2.6) to obtain: Y = HFξ +Cε , C HD. t t 1 t − ≡ Ourassumptiononthedimensionsof Y andε impliesthatthematrixC issquare. Inaddition, t t we assume C is invertible. Then: ε = C 1Y C 1HFξ . (2.7) t − t − t 1 − − Substituting (2.7) into (2.5), we obtain: ξ = Mξ +DC 1Y , t t 1 − t − where M = I DC 1H F. (2.8) − − As long as the eigenvalues of M are less t£han unity in ¤absolute value, ξ = DC 1Y +MDC 1Y +M2DC 1Y +... . (2.9) t − t − t 1 − t 2 − − 6

Using (2.9) to substitute out for ξ in (2.7), we obtain: t 1 − ε = C 1Y C 1HF DC 1Y +MDC 1Y +M2DC 1Y +... , t − t − − t 1 − t 2 − t 3 − − − − £ ¤ or, after rearranging: Y = B Y +B Y +... +u , (2.10) t 1 t 1 2 t 2 t − − where u = Cε (2.11) t t B = HFMj 1DC 1, j = 1,2,... (2.12) j − − Expression (2.10) is an infinite-order VAR, because u is orthogonal to Y , j 1. t t j − ≥ Proposition 2.1. (Fernandez-Villaverde, Rubio-Ramirez, and Sargent) If C is invertible and the eigenvalues of M are less than unity in absolute value, then the RBC model implies: Y has the infinite-order VAR representation in (2.10) t • The linear one-step-ahead forecast error Y given past Y ’s is u , which is related to the t t t • economic disturbances by (2.11) The variance-covariance of u is CC t 0 • The sum of the VAR lag matrices is given by: • ∞ B(1) B = HF [I M] 1DC 1. j − − ≡ − j=1 X We will use the last of these results below. Relation (2.10) indicates why researchers interested in constructing DSGE models find it usefultoanalyzeVARs. Atthesametime,thisrelationshipclarifiessomeofthepotentialpitfalls in the use of VARs. First, in practice the econometrician must work with finite lags. Second, the assumption that C is square and invertible may not be satisfied. Whether C satisfies these conditions depends on how Y is defined. Third, significant measurement errors may t exist. Fourth, the matrix, M, may not have eigenvalues inside the unit circle. In this case, the economicshocksarenotrecoverablefromtheVARdisturbances.6 Implicitly,theeconometrician who works with VARs assumes that these pitfalls are not quantitatively important. 2.3. VARs in Practice and the RBC Model WeareinterestedintheuseofVARsasawaytoestimatetheresponseofX toeconomicshocks, t i.e., elements of ε . In practice, macroeconomists use a version of (2.10) with finite lags, say q. t A researcher can estimate B ,...,B and V = Eu u . To obtain the impulse response functions, 1 q t 0t however,theresearcherneedstheB ’sandthecolumnofC correspondingtotheshockinε that i t 6For anearlyexample, seeHansenand Sargent (1980, footnote12). Sims and Zha(forthcoming)discussthe possibilitythat,althoughagiveneconomicshockmaynotlieexactlyinthespaceofcurrentandpastY ,itmay t nevertheless be ‘close’. They discuss methods to detect this case. 7

is of interest. However, to compute the required column of C requires additional identifying assumptions. In practice, two types of assumptions are used. Short-run assumptions take the form of direct restrictions on the matrix C. Long-run assumptions place indirect restrictions on C that stem from restrictions on the long-run response of X to a shock in an element of ε . t t In this section we use our RBC model to discuss these two types of assumptions and how they are imposed on VARs in practice. 2.3.1. The Standard Version of the Model The log-linearizedequilibriumlaws of motionfor capital andhours inthis model can be written as follows: ˆ ˆ logk = γ +γ logk +γ logz +γ τ +γ τ , (2.13) t+1 0 k t z t l l,t x x,t and ˆ logl = a +a logk +a logz +a τ +a τ . (2.14) t 0 k t z t l l,t x x,t From (2.13) and (2.14), it is clear that all shocks have only a temporary effect on l and k ˆ .7 t t The only shock that has a permanent effect on labor productivity, a y /l , is εz. The other t t t t ≡ shocks do not have a permanent effect on a . Formally, this exclusion restriction is: t lim [E a E a ] = f (εz only). (2.15) j t t+j − t − 1 t+j t →∞ In our linear approximation to the model solution f is a linear function. The model also implies the sign restriction that f is an increasing function. In (2.15), E is the expectation operator, t ˆ conditional on the information set Ω = logk ,logz ,τ ,τ ; s 0 . t t s t s l,t s x,t s − − − − ≥ In practice, researchers impose the ex³clusion and sign restrictions on a VA´R to compute εz t and identify its dynamic effects on macroeconomic variables. Consider the N 1 vector, Y . t × The VAR for Y is given by: t Y = B(L)Y +u , Eu u = V, (2.16) t+1 t t+1 t 0t B(L) B +B L+...+B Lq 1, 1 2 q − ≡ ∆loga t Y = logl . t t ⎛ ⎞ x t ⎝ ⎠ Here, x is an additional vector of variables that may be included in the VAR. Motivated by the t typeofreasoningdiscussedintheprevioussubsection, researchersassumethatthefundamental economic shocks are related to u as follows: t u = Cε , Eε ε = I, CC = V. (2.17) t t t 0t 0 Without loss of generality, we assume that the first element in ε is εz. We can easily verify t t that: lim E ˜ a E ˜ a = τ [I B(1)] 1Cε , (2.18) t t+j t 1 t+j − t j − − − →∞ h i 7Cooley and Dwyer (1998) argue that in the standard RBC model, if technology shocks have a unit root, then per capita hours worked will be difference stationary. This claim, which plays an important role in their analysis of VARs, is incorrect. 8

where τ is a row vector with all zeros, but with unity in the first location. Here: B(1) B +...+B . 1 q ≡ Also, E˜ is the expectation operator, conditional on Ω ˜ = Y ,...,Y . As mentioned above, t t t t q+1 { − } to compute the dynamic effects of εz, we require B ,...,B and C , the first column of C. t 1 q 1 The symmetric matrix, V, and the B ’s can be computed using ordinary least squares rei gressions. However, the requirement that CC = V is not sufficient to determine a unique value 0 of C . Adding the exclusion and sign restrictions does uniquely determine C . Relation (2.18) 1 1 implies that these restrictions are: number 0 exclusion restriction: [I B(1)] 1C = , − − numbers numbers ∙ ¸ where 0 is a row vector and sign restriction: (1,1) element of [I B(1)] 1C is positive. − − Therearemanymatrices, C, that satisfyCC = V as well as theexclusionandsignrestrictions. 0 It is well-known that the first column, C , of each of these matrices is the same. We prove this 1 result here, because elements of the proof will be useful to analyze our simulation results. Let D [I B(1)] 1C. − ≡ − Let S (ω) denote the spectral density of Y at frequency ω that is implied by the qth-order Y t VAR. Then: DD 0 = [I B(1)] − 1V [I B(1) 0 ]− 1 = S Y (0). (2.19) − − The exclusion restriction requires that D have a particular pattern of zeros: d 0 11 D = 1 × 1 1 × (N − 1) , ⎡ D 21 D 22 ⎤ (N 1) 1 (N 1) (N 1) − × − × − ⎣ ⎦ so that d2 d D S11(0) S21(0) DD = 11 11 201 = Y Y 0 , 0 D d D D +D D S21(0) S22(0) ∙ 21 11 21 201 22 202 ¸ ∙ Y Y ¸ where S11(ω) S21(ω) S (ω) Y Y 0 . Y ≡ S21(ω) S22(ω) Y Y ∙ ¸ The exclusion restriction implies that d2 = S11(0), D = S21(0)/d . (2.20) 11 Y 21 Y 11 There are two solutions to (2.20). The sign restriction d > 0 (2.21) 11 selects one of the two solutions to (2.20). So, the first column of D, D , is uniquely determined. 1 By our definition of C, we have C = [I B(1)]D . (2.22) 1 1 − We conclude that C is uniquely determined. 1 9

2.3.2. The Recursive Version of the Model In the recursive version of the model, the policy rule for labor involves logz and τ , t 1 x t 1 − − because these variables help forecast logz and τ , : t x t ˆ logl = a +a logk +a˜ τ +a˜ logz +a˜ τ . t 0 k t l l,t 0z t 1 0x x,t 1 − − Because labor is a state variable at the time the investment decision is made, the equilibrium ˆ law of motion for k is: t+1 ˆ ˆ logk = γ +γ logk +γ˜ logz +γ˜ τ +γ˜ τ t+1 0 k t z t l l,t x x,t +γ˜ logz +γ˜ τ . 0z t 1 0x x,t 1 − − Asinthestandardmodel, theonlyshockthat affectsa inthelongrunis ashocktotechnology. t So, thelong-runidentificationstrategydiscussedinsection2.3.1appliestotherecursiveversion of the model. However, an alternative procedure for identifying εz applies to this version of t the model. We refer to this alternative procedure as the ‘short-run’ identification strategy because it involves recovering εz using only the realized one-step-ahead forecast errors in labor t productivity and hours, as well as the second moment properties of those forecast errors. Let ua and ul denote the population one-step-ahead forecast errors in a and logl , Ω,t Ω,t t t conditional on the information set, Ω . The recursive version of the model implies that t 1 − ua = α εz +α εl, ul = γεl, Ω,t 1 t 2 t Ω,t t where α > 0, α , and γ are functions of the model parameters. The projection of ua on ul 1 2 Ω,t Ω,t is given by cov(ua ,ul ) ua = βul +α εz, where β = Ω,t Ω,t . (2.23) Ω,t Ω,t 1 t var ul Ω,t Because we normalize the standard deviation of εz to unity, α is given by: t ¡1 ¢ α = var ua β2var ul . 1 Ω,t − Ω,t q ¡ ¢ ¡ ¢ In practice, we implement the previous procedure using the one-step-ahead forecast errors generated from a VAR in which the variables in Y are ordered as follows: t logl t Y = ∆loga . t t ⎛ ⎞ x t ⎝ ⎠ We write the vector of VAR one-step-ahead forecast errors, u , as: t ul t u = ua . t t ⎛ ⎞ ux t ⎝ ⎠ We identify the technology shock with the second element in ε in (2.17). To compute the t dynamic response of the variables in Y to the technology shock we need B ,...,B in (2.16) t 1 q 10

and the second column, C , of the matrix C, in (2.17). We obtain C in two steps. First, we 2 2 identify the technology shock using: 1 εz = ua β ˆ ul , t αˆ t − t 1 ³ ´ where β ˆ = cov(ua t ,ul t ) , αˆ = var(ua) β ˆ2 var ul . var ul 1 t − t t q ¡ ¢ The required variances and covariances are obtained from the estimate of V in (2.16). Second, ¡ ¢ we regress u on εz to obtain:8 t t cov(ul,εz) 0 var(εz) C = cov(ua,εz) = αˆ 1 . 2 ⎛ var(εz) ⎞ ⎛ ⎞ cov(ux,εz) 1 cov(ux,ua) β ˆ cov ux,ul ⎜ var(εz) ⎟ ⎜ αˆ1 t t − t t ⎟ ⎝ ⎠ ⎝ ³ ´ ⎠ ¡ ¢ 2.4. Parameterization of the Model We consider different specifications of the RBC model that are distinguished by the parameterization of the laws of motion of the exogenous shocks. In all specifications we assume, as in CKM , that: β = 0.981/4, θ = 0.33, δ = 1 (1 .06)1/4, ψ = 2.5, γ = 1.011/4 1 (2.24) − − − τ = 0.3, τ = 0.242, μ = 1.0161/4 1, σ = 1. x l z − 2.4.1. Our MLE Parameterizations Weestimatetwoversionsofourmodel. Inthetwo-shock maximum likelihood estimation (MLE) specification we assume that σ = 0, so that there are two shocks, τ and logz . We estimate x l,t t the parameters ρ , σ , and σ , by maximizing the Gaussian likelihood function of the vector, l l z X = (∆logy ,logl ) , subject to (2.24).9 Our results are given by: t t t 0 logz = μ +0.00953εz, t z t τ = (1 0.986)τ¯ +0.986τ +0.0056εl. l,t l l,t 1 t − − The three-shock MLE specification incorporates the investment tax shock, τ , into the x,t model. We estimate the three-shock MLE version of the model by maximizing the Gaussian likelihoodfunctionofthevector, X = (∆logy ,logl ,∆logi ), subjecttotheparametervalues t t t t 0 in (2.24). The results are: logz = μ +0.00968εz, t z t τ = (1 0.9994)τ +0.9994τ +0.00631εl, l,t l l,t 1 t − − τ = (1 0.9923)τ +0.9923τ +0.00963εx. x,t x x,t 1 t − − 8We implement the procedure for estimating C by computing CC = V, where C is the lower triangular 2 0 Cholesky decomposition of V, and setting C equal to the second column of C. 2 9We use the standard Kalman filter strategy discussed in Hamilton (1994, section 13.4). We remove the sample mean from X prior to estimation and set the measurement error in the Kalman filter system to zero, t i.e., R=0 in (2.6). 11

The estimated values of ρ and ρ are close to unity. This finding is consistent with other x l research that also reports that shocks in estimated general equilibrium models exhibit high degrees of serial correlation.10 2.4.2. CKM Parameterizations The two-shock CKM specification has two shocks, z and τ . These shocks have the following t l,t time series representations: logz = μ +0.0131εz, t z t τ = (1 0.952)τ +0.952τ +0.0136εl. l,t l l,t 1 t − − The three-shock CKM specification adds an investment shock, τ , to the model, and has the x,t following law of motion: τ = (1 0.98)τ +0.98τ +0.0123εx. (2.25) x,t − x x,t − 1 t As in our specifications, CKM obtain their parameter estimates using maximum likelihood methods. However, their estimates are very different from ours. For example, the variances of the shocks are larger in the two-shock CKM specification than in our MLE specification. Also, the ratio of σ2 to σ2 is nearly three times larger in the two-shock CKM specification than in l z our two-shock MLE specification. Section 5 below discusses the reasons for these differences. 2.5. The Importance of Technology Shocks for Hours Worked Table 1 reports the contribution, V , of technology shocks to three different measures of the h volatility in the log of hours worked: (i) the variance of the log hours, (ii) the variance of HP-filtered, log hours and (iii) the variance in the one-step-ahead forecast error in log hours.11 With one exception, we compute the analogous statistics for log output. The exception is (i), for which we compute the contribution of technology shocks to the variance of the growth rate of output. The key result in this table is that technology shocks account for a very small fraction of the volatility in hours worked. When V is measured according to (i), it is always below 4 percent. h When V is measured using (ii) or (iii) it is always below 8 percent. For both (ii) and (iii), in h the CKM specifications, V is below 2 percent.12 Consistent with the RBC literature, the table h also shows that technology accounts for a much larger movement in output. 10See, for example, Christiano (1988), Christiano, et al. (2004), and Smets and Wouters (2003). 11WecomputeforecasterrorvariancesbasedonafourlagVAR.ThevariablesintheVARdependonwhether the calculations correspond to the two or three shock model. In the case of the two-shock model, the VAR has two variables, output growth and log hours. In the case of the three-shock model, the VAR has three variables: output growth, log hours and the log of the investment to output ratio. Computing V requires estimating h VARsinartificialdatageneratedwithallshocks,aswellasinartificialdatageneratedwithonlythetechnology shock. In the latter case, the one-step ahead forecast error from the VAR is well defined, even though the VAR coefficients themselves are not well defined due to multicollinearity problems. 12When we measure V according to (i), V drops from 3.73 in the two-shock MLE model to 0.18 in the h h three-shockMLEmodel. TheanalogousdropinV isanorderofmagnitudesmallerwhenV ismeasuredusing h h (ii) or (iii). The reason for this difference is that ρ goes from 0.986 in the two-shock MLE model to 0.9994 l in the three-shock MLE model. In the latter specification there is a near-unit root in τ , which translates l,t into a near-unit root in hours worked. As a result, the variance of hours worked becomes very large at the low frequencies. The near-unit root in τ has less of an effect on hours worked at high and business cycle lt frequencies. 12

Figure 1 displays visually how unimportant technology shocks are for hours worked. The top panel displays two sets of 180 artificial observations on hours worked, simulated using the standard two-shock MLE specification. The volatile time series shows how log hours worked evolve in the presence of shocks to both z and τ . The other time series shows how log hours t l,t worked evolve in response to just the technology shock, z . The bottom panel is the analog of t the top figure when the data are generated using the standard two-shock CKM specification. 3. Results Based on RBC Data Generating Mechanisms In this section we analyze the properties of conventional VAR-based strategies for identifying the effects of a technology shock on hours worked. We focus on the bias properties of the impulse response estimator, and on standard procedures for estimating sampling uncertainty. We use the RBC model parameterizations discussed in the previous section as the data generatingprocesses. Foreachparameterization,wesimulate1,000datasetsof180observations each. The shocks εz, εl, and possibly εx, are drawn from i.i.d. standard normal distributions. t t t For each artificial data set, we estimate a four-lag VAR. The average, across the 1,000 data sets, of the estimated impulse response functions, allows us to assess bias. For each data set we also estimate two different confidence intervals: a percentile-based confidence interval and a standard-deviation based confidence interval.13 We construct the intervals using the following bootstrap procedure. Using random draws from the fitted VAR disturbances, we use the estimated four lag VAR to generate 200 synthetic data sets, each with 180 observations. For each of these 200 synthetic data sets we estimate a new VAR and impulse response function. For each artificial data set the percentile-based confidence interval is defined as the top 2.5 percent and bottom 2.5 percent of the estimated coefficients in the dynamic response functions. The standard-deviation-based confidence interval is defined as the estimated impulse response plus or minus two standard deviations where the standard deviationsarecalculatedacrossthe200simulatedestimatedcoefficientsinthedynamicresponse functions. We assess the accuracy of the confidence interval estimators in two ways. First, we compute the coverage rate for each type of confidence interval. This rate is the fraction of times, across the 1,000 data sets simulated from the economic model, that the confidence interval contains the relevant true coefficient. If the confidence intervals were perfectly accurate, the coverage rate would be 95 percent. Second, we provide an indication of the actual degree of sampling uncertainty in the VAR-based impulse response functions. In particular, we report centered 95 percent probability intervals for each lag in our impulse response function estimators.14 If the confidence intervals wereperfectlyaccurate, theyshouldonaverage coincidewith theboundary of the 95 percent probability interval. When we generate data from the two-shock MLE and CKM specifications, we set Y = t 13Sims and Zha (1999) refer to what we call the percentile-based confidence interval as the ‘other-percentile bootstrapinterval’. Thisprocedurehasbeenusedinseveralstudies,suchasBlanchardandQuah(1989),Christiano, Eichenbaum, and Evans (1999), Francis and Ramey (2004), McGrattan (2006), and Runkle (1987). The standard-deviationbasedconfidenceintervalhasbeenusedbyotherresearchers,suchasChristianoEichenbaum, and Evans (2005), Gali (1999), and Gali and Rabanal (2004). 14For each lag starting atthe impactperiod, we ordered the 1,000 estimated impulse responses fromsmallest to largest. The lower and upper boundaries correspond to the 25th and the 975th impulses in this ordering. 13

(∆loga , logl ). When we generate data from the three-shock MLE and CKM specifications, t t 0 we set Y = (∆loga ,logl ,logi /y ). t t t t t 0 3.1. Short-Run Identification Results for the two- and three- Shock MLE Specifications Figure 2 reports results generated from four different parameterizations of the recursive versionoftheRBCmodel. Ineachpanel,thesolidlineistheaverageestimatedimpulseresponse function for the 1,000 data sets simulated using the indicated economic model. For each model, the starred line is the true impulse response function of hours worked. In each panel, the gray area defines the centered 95 percent probability interval for the estimated impulse response functions. The stars with no line indicate the average percentile-based confidence intervals across the 1,000 data sets. The circles with no line indicate the average standard-deviationbased confidence intervals. Figures 3 and 4 graph the coverage rates for the percentile-based and standard-deviationbased confidence intervals. For each case we graph how often, across the 1,000 data sets simulated from the economic model, the econometrician’s confidence interval contains the relevant coefficient of the true impulse response function. The1,1panel inFigure2exhibitsthepropertiesoftheVAR-basedestimatoroftheresponse ofhourstoatechnologyshockwhenthedataaregeneratedbythetwo-shockMLEspecification. The 2,1 panel corresponds to the case when the data generating process is the three-shock MLE specification. The panels have two striking features. First, there is essentially no evidence of bias in the estimated impulse response functions. In all cases, the solid lines are very close to the starred lines. Second, aneconometricianwouldnotbemisledininferencebyusingstandardprocedures for constructing confidence intervals. The circles and stars are close to the boundaries of the gray area. The 1,1 panels in Figures 3 and 4 indicate that the coverage rates are roughly 90 percent. So, with high probability, VAR-based confidence intervals include the true value of the impulse response coefficients. Results for the CKM Specification The second column of Figure 2 reports the results when the data generating process is given by variants of the CKM specification. The 1,2 and 2,1 panels correspond to the two and three-shock CKM specification, respectively. ThesecondcolumnofFigure2containsthesamestrikingfeaturesasthefirstcolumn. There is very little bias in the estimated impulse response functions. In addition, the average value of the econometrician’s confidence interval coincides closely with the actual range of variation in the impulse response function (the gray area). Coverage rates, reported in the 1,2 panels of Figures 3 and 4, are roughly 90 percent. These rates are consistent with the view that VAR-based procedures lead to reliable inference. A comparison of the gray areas across the first and second columns of Figure 2, clearly indicates that more sampling uncertainty occurs when the data are generated from the CKM specifications than when they are generated from the MLE specifications (the gray areas are wider). VAR-based confidence intervals detect this fact. 14

3.2. Long-run Identification Results for the two- and three- Shock MLE Specifications The first and second rows of column 1 in Figure 5 exhibit our results when the data are generated by the two- and three- shock MLE specifications. Once again there is virtually no bias in the estimated impulse response functions and inference is accurate. The coverage rates associated with the percentile-based confidence intervals are very close to 95 percent (see Figure 3). The coverage rates for the standard-deviation-based confidence intervals are somewhat lower, roughly 80 percent (see Figure 4). The difference in coverage rates can be seen in Figure 5, which shows that the stars are shifted down slightly relative to the circles. Still, the circles and stars are very good indicators of the boundaries of the gray area, although not quite as good as in the analog cases in Figure 2. Comparing Figures 2 and 5, we see that Figure 5 reports more sampling uncertainty. That is, the gray areas are wider. Again, the crucial point is that the econometrician who computes standard confidence intervals would detect the increase in sampling uncertainty. Results for the CKM Specification The third and fourth rows of column 1 in Figure 5 report results for the two and three shockCKMspecifications. ConsistentwithresultsreportedinCKM,thereissubstantialbiasin the estimated dynamic response functions. For example, in the Two-shock CKM specification, the contemporaneous response of hours worked to a one-standard-deviation technology shock is 0.3 percent, while the mean estimated response is 0.97 percent. This bias stands in contrast to our other results. Is this bias big or problematic? In our view, bias cannot be evaluated without taking into account sampling uncertainty. Bias matters only to the extent that the econometrician is led to an incorrect inference. For example, suppose sampling uncertainty is large and the econometrician knows it. Then the econometrician would conclude that the data contain little information and, therefore, would not be misled. In this case, we say that bias is not large. In contrast, suppose sampling uncertainty is large, but the econometrician thinks it is small. Here, we would say bias is large. We now turn to the sampling uncertainty in the CKM specifications. Figure 5 shows that the econometrician’s average confidence interval is large relative to the bias. Interestingly, the percentile confidence intervals (stars) are shifted down slightly relative to the standarddeviation-based confidence intervals (circles). On average, the estimated impulse response function is not in the center of the percentile confidence interval. This phenomenon often occurs in practice.15 Recall that we estimate a four lag VAR in each of our 1,000 synthetic data sets. For the purposes of the bootstrap, each of these VARs is treated as a true data generating process. The asymmetric percentile confidence intervals show that when data are generated by these VARs, VAR-based estimators of the impulse response function have a downward bias. Figure 3 reveals that for the two- and three-shock CKM specifications, percentile-based coverage rates are reasonably close to 95 percent. Figure 4 shows that the standard deviation 15An extreme example, in which the point estimates roughly coincide with one of the boundaries of the percentile-based confidence interval, appears in Blanchard and Quah (1989). 15

based coverage rates are lower than the percentile-based coverage rates. However even these coverage rates are relatively high in that they exceed 70 percent. Insummary,theresultsfortheMLEspecificationdifferfromthoseoftheCKMspecifications in two interesting ways. First, sampling uncertainty is much larger with the CKMspecification. Second, theestimatedresponsesaresomewhatbiasedwiththeCKMspecification. Butthebias is small: It has no substantial effect on inference, at least as judged by coverage rates for the econometrician’s confidence intervals. 3.3. Confidence Intervals in the RBC Examples and a Situation in Which VAR- Based Procedures Go Awry Here we show that the more important technology shocks are in the dynamics of hours worked, the easier it is for VARs to answer the question, ‘how do hours worked respond to a technology shock’. We demonstrate this by considering alternative values of the innovation variance in the labor tax, σ , and by considering alternative values of σ, the utility parameter that controls the l Frisch elasticity of labor supply. Consider Figure 6, which focuses on the long-run identification schemes. The first and second columns report results for the two-shock MLE and CKM specifications, respectively. For each specification we redo our experiments, reducing σ by a half and then by a quarter. l Table 1 shows that the importance of technology shocks rises as the standard deviation of the labor tax shock falls. Figure 6 indicates that the magnitude of sampling uncertainty and the size of confidence intervals fall as the relative importance of labor tax shocks falls.16 Figure 7 presents the results of a different set of experiments based on perturbations of the two-shock CKM specification. The 1,1 and 2,1 panels show what happens when we vary the value of σ, the parameter that controls the Frisch labor supply elasticity. In the 1,1 panel we set σ = 6, which corresponds to a Frisch elasticity of 0.63. In the 2,1 panel, we set σ = 0, which corresponds to a Frisch elasticity of infinity. As the Frisch elasticity is increased, the fraction of the variance in hours worked due to technology shocks decreases (see Table 1). The magnitude of bias and the size of confidence intervals are larger for the higher Frisch elasticity case. In both cases the bias is still smaller than the sampling uncertainty. We were determined to construct at least one example in which the VAR-based estimator of impulse response functions have bad properties, i.e., bias is larger than sampling uncertainty. We display such an example in the 3,1 panel of Figure 7. The data generating process is a version of the two-shock CKM model with an infinite Frisch elasticity and double the standard deviation of the labor tax rate. Table 1 indicates that with this specification, technology shocks account for a trivial fraction of the variance in hours worked. Of the three measures of V , h two are 0.46 percent and the third is 0.66 percent . The 3,1 panel of Figure 7 shows that the VAR-based procedure now has very bad properties: the true value of the impulse response function lies outside the average value of both confidence intervals that we consider. This example shows that constructing scenarios in which VAR-based procedures go awry is certainly possible. However, this example seems unlikely to be of practical significance given the poor fit to the data of this version of the model. 16As σ falls, the total volatility of hours worked falls, as does the relative importance of labor tax shocks. In l principle, both effects contribute to the decline in sampling uncertainty. 16

3.4. Are Long-Run Identification Schemes Informative? Up to now, we have focused on the RBC model as the data generating process. For empirically reasonable specifications of the RBC model, confidence intervals associated with long-run identification schemes are large. One might be tempted to conclude that VAR-based long-run identification schemes are uninformative. Specifically, are the confidence intervals so large that we can never discriminate between competing economic models? Erceg, Guerrieri, and Gust (2005) show that the answer to this question is ‘no’. They consider an RBC model similar to the one discussed above and a version of the sticky wage-price model developed by Christiano, Eichenbaum, and Evans (2005) in which hours worked fall after a positive technology shock. Theythenconductaseriesofexperimentstoassesstheabilityofalong-runidentifiedstructural VAR to discriminate between the two models on the basis of the response of hours worked to a technology shock. Using estimated versions of each of the economic models as a data generating process, they generate 10,000 synthetic data sets each with 180 observations. They then estimate a four-variable structural VAR on each synthetic data set and compute the dynamic response of hours worked to a technology shock using long-run identification. Erceg, Guerrieri, and Gust (2005) report that the probability of finding an initial decline in hours that persists for two quarters is much higher in the model with nominal rigidities than in the RBC model (93 percentversus26percent). So, if thesearetheonlytwomodelscontemplatedbytheresearcher, an empirical finding that hours worked decline after a positive innovation to technology will constitute compelling evidence in favor of the sticky wage-price model. Erceg, Guerrieri, and Gust (2005) also report that the probability of finding an initial rise in hours that persists for two quarters is much higher in the RBC model than in the sticky wage-price model (71 percent versus 1 percent). So, an empirical finding that hours worked rises after a positive innovation to technology would constitute compelling evidence in favor of the RBC model versus the sticky wage-price alternative. 4. Contrasting Short- and Long- Run Restrictions The previous section demonstrates that, in the examples we considered, when VARs are identified using short-run restrictions, the conventional estimator of impulse response functions is remarkably accurate. In contrast, for some parameterizations of the data generating process, the conventional estimator of impulse response functions based on long-run identifying restrictions can exhibit noticeable bias. In this section we argue that the key difference between the two identification strategies is that the long-run strategy requires an estimate of the sum of the VAR coefficients, B(1). This object is notoriously difficult to estimate accurately (see Sims, 1972). We consider a simple analytic expression related to one in Sims (1972). Our expression shows what an econometrician who fits a misspecified, fixed-lag, finite-order VAR would find ˆ ˆ ˆ in population. Let B ,...,B and V denote the parameters of the qth-order VAR fit by the 1 q econometrician. Then: 1 π Vˆ = V + min B e iω Bˆ e iω S (ω) B eiω Bˆ eiω 0dω, (4.1) − − Y Bˆ 1,...,Bˆ q 2π Z − π h − i h − i ¡ ¢ ¡ ¢ ¡ ¢ ¡ ¢ 17

where B(L) = B +B L+B L2 +..., 1 2 3 Bˆ (L) = Bˆ +Bˆ L+...+Bˆ L3. 1 2 4 Here, B(e iω) and Bˆ (e iω) correspond to B(L) and Bˆ (L) with L replaced by e iω.17 In (4.1), − − − B and V are the parameters of the actual infinite-ordered VAR representation of the data (see (2.10)), and S (ω) is the associated spectral density at frequency ω.18 According to Y (4.1), estimation of a VAR approximately involves choosing VAR lag matrices to minimize a quadratic form in the difference between the estimated and true lag matrices. The quadratic form assigns greatest weight to the frequencies for which the spectral density is the greatest. If the econometrician’s VAR is correctly specified, then Bˆ (e iω) = B(e iω) for all ω, and Vˆ = V, − − so that the estimator is consistent. If there is specification error, then B ˆ (e iω) = B(e iω) for − − 6 some ω and V > Vˆ.19 In our context, specification error exists because the true VAR implied by our data generating processes has q = , but the econometrician uses a finite value of q. ∞ To understand the implications of (4.1) for our analysis, it is useful to write in lag-operator form the estimated dynamic response of Y to a shock in the first element of ε t t Y = I +θ L+θ L2 +... Cˆ ε , (4.2) t 1 2 1 1,t £ ¤ where the θ ’s are related to the estimated VAR coefficients as follows: k 1 π 1 θ = I B ˆ e iω e iω − ekωidω. (4.3) k − − 2π − π Z − h i ¡ ¢ In the case of long-run identification, the vector Cˆ is computed using (2.22), and Bˆ (1) and Vˆ 1 ˆ replace B(1) and V respectively. In the case of short-run identification, we compute C as the 1 second column in the upper triangular Cholesky decomposition of Vˆ .20 17The minimization in (4.1) is actually over the trace of the indicated integral. One interpretation of (4.1) is that it provides the probability limit of our estimators — what they would converge to as the sample size increases to infinity. We do not adopt this interpretation, because in practice an econometrician would use a consistent lag-length selection method. The probability limit of our estimators corresponds to the true impulse response functions for all cases considered in this paper. 18The derivation of this formula is straightforward. Write (2.10) in lag operator form as follows: Y =B(L)Y +u , t t 1 t − where Eu u =V. Let the fitted disturbances associated with a particular parameterization, Bˆ(L), be denoted t 0t uˆ . Simple substitution implies: t uˆ = B(L) Bˆ(L) Y +u . t t 1 t − − h i The two random variables on the right of the equality are orthogonal, so that the variance of uˆ is just the t variance of the sum of the two: var(uˆ )=var B(L) Bˆ(L) Y +V. t t 1 − − ³h i ´ Expression (4.1) in the text follows immediately. 19By V >Vˆ, we mean that V Vˆ is a positive definite matrix. − 20In the earlier discussion it was convenient to adopt the normalization that the technology shock is the second element of ε . Here, we adopt the same normalization as for the long-run identification — namely, that t the technology shock is the first element of ε . t 18

We use (4.1) to understand why estimation based on short-run and long-run identification canproducedifferentresults. Accordingto(4.2),impulseresponsefunctionscanbedecomposed into two parts, the impact effect of the shocks, summarized by Cˆ , and the dynamic part 1 summarized in the term in square brackets. We argue that when a bias arises with long-run restrictions, it is because of difficulties in estimating C . These difficulties do not arise with 1 short-run restrictions. ˆ ˆ In the short-run identification case, C is a function of V only. Across a variety of numerical 1 examples, wefindthatVˆ isveryclosetoV.21 Thisresultisnotsurprisingbecause(4.1)indicates that the entire objective of estimation is to minimize the distance between Vˆ and V. In the ˆ ˆ ˆ long-run identification case, C depends not only on V but also on B(1). A problem is that 1 the criterion does not assign much weight to setting Bˆ (1) = B(1) unless S (ω) happens to Y be relatively large in a neighborhood of ω = 0. But, a large value of S (0) is not something Y one can rely on.22 When S (0) is relatively small, attempts to match B ˆ (e iω) with B(e iω) Y − − at other frequencies can induce large errors in Bˆ (1). The previous argument about the difficulty of estimating C in the long-run identification 1 case does not apply to the θ s. According to (4.3) θ is a function of B ˆ (e iω) over the whole 0k k − range of ω’s, not just one specific frequency. We now present a numerical example, which illustrates Proposition 1 as well as some of the observations we have made in discussing (4.1). Our numerical example focuses on population results. Therefore, it provides only an indication of what happens in small samples. To understand what happens in small samples, we consider four additional numerical examples. First, we show that when the econometrician uses the true value of B(1), the bias and muchofthesamplinguncertaintyassociatedwiththeTwo-shockCKMspecificationdisappears. Second, we demonstrate that bias problems essentially disappear when we use an alternative to the standard zero-frequency spectral density estimator used in the VAR literature. Third, we show that the problems are attenuated when the preference shock is more persistent. Fourth, we consider the recursive version of the two-shock CKM specification in which the effect of technology shocks can be estimated using either short- or long-run restrictions. A Numerical Example Table 2 reports various properties of the two-shock CKM specification. The first six B ’s in j the infinite-order VAR, computed using (2.12), are reported in Panel A. These B ’s eventually j converge to zero, however they do so slowly. The speed of convergence is governed by the size ˆ of the maximal eigenvalue of the matrix M in (2.8), which is 0.957. Panel B displays the B ’s j that solve (4.1) with q = 4. Informally, the Bˆ ’s look similar to the B ’s for j = 1,2,3,4. In line j j with this observation, the sum of the true B ’s, B +...+B is similar in magnitude to the sum j 1 4 ˆ ˆ of the estimated B ’s, B(1) (see Panel C). But the econometrician using long-run restrictions j needs a good estimate of B(1). This matrix is very different from B +...+B . Although the 1 4 21This result explains why lag-length selection methods, such as the Akaike criterion, almost never suggest valuesofq greaterthan4inartificialdatasetsoflength180,regardlessofwhichofourdatageneratingmethods we used. These lag length selection methods focus on Vˆ. 22Equation(4.1)showsthatBˆ(1)correspondstoonlyasinglepointintheintegral. Sootherthingsequal,the estimation criterion assigns no weight at all to getting Bˆ(1) right. The reason B(1) is identified in our setting is that the B(ω) functions we consider are continuous at ω =0. 19

remaining B ’s for j > 4 are individually small, their sum is not. For example, the 1,1 element j of B(1) is 0.28, or six times larger than the 1,1 element of B +...+B . 1 4 The distortion in Bˆ (1) manifests itself in a distortion in the estimated zero-frequency spec- ˆ tral density (see Panel D). As a result, there is distortion in the estimated impact vector, C 1 (Panel F).23 To illustrate the significance of the latter distortion for estimated impulse response functions, we display in Figure 8 the part of (4.2) that corresponds to the response of hours worked to a technology shock. In addition, we display the true response. There is a substantial distortion, which is approximately the same magnitude as the one reported for small samples in Figure 5. The third line in Figure 8 corresponds to (4.2) when Cˆ is replaced by its true 1 value, C . Most of the distortion in the estimated impulse response function is eliminated by 1 this replacement. Finally, the distortion in Cˆ is due to distortion in Bˆ (1), as Vˆ is virtually 1 identical to V (panel E). ThisexampleisconsistentwithouroverallconclusionthattheindividualB ’sandV arewell j estimated by the econometrician using a four-lag VAR. The distortions that arise in practice primarily reflect difficulties in estimating B(1). Our short-run identification results in Figure 2 are consistent with this claim, because distortions are minimal with short-run identification. Using the True Value of B(1) in a Small Sample ˆ ˆ A natural way to isolate the role of distortions in B(1) is to replace B(1) by its true value when estimating the effects of a technology shock. We perform this replacement for the twoshock CKM specification, and report the results in Figure 9. For convenience, the 1,1 panel of Figure 9repeats our resultsfor the two-shockCKMspecificationfromthe3,1panel in Figure5. The 1,2 panel of Figure 9 shows the sampling properties of our estimator when the true value of B(1) is used in repeated samples. When we use the true value of B(1) the bias completely disappears. In addition, coverage rates are much closer to 95 percent and the boundaries of the average confidence intervals are very close to the boundaries of the gray area. Using an Alternative Zero-Frequency Spectral Density Estimator In practice, the econometrician does not know B(1). However, we can replace the VARbased zero-frequency spectral density in (2.19) with an alternative estimator of S (0). Here, Y we consider the effects of using a standard Bartlett estimator:24 S Y (0) = T − 1 g(k)Cˆ (k), g(k) = 1 − 0 | k r | | k k | ≤ > r r , (4.4) k= (T 1) ½ | | −X− where, after removing the sample mean from Y : t T 1 Cˆ (k) = Y Y . T t t0 k − t=k+1 X 23A similar argument is presented in Ravenna (2005). 24Christiano, Eichenbaum and Vigfusson (2006) also consider the estimator proposed by Andrews and Monahan (1992). 20

We use essentially all possible covariances in the data by choosing a large value of r, r = 150.25 In some respects, our modified estimator is equivalent to running a VAR with longer lags. We now assess the effect of our modified long-run estimator. The first two rows in Figure 5 present results for cases in which the data generating mechanism corresponds to our twoand three-shock MLE specifications. Both the standard estimator (the left column) and our modified estimator (the right column) exhibit little bias. In the case of the standard estimator, the econometrician’s estimator of standard errors understates somewhat the degree of sampling uncertainty associated with the impulse response functions. The modified estimator reduces this discrepancy. Specifically, the circles and stars in the right column of Figure 5 coincide closely with the boundary of the gray area. Coverage rates are reported in the 2,1 panels of Figures 3 and 4. In Figure 3, coverage rates now exceed 95 percent. The coverage rates in Figure 4 are much improved relative to the standard case. Indeed, these rates are now close to 95 percent. Significantly, the degree of sampling uncertainty associated with the modified estimator is not greater than that associated with the standard estimator. In fact, in some cases, sampling uncertainty declines slightly. The last two rows of column 1 in Figure 5 display the results when the data generating process is a version of the CKM specification. As shown in the second column, the bias is essentially eliminated by using the modified estimator. Once again the circles and stars roughly coincide with the boundary of the gray area. Coverage rates for the percentile-based confidence intervals reported in Figure 3 again have a tendency to exceed 95 percent (2,2 panel). As shown in the 2,2 panel of Figure 4, coverage rates associated with the standard deviation based estimator are very close to 95 percent. There is a substantial improvement over the coverage rates associated with the standard spectral density estimator. Figure 5 indicates that when the standard estimator works well, the modified estimator also works well. When the standard estimator results in biases, the modified estimator removes them. These findings are consistent with the notion that the biases for the two CKM specifications reflect difficulties in estimating the spectral density at frequency zero. Given our ˆ finding that V is an accurate estimator of V, we conclude that the difficulties in estimating the zero-frequency spectral density in fact reflect problems with B(1). The second column of Figure 7 shows how our modified VAR-based estimator works when the data are generated by the various perturbations on the Two-shock CKM specification. In every case, bias is substantially reduced. Shifting Power to the Low Frequencies Formula (4.1), suggests that, other things being equal, the more power there is near frequency zero, the less bias there is in Bˆ (1) and the better behaved is the estimated impulse response function to a technology shock. To pursue this observation we change the parameterization of the non-technology shock in the two-shock CKM specification. We reallocate power toward frequency zero, holding the variance of the shock constant by increasing ρ to 0.998 and l suitably lowering σ in (2.1). The results are reported in the 2,1 panel of Figure 9. The bias l associated with the two-shock CKM specification almost completely disappears. This result is 25Theruleofalwayssettingthebandwidth,r,equaltosamplesizedoesnotyieldaconsistentestimatorofthe spectral density at frequency zero. We assume that as sample size is increased beyond T =180, the bandwidth is increased sufficiently slowly to achieve consistency. 21

consistent with the notion that the bias problems with the two-shock CKM specification stem from difficulties in estimating B(1). Thepreviousresultcallsintoquestionconjecturesintheliterature(seeErceg, Guerrieri, and Gust, 2005). According to these conjectures, if there is more persistence in a non-technology shock, then the VAR will produce biased results because it will confuse the technology and non-technology shocks. Our result shows that this intuition is incomplete, because it fails to take into account all of the factors mentioned in our discussion of (4.1). To show the effect of persistence, we consider a range of values of ρ to show that the impact of ρ on bias is in fact l l not monotone. The 2,2 panel of Figure 9 displays the econometrician’s estimator of the contemporaneous impact on hours worked of a technology shock against ρ . The dashed line indicates the true l contemporaneous effect of a technology shock on hours worked in the two-shock CKM specification. The dot-dashed line in the figure corresponds to the solution of (4.1), with q = 4, using the standard VAR-based estimator.26 The star in the figure indicates the value of ρ in l the two-shock CKM specification. In the neighborhood of this value of ρ , the distortion in the l estimator falls sharply as ρ increases. Indeed, for ρ = 0.9999, essentially no distortion occurs. l l For values of ρ in the region, ( 0.5,0.5), the distortion increases with increases in ρ . l l − The 2,2 panel of Figure 9 also allows us to assess the value of our proposed modification to the standard estimator. The line with diamonds displays the modified estimator of the contemporaneous impact on hours worked of a technology shock. When the standard estimator works well, that is, for large values of ρ the modified and standard estimators produce similar l results. However, when the standard estimator works poorly, e.g. for values of ρ near 0.5, our l modified estimator cuts the bias in half. A potential shortcoming of the previous experiments is that persistent changes in τ do not l,t necessarily induce very persistent changes in labor productivity. To assess the robustness of our results, we also considered what happens when there are persistent changes in τ . These x,t do have a persistent impact on labor productivity. In the two-shock CKM model, we set τ to l,t a constant and allowed τ to be stochastic. We considered values of ρ in the range, [ 0.5,1], x,t x − holding the variance of τ constant. We obtain results similar to those reported in the 2,2 x,t panel of Figure 9. Short- and Long-Run Restrictions in a Recursive Model We conclude this section by considering the recursive version of the two-shock CKM specification. This specification rationalizes estimating the impact on hours worked of a shock to technology using either the short- or the long-run identification strategy. We generate 1,000 datasets, eachoflength180. Oneachsyntheticdataset, weestimateafourlag, bivariateVAR. Given this estimated VAR, we can estimate the effect of a technology shock using the shortand long-run identification strategy. Figure 10 reports our results. For the long-run identification strategy, there is substantial bias. In sharp contrast, there is no bias for the short-run identification strategy. Because both procedures use the same estimated VAR parameters, the bias in the long-run identification strategy is entirely attributable due to the use of Bˆ (1). 26Because (4.1) is a quadratic function, we solve the optimization problem by solving the linear first-order conditions. These are the Yule-Walker equations, which rely on population second moments of the data. We obtainthepopulationsecondmomentsbycomplexintegrationofthereducedformofthemodelusedtogenerate the data, as suggested by Christiano (2002). 22

5. Relation to Chari-Kehoe-McGrattan In the preceding sections we argue that structural VAR-based procedures have good statistical properties. Our conclusions about the usefulness of structural VARs stand in sharp contrast to the conclusions of CKM. These authors argue that, for plausibly parameterized RBC models, structural VARs lead to misleading results. They conclude that structural VARs are not useful for constructing and evaluating structural economic models. In this section we present the reasons we disagree with CKM. CKM’s Exotic Data Generating Processes CKM’scritiqueofVARsisbasedonsimulationsusingparticularDSGEmodelsestimatedby maximum likelihood methods. Here, we argue that their key results are driven by assumptions about measurement error. CKM’s measurement error assumptions are overwhelmingly rejected in favor of alternatives under which their key results are overturned. CKM adopt a state-observer setup to estimate their model. Define: Y = (∆loga ,logl ,∆logi ,∆logG ) , t t t t t 0 where G denotes government spending plus net exports. CKM suppose that t Y = X +v , Ev v = R, (5.1) t t t t t0 where R is diagonal, v is a 4 1 vector of i.i.d. measurement errors and X is a 4 1 vector t t × × containing the model’s implications for the variables in Y . The two-shock CKM specification t has only the shocks, τ and z . CKM model government spending plus net exports as: l,t t G = g Z , t t t × where g is in principle an exogenous stochastic process. However, when CKM estimate the t parameters of the technology and preferences processes, τ and z , they set the variance of the l,t t government spending shock to zero, so that g is a constant. As a result, CKM assume that t ∆logG = logz +measurement error. t t CKM fix the elements on the diagonal of R exogenously to a “small number”, leading to the remarkable implication that government purchases plus net exports. To demonstrate the sensitivity of CKM’s results to their specification of the magnitude of R, we consider the different assumptions that CKM make in different drafts of their paper. In the draft of May 2005, CKMset the diagonal elements of R to 0.0001. In the draft of July 2005, CKM set the ith diagonal element of R equal to 0.01 times the variance of the ith element of Y . t The 1,1 and 2,1 panels in Figure 11 report results corresponding to CKM’s two-shock specifications in the July and May drafts, respectively.27 These panels display the log likelihood 27To ensure comparability of results we use CKM’s computer code and data, available on Ellen McGrattan’s webpage. The algorithm used by CKM to form the estimation criterion is essentially the same as the one we used to estimate our models. The only difference is that CKM use an approximation to the Gaussian function by working with the steady state Kalman gain. We form the exact Gaussian density function, in which the Kalman gain varies over dates, as described in Hamilton (1994). We believe this difference is inconsequential. 23

value (see LLF) of these two models and their implications for VAR-based impulse response functions(the1,1panelisthesameasthe3,1panelinFigure5). Surprisingly, thelog-likelihood of the July specification is orders of magnitude worse than that of the May specification. The3,1panel inFigure11displaysour resultswhenthediagonal elementsof R are included among the parameters being estimated.28 We refer to the resulting specification as the “CKM free measurement error specification”. First, both the May and the July specifications are rejected relative to the free measurement error specification. The likelihood ratio statistic for testing the May and July specifications are 428 and 6,266, respectively. Under the null hypothesis that the May or July specification is true, these statistics are realizations of a chi-square distribution with 4 degrees of freedom. The evidence against CKM’s May or July specifications of measurement error is overwhelming. Second, when the data generating process is the CKM free measurement error specification, the VAR-based impulse response function is virtually unbiased (see the 3,1 panel in Figure 11). We conclude that the bias in the two-shock CKM specification is a direct consequence of CKM’s choice of the measurement error variance. As noted above, CKM’s measurement error assumption has the implication that ∆logG is t roughly equals to logz . To investigate the role played by this peculiar implication, we delete t ∆logG from Y and reestimate the system. We present the results in the right column of t t Figure 11. In each panel of that column, we re-estimate the system in the same way as the corresponding panel in the left column, except that ∆logG is excluded from Y . Comparing t t the 2,1 and 2,2 panels, we see that, with the May measurement error specification, the bias disappears after relaxing CKM’s ∆logG = logz assumption. Under the July specification of t t measurementerror, thebiasresultremainsevenafterrelaxingCKM’sassumption(comparethe 1,1 and 1,2 graphs of Figure 11). As noted above, the May specification of CKM’s model has a likelihood that is orders of magnitude higher than the July specification. So, in the version of the CKMmodel selectedbythe likelihoodcriterion(i.e., the Mayversion), the∆logG = logz t t assumption plays a central role in driving the CKM’s bias result. In sum, CKM’s examples which imply that VARs with long-run identification display substantial bias, are not empirically interesting from a likelihood point of view. The bias in their examples is due to the way CKM choose the measurement error variance. When their measurement error specification is tested, it is overwhelmingly rejected in favor of an alternative in which the CKM bias result disappears. Stochastic Process Uncertainty CKM argue that there is considerable uncertainty in the business cycle literature about the values of parameters governing stochastic processes such as preferences and technology. They arguethatthisuncertaintytranslatesintoawideclassofexamplesinwhichthebiasinstructural VARs leads to severely misleading inference. The right panel in Figure 12 summarizes their argument. The horizontal axis covers the range of values of (σ /σ )2 considered by CKM. For l z each value of (σ /σ )2 we estimate, by maximum likelihood, four parameters of the two-shock l z 28When generating the artificial data underlying the calculations in the 3,1 panel of Figure 11, we set the measurement error to zero. (The same assumption was made for all the results reported here.) However, simulations that include the estimated measurement error produce results that are essentially the same. 24

model: μ , τ , σ and ρ .29 We use the estimated model as a data generating process. The left z l l l vertical axis displays the small sample mean of the corresponding VAR-based estimator of the contemporaneous response of hours worked to a one-standard deviation technology shock. Based on a review the RBC literature, CKM report that they have a roughly uniform prior over the different values of (σ /σ )2 considered in Figure 12. The figure indicates that for many l z of these values, the bias is large (compare the small sample mean, the solid line, with the true response, the starred line). For example, there is a noticeable bias in the 2-shock CKM specification, where (σ /σ )2 = 1.1. l z We emphasize three points. First, as we stress repeatedly, bias cannot be viewedin isolation fromsamplinguncertainty. Thetwodashedlinesinthefigureindicatethe95percentprobability interval. These intervals are enormous relative to the bias. Second, not all values of (σ /σ )2 l z are equallylikely, and for theones with greatest likelihood there is little bias. Onthe horizontal axis of the left panel of Figure 12, we display the same range of values of (σ /σ )2 as in the l z rightpanel. Onthevertical axiswereportthelog-likelihoodvalueof theassociatedmodel. The peak of this likelihood occurs close to the estimated value in the two-shock MLE specification. Note how the log-likelihood value drops sharply as we consider values of (σ /σ )2 away from l z the unconstrained maximum likelihood estimate. The vertical bars in the figure indicate the 95 percent confidence interval for (σ /σ )2.30 Figure 12 reveals that the confidence interval is l z very narrow relative to the range of values considered by CKM, and that within the interval, the bias is quite small. Third, the right axis in the right panel of Figure 12 plots V , the percent of the variance h in log hours due to technology, as a function of (σ /σ )2. The values of (σ /σ )2 for which l z l z there is a noticeable bias correspond to model economies where V is less than 2 percent. Here, h identifying the effects of a technology shock on hours worked is tantamount to looking for a needle in a haystack. The Metric for Assessing the Performance of Structural VARs CKMemphasizecomparisonsbetweenthetruedynamicresponsefunctioninthedatagenerating process and the response function that an econometrician would estimate using a four-lag VAR with an infinite amount of data. In our own analysis in section 4, we find population calculations with four lag VARs useful for some purposes. However, we do not view the probability limit of a four lag VAR as an interesting metric for measuring the usefulness of structural VARs. In practice econometricians do not have an infinite amount of data. Even if they did, they would certainly not use a fixed lag length. Econometricians determine lag length endogenously and, in a large sample, lag length would grow. If lag lengths grow at the appropriate rate with sample size, VAR-based estimators of impulse response functions are consistent. The interesting issue (to us) is how VAR-based procedures perform in samples of the size that practitioners have at their disposal. This is why we focus on small sample properties like bias and sampling uncertainty. Over-Differencing 29We use CKM’s computer code and data to ensure comparability of results. 30The bounds of this interval are the upper and lower values of (σ /σ )2 where twice the difference of the l z log-likelihood from its maximal value equals the critical value associated with the relevant likelihood ratio test. 25

The potential power of the CKM argument lies in showing that VAR-based procedures are misleading, even under circumstances when everyone would agree that VARs should work well, namely when the econometrician commits no avoidable specification error. The econometrician does, however, commit one unavoidable specification error. The true VAR is infinite ordered, but the econometrician assumes the VAR has a finite number of lags. CKM argue that this seeminglyinnocuous assumption is fatal for VARanalysis. We have arguedthat this conclusion is unwarranted. CKM present other examples in which the econometrician commits an avoidable specification error. Specifically, they study the consequences of over differencing hours worked. That is, the econometrician first differences hours worked when hours worked are stationary.31 This error gives rise to bias in VAR-based impulse response functions that is large relative to sampling uncertainty. CKM argue that this bias is another reason not to use VARs. However, the observation that avoidable specification error is possible in VAR analysis is notaproblemforVARsperse. Thepossibilityofspecificationerrorisapotential pitfall forany type of empirical work. In any case, CKM’s analysis of the consequences of over differencing is not new. For example, Christiano, Eichenbaum and Vigfusson (2003, hereafter, CEV) study a situation in which the true data generating process satisfies two properties: Hours worked are stationaryandtheyriseafterapositivetechnologyshock. CEVthenconsideraneconometrician whodoes VAR-basedlong-runidentificationwhenY in(2.16) containsthe growthrateof hours t rather than the log level of hours. CEV show that the econometrician would falsely conclude thathoursworkedfallafterapositivetechnologyshock. CEVdonotconcludefromthisexercise that structural VARs are not useful. Rather, they develop a statistical procedure to help decide whether hours worked should be first differenced or not. CKM Ignore Short-Run Identification Schemes We argue that VAR-based short-run identification schemes lead to remarkably accurate and precise inference. This result is of interest because the preponderance of the empirical literature on structural VARs explores the implications of short-run identification schemes. CKM are silent on this literature. McGrattan (2006) dismisses short-run identification schemes as “hokey.” One possible interpretation of this adjective is that McGrattan can easily imagine models in which the identification scheme is incorrect. The problem with this interpretation is that all models are a collection of strong identifying assumptions, all of which can be characterized as “hokey”. A second interpretation is that in McGrattan (2006)’s view, the type of zero restrictions typically used in short run identification are not compatible with dynamic equilibrium theory. This view is simply incorrect (see Sims and Zha (2006)). A third possible interpretation is that no one finds short-run identifying assumptions interesting. However, the results of short-run identification schemes have had an enormous effect on the construction of dynamic, general equilibrium models. See Woodford (2003) for a summary in the context of monetary models. Sensitivity of Some VAR Results to Data Choices 31For technical reasons, CKM actually consider ‘quasi differencing’ hours worked using a differencing parameter close to unity. In small samples this type of quasi differencing is virtually indistinguishable from first differencing. 26

CKM argue that VARs are very sensitive to the choice of data. Specifically, they review the papers by Francis and Ramey (2004), CEV, and Gali and Rabanal (2004), which use long-run VAR methods to estimate the response of hours worked to a positive technology shock. CKM notethatthesestudiesusedifferentmeasuresofpercapitahoursworkedandoutputintheVAR analysis. The bottom panel of Figure 13 displays the different measures of per capita hours worked that these studies use. Note how the low frequency properties of these series differ. The corresponding estimated impulse response functions and confidence intervals are reported in the top panel. CKM view it as a defect in VAR methodology that the different measures of hours worked lead to different estimated impulse response functions. We disagree. Empirical results should be sensitive to substantial changes in the data. A constructive response to the sensitivity in Figure 13 is to carefully analyze the different measures of hours worked, see which is more appropriate, and perhaps construct a better measure. It is not constructive to dismiss an econometric technique that signals the need for better measurement. CKM note that the principle differences in the hours data occur in the early part of the sample. According to CKM, when they drop these early observations they obtain different impulse response functions. However, as Figure 13 shows, these impulse response functions are not significantly different from each other. 6. A Model with Nominal Rigidities Inthissectionweuse themodel inACELtoassess theaccuracyof structural VARsforestimating the dynamic response of hours worked to shocks. This model allows for nominal rigidities in prices and wages and has three shocks: a monetary policy shock, a neutral technology shock, and a capital-embodied technology shock. Both technology shocks affect labor productivity in the long run. However, the only shock in the model that affects the price of investment in the long run is the capital-embodied technology shock. We use the ACEL model to evaluate the ability of a VAR to uncover the response of hours worked to both types of technology shock and to the monetary policy shock. Our strategy for identifying the two technology shocks is similar to the one proposed by Fisher (2006). The model rationalizes a version of the short-run, recursive identification strategy used by Christiano, Eichenbaum and Evans (1999) to identify monetary shocks. This strategy corresponds closely to the recursive procedure studied in section 2.3.2. 6.1. The Model The details of the ACEL model, as well as the parameter estimates, are reported in Appendix A of the NBER Working Paper version of this paper. Here, we limit our discussion to what is necessary to clarify the nature of the shocks in the ACEL model. Final goods, Y , are produced t using a standard Dixit-Stiglitz aggregator of intermediate goods, y (i), i (0,1). To produce t ∈ a unit of consumption goods, C , one unit of final goods is required. To produce one unit of t investment goods, I , Υ 1 units of final goods are required. In equilibrium, Υ 1 is the price, in t −t −t units of consumption goods, of an investment good. Let μ denote the growth rate of Υ , let Υ,t t μ denotethenonstochasticsteadystatevalueof μ , andletμˆ denotethepercentdeviation Υ Υ,t Υ,t 27

of μ from its steady state value: Υ,t Υ μ μ t Υ,t Υ μ = , μˆ = − . (6.1) Υ,t Υ Υ,t μ t 1 Υ − The stochastic process for the growth rate of Υ is: t μˆ = ρ μˆ +σ ε , σ > 0. (6.2) Υ,t μ Υ Υ,t − 1 μ Υ μ Υ ,t μ Υ We refer to the i.i.d. unit variance random variable, ε , as the capital-embodied technology μ Υ ,t shock. ACEL assume that the intermediate good, y (i), for i (0,1) is produced using a t ∈ Cobb-Douglas production function of capital and hours worked. This production function is perturbed by a multiplicative, aggregate technology shock denoted by Z . Let z denote the t t growth rate of Z , let z denote the nonstochastic steady state value of z , and let zˆ denote the t t t percentage deviation of z from its steady state value: t Z z z t t z = , zˆ = − . (6.3) t t Z z t 1 − The stochastic process for the growth rate of Z is: t zˆ = ρ zˆ +σ εz, σ > 0, (6.4) t z t 1 z t z − where the i.i.d. unit variance random variable, εz, is the neutral shock to technology. t We now turn to the monetary policy shock. Let x denote M /M , where M denotes t t t 1 t − the monetary base. Let xˆ denote the percentage deviation of x from its steady state, i.e., t t (xˆ x)/x. We suppose that xˆ is the sum of three components. One, xˆ , represents the t t Mt − component of xˆ reflecting an exogenous shock to monetary policy. The other two, xˆ and xˆ , t zt Υt representtheendogenous responseof xˆ tothe neutral andcapital-embodiedtechnologyshocks, t respectively. Thus monetary policy is given by: xˆ = xˆ +xˆ +xˆ . (6.5) t zt Υt Mt ACEL assume that xˆ = ρ xˆ +σ ε , σ > 0 M,t xM M,t 1 M M,t M − xˆ = ρ xˆ +c εz +cpεz (6.6) z,t xz z,t 1 z t z t 1 − − xˆ = ρ xˆ +c ε +cpε . Υ,t xΥ Υ,t − 1 Υ μ Υ ,t Υ μ Υ ,t Here,ε representstheshocktomonetarypolicyandisani.i.d. unitvariancerandomvariable. M,t Table 3 summarizes the importance of different shocks for the variance of hours worked and output. Neutral and capital-embodied technologyshocks account for roughly equal percentages of the variance of hours worked (40 percent each), while monetary policy shocks account for the remainder. Working with HP-filtered data reduces the importance of neutral technology shocks to about 18 percent. Monetary policy shocks become much more important for the variance of hours worked. A qualitatively similar picture emerges when we consider output. It is worth emphasizing that neutral technology shocks are much more important in hours worked in the ACEL model than in the RBC model. This fact plays an important role in determining the precision of VAR-based inference using long-run restrictions in the ACEL model. 28

6.2. Results We use the ACEL model to simulate 1,000 data sets each with 180 observations. We report results from two different VARs. In the first VAR, we simultaneously estimate the dynamic effect on hours worked of a neutral technology shock and a capital-embodied technology shock. The variables in this VAR are: ∆lnp It Y = ∆lna , t t ⎛ ⎞ lnl t ⎝ ⎠ where p denotes the price of capital in consumption units. The variable, ln(p ), corresponds It It to ln Υ 1 in the model. As in Fisher (2006), we identify the dynamic effects on Y of the −t t two technology shocks, using a generalization of the strategy in section 2.3.1.32 The details are ¡ ¢ provided in Appendix B of the NBER Working Paper version of this paper. The 1,1 panel of Figure 14 displays our results using the standard VAR procedure to estimate the dynamic response of hours worked to a neutral technology shock. Several results are worth emphasizing. First, the estimator is essentially unbiased. Second, the econometrician’s estimator of sampling uncertainty is also reasonably unbiased. The circles and stars, which indicate the mean value of the econometrician’s standard-deviation-based and percentile-based confidence intervals, roughly coincide with the boundaries of the gray area. However, there is a slight tendency, in both cases, to understate the degree of sampling uncertainty. Third, confidence intervals are small, relative to those in the RBC examples. Both sets of confidence intervals exclude zero at all lags shown. This result provides another example, in addition to the one provided by Erceg, Guerrieri, and Gust (2005), in which long-run identifying restrictions are useful for discriminating between models. An econometrician who estimates that hours drop after a positive technology shock would reject our parameterization of the ACEL model. Similarly, an econometrician with a model implying that hours fall after a positive technology shock would most likely reject that model if the actual data were generated by our parameterization of the ACEL model. The 2,1 panel in Figure 14 shows results for the response to a capital-embodied technology shock as estimated using the standard VAR estimator. The sampling uncertainty is somewhat higher for this estimator than for the neutral technology shock. In addition, there is a slight amount of bias. The econometrician understates somewhat the degree of sampling uncertainty. We now consider the response of hours worked to a monetary policy shock. We estimate this response using a VAR with the following variables: ∆loga t Y = logl . t t ⎛ ⎞ R t ⎝ ⎠ As discussed in Christiano, Eichenbaum, and Evans (1999), the monetary policy shock is identifiedbychoosingC tobethe lower triangular decompositionof thevariancecovariancematrix, V, of the VAR disturbances. That is, we choose a lower triangular matrix, C with positive diagonal terms, such that CC = V. Let u = Cε . We then interpret the last element of ε as the 0 t t t monetary policy shock. According to the results in the 1,2 panel of Figure 14, the VAR-based 32Our strategy differs somewhat from the one pursued in Fisher (2006), who applies a version of the instrumental variables strategy proposed by Shapiro and Watson (1988). 29

estimator of the response of hours worked displays relatively little bias and is highly precise. In addition, theeconometrician’sestimatorof samplinguncertaintyis virtuallyunbiased. Suppose the impulse response in hours worked to a monetary policy shock were computed using VARbased methods with data generated from this model. We conjecture that a model in which money is neutral, or in which a monetary expansion drives hours worked down, would be easy to reject. 7. Concluding Remarks In this paper we study the ability of structural VARs to uncover the response of hours worked to a technology shock. We consider two classes of data generating processes. The first class consists of a series of real business cycle models that we estimate using maximum likelihood methods. The second class consists of the monetary model in ACEL. We find that with shortrun restrictions, structural VARs perform remarkably well in all our examples. With long-run restrictions, structural VARs work well as long as technology shocks explain at least a very small portion of the variation in hours worked. In a number of examples that we consider, VAR-based impulse response functions using long-run restrictions exhibit some bias. Even though these examples do not emerge from empiricallyplausibledatageneratingprocesses, wefindthemofinterest. Theyallowustodiagnose what can go wrong with long-run identification schemes. Our diagnosis leads us to propose a modification to the standard VAR-based procedure for estimating impulse response functions using long-run identification. This procedure works well in our examples. Finally, we find that confidence intervals with long-run identification schemes are substantially larger than those with short-run identification schemes. In all empirically plausible cases, the VARs deliver confidence intervals that accurately reflect the true degree of sampling uncertainty. We view this characteristic as a great virtue of VAR-based methods. When the data contain little information, the VAR will indicate the lack of information. To reduce large confidence intervals the analyst must either impose additional identifying restrictions (i.e., use more theory) or obtain better data. 30

. References Altig, David, Lawrence Christiano, Martin Eichenbaum, and Jesper Linde (2005). “Firm- SpecificCapital,NominalRigidities,andtheBusinessCycle,”NBERWorkingPaperSeries 11034. Cambridge, Mass.: National Bureau of Economic Research, January. Andrews, Donald W. K., and J. Christopher Monahan (1992). “An Improved Heteroskedasticity and Autocorrelation Consistent Covariance Matrix Estimator,” Econometrica, vol. 60 (July), pp. 953—66. Basu, Susanto, John Fernald, and Miles Kimball (2004). “Are Technology Improvements Contractionary?” NBER Working Paper Series 10592. Cambridge, Mass.: National Bureau of Economic Research, June. Bernanke, Ben S. (1986). “Alternative Explanations of the Money-Income Correlation,” Carnegie Rochester Conference Series on Public Policy, vol. 25 (Autumn), pp. 49—99. Bernanke, Ben S., and Alan S. Blinder (1992). “The Federal Funds Rate and the Channels of Monetary Transmission,” American Economic Review, vol. 82 (September), pp. 901—21. Bernanke, Ben S., and Ilian Mihov (1998). “Measuring Monetary Policy,” Quarterly Journal of Economics, vol. 113 (August), pp. 869—902. Blanchard, Olivier, and Roberto Perotti (2002). “An Empirical Characterization of the DynamicEffects of Changes in Government Spending andTaxesonOutput,”Quarterly Journal of Economics, vol. 117 (November), pp. 1329—68. Blanchard, Olivier, andDannyQuah(1989). “TheDynamicEffectsof AggregateDemandand Supply Disturbances,” American Economic Review, vol. 79 (September), pp. 655—73. Blanchard, Olivier, and Mark Watson (1986). “Are Business Cycles All Alike?” in Robert Gordon, ed., Continuity and Change in the American Business Cycle. Chicago: University of Chicago Press, pp.123—56. Chari, V. V., Patrick J. Kehoe, and Ellen McGrattan (2005a). “A Critique of Structural VARsUsingRealBusinessCycleTheory,”WorkingPaperSeries631.Minneapolis: Federal Reserve Bank of Minneapolis, May. –––(2005b). “A Critique of Structural VARs Using Real Business Cycle Theory,” Working Paper Series 631. Minneapolis: Federal Reserve Bank of Minneapolis, July. Christiano, Lawrence J. (1988). “Why Does Inventory Investment Fluctuate So Much?” Journal of Monetary Economics, vol. 21 (March—May), pp. 247—80. ––– (2002). “Solving Dynamic Equilibrium Models by a Method of Undetermined Coefficients,” Computational Economics, vol. 20 (October), pp. 21—55. Christiano, Lawrence J., and Martin Eichenbaum (1992). “Identification and the Liquidity Effect of a Monetary Policy Shock,” in Alex Cukierman, Zvi Hercowitz, and Leonardo Leiderman,eds.,Political Economy, Growth, and Business Cycles.Cambridge, Mass.: MIT Press, pp. 335—70. Christiano, Lawrence J., Martin Eichenbaum, and Charles Evans (1999). “Monetary Policy Shocks: What Have We Learned and to What End?,” in John B.Taylor and Michael D. Woodford, eds., Handbook of Macroeconomics. Volume 1A, Amsterdam; Elsevier Science, pp. 65-148 Christiano, Lawrence J., Martin Eichenbaum, and Charles Evans (2005). “Nominal Rigidities and the Dynamic Effects of a Shock to Monetary Policy,” Journal of Political Economy, vol. 113 (February), pp. 1—45. Christiano, Lawrence J., Martin Eichenbaum, and Robert Vigfusson (2003). “What Happens afteraTechnologyShock?”NBERWorkingPaperSeries9819.Cambridge,Mass.: National Bureau of Economic Research, July. 31

––– (2004). “The Response of Hours to a Technology Shock: Evidence Based on Direct Measures of Technology,” Journal of the European Economic Association, vol. 2 (April), pp. 381—95. ––– (forthcoming). “Alternative Procedures for Estimating Long-Run Identified Vector Autoregressions,” Journal of the European Economic Association. Cooley, Thomas F., and Mark Dwyer (1998). “Business Cycle Analysis without Much Theory: A Look at Structural VARs,” Journal of Econometrics, vol. 83 (March—April), pp. 57—88. Cushman, DavidO., andTaoZha(1997).“IdentifyingMonetaryPolicyinaSmallOpenEconomy under Flexible Exchange Rates,” Journal of Monetary Economics, vol. 39 (August), pp. 433—48. Del Negro, Marco, Frank Schorfheide, Frank Smets, and Raf Wouters (2005). “On the Fit and Forecasting Performance of New Keynesian Models,” unpublished paper, University of Pennsylvania. Eichenbaum, Martin, and Charles Evans (1995). “Some Empirical Evidence on the Effects of Shocks to Monetary Policy on Exchange Rates,” Quarterly Journal of Economics, vol. 110 (November), pp. 975—1009. Erceg, Christopher J., Luca Guerrieri, and Christopher Gust (2005). “Can Long-Run Restrictions Identify Technology Shocks?” Journal of the European Economic Association, vol. 3 (December), pp. 1237—78. Faust, Jon, and Eric Leeper (1997). “When Do Long-Run Identifying Restrictions Give Reliable Results?” Journal of Business and Economic Statistics, vol. 15 (July), pp. 345—53. Fernandez-Villaverde, Jesus, Juan F. Rubio-Ramirez, and Thomas J. Sargent (2005). “A, B, C’s (and D’s) for Understanding VARs,” unpublished paper, New York University. Fisher, Jonas (2006). “The Dynamic Effects of Neutral and Investment-Specific Technology Shocks,” Journal of Political Economy, vol. 114, no. 3, June. Francis,Neville,MichaelT.Owyang,andJenniferE.Roush(2005).“AFlexibleFinite-Horizon Identification of Technology Shocks,” Working Paper Series 2005-024A. St. Louis: Federal Reserve Bank of St. Louis, April. Francis, Neville, and Valerie A. Ramey (2005). “Is the Technology-Driven Real Business Cycle Hypothesis Dead? Shocks and Aggregate Fluctuations Revisited,” Journal of Monetary Economics, vol. 52 (November), pp. 1379—99. Gali, Jordi (1999). “Technology, Employment, and the Business Cycle: Do Technology Shocks Explain Aggregate Fluctuations?” American Economic Review, vol. 89 (March), pp. 249— 71. Gali, Jordi, and Pau Rabanal (2005). “Technology Shocks and Aggregate Fluctuations: How Well Does the Real Business Cycle Model Fit Postwar U.S. Data?” in Mark Gertler and KennethRogoff,eds.,NBERMacroeconomicsAnnual 2004.Cambridge,Mass.: MITPress. Hamilton, James D. (1994). Time Series Analysis. Princeton: Princeton University Press. –––(1997).“MeasuringtheLiquidityEffect,”American Economic Review, vol.87(March), pp. 80—97. Hansen, Lars, and Thomas Sargent (1980). “Formulating and Estimating Dynamic Linear Rational Expectations Models,” Journal of Economic Dynamics and Control, vol. 2 (February), pp. 7—46. King, RobertG., CharlesI.Plosser, JamesH.Stock, andMarkW.Watson(1991).“Stochastic TrendsandEconomicFluctuations,”American Economic Review, vol. 81(September), pp. 819—40. McGrattan, Ellen (2006). “A Critique of Structural VARs Using Business Cycle Theory,” presentation at the annual meeting of the American Economic Association, Boston, January 6—8, http://minneapolisfed.org/research/economists/mcgrattan/CEV/assa.htm. Pagan, AdrianR., andJohn C. Robertson(1998). “Structural Models of the Liquidity Effect,” Review of Economics and Statistics, vol. 80 (May), pp. 202—17. 32

Ravenna, Federico, 2005, ‘Vector Autoregressions and Reduced Form Representations of Dynamic Stochastic General Equilibrium Models,’ unpublished manuscript. Rotemberg, Julio J., and Michael Woodford (1992). “Oligopolistic Pricing and the Effects of Aggregate Demand on Economic Activity,” Journal of Political Economy, vol. 100 (December), pp. 1153—1207. ––– (1997). “An Optimization-Based Econometric Framework for the Evaluation of Monetary Policy,” in Ben S. Bernanke and Julio Rotemberg, eds., NBER Macroeconomics Annual 1997. Cambridge, Mass.: MIT Press. Runkle, David E. (1987). “Vector Autoregressions and Reality,” Journal of Business and Economic Statistics, vol. 5 (October), pp. 437-442. Shapiro, Matthew, and Mark Watson (1988). “Sources of Business Cycle Fluctuations,” in Stanley Fischer, ed., NBER Macroeconomics Annual 1988. Cambridge, Mass.: MIT Press. Sims, Christopher (1972). “The Role of Approximate Prior Restrictions in Distributed Lag Estimation,” Journal of the American Statistical Association, vol. 67 (March), pp. 169—75. ––– (1980). “Macroeconomics and Reality,” Econometrica, vol. 48 (January), pp. 1—48. ––– (1986). “Are Forecasting Models Usable for Policy Analysis?” Federal Reserve Bank of Minneapolis, Quarterly Review, vol. 10 (Winter), pp. 2—16. ––– (1989). “Models and Their Uses,” American Journal of Agricultural Economics, vol. 71 (May), pp. 489—94. Sims, Christopher, and Tao Zha (1999). “Error Bands for Impulse Responses,” Econometrica, vol. 67 (September), pp. 1113—55. –––(forthcoming). “Does MonetaryPolicy GenerateRecessions?”Macroeconomic Dynamics. Smets, Frank, and Raf Wouters (2003). “An Estimated Dynamic Stochastic General Equilibrium Model of the Euro Area,” Journal of the European Economic Association, vol. 1 (September), pp. 1123—75. Vigfusson, Robert J. (2004). “The Delayed Response to a Technology Shock: A Flexible Price Explanation,”InternationalFinanceDiscussionPaperSeries2004-810.Washington: Board of Governors of the Federal Reserve System, July. Woodford, Michael M. (2003). Interest and Prices: Foundations of a Theory of Monetary Policy. Princeton: Princeton University Press. 33

A. A Model with Nominal Wage and Price Rigidities This appendix describes the ACEL model used in section 6. The model economy is composed of households, firms, and a monetary authority. There is a continuum of households, indexed by j (0,1). The jth household is a monopoly ∈ supplier of a differentiated labor service, and sets its wage subject to Calvo-style wage frictions. In general, households earn different wage rates and work different amounts. A straightforward extension of arguments in Erceg, Henderson, and Levin (2000) and in Woodford (1996) establishes that in the presence of state contingent securities, households are homogeneous with respect to consumption and asset holdings.33 Our notation reflects this result. The preferences of the jth household are given by: h2 Ej ∞ βl t log(C bC ) ψ j,t+l , t − t+l − t+l − 1 − L 2 l=0 ∙ ¸ X where ψ 0 and Ej is the time t expectation operator, conditional on household j’s time t L t ≥ information set. The variable, C , denotes time t consumption and h denotes time t hours t jt worked. The household’s asset evolution equation is given by: M = R [M Q +(x 1)Ma]+A +Q +W h t+1 t t t t t j,t t j,t j,t − − +D (1+η(V ))P C . t t t t − Here, M and Q denote, respectively, the household’s stock of money, and cash balances at t t the beginning of period t. The variable W represents the nominal wage rate at time t. In j,t addition D and A denote firm profits and the net cash inflow from participating in statet j,t contingent security markets at time t. The variable, x , represents the gross growth rate of the t economy-wide per capita stock of money, Ma. The quantity (x 1)Ma is a lump-sum payment t t t − to households by the monetary authority. The household deposits M Q +(x 1)Ma with t t t t − − a financial intermediary. The variable, R , denotes the gross interest rate. The variable, V , t t denotes the time t velocity of the household’s cash balances: P C t t V = , (A.1) t Q t where η(V ) is increasing and convex.34 For the quantitative analysis of our model, we must t specify the level and the first two derivatives of the transactions function, η(V), evaluated 33Erceg, Christopher J., Dale W. Henderson, and Andrew T. Levin (2000). “Optimal Monetary Policy with Staggered Wage and Price Contracts,” Journal of Monetary Economics, vol. 46 (October), pp. 281—313. Woodford, Michael M. (1996). “Control of the Public Debt: A Requirement for Price Stability?” NBER Working Paper Series 5684. Cambridge, Mass.: National Bureau of Economic Research, July. 34Similar specifications have been used by authors such as Sims (1994) and Schmitt-Grohe and Uribe (2004). (Schmitt-Grohé,Stefanie,andMartinUribe(2004). “OptimalFiscalandMonetaryPolicyunderStickyPrices,” Journal of Economic Theory, vol. 114 (February), pp. 198—230. Sims, Christopher, (1994), “A Simple Model forStudyoftheDeterminationofthePriceLevelandtheInteractionofMonetaryandFiscalPolicy,”Economic Theory, vol. 4 (3), 381—99.) 34

in steady state. We denote these by η, η , and η , respectively. Let (cid:18) denote the interest 0 00 semi-elasticity of money demand in steady state: 100 dlog(Qt) (cid:18) × Pt . ≡ − 400 dR t × Let V and η denote the values of velocity and η(V ) in steady state. ACEL parameterize the t second-order Taylor series expansion of η( ) about steady state. The values of η, η , and η , 0 00 · are determined by ACEL’s estimates of (cid:18), V, and η. The jth household is a monopoly supplier of a differentiated labor service, h . It sells this jt service to a representative, competitive firm that transforms it into an aggregate labor input, L , using the technology: t 1 1 λw H = hλwdj , 1 λ < . t j,t ≤ w ∞ ∙Z0 ¸ Let W denote the aggregate wage rate, i.e., the nominal price of H . The household takes H t t t and W as given. t In each period, a household faces a constant probability, 1 ξ , of being able to re-optimize w − its nominal wage. The ability to re-optimize is independent across households and time. If a household cannot re-optimize its wage at time t, it sets W according to: jt W = π μ W , j,t t 1 z j,t 1 − ∗ − where π P /P . The presence of μ implies that there are no distortions from wage t 1 t 1 t 2 z − ≡ − − ∗ dispersion along the steady state growth path. Attimetafinalconsumptiongood,Y ,isproducedbyaperfectlycompetitive,representative t final good firm. This firm produces the final good by combining a continuum of intermediate goods, indexed by i [0,1], using the technology ∈ 1 1 λf Y t = y t (i)λfdi , (A.2) ∙Z 0 ¸ where 1 λ < and y (i) denotes the time t input of intermediate good i. The firm takes f t ≤ ∞ its output price, P , and its input prices, P (i), as given and beyond its control. t t Intermediate good i is produced by a monopolist using the following technology: K (i)α(Z h (i))1 α φz if K (i)α(Z h (i))1 α φz y t (i) = 0 t t t − − t∗ othe t rwise t t − ≥ t∗ (A.3) ½ where 0 < α < 1. Here, h (i) and K (i) denote time t labor and capital services used to produce t t the ith intermediate good. The variable Z represents a time t shock to the technology for t producing intermediate output. The growth rate of Z , Z /Z , is denoted by μ . The nont t t 1 zt − negative scalar, φ, parameterizes fixed costs of production. To express the model in terms of a stochastic steady state, we find it useful to define the variable z as: t∗ α z = Υ1 αZ , (A.4) t∗ t− t 35

where Υ represents a time t shock to capital-embodied technology. The stochastic process t generating Z is defined by (6.3) and (6.4). The stochastic process generating Υ is defined by t t (6.1) and (6.2). Intermediate good firms hire labor in perfectly competitive factor markets at the wage rate, W . Profits are distributed to households at the end of each time period. We assume that the t firm must borrow the wage bill in advance at the gross interest rate, R . t In each period, the ith intermediate goods firm faces a constant probability, 1 ξ , of being p − able to re-optimize its nominal price. The ability to re-optimize prices is independent across firms and time. If firm i cannot re-optimize, it sets P (i) according to: t P (i) = π P (i). (A.5) t t 1 t 1 − − Let K¯ (i) denote the physical stock of capital available to the ith firm at the beginning of t period t. The services of capital, K (i) are related to stock of physical capital, by: t K¯ (i) = u (i)K¯ (i). t t t Here u (i) is firm is capital utilization rate. The cost, in investment goods, of setting the t 0 utilization rate to u (i) is a(u (i))K¯ (i), where a( ) is increasing and convex. We assume that t t t · u (i) = 1 in steady state and a(1) = 0. These two conditions determine the level and slope of t a( ) in steady state. To implement our log-linear solution method, we must also specify a value · for the curvature of a in steady state, σ = a (1)/a(1) 0. a 00 0 ≥ There is no technology for transferring capital between firms. The only way a firm can change its stock of physical capital is by varying the rate of investment, I (i), over time. The t technology for accumulating physical capital by intermediate good firm i is given by: I (i) t F(I (i),I (i)) = (1 S )I (i), t t 1 t − − I t 1 (i) µ − ¶ where K¯ (i) = (1 δ)K¯ (i)+F(I (i),I (i)). t+1 t t t 1 − − The adjustment cost function, S, satisfies S = S = 0, and S > 0 in steady state. Given the 0 00 log-linearization procedure used to solve the model, we need not specify any other features of the function S. The present discounted value of the ith intermediate good’s net cash flow is given by: E ∞ βjυ P (i)y (i) R W h (i) P Υ 1 I (i)+a(u (i))K¯ (i) , t t+j t+j t+j − t+j t+j t − t+j −t+j t+j t+j t+j j=0 X © £ ¤ª (A.6) where R denotes the gross nominal rate of interest. t The monetary policy rule is defined by (6.5) and (6.6). Financial intermediaries receive M Q + (x 1)M from the household. Our notation reflects the equilibrium condition, t t t t − − 36

Ma = M . Financial intermediaries lend all of their money to intermediate good firms, which t t use the funds to pay labor wages. Loan market clearing requires that: W H = x M Q . (A.7) t t t t t − The aggregate resource constraint is: (1+η(V ))C +Υ 1 I +a(u )K ¯ Y . (A.8) t t −t t t t t ≤ We refer the reader to ACEL for a descrip£tion of how¤the model is solved and for the methodology used to estimate the model parameters. The data and programs, as well as an extensive technical appendix, may be found at the following website: www.faculty.econ.northwestern.edu/faculty/christiano/research/ACEL/acelweb.htm. B. Long-Run Identification of Two Technology Shocks This appendix generalizes the strategy for long-run identification of one shock to two shocks, using the strategy of Fisher (2006). As before, the VAR is: Y = B(L)Y +u , Eu u = V, t+1 t t t 0t B(L) B +B L+...+B Lq 1, 1 2 q − ≡ We suppose that the fundamental shocks are related to the VAR disturbances as follows: u = Cε , Eε ε = I, CC = V, t t t 0t 0 where the first two element in ε are ε and εz, respectively. The exclusion restrictions are: t μ Υ ,t t lim E˜ a E˜ a = f ε ,εz,only j t t+j − t − 1 t+j z μ Υ ,t t →∞ h i lim E˜ logp E˜ logp = f ¡ ε ,only . ¢ j t I,t+j − t − 1 I,t+j Υ μ Υ ,t →∞ h i ¡ ¢ That is, only technology shocks have a long-run effect on the log-level of labor productivity, whereas only capital-embodied shocks have a long-run effect on the log-level of the price of investment goods. According to the sign restrictions, the slope of f with respect to its second z argument and the slope of f are non-negative. Applying a suitably modified version of the Υ logic in Section 2.3.1, we conclude that, according to the exclusion restrictions, the indicated pattern of zeros must appear in the following 3 by 3 matrix: a 0 0 [I B(1)] 1C = b c 0 − − ⎡ ⎤ number number number ⎣ ⎦ The sign restrictions are a,c > 0. To compute the dynamic response of Y to the two technology t shocks, we require the first two columns of C. To obtain these, we proceed as follows. Let D [I B(1)] 1C, so that: − ≡ − DD 0 = [I B(1)] − 1V [I B(1) 0 ]− 1 = S Y (0), (B.1) − − 37

where, as before, S (0) is the spectral density of Y at frequency-zero, as implied by the Y t estimated VAR. The exclusion restrictions require that D have the following structure: d 0 0 11 D = d d 0 . 21 22 ⎡ ⎤ d d d 31 32 33 ⎣ ⎦ Here, the zero restrictions reflect our exclusion restrictions, and the sign restrictions require d ,d 0. Then, 11 22 ≥ d2 d d d d S11(0) S21(0) S31(0) 11 11 21 11 31 Y Y Y DD = d d d2 +d2 d d +d d = S21(0) S22(0) S32(0) 0 21 11 21 22 21 31 22 32 Y Y Y ⎡ ⎤ ⎡ ⎤ d d d d +d d d2 +d2 +d2 S31(0) S32(0) S33(0) 31 11 31 21 32 22 31 32 33 Y Y Y ⎣ ⎦ ⎣ ⎦ and d = S11(0), d = S21(0)/d , d = S31(0)/d 11 Y 21 Y 11 31 Y 11 q S11(0)S22(0) (S21(0))2 S32(0) S21(0)S31(0)/d2 d = Y Y − Y , d = Y − Y Y 11. 22 s S Y 11(0) 32 d 22 The sign restrictions imply that the square roots should be positive. The fact that S (0) is Y positive definite ensures that the square roots are real numbers. Finally, the first two columns of C are calculated as follows: . . . . C .C = [I B(1)] D .D , 1 2 1 2 − ∙ ¸ ∙ ¸ where C is the ith column of C and D is the ith column of D, i = 1,2. i i To construct our modified VAR procedure, simply replace S (0) in (B.1) by (4.4). Y 38

Two−Shock MLE Specification −1.45 −1.5 −1.55 −1.6 −1.65 20 40 60 80 100 120 140 160 180 Time )sruoH(nl Figure 1: A Simulated Time Series for Hours −1.45 −1.5 −1.55 −1.6 −1.65 20 40 60 80 100 120 140 160 180 Time Both Shocks Only Technology Shocks )sruoH(nl Two−Shock CKM Specification

0.8 0.6 0.4 0.2 0 −0.2 0 2 4 6 8 10 Period After Shock tnecreP Two−shock MLE specification 0.8 0.6 0.4 0.2 0 −0.2 0 2 4 6 8 10 Period After Shock tnecreP Two−shock CKM specification 0.8 0.6 0.4 0.2 0 −0.2 0 2 4 6 8 10 Period After Shock tnecreP Figure 2: Short−run Identification Results Three−shock MLE specification 0.8 0.6 0.4 0.2 0 −0.2 0 2 4 6 8 10 Period After Shock True Response Sampling Distrbution Estimated Response Average CI Standard Deviation Based Average CI Percentile Based tnecreP Three−shock CKM specification

Figure 3: Coverage Rates, Percentile−Based Confidence Intervals Short−Run Identification 1 0.95 0.9 0.85 0 2 4 6 8 10 Period After Shock tnecreP Recursive MLE 1 0.95 0.9 0.85 0 2 4 6 8 10 Period After Shock tnecreP Recursive CKM Long−Run Identification 1 0.95 0.9 0.85 0 2 4 6 8 10 Period After Shock tnecreP Nonrecursive MLE 1 0.95 0.9 0.85 0 2 4 6 8 10 Period After Shock tnecreP Nonrecursive CKM 2 Shock Standard 2 Shock Bartlett 3 Shock Standard 3 Shock Bartlett

Figure 4: Coverage Rates, Standard Deviation−Based Confidence Intervals Short−Run Identification 1 0.95 0.9 0.85 0.8 0.75 0.7 0.65 0 2 4 6 8 10 Period After Shock tnecreP Recursive MLE 1 0.9 0.8 0.7 0 2 4 6 8 10 Period After Shock tnecreP Recursive CKM Long−Run Identification 0.95 0.9 0.85 0.8 0.75 0.7 0.65 0 2 4 6 8 10 Period After Shock tnecreP Nonrecursive MLE 0.95 0.9 0.85 0.8 0.75 0.7 0.65 0 2 4 6 8 10 Period After Shock tnecreP Nonrecursive CKM 2 Shock Standard 2 Shock Bartlett 3 Shock Standard 3 Shock Bartlett

2 1 0 −1 0 2 4 6 8 10 tnecreP Two−shock MLE specification, Standard 2 1 0 −1 0 2 4 6 8 10 tnecreP Two−shock MLE specification, Bartlett 2 1 0 −1 0 2 4 6 8 10 tnecreP Three−shock MLE specification, Standard 2 1 0 −1 0 2 4 6 8 10 tnecreP Three−shock MLE specification, Bartlett 2 1 0 −1 0 2 4 6 8 10 tnecreP Two−shock CKM specification, Standard 2 1 0 −1 0 2 4 6 8 10 tnecreP Two−shock CKM specification, Bartlett 2 1 0 −1 0 2 4 6 8 10 Period After Shock tnecreP Figure 5: Long−run Identification Results Three−shock CKM specification, Standard 2 1 0 −1 0 2 4 6 8 10 Period After Shock tnecreP Three−shock CKM specification, Bartlett

2 1 0 0 2 4 6 8 10 Period After Shock tnecreP CKM specification, s l 2 1 0 0 2 4 6 8 10 Period After Shock tnecreP 2 1 0 0 2 4 6 8 10 Period After Shock CKM specification, s /2 l tnecreP MLE specification, s l 2 1 0 0 2 4 6 8 10 Period After Shock tnecreP MLE specification, s /2 l 2 1 0 0 2 4 6 8 10 Period After Shock tnecreP Figure 6: Analyzing Precision in Inference MLE specification, s /4 l 2 1 0 0 2 4 6 8 10 Period After Shock tnecreP CKM specification, s /4 l

6 4 2 0 −2 0 2 4 6 8 10 Period After Shock tnecreP s = 6, Standard 6 4 2 0 −2 0 2 4 6 8 10 Period After Shock tnecreP s = 6, Bartlett 6 4 2 0 −2 0 2 4 6 8 10 Period After Shock tnecreP s = 0, Standard 6 4 2 0 −2 0 2 4 6 8 10 Period After Shock tnecreP s = 0, Bartlett 6 4 2 0 −2 0 2 4 6 8 10 Period After Shock tnecreP Figure 7: Varying the Labor Elasticity in the Two−shock CKM Specification s = 0, 2s , Standard l 6 4 2 0 −2 0 2 4 6 8 10 Period After Shock tnecreP s = 0, 2s , Bartlett l

1.4 1.2 1 0.8 0.6 0.4 0.2 0 0 5 10 15 20 25 30 35 40 45 50 Period After Shock tnecreP Figure 8: Impact of C on Distortions 1 Standard Long−Run Identification Response Using Estimated B(L) and True C 1 True Response

2 1.5 1 0.5 0 −0.5 0 2 4 6 8 10 Period After Shock tnecreP Two−shock CKM specification, Standard 2 1.5 1 0.5 0 −0.5 0 2 4 6 8 10 Period After Shock tnecreP Two−shock CKM specification, True B(1) 2 1.5 1 0.5 0 −0.5 0 2 4 6 8 10 Period After Shock tnecreP Figure 9: Analysis of Long−run Identification Results Increased Persistence in Preference Shock (r = .998 s = 0.0028) l l 3 2.5 2 1.5 1 0.5 0 −0.5 0 0.5 1 r l tnecreP Contemporaneous Impact of Technology on Hours Standard Bartlett True Response

Figure 10: Comparing Long− and Short−Run Identification Recursive Two−Shock CKM Specification 2 1 0 0 2 4 6 8 10 Period After Shock tnecreP Short−Run Identification 2 1 0 0 2 4 6 8 10 Period After Shock tnecreP Long−Run Identification True Response Sampling Distrbution Estimated Response Average CI Standard Deviation Based Average CI Percentile Based

2 1 0 0 2 4 6 8 10 Period After Shock tnecreP CKM specification, July Measurement Error LLF = −329 2 1 0 0 2 4 6 8 10 Period After Shock tnecreP CKM July measurement error, No G LLF = 1833 2 1 0 0 2 4 6 8 10 Period After Shock tnecreP CKM May measurement error LLF = 2590 2 1 0 0 2 4 6 8 10 Period After Shock tnecreP CKM May measurement error, No G LLF = 2034 2 1 0 0 2 4 6 8 10 Period After Shock tnecreP Figure 11: The Treatment of CKM Measurement Error CKM Free measurement error LLF = 2804 2 1 0 0 2 4 6 8 10 Period After Shock tnecreP CKM Free measurement error, No G LLF = 2188

Concentrated Likelihood Function Technology Shocks and Hours 1550 2 95 Percent Critical Value 1540 1 1530 1520 0 1510 95 Percent Confidence Interval 1500 −1 1490 1480 −2 0 0.5 1 1.5 2 0 0.5 1 1.5 2 Ratio of Innovation Variances (s / s ) 2 l z kcohS ygolonhceT A ot sruoH fo esnopseR tnecreP Figure 12: Stochastic Process Uncertainty 40 30 20 10 0 0 0.5 1 1.5 2 Ratio of Innovation Variances (s / s ) 2 l z ygolonhcet ot eud tnemyolpme gol ni ecnairav fo tnecreP v =3.8 h v =0.8 h

Figure 13: Data Sensitivity and Inference in VARs Estimated Hours Response Starting in 1948 FR Data 1 0.5 0 −0.5 −1 0 5 10 Period After the Shock tnecreP CEV Data 1 0.5 0 −0.5 −1 0 5 10 Period After the Shock tnecreP GR Data 1 0.5 0 −0.5 −1 0 5 10 Period After the Shock tnecreP Estimated Hours Response Starting in 1959 FR Data 1 0.5 0 −0.5 0 5 10 Period After the Shock tnecreP CEV Data 1 0.5 0 −0.5 0 5 10 Period After the Shock tnecreP GR Data 1 0.5 0 −0.5 0 5 10 Period After the Shock tnecreP 1.1 1 0.9 0.8 0.7 1950 1955 1960 1965 1970 1975 1980 1985 1990 1995 2000 1 = 1q0002 Hours Per Capita FR CEV GR

0.5 0.4 0.3 0.2 0.1 0 −0.1 0 2 4 6 8 10 Period After Shock tnecreP Neutral Technology Shock 0.5 0.4 0.3 0.2 0.1 0 −0.1 0 2 4 6 8 10 Period After Shock tnecreP Figure 14: Impulse Response Results when the ACEL Model is the DGP Monetary Policy Shock 0.5 0.4 0.3 0.2 0.1 0 −0.1 0 2 4 6 8 10 Period After Shock tnecreP Investment−Specific Technology Shock

Table 1: Contribution of Technology Shocks to Volatility Measure of Variation One-step-ahead Model specification Unfiltered HP-filtered forecast error ln(cid:111) (cid:4)ln(cid:124) ln(cid:111) ln(cid:124) ln(cid:111) (cid:4)ln(cid:124) (cid:119) (cid:119) (cid:119) (cid:119) (cid:119) (cid:119) MLE Base Nonrecursive 3.73 67.16 7.30 67.14 7.23 67.24 Recursive 3.53 58.47 6.93 64.83 0.00 57.08 (cid:30) (cid:64)2 Nonrecursive 13.40 89.13 23.97 89.17 23.77 89.16 (cid:111) Recursive 12.73 84.93 22.95 88.01 0.00 84.17 (cid:30) (cid:64)4 Nonrecursive 38.12 97.06 55.85 97.10 55.49 97.08 (cid:111) Recursive 36.67 95.75 54.33 96.68 0.00 95.51 (cid:30) =6 Nonrecursive 3.26 90.67 6.64 90.70 6.59 90.61 Recursive 3.07 89.13 6.28 90.10 0.00 88.93 (cid:30) =0 Nonrecursive 4.11 53.99 7.80 53.97 7.73 54.14 Recursive 3.90 41.75 7.43 50.90 0.00 38.84 Three Nonrecursive 0.18 45.67 3.15 45.69 3.10 45.72 Recursive 0.18 36.96 3.05 43.61 0.00 39.51 CKM Base Nonrecursive 2.76 33.50 1.91 33.53 1.91 33.86 Recursive 2.61 25.77 1.81 31.41 0.00 24.93 (cid:30) (cid:64)2 Nonrecursive 10.20 66.86 7.24 66.94 7.23 67.16 (cid:111) Recursive 9.68 58.15 6.88 64.63 0.00 57.00 (cid:30) (cid:64)4 Nonrecursive 31.20 89.00 23.81 89.08 23.76 89.08 (cid:111) Recursive 29.96 84.76 22.79 87.91 0.00 84.07 (cid:30) =6 Nonrecursive 0.78 41.41 0.52 41.33 0.52 41.68 Recursive 0.73 37.44 0.49 40.11 0.00 37.42 (cid:30) =0 Nonrecursive 2.57 20.37 1.82 20.45 1.82 20.70 Recursive 2.44 13.53 1.73 18.59 0.00 12.33 (cid:30) =0 Nonrecursive 0.66 6.01 0.46 6.03 0.46 6.12 and 2(cid:30) (cid:111) Recursive 0.62 3.76 0.44 5.41 0.00 3.40 Three Nonrecursive 2.23 30.73 1.71 31.11 1.72 31.79 Recursive 2.31 23.62 1.66 29.67 0.00 25.62 Note: (a) (cid:89) corresponds to the columns denoted by ln((cid:111) )(cid:61) (cid:107) (cid:119) (b) In each case, the results report the ratio of two variances: the numerator is the variance for the system with only technology shocks and the denominator is the variance for the system with both technology shock and labor tax shocks. All statistics are averages of the ratios, based on 300 simulations of 5000 observations for each model. (c) ‘Base’ means the two-shock specification, whether MLE or CKM, as indicated. Three’ means the three-shock specification. (d) For a description of the procedure used to calculate the forecast error variance, see footnote 13. (e) ‘MLE’ and ‘CKM’ refer, respectively, to our and CKM’s estimated models. .

Table 2: Properties of Two-Shock CKM Specification Panel A: First Six Lag Matrices in Infinite-Order VAR Representation 0.013 0.041 0.012 0.00 0.012 0.00 B 1 = 0.0065 0.94 , B 2 = 0.0062 − 0.00 , B 3 = 0.0059 − 0.00 , ∙ ¸ ∙ − ¸ ∙ − ¸ 0.011 0.00 0.011 0.00 0.010 0.00 B 4 = 0.0056 − 0.00 , B 5 = 0.0054 − 0.00 , B 6 = 0.0051 − 0.00 ∙ − ¸ ∙ − ¸ ∙ − ¸ Panel B: Population Estimate of Four-lag VAR 0.017 0.043 0.017 0.00 0.012 0.00 Bˆ 1 = 0.0087 0.94 , Bˆ 2 = 0.0085 − 0.00 , Bˆ 3 = 0.0059 − 0.00 , ∙ ¸ ∙ − ¸ ∙ − ¸ 0.0048 0.0088 Bˆ 4 = 0.0025 − 0.0045 ∙ − ¸ Panel C: Actual and Estimated Sum of VAR Coefficients 0.055 0.032 0.28 0.022 0.047 0.039 Bˆ(1)= , B(1)= , 4 B = 0.14 0.94 0.14 0.93 j=1 j 0.024 0.94 ∙ ¸ ∙ ¸ ∙ ¸ P Panel D: Actual and Estimated Zero-Frequency Spectral Density 0.00017 0.00097 0.00012 0.0022 S (0) = , Sˆ (0)= . Y 0.00097 0.12 Y 0.0022 0.13 ∙ ¸ ∙ ¸ Panel E: Actual and Estimated One-Step-Ahead Forecast Error Variance 0.00012 0.00015 V =Vˆ = − 0.00015 0.00053 ∙ − − ¸ Panel F: Actual and Estimated Impact Vector 0.00773 0.00406 C = , Cˆ = 1 0.00317 1 0.01208 µ ¶ µ ¶ Table 3: Percent Contribution of Shocks in the ACEL model to the Variation in Hours and in Output Types of shock Statistic Monetary Policy Neutral Technology Capital-Embodied variance of logged hours 22.2 40.0 38.5 variance of HP filtered logged hours 37.8 17.7 44.5 variance of ∆y 29.9 46.7 23.6 variance of HP filtered logged output 31.9 32.3 36.1 Note: Results are average values based on 500 simulations of 3100 observations each. ACEL: Altig Christiano, Eichenbaum and Linde (2005).

Cite this document

APA

Lawrence J. Christiano, Martin Eichenbaum, & and Robert Vigfusson (2006). Assessing Structural VARs (IFDP 2006-866). Board of Governors of the Federal Reserve System, International Finance Discussion Papers. https://whenthefedspeaks.com/doc/ifdp_2006-866

BibTeX

@techreport{wtfs_ifdp_2006_866,
  author = {Lawrence J. Christiano and Martin Eichenbaum and and Robert Vigfusson},
  title = {Assessing Structural VARs},
  type = {International Finance Discussion Papers},
  number = {2006-866},
  institution = {Board of Governors of the Federal Reserve System},
  year = {2006},
  url = {https://whenthefedspeaks.com/doc/ifdp_2006-866},
  abstract = {This paper analyzes the quality of VAR-based procedures for estimating the response of the economy to a shock. We focus on two key issues. First, do VAR-based confidence intervals accurately reflect the actual degree of sampling uncertainty associated with impulse response functions? Second, what is the size of bias relative to confidence intervals, and how do coverage rates of confidence intervals compare with their nominal size? We address these questions using data generated from a series of estimated dynamic, stochastic general equilibrium models. We organize most of our analysis around a particular question that has attracted a great deal of attention in the literature: How do hours worked respond to an identified shock? In all of our examples, as long as the variance in hours worked due to a given shock is above the remarkably low number of 1 percent, structural VARs perform well. This finding is true regardless of whether identification is based on short-run or long-run restrictions. Confidence intervals are wider in the case of long-run restrictions. Even so, long-run identified VARs can be useful for discriminating among competing economic models.},
}