feds · January 3, 2021

Bias in Local Projections

Abstract

Local projections (LPs) are a popular tool in macroeconomic research. We show that LPs are often used with very small samples in the time dimension. Consequently, LP point estimates can be severely biased. We derive simple expressions for this bias and propose a way to bias-correct LPs. Small sample bias can also lead autocorrelation-robust standard errors to dramatically understate sampling uncertainty. We argue they should be avoided in LPs like the ones we study. Using identified monetary policy shocks, we demonstrate that the bias in point estimates can be economically meaningful and the bias in standard errors can affect inference. Accessible materials (.zip) Original paper: PDF | Accessible materials (.zip)

Finance and Economics Discussion Series Divisions of Research & Statistics and Monetary Affairs Federal Reserve Board, Washington, D.C. Bias in Local Projections Edward P. Herbst and Benjamin K. Johannsen 2020-010 Please cite this paper as: Herbst, Edward P., and Benjamin K. Johannsen (2021). “Bias in Local Projections,” FinanceandEconomicsDiscussionSeries2020-010r1. Washington: BoardofGovernorsofthe Federal Reserve System, https://doi.org/10.17016/FEDS.2020.010r1. NOTE: Staff working papers in the Finance and Economics Discussion Series (FEDS) are preliminary materials circulated to stimulate discussion and critical comment. The analysis and conclusions set forth are those of the authors and do not indicate concurrence by other members of the research staff or the Board of Governors. References in publications to the Finance and Economics Discussion Series (other than acknowledgement) should be cleared with the author(s) to protect the tentative character of these papers.

Bias in Local Projections Edward P. Herbst and Benjamin K. Johannsen∗ December 31, 2020 Abstract Local projections (LPs) are a popular tool in macroeconomic research. We show that LPs are often used with very small samples in the time dimension. Consequently, LP point estimates can be severely biased. We derive simple expressions for this bias andproposeawaytobias-correctLPs. Smallsamplebiascanalsoleadautocorrelationrobust standard errors to dramatically understate sampling uncertainty. We argue they should be avoided in LPs like the ones we study. Using identified monetary policy shocks, we demonstrate that the bias in point estimates can be economically meaningful and the bias in standard errors can affect inference. 1 Introduction Weshowthatifatimeseriesispersistent—asisgenerallythecasewhenresearchersareinterested in impulse responses—then estimators of impulse responses by local projections (LPs) can be severely biased in sample sizes commonly found in the empirical macroeconomics literature. Starting with Jordà (2005), LPs have been used by researchers as an alternative to other time series methods, such as vector autoregressions (VARs). We survey the literature and ∗edward.p.herbst@frb.gov;benjamin.k.johannsen@frb.gov. ThisdoesnotreflecttheviewsoftheFederal ReserveBoardortheFederalReserveSystem. WethankToddClark,MarcoDelNegro,EricGhysels,Òscar Jordà,LutzKilian,MikkelPlagborg-Møller,FrankSchorfheide,andMinchulShin,aswellasseminarparticpants, for helpful comments and discussions. We thank Nicholas Zevanove for excellent research assistance. Any errors are our own. 1

find that, over the past 15 years, LPs have been applied in a variety of settings that are notably different than the setting studied in Jordà (2005). In particular, we find that sample sizes in the time dimension are typically much smaller than the sample sizes studied in Jordà (2005) and that LPs have become increasingly prevalent when researchers also have a cross section of data (i.e., panel data). Additionally, researchers often approach LPs with identified structural shocks in hand, rather than identifying those shocks as a part of the estimation.1 We focus on this idealized case in this paper, as it is a natural benchmark for understanding the methodology. Using Monte Carlo analysis, we demonstrate that the magnitude of the bias in LPs can be large when sample sizes in the time dimension are similar to those typically found in the empirical macroeconomics literature. Our Monte Carlo simulations use simple, linear data generating processes. While researchers may be drawn to LPs because they invoke fewer parametric restrictions than other methods, an important standard for this methodology is that it performs well in simple scenarios. Notably, we show that the bias in LPs persists even when the shock on the right-hand-side of the regressions in our Monte Carlo analysis is iid, as is often the case when researchers have access to a time series of identified structural shocks. We analyze the small-sample bias in LPs using a higher-order expansion of the LP estimator, building on the related work of Kendall (1954), Rilstone et al. (1996), Anatolyev (2005), and Bao and Ullah (2007). We show that the bias of the LP estimator at horizon h is a function—specifically, a weighted sum—of the (population) impulse response function at other horizons. As a result, if LP estimators across horizons have the same sign (as is the case for hump-shaped impulse responses), then the least-squares estimators are biased toward zero at every horizon. Additionally, our analysis highlights that the small-sample estimates from LPs are not “local” because the small-sample biases of those estimates depend on the true impulse responses at other horizons. Weusethehigher-orderexpansionoftheLPestimatortodevelopasimplebiascorrection that can added to the ordinary least squared (OLS) estimator. In Monte Carlo simulations, on average our bias-corrected estimators are markedly closer to the true values of the impulse responses.2 In addition, using coverage probabilities as a metric, we show that our estimator 1In what follows, we always refer to the regressor associated with the LP coefficient as the “shock.” 2Becauseourbiascorrectiondoesnotcompletelyeliminatesmallsamplebias,insomesettingsresearchers 2

performs relatively well when compared to the OLS estimator. We extend our analysis to settings using panel data and show that—when using fixed effects—the bias we document persists. We provide formulas for bias correcting the OLS estimator in panel settings with and without controls. Additionally, we show that increasing the number of entities in the panel will not eliminate the bias. Finally, we also study the downward bias in the standard errors of LP estimators. Our analysis of standard errors is related to recent work by Olea and Plagborg-Møller (2020), who suggest that researchers using LPs should use heteroskedasticity-consistent, but not autocorrelation-robust, standard errors.3 Our focus on finite sample issues leaves us at similar conclusions, but for different reasons. We show that, like the LPs themselves, commonly-used standard errors for LPs that are heteroskedasticity- and autocorrelation-robust (HAR) are also severely downward biased whensamplesizesaresimilartothosecommonlyfoundintheliterature. Whentheregression scores—the regression residuals times the regressors—are autocorrelated, HAR standard errorsarenecessaryforvalidinference. PopularHARestimators,suchastheNeweyandWest (1987) estimator, rely on estimators of the autocorrelation of the regression score. Like the LPpointestimators, infinitesamples, theestimatorsoftheseautocorrelationswillbebiased. In most empirically-realistic settings, the bias will be downwards. This downward bias is large enough that, in LP settings like the ones we study, estimates of the autocorrelation are, on average, negative in small samples, even when the true autocorrelation is zero or mildly positive! The possibility of the bias leading to an incorrectly signed estimate of autocorrelations suggests researchers may prefer standard errors that are heteroskedasticity-consistent, but not autocorrelation-robust, such as Huber-White standard errors. In fact, in our empirical examples, switching from Newey-West to Huber-White estimators generally increases the estimates of standard errors, sometimes dramatically so. In the case that researchers need to use autocorrelation-robust standard errors, our small sample analysis suggests that they should be aware that these standard errors may be dramatically biased. Additionally, when researchers have the shocks in hand, we argue that under the null may prefer methods, such as VARs, that estimate the same impulse responses as LPs (see Plagborg-Møller and Wolf (2019)) and have well-understood, effective methods for bias correction (see Kilian (1998)). 3Olea and Plagborg-Møller (2020) specifically advocate the use of lag-augmented LPs. We analyze the effects of lag augmentation in our setup in Appendix G. 3

hypothesis that is typically of interest, HAR standared errors are not even necessary for valid inference. We provide an analytical characterization of the true autocorrelation function of the regression score in a simple setting. In this case, using HAR standard errors is not conservative; in fact, it will typically understate uncertainty relative to a standard error calculated without attempting to take autocorrelation into account. We analyze bias in three examples drawn from the empirical monetary economics literature. We show that the bias in point estimates can be economically meaningful. In these examples, consistent with our simulations and analytical results, the use of HAR estimators typically leads to smaller standard error estimates than the use of the Huber-White estimator; in many cases this difference is large enough to affect inference. This is true even when the estimated bias in the point estimate is small. Our paper is related to work by Kilian and Kim (2011), who study the coverage probabilities for confidence intervals for LP estimators using bootstrap methods. Their work focuses onthecasewhenshocksareidentifiedasapartoftheLPestimationandusestheblockbootstrap to approximate the finite sample distribution of the OLS estimate. By contrast, our paper considers the case when a time series of identified shocks is available to the researcher, so right-hand-side variables are iid. Our analysis relies on higher-order expansions of the OLS estimator, which illustrate the reasons that the LP estimator is biased and provide a natural bias correction without bootstrapping. In addition, we extend the analysis to panel data settings, which are common settings for LPs in practice. In Appendix F, we compare the bias correction from the block bootstrap used in Kilian and Kim (2011) to our proposed bias correction and use insights from our higher-order expansion of the OLS estimator to explain why the bootstrap performs relatively poorly unless block lengths are relatively long. More generally, our paper is related to work on bias in least-squares estimators of autocorrelation (such as Kendall (1954) and Shaman and Stine (1988)), in dynamic panel data settings (such as Nickell (1981) and Hahn and Kuersteiner (2002)), and in generalized method of moments systems (Rilstone et al. (1996), Anatolyev (2005), and Bao and Ullah (2007)). We extend this work to our LP setting. 4

2 Some evidence on the use of LPs To get a sense of how LPs are used in the literature, we examine the 100 “most relevant” papers citing Jordà (2005) on Google Scholar.4 Google Scholar’s relevance ranking weights the text of the document, the authors, the source of the publication, and the number of citations. Of these 100 papers, 71 employed LPs in an empirical project (rather than merely citing but not applying LP).5 The focus of this paper is parameter bias associated with short time series, so for each of the studies we recorded the length of the time series, T, in the main LP in each of these papers. About two-thirds of the papers surveyed employed panel data. As mentioned in the introduction and discussed later, with entity-specific fixed effects, the time dimension is still the relevant component of the sample size for determining the LP bias. Because many of the panel data sets are unbalanced, constructing a single summary T is challenging. For unbalanced panels, we summarize the size of the time dimension using the mean T across entities, when readily available, or using the largest value of T across entities. In general, our assessment of T is extremely conservative in the sense that it overestimates the time series dimension of the data for many of the LP applications. It is not unusual, for example, to see unbalanced panels that have an average T that is less than half of the time-series dimension of the entire panel or to see robustness exercises that use a small fraction of the data series. In these cases, we use the entire time series dimension of the panel, which biases our estimates of T up. Figure 1 displays a histogram of the sample of 71 Ts collected in our literature review. The median T (the red dash dotted line) is around 95. These sample sizes are significantly smaller than those typically used in empirical macroeconometrics papers, as most of the papers surveyed here use the increasingly popular strategy of using observed shocks, such as the monetary policy shocks of Romer and Romer (2004), rather than identified shocks from a VAR, as in Jordà (2005). Constructing these observed shocks is often difficult and costly, and so the time series typically have short length. The application of LPs to such short time series does not seem to have been anticipated in the early literature on LPs. In fact, the Monte Carlo studies in Jordà (2005) used T = 300 4We conducted this search in October 2019. See Appendix I for the list of citations. 5If a paper appeared as both a working paper and a published paper, we excluded the working paper version from our analysis. 5

20.0 17.5 15.0 12.5 10.0 7.5 5.0 2.5 0.0 0 100 200 300 400 500 T srepaP Median Jordà (2005) Monte Carlos Figure 1: T is small in the literature using LPs. and T = 496 (the orange, dashed lines in Figure 1). Less than 6 percent of the surveyed studies use sample sizes at least that large. Of course, it is difficult to fault Jordà (2005) for not anticipating how researchers would subsequently apply LP methods. While many studies in our survey use annual or quarterly data, Jordà (2005) used monthly data. In general, however, increasing T by using monthly data rather than quarterly or annual data will not completely eliminate the issue of small-sample bias in LPs because the monthly series are likely to be more persistent, and the bias in LPs is more severe when the data are more persistent. 3 Bias in LPs In this section, we demonstrate that LPs can be severely biased with sample sizes like those documented in Section 2. We explore this bias using both Monte Carlo evidence and a new analytic approximation of the bias. The analytic approximation yields insights into the bias associated with LPs at different horizons and suggests a correction of the bias. 6

3.1 Bias in LPs using an AR(1) example To demonstrate that LPs can be severely biased in small samples, we first consider Monte CarloanalysisusinganAR(1)datageneratingprocess. Ourobjectiveistostudytheaccuracy of estimated impulse responses via LPs for various sample sizes. For a given T, we simulate N = 10;000 time series, fy gT , for the data generating process: mc t t=1 y =(cid:26)y +" +(cid:23) : (1) t t(cid:0)1 t t Here, " and (cid:23) are iid standard normal random variables. In these Monte Carlo simulations, t t to be consistent with the high persistence of macroeconomic data, we set (cid:26) = 0:95.6 We use the AR(1) time series model because LPs were designed to capture the dynamics of a wide range of data generating processes and one would hope that they would perform well in the simplest examples. We assume that the researcher does not know the true data generating process but is otherwise in a near-ideal setting for estimating the impulse response function of y using t LPs. The researcher observes fy ;" gT ; that is, the researcher directly observes the shock t t t=1 " which is independent over time and uncorrelated with past values of y . In addition, the t t researcher may control for other variables, denoted by the vector c . When we include such t controls in the Monte Carlo exercise, we assume c = y . We stress that our regressions t t(cid:0)1 with controls are ideal in the sense that no useful additional information from earlier periods could be added to the regressors and we include the correct number of lags of y as controls. t The LP model is the set of regression models indexed by the impulse response horizon h, ′ y = (cid:11) +(cid:12) x +u ; h = 0;:::;H: (2) t+h h h t t;h where x (cid:17) [" ;c′]′.7 Thus, the first elements of the coefficient vectors f(cid:12) gH trace out the t t t h h=0 impulse response of interest. We denote the H + 1 vector describing the impulse response by (cid:18) with elements (cid:18) for h = 0;:::;H. As in the empirical macroeconomics literature, we h b estimate each (cid:12) using least squares. We denote the estimator of the (cid:12) by (cid:12) and the h h h;LS b estimator of the impulse response by (cid:18) . LS Using Monte Carlo simulations, we can compute, for any T, the finite sample expectation b oftheleast-squaresestimator,E[(cid:18) ]. Figure2displaystheexpectationfortheLPestimators LS 6In Appendix C, we provide results for alternative values of (cid:26). In each simulation, we initialize y at a 0 draw from the unconditional distribution of y . The variances of " and (cid:23) are set to (cid:27)2 =(cid:27)2 =1: t t t " (cid:23) 7When we do not include controls, x =" . t t 7

(a) Without Controls (b) With Controls 1.0 1.0 0.8 0.8 0.6 True 0.6 True E[ LS], T=200 E[ LS], T=200 0.4 E[ LS], T=100 0.4 E[ LS], T=100 E[ LS], T=50 E[ LS], T=50 0.2 0.2 0.0 0.0 0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10 h h Figure 2: LP estimators are biased in empirically-relevant samples when y is an AR(1) with t (cid:26) = 0:95. with and without controls for the AR(1) data generating process with T 2 f50;100;200g. Recall that about half of the surveyed literature uses T less than 100. When controls are not included in the LP (the left panel), the estimator is biased even at short horizons. This is true even for moderately long time series—i.e., T = 200. As the horizon of the impulse response increases, the bias becomes worse. When y is included as t(cid:0)1 a control (the right panel), the bias diminishes substantially at short horizons. Intuitively, addingcontrolsmakestheleast-squareserrortermslessautocorrelatedwithatshorthorizons. However, even for the impulse response only 10 periods ahead (2.5 years with quarterly data), the controls alleviate only a small fraction of the bias. The reason that controls are less effective at reducing the bias as h increases is that they are less effective at forecasting y . Note also that, at these longer horizons, the overlapping nature of the left-hand-side t+h variables in the LP implies that the error terms are autocorrelated.8 8Following Jordà (2005), for each regression of horizon h, researchers typically use all available data, meaning that the regression error term is autocorrelated for at least h(cid:0)1 periods. With autocorrelated regressionerrorterms, thegeneralizedleast-squaresestimatorasymptoticallyperformsbetterthantheordinaryleast-squaresestimator. However,researcherstypicallyusetheordinaryleast-squaresestimatorbecause of the small-sample shortcomings of the feasible generalized least-squares estimator. That said, recent work by Lusompa (2019) suggests that well-behaved small sample GLS estimators may be obtained for LPs. 8

3.2 Approximating the small sample bias in LPs In this subsection we derive an analytic approximation to the bias of the least-squares estib b mator, (cid:18) , given by E[(cid:18) ](cid:0)(cid:18). The expressions we derive are, to the best of our knowledge, LS LS new to the literature and highlight the interdependence of the bias in LPs at different horizons. In order to illustrate the point clearly (and to avoid tedious matrix algebra), we first focus on LPs without controls. We generalize our analytic approximation to the case when controls are included in the LP, and the intuition for the bias is similar (we provide the associated derivations in our Appendix A). In all of our derivations, we make the following assumptions. Let w = [y ;" ;c′]′. t t t t Assumption 1. The data fw g is stationary and ergodic. The demeaned series fw (cid:0)(cid:22) g, t t w where (cid:22) = E[w ], is purely non-deterministic with absolutely summable Wold decomposition w t coefficients and E[jjw w′jj2] < 1. t t Under Assumption 1, the Wold representation can be inverted and the data has a VAR(1) representation. If fw g is jointly Gaussian, the data satisfies the Assumption 1. t The next assumption explicitly characterizes the relationship between y and " as well the t t conditional moments of " . t Assumption 2. The time series y can be decomposed as t ∑1 y = (cid:18) " +y(cid:22) t j t(cid:0)j t j=0 where y(cid:22) is independent of " for j = (cid:0)1;:::;1. Moreover, t t(cid:0)j E[" jfw g ;f" g ] = E[" ] = (cid:22) t (cid:28) (cid:28)<t (cid:28) t<(cid:28) t " and E[(" (cid:0)(cid:22) )2jfw g ;f" g ] = E[(" (cid:0)(cid:22) )2] = (cid:27)2 > 0: t " (cid:28) (cid:28)<t (cid:28) t<(cid:28) t " " The first part of this assumption characterizes y as the sum of a casual linear filter in t f" g1 and an additional component which is independent of all " . The filter coefficients t(cid:0)j j=0 j are the impulse response coefficients of interest. This is a stark assumption that is meant to represent an idealized case where “direct causal inference”—as in Nakamura and Steinsson (2018)—is possible. A jointly Gaussian time series trivially satisfies this portion of the assumption. The second part of this assumption precludes first and second moment dependence in " . The idealized case where the researcher has iid shocks in hand trivially satisfies t 9

the second part of Assumption 2. Such a stark assumption is consistent with the emerging practice of constructing such shocks with desirable statistical properties. We stress that a violation of Assumption 2 does not eliminate the small sample bias discussed here, it merely makes it difficult to characterize analytically. This assumption allows us to obtain sharp analytical results, a necessary first step in understanding finite sample issues in LPs. Recall that in our framework, the researcher observes fy ;" gT . For a given h, the t t t=1 ordinary least-squares estimator of (cid:18) can be written as h ( )( ) ∑ ∑ ∑ 1 T(cid:0)h" y (cid:0) 1 T(cid:0)h" T(cid:0)hy b T(cid:0)h t=1 t t+h (T(cid:0)h)2 t=1 t t=1 t+h (cid:18) = ( ) h;LS ∑ ∑ 2 1 T(cid:0)h"2 (cid:0) 1 T(cid:0)h" T(cid:0)h t=1 t (T(cid:0)h)2 t=1 t ccov[" ;y ] t t+h = : (3) vcar[" ] t Here, ccov and vcar are the sample covariance and variance, respectively.9 Equation (3) makes it clear that (cid:18) in population is the scaled covariance between y and " . h t+h t Analytical Result 1 (Expression for the bias in an LP without controls). Under Assumptions 1 and 2, the bias of the LP estimator in equation (3) is given by ( ) [ ] T∑(cid:0)h(cid:0)1 ( ) E (cid:18) b (cid:0)(cid:18) = (cid:0) 1 1(cid:0) j ((cid:18) +(cid:18) )+O T (cid:0)3=2 : (4) h;LS h T (cid:0)h T (cid:0)h h+j h(cid:0)j j=1 A proof of this claim is given in Appendix A, but we provide some discussion here. To [ ] b b derive an expression for the approximate bias of (cid:18) , we need to compute E (cid:18) . In h;LS h;LS such a simple example, one can calculate higher-order expansions of equation (3). This is a widely adopted approach to these kinds of problems. We apply the methodology of Bao and Ullah (2007), which does not require, for example, that the shocks be normally distributed. b Finally, note that equation (4) gives an approximate expression for the bias of (cid:18) that h;LS consists of only the impulse response coefficients at different horizons.10 To gain intuition for equation (4), it is useful to consider the case when " is symmetric t around the origin.11 In this case, [ ] E (cid:18) b = E[ccov[" t ;y t+h ]] +O ( T (cid:0)3=2 ) : (5) h;LS E[vcar[" ]] t 9Note that for the variance to be consistent with the least-squares estimator, we use only the T (cid:0)h observations of " . t 10Given our maintained assumption that " t is uncorrelated with past values of y t , (cid:18)(cid:0)j =0 for j >0. b 11In this case, the derivation of our expression for the bias of (cid:18) is similar to the derivation in Kendall h;LS (1954) for the well-known bias of estimators of autocorrelation—see also Shaman and Stine (1988). 10

The numerator of equation (5) is given by ( ) 1 E[ccov[y ;" ]] = 1(cid:0) cov[y ;" ] (6) t+h t T (cid:0)h t+h t ( ) T∑(cid:0)h(cid:0)1 1 j (cid:0) 1(cid:0) (cov[y ;" ]+cov[y ;" ]): T (cid:0)h T (cid:0)h t+h+j t t+h(cid:0)j t j=1 Here, cov is the true covariance. The summation in equation (6) comes from the plug-in estimator for the mean in ccov. The denominator of equation (5) is ( ) 1 E[vcar[" ]] = 1(cid:0) var[" ]: (7) t T (cid:0)h t Here, var is the true variance. Noting that (cid:18) = cov[" ;y ]=var[" ], we obtain equah+j t t+h+j t tion(4).12 Fromthisderivation, itisclearthatthebiasarisesbecauseoftheneedtoestimate the means of the variables in the OLS calculations. Several more remarks regarding equation (4) are in order. First, as expected, the bias is a decreasing function of T; for fixed h, the least-squares estimator is consistent. Second, b the bias of (cid:18) is a function of the impulse response coefficients at all other horizons. h;LS Intuitively, the data generating process affects the bias of OLS estimators at similar horizons in similar ways. The interdependence of LP estimates across h highlights that, in finite samples, LPs are not “local.” Third, the contribution of the horizon h+j impulse response coefficients to the bias in the least-squares estimate of the h impulse response coefficient decreases only at linear rate as j increases or decreases. In practice, this means that the bias in the portion of the impulse response of interest—typically, say, the first 20 periods in quarterlymacroeconomicapplications—canbemeaningfullyaffectedbytheimpulseresponse atmuchlongerhorizons. Thisisespeciallytrueforextremelypersistenttimeseries(likemany macroeconomic series). We next consider the case when a researcher includes controls in the LP. We maintain the following assumptions. Assumption 3. (cid:11) +x′(cid:12) is an optimal linear forecast for y . h t h t+h The assumption that the LP produces an optimal linear forecast implies that, using the true parameters, forecast errors are MA(h+1) processes. 12In this derivation, the expression for the bias would be divided by T (cid:0)h(cid:0)1 rather than T (cid:0)h, but that difference is absorbed in the term O(T(cid:0)3=2). 11

Assumption 4. The matrix E[x x′] is full rank. t t This assumption ensures our estimator is consistent and that our setup satisfies the assumptions of Rilstone et al. (1996). With these additional assumptions, we can state our next analytical result. Analytical Result 2 (Expression for the bias in an LP with controls). Under Assumptions 1-4, the bias for the LP with controls is given by ( ) [ ] ∑h ( { }) ( ) E (cid:18) b (cid:0)(cid:18) = (cid:0) 1 1(cid:0) j 1+tr (cid:6) (cid:0)1(cid:6) (cid:18) +O T (cid:0)3=2 ; (8) h;LS h T (cid:0)h T (cid:0)h c;0 c;j h(cid:0)j j=1 [ ] where (cid:6) (cid:17) E (c (cid:0)(cid:22) )(c (cid:0)(cid:22) ) ′ . c;j t(cid:0)j c t c The claim again relies on the results of Bao and Ullah (2007). A detailed derivation is in Appendix A. Several remarks regarding equation (8) are in order. First, as in the case withoutcontrols,thebiasisadecreasingfunctionofT; forfixedh,theleast-squaresestimator b is consistent. Second, the bias of (cid:18) is a function of the impulse response coefficients at all h;LS horizons up to h. Intuitively, the data generating process affects the bias of OLS estimators at similar horizons in similar ways, however controlling for past data truncates the terms that are important for the bias by truncating the correlation in the regression errors. Third, the contribution of the horizon h(cid:0)j impulse response coefficients to the bias in the least- { } squares estimate of the h impulse response coefficient is scaled by 1 + tr (cid:6) (cid:0)1(cid:6) . As a c;0 c;j result, when controls are persistent or when unneeded persistent controls are added to the LP, the bias increases. [ ] b Using our AR(1) example, figure 3 shows E (cid:18) calculated using the approximation in LS equation (5), assuming that the true values of (cid:18) are known. The figure also shows the exact h finite sample value from Monte Carlo simulations. Notably, the approximation works quite well in population. For the no-controls case, the analytic approximation is nearly exact for T 2 f50;100;200g. With controls, the analytic approximation to the impulse response is somewhat above the true finite-sample expectation, though it still captures most of the bias associated with the least-squares estimator. 3.3 A bias-corrected estimator Equations 4 and 8 lend themselves to constructing bias corrected estimators for (cid:18) using h plug-in estimators for (cid:18) and (cid:6) . In the case of no controls, the bias depends on the values j c;j 12

(a) Without Controls (b) With Controls 1.0 1.0 0.8 0.8 0.6 True 0.6 True E[ LS], T=100 E[ LS], T=100 MC (dashed) MC (dashed) 0.4 0.4 Approx. (solid) Approx. (solid) E[ LS], T=50 E[ LS], T=50 0.2 MC (dashed) 0.2 MC (dashed) Approx. (solid) Approx. (solid) 0.0 0.0 0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10 h h Figure 3: The bias approximation is accurate in our LPs. of (cid:18) for all j ̸= h and jjj (cid:20) T. Given the inability to estimate all of these parameters with j a sample size of T, in practice a researcher could truncate the horizon of the coefficients used in the bias correction. Without theory on how to optimally pick the maximum horizon used in the bias correction, H, we set H to 20, 25, and 50 for T equal to 50, 100, and 200, respectively. In the case when controls are included in the LP, all of the needed values of (cid:18) j and (cid:6) are easily computed. c;j Notably, the researcher could bias correct the coefficients using the OLS estimates of (cid:18) . When we construct the bias-corrected estimator in this way, we denote the estimator as j b (cid:18) . Alternatively, theresearchercoulditeratethebias correctiononallvaluesof(cid:18) . When BC;h j b we construct the bias-corrected estimator in this way, we denote the estimator as (cid:18) . BCC;h For the case of an LP without controls and an LP with controls, respectively, figures b b b 4 and 5 show the average value of (cid:18) , (cid:18) , and (cid:18) over our Monte Carlo simulations LS BC BCC when y follows an AR(1) with (cid:26) = 0:95. Clearly, our bias correction does not completely t b correct for the bias in (cid:18) in either case, indicating that our bias-corrected estimator is not LS b b b a panacea for bias in LPs. Nevertheless, (cid:18) and (cid:18) are markedly closer than (cid:18) to (cid:18) on BC BCC LS average. 13

(a) T = 50 (b) T = 100 1.0 1.0 0.8 0.8 True 0.6 True 0.6 E[ BCC] E[ BCC] E[ BC] 0.4 E[ BC] 0.4 E[ LS] E[ LS] 0.2 0.2 0.0 0.0 0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10 h h b b b Figure 4: (cid:18) and (cid:18) are closer than (cid:18) to (cid:18), on average, in our LPs without controls BC BCC LS when y is an AR(1) with (cid:26) = 0:95. t (a) T = 50 (b) T = 100 1.0 1.0 0.8 0.8 True 0.6 True 0.6 E[ BCC] E[ BCC] E[ BC] 0.4 E[ BC] 0.4 E[ LS] E[ LS] 0.2 0.2 0.0 0.0 0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10 h h b b b Figure 5: (cid:18) and (cid:18) are closer than (cid:18) to (cid:18), on average, in our LPs with controls when BC BCC LS y is an AR(1) with (cid:26) = 0:95. t 4 Extension to panel data In this section, we demonstrate that LPs can be severely biased with sample sizes in the time dimension commonly found in the empirical macroeconomic literature even when researchers have access to a large cross-section (i.e. panel data). Of course, parameter bias in dynamic panel data models has been studied since at least Nickell (1981). We illustrate the bias in LPs using the expansion of the OLS estimator that is similar to one in the previous section. 14

As before, the bias for an LP at horizon h is linked directly to the LP population coefficients at other horizons. In all of our derivations, we maintain the assumptions from the previous section, and, for algebraic simplicity, we assume that the panel is balanced. 4.1 Bias in LPs with panel data using the AR(1) example TodemonstratethatLPscanbeseverelybiasedinsmallsampleswithpaneldata,wegenerate data, fy gT for each entity i = 1;:::;I, using the data generating process specified in i;t t=1 equations (1). For simplicity, we assume that all the data are independent across entities, but our derivations for the approximate bias do not depend on this assumption. We show results for panels containing I = 10;25;and 50 entities. As in the previous section, we assume (cid:26) = 0:95. In panel settings, the LP model is the set of regression models indexed by the impulse response horizon h, ′ y = (cid:11) +(cid:12) x +u ; h = 0;:::;H: (9) i;t+h i;h h i;t i;t;h where x (cid:17) [" ;c′ ]′.7 The first element of the coefficient vectors f(cid:12) gH trace out the i;t i;t i;t h h=0 impulse response of interest, which we denote f(cid:18) gH . Note that we incorporate fixed h h=1 effects in this panel setting. Figure 6 displays the Monte-Carlo mean of LP estimators with and without controls for T = 100 and different numbers of entitites in the panels. As was the case without panel data, the LP estimates of the impulse responses are severely biased. Notably, the bias does not approach zero as the number of entities in the panel grows large (see Nickell (1981)). As in the non-panel setting, the inclusion of controls is less effective at reducing the bias in LP estimators as h increases. The reason that controls are less effective at reducing the bias as h increases is that they are less effective at forecasting y . Thus, for large h, the i;t+h bias is similar to the LP estimator without controls. Papers like Acemoglu et al. (2019) have argued that T as small as 40 should make the bias in panel LP estimators relatively small. While the bias documented by Nickell (1981) is small at h = 1 when controls are included, thatbiascanbelargeath = 10evenwhenT isrelativelylarge. Ingeneral, impulseresponses are most of interest at moderate-to-large values of h. 15

(a) Without Controls (b) With Controls 1.0 1.0 0.8 0.8 0.6 True 0.6 True I=10 I=10 0.4 I=25 0.4 I=25 I=50 I=50 0.2 0.2 0.0 0.0 0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10 h h Figure 6: LP estimators without controls are biased in empirically-relevant samples when y is an AR(1) with (cid:26) = 0:95. i;t 4.2 Understanding bias in LPs with panel data In this subsection we derive an approximate bias function for the LP estimator in the context of panel data with entity fixed effects. We do our expansions under the assumption that the time series is growing, but that the number of entities in the panel is constant. This seems like the most relevant setting for macroeconomic applications, where panels generally consists of countries, states, or even counties. We analyze the implications of having a larger number of entities in a panel for the size of the bias. Throughout this sub-section, we maintain the following assumption. Assumption 5. Either " ? " for j ̸= i, or if " and " are correlated then E[" (y (cid:0) i;t j;t i;t j;t i;t j;t+h (cid:11) (cid:0)(cid:12) " )] = 0. j;h h j;t Thisassumptionsaysthateithertheshocksareindependentortheyarevalidinstruments for one another. Analytical Result 3 (Bias in Panel LPs without controls.). Under the maintained assumptions from this and the previous sections, the series expansion of the OLS estimator without controls implies ( ) [ ] T∑(cid:0)h(cid:0)1 ( ) E (cid:18) b (cid:0)(cid:18) = (cid:0) 1 1(cid:0) j ((cid:18) +(cid:18) )+O T (cid:0)3=2 : (10) h;LS h T (cid:0)h T (cid:0)h h+j h(cid:0)j j=1 16

Equation (10) is identical to equation (4), meaning that the expression for the bias in a panel setting is identical to the expression without panel data.13 As a result, without controls the bias-corrected estimator from the previous section could be applied to the panel data setup we analyze here, where T is used as the number of relevant observations rather than I (cid:2)T. When controls are included, the bias calculation becomes more complicated, although the intuition about the size and scope of bias is essentially unchanged. Define covariance of the "s between panelists as (cid:27) = E[(" (cid:0)(cid:22) )(" (cid:0)(cid:22) )]: j;k j;t j;" k;t k;" Next, write the covariance of t(cid:0)u and t controls of panelists j and k as [ ( )] (cid:6) = E (c (cid:0)(cid:22) ) c ′ (cid:0)(cid:22) ′ : j;k;u j;t(cid:0)u c;j k;t k;z Then, the average of the variance of the controls is given by ∑I 1 (cid:22) (cid:6) = (cid:6) : 0 j;j;c;0 I j=1 Analytical Result 4 (BiasinPanelLPswithcontrols.). Under the maintained assumptions from this and the previous sections, the series expansion of the OLS estimator with controls implies ( ) [ ] ∑h ( ) E (cid:18) b (cid:0)(cid:18) = (cid:0) 1 1(cid:0) u [1+! ](cid:18) +O T (cid:0)3=2 (11) h;LS h T (cid:0)h T (cid:0)h u h(cid:0)u u=1 where { } 1 ∑I ∑I tr (cid:6) (cid:22)(cid:0)1(cid:6) (cid:27) [ ( )] ! = 0∑ k;i;c;u i;k;" and (cid:6) = E (c (cid:0)(cid:22) ) c ′ (cid:0)(cid:22) ′ : u I2 1 I (cid:27) j;k;c;u j;t(cid:0)u c;j k;t k;z i=1 k=1 I j=1 j;j;" 13The moment conditions with and without controls are given by 2 3 (y (cid:0)(cid:11) (cid:0)(cid:12)x ) 6 1;t+h 1;t 1;t 7 6 6 (y 2;t+h (cid:0)(cid:11) 2;t (cid:0)(cid:12)x 2;t ) 7 7 6 7 E6 6 . . . 7 7=0: 6 7 6 7 4 (y (cid:0)(cid:11) (cid:0)(cid:12)x ) 5 I;t+h I;t I;t ∑ 1 I x (y (cid:0)(cid:11) (cid:0)(cid:12)x ) I i=1 i;t i;t+h i;t i;t 17

A few comments are in order regarding equation (11). First, as was the case without panel data and because the OLS estimator is consistent, the bias goes to zero as the sample size goes to infinity. Second, the cross-autocovariance of the control variables plays a role in the bias. Notably, if the controls are not correlated, the bias is smaller than if they are correlated. Third, as was the case without panel data, if the controls are positively autocorrelated, or if unnecessary positively autocorrelated controls are included in the LP, the bias is larger. Fourth, even with controls that are independent across entities or over time, the bias does not go to zero as the number of panelists increases. 5 Beyond point estimation: bias in standard errors Since Jordà (2005), the conventional wisdom has been that heteroskedasticity and autocorrelation robust (HAR) standard errors are necessary because the regression residuals of LPs are autocorrelated. That is the reason that most practioners use the HAR standard errors of Newey-West or more recent ones detailed in Sun (2014) and Lazarus et al. (2018). However, under Assumption 2 in the LP with controls, the regression score—the product of the " and t the regression residuals—is serially uncorrelated.14 Thus, in large samples HAR standard errors are not necessary; instead, Huber-White (heteroskedasticity-robust) standard errors are valid. In an LP without controls, under the AR(1) DGP, one can show that the autocovariance function of the regression score, r = " (y (cid:0)(cid:18) " ), is given by t t t+h h t 8 > < (cid:26)2h((cid:27)2) 2 (cid:28) = 1;:::;h " cov[r ;r ] = (12) t t(cid:0)(cid:28) > : 0 (cid:28) > h: Thus, unless " is uncorrelated with y , the regression score will be serially correlated, as t t suggested by the early LP literature. However, in an LP without controls, researchers are generally interested in rejecting the null hypothesis that (cid:18) is zero. Under the null hypothesis h that the (cid:18) = 0 for all h, the regression score is uncorrelated in population, and Huber-White h standard errors are asymptotically valid for the purposes of hypothesis testing. 14Olea and Plagborg-Møller (2020) use lag augmentation to achieve (population) residualized regressors, whereas our setup does not require this step because of Assumption 2. 18

HAR standard errors are often justified by researchers as being somehow more conservativethanstandarderrorsthatdonotcontrolforautocorrelation. WhileHARstandarderrors are asymptotically valid even when Huber-White will do, using popular HAR estimators, like the Newey-West estimator, can dramatically reduce the size of the estimated standard errors. The reason is that these estimators rely on estimators of the long-run variance (LRV) of the regression score that are functions of the autocovariance of the regression score. In a linear regression like an LP, standard errors are typically a function of the LRV and an estimator of E[x x′]. In fact, when one has " ? c , it is enough to focus on the estimator of the LRV t t t t to analyze the size of the standard errors. In finite samples, estimates of the autocovariance of the regression score are typically downward biased. Thus, when the actual autocorrelation function is zero, the bias in the autocorrelation function makes it negative, on average, in finite samples. We derive a simple expression for this downward bias in the case of an LP without controls in Appendix B. In the case of LP with controls, where the autocovariance function of the regression score is zero in population, the downward bias means that increasing the bandwidth of a Newey- West estimator, on average, reduces the size of the estimated LRV. In the case of LP without controls, the downward bias means increasing the bandwidth reduces the size of the standard errors, on average, even though the regression score is autocorrelated. Thus, HAR standard errors based on estimated autocovariances of the regression score will tend to underestimate b the uncertainty associated with (cid:18) . h;LS To explore the effects of the downward bias in standard errors, we consider Monte Carlo evidence using our AR(1) example. Figure 7 shows the true values of the LRV for the regression score in an LP without controls (panels (a) and (b)) and in an LP with controls (panels (c) and (d)). The horizon of the LP is h = 5. The figures also show the Monte Carlo average of Newey-West estimators that use a bandwidth u. The left panels (panels (a) and (c)) show the estimators in the case when (cid:18) = 0 for all h—that is, the shock is noise h uncorrelated with y —which is typically the maintained null hypothesis to reject. The right t panels (panels (b) and (d)) show the estimators in the case when the shock is correlated with y , so (cid:18) = (cid:26)h.15 t h Several features of these figures are woth noting. First, in all cases the Newey-West estimators dramatically under-estimate the long-run variance. Second, increasing the band- 15We keep the variance of y fixed across all of our specifications. t 19

(a) Without controls, (cid:18) = 0 (b) Without controls, (cid:18) = (cid:26)h h h True LRV 12 12 True LRV 10 10 8 8 T=100 6 6 T=100 4 4 T=50 T=50 2 2 0 0 0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10 Newey-West bandwidth (u) Newey-West bandwidth (u) (c) With controls, (cid:18) = 0 (d) With controls, (cid:18) = (cid:26)h h h 5 5 True LRV True LRV 4 4 T=100 3 3 T=100 2 T=50 2 T=50 1 1 0 0 0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10 Newey-West bandwidth (u) Newey-West bandwidth (u) Figure7: EstimatorsofstandarderrorsinanLPwithh = 5arebiasedinempirically-relevant samples when y is an AR(1) with (cid:26) = 0:95. t width of the Newey-West estimator tends to reduce the estimate of the LRV. Third, even when there is autocorrelation in the population regression score (panel (b)), in empirically relevant sample sizes, increasing the bandwidth of the the Newey-West estimator does little to improve the estimate of the long-run variance and can reduce that estimate. Fourth, the Huber-White estimators, which are given by the Newey-West estimator with u = 0, are also downward biased. However, the Huber-White estimator is not further attenated by the estimates of the autocovariances of the regression score included in the Newey-West estimator. 20

Often, standarderrorsareinterpretedasprovidingacredibleregionaroundtheestimated impulse response without reference to a null hypothesis. To this end, it is also useful to considertheimplicationsof biasinpointestimatesandstandard errorsunderthe maintained assumption that (cid:18) ̸= 0. We analyze this issue using Monte Carlo analysis and an AR(1) h generating process for y . The OLS estimator of the impulse response function is biased t down and typical estimators of standard errors are also biased down. Both sources of bias may make confidence intervals fail to contain the true impulse response function. Using Huber-White and Newey-West standard errors, Table 1 displays the percentage of confidence sets with 95% nominal coverage probability (based on the asymptotic normal approximation) that contain the true impulse response function at horizon h in a Monte Carlo simulation of an LP without controls where y is an AR(1) with (cid:26) = 0:95 and a sample t size of T = 50.16 Several features of these results are worth noting. First, implementing our proposed bias correction and using controls can increase coverage probabilities. Second, even with our bias correction and controls, the coverage probabilities are markedly lower than 95%. Third, the Huber-White standard errors appear to perform better than the Newey-West standard errors. Overall, our analysis of standard errors indicates that bias in LPs is an important problem to address when conducting inference. In small samples, Huber-White standard errors are preferable to HAR standard errors when using critical values from the normal asymptotic limiting distribution. When researchers are interested in credible regions around point b estimates, implementing our bias adjustment to (cid:18) can improve the coverage probabilities. h;LS 16When constructing the Newey-West estimator, we use a bandwidth of 0:7(T (cid:0)h)1=3. We use this bandwidth as the “textbook” choice from Lazarus et al. (2018). When one uses the fixed-b critical values suggested by Lazarus et al. (2018), the performance of the Newey-West estimator improves somewhat (see ourAppendix),thoughthecoverageprobabilitiesarenotuniformlybetterthanthosefromtheHuber-White standard errors. Fixed-b asymptotics involve using a larger bandwidth for the Newey-West estimator and larger critical values than those implied by the asymptotic normal approximation. It turns out that the bandwidth is not that much larger in sample sizes typically found in the literature. As a result, it is not surprising that using a larger critical value improves the coverage probabilities given that the confidence intervals are too small. 21

b Table 1: Coverage probability of different estimators of standard errors for (cid:18) in LP when h y is an AR(1) with (cid:26) = 0:95 and T = 50 t b b b (cid:18) , no controls (cid:18) , no controls (cid:18) , controls h;LS h;BCC h;BCC h HW NW HW NW HW NW 0 0.87 0.82 0.86 0.82 0.92 0.91 1 0.83 0.80 0.82 0.80 0.90 0.88 2 0.80 0.77 0.79 0.78 0.87 0.86 3 0.78 0.75 0.76 0.75 0.85 0.83 4 0.76 0.73 0.75 0.73 0.83 0.81 5 0.75 0.72 0.74 0.72 0.81 0.79 6 0.75 0.72 0.73 0.71 0.80 0.78 7 0.74 0.70 0.73 0.70 0.78 0.76 8 0.74 0.71 0.73 0.70 0.77 0.75 9 0.74 0.71 0.73 0.70 0.76 0.74 10 0.74 0.71 0.73 0.70 0.75 0.74 Note: HW stands for Huber-White standard errors, NW stands for Newey-West standard errors. 6 Application to monetary policy shocks In this section we provide empirical examples of estimated bias in LPs and also show the sensitivity of inference to the construction of standard errors. We highlight three examples from the literature on the effect of monetary policy shocks on the macroeconomy. These shocks are constructed using either the narrative approach of Romer and Romer (2004) or through asset prices as pioneered by Kuttner (2001). 6.1 The effects of monetary policy shocks on output and inflation Using a setup similar to Gorodnichenko and Lee (2019), we estimate the effects of Romer and Romer (2004) monetary policy shocks on output and inflation.17 The data sample runs from 1969:Q1-2008:Q4. We estimate LPs of the form in equation (2) on real output growth 17The shock series was extended to 2008 by Coibion et al. (2017). 22

(a) Inflation (b) Output 0.25 2 Bias Corrected Bias Corrected Least Squares Least Squares 0.00 1 0.25 0 0.50 1 0.75 2 1.00 3 1.25 4 1.50 0 5 10 15 20 0 5 10 15 20 h h Figure 8: The effect of monetary policy shocks. and annualized inflation. We include controls consisting of four lags of real output growth, inflation, the federal funds rate, and the monetary policy innovation.18 The estimated impulse responses of inflation and output to a monetary policy shock are displayed in Figure 8. As in Gorodnichenko and Lee (2019), we cumulate the impulse response of output growth. Figure 8 also shows the bias-corrected estimate of the impulse response. To focus attention on the difference between the two impulse responses in this illustrative example, we omit confidence bands. The estimated inflation impulse response roughly accords with Gorodnichenko and Lee (2019): a contractionary 100 basis-point monetary policy shock causes inflation to be little changed for the first few periods after the shock and then eventually decline persistently. The bias-corrected impulse response indicates that inflation responds somewhat more to a monetary policy shock than under the conventional estimates. On average, the response of inflation is about 15 basis points lower in the bias-corrected impulse response, a moderate but nontrivial difference. The bias-corrected estimator is lower than the least-squares LP estimator because the estimated least-squares impulse response is negative for almost hori- 18The LPs here are slightly different from the ones in Gorodnichenko and Lee (2019) in two ways. First, weusey t+h ratherthany t+h (cid:0)y t(cid:0)1 . Thebiasdiscussedinthispaperisstillpresentunderthelatterformula. Second, we omit TFP innovations because the objective here is not to study relative variance contributions. Taken together, these differences lead to only minor changes in the estimated LPs. 23

zons, and the bias-correction is a weighted average of the impulse responses at all previous horizons. The estimated output impulse response also broadly accords with the results in Gorodnichenko and Lee (2019): a contractionary 100 basis-point monetary policy shock causes the level of output to contract by about 4 percent after two and half years, after which effects of the shock slowly dissipate. Notably, the bias-corrected estimator implies the output decline is about 1/2 percentage point larger. Essentially, this is because the bias-correction at, say, horizon h = 15 is influenced by the LP coefficients at previous horizons. For both inflation and output, the corrections are moderate but economically meaningfully. The corrections are larger at longer horizons, a consequence of the single-sided nature of our bias correction, given in equation (8).19 It is perhaps not surprising that the bias correction is moderate given that the sample size used here is markedly larger than those typically found in the LP literature. 6.2 State dependence in the effects of monetary policy shocks One of the key advantages of LPs, relative to other popular methods in macroeconomics, are their ability to handle nonlinearities. Tenreyro and Thwaites (2016) use a “smoothly transitioninglocalprojectionmodel”toinvestigatewhethermonetarypolicyhaslargereffects during recessions or expansions. The LP specification is given by y = (cid:28)t+F(z )((cid:11)b +(cid:18)b" +(cid:13)b′ c )+(1(cid:0)F(z ))((cid:11)r +(cid:18)r" +(cid:13)r′ c )+u : (13) t+h t h h t t t h h t t t Here y is the endogenous variable of interest (output), c is a vector of controls and F(z ) t+h t t is a smooth increasing function of an indicator of the state of the economy z . The value t of F(z ) is exogenous from the perspective of the regression. The shock " is once again a t t variant of the Romer and Romer (2004) measure. The coefficients of interest are (cid:18)b and h (cid:18)r, the effects of a monetary policy shock at horizon h in a boom (b) and recession (r), h respectively. 19Anearlierworkingpaperversionofthispaperpresentedaslightlydifferentformulaforthebiascorrection inthecaseofLPwithcontrols. Whilebotharevalid,thecorrectioninthepreviousversionforhorizonhwas influenced by estimates of impulse responses at all horizons. The current formula depends only on horizons j (cid:20)h. This is much easier to implement in practice. 24

Since F(z ) is exogenous, we can estimate this LP model by OLS. We proceed by first t removing the trend and estimate (13) and then omit the linear trend in the regression specification.20 We estimate the model on quarterly data using a sample that runs from 1969:Q1 to 2002:Q4. Following, Tenreyro and Thwaites (2016), the control variables c are t one lag of detrended output and the federal funds rate. Table 2: State-Dependent Effects of Monetary Policy Shocks Horizon (cid:12) ^b (cid:12) ^r Diff. (cid:12) ^b (cid:12) ^r Diff. LS LS BC BC Point Est. -1.27 -0.70 -0.58 -1.36 -0.80 -0.56 h = 9 NW SE ( 0.39) ( 0.29) ( 0.59) ( 0.39) ( 0.29) ( 0.58) HW SE ( 0.45) ( 0.64) ( 0.89) ( 0.46) ( 0.65) ( 0.90) Point Est. -1.49 -0.41 -1.08 -1.61 -0.52 -1.09 h = 10 NW SE ( 0.37) ( 0.25) ( 0.53) ( 0.37) ( 0.28) ( 0.55) HW SE ( 0.50) ( 0.74) ( 1.01) ( 0.51) ( 0.75) ( 1.02) Point Est. -1.10 -0.38 -0.73 -1.26 -0.49 -0.77 h = 11 NW SE ( 0.38) ( 0.34) ( 0.64) ( 0.39) ( 0.39) ( 0.67) HW SE ( 0.60) ( 0.72) ( 1.05) ( 0.60) ( 0.73) ( 1.06) Point Est. -0.64 -0.47 -0.17 -0.81 -0.58 -0.23 h = 12 NW SE ( 0.45) ( 0.44) ( 0.80) ( 0.46) ( 0.49) ( 0.85) HW SE ( 0.65) ( 0.62) ( 1.01) ( 0.64) ( 0.64) ( 1.02) Notes: Table shows the LS and BC estimates of (cid:12)b and (cid:12)r with Newey-West (NW) and Huber-White (HW) standard errors. Table 2 shows the LS and BC estimates of with (cid:12)b and (cid:12)r, along with with Newey- West and Huber-White standard errors in parenthesis, for h = 9;:::;12. The coefficient estimates can be interpreted as the percentage point effect on real output in period t + h of a 100 point monetary policy shock at time t when the economy is definitely in either a boom or recession, respectively. The point estimates under both LS and BC indicate that positive shocks occuring during booms are more contractionary than identically-sized shocks occuring during recessions, consistent with Tenreyro and Thwaites (2016). Relative to the LS estimates, the bias-corrected estimates are more negative for both (cid:12)b and (cid:12)r, consistent h h 20Results are similar under alternative ways of detrending output. 25

with the earlier simulation study. The difference between the coefficients for booms and recessions is larger when using the bias-corrected estimators. That is, there is moderately more evidence for differential effects of monetary policy shocks depending on the state of the business cycle. As in Tenreyro and Thwaites (2016), the uncertainty surrounding these estimates is large. Notably, the Huber- White standard errors are markedly larger than the Newey-West standard errors, consistent with our analysis in Section 5. For a number of the entries in Table 2, conclusions about statisticalsignificancewouldbedifferentusingtheHuber-Whitestandarderrorsascompared to the Newey-West standard errors. 6.3 Time dependence in the effect of monetary policy shocks Our final example assesses the evidence for a change in transmission of monetary policy over the early part of the 2000s. Lunsford (2020) identifies two monetary policy shocks from high frequency movements in assets in prices. The first shock is to the level of current federal fundsrateandthesecondshock, theforward guidance shock, istothemarket’sexpectedpath of federal funds rate beyond the current month. Lunsford (2020) argues that the responses of asset prices and macroeconomic aggregates to forward guidance shocks changed in 2003, as the policy statement issued by the Federal Open Market Committee (FOMC) following monetary policy decisions shifted to emphasize future policy inclinations rather than risks to the economic outlook. Using the identified federal funds rate ("FFR) and forward guidance t "FG shocks, Lunsford (2020) effectively estimates LPs of the form t y (cid:0)y = (cid:11) +(cid:18)FFR"FFR +(cid:18)FG"FG +u ; h = 1;:::;H: (14) t+h t h h t h t t;t+h where y is a macroeconomic aggregate. The focus here will be on estimates of the coeffit+h cient (cid:18)FG and whether they have changed over two samples of monthly data from February h 2000 to June 2003 and August 2003 to May 2006. Following Lunsford (2020), the horizon of interest is h = 12, so (cid:18)FG measures the effect of a forward guidance shock one year after its 12 realization (after netting the contemporaneous effect). The size of the two samples deserves emphasis as they are 28 and 23 observations, respectively. These are extremely small—about half of the smallest sample considered in the Monte Carlo simulations in this paper.21 21For ease of exposition, the paper assumes that the econometrician observes T total observations of both y and" (andpotentiallycontrols.) SothereareT(cid:0)hobservationsforthehthhorizonLP.Insomepractical t t 26

Before describing the regression results, we note two things about the LP model in (14). First, subtracting y from the dependent variables tends to the reduce the parameter bias t substantially. The model in equation (14) is essentially an LP without controls. One can see, either from the trajectories in Figure 2 or the expression in equation (4), that in an LP without controls a sizable portion of the bias associated with least squares estimate of (cid:18) at h horizons greater than zero is accounted for by the bias in (cid:18) . Informally, subtracting y from 0 t y out removes this portion of the bias. t+h Table 3 displays (cid:18) ^FG from the LP model in equation (14) along with Newey-West 12;LS and Huber-White standard errors over the two samples for three dependent variables: the growth in real personal consumption expenditures (PCE), unemployment rate changes, and the growth in industrial production (IP). The Newey-West standard errors are computed using u = 10 lags. The point estimates, as mentioned above, are computed using least squares, as the bias correction would likely be minor when using y (cid:0)y as the dependent t+h t variable. The point estimates indicate that while contractionary–i.e, positive—forward guidance shocks were associated with increases in real PCE growth and a fall in the unemployment rate in the first sample, they were associated with a fall in consumption growth and an increase in unemployment over the second sample. The point estimates associated with IP are both negative. Consistent with standard practice, Lunsford (2020) uses Newey-West standard errors, which are replicated in Table 3; when using the fixed-b asymptotic critical values from Sun (2014), which Table 3 employs, the coefficients associated with PCE growth and the unemployment rate are moderately statistically significant. Table 3 also displays the Huber-White standard errors. With one exception—the unemployment rate in the August to May 2006 sample—these are larger than the Newey-West standard errors, sometimes substantially so. The standard error estimates associated with PCE growth and IP growth in the second sample increases from 3.82 to 6.15 and 4.37 to 10.90, respectively.22 applications,theeconometricianislimitedonlybytheobservationsofshock. Thatis,theyobserveasample of size T +H of y and a sample of size T of " , where H is the maximum horizon considered. In this case, t t thereareT observationsforeachofthehthhorizonLPs–thisisthecaseinLunsford(2020). Usingaconstant rather than shrinking sample size results in extremely minor modifications of the analytic expressions and essentially no changes to the Monte Carlo simulations in this paper. 22In this case, whether the critical values used to assess statistical significance should change depending on the construction of the standard errors is complicated and requires an explicit statement of the null 27

Table 3: Response to Forward Guidance Shock Feb. 2000 to Jun. 2003 Aug. 2003 to May 2006 PCE Growth Results Point Estimate 2.88 -10.51 NW SE (2.26) (3.82) HW SE (2.61) (6.15) Unemployment Results Point Estimate -4.01 3.49 NW SE (1.13) (1.24) HW SE (2.45) (1.04) IP Growth Results Point Estimate -9.87 -10.10 NW SE ( 8.53) ( 4.37) HW SE (12.03) (10.90) Notes: The Table shows (cid:18) ^FG from the LP model in (14) along with Newey-West (NW) 12;LS and Huber-White (HW) standard errors. The NW standard errors are computed using a bandwidth of u = 10. As argued in Section 5, in small samples, even when appropriate, HAC standard errors tend to underestimate the long-run variance relative to the Huber-White estimator. Note that this is true even when the estimate of the coefficient of interest exhibits little bias itself, as we have argued is plausible in this case. The Huber-White standard errors indicate that the uncertainty surrounding these regression coefficients is considerably larger than implied the Newey-West standard errors. 7 Conclusion WehaveshownthatLPscanbeseverelybiasedinsamplesizescommonlyfoundintherelated literature. We derived an approximate bias function that shows that LPs are intimately hypothesis. In any event, given such small sample sizes, relying on critical values associated with limiting distributions could be problematic. 28

linked across horizons in small samples. The bias of LPs persists even when researchers have access to a large cross-section (panel data). We used our approximate bias function to bias correct LPs. In Monte Carlo, analysis our bias correction does not completely correct for the bias in LPs. These results suggest that other time series models with well-understood, effective methods for bias correction (such as VARs) may be better alternatives for estimated impulse responses if researchers have data samples in the time dimension that are similar to those typically found in empirical macroeconomic research. In particular, specifying time series models that are generative for the time series of interest would allow researchers to use likelihood methods. We also analyzed bias in standard errors computed for estimated impulse response functions from LPs. We showed that, in small samples, standard errors that rely on estimated autocovariances of the regression score, like the Newey-West estimator, typically understate the amount of uncertainty surrounding the estimated impulse response functions. We argued that, in LPs similar to those we consider, researchers should prefer standard errors that are heteroskedasticity consistent, but not autocorrelation robust. Recent work on standard errors in time series regression has focused on limiting distributionsotherthanthenormaldistribution(seeSun(2014)andLazarusetal.(2018)). However, with samples typically found in the LP literature, it is difficult to appeal to limiting critical values as accurate approximations. As a result, if researchers are going to use HAR standard errors, they may want to check to see if Huber-White standard errors would lead to different conclusions. If the Huber-White standard errors are larger than the HAR standard errors, researchers should consider what might lead to the apparent negative autocovariance in the regression score. Without another reasonable theory, it may be that the negative estimates of the autocovariance of the regression score are the result of small sample bias. References Acemoglu, D., S. Naidu, P. Restrepo, and J. A. Robinson (2019): “Democracy Does Cause Growth,” Journal of Political Economy, 127, 47–100. Anatolyev, S. (2005): “GMM, GEL, Serial Correlation, and Asymptotic Bias,” Econometrica, 73, 983–1002. 29

Bao, Y. and A. Ullah (2007): “The Second-Order Bias and Mean Squared Error of Estimators in Time-Series Models,” Journal of Econometrics, 140, 650 – 669. Coibion, O., Y. Gorodnichenko, L. Kueng, and J. Silvia (2017): “Innocent Bystanders? Monetary Policy and Inequality,” Journal of Monetary Economics, 88, 70–89. Gorodnichenko, Y. and B. Lee (2019): “Forecast Error Variance Decompositions with Local Projections,” Journal of Business & Economic Statistics, 0, 1–24. Hahn, J. and G. Kuersteiner(2002): “AsymptoticallyUnbiasedInferenceforaDynamic Panel Model with Fixed Effects When Both n and T are Large,” Econometrica, 70, 1639– 1657. Hall, P. (1992): The Bootstrap and Edgeworth Expansion, Springer New York. Hall, P., J. L. Horowitz, and B.-Y. Jing(1995): “OnBlockingRulesfortheBootstrap with Dependent Data,” Biometrika, 82, 561–574. Jordà, Ò. (2005): “Estimation and Inference of Impulse Responses by Local Projections,” American Economic Review, 95, 161–182. Kendall, M. G. (1954): “Note on Bias in the Estimation of Autocorrelation,” Biometrika, 41, 403–404. Kilian, L. (1998): “Small-Sample Confidence Intervals for Impulse Response Functions,” Review of economics and statistics, 80, 218–230. Kilian, L. and Y. J. Kim (2011): “How Reliable are Local Projection Estimators of Impulse Responses?” Review of Economics and Statistics, 93, 1460–1466. Kuttner, K. N. (2001): “Monetary Policy Surprises and Interest Rates: Evidence From the Federal Funds Futures Market,” Journal of Monetary Economics, 47, 523–544. Lazarus, E., D. J. Lewis, J. H. Stock, and M. W. Watson (2018): “HAR Inference: Recommendations for Practice,” Journal of Business & Economic Statistics, 36, 541–559. Lunsford, K. G. (2020): “Policy Language and Information Effects in the Early Days of Federal Reserve Forward Guidance,” American Economic Review, 110, 2899–2934. 30

Lusompa, A. (2019): “Local Projections, Autocorrelations, and Efficiency,” Unpublished Manuscript, UC-Irvine. Nakamura, E. and J. Steinsson (2018): “Identification in Macroeconomics,” Journal of Economic Perspectives, 32, 59–86. Newey, W. K. and K. D. West(1987): “ASimple, PositiveSemi-Definite, Heteroskedasticity and Autocorrelation,” Econometrica, 55, 703–708. Nickell, S. (1981): “Biases in Dynamic Models with Fixed Effects,” Econometrica, 49, 1417–1426. Olea, J. L. M. and M. Plagborg-Møller (2020): “Local Projection Inference is Simpler and More Robust Than You Think,” . Plagborg-Møller, M. and C. K. Wolf(2019): “LocalProjectionsandVARsEstimate the Same Impulse Responses,” Unpublished Manuscript, Princeton University. Rilstone, P., V. Srivastava, and A. Ullah(1996): “TheSecond-OrderBiasandMean Squared Error of Nonlinear Estimators,” Journal of Econometrics, 75, 369 – 395. Romer, C. D. and D. H. Romer (2004): “A New Measure of Monetary Shocks: Derivation and Implications,” American Economic Review, 94, 1055–1084. Shaman, P. and R. A. Stine(1988): “TheBiasofAutoregressiveCoefficientEstimators,” Journal of the American Statistical Association, 83, 842–848. Sun, Y. (2014): “Let’s Fix It: Fixed-b Asymptotics Versus Small-b Asymptotics in HeteroskedasticityandAutocorrelationRobustInference,”JournalofEconometrics,178,659– 677. Tenreyro, S. and G. Thwaites (2016): “Pushing on a String: US Monetary Policy is Less Powerful in Recessions,” American Economic Journal: Macroeconomics, 8, 43–74. 31

A Derivations of approximate bias In this appendix, we derive our expressions for the bias of the LP estimators studied in our paper. To do so, we employ the framework proposed by Rilstone et al. (1996) and extended to time series models by Bao and Ullah (2007). These papers derive expressions for finite-sample moments for a wide class of estimators via an approximation of an estimator (cid:12)^ of the form: (cid:12)^(cid:0)(cid:12) =a(cid:0)1=2 +a(cid:0)1 +O p (T (cid:0)3=2): (A.1) It can be verified that under Assumption 1, combined with the least squares estimation framework, satisfies the necessary assumptions of Rilstone et al. (1996). Assumptions 2 and 3 allow one to obtain tractable expressions. In our derivation, we use the notation of Bao and Ullah (2007) where possible. For each derivation, we will cast the OLS estimator as a GMM problem with moment conditions given by q((cid:12);w ), where the data t vector w = [y ;x′] ′ for the LP models with and without controls. For panel settings, the data vector is t t t expanded to include the additional observables. The objective function is thus given by: T∑(cid:0)h 1 T(cid:0)h ((cid:12);W 1:T )= T (cid:0)h q((cid:12);w t ): t=1 Let ▽iA((cid:12)) be the matrix of ith order partial derivatives of A with respect to (cid:12). In what follows, write T(cid:0)h ((cid:12);W 1:T ) as T(cid:0)h and q((cid:12);w t ) as q t . Define the series of matrices H i =▽i T(cid:0)h and H i =E[H i ] with Q=H (cid:0) 1 1 ;V =H 1 (cid:0)H 1 and W =H 2 (cid:0)H 2 : Bao and Ullah (2007) show that the expressions for the terms in (A.1) are given by: [ ] 1 a(cid:0)1=2 =(cid:0)Q T(cid:0)h and a(cid:0)1 =(cid:0)QVa(cid:0)1=2 (cid:0) 2 QH 2 a(cid:0)1=2 (cid:10)a(cid:0)1=2 : [ ] [ ] We are interested in computed in the bias, that is E (cid:12)^(cid:0)(cid:12) . It is obvious that E a(cid:0)1=2 = 0. Moreover, because the moment conditions associated with the LP estimator are linear in (cid:12), we have H =H =0. It 2 2 can be verified that the expression for the bias to O(T(cid:0)1) thus simplifies considerably to [ ] E (cid:12)^(cid:0)(cid:12) (cid:25)E[QH 1 Q T(cid:0)h ]:=B: (A.2) Our objective is to derive the element of the vector B that is associated with the shock of interest " for the LP model with and without controls. Before proceeding with introduce notation to define first and second moments of the shocks, " , and controls, c : t t [ ] [ ] (cid:22) =E[" ]; (cid:27)2 =E (" (cid:0)(cid:22) )2 ; (cid:22) =E[c ]; and (cid:6) =E (c (cid:0)(cid:22) )(c (cid:0)(cid:22) ) ′ : " t " t " c t c t c t c A.1 LP without controls For the LP without controls, the moment conditions associated with the OLS estimator are defined as 2 3 y (cid:0)(cid:11) (cid:0)(cid:18) " q (cid:17)4 t+h h h t 5 ; t " (y (cid:0)(cid:11) (cid:0)(cid:18) " ) t t+h h h t 1

wherewehavedepartedslightlyfromthenotationofBaoandUllah(2007)indefiningtheparametervector. Beforeconstructingthematricesofderivatives,notethatwecanrearrangethemomentconditionstodeduce: E[(" (cid:0)(cid:22) )(y (cid:0)(cid:11) )] E[(" (cid:0)(cid:22) )(y (cid:0)(cid:11) )] (cid:18) = t " t+h h = t " t+h h h E[(" (cid:0)(cid:22) )2] (cid:27)2 t " " Additionally for s<t, we have E[(" (cid:0)(cid:22) )(y (cid:0)E [y ]+E [y ](cid:0)(cid:11) )] E[(" (cid:0)(cid:22) )(y (cid:0)E [y ](cid:0)(cid:11) )] (cid:18) = t " t+h s t+h s t+h h = t " t+h s t+h h (A.3) h (cid:27)2 (cid:27)2 Where the second equality follows from the fact that E [" ]=(cid:22) for all s<t. It is easy to see that: s t 2 3 2 3 2 ( ) 3 H 1 = T (cid:0) 1 h T∑ t= (cid:0) 1 h 4 (cid:0) (cid:0) " 1 t (cid:0) (cid:0) " " 2 t t 5 ; H 1 = 4 (cid:0) (cid:0) (cid:22) 1 " (cid:0) ( (cid:27) (cid:0) " 2 (cid:22) + " (cid:22)2 " ) 5 ; and Q= 4 (cid:0) 1 (cid:27) (cid:22) + 2 " (cid:27) (cid:22) " 2 " 2 (cid:0) (cid:27) (cid:22) (cid:27) " 2 " 1 2 5 : " " Tedious algebra yields: 2 ( ( ) ) ( ) 3 2 QH 1 Q= T (cid:0) 1 h T∑ t= (cid:0) 1 h6 4 (cid:0) 1 (cid:0) (cid:0) (cid:27) 1 2 " 2 (cid:27) (cid:22) ( " 2 " " t (" (cid:0) t (cid:0) (cid:22) " (cid:22) ) " + )+ ((cid:27) (cid:22) " 2 " ) (cid:27) (cid:22) 2 " 2 " (" t (cid:0) (" t (cid:22) (cid:0) " )2 (cid:22) " )2 (cid:0) (cid:27) 1 " 2 (" t (cid:0) (cid:0) ((cid:27) (cid:22) 1 " 2 " )2 )(cid:0) (" t ((cid:27) (cid:0) (cid:22) " 2 " ) (cid:22) 2 " ( ) " 2 t (cid:0)(cid:22) " )2 7 5: Recallthatweonlyneedtoobtainthesecondelementof B whichcorrespondswiththebiasassociatedwith (cid:18)^ h : Thus, we only need to calculate [QH 1 Q] 2;(cid:1) T(cid:0)h . This is given by: ( ) T∑(cid:0)h T∑(cid:0)h 1 1 (cid:22) [QH 1 Q] 2;(cid:1) T(cid:0)h = (T (cid:0)h)2 (cid:0) (cid:27)2 (" t (cid:0)(cid:22) " )+ ((cid:27)2 " )2 (" t (cid:0)(cid:22) " )2 (y t+h (cid:0)(cid:11) h (cid:0)(cid:18) h " t ) t=1 " " t=1 T∑(cid:0)h T∑(cid:0)h 1 1 (cid:0) (" (cid:0)(cid:22) )2 " (y (cid:0)(cid:11) (cid:0)(cid:18) " ) (T (cid:0)h)2 ((cid:27)2)2 t " t t+h h h t t=1 " t=1 T∑(cid:0)hT∑(cid:0)h 1 1 1 = (cid:0) (" (cid:0)(cid:22) )(y (cid:0)(cid:11) (cid:0)(cid:18) " )(cid:0) (" (cid:0)(cid:22) )2(" (cid:0)(cid:22) )(y (cid:0)(cid:11) (cid:0)(cid:18) " ) (T (cid:0)h)2 (cid:27)2 t " s+h h h s ((cid:27)2)2 t " s " s+h h h s t=1 s=1 " " T∑(cid:0)hT∑(cid:0)h 1 = (ϕ (t;s)+ϕ (t;s)) (T (cid:0)h)2 I II t=1 s=1 Consider the expectation of the term ϕ . We have: I 8 >< 0 if t=s E[ϕ (t;s)]= I >:(cid:0)(cid:18) h+s(cid:0)t otherwise : Now consider the expectation of the term ϕ , II 1 E[ϕ (t;s)]= E[(" (cid:0)(cid:22) )2(" (cid:0)(cid:22) )(y (cid:0)(cid:11) (cid:0)(cid:18) " )] II ((cid:27)2)2 t " s " s+h h h s " 1 = E[(" (cid:0)(cid:22) )2E[(" (cid:0)(cid:22) )(y (cid:0)(cid:11) (cid:0)(cid:18) " )j" ]] ((cid:27)2)2 t " s " s+h h h s t " 2 2 0 1 33 1 6 6 B C 77 = ((cid:27)2)2 E4(" t (cid:0)(cid:22) " )2E4(" s (cid:0)(cid:22) " )@ | y s+h (cid:0)(cid:11) h(cid:0)(t(cid:0){sz) (cid:0)(cid:18) h(cid:0)(t(cid:0)s) " }t + | (cid:11) h(cid:0)(t(cid:0)s) +(cid:18) h(cid:0)(t{(cid:0)zs) " t (cid:0)(cid:11) h (cid:0)(cid:18) h " }s Aj" t 55: " (cid:14)I(t;s) (cid:14)II(t;s) 2

ByAssumption2," doesnotenterinto(cid:14) (t;s). Moreover,underAssumption2,conditioningonitdoesnot t I convey any useful information about " or other components of y . s t+s E[(" (cid:0)(cid:22) )(cid:14) (t;s)j" ]=E[(" (cid:0)(cid:22) )(cid:14) (t;s)]: (A.4) s " I t s " I Direct calculation yields: 8 >< 0 if t=s E[(" (cid:0)(cid:22) )(cid:14) (t;s)]= s " I >: (cid:18) (cid:27)2 otherwise: h " Consider now the term involving (cid:14) (t;s). By direct calculation: II 8 >< 0 if t=s E[(" (cid:0)(cid:22) )(cid:14) (t;s)j" ]= s " II t >:(cid:0)(cid:18) (cid:27)2 otherwise: h " Thus E[ϕ (t;s)]=0 for all t and s. Combining these results, we have: II T∑(cid:0)hT∑(cid:0)h 1 E[[QH 1 Q] 2;(cid:1) T(cid:0)h ]= (T (cid:0)h)2 (cid:0)(cid:18) h+s(cid:0)t 1ft̸=sg: t=1 s=1 Tedious arithmetic confirms that: ( ) T(cid:0)∑h(cid:0)1 1 j B =(cid:0) T (cid:0)h 1(cid:0) T (cid:0)h ((cid:18) h+j +(cid:18) h(cid:0)j ): j=1 This delivers equation (4). A.2 LP with controls In the case of controls, we have x = [" ;c ]′, so the moment conditions associated with the OLS estimator t t t are defined so that 2 3 2 3 y (cid:0)(cid:11) (cid:0)x′(cid:12) y (cid:0)(cid:11) (cid:0)x′(cid:12) 6 t+h h t h 7 q t (cid:17)4 x ( t y +h (cid:0)(cid:11) h (cid:0)x t ′(cid:12) h ) 5 = 6 4 " t (y t+h (cid:0)(cid:11) h (cid:0)x′ t (cid:12) h ) 7 5: t t+h h t h c (y (cid:0)(cid:11) (cid:0)x′(cid:12) ) t t+h h t h Recall the object of interest, (cid:18) , is the first element of (cid:12) . We have that h h 2 3 2 3 (cid:0)1 (cid:0)" (cid:0)c′ (cid:0)1 (cid:0)(cid:22) (cid:0)(cid:22)′ H 1 = T (cid:0) 1 h T∑(cid:0)h6 6 4 (cid:0)" t (cid:0)"2 t t (cid:0)" t c t(cid:0) ′ t(cid:0) 1 1 7 7 5; H 1 = 6 6 4 (cid:0)(cid:22) " (cid:0) ( (cid:27) " 2+ " (cid:22)2 " ) (cid:0)(cid:22) " (cid:22) c ′ c 7 7 5; t=1 2 (cid:0)c t ( (cid:0)1 (cid:0)" t c t(cid:0)1 (cid:0)c t(cid:0) ) 1 c′ t(cid:0)1 3 (cid:0)(cid:22) c (cid:0)(cid:22) " (cid:22) c (cid:0)((cid:6) c +(cid:22) c (cid:22)′ c ) 6 (cid:0) 1+ (cid:27) (cid:22)2 " 2 +(cid:22)′ c (cid:6)(cid:0) c 1(cid:22) c (cid:27) (cid:22) 2 " (cid:22)′ c (cid:6)(cid:0) c 1 7 6 " " 7 and Q=4 (cid:22)" (cid:0) 1 0 5: (cid:27)2 (cid:27)2 " " (cid:6)(cid:0)1(cid:22) 0 (cid:0)(cid:6)(cid:0)1 c c c 3

As with previous deriviation, we need only calculate the second row of QH Q. Direct calculation yields 1 2 3 ′ T∑(cid:0)h 6 1(cid:0) (cid:27) (cid:22) 2 " (" t (cid:0)(cid:22) " )(cid:0)(cid:22)′ c (cid:6)(cid:0) c 1(c t(cid:0)1 (cid:0)(cid:22) c ) 7 [QH 1 Q] 2;(cid:1) = T (cid:0) 1 h (cid:27) 1 2 (" t (cid:0)(cid:22) " ) 6 4 " (cid:27) 1 2 (" t (cid:0)(cid:22) " ) 7 5 : t=1 " (cid:6)(cid:0) c 1 " (c t(cid:0)1 (cid:0)(cid:22) c ) Then the second element of QH 1 Q T(cid:0)h is given by 2 3 ′ T∑(cid:0)h 6 1(cid:0) (cid:27) (cid:22) 2 " (" t (cid:0)(cid:22) " )(cid:0)(cid:22)′ c (cid:6)(cid:0) c 1(c t(cid:0)1 (cid:0)(cid:22) c ) 7 [QH 1 Q T(cid:0)h ] 2 = T (cid:0) 1 h (cid:27) 1 2 (" t (cid:0)(cid:22) " ) 6 4 " (cid:27) 1 2 (" t (cid:0)(cid:22) " ) 7 5 t=1 " (cid:6)(cid:0) c 1 " (c t(cid:0)1 (cid:0)(cid:22) c ) 2 3 y (cid:0)(cid:11) (cid:0)x′(cid:12) T∑(cid:0)h6 s+h h s h 7 (cid:2) T (cid:0) 1 h 6 4 " s (y s+h (cid:0)(cid:11) h (cid:0)x′ s (cid:12) h ) 7 5: s=1 c (y (cid:0)(cid:11) (cid:0)x′(cid:12) ) s s+h h s h Explicit calculation of this object yields: T∑(cid:0)hT∑(cid:0)h 1 1 [QH 1 Q T(cid:0)h ] 2 = (cid:0) (T (cid:0)h)2 (cid:27)2 (" t (cid:0)(cid:22) " )(y s+h (cid:0)(cid:11) h (cid:0)x ′ s (cid:12) h ) t=1 s=1 " [ ] 1 T∑(cid:0)hT∑(cid:0)h 1 2 (cid:0) (" (cid:0)(cid:22) ) (" (cid:0)(cid:22) )(y (cid:0)(cid:11) (cid:0)x ′ (cid:12) ) (T (cid:0)h)2 (cid:27)2 t " s " s+h h s h t=1 s=1 " T∑(cid:0)hT∑(cid:0)h (cid:0) (T (cid:0) 1 h)2 (cid:27) 1 2 (" t (cid:0)(cid:22) " )(c t(cid:0)1 (cid:0)(cid:22) c ) ′ (cid:6) (cid:0) c 1(c s(cid:0)1 (cid:0)(cid:22) c )(y s+h (cid:0)(cid:11) h (cid:0)x ′ s (cid:12) h ) t=1 s=1 " T∑(cid:0)hT∑(cid:0)h 1 =(cid:0) ϕ (t;s)+ϕ (t;s)+ϕ (t;s): (T (cid:0)h)2 I II III t=1 s=1 Consider first E[ϕ (t;s)]. Direct calculation yields: I 8 E[ϕ (t;s)]= >< (cid:18) h(cid:0)(t(cid:0)s) if s<t(cid:20)s+h I >: 0 otherwise: Note, unlike the no controls case, for t < s, E[ϕ (t;s)] = 0 as a direct consequence of Assumption 3. By I similarargumenttotheprevioussection,E[ϕ (t;s)]=0foralltands. Considertheexpectionofϕ (t;s): II III [ ] E[ϕ III (t;s)]=E (cid:27) 1 2 (" t (cid:0)(cid:22) " )(c t(cid:0)1 (cid:0)(cid:22) c ) ′ (cid:6) (cid:0) c 1(c s(cid:0)1 (cid:0)(cid:22) c )(y s+h (cid:0)(cid:11) h (cid:0)x ′ s (cid:12) h ) " By direct calculation, we have: 8 E[ϕ (t;s)]= >< (cid:18) h(cid:0)(t(cid:0)s) E [ (c t(cid:0)1 (cid:0)(cid:22) c ) ′ (cid:6)(cid:0) c 1(c s(cid:0)1 (cid:0)(cid:22) c ) ] if s<t(cid:20)s+h III >: 0 otherwise: Plugging these expectations into the expression for B delivers equation (8). 4

B Derivation of bias in standard errors We are going to consider a case where a researcher uses an LP without controls and wants to estimate the standard error of the estimator. To do so, the researcher needs to compute the variance of the regression score, which is composed of terms of the form [ ] (cid:13) h;u (cid:17)E " t (y t+h (cid:0)(cid:11) h (cid:0)" t (cid:12) h )(y t+h(cid:0)u (cid:0)(cid:11) h (cid:0)" t(cid:0)u (cid:12) h )" t(cid:0)u : Theresearcherwantstoreportstandarderrorsunderthenullhypothesisthatthecoefficientontheshockof interestiszero. Ifthishypothesisismaintainedatallhorizons,thentheresearcher’smaintainedassumption is that " ?y t s for all t and s. We will use this maintained assumption. In addition, we assume that E["8]<1 in order to t satisfytheassumptionsinRilstoneetal.(1996). Inthissection,foreaseofexposition,weassumeE[" ]=0, t b but this is without loss of generality. Notably, typical HAC estimators of the standard error of (cid:12) are h;LS functions only of (cid:13) , the sample size, and a bandwidth parameter. h;u Whenresearcherscalcluate(cid:13) (forexample,whencomputingNewey-Weststandarderrors), theytypih;u cally use plug-in estimators derived from the empirical regression scores. To understand how this procedure affects the small-sample properties of (cid:13)b , it is useful to think about estimating the regression coefficients h;u and and (cid:13) jointly. In this case h;u 2 3 (y (cid:0)(cid:11) (cid:0)" (cid:12) ) 6 t+h h t h 7 6 7 q t =4 " t (y t+h (cid:0)(cid:11) h (cid:0)" t (cid:12) h ) 5: " t (y t+h (cid:0)(cid:11) h (cid:0)" t (cid:12) h )" t(cid:0)u (y t+h(cid:0)u (cid:0)(cid:11) h (cid:0)" t(cid:0)u (cid:12) h )(cid:0)(cid:13) h;u Unlike our previous derivations, because of the (cid:12) (cid:12)′ term in q , the matrix H is not a matrix of zeros. h h t 2 2 3 (cid:0)1 0 0 6 7 E(▽q t )= 6 4 0 (cid:0)(cid:27) " 2 0 7 5 0 0 (cid:0)1 2 3 (cid:0)1 0 0 6 7 6 7 Q=4 0 (cid:0) 1 0 5 (cid:27)2 " 0 0 (cid:0)1 2 3 (y (cid:0)(cid:11) (cid:0)" (cid:12) ) T∑(cid:0)h 6 t+h h t h 7 Q T(cid:0)h(cid:0)u =(cid:0) T (cid:0) 1 h(cid:0)u 6 4 (cid:27) "t 2 (y t+h (cid:0)(cid:11) h (cid:0)" t (cid:12) h ) 7 5 " t=u+1 " t (y t+h (cid:0)(cid:11) h (cid:0)" t (cid:12) h )" t(cid:0)u (y t+h(cid:0)u (cid:0)(cid:11) h (cid:0)" t(cid:0)u (cid:12) h )(cid:0)(cid:13) h;u 5

2 3 1 0 0 6 7 T∑(cid:0)h QVQ T(cid:0)h(cid:0)u = 6 4 0 (cid:27) 1 2 0 7 5 (cid:2) T (cid:0) 1 h(cid:0)u ▽q t " t=1+u 0 0 1 2 3 (y (cid:0)(cid:11) (cid:0)" (cid:12) ) T∑(cid:0)h 6 s+h h s h 7 (cid:2) T (cid:0) 1 h(cid:0)u 6 4 (cid:27) "s 2 (y s+h (cid:0)(cid:11) h (cid:0)" s (cid:12) h ) 7 5 " s=1+u " s (y s+h (cid:0)(cid:11) h (cid:0)" s (cid:12) h )" s(cid:0)u (y s+h(cid:0)u (cid:0)(cid:11) h (cid:0)" s(cid:0)u (cid:12) h )(cid:0)(cid:13) h;u We care about the third row of QVQ T(cid:0)h(cid:0)u . Fix s and t, the terms in the third row are of the form [(cid:0)" t " t(cid:0)u (y t+h (cid:0)(cid:11) h (cid:0)" t (cid:12) h )(cid:0)" t " t(cid:0)u (y t+h(cid:0)u (cid:0)(cid:11) h (cid:0)" t(cid:0)u (cid:12) h )](y s+h (cid:0)(cid:11) h (cid:0)" s (cid:12) h ) [ ] " + (cid:0)" t "2 t(cid:0)u (y t+h (cid:0)(cid:11) h (cid:0)" t (cid:12) h )(cid:0)"2 t " t(cid:0)u (y t+h(cid:0)u (cid:0)(cid:11) h (cid:0)" t(cid:0)u (cid:12) h ) (cid:27) s 2 (y s+h (cid:0)(cid:11) h (cid:0)" s (cid:12) h ) " (cid:0)" s (y s+h (cid:0)(cid:11) h (cid:0)" s (cid:12) h )" s(cid:0)u (y s+h(cid:0)u (cid:0)(cid:11) h (cid:0)" s(cid:0)u (cid:12) h )+(cid:13) h;u Clearly, E[(cid:0)" s (y s+h (cid:0)(cid:11) h (cid:0)" s (cid:12) h )" s(cid:0)u (y s+h(cid:0)u (cid:0)(cid:11) h (cid:0)" s(cid:0)u (cid:12) h )+(cid:13) h;u ]=0 for all s. Consider (cid:0)[" t " t(cid:0)u (y t+h (cid:0)(cid:11) h (cid:0)" t (cid:12) h )+" t " t(cid:0)u (y t+h(cid:0)u (cid:0)(cid:11) h (cid:0)" t(cid:0)u (cid:12) h )](y s+h (cid:0)(cid:11) h (cid:0)" s (cid:12) h ) If u>0 and " ?y , then this is zero in expectation for all t and s. If u=0, then the expectation is t s [ ] (cid:0)2(cid:27)2E (y (cid:0)(cid:22) )2 " t y Consider [ ] " (cid:0) " t "2 t(cid:0)u (y t+h (cid:0)(cid:11) h (cid:0)" t (cid:12) h )+"2 t " t(cid:0)u (y t+h(cid:0)u (cid:0)(cid:11) h (cid:0)" t(cid:0)u (cid:12) h ) (cid:27) s 2 (y s+h (cid:0)(cid:11) h (cid:0)" s (cid:12) h ) " If u>0 and " ?y , then this is zero in expectation if s̸=t and s̸=t(cid:0)u. If s=t or s=t(cid:0)u, then this t s equals (cid:0)(cid:27)2E(y (cid:0)(cid:22) ): " t y If u=0, then the expectation is { } [ ] " (cid:0)E 2"3(y (cid:0)(cid:11) (cid:0)" (cid:12) ) s (y (cid:0)(cid:11) (cid:0)" (cid:12) ) : t t+h h t h (cid:27)2 s+h h s h " If s̸=t and " ?y , then the expectation is zero. If s=t, the the expectation is t s ( ) E "4 (cid:0)2 t E(y (cid:0)(cid:22) ): (cid:27)2 t y " Note that 2 3 0 0 0 0 0 0 0 0 0 6 7 ▽2q t = 6 4 0 0 0 0 0 0 0 0 0 7 5; 2" t " t(cid:0)u " t "2 t(cid:0)u +"2 t " t(cid:0)u 0 "2 t " t(cid:0)u +"2 t(cid:0)u " t 2"2 t "2 t(cid:0)u 0 0 0 0 2 3 0 0 0 0 0 0 0 0 0 ( ) 6 7 H 2 =E ▽2q t = 6 4 0 0 0 0 0 0 0 0 0 7 5; ( ) 0 0 0 0 2 (cid:27)2 2 0 0 0 0 " 6

2 3 0 0 0 0 0 0 0 0 0 6 7 (cid:0) 1 2 QH 2 = 6 4 0 0 0 0 ( 0 ) 0 0 0 0 7 5: 0 0 0 0 (cid:27)2 2 0 0 0 0 " To compute 1 (cid:0) 2 QH 2 E[Q T(cid:0)h(cid:0)u (cid:10)Q T(cid:0)h(cid:0)u ] we need to evaluate [ ] T∑(cid:0)h T∑(cid:0)h 1 " 1 " E t (y (cid:0)(cid:11) (cid:0)" (cid:12) ) s (y (cid:0)(cid:11) (cid:0)" (cid:12) ) : T (cid:0)h(cid:0)u (cid:27)2 t+h h t h T (cid:0)h(cid:0)u (cid:27)2 s+h h s h t=1+u " s=1+u " If t̸=s and " ?y , then t s [ ] " " E t (y (cid:0)(cid:11) (cid:0)" (cid:12) ) s (y (cid:0)(cid:11) (cid:0)" (cid:12) ) =0: (cid:27)2 t+h h t h (cid:27)2 s+h h s h " " If t=s and " ?y , then t s [ ] [ ] "2 1 E t (y (cid:0)(cid:11) )2 =E (y (cid:0)(cid:11) )2 : ((cid:27)2)2 t+h h (cid:27)2 t+h h " " Putting all of this together, if u>0, the bias is [ ] 1 T (cid:0)h(cid:0)2u (cid:0) (cid:27)2E (y (cid:0)(cid:22) )2 : T (cid:0)h(cid:0)u T (cid:0)h(cid:0)u " t y If u=0, the bias is ( ) [ ] 1 E "4 1 (cid:0) 2 t E(y (cid:0)(cid:22) )(cid:0) (cid:27)2E (y (cid:0)(cid:22) )2 T (cid:0)h(cid:0)u (cid:27)2 t y T (cid:0)h(cid:0)u " t y " In the case of normal variation, this is 1 (cid:0) 7(cid:27)2E(y (cid:0)(cid:22) ) T (cid:0)h(cid:0)u " t y Notice that under the null hypothesis, [ ] (cid:13) =(cid:27)2E (y (cid:0)(cid:22) )2 : h;0 " t y Under the null hypothesis that " ?y and under normal variation, t s ( ) ( ) 7 E((cid:13)b )=(cid:13) 1(cid:0) +O T (cid:0)3=2 h;0 h;0 T (cid:0)h ( ) 1 E((cid:13)b )=(cid:13) (cid:0) (cid:13) +O T (cid:0)3=2 : h;u h;u T (cid:0)h h;0 Clearly, when T is small, these distortions can be substantial, which explains why increasing the bandwidth of a Newey-West estimator makes the standard errors even smaller in expectation. C Additional figures for the AR(1) example In this appendix, we show figures analogous to those in the text for different values of (cid:26) in the AR(1) example. In the main text, we set (cid:26) = 0:95. Here, we consider (cid:26) = 0:9 and (cid:26) = 0:99. 7

C.1 (cid:26) = 0:9 (a) Without Controls (b) With Controls 1.0 1.0 0.8 0.8 0.6 0.6 0.4 0.4 True True E[ LS], T=200 E[ LS], T=200 0.2 E[ LS], T=100 0.2 E[ LS], T=100 E[ LS], T=50 E[ LS], T=50 0.0 0.0 0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10 h h Figure C.1: LP estimators are biased in empirically-relevant samples when y is an AR(1) t with (cid:26) = 0:90. (a) Without Controls (b) With Controls 1.0 1.0 0.8 0.8 0.6 0.6 True True 0.4 E[ LS], T=100 0.4 E[ LS], T=100 MC (dashed) MC (dashed) 0.2 Approx. (solid) 0.2 Approx. (solid) E[ LS], T=50 E[ LS], T=50 MC (dashed) MC (dashed) 0.0 0.0 0 1 2 3 4 5 6 7 8 9 10Approx. (solid) 0 1 2 3 4 5 6 7 8 9 10Approx. (solid) h h Figure C.2: The bias approximation is accurate in our LPs. 8

(a) T = 50 (b) T = 100 1.0 1.0 0.8 0.8 0.6 0.6 0.4 0.4 True True E[ BCC] E[ BCC] 0.2 E[ BC] 0.2 E[ BC] E[ LS] E[ LS] 0.0 0.0 0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10 h h b b b Figure C.3: (cid:18) and (cid:18) are closer than (cid:18) to (cid:18), on average, in our LPs without controls BC BCC LS when y is an AR(1) with (cid:26) = 0:90. t (a) T = 50 (b) T = 100 1.0 1.0 0.8 0.8 0.6 0.6 0.4 0.4 True True E[ BCC] E[ BCC] 0.2 E[ BC] 0.2 E[ BC] E[ LS] E[ LS] 0.0 0.0 0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10 h h b b b Figure C.4: (cid:18) and (cid:18) are closer than (cid:18) to (cid:18), on average, in our LPs with controls BC BCC LS when y is an AR(1) with (cid:26) = 0:90. t 9

C.2 (cid:26) = 0:99 (a) Without Controls (b) With Controls 1.0 1.0 True True 0.8 0.8 E[ LS], T=200 0.6 E[ LS], T=200 0.6 E[ LS], T=100 E[ LS], T=100 0.4 0.4 E[ LS], T=50 E[ LS], T=50 0.2 0.2 0.0 0.0 0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10 h h Figure C.5: LP estimators are biased in empirically-relevant samples when y is an AR(1) t with (cid:26) = 0:99. (a) Without Controls (b) With Controls 1.0 1.0 True True 0.8 0.8 E[ LS], T=100 MC (dashed) 0.6 E[ LS], T=100 0.6 Approx. (solid) MC (dashed) E[ LS], T=50 Approx. (solid) 0.4 0.4 MC (dashed) E[ LS], T=50 Approx. (solid) MC (dashed) 0.2 0.2 Approx. (solid) 0.0 0.0 0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10 h h Figure C.6: The bias approximation is accurate in our LPs. 10

(a) T = 50 (b) T = 100 1.0 1.0 True True 0.8 0.8 E[ BCC] 0.6 0.6 E[ BCC] E[ BC] 0.4 E[ BC] 0.4 E[ LS] E[ LS] 0.2 0.2 0.0 0.0 0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10 h h b b b Figure C.7: (cid:18) and (cid:18) are closer than (cid:18) to (cid:18), on average, in our LPs without controls BC BCC LS when y is an AR(1) with (cid:26) = 0:99. t (a) T = 50 (b) T = 100 1.0 1.0 True True 0.8 0.8 E[ BCC] E[ BCC] E[ BC] 0.6 E[ BC] 0.6 E[ LS] 0.4 0.4 E[ LS] 0.2 0.2 0.0 0.0 0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10 h h b b b Figure C.8: (cid:18) and (cid:18) are closer than (cid:18) to (cid:18), on average, in our LPs with controls BC BCC LS when y is an AR(1) with (cid:26) = 0:99. t 11

D An AR(2) example In this appendix, we show figures analogous to those in the text for an AR(2) example. We specify the example so that y = ((cid:26)+ )y (cid:0) (cid:26)y +" +(cid:23) : t t(cid:0)1 t(cid:0)2 t t This process delivers hump-shaped impulse response functions. We set (cid:26) = 0:95 and = 0:4. When we include controls, we set c = [y ;y ]′. t t(cid:0)1 t(cid:0)2 (a) Without Controls (b) With Controls 1.4 1.4 1.2 1.2 True True 1.0 1.0 E[ LS], T=200 E[ LS], T=200 0.8 0.8 E[ LS], T=100 E[ LS], T=100 0.6 0.6 E[ LS], T=50 0.4 0.4 E[ LS], T=50 0.2 0.2 0.0 0.0 0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10 h h Figure D.1: LP estimators are biased in empirically-relevant samples when y is an AR(2) t with (cid:26) = 0:95 and = 0:4. (a) Without Controls (b) With Controls 1.4 1.4 1.2 1.2 1.0 True 1.0 True E[ LS], T=100 E[ LS], T=100 0.8 MC (dashed) 0.8 MC (dashed) Approx. (solid) Approx. (solid) 0.6 E[ LS], T=50 0.6 E[ LS], T=50 MC (dashed) MC (dashed) 0.4 0.4 Approx. (solid) Approx. (solid) 0.2 0.2 0.0 0.0 0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10 h h Figure D.2: The bias approximation is accurate in our LPs. 12

(a) T = 50 (b) T = 100 1.4 1.4 1.2 1.2 True True 1.0 1.0 E[ BCC] 0.8 E[ BCC] 0.8 E[ BC] E[ BC] E[ LS] 0.6 0.6 E[ LS] 0.4 0.4 0.2 0.2 0.0 0.0 0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10 h h b b b Figure D.3: (cid:18) and (cid:18) are closer than (cid:18) to (cid:18), on average, in our LPs without controls BC BCC LS when y is an AR(1) with (cid:26) = 0:95 and = 0:4. t (a) T = 50 (b) T = 100 1.4 1.4 1.2 1.2 True True 1.0 1.0 E[ BCC] 0.8 E[ BCC] 0.8 E[ BC] E[ BC] E[ LS] 0.6 0.6 0.4 E[ LS] 0.4 0.2 0.2 0.0 0.0 0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10 h h b b b Figure D.4: (cid:18) and (cid:18) are closer than (cid:18) to (cid:18), on average, in our LPs with controls BC BCC LS when y is an AR(2) with (cid:26) = 0:90 and = 0:4. t 13

E Newey-West standard errors and fixed-b asymptotics Sun (2014) and Lazarus et al. (2018) suggest that researchers should use fixed-b asymptotics when conducting inference using HAR estimators. For the Newey-West estimator, they p suggest using a bandwidth of 1:3 T (cid:0)h. The asymptotic limiting distribution of the test statistic is not standard. Table 1 shows results that are analogous to the results shown in Table 1, but using the bandwidth for the Newey-West estimator suggested by Lazarus et al. (2018) and the nonstandard critical values to construct confidence sets using the Newey-West standard error. When one uses the fixed-b critical values, the performance of the Newey-West estimator improves somewhat, though the coverage probabilities are not uniformly better than those from the Huber-White standard errors. Fixed-b asymptotics involve using a larger bandwidth for the Newey-West estimator and larger critical values than those implied by the asymptotic normal approximation. It turns out that the bandwidth is not that much larger in sample sizes typically found in the literature. As a result, the bias is not that much larger when the larger bandwidth is used. It is then not surprising that using a larger critical value improves the coverage probabilities given that the confidence intervals are too small. 14

b Table 1: Coverage probability of different estimators of standard errors for (cid:18) in LP without h controls when y is an AR(1) with (cid:26) = 0:95 and T = 50 using fixed-b critical values t b b b (cid:18) , no controls (cid:18) , no controls (cid:18) , controls h;LS h;BCC h;BCC h Huber-White Newey-West Huber-White Newey-West Huber-White Newey-West 0 0.87 0.82 0.86 0.85 0.92 0.93 1 0.83 0.81 0.82 0.83 0.90 0.90 2 0.80 0.80 0.79 0.82 0.87 0.89 3 0.78 0.77 0.76 0.79 0.85 0.86 4 0.76 0.76 0.75 0.78 0.83 0.85 5 0.75 0.75 0.74 0.78 0.81 0.83 6 0.75 0.75 0.73 0.77 0.80 0.82 7 0.74 0.73 0.73 0.76 0.78 0.82 8 0.74 0.74 0.73 0.77 0.77 0.82 9 0.74 0.75 0.73 0.77 0.76 0.81 10 0.74 0.75 0.73 0.78 0.75 0.81 15

F Bias and the block bootstrap An alternative approach to achieve bias correction in LPs is through bootstrapping.1 Bootstrap methods construct approximation to the distribution of an estimator (for example), by resampling observables or the errors from a parametric model. In a time series context, where the dynamic relationship between observables or errors is important to preserve, block bootstrappingtechniques—inwhichtheresamplingschemeseekstopreservesomeofthecorrelation in the original data set—are typically used, as in Kilian and Kim (2011). We revisit the Monte Carlo simulations in Section 3.1 and attempt to correct for the finite-sample bias in LPs using the block bootstrap.2 Figure F.1 displays the bias correction from the block bootstrap when y is our AR(1) example. t Similar to results reported by Kilian and Kim (2011), the block bootstrap offers little in the way of LP bias correction. As in Kilian and Kim (2011), we set the block length to 4.3 Given equation (4), it is not surprising that with a short block length the block bootstrap does little to bias correct. The block bootstrap works by maintaining the autocorrelation structure of the data in LPs within a given block, but by destroying the autocorrelation across blocks. By destroying the autocorrelation across blocks, the block bootstrap destroys some of the autocorrelation information needed to adjust the estimates. As a result, the block bootstrap underestimates the bias in LPs, rendering bias correction based on the block bootstrap relatively ineffective. Longer block lengths make the block bootstrap more effective at bias correcting, but they reduce the number of non-overlapping blocks in a dataset. Kilian and Kim (2011) report that longer block lengths lead to worse coverage probabilities. 1See Hall (1992) for a textbook treatment. 2We use the same block length as Kilian and Kim (2011). 3Results are similar when we set the block length to (T (cid:0)h)1=3, as suggested by Hall et al. (1995). 16

(a) Without Controls T=50 T=100 T=200 1.0 1.0 1.0 0.8 0.8 0.8 0.6 0.6 E[ Bootstrap] 0.6 E[ Bootstrap] E[ Bootstrap] E[ LS] 0.4 0.4 E[ LS] 0.4 E[ LS] 0.2 0.2 0.2 0.00 2 4 6 8 10 0.00 2 4 6 8 10 0.00 2 4 6 8 10 h h h (b) With Controls T=50 T=100 T=200 1.0 1.0 1.0 0.8 0.8 0.8 E[ Bootstrap] E[ Bootstrap] 0.6 0.6 0.6 E[ Bootstrap] E[ LS] E[ LS] 0.4 0.4 0.4 E[ LS] 0.2 0.2 0.2 0.00 2 4 6 8 10 0.00 2 4 6 8 10 0.00 2 4 6 8 10 h h h b Figure F.1: (cid:12) provides little bias correction in our LPs y is an AR(1) with (cid:26) = 0:95. Bootstrap t 17

G Lag augmentation in the AR(1) example In this section, we consider the effects of lag augmentation in our AR(1) example. Olea and Plagborg-Møller (2020) argue that lag augmentation improves the performance of LPs. There are some important differences between the LP setting we consider and the setting in Olea and Plagborg-Møller (2020). First, we consider an LP in which a constant must be estimated, while Olea and Plagborg-Møller (2020) do not. Second, we consider LP estimates of impulse responses to identified shocks that that the researcher brings to the LP, rather than identifying the shock as a part of the LP system. Here, we consider the effects of lag augmentation in our setup. The main example in Olea and Plagborg-Møller (2020) is an AR(1) without a constant, so the example from our paper has few other differences than the two important differences identified above. (a) LP estimators are biased (b) The bias approximation is accurate 1.0 1.0 0.8 0.8 0.6 True 0.6 True E[ LS], T=200 E[ LS], T=100 0.4 E[ LS], T=100 0.4 MC (dashed) Approx. (solid) 0.2 E[ LS], T=50 0.2 E[ LS], T=50 MC (dashed) Approx. (solid) 0.0 0.0 0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10 h h Figure G.1: LP with lag augmentation when y is an AR(1) with (cid:26) = 0:95. t b FigureG.1showstheMonteCarlomeanof(cid:18) inourAR(1)examplewhen(cid:26) = 0:95. The LS bias in the impulse response estimator is essentially the same as without lag augmentation. b b b Figure G.2 shows the Monte Carlo mean of (cid:18) , (cid:18) , and (cid:18) in our AR(1) example LS BC BCC when(cid:26) = 0:95. Themeanbiascorrectionisessentiallythesameaswithoutlagaugmentation. Table2showsthecoverageprobabilitiesforconfidenceintervalsconstructedusingHuber- White and Newey-West standard errors in our AR(1) example with lag agumentation when (cid:26) = 0:95. The coverage probabilities are similar to their values without lag augmentation. 18

(a) T = 50 (b) T = 100 1.0 1.0 0.8 0.8 0.6 True 0.6 True E[ BCC] E[ BC],E[ BCC] 0.4 E[ BC] 0.4 E[ LS] 0.2 E[ LS] 0.2 0.0 0.0 0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10 h h Figure G.2: Bias correction in an LP with lag augmentation when y is an AR(1) with t (cid:26) = 0:95. b Table 2: Coverage probability of different estimators of standard errors for (cid:18) in LP with lag h augmentation when y is an AR(1) with (cid:26) = 0:95 and T = 50 t b b (cid:18) (cid:18) h;LS h;BCC h Huber-White Newey-West Huber-White Newey-West 0 0.92 0.91 0.92 0.91 1 0.89 0.87 0.90 0.88 2 0.85 0.83 0.87 0.85 3 0.83 0.80 0.84 0.82 4 0.80 0.77 0.82 0.80 5 0.77 0.74 0.80 0.78 6 0.76 0.73 0.78 0.76 7 0.74 0.71 0.77 0.75 8 0.73 0.69 0.76 0.74 9 0.71 0.68 0.75 0.73 10 0.70 0.67 0.74 0.72 19

H Derivation of bias in an LP without controls using differences in y t Some authors use y t+h (cid:0)y t(cid:0)1 or y t+h (cid:0)y t as the left hand side variable for their LP. If controls are included and the researcher uses y t+h (cid:0)y t(cid:0)1 as the left-hand side variable, it has no effect on our earlier derivations. If (cid:18) = 0 and controls are included, it will also have no effect on our earlier derivation if the researcher 0 uses y (cid:0)y as the left-hand-side variable. If controls are not included in the regression, then using the t+h t difference of y reduces the bias. Intuitively, using the difference of y removes much of the persistence in t t thedependentvariable, andfornear-unit-rootprocessesis almostwell-specifiedsothat theregressionerrors are nearly MA(h+1) processes. Lunsford (2020) uses an LP without controls and y (cid:0)y , so we focus on t+h t this setup. Given that Lunsford (2020) works with high-frequency monetary policy and forward guidance shocks, along with monthly macroeconomic data, it is reasonable to assume that (cid:18) = 0. The algebra for 0 the case when y t+h (cid:0)y t(cid:0)1 is the left-hand-side variable is similar. To show that using y (cid:0) y as the left-hand-side variable reduces bias, consider that the moment t+h t conditions associated with the OLS esitmator are E[q ]=0 t where 2 3 y (cid:0)y (cid:0)(cid:11) (cid:0)(cid:12) " q (cid:17)4 t+h t h h t 5 : t " (y (cid:0)y (cid:0)(cid:11) (cid:0)(cid:12) " ) t t+h t h h t We have that 2 3 (cid:0)1 (cid:0)" ▽1q = 4 t 5 t (cid:0)" (cid:0)"2 t t 2 3 1 T∑(cid:0)h 4 (cid:0)1 (cid:0)" t 5 H = 1 T (cid:0)h (cid:0)" (cid:0)"2 t=1 t t 2 3 (cid:0)1 (cid:0)(cid:22) H = 4 ( " ) 5 1 (cid:0)(cid:22) (cid:0) (cid:27)2+(cid:22)2 " " " 2 3 0 0 ▽2q =H =H = 4 5 t 2 2 0 0 2 3 V = 1 T∑(cid:0)h 4 0 ( (cid:0)(" t( (cid:0)(cid:22) " ) )) 5 T (cid:0)h (cid:0)(" (cid:0)(cid:22) ) (cid:0) "2(cid:0) (cid:27)2+(cid:22)2 t=1 t " t " " 2 ( ) 3 (cid:0) 1+ (cid:22)2 " (cid:22)" Q= 4 (cid:27) " 2 (cid:27) " 2 5 (cid:22)" (cid:0) 1 (cid:27)2 (cid:27)2 " " Because H is a matrix of zeros, 2 B =EfQVQ T(cid:0)h g: 20

Note that { ( ) } B =E Q H 1 (cid:0)H 1 Q T(cid:0)h =EfQH 1 Q T(cid:0)h g: Also, 2 ( ) 32 3 QH 1 = T (cid:0) 1 h T∑ t= (cid:0) 1 h 4 2 (cid:0) ( 1 (cid:27) (cid:22) + " 2 " (cid:27) (cid:22) " 2 " 2 (cid:0) (cid:27) (cid:22) (cid:27) " 2 " 1 " 2 ) 5 ( 4 (cid:0) (cid:0) " 1 t (cid:0) (cid:0) " " 2 t t 5 ) 3 = 1 T∑(cid:0)h 4 1(cid:0) (cid:27) (cid:22) " 2 " (" t (cid:0)(cid:22) " ) 1(cid:0) (cid:27) (cid:22) " 2 " (" t (cid:0)(cid:22) " ) " t 5 T (cid:0)h 1 (" (cid:0)(cid:22) ) 1 (" (cid:0)(cid:22) )" t=1 (cid:27)2 t " (cid:27)2 t " t " " and 2 ( ) ( ) 32 ( ) 3 QH 1 Q= T (cid:0) 1 h T∑(cid:0)h 4 1(cid:0) 1 (cid:27) (cid:22) ( " 2 " " (" (cid:0) t (cid:0) (cid:22) (cid:22) ) " ) 1(cid:0) 1 (cid:27) (cid:22) ( " 2 " " (" (cid:0) t (cid:0) (cid:22) (cid:22) ) " " ) " t 54 (cid:0) 1 (cid:22) + " (cid:27) (cid:22) " 2 " 2 (cid:0) (cid:27) (cid:22) " 2 " 1 5 t=1 2 ((cid:27) " 2 t " ( (cid:27) " 2 ) t " t ) ( (cid:27) " 2 (cid:27) " 2 ) 3 2 = 1 T∑(cid:0)h6 4 (cid:0) 1(cid:0)2 (cid:27) (cid:22) " 2 " (" t (cid:0)(cid:22) " )+ (cid:27) (cid:22) " 2 " (" t (cid:0)(cid:22) " )2 (cid:0) (cid:27) 1 " 2 (" t (cid:0)(cid:22) " )(cid:0) ((cid:27) (cid:22) " 2 " )2 (" t (cid:0)(cid:22) " )2 7 5 T (cid:0)h t=1 (cid:0) (cid:27) 1 " 2 (" t (cid:0)(cid:22) " )+ ((cid:27) (cid:22) " 2 " )2 (" t (cid:0)(cid:22) " )2 (cid:0) ((cid:27) 1 " 2)2 (" t (cid:0)(cid:22) " )2 Then 2 ( ( ) ) ( ) 3 2 QH 1 Q T(cid:0)h = T (cid:0) 1 h T∑ t= (cid:0) 1 h6 4 (cid:0) 2 1 (cid:0) (cid:0) (cid:27) 1 2 " 2 (cid:27) (cid:22) ( " 2 " " t (" (cid:0) t (cid:0) (cid:22) " (cid:22) ) " + )+ ((cid:27) (cid:22) " 2 " ) (cid:27) (cid:22) 2 " 2 " (" t (cid:0) ( 3 " t (cid:22) (cid:0) " )2 (cid:22) " )2 (cid:0) (cid:27) 1 " 2 (" t (cid:0) (cid:0) ((cid:27) (cid:22) 1 " 2 " )2 )(cid:0) (" t ((cid:27) (cid:0) (cid:22) " 2 " ) (cid:22) 2 " ( ) " 2 t (cid:0)(cid:22) " )2 7 5 (cid:2) 1 T∑(cid:0)h 4 y s+h (cid:0)y s (cid:0)(cid:11) h (cid:0)(cid:12) h " s 5 T (cid:0)h " (y (cid:0)y (cid:0)(cid:11) (cid:0)(cid:12) " ) s=1 s s+h s h h s We only need the expectation of the second row of this expression. To calculate it, fix t and consider { ( ) } 1 (cid:22) 1 E (cid:0) (" (cid:0)(cid:22) )(cid:0) " (" (cid:0)(cid:22) )2 (y (cid:0)y (cid:0)(cid:11) (cid:0)(cid:12) " )(cid:0) (" (cid:0)(cid:22) )2" (y (cid:0)y (cid:0)(cid:11) (cid:0)(cid:12) " ) = (cid:27)2 t " ((cid:27)2)2 t " s+h s h h s ((cid:27)2)2 t " s s+h s h h s " { " " } 1 1 E (cid:0) (" (cid:0)(cid:22) )(y (cid:0)y (cid:0)(cid:11) (cid:0)(cid:12) " )(cid:0) (" (cid:0)(cid:22) )2(" (cid:0)(cid:22) )(y (cid:0)y (cid:0)(cid:11) (cid:0)(cid:12) " ) = (cid:27)2 t " s+h s h h s ((cid:27)2)2 t " s " s+h s h h s " " { } 1 E (cid:0) (" (cid:0)(cid:22) )(y (cid:0)y (cid:0)(cid:11) (cid:0)(cid:12) " ) = (cid:27)2 t " s+h s h h s { " } 1 E (cid:0) (" (cid:0)(cid:22) )(y (cid:0)y +y (cid:0)y (cid:0)(cid:11) (cid:0)(cid:12) " ) (cid:27)2 t " s+h t t s h h s " If s=t, or s<t(cid:0)h, then this expression is zero. If t(cid:0)h(cid:20)s<t, then this expression is equal to (cid:0)(cid:12) s+h(cid:0)t . If t<s, this expression is equal to (cid:0)(cid:12) s+h(cid:0)t +(cid:12) s(cid:0)t The existence of the second term mitigates the bias relative to the case where the difference is not taken. 21

I Sources for meta study Abiad, M. A., D. Furceri, and P. Topalova (2015): The macroeconomic effects of public investment: evidence from advanced economies, 15-95, International Monetary Fund. Acemoglu, D., S. Naidu, P. Restrepo, and J. A. Robinson (2019): “Democracy Does Cause Growth,” Journal of Political Economy, 127, 47–100. ADB, A. A., D. Furceri, and P. T. IMF (2016): “The macroeconomic effects of public investment: Evidence from advanced economies,” Journal of Macroeconomics, 50, 224–240. Adler, G., M. R. A. Duval, D. Furceri, K. Sinem, K. Koloskova, M. Poplawski- Ribeiro, et al. (2017): Gone with the headwinds: Global productivity, International Monetary Fund. Alesina, A., O. Barbiero, C. Favero, F. Giavazzi, and M. Paradisi (2015a): “Austerity in 2009–13,” Economic Policy, 30, 383–437. ——— (2017): “The effects of fiscal consolidations: Theory and evidence,” Tech. rep., National Bureau of Economic Research. Alesina, A., C. Favero, and F. Giavazzi (2015b): “The output effect of fiscal consolidation plans,” Journal of International Economics, 96, S19–S42. Amior, M. and A. Manning (2018): “The persistence of local joblessness,” American Economic Review, 108, 1942–70. Andreasen, M. M., J. Fernández-Villaverde, and J. F. Rubio-Ramírez (2017): “The pruned state-space system for non-linear DSGE models: Theory and empirical applications,” The Review of Economic Studies, 85, 1–49. Angrist, J. D., Ò. Jordà, and G. M. Kuersteiner(2018): “Semiparametricestimates of monetary policy effects: string theory revisited,” Journal of Business & Economic Statistics, 36, 371–387. 22

Arezki, R., V. A. Ramey, and L. Sheng (2017): “News shocks in open economies: Evidence from giant oil discoveries,” The quarterly journal of economics, 132, 103–155. Arregui, N., J. Beneš, I. Krznar, and S. Mitra (2013): “Evaluating the net benefits of macroprudential policy: A cookbook,” . Auerbach, A. J. and Y. Gorodnichenko (2012): “Fiscal multipliers in recession and expansion,” in Fiscal policy after the financial crisis, University of Chicago Press, 63–98. ——— (2017): “Fiscal multipliers in Japan,” Research in Economics, 71, 411–421. Ball, L. M., D. Furceri, M. D. Leigh, and M. P. Loungani (2013): The distributional effects of fiscal consolidation, 13-151, International Monetary Fund. Banerjee, R., M. B. Devereux, and G. Lombardo (2016): “Self-oriented monetary policy, globalfinancialmarketsandexcessvolatilityofinternationalcapitalflows,” Journal of International Money and Finance, 68, 275–297. Banerjee, R. N. and H. Mio (2018): “The impact of liquidity regulation on banks,” Journal of Financial intermediation, 35, 30–44. Barnichon, R. and C. Brownlees (2019): “Impulse response estimation by smooth local projections,” Review of Economics and Statistics, 101, 522–530. Basher, S. A., A. A. Haug, and P. Sadorsky (2012): “Oil prices, exchange rates and emerging stock markets,” Energy Economics, 34, 227–240. Baumeister, C. and L. Kilian (2016): “Lower oil prices and the US economy: Is this time different?” Brookings Papers on Economic Activity, 2016, 287–357. Bayer, C., R. Lütticke, L. Pham-Dao, and V. Tjaden (2019): “Precautionary savings, illiquid assets, and the aggregate consequences of shocks to household income risk,” Econometrica, 87, 255–290. Ben Zeev, N. and E. Pappa (2017): “Chronicle of a war foretold: The macroeconomic effects of anticipated defence spending shocks,” The Economic Journal, 127, 1568–1597. Berton, F., S. Mocetti, A. F. Presbitero, and M. Richiardi (2018): “Banks, firms, and jobs,” The Review of Financial Studies, 31, 2113–2156. 23

Bhattacharya, U., N. Galpin, R. Ray, and X. Yu (2009): “The role of the media in the internet IPO bubble,” Journal of Financial and Quantitative Analysis, 44, 657–682. Bluedorn, J. C. and C. Bowdler (2011): “The open economy consequences of US monetary policy,” Journal of International Money and Finance, 30, 309–336. Borio, C., M. Drehmann, and K. Tsatsaronis (2014): “Stress-testing macro stress testing: does it live up to expectations?” Journal of Financial Stability, 12, 3–15. Borio, C. E., E. Kharroubi, C. Upper, and F. Zampolli (2016): “Labour reallocation and productivity dynamics: financial causes, real consequences,” . Born, B., G. J. Müller, and J. Pfeifer (2014): “Does austerity pay off?” Review of Economics and Statistics, 1–45. Boss, M., G. Fenz, J. Pann, C. Puhr, M. Schneider, E. Ubl, et al. (2009): “modelingcreditriskthroughtheAustrianbusinesscycle: AnupdateoftheOeNbmodel,” Financial Stability Report, 17, 85–101. Brady, R. R. (2011): “Measuring the diffusion of housing prices across space and over time,” Journal of Applied Econometrics, 26, 213–231. Butt, N., R. Churm, M. F. McMahon, A. Morotz, and J. F. Schanz (2014): “QE and the bank lending channel in the United Kingdom,” . Caggiano, G., E. Castelnuovo, V. Colombo, and G. Nodari (2015): “Estimating fiscal multipliers: News from a non-linear world,” The Economic Journal, 125, 746–776. Caldara, D. and E. Herbst (2019): “Monetary policy, real activity, and credit spreads: Evidence from bayesian proxy svars,” American Economic Journal: Macroeconomics, 11, 157–92. Caselli, F. G. and A. Roitman(2016): “Nonlinearexchange-ratepass-throughinemerging markets,” International Finance. Chiţu, L., B. Eichengreen, and A. Mehl (2014): “When did the dollar overtake sterling as the leading international currency? Evidence from the bond markets,” Journal of Development Economics, 111, 225–245. 24

Chodorow-Reich, G. and L. Karabarbounis (2016): “The limited macroeconomic effects of unemployment benefit extensions,” Tech. rep., National Bureau of Economic Research. Chong, Y., Ò. Jordà, and A. M. Taylor (2012): “The Harrod–Balassa–Samuelson Hypothesis: Real Exchange Rates And Their Long-Run Equilibrium,” International Economic Review, 53, 609–634. Cloyne, J. and P. Hürtgen (2016): “The macroeconomic effects of monetary policy: a new measure for the United Kingdom,” American Economic Journal: Macroeconomics, 8, 75–102. Coibion, O. and Y. Gorodnichenko (2012): “What can survey forecasts tell us about information rigidities?” Journal of Political Economy, 120, 116–159. Coibion, O., Y. Gorodnichenko, L. Kueng, and J. Silvia (2017): “Innocent Bystanders? Monetary Policy and Inequality,” Journal of Monetary Economics, 88, 70–89. Dabla-Norris, M. E., M. S. Guo, M. V. Haksar, M. Kim, M. K. Kochhar, K. Wiseman, and A. Zdzienicka (2015): The new normal: A sector-level perspective on productivity trends in advanced economies, International Monetary Fund. De Cos, P. H. and E. Moral-Benito (2016): “Fiscal multipliers in turbulent times: the case of Spain,” Empirical Economics, 50, 1589–1625. Dessaint, O. and A. Matray (2017): “Do managers overreact to salient risks? Evidence from hurricane strikes,” Journal of Financial Economics, 126, 97–121. Dupor, B., J. Han, and Y.-C. Tsai (2009): “What do technology shocks tell us about the New Keynesian paradigm?” Journal of Monetary Economics, 56, 560–569. Eichler, M. (2013): “Causal inference with multiple time series: principles and problems,” PhilosophicalTransactionsoftheRoyalSocietyA:Mathematical, PhysicalandEngineering Sciences, 371, 20110613. Favara, G. and J. Imbs (2015): “Credit supply and the price of housing,” American Economic Review, 105, 958–92. 25

Fazzari, S. M., J. Morley, and I. Panovska (2015): “State-dependent effects of fiscal policy,” Studies in Nonlinear Dynamics & Econometrics, 19, 285–315. Francis, N., M. T. Owyang, J. E. Roush, and R. DiCecio (2014): “A flexible finitehorizon alternative to long-run restrictions with an application to technology shocks,” Review of Economics and Statistics, 96, 638–647. Funke, M., M. Schularick, and C. Trebesch (2016): “Going to extremes: Politics after financial crises, 1870–2014,” European Economic Review, 88, 227–260. Furceri, D., M. L. E. Bernal-Verdugo, and M. D. M. Guillaume (2012a): Crises, labor market policy, and unemployment, 12-65, International Monetary Fund. Furceri, D., S. Guichard, and E. Rusticelli (2012b): “The effect of episodes of large capital inflows on domestic credit,” The North American Journal of Economics and Finance, 23, 325–344. Furceri, D., P. Loungani, and A. Zdzienicka (2018): “The effects of monetary policy shocks on inequality,” Journal of International Money and Finance, 85, 168–186. Furceri, D. and A. Zdzienicka (2011): “How costly are debt crises?” IMF working papers, 1–29. ——— (2012): “How costly are debt crises?” Journal of International Money and Finance, 31, 726–742. Gal, P. N. and A. Hijzen (2016): The short-term impact of product market reforms: A cross-country firm-level analysis, International Monetary Fund. Gertler, M. and S. Gilchrist (2018): “What happened: Financial factors in the great recession,” Journal of Economic Perspectives, 32, 3–30. Hall, A. R., A. Inoue, J. M. Nason, and B. Rossi(2012): “Informationcriteriaforimpulse response function matching estimation of DSGE models,” Journal of Econometrics, 170, 499–518. Hamilton, J. D. (2011): “Nonlinearities and the macroeconomic effects of oil prices,” Macroeconomic dynamics, 15, 364–378. 26

Hautsch, N. and R. Huang (2012): “The market impact of a limit order,” Journal of Economic Dynamics and Control, 36, 501–522. Hollo, D., M. Kremer, and M. Lo Duca (2012): “CISS-a composite indicator of systemic stress in the financial system,” . Jordà, Ò. (2009): “Simultaneous confidence regions for impulse responses,” The Review of Economics and Statistics, 91, 629–647. Jordà, Ò. and M. Marcellino (2010): “Path forecast evaluation,” Journal of Applied Econometrics, 25, 635–662. Jordà, Ò., M. Schularick, and A. M. Taylor (2013): “When credit bites back,” Journal of Money, Credit and Banking, 45, 3–28. ——— (2015a): “Betting the House,” Journal of International Economics, 96, S2–S18. ——— (2015b): “Leveraged bubbles,” Journal of Monetary Economics, 76, S1–S20. ——— (2016): “Sovereigns versus banks: credit, crises, and consequences,” Journal of the European Economic Association, 14, 45–79. Jordà, Ò. and A. M. Taylor (2016): “The time for austerity: estimating the average treatment effect of fiscal policy,” The Economic Journal, 126, 219–255. Kilian, L. and Y. J. Kim (2011): “How Reliable are Local Projection Estimators of Impulse Responses?” Review of Economics and Statistics, 93, 1460–1466. Kilian, L. and R. J. Vigfusson (2011): “Nonlinearities in the oil price–output relationship,” Macroeconomic Dynamics, 15, 337–363. ——— (2017): “The role of oil price shocks in causing US recessions,” Journal of Money, Credit and Banking, 49, 1747–1776. Kraay, A. (2014): “Government spending multipliers in developing countries: evidence from lending by official creditors,” American Economic Journal: Macroeconomics, 6, 170– 208. 27

Krishnamurthy, A. and T. Muir (2017): “How credit cycles across a financial crisis,” Tech. rep., National Bureau of Economic Research. Leduc, S. and D. Wilson (2013): “Roads to prosperity or bridges to nowhere? Theory and evidence on the impact of public infrastructure investment,” NBER Macroeconomics Annual, 27, 89–142. Leigh, M. D., W. Lian, M. Poplawski-Ribeiro, R. Szymanski, V. Tsyrennikov, and H. Yang (2017): Exchange rates and trade: A disconnect?, International Monetary Fund. Listorti, G. and R. Esposti (2012): “Horizontal price transmission in agricultural markets: fundamental concepts and open empirical issues,” Bio-based and Applied Economics Journal, 1, 81–108. Luetticke, R. (2018): “Transmission of monetary policy with heterogeneity in household portfolios,” . Menkhoff, L., L. Sarno, M. Schmeling, and A. Schrimpf (2016): “Currency value,” The Review of Financial Studies, 30, 416–441. Mertens, K. and J. L. Montiel Olea (2018): “Marginal tax rates and income: New time series evidence,” The Quarterly Journal of Economics, 133, 1803–1884. Mian, A., A. Sufi, and E. Verner (2017): “Household debt and business cycles worldwide,” The Quarterly Journal of Economics, 132, 1755–1817. Miyamoto, W., T. L. Nguyen, and D. Sergeyev (2018): “Government spending multipliers under the zero lower bound: Evidence from Japan,” American Economic Journal: Macroeconomics, 10, 247–77. Moretti, E. and D. J. Wilson (2017): “The effect of state taxes on the geographical location of top earners: evidence from star scientists,” American Economic Review, 107, 1858–1903. Nakamura, E. and J. Steinsson (2018): “Identification in Macroeconomics,” Journal of Economic Perspectives, 32, 59–86. 28

Ottonello, P. and T. Winberry (2018): “Financial heterogeneity and the investment channel of monetary policy,” Tech. rep., National Bureau of Economic Research. Owyang, M. T., V. A. Ramey, and S. Zubairy (2013): “Are government spending multipliers greater during periods of slack? Evidence from twentieth-century historical data,” American Economic Review, 103, 129–34. Ramey, V. A. (2016): “Macroeconomic shocks and their propagation,” in Handbook of macroeconomics, Elsevier, vol. 2, 71–162. Ramey, V. A. and S. Zubairy (2018): “Government spending multipliers in good times andinbad: evidencefromUShistoricaldata,” Journal of Political Economy, 126, 850–901. Riera-Crichton, D., C. A. Vegh, and G. Vuletin (2015): “Procyclical and countercyclical fiscal multipliers: Evidence from OECD countries,” Journal of International Money and Finance, 52, 15–31. Romer, C. D. and D. H. Romer (2015): “New evidence on the impact of financial crises in advanced countries,” Tech. rep., National Bureau of Economic Research. ——— (2017): “New evidence on the aftermath of financial crises in advanced countries,” American Economic Review, 107, 3072–3118. Salomons, A. et al. (2018): “Is automation labor-displacing? Productivity growth, employment, and the labor share,” Tech. rep., National Bureau of Economic Research. Santoro, E., I. Petrella, D. Pfajfar, and E. Gaffeo (2014): “Loss aversion and the asymmetric transmission of monetary policy,” Journal of Monetary Economics, 68, 19–36. Schaller, J. (2013): “For richer, if not for poorer? Marriage and divorce over the business cycle,” Journal of Population Economics, 26, 1007–1033. Stock, J. H. and M. W. Watson (2018): “Identification and estimation of dynamic causal effects in macroeconomics using external instruments,” The Economic Journal, 128, 917–948. 29

Stoyanov, A. and N. Zubanov (2012): “Productivity spillovers across firms through worker mobility,” American Economic Journal: Applied Economics, 4, 168–98. Taylor, A. M. (2015): “Credit, financial stability, and the macroeconomy,” Annu. Rev. Econ., 7, 309–339. Tenreyro, S. and G. Thwaites (2016): “Pushing on a String: US Monetary Policy is Less Powerful in Recessions,” American Economic Journal: Macroeconomics, 8, 43–74. Teulings, C. N. and N. Zubanov (2014): “Is economic recovery a myth? Robust estimation of impulse responses,” Journal of Applied Econometrics, 29, 497–514. Yang, J., H. Guo, and Z. Wang (2006): “International transmission of inflation among G-7 countries: A data-determined VAR analysis,” Journal of Banking & Finance, 30, 2681–2700. Zidar, O. (2019): “Tax cuts for whom? Heterogeneous effects of income tax changes on growth and employment,” Journal of Political Economy, 127, 1437–1472. 30

Cite this document
APA
Edward P. Herbst and Benjamin K. Johannsen (2021). Bias in Local Projections (FEDS 2020-010). Board of Governors of the Federal Reserve System, Finance and Economics Discussion Series. https://whenthefedspeaks.com/doc/feds_2020-010
BibTeX
@techreport{wtfs_feds_2020_010,
  author = {Edward P. Herbst and Benjamin K. Johannsen},
  title = {Bias in Local Projections},
  type = {Finance and Economics Discussion Series},
  number = {2020-010},
  institution = {Board of Governors of the Federal Reserve System},
  year = {2021},
  url = {https://whenthefedspeaks.com/doc/feds_2020-010},
  abstract = {Local projections (LPs) are a popular tool in macroeconomic research. We show that LPs are often used with very small samples in the time dimension. Consequently, LP point estimates can be severely biased. We derive simple expressions for this bias and propose a way to bias-correct LPs. Small sample bias can also lead autocorrelation-robust standard errors to dramatically understate sampling uncertainty. We argue they should be avoided in LPs like the ones we study. Using identified monetary policy shocks, we demonstrate that the bias in point estimates can be economically meaningful and the bias in standard errors can affect inference. Accessible materials (.zip) Original paper: PDF | Accessible materials (.zip)},
}