feds · July 18, 2018

Attenuating the Forward Guidance Puzzle: Implications for Optimal Monetary Policy

Abstract

We examine the implications of less powerful forward guidance for optimal policy using a sticky-price model with an effective lower bound (ELB) on nominal interest rates as well as a discounted Euler equation and Phillips curve. When the private-sector agents discount future economic conditions more in making their decisions today, an announced cut in future interest rates becomes less effective in stimulating current economic activity. While the implication of such discounting for optimal policy depends on its degree, we find that, under a wide range of plausible degrees of discounting, it is optimal for the central bank to compensate for the reduced effect of a future rate cut by keeping the policy rate at the ELB for longer. Accessible materials (.zip)

Finance and Economics Discussion Series Divisions of Research & Statistics and Monetary Affairs Federal Reserve Board, Washington, D.C. Attenuating the Forward Guidance Puzzle: Implications for Optimal Monetary Policy Taisuke Nakata, Ryota Ogaki, Sebastian Schmidt, and Paul Yoo 2018-049 Please cite this paper as: Nakata, Taisuke, Ryota Ogaki, Sebastian Schmidt, and Paul Yoo (2018). “Attenuating the Forward Guidance Puzzle: Implications for Optimal Monetary Policy,” Finance and Economics Discussion Series 2018-049. Washington: Board of Governors of the Federal Reserve System, https://doi.org/10.17016/FEDS.2018.049. NOTE: Staff working papers in the Finance and Economics Discussion Series (FEDS) are preliminary materials circulated to stimulate discussion and critical comment. The analysis and conclusions set forth are those of the authors and do not indicate concurrence by other members of the research staff or the Board of Governors. References in publications to the Finance and Economics Discussion Series (other than acknowledgement) should be cleared with the author(s) to protect the tentative character of these papers.

Attenuating the Forward Guidance Puzzle: Implications for Optimal Monetary Policy∗ Taisuke Nakata† Ryota Ogaki‡ Sebastian Schmidt§ Federal Reserve Board University of Tokyo European Central Bank Paul Yoo¶ UNC Kenan-Flagler First Draft: May 2017 This Draft: June 2018 Abstract We examine the implications of less powerful forward guidance for optimal policy using a sticky-price model with an effective lower bound (ELB) on nominal interest rates as well as a discountedEulerequationandPhillipscurve. Whentheprivate-sectoragentsdiscountfuture economicconditionsmoreinmakingtheirdecisionstoday,anannouncedcutinfutureinterest rates becomes less effective in stimulating current economic activity. While the implication ofsuchdiscountingforoptimalpolicydependsonitsdegree, wefindthat, underawiderange of plausible degrees of discounting, it is optimal for the central bank to compensate for the reduced effect of a future rate cut by keeping the policy rate at the ELB for longer. JEL: E52, E58, E61 Keywords: ForwardGuidance,OptimalPolicy,DiscountedEulerEquation,DiscountedPhillips Curve, Effective Lower Bound. ∗We thank Roc Armenter, Flint Brayton, Hess Chung, Gaetano Gaballo, Klodiana Istrefi, Eric Mengus, MatthiasPaustian,JohnRoberts,AkatsukiSukeda,YukiTeranishi,MartinUribe,MattWaldron,andseminar participants atthe13th DynareConference, the21st T2MConference, the24th InternationalConference for Computing in Economics and Finance, and Mannheim University for useful comments. The views expressed in this paper should be regarded as solely those of the authors and are not necessarily those of the Federal Reserve Board of Governors, the Federal Reserve System, or the European Central Bank. All errors and omissions are solely our own. †Division of Research and Statistics, Federal Reserve Board, 20th Street and Constitution Avenue N.W. Washington, D.C. 20551; Email: taisuke.nakata@frb.gov ‡University of Tokyo, Faculty of Economics, 7-3-1 Hongo, Bunkyo, Tokyo, 113-0033, Japan; Email: rogaki92@g.ecc.u-tokyo.ac.jp §European Central Bank, Monetary Policy Research Division, 60640 Frankfurt, Germany; Email: sebastian.schmidt@ecb.int ¶UNC Kenan-Flagler Business School (CB 3490, McColl Building), 300 Kenan Center Dr., Chapel Hill, NC 27599; Email: paul yoo@kenan-flagler.unc.edu 1

1 Introduction When current policy rates are at or close to their effective lower bounds (ELB), central banks often turn to communication about the future path of policy rates—known as forward guidance—asanalternativemeanstostimulateeconomicactivity. Accordingtothestandard sticky-price model often used in academia and central banks to analyze monetary policy, forward guidance is a powerful substitute for a change in the current policy rate and should be used by central banks to improve welfare when the current policy rate is constrained by the ELB (see Eggertsson and Woodford (2003); and Jung, Teranishi, and Watanabe (2005)). In particular, in this model, the central bank finds it optimal to announce that it will keep the policy rate at the ELB for longer than would be warranted by future output and inflation stabilization considerations alone. An intriguing feature of this standard model is that the economic effects of forward guidance can be implausibly large (Carlstrom, Fuerst, and Paustian (2015); Del Negro, Giannoni, and Patterson (2015); and Kiley (2016)). This feature—often referred to as the forward guidance puzzle—has generated concern among researchers that the standard model is of limited use for the analysis of forward guidance policies and, as a result, has also generated an interest in modifying the standard model to mitigate the implausibly large effects of forward guidance. As we will review shortly, a number of recent papers have shown that various economically sensible departures from the standard framework go a long way in attenuating the forward guidance puzzle, but they have done so under the assumption that the interest rate policy is characterized by a simple feedback rule. In this paper, we examine the implications of attenuating the forward guidance puzzle for the optimal design of forward guidance policy. We do so by introducing private-sector discounting—discountingoftheexpectedfutureincomeintheEulerequationanddiscounting of the expected future marginal costs of production in the Phillips curve—into an otherwise standard sticky-price model and characterizing how the degree of discounting affects optimal commitment policy. We assume that the central bank is concerned about inflation and output stabilization. Our setup is motivated by McKay, Nakamura, and Steinsson (2016b) and Gabaix (2016), among others, who relate their models of less powerful forward guidance tothediscountedEulerequationand/orthediscountedPhillipscurve. Webeginouranalysis by examining the effect of discounting on optimal policy in a three-period model in which the cost and the benefit of adjusting future policy rates can be transparently described. We then move on to examining the effect of discounting in a series of infinite-horizon models and study how the degree of discounting affects optimal policy. A priori, the implication of less powerful forward guidance for optimal policy is not clear. When forward guidance is less powerful, the central bank may want to promise to keep the policy rate at the ELB for longer, as the economy would not be sufficiently stimulated otherwise. However, it is also plausible that the central bank wants to keep the policy rate at the ELB for a shorter period when forward guidance is less powerful; in an extreme case 2

in which forward guidance has no effect on current economic activities, it would be pointless for the central bank to promise that it will keep the policy rate at the ELB after the natural rate of interest becomes positive. The main insight from the three-period model in the first part of the paper is as follows. The cost of a commitment by the central bank during a crisis involving the binding ELB constraint to keep the policy rate at the ELB for longer is that the economy experiences a temporary overheating in the aftermath of the crisis. The benefit of such a commitment is that, becauseoftheoverheating, thedeclinesinoutputandinflationaremitigatedduringthe crisis. Because the private-sector discounting makes the economy less sensitive to a future rate change, the discounting reduces both the cost and the benefit of a commitment to keep the policy rate at the ELB for longer. Since the cost and the benefit are both lower, whether the discounting makes it optimal for the central bank to promise to keep future policy rates low for longer or shorter than in the standard model is a quantitative question and depends on the parameters of the model. Turning to the infinite-horizon models, when there is discounting only in the Euler equation, we find that optimal monetary policy in a liquidity trap entails committing to keeping the policy rate at the ELB for longer than in the standard model without discounting. Even though the policy rate is kept at the ELB for longer with higher discounting, the declines in output and inflation at the outset of the liquidity trap are larger, as the extension of the ELB duration is not large enough to compensate for the reduced power of forward guidance. Likewise, we find that the central bank will find it optimal to keep the policy rate at the ELB for longer in the model with discounting only in the Phillips curve than in the standard model without any discounting. EvenwhenweallowfordiscountinginboththeEulerequationandthePhillipscurve, the optimaldurationofkeepingthepolicyrateattheELBcontinuestobelongerwithdiscounting than without discounting, unless the degree of discounting is large in both equations. When households and price-setters heavily discount their expected future income and marginal costs, respectively, in making their decisions, forward guidance becomes so ineffective that the central bank finds it optimal to keep the policy rate at the ELB for a shorter period than in the standard model. Given our finding that the implication of less powerful forward guidance importantly depends on the degree of discounting, we will end our analysis by reviewing the evidence regardingthedegreeofdiscountingintheEulerequationandthePhillipscurve. Wefindthat, through the lens of existing micro-founded models of less powerful forward guidance, minor deviations of the standard Euler equation and the standard Phillips curve are empirically plausible and are enough to meaningfully attenuate the forward guidance puzzle. According to our analysis of the infinite-horizon models, a small degree of discounting makes it optimal forthecentralbanktokeepthepolicyrateattheELBforlonger. Theincreaseintheoptimal ELB duration due to discounting can be quantitatively large. Under some specifications 3

consideredintheliterature,wefindthatthediscountingincreasestheexpectedELBduration associated with optimal policy by about one year. An important caveat to our analysis is that it is semi-structural. We are silent—and purposefullyso—aboutthemodel’sprimitivesthatjustifythediscountinginthelog-linearized equilibrium conditions characterizing the households and price-setters. Also, we assume that the central bank’s objective function is given by the standard quadratic function of inflation and the output gap. Although this objective function can be justified as the secondorder approximation to household welfare in the standard model, it may not be necessarily consistent with household welfare in more micro-founded models of Euler equation and/or Phillips curve discounting. While it would be ideal to characterize the optimal commitment policy in each of the growing number of micro-founded models of less powerful forward guidance, we think that our semi-structural approach has some merits. First, as we discussed, many different microfounded models will end up with something that looks like the discounted Euler equation and Phillips curve. As a result, the insights from our analysis can be seen as providing a useful starting point for understanding optimal policy in various micro-founded models of less powerful forward guidance. Second, the minimal departure from the standard model entailed by our approach allows us to sharply characterize the cost and benefit of keeping interest rates at the ELB for an extended period and the way they are affected when the forward guidance becomes less powerful, and our approach facilitates the interpretation of theresultsonoptimalpolicy. Finally,fromamorepragmaticperspective,solvingtheoptimal commitment policy problem in some of the micro-founded models discussed above without abstracting from the ELB constraint is computationally very hard or infeasible.1 Our paper builds on recent papers that either examine the forward guidance puzzle in a standard sticky-price model, or propose ways to attenuate the puzzle, or both. Carlstrom, Fuerst, and Paustian (2015) and Kiley (2016) show that the effects of a future rate cut are much smaller in sticky-information models than in sticky-price models. Del Negro, Giannoni, and Patterson (2015) attenuate the forward guidance puzzle by introducing an overlappinggenerations structure into an otherwise standard New Keynesian model, while Angeletos and Lian (2016); Gabaix (2016), Haberis, Harrison, and Waldron (2017); and Wiederholt (2015) attenuate the puzzle by departing from the assumption of common knowledge, rational expectations, perfect credibility, and perfect information, respectively. McKay, Nakamura, and Steinsson (2016a) and Kaplan, Moll, and Violante (2016) show that the effect of forward guidance is much smaller in a model with heterogeneous households and incomplete markets than in the standard New Keynesian model with a representative household and complete markets. All of these papers restrict the scope of their analysis to Taylor-type interest rate 1InAppendixG,wecharacterizeoptimalcommitmentpolicyinastructuralmodeloflesspowerfulforward guidance—an overlapping-generation New Keynesian model with perpetual youth—and find that the results from the structural analysis are consistent with those from our semi-structural analysis. 4

rules in analyzing the dynamics of the economy at the ELB.2 Our contribution is to depart from the assumption of a suboptimal monetary policy and analyze the implications of attenuated forward guidance for optimal commitment policy when the policy rate is constrained by the ELB. The two papers closest to ours are Bilbiie (2017) and Andrade, Gaballo, Mengus, and Mojon (2017). Bilbiie (2017) uses a novel framework developed by Bilbiie (2016) to analyze theimplicationsofdiscountingintheEulerequationfortheoptimalELBduration. Andrade, Gaballo, Mengus, and Mojon (2017) build a model in which the presence of “pessimists” who do not believe in the central bank’s commitment to keeping the policy rate at the ELB for an extended period attenuates the forward guidance puzzle, and they examine their model’s implication for the optimal ELB duration. Our work differs methodologically from theirs, as we characterize the optimal commitment policy whereas they compute the optimal ELB duration assuming that, after liftoff, the policy rate either returns to its steady state immediately (Bilbiie (2017)) or is determined by a simple feedback rule (Andrade, Gaballo, Mengus, and Mojon (2017)). There are also interesting substantive differences between—and similarities across—these two papers on one hand and our paper on the other, which will be discussed in detail in the main body of the paper. Finally, our work is related to the literature on optimal monetary policy under commitmentintheNewKeynesianmodelwiththeELBconstraint(EggertssonandWoodford(2003); Jung, Teranishi, and Watanabe (2005); Adam and Billi (2006); and Nakov (2008)). These papers established the desirability of keeping the policy rate at the ELB for an extended periodintheNewKeynesianmodelwiththestandardEulerequationandthe standardPhillips curve. Our paper extends their analysis to models with a discounted Euler equation and a discounted Phillips curve, showing that the desirability of “low-for-long” policy survives even in models with discounting and that the central bank often finds it optimal to keep the policy rate low for longer with discounting than without discounting. Theremainderofthepaperisorganizedasfollows. Section2presentsourstylizedanalysis based on a three-period model. Section 3 extends the analysis to an infinite-horizon model. Section 4 reviews the evidence on the degree of discounting and Section 5 briefly summarizes the result of an optimal policy analysis in a micro-founded model of less powerful forward guidance (discussed in detail in Appendix G). Section 6 concludes. 2 A three-period model Inthissection,weexaminetheimplicationofprivate-sectordiscountingforoptimalpolicy in a deterministic three-period model. For simplicity, we only allow discounting in the Euler equation. The three-period structure allows us to transparently describe how the Euler 2Gabaix (2016) studies optimal policy in his model but abstracts from the ELB constraint when characterizing the optimal interest rate policy under commitment. 5

equation discounting affects the marginal costs and benefits of forward guidance and thus optimal policy. 2.1 Private sector Time is discrete and indexed by t. The economy starts at t = 1 and ends in t = 3.3 There is no uncertainty. In periods 1 and 2, the private-sector equilibrium conditions are given by the following two equations: y = (1−α)y −σ(i −π −rn), (1) t t+1 t t+1 t π = κy +βπ , (2) t t t+1 where y is the output gap, π denotes the inflation rate between periods t−1 and t, i is the t t t nominal interest rate between periods t and t+1 on a risk-free bond, and rn is a demand t shock that captures exogenous fluctuations in the natural real rate of interest. Equation (1) is the Euler equation, and equation (2) is the Phillips curve. α is a discounting parameter on the expected output next period. If α = 0, these two equations represent the textbook New Keynesian model studied in detail in Woodford (2003) and Gali (2015), among others. In the final period 3, the private-sector equilibrium conditions are given by y = −σ(i −rn), (3) 3 3 3 π = κy . (4) 3 3 The demand shock is assumed to be negative in period 1 and stays constant at a positive value in periods 2 and 3. That is, rn = rn < 0, and rn = rn = rn > 0. 1 L 2 3 2.2 Monetary policy The central bank has the following objective function: u(y,π) = − 1 (cid:0) π2+λy2(cid:1) . (5) 2 At the beginning of the initial period t = 1, the central bank chooses sequences of the output gap, inflation, and the nominal interest rate in order to maximize the discounted sum of current and future utility flows. The central bank’s optimization problem is given by max u(y ,π )+βu(y ,π )+β2u(y ,π ), 1 1 2 2 3 3 {yt,πt,it}3 t=1 3Moreprecisely,weareworkingwithaninfinite-horizonmodelinwhichtheeconomyisassumedtobeat the steady state from period 4 onwards (in a pure three-period model, there would not be any demand for government bonds at period 3). We simply refer to this setup as “the three-period model” for the sake of brevity. 6

subject to the private-sector equilibrium conditions described earlier and the ELB constraint on the policy rate: i ≥ i , t ELB for all t ∈ {1,2,3}. As a reference, we will also consider how the economy behaves when the nominal interest rate is determined according to the following simple interest rate feedback rule: i = max[rn+φ π ,i ], (6) t t π t ELB where φ > 1. π 2.3 Results Whileoursimplethree-periodmodelcanbesolvedinclosedforminprinciple,thesolution turns out to depend on the discounting parameter α in a complicated, nonlinear way. We therefore use a numerical example to describe how discounting affects optimal policy.4 Table 1 shows the parameter values used for the numerical analysis. For this three-period model, they are chosen not based on empirical realism, but to make the key takeaways from the analysis transparent. Table 1: Parameter Values: Three-Period Model Parameter Description Values β Discount factor 0.9925 σ Intertemporal elasticity of substitution 1 κ Slope of the Phillips Curve 0.2 θ Relative price elasticity of demand 10 λ Relative weight on output gap volatility (κ/θ) 0.02 α Discounting parameter in the Euler equation [0,0.5] i The effective lower bound 0 ELB rn Long-run natural real rate (1/β−1) 0.0075 rn Natural real rate in period 1 −0.03825 L Figure 1 shows the paths of the output gap, inflation and the policy rate in the model with discounting (α = 0.5) and in the model without discounting (α = 0).5 Under the simple interest-rate rule, output and inflation are fully stabilized at their steady state values from period 2—when the crisis shock disappears—regardless of the value of α. Because the allocation in period 2 does not depend on α, and because the policy rate in period 1 is constrained at the ELB regardless of α, output and inflation in period 1 do not depend on α.6 Output declines by about 4 percent and inflation drops by 3 percentage points in period 1. 4In Appendix A, we provide some analytical results in a version of the three-period model with a static Phillips curve. 5The parameterization α=0.5 in the model with discounting is chosen for illustrative purposes. 6Inthethree-periodmodelofthissectionandtheinfinite-horizonmodelofthenextsection,theallocation under the simple interest rate rule is identical to the allocation under the optimal discretionary policy. 7

Figure 1: Optimal Policy in a Three-Period Model with and without Discounting Policy Rate Output Gap Inflation 4 2 2 1 1 3 0 0 2 −1 −1 −2 −2 1 Optimal Policy (α = 0) −3 −3 Optimal Policy (α = 0.5) Simple Rule 0 −4 −4 1 2 3 1 2 3 1 2 3 Time Time Time Note: Units are annualized percent, annualized percentage points, and percent deviation for policy rate, inflation, and output gap, respectively. Under the optimal policy, the central bank keeps the policy rate at the ELB in period 2 and chooses the policy rate below the natural rate of interest in period 3 (i < rn). This 3 3 low-for-longer policy results in overshooting of output and inflation in periods 2 and 3, which mitigates the declines in output and inflation in period 1 through expectations. The policy rate in period 3 depends on the degree of discounting, with the central bank in the economy with discounting choosing a lower policy rate than the central bank in the economy without discounting. Even though the policy rate in period 3 is lower under discounting, the initial declines in output and inflation are larger under discounting, reflecting the reduced power of forward guidance under discounting. However, regardless of the degree of discounting, the declines in output and inflation in period 1 are smaller under optimal policy than under the interest rate feedback rule. To understand how the degree of discounting affects optimal policy, Figure 2 shows how differently an announcement of a future rate cut—here a reduction in i by 75 basis points 3 below the optimal level—affects the paths of output and inflation in the models with and without discounting. In both models, a promise to lower the policy rate further in period 3 from its optimal level mitigates the declines in output and inflation in period 1, but amplifies the overshooting of output and inflation in period 2, as the comparison of dashed and solid linesineachpanelofFigure2demonstrates. Hence, theratecutinperiod3bearsthebenefit of smaller output and inflation deviations from their targets in period 1 and the cost of larger target deviations in period 2 in both models.7 Because a rate cut in period 3 has a smaller effect on inflation and output in earlier 7The optimal commitment policy accounts for this tradeoff when choosing the level of the policy rate for period 3 so the central bank has no incentive in period 1 to announce a period-3 policy rate that deviates from the one that solves the optimization problem under commitment. 8

Figure 2: Effects of a Reduction in the Future Policy Rate on the Macroeconomy with and without Discounting (a) Model w/o Discounting (α=0) 2 1 0 −1 −2 −3 1 2 3 paG tuptuO i = i opt (α = 0) − 75 b.p. 3 3 i = i opt (α = 0) 3 3 Time 2 1.5 1 0.5 0 −0.5 −1 1 2 3 noitalfnI (b) Model w/ Discounting (α=0.5) 2 1 0 −1 −2 −3 1 2 3 Time paG tuptuO i = i opt (α = 0.5) − 75 b.p. 3 3 i = i opt (α = 0.5) 3 3 Time 2 1.5 1 0.5 0 −0.5 −1 1 2 3 noitalfnI Time Note: The thin solid line represents the steady states where u is at maximum. Units are percent deviation t and annualized percentage points for output gap and inflation, respectively. periods, it reduces and increases the target deviation in periods 1 and 2, respectively, by less in the model with discounting than in the standard model. In our three-period model, for any given level of the third-period policy rate, the degree of discounting does not matter for allocations in period 3, and thus the level of overshooting in period 2 is smaller with discounting than without discounting for any i < rn. As a result, for any given level of the 3 third-period policy rate, a smaller increase in the target deviation in period 2 induced by a third-period rate cut means a smaller increase in the welfare cost of overshooting in period 2 in the model with discounting than in the standard model. However, the smaller reduction in the target deviation in period 1 induced by a smaller increase in the target overshooting in period 2 does not necessarily mean that the welfare benefit of a less severe recession is smaller in the model with discounting; for any given level of the third-period policy rate, the recession is more severe to begin with in the model with discounting than in the standard model, and the objective function is quadratic. Thus, a smaller increase in inflation and 9

output in period one can lead to a larger increase in welfare in the model with discounting than in the standard model. Figure 3: Tradeoff of Adjusting the Future Policy Rate with and without Discounting Discounted future utilities Utility at time 1 (u ) x 10−3 1 x 10−3 (β u + β 2 u ) 0 0 2 3 −1 −1 −2 −2 −3 −3 −4 w/o Discounting (α = 0) w Discounting (α = 0.5) −5 −4 0 1 2 3 0 1 2 3 i i 3 3 Marginal Cost and Benefit x 10−3 Welfare x 10−3 −2 3 −3 2 MC −4 1 −5 0 MB −6 −1 0 1 2 3 0 1 2 3 i i 3 3 Note: Thesolid(dashed)verticallinedenotesmaximumwelfareforα=0(α=0.5)inthebottomtwopanels. MB and MC stand for marginal welfare benefit and cost, respectively. Figure 3 shows how the discounting affects the cost and benefit of a future rate cut in detail. In each panel, solid and dashed lines are for the standard model and the model with discounting, respectively. For different values of i , the upper left panel plots the central 3 bank’s utility flow in period 1, while the upper right panel plots the discounted sum of utility flows in periods 2 and 3. Consistent with the discussion in the previous paragraph, in both the models with and without discounting, the utility flow of the initial period is decreasing in i , whereas the discounted sum of utility flows of the second and third periods is increasing in 3 i (as long as i < rn). The bottom left panel shows welfare, the sum of the top two panels. 3 3 The optimal i that maximizes welfare equates the marginal benefit and cost of reducing the 3 third-period policy rate, which are shown in the bottom right panel. Consistent with the discussion in the previous paragraph, the marginal cost of reducing the third-period policy rate—in the form of a larger overshooting—is smaller in the model with discounting than in the standard model for any level of i . However, the marginal benefit of reducing the 3 10

third-period policy rate—in the form of a less severe recession—may be smaller or larger in the model with discounting than in the standard model. In our numerical example, the marginal benefit is higher in the model with discounting if i is sufficiently small, but is lower 3 otherwise. Appendix B reinforces the idea that the effect of discounting for optimal policy is ambiguous by providing an alternative parameterization of the three-period model in which the optimal third-period policy rate is higher with discounting than without discounting. Given our finding that the effect of discounting for optimal policy depends importantly on the specifics of the model, we now turn our attention to an infinite-horizon model that is stylized, but is parameterized in a way that can speak to the implications of discounting in a more empirically relevant setup. 3 An infinite-horizon model In this section, we examine the effect of the private-sector discounting on optimal policy in infinite-horizon models. In addition to discounting in the Euler equation, we allow for discounting in the Phillips curve. The economy is buffeted by two exogenous disturbances, a demand shock (rn) and a cost-push shock (e ). t t 3.1 The private sector and monetary policy The aggregate private-sector behavior is summarized by the following two loglinear equilibrium conditions: y = (1−α )E y −σ(i −E π −rn), (7) t 1 t t+1 t t t+1 t π = κy +(1−α )βE π +e , (8) t t 2 t t+1 t where α and α are the discount parameters in the Euler equation and the Phillips curve, 1 2 respectively.8 The two exogenous shocks, (rn,e ), are perfectly correlated, as in Boneva, t t Braun, andWaki(2016)andHillsandNakata(2018), andfollowatwo-stateMarkovprocess. In the first period, the economy is in the crisis state. It moves to the normal state—which is assumed to be an absorbing state—with a positive probability (1−µ) each period.9 In the crisis state, rn = rn and e = e . In the normal state, rn = rn = 1/β −1 and e = 0. t L t L t t At the beginning of the first period, the central bank chooses a state-contingent sequence of (y ,π ,i ) in order to maximize the expected discounted sum of future utility flows given t t t 8Somepapers,includingMcKay,Nakamura,andSteinsson(2016b),haveconsideredanalternativespecificationofthediscountedEulerequationinwhichthereisanadditionaldiscountingparameterfortheexpected real rate. The key results of our analysis are robust to this alternative specification of the discounted Euler equation. These results are available upon request. 9The assumption of an absorbing state is common in the literature and is without loss of generality for the purpose of this paper. However, the departure from this assumption does have interesting implications, as explored by Nakata (2018) and Nakata and Schmidt (2014). 11

by ∞ (cid:88) max E βt−1u(y ,π ) 1 t t {yt,πt,it}∞ t=1 t=1 subject to equations ((7) - (8)) and the ELB constraint on the policy rate (i ≥ i ). The t ELB utility flow, u(·), is given by the standard quadratic objective function (see equation (5)). Further details on the optimization problem and the associated optimality conditions are provided in Appendix D. 3.2 Parameterization Table 2 shows the baseline parameter values. Our choice of β = 0.9925 implies a real interest rate in the normal state of 3 percent (annualized). σ = 1 is consistent with the logutilityspecificationofthehouseholdpreferenceforconsumptioninthestandardintertemporal optimization problem of the household. θ = 8 is within the range of values found in the literature (see Broda and Weinstein (2006); Denes, Eggertsson, and Gilbukh (2013)). κ = 0.007 is consistent with a Calvo parameter of 0.84 in the Calvo model of price-setting. Our choice of µ = 5/6 is within the range of values used in the literature and implies an expected duration of crisis of one and a half year. The value of λ is set to be consistent with the value implied by the second-order approximation of the household’s welfare.10 The main exercise of this section is to examine how α and α affect optimal policy. We will consider the range 1 2 from 0 to 1 for both parameters. Table 2: Parameter Values: Infinite-Horizon Model Parameter Description Values β Discount factor 0.9925 σ Intertemporal elasticity of substitution 1 κ Slope of the Phillips Curve 0.007 θ Relative price elasticity of demand 8 λ Relative weight on output gap volatility (κ/θ) 0.875×10−3 i The effective lower bound on the policy rate 0 ELB α Discounting parameter in the Euler equation [0,1] 1 α Discounting parameter in the Phillips curve [0,1] 2 µ Persistence probability of the crisis state 5/6 (rn,e ) (demand shock,cost-push shock) *Chosen so that t t (y ,π )=(−0.07,−0.01/4) 1 1 under the simple rule Conditional on these parameter values, we choose the size of the two shocks, rn and L e , such that output and inflation fall by 7 percent and 1 percentage point, respectively, if L the central bank follows the simple interest rate rule described earlier; see equation (6).11 10In Appendix E, we conduct an analysis in which the value of λ is much higher and is consistent with placing equal weights on inflation and output stabilization objectives, a common practice in central banks. 11In the infinite-horizon model with an absorbing state, the allocation under policy rule (6) is identical to the allocation under the optimal discretionary policy. 12

This way of choosing the shock size means that the values of rn and e are different when L L we choose different values of α and α . The approach of keeping the severity of the crisis 1 2 constant as one varies the model’s parameter values is adopted by Boneva, Braun, and Waki (2016) and Hills and Nakata (2018), who adjust the size of shocks in their models to keep the magnitudes of the declines in output and inflation unchanged when conducting sensitivity analyses.12,13 3.3 Results We will first examine the effect of α on optimal policy while holding α = 0 (i.e., the 1 2 model with discounting only in the Euler equation). We then examine the effect of α on 2 optimal policy while holding α = 0 (i.e., the model with discounting only in the Phillips 1 curve). Finally, we will explore the implication of varying both α and α . 1 2 Figure4plotstheimpulseresponsefunctionsofthepolicyrate,inflationandoutputunder the simple interest rate rule and under the optimal monetary policy for various degrees of Euler equation discounting, when the crisis state persists for eight quarters. Figure 4: Optimal Policy w/ Discounting in the Euler Equation Policy Rate Output Gap Inflation 4 5 1 2.5 0.5 3 0 0 2 Simple Rule -2.5 -0.5 α = 0 1 1 α = 0.5 1 -5 -1 α = 1 1 0 -7.5 -1.5 0 10 20 30 40 0 10 20 30 40 0 10 20 30 40 Time Time Time Note: Inallmodels,α =0. Unitsareannualizedpercent,annualizedpercentagepoints,andpercentdeviation 2 for the policy rate, inflation, and the output gap, respectively. By construction, the allocation under the simple rule does not depend on α , because, as 1 we vary α , we vary the size of the shocks so that the declines in output and inflation in the 1 crisis state are unchanged under the simple rule. A key feature of the economy under the simple rule is that the economy returns to its steady state once the crisis shock disappears. 12Unlikeinthethree-periodmodel,thediscountingfactorsintheprivate-sectorbehavioralconstraintswill influence the level of output and inflation in the crisis state when monetary policy follows the simple rule. This dependency reflects the fact that, conditional on being in the crisis state, the crisis shock will persist in the subsequent period with positive probability µ, affecting private-sector expectations about future output and inflation. 13In Appendix F, we conduct an alternative analysis in which the size of shocks are kept constant as we vary discounting parameters. 13

Undertheoptimalpolicywithα = 0—whichcorrespondstothestandardEulerequation— 1 the central bank keeps the policy rate at the ELB beyond the point in time where the shocks jump back to the normal state. The policy of keeping the interest rate at the ELB for an extended period stimulates the economy by raising expectations about future output and inflation. The optimality of the low-for-long policy in this standard version of the model is well known (Jung, Teranishi, and Watanabe (2005); Eggertsson and Woodford (2003)). Comparingtheoptimalpolicyforalternativedegreesofdiscounting,wefindthatthemore households discount expected future real interest rates, the longer the central bank keeps the policy rate at zero. When α = 0, the policy rate is kept at the ELB for 13 quarters. As 1 α increases, the ELB duration increases monotonically, reaching 26 quarters when α = 1. 1 1 Nevertheless, as α increases, the initial declines in output and inflation are larger, reflecting 1 the diminished effectiveness of a cut in future interest rates. The initial decline in output is more sensitive to the degree of discounting than the initial decline of inflation, reflecting the fact that the discounting is in the Euler equation. One way to summarize the effect of discounting on optimal policy is to inspect the expected ELB duration under different degrees of discounting. As shown in the left panel of Figure 6, when α = 0, the expected ELB duration is about 10 quarters. As α increases, 1 1 the expected ELB duration increases monotonically and in a quantitatively significant way. When α = 1, the expected ELB duration is about 18 quarters, two years longer than when 1 α = 0. 1 Figure 5: Optimal Policy w/ Discounting in the Phillips Curve Policy Rate Output Gap Inflation 4 6 0.5 3 3 0 0 2 -0.5 Simple Rule -3 α = 0 2 1 -1 α = 0.5 2 -6 α = 1 2 0 -9 -1.5 0 10 20 30 40 0 10 20 30 40 0 10 20 30 40 Time Time Time Note: Inallmodels,α =0. Unitsareannualizedpercent,annualizedpercentagepoints,andpercentdeviation 1 for the policy rate, inflation, and the output gap, respectively. We now turn to the analysis of optimal policy in the model with discounting only in the Phillips curve. Figure 5 plots the same set of impulse response functions with alternative degrees of α when the crisis shock lasts for eight quarters. As in the previous model with 2 discounting only in the Euler equation, by construction, the allocations under the simple rule 14

donotdependonthedegreeofdiscounting. Underoptimalpolicy, thecentralbankkeepsthe policy rate at the ELB for longer in the models with discounting than in the model without discounting. However, unlike in the model with discounting only in the Euler equation, the effect of discounting on the optimal ELB duration is not monotonic, as can be seen in the right panel of Figure 6 which shows the effect of α on the expected ELB duration. As for 2 the allocation, the initial decline in inflation is larger when α is higher, as can be seen in 2 the right panel of Figure 5. Interestingly, the initial decline in output depends on the degree of discounting in a non-monotonic way. As α increases from 0, the initial output decline 2 becomes smaller at first, but increases eventually as α approaches 1, as can be seen in the 2 middle panel of Figure 5. Figure 6: Expected ELB Durations Discounting in Euler Eq. Discounting in Phillips Curve 20 20 15 15 10 10 5 5 0 0.2 0.4 0.6 0.8 1 0 0.2 0.4 0.6 0.8 1 α α 1 2 Note: The thin dashed lines represent the expected duration of the crisis shock. Thin dash-dotted lines represent expected ELB duration in the absence of discounting. Units are in quarters. Finally, we examine the effect of discounting in the model that allows for discounting in both the Euler equation and in the Phillips curve. Figure 7(a) plots the expected ELB duration for various combinations of α and α , while Figure 7(b) demarcates the parameter 1 2 spaceintothepartinwhichtheexpectedELBdurationisshorterorlongerthantheexpected ELB duration in the standard model (that is, the model with α = α = 0). According to 1 2 these figures, as long as either α or α is sufficiently low, the expected ELB duration is 1 2 higher with discounting than without discounting, in line with our previous results. The forward-looking element in households’ and firms’ decision making is too small to allow for an effective use of forward guidance as a means to steer private sector behavior and the expected ELB duration is lower with discounting than without discounting, only when α 1 and α are both high. The effect of discounting on the ELB duration can be large. According 2 to Figure 7(c), when both α and α are around 0.15, the expected ELB duration is about 1 2 five quarters longer than when they are 0. When both α and α are 1, the expected ELB 1 2 duration is about four quarters shorter than when they are 0. 15

Figure 7: Optimal Policy w/ Discounting in the Euler Equation and the Phillips Curve (7.a) E[ELB Duration] with α ∈ [0,1] and α ∈ [0,1] 1 2 20 15 10 0 0.2 5 0.4 1 0.8 0.6 0.6 0.4 0.2 0 1 0.8 α 2 α 1 ]noitaruD BLE[E 18 16 14 12 10 8 6 3.4 Relation to Bilbiie (2017) and Andrade, Gaballo, Mengus, and Mojon (2017) Bilbiie (2017) studies the implication of discounting in the Euler equation in a model with a static Phillips curve for the optimal ELB duration using a novel framework developed in Bilbiie (2016) in which the central bank optimally chooses—before the crisis shock hits the economy—the probability of keeping the policy rate at the ELB each period after the crisis shock disappears. He finds that the optimal expected ELB duration is lower when the discounting in the Euler equation is higher. A corresponding analysis in our paper is to vary α while keeping α = 1. According to Figure 7(a), the effect of increasing α is 1 2 1 non-monotonic when α = 1. One key factor accounting for the difference in the result 2 is that, as discussed in Section 3.2, we keep the severity of the shock constant as we vary discountingparameters,whereasBilbiie(2017)keepsthesizeoftheexogenousshockconstant. Accordingly, as the discounting parameter increases in Bilbiie (2017), the crisis becomes less severe, adding a force to reduce the optimal expected ELB duration. Andrade, Gaballo, Mengus, and Mojon (2017) study the implication for the optimal ELB duration of less powerful forward guidance associated with the presence of pessimists who do not believe in the central bank’s commitment to keep the policy rate at the ELB after the natural rate of interest becomes positive. They assume that the policy rate will be set according to a feedback rule after liftoff and let the central bank to choose the duration of keeping the policy rate at the ELB. They do find a nonlinear effect of less powerful forward guidance on the optimal ELB duration, which is reminiscent of Figure 7(c) in which we vary discounting parameters in both Euler equation and Phillips curve at the same time. The 16

(7.b) Pairs of (α ,α ) under which 1 2 E[ELB Duration] is shorter than in the standard model α 1 α 2 (7.c) E[ELB Duration] with α = α ∈ [0,1] 1 2 1 20 0.8 15 0.6 0.4 10 0.2 0 0 0.2 0.4 0.6 0.8 1 5 0 0.2 0.4 0.6 0.8 1 α = α 1 2 ]noitaruD BLE[E qualitative similarity makes sense, as the fraction of pessimists affect both the Euler equation and the Phillips curve in their model. Like Bilbiie (2017), they keep the size of the exogenous shock constant as they vary the fraction of pessimists. Interestingly, their nonlinearity is much more pronounced than ours. Also, the optimal ELB duration becomes the same as the ELB duration under optimal discretionary policy when a fraction of pessimists reaches a certain threshold in their model, while we find that the expected ELB duration under a fully optimal policy gradually approaches the ELB duration under optimal discretionary policy, as seen in Figure 7(c). 4 Discussion A key finding from the previous two sections is that the implication of private-sector discounting for optimal policy depends importantly on its degree. Given this finding, a natural question is, “what are the plausible values of discounting parameters?” In this section, we investigate this question by reviewing the values of the discounting parameters in the Euler equation and the Phillips curve consistent with plausible parameterizations of different micro-founded models of less powerful forward guidance in the literature.14 DelNegro,Giannoni,andPatterson(2015)showthattheNewKeynesianmodelcombined withanoverlapping-generationsstructurealaBlanchard(1985)andYaari(1965)goesalong wayinattenuatingtheforwardguidancepuzzle. Thelog-linearizedprivate-sectorequilibrium conditions of their overlapping-generations model feature a discounted Euler equation and a discounted Phillips curve akin to those considered in our paper. The discounting parameters in their Euler equation and the Phillips curve are related to the probability of death, which can be interpreted more broadly to capture different forms of wealth resetting, such as a 14There are vast empirical literatures estimating the consumption Euler equation and Phillips curve. A comprehensive review of these literatures is beyond the scope of this section. 17

default. They consider two parameterizations; one implies α = 0.013 and α = 0.191 and 1 2 the other implies α = 0.04 and α = 0.209. 1 2 McKay, Nakamura, and Steinsson (2016a) show that the forward guidance puzzle is substantially muted in an incomplete-markets model in which households face uninsurable income risks, and McKay, Nakamura, and Steinsson (2016b) show that the dynamics of the model of McKay, Nakamura, and Steinsson (2016a) is well approximated by the sticky-price model with the discounted Euler equation and the standard Phillips curve. They show that α = 0.03 and α = 0.06 are consistent with the dynamics of their incomplete-markets model 1 1 with low-risk and high-risk calibrations, respectively. Gabaix (2016) proposes a version of the New Keynesian model in which agents are nonrational, demonstrating that forward guidance is less powerful in his model and that the dynamics of his model are characterized by a discounted Euler equation and a discounted Phillips curve. While his analysis is mainly analytical, he presents a numerical example in which he sets α = α = 0.15, informed by the estimates of a hybrid Phillips curve from 1 2 Gal´ı and Gertler (1999). Angeletos and Lian (2016) show the attenuation of the forward guidance puzzle in a sticky-price model with imperfect common knowledge and derive the discounted Euler equation and the discounted Phillips curve. In their numerical example, the valuesassignedtotwoparametersgoverningthedegreeofdeparturefromcommonknowledge imply α = 0.003 and α = 0.037. 1 2 Carlstrom, Fuerst, and Paustian (2015) and Kiley (2016) show that the forward guidance puzzle is attenuated in sticky-information models. In sticky-information models, inflation todaydependsontheinfinitesumoftheexpectedcurrentmarginalcoststhatwereforecastin the past. As a result, there is no straightforward one-to-one mapping between our discounted Phillips curve and the sticky-information model. However, to the extent that the expected future inflation does not show up at all in the sticky-information model, the corresponding α can be thought of as 1. Both papers use the standard Euler equation (that is, α = 0). 2 1 Finally, Chung (2015) shows that the effect of a future interest rate change is much more muted in FRB/US—a macroeconomic model used by the staff at the Board of Governors of the Federal Reserve System for policy analysis—than in other more standard sticky-price models, such as Smets and Wouters (2007). FRB/US developers in the 1990s introduced a discounting coefficient on the expected future income in the consumption Euler equation on the grounds that (i) risk-averse households would respond less to changes in the future income when they face uninsurable income risks than when they do not and (ii) households are finitely-lived.15. The discounting parameter in the Euler equation in FRB/US is 0.06. While there is various built-in inertia in the FRB/US Phillips curve, there is no explicit discounting of the expected inflation, so α = 0.16 2 15Interestingly, Reifschneider (1996)—a technical document that describes the consumption sector of FRB/US—motivates the introduction of the discounting parameter on the expected future income to the consumptionEulerequationbyreferringtotheearlyliteratureonuninsurableincomeriskandprecautionary saving,suchasCarroll(1997),aswellasthemodelsofperpetualyouthbyBlanchard(1985)andYaari(1965). 16Kiley and Roberts (2017) find that inflation and output are better stabilized under an inertial interest 18

All told, according to the existing models of less powerful forward guidance, minor deviations from the standard Euler equation and the standard Phillips curve are empirically plausible and are enough to meaningfully attenuate the forward guidance puzzle. According to our analysis in the previous section, when the degree of discounting is small, the central bank wants to keep the policy rate at the ELB for longer than in the standard model. Under some choices of α and α considered in the literature (Del Negro, Giannoni, and Patterson 1 2 (2015); McKay, Nakamura, and Steinsson (2016b); Angeletos and Lian (2016); and Chung (2015)), the optimal ELB duration is only slightly longer than in the standard model. However, under other choices of α and α (Gabaix (2016); Carlstrom, Fuerst, and Paustian 1 2 (2015); and Kiley (2016)), the optimal ELB duration is extended in a quantitatively significant way. With α = α = 0.15 considered in Gabaix (2016), the expected ELB duration is 1 2 more than one year longer than in the standard model, as can be seen in Figure 7(c). With α = 0 and α = 1 implied by the sticky-information model, the expected ELB duration is 1 2 about half a year longer than in the standard model, as can be seen in the right panel of Figure 6. 5 Analysis based on a micro-founded model of less powerful forward guidance So far, we have focused on analyzing the implication of less powerful forward guidance in New Keynesian models with a discounted Euler equation and/or Phillips curve without taking a stance on the structural model that generates discounting in these two equations. In Appendix G, we characterize an optimal commitment policy in a structural model of less powerful forward guidance. The structural model we examine is a New Keynesian model with overlapping generations (OLG) developed by Piergallini (2006), Castelnuovo and Nistic´o (2010), and Nistic´o (2012). Recently, Del Negro, Giannoni, and Patterson (2015) show that the log-linearization of the OLG New Keynesian model leads to discounting in both the consumption Euler equation and the Phillips curve and that the effects of anticipated monetary policy shock on today’s economic activities is smaller in this model than in the standard New Keynesian model. In particular, the higher the probability of dying, the smaller the effects of a future interest rate adjustment are on today’s economic activities. In Appendix G, we find that the result from the OLG New Keynesian model is consistent with the main result from the model with a discounted Euler equation and Phillips curve. That is, the higher the death probability is—the less powerful forward guidance is—the more future accommodation the central bank would choose to promise. rate rule that implements low-for-long policy than under other more standard rules in the FRB/US model, whereasBrayton,Laubach,andReifschneider(2014)showthatlow-for-longpolicyisakeyfeatureofoptimal commitment policy in FRB/US model. These results are consistent with our result that low-for-long policy remains desirable in the model with discounting, unless the discounting parameter is close to one in both the Euler equation and Phillips curve. 19

It would be useful to characterize optimal commitment policy in other micro-founded models of less powerful forward guidance. We leave that effort to future research. Our hope is that our analysis based on the discounted Euler equation and Phillips curve can serve as a useful reference for other researchers when they analyze the optimal conduct of forward guidance policy in other micro-founded models of less powerful forward guidance. 6 Conclusion Standard sticky-price models imply implausibly large responses of output and inflation to announcements about interest rate changes far in the future. As a result, there is widespread concern that they may be of limited use as a framework for analyzing forward guidance policies. Researchershaverecentlyproposedvariousmicro-foundedmodificationsofthestandard modelingframeworkthatattenuatetheeffectofforwardguidanceoneconomicactivities. The dynamics of these modified models are often consistent with those of an otherwise standard New Keynesian model with a discounted Euler equation and/or Phillips curve. In this paper, we have examined the implication of less powerful forward guidance for the optimal design of forward guidance policy by analyzing how discounting on the part of the private sector affects the optimal commitment policy of the central bank caught in a liquidity trap. We have shown that forward guidance not only continues to be part of the optimal policy toolkit, but should also be used more heavily by the central bank than in the standard model under a wide range of the degree of discounting. In particular, it is optimal for the central bank to keep the policy rate at the ELB for a longer period of time in the model with discounting than in the standard model. This low-for-longer strategy allows the central bank to partially compensate for the reduced power of forward guidance due to discounting on current economic activities. 20

References Adam, K., and R. Billi(2006): “OptimalMonetaryPolicyUnderCommitmentwithaZeroBound on Nominal Interest Rates,” Journal of Money, Credit, and Banking, 38(7), 1877–1905. Andrade, P., G. Gaballo, E. Mengus, and B. Mojon (2017): “Forward Guidance and Heterogeneous Beliefs,” HEC Paris Research Paper Series 1192, HEC Paris Research Paper Series. Angeletos, G.-M., andC.Lian(2016): “ForwardGuidancewithoutCommonKnowledge,”Working Paper 22785, National Bureau of Economic Research. Bilbiie, F. O. (2016): “Optimal Forward Guidance,” CEPR Discussion Papers 11251, C.E.P.R. Discussion Papers. (2017): “The Puzzle, the Power, and the Dark Side: Forward Guidance Redux,” CEPR Discussion Papers 12231, C.E.P.R. Discussion Papers. Blanchard, O. J. (1985): “Debt, Deficits, and Finite Horizons,” Journal of Political Economy, 59(2), 135–149. Boneva, L. M., R. A. Braun, and Y. Waki(2016): “SomeUnpleasantPropertiesofLoglinearized Solutions when the Nominal Rate is Zero,” Journal of Monetary Economics, 84, 216–232. Brayton, F., T. Laubach, and D. Reifschneider(2014): “Optimal-ControlMonetaryPolicyin the FRB/US Model,” FEDS Notes 2014-11-21, Board of Governors of the Federal Reserve System (U.S.). Broda, C., and D. E. Weinstein (2006): “Globalization and the Gains From Variety,” The Quarterly Journal of Economics, 121(2), 541–585. Calvo, G. A., and M. Obstfeld (1988): “Optimal Time-Consistent Fiscal Policy with Finite Lifetimes,” Econometrica, 56(2), 411–432. Carlstrom, C. T., T. S. Fuerst, and M. Paustian (2015): “Inflation and Output in New Keynesian Models with a Transient Interest Rate Peg,” Journal of Monetary Economics, 76, 230– 243. Carney, M. (2017): “Lambda,” Remarks at the London School of Economics. Carroll, C. D. (1997): “Buffer-Stock Saving and the Life Cycle/Permanent Income Hypothesis,” Quarterly Journal of Economics, 112(1), 1–55. Castelnuovo, E., and S. Nistico´ (2010): “Stock Market Conditions and Monetary Policy in a DSGE Model for the US,” Journal of Economic Dynamics and Control, 34(9), 1700–1731. Christiano, L. J., and J. D. M. Fisher (2000): “Algorithms for Solving Dynamic Models with OccasionallyBindingConstraints,” Journal of Economic Dynamics and Control, 24(8), 1179–1232. Chung, H. (2015): “The Effects of Forward Guidance in Three Macro Models,” FEDS Notes 2015- 02-26, Board of Governors of the Federal Reserve System (U.S.). 21

Del Negro, M., M. Giannoni, and C. Patterson(2015): “TheForwardGuidancePuzzle,”Staff reports, Federal Reserve Bank of New York. Denes, M., G. B. Eggertsson, and S. Gilbukh (2013): “Deficits, Public Debt Dynamics and Tax and Spending Multipliers,” Economic Journal, 0, 133–163. Eggertsson, G., and M. Woodford (2003): “The Zero Bound on Interest Rates and Optimal Monetary Policy,” Brookings Papers on Economic Activity, 34(1), 139–235. Gabaix, X. (2016): “A Behavioral New Keynesian Model,” Working Paper. Gali, J.(2015): MonetaryPolicy, Inflation, andtheBusinessCycle.Princeton: PrincetonUniversity Press. Gal´ı, J., and M. Gertler (1999): “Inflation Dynamics: A Structural Econometric Analsis,” Journal of Monetary Economics, 44, 195–222. Haberis, A., R. Harrison, and M. Waldron (2017): “Uncertain Forward Guidance,” Staff Working Pape Series 654, Bank of England. Hills, T. S., and T. Nakata (2018): “Fiscal Multipliers at the Zero Lower Bound: The Role of Policy Inertia,” Journal of Money, Credit and Banking, 50(1), 155–172. Juillard, M., D. Laxton, P. McAdam, and H. Pioro (1998): “An Algorithm Competition: First-order Iterations versus Newton-based Techniques,” Journal of Economic Dynamics and Control, 22(8-9), 1291–1318. Jung, T., Y. Teranishi, and T. Watanabe (2005): “Optimal Monetary Policy at the Zero- Interest-Rate Bound,” Journal of Money, Credit, and Banking, 35(7), 813–35. Kaplan, G., B. Moll, and G. Violante (2016): “A Note on Unconventional Monetary Policy in HANK,” Working paper, Princeton University. Kiley, M.(2016): “PolicyParadoxesintheNewKeynesianModel,” Review of Economic Dynamics, 21, 1–15. Kiley, M., and J. Roberts (2017): “Monetary Policy in a Low Interest Rate World,” Brookings Papers on Economic Activity, Spring, 317–372. Maliar, L., and S. Maliar(2015): “MergingSimulationandProjectionApproachestoSolveHigh- Dimensional Problems with an Application to a New Keynesian Model,” Quantitative Economics, 6(1), 1–47. McKay, A., E. Nakamura, and J. Steinsson (2016a): “The Power of Forward-Guidance Revisited,” American Economic Review, 106(10), 3133–3158. (2016b): “The Discounted Euler Equation: A Note,” NBER Working Paper Series 22129, National Bureau of Economic Research. Nakata, T. (2017): “Uncertainty at the Zero Lower Bound,” American Economic Journal: Macroeconomics, 9(3), 186–221. 22

(2018): “Reputation and Liquidity Traps,” Review of Economic Dynamics, 28, 252–268. Nakata, T., and S. Schmidt(2014): “ConservatismandLiquidityTraps,”FinanceandEconomics Discussion Series 2014-105, Board of Governors of the Federal Reserve System (U.S.). Nakov, A. (2008): “Optimal and Simple Monetary Policy Rules with Zero Floor on the Nominal Interest Rate,” International Journal of Central Banking, 4(2), 73–127. Nistico´, S. (2012): “Monetary Policy and Stock-Price Dynamics in a DSGE Framework,” Journal of Macroeconomics, 34(1), 126–146. Piergallini, A. (2006): “Real Balance Effects and Monetary Policy,” Economic Inquiry, 44(3), 497–511. Reifschneider, D. (1996): “Household Consumption and Investment in the FRB/US Model,” Mimeo. Reifschneider, D. L., W. Wascher, and D. Wilcox (2015): “Aggregate Supply in the United States: RecentDevelopmentsandImplicationsfortheConductofMonetaryPolicy,”IMFEconomic Review, 63(1), 71–109. Smets, F., and R. Wouters (2007): “Shocks and Frictions in US Business Cycles: A Bayesian DSGE Approach,” American Economic Review, 97(3), 586–606. Wiederholt, M. (2015): “Empirical Properties of Inflation Expectations and the Zero Lower Bound,” Working Paper. Woodford, M.(2003): InterestandPrices: FoundationsofaTheoryofMonetaryPolicy.Princeton: Princeton University Press. Yaari, M. E.(1965): “UncertainLifetime,LifeInsurance,andtheTheoryoftheConsumer,”Review of Economic Studies, 32(2), 137–150. Yellen, J. L. (2012): “Revolution and Evolution in Central Bank Communications,” Remarks at the Haas School of Business, University of California, Berkeley, Berkeley, California. 23

For Online Publication: Technical Appendix The technical appendix is organized as follows. Appendix A provides necessary and sufficient conditions under which the optimal time-three policy rate declines as the discounting parameter increases in the three-period model with a static Phillips curve, while Appendix B shows a numerical example from the three-period model in which an increase in the discounting parameter leads to a higher time-three policy rate. Appendix C describes the details of optimal commitment policy in the three-period model, while Appendix D does so in the infinite-horizon model, including the solution method and its accuracy. Appendix E analyzes the implication of discounting for optimal policy in an infinitehorizon model with equal weights on inflation and output stabilization terms in the central banks objective function, while Appendix F analyzes the implication of discounting for optimal policy in an infinite-horizon model in a setup where we keep the shock size constant as we change discounting coefficients. Appendix G analyzes the implication of less powerful forward guidance for optimal policy in a New Keynesian model with overlapping generations and perpetual youth. A Some analytical results in the three-period model This section derives a necessary and sufficient condition for the marginal effect of an increase in the degree of discounting for the optimal policy rate and output in period 3 to be positive and negative, respectively. The condition is derived under the assumption that the ELB is binding in periods 1 and 2 and not binding in period 3. The perfect-foresight New Keynesian model with a static Phillips curve and a Consumption Euler equation with discounting consists of the following two private sector behavioral constraints: π = κy (A.1) t t y = (1−α)y −σ(i −π −rn) (A.2) t t+1 t t+1 t We can substitute (A.1) into (A.2), and obtain y = (1−α+κσ)y −σ(i −rn) (A.3) t t+1 t t Assuming that the central bank faces the objective function introduced in the main text, the optimization problem of a central bank acting under commitment is ∞ (cid:20) (cid:21) (cid:88) 1 max βt − (κ2+λ)y2+φEE(y −(1−α+κσ)y +σ(i −rn))+φELBi , (A.4) yt,it 2 t t t t+1 t t t t t=0 where we have used (A.1) to substitute out the inflation rate in the central bank objective function. The first order necessary conditions can be combined to 1−α+κσ σ(κ2+λ)y = −φELB + φELB, (A.5) t t β t−1 24

where φELB ≥ 0. As in the main text, we assume rn = rn < 0 and rn = rn = rn > 0. 1 L 2 3 Assumption 1 (for local analysis): For a given parameterization of the model, rn and rn are L such that under the optimal commitment policy φELB,φELB > 0 and φELB = 0. 1 2 3 Define Ω(α) ≡ 1−α+κσ. (A.6) The system of equilibrium conditions for the three-period model conditional on Assumption 1 being satisfied then reads σ(κ2+λ)y +φELB = 0, (A.7) 1 1 y −Ωy = σrn, (A.8) 1 2 L Ω σ(κ2+λ)y +φELB − φELB = 0, (A.9) 2 2 β 1 y −Ωy = σrn, (A.10) 2 3 Ω σ(κ2+λ)y − φELB = 0, (A.11) 3 β 2 y +σi = σrn, (A.12) 3 3 where I use Ω as a shortcut for Ω(α). Substituting (A.9) into (A.11) Ω2 Ω y = φELB − y (A.13) 3 β2σ(κ2+λ) 1 β 2 Substituting out φELB using (A.7) 1 Ω2 Ω y = − y − y (A.14) 3 β2 1 β 2 Substituting out y and y using (A.8) and (A.10), to get the policy function for y 1 2 3 (cid:18) Ω2 Ω4(cid:19) Ω2 (cid:18) Ω Ω3(cid:19) 1+ + y = −σ rn −σ + rn (A.15) β β2 3 β2 L β β2 One can then show that ∂y3 > 0 if and only if ∂α (cid:18) Ω5 Ω (cid:19) (cid:18) 1 Ω2 Ω4 Ω6(cid:19) − 2σ(−rn)+ +2 −2 − σrn > 0. (A.16) β4 β2 L β β2 β3 β4 We can further simplify this expression. Note that from (A.11), y > 0. Hence, from 3 (cid:16) (cid:17) (A.15), we have Ω2 (−rn)− Ω + Ω3 rn > 0. Let us rewrite (A.16): β2 L β β2 Ω3 (cid:20) Ω2 (cid:18) Ω Ω3(cid:19) (cid:21) (cid:18) 1 Ω2 Ω6(cid:19) Ω 2 (−rn)− + rn + +2 + rn−2 (−rn), (A.17) β2 β2 L β β2 β β2 β4 β2 L 25

where we know that the term in square brackets has to be positive. Further reorganizing terms, we get 1 (cid:18) Ω4 (cid:19)(cid:20) Ω2 (cid:18) Ω Ω3(cid:19) (cid:21) 1 (cid:18) Ω6 (cid:19) 2 −1 (−rn)− + rn + −1 rn. (A.18) Ω β2 β2 L β β2 β β3 Again, weknowthattheterminsquarebracketsispositiveifAssumption1issatisfied. Also, we know that Ω(α) > 0. √ √ Thus, ∂y3 > 0 if and only if Ω(α) > β, or, equivalently, 1− β+κσ > α. An interesting ∂α implication of this condition is that, if Assumption 1 is satisfied, a marginal increase in α at α = 0 (no Euler equation discounting) will always lead to an increase in y , and a reduction 3 in i . 3 Numerical example We use the parameterization from Section 2 in the main text for the three-period model. The only difference (besides having a static Phillips curve) is that we choose a slightly lower value of −0.03 (instead of −0.03825) for rn. This ensures that Assumption 1 is satisfied for L α ∈ [0,0.9]. Figure A.1 shows {y ,y ,y ,φELB,φELB,i } for α ∈ [0,0.9]. 1 2 3 1 2 3 Figure A.1: Allocations in the three-period model with a static Phillips curve y 1 10-3 1 ELB -1 2 -2 1 -3 0 0 0.2 0.4 0.6 0.8 0 0.2 0.4 0.6 0.8 y 2 10-4 2 ELB 1.5 4 1 2 0.5 0 0 0.2 0.4 0.6 0.8 0 0.2 0.4 0.6 0.8 y i 3 3 4 0.5 2 0 0 0 0.2 0.4 0.6 0.8 0 0.2 0.4 0.6 0.8 √ Figure A.2 plots 1− β +κσ−α (solid black line) as a function of α. Hence, for those values of α for which the solid black line is above the horizontal black zero line, the necessary and sufficient condition for ∂y3 > 0 is satisfied. ∂α 26

Figure A.2: Indicators for ∂y3 > 0 and ∂i3 < 0 ∂α ∂α 0.4 0 -0.4 -0.8 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 √ Note: Solid black line: 1− β+κσ−α. For those values of α for which the solid black line is above the horizontal black zero line, the necessary and sufficient condition for ∂y3 >0 and ∂i3 <0 is satisfied. ∂α ∂α B Additional results for the three-period model In Section 2 of the main text, we analyze the implications of discounting in the Euler equation for optimal policy in a three-period model. One takeaway from that section is that whether the discounting makes optimal policy more or less accommodative depends on parameter values. In this section, we present a case in which the discounting make optimal policy less accommodative. The parameter value for κ = 0.02 in this example is excerpted fromEggertssonandWoodford(2003). AllotherparametervaluesarethesameasinTable1. The setup is the same as in Section 2, i.e. the contractionary shock lasts only for one period. As before, we set the discounting parameters equal to 0.5 in the model with discounting. Figure B.1: Optimal Policy in a Three-Period Model (An Alternative Parameterization) Policy Rate Output Gap Inflation 4 2 0.2 1 0.1 3 0 0 2 −1 −0.1 1 Optimal Policy (α = 0) −2 −0.2 Optimal Policy (α = 0.5) Simple Rule 0 −3 −0.3 1 2 3 1 2 3 1 2 3 Time Time Time Figure B.1 shows the paths of the policy rate, inflation and the output gap under the alternative parameterization. Under the optimal policy, the central bank keeps the policy 27

rate at zero in period 2 both in the model with α = 0 and in the model with α = 0.5. The optimal level of the policy rate in period 3 is higher in the model with discounting than in the model without discounting. C Solving the three-period model under optimal commitment policy The system of equilibrium conditions under optimal commitment policy are given by: y = (1−α)y −σ(i −π )+σrn 1 2 1 2 1 y = (1−α)y −σ(i −π )+σrn 2 3 2 3 y = −σi σrn 3 3 π = κy +βπ 1 1 2 π = κy +βπ 2 2 3 π = κy 3 3 1 0 = λy +κφPC + φELB 1 1 σ 1 1 1−α 0 = λy +κφPC + φELB − φELB 2 2 σ 2 βσ 1 1 1−α 0 = λy +κφPC + φELB − φELB 3 3 σ 3 βσ 3 0 = π −φPC 1 1 1 0 = π −φPC +φPC − φELB 2 2 1 β 1 1 0 = π −φPC +φPC − φELB 3 3 2 β 2 and (i −i )φELB = 0, (i −i )φELB = 0, (i −i )φELB = 0 1 ELB 1 2 ELB 2 3 ELB 3 i ≥ i , i ≥ i , i ≥ i 1 ELB 2 ELB 3 ELB φELB ≥ 0, φELB ≥ 0, φELB ≥ 0 1 2 3 where φPC and φELB denote the Lagrange multipliers associated with the Phillips curve and t t the ELB constraint. The system of nonlinear equations can be solved as follows. First, assume that the ELB constraint is never binding. Set φELB = 0 for t = 1,2,3 and solve the resulting system of t linear equations. If i ≥ 0 for all t, the model is solved. If i < i , set i = i and t 1 ELB 1 ELB solve the system of linear equations with φELB > 0. If i ,i ≥ i , the model is solved. If 1 2 3 ELB i < i , set i = i and solve the system of linear equations with φELB,φELB > 0. If 2 ELB 2 ELB 1 2 i ≥ i , the model is solved. If i < i , set i = i and solve the system of linear 3 ELB 3 ELB 3 ELB equations with φELB,φELB,φELB > 0. 1 2 3 28

D Details of the optimal commitment policy D.1 The recursive characterization Followingthecommonpracticeintheliterature, wecharacterizetheoptimalcommitment policy recursively based on the saddle-point functional equation. The saddle-point functional equation corresponding to the infinite-horizon problem of the Ramsey planner is given by, W (rn,e ,ω ,ω ) t t t 1,t−1 2,t−1 =min max h(y ,π ,i ,ω ,ω ,γ ,γ )+βE W (rn ,e ,ω ,ω ) {γ1,t,γ2,t} {yt,πt,it} t t t 1,t−1 2,t−1 1,t 2,t t t+1 t+1 t+1 1,t 2,t where h(y ,π ,i ,ω ,ω ,γ ,γ ) t t t 1,t−1 2,t−1 1,t 2,t (cid:20) (cid:21) = − 1 (cid:0) π2+λy2(cid:1) −ω ((1−α )y +σπ )−ω (1−α )βπ 2 t t 1,t−1 1 t t 2,t−1 2 t +γ (y +σ(i −rn))+γ (π −κy −e ) 1,t t t t 2,t t t t and the optimization is subject to the the ELB constraint, and the following law of motions for ω and ω , 1,t 2,t ω = γ 1,t 1,t ω = γ 2,t 2,t Note that ω (the Lagrange multiplier on the Euler equation) and ω (the Lagrange mul- 1,t 2,t tiplier on the Phillips curve) are the only relevant state variables because the expectation terms show up only in these two equations.17 Define the state S := [rn,e ,ω ,ω ]. A t t t 1,t−1 2,t−1 Ramsey equilibrium can be defined as the set of time-invariant value and policy functions {W(S ), y(S ), π(S ), i(S ), ω (S ), ω (S ), ω (S )} that solve the saddle-point functional t t t t 1 t 2 t 3 t equation equation above. The FONC are 1−α 1 ∂y : −λy +ω − ω −κω = 0 t t 1,t 1,t−1 2,t β σ ∂π : −π − ω +ω −(1−α )ω = 0 t t 1,t−1 2,t 2 2,t−1 β ∂i : σω +ω = 0, t 1,t 3,t as well as the private sector behavioral constraints. 17ω is the Lagrange multiplier on the ELB constraint. 3,t 29

D.2 Solution method The problem is to find a set of policy functions, {y(S ), π(S ), i(S ), ω (S ), ω (S ), t t t 1 t 2 t ω (S )}, that solves the following system of functional equations 3 t y(S ) = (1−α )E y(S )−σ(i(S )−E π(S )−rn) (D.1) t 1 t t+1 t t t+1 t π(S ) = κy(S )+(1−α )βE π(S )+e (D.2) t t 2 t t+1 t i(S ) ≥ 0 ≡ i (D.3) t ELB 1−α ω (S ) = λy(S )+ 1 ω (S )+ω (S ) (D.4) 1 t t 1 t−1 2 t β σ ω (S ) = π(S )+ ω (S )+(1−α )ω (S ) (D.5) 2 t t 1 t−1 2 2 t−1 β ω (S ) = −σω (S ) (D.6) 3 t 1 t Following the approach of Christiano and Fisher (2000), we decompose these policy functions into two parts using an indicator function: one in which the policy rate is allowed to be less than i , and the other in which the policy rate is assumed to be i . That is, for ELB ELB any variable Z, Z(·) = I Z (·)+(1−I )Z (·). (D.7) {R(·)≥iELB} unc {R(·)≥iELB} ELB The problem then becomes finding a set of a pair of policy functions, {[y (·), y (·)], unc ELB [π (·), π (·)], [i (·), i (·)], [ω (·), ω (·)], [ω (·), ω (·)], [ω (·), unc ELB unc ELB 1,unc 1,ELB 2,unc 2,ELB 3,unc ω (·)]} that solves the system of functional equations above. This method can achieve a 3,ELB given level of accuracy with a considerable less number of grid points relative to the standard approach. The time-iteration method starts by specifying a guess for the values policy functions take on a finite number of grid points. The values of the policy function that are not on any of the grid points are interpolated or extrapolated linearly. Let X(·) be a vector of policy functions that solves the functional equations above and let X(0) be the initial guess of such policy functions.18 At the s-th iteration, given the approximated policy function X(s−1)(·), we solve the system of nonlinear equations given by equations (D.1)-(D.6) to find today’s y ,π ,i ,ω ,ω , and ω at each point of the state space. In solving the system of nonlinear t t t 1,t 2,t 3,t equations, we evaluate the value of future variables that are not on the grid points with linear interpolation. The system is solved numerically by using a nonlinear equation solver, dneqnf, provided by the IMSL Fortran Numerical Library. If the updated policy functions aresufficientlyclose to thepreviously approximatedpolicyfunctions, thenthe iterationends. Otherwise, using the former as the guess for the next period’s policy functions, we iterate on this process until the difference between the guessed and updated policy functions is sufficiently small ( (cid:13) (cid:13)vec(Xs(δ)−Xs−1(δ)) (cid:13) (cid:13) < 1e-12 is used as the convergence criteria). ∞ 18Forallmodelsandallvariables,weuseflatfunctionsatthedeterministicsteady-statevaluesastheinitial guess. 30

D.3 Solution accuracy We assess the accuracy of our solution for the allocation under the optimal commitment policy through the following two residual functions: R = |y −(1−α )E y +σ(i −E π −rn)|, 1,t t 1 t t+1 t t t+1 t R = |π −κy −(1−α )βE π −e |, 2,t t t 2 t t+1 t where the residual R measures the difference between the chosen output gap today and 1,t today’s output gap consistent with the optimization behavior of the household, as a percent deviation from steady-state output gap. Similarly, the residual R captures the same 2,t difference in inflation today. It is common to simulate the economy once for a very long time, and report summary statisticsof theseresidualsfromthe simulatedpathto quantifythesolutionaccuracy (Maliar andMaliar(2015)andNakata(2017),amongothers). However,suchapproachdoesnotmake sense in our model with an absorbing state; after a certain point in time, the economy is at the steady state forever in which the residuals are zero. We take the following approach to quantify the solution accuracy. We simulate 500,000 different economies. In each economy, the initial state is given by the crisis state in which rn = rn and e = e . How long the t L t L crisis shock lasts depends on the realizations of the crisis shock. In each economy, we stop our simulation after 8 quarters from the lift-off quarter so as not to include observations in which the economy are very close to or at the steady state.19 Table D.1 reports the average and the maximum of the residuals collected from 500,000 economies of varying lengths for three values of α = α = 0, 0.5, 1. 1 2 Table D.1: Solution Accuracy (Discounting in Euler Equation) α ξ =[log (R )] E[ξ] max[ξ] 1 10 k,t k =1: Euler equation error −11.0 −2.9 α =α =0 1 2 k =2: Sticky-price equation error −13.1 −5.7 k =1: Euler equation error −13.5 −3.3 α =α =0.5 1 2 k =2: Sticky-price equation error −15.6 −5.6 k =1: Euler equation error −19.8 −16.9 α =α =1 1 2 k =2: Sticky-price equation error −29.2 −18.4 E Sensitivity analysis (I) In this section, we examine the implications of less powerful forward guidance on optimal policy when the weight on the output stabilization term in the central bank’s objective function—given by λ—is higher than in the baseline. We consider two alternative values for λ. The first value we consider—the value we focus on—is 1/64, which is substantially higher than the baseline and gives equal weights to the volatility of annualized inflation and the volatility of employment gap.20 When the optimal 19The choice of 8 quarters is somewhat arbitrarily, but the economy typically have returned to the steady state 8 quarters after liftoff, regardless of how long the crisis shock has lasted. 201/64 = 1/16∗1/4. The first term, 1/16(= 1/4∗1/4), is the term to take into account the fact that π t in the model is a quarterly rate of inflation. The second-term, 1/4(= 1/2∗1/2), translates the output gap volatility into the employment volatility, using the Okun’s law with coefficient of 2. 31

policy analysis is conducted for policy purposes in central banks, it is common to put equal weightstothepricestabilityandemploymentstabilityterms. See,forexample,Yellen(2012), Reifschneider,Wascher,andWilcox(2015),andCarney(2017). Thesecondvalueweconsider is 0.0045, which is five times as large as the baseline value and is lower than 1/64. Figure E.1: IRFs in the Model with Discounted Euler Equation (λ = 1/64) Policy Rate Output Gap Inflation 4 5 1 2.5 3 0.5 0 2 0 -2.5 Simple Rule α = 0 1 1 -0.5 α = 0.5 -5 1 α = 1 1 0 -7.5 -1 0 10 20 30 40 0 10 20 30 40 0 10 20 30 40 Time Time Time Figure E.1 shows the IRFs under the optimal commitment policy in the model with discounting in the Euler equation only under λ = 1/64. The IRFs with the standard Euler equation—shown by the dashed black lines—are qualitatively similar to those under the baseline λ shown in Figure 4. Quantitatively, the deviation of inflation from the target is larger, and the deviation of output from the target level is smaller, under λ = 1/64 than under the baseline calibration. The effect of discounting on the optimal interest-rate path is qualitatively different under λ = 1/64 than under the baseline value of λ. In Figure E.3, the liftoff quarter is the same under α = 0.5 as that under α = 0. Under α = 1, the liftoff quarter is earlier, albeit only by two quarters. Interestingly, even though the lift-off quarter is earlier under α = 1 than—or, the same under α = 0.5 as—under α = 0, the speed of convergence to the steady state interest rate is slower the higher the discounting parameter is. The left panel of Figure E.3 shows the non-monotonic effect of the discounting on the expectedELBdurationunderλ = 1/64; althoughtheexpectedELBdurationinitiallyincreases as α increases from zero, an increase in α leads to a shorter expected ELB duration after some threshold value of α around 0.2. Note that, for any given α, a higher λ is associated with a shorter expected ELB duration. Moving on to the effect of discounting in the Phillips curve, the effect of discounting on the optimal policy-rate path under λ = 1/64 is similar to that under the baseline λ shown in Figure E.2. As shown in the middle panel of Figure E.3, a higher discounting is associated with a longer expected ELB duration, though the effect of increasing the discounting parameter is negligible when alpha is sufficiently large. For any given α, a higher λ is associated with a shorter expected ELB duration. Finally, the right panel of Figure E.3 shows the effect on the expected ELB duration of increasing the discounting parameters in the Euler equation and the Phillips curve at the same time. Qualitatively, the effect is similar to that under the baseline λ; although the expected ELB duration increases initially as α increases from zero, an increase in α leads to a shorter expected ELB duration after some threshold value of α. As in the model with discounting only in the Euler equation or the Philip curve, for any given α, a higher λ is 32

Figure E.2: IRFs in the Model with Discounted Phillips Curve (λ = 1/64) Policy Rate Output Gap Inflation 4 5 1 2.5 3 0.5 0 2 0 -2.5 Simple Rule α = 0 2 1 -0.5 α = 0.5 -5 2 α = 1 2 0 -7.5 -1 0 10 20 30 40 0 10 20 30 40 0 10 20 30 40 Time Time Time Figure E.3: Expected ELB Durations with various λs Discounting in Euler Eq. Discounting in Euler Eq. Discounting in Phillips Curve & Phillips Curve 20 20 20 λ = Baseline λ = 5*Baseline λ = 1/64 15 15 15 10 10 10 5 5 5 0 0.2 0.4 0.6 0.8 1 0 0.2 0.4 0.6 0.8 1 0 0.2 0.4 0.6 0.8 1 α α α =α 1 2 1 2 associated with a shorter expected ELB duration. F Sensitivity analysis (II) Throughout the paper, we have examined the implications of less powerful forward guidance in a setup where, as we vary α, the size of the shocks are modified to keep the severity of the recession unchanged under the simple rule. In this section, we demonstrate the results from the alternative experiment in which the size of the shocks are kept constant as we vary α. With the size of shocks unchanged, a decline in α makes the declines in output and inflation in the crisis state smaller, as shown in Figure F.1. For any given α, faced with a less severe recession, the central bank finds it optimal to keep the policy rate at the ZLB for a shorter duration in this alternative experiment than in the baseline experiment, as captured by the fact that black line is below the red line in the left panel of Figure F.5. Under the alternative setup—shown by the black line—an increase in α leads to a reduction in the expected ELB duration, whereas an increase in α leads to an increase in the expected ELB duration under the baseline setup—shown by the red line. The effect of increasing α on the IRFs are shown in Figure F.2. 33

Figure F.1: IRFs in the Model with Discounted Euler Equation under the Simple Rule: Keeping the Magnitudes of the Shocks Constant Policy Rate Output Gap Inflation 4 3 0.5 3 0 0 2 -3 -0.5 1 -6 -1 α = 0 α = 0.5 α = 1 0 -9 -1.5 0 10 20 30 40 0 10 20 30 40 0 10 20 30 40 Time Time Time Figure F.2: IRFs in the Model with Discounted Euler Equation under the Optimal Commitment Policy: Keeping the Magnitudes of the Shocks Constant Policy Rate Output Gap Inflation 4 4 0.3 α = 0 α = 0.5 3 2 0.2 α = 1 2 0 0.1 1 -2 0 0 -4 -0.1 0 10 20 30 40 0 10 20 30 40 0 10 20 30 40 Time Time Time MovingontothemodelwithdiscountingonlyinthePhillipscurve,ahigherαisassociated withalesssevererecessionunderthesimplerule, asinthemodelwithdiscountingonlyinthe Eulerequation,andasshowninFigureF.3. Foranygivenα,facedwithalesssevererecession, the central bank finds it optimal to keep the policy rate at the ZLB for a shorter duration in this alternative experiment than in the baseline experiment, as captured by the fact that black line is below the red line in the middle panel of Figure F.5. Under the alternative experiment—shown by the black line—an increase in α does not alter the expected ELB duration much, whereas an increase in α leads to an increase in the expected ELB duration under the baseline experiment—shown by the red line. The effect of increasing α on the IRFs are shown in Figure F.4. Finally, the right panel of Figure F.5 shows the effect on the expected ELB duration of increasing the discounting parameter in both the Euler equation and the Phillips curve. Similarly to the model with the discounting only in the Euler equation, the expected ELB duration declines as α increases under the alternative experiment—shown by the black line. 34

Figure F.3: IRFs in the Model with Discounted Phillips Curve under the Simple Rule: Keeping the Magnitudes of the Shocks Constant Policy Rate Output Gap Inflation 4 3 0.5 3 0 0 2 -3 -0.5 1 -6 -1 α = 0 α = 0.5 α = 1 0 -9 -1.5 0 10 20 30 40 0 10 20 30 40 0 10 20 30 40 Time Time Time Figure F.4: IRFs in the Model with Discounted Phillips Curve under the Optimal Commitment Policy: Keeping the Magnitudes of the Shocks Constant Policy Rate Output Gap Inflation 4 5 0.3 α = 0 α = 0.5 3 0.2 α = 1 2 0 0.1 1 0 0 -5 -0.1 0 10 20 30 40 0 10 20 30 40 0 10 20 30 40 Time Time Time G Analysis of optimal policy in a micro-founded model of less powerful forward guidance In the main text, we analyzed the implication of less powerful forward guidance in a New Keynesian models with discounted Euler equation and Phillips curve without taking a stance on the structural model that generates discounting in these two equations. In this section, we characterize optimal commitment policy in a structural model of less powerful forward guidance. The structural model we examine is a New Keynesian model with overlapping generations developed by Piergallini (2006), Castelnuovo and Nistic´o (2010), and Nistic´o (2012). Recently, Del Negro, Giannoni, and Patterson (2015) show that the effect of anticipated monetary policy shock is smaller in this model than in the standard New Keynesian model. 35

Figure F.5: Expected ELB Duration: Keeping the Magnitudes of the Shocks Constant Discounting in Euler Eq. Discounting in Euler Eq. Discounting in Phillips Curve & Phillips Curve 20 20 20 Baseline Excercise Alternative Excercise 15 15 15 10 10 10 5 5 5 0 0.2 0.4 0.6 0.8 1 0 0.2 0.4 0.6 0.8 1 0 0.2 0.4 0.6 0.8 1 α α α =α 1 2 1 2 G.1 Model G.1.1 Household Every period j, a new cohort is born with mass p, and each cohort has a constant probability of dying—denoted by p—which does not depend on j. At time t, agents born in period j seek to maximize their discounted sum of future utility flows subject to budget constraints. ∞ (cid:34) s (cid:35) (cid:88) (cid:89) max (β(1−p))t+s−1 δ [lnC +vln(1−L )] t+u−1 j,t+s j,t+s {Cj,t+s,Lj,t+s}∞ s=0s=0 u=0 subject to (cid:90) 1 P C +B + Q∗ (i)Z (i)di t+s j,t+s j,t+s t+s j,t+s 0 1 (cid:90) 1 =W∗ L +P T + [B R + (Q∗ (i)+D∗ (i))Z (i)di] t+s j,t+s t+s j,t+s 1−p j,t+s t+s−1 t+s t+s j,t+s 0 C and L are consumption and labor supply of cohort j at time t. B and Z (i) are j,t j,t j,t j,t respectively a nominal bond holding and the equity share issued by monopolistic firms, i, owned by cohort j at time t. Q∗(i) is the nominal price of equity and D∗(i) is the dividend. t t W∗ is the nominal wage and T is the lump-sum transfer. ∗ denotes the nominal price. t j,t Each agent faces the constant death rate p for each period. As in Blanchard (1985), each agent has entered an annuity contract in which the fraction p of the same cohorts dying in each period leaves its wealth to those who remain alive. The wealth [(B R + (cid:82)1 (Q∗(i)+ j,t t−1 0 t D∗(i))Z (i)di] is divided by 1−p fraction of cohort members alive. By adding the annuity t j,t and his own wealth, each agent has ( p +1)[B R + (cid:82)1 (Q∗(i)+D∗(i))Z (i)di] for each 1−p j,t t−1 0 t t j,t period. The discount rate at time t is given by βδ where δ is the discount factor shock altering t t the weight of future utility at time t+1 relative to the period utility at time t. The shock 36

process is described as follows: (cid:40) δ if 1 ≤ t ≤ τ H δ = t 1 otherwise. Note that, unlike in the infinite-horizon model in the main text, there is no uncertainty. In particular, the duration of the shock, τ, is given. For any variables X, aggregate variable X is defined by X ≡ (cid:80)t p(1−p)t−jX . t t j=−∞ j,t G.1.2 Firms There is a final good producer and a continuum of intermediate goods producers indexed by i ∈ [0,1]. The final good producer purchases the intermediate goods Y at P , produces i,t i,t the final good by CES technology, and sells it to households. The problem is summarized as: (cid:90) 1 max P Y − P Y di t t i,t i,t {Yi,t i∈[0,1]} 0 subject to the CES production function, (cid:20)(cid:90) 1 θ−1 (cid:21) θ− θ 1 Y = Y θ di t i,t 0 Intermediate goods producers use labor as an input of producing intermediate goods accordingtoalinearproductionfunction(Y = L )andsellthemtothefinalgoodproducer. i,t i,t Eachfirmfacesaquadraticadjustmentcostwhenhechangeshispricetomaximizehisprofit. We assume that each firm receives a production subsidy τ∗ so that economy is fully efficient in the steady state. max (cid:88) ∞ F (cid:34) (1+τ∗)P Y −W∗L −P ϕ (cid:18) P i,t −1 (cid:19)2 Y (cid:35) {Pi,t} t=1 1,t i,t i,t t i,t t 2 P i,t−1 t subject to (cid:18) P (cid:19)−θ i,t Y = Y i,t t P t where F is the aggregate stochastic discount factor. That is, 1,t t (cid:88) F ≡ p(1−p)t−jF 1,t j,1,t j=−∞ with P C F ≡ βt−1 1 j,1 j,1,t P C t j,t The time zero price is assumed to be the same across firms (i.e. P = P > 0). i,0 0 37

G.1.3 Government policies The supply of the government bond is assumed to be zero. With this assumption, the government budget constraint is given by t (cid:88) τ∗P Y +P p(1−p)t−jT = 0 t t t j,t j=−∞ We will consider two cases regarding how the central bank sets the short-term nominal interestrate. Inthefirstcase,thecentralbanksetstheinterestrateaccordingtothefollowing interest-rate feedback rule; R R = max{ ss ΠΨ, 1} t δ t t where Π := Pt and, t Pt−1 1 R := ss β(1−ηQ) Y R is the steady state nominal interest rate and where η = p 1−β(1−p) . The intercept of ss 1−p (1+v)β the policy rule is time-varying and depends on δ in such a way that, in the absence of the t ZLB constraint, the effect of variation in δ would be fully neutralized by a corresponding variation in the policy rate. In the second case, the central bank choose the sequence of interest rates at time one in order to maximize the household’s welfare subject to the private-sector equilibrium conditions. G.1.4 Market clearing conditions The market clearing conditions for the final good, labor, government bond and equity are given by ϕ Y = C + (Π −1)2Y t t t t 2 (cid:90) 1 L = L (i) di t t 0 B = 0 t (cid:90) 1 Z (i) di = 1 t 0 38

G.1.5 Private-sector equilibrium conditions Private sector equilibrium conditions are given by: p (1−p)ω 1 t+1 C = Q + Π C t t t+1 t+1 (1+v)(ω −1) ω −1 R t t t 1 Q = Π (Q +D ) t t+1 t+1 t+1 R t vC = W (1−L ) t t t θ φ D = Y −W Y − (Π −1)2Y t t t t t t θ−1 2 φ C = [1− (Π −1)2]Y t t t 2 1 Y θ Π (Π −1) = Π2 (Π −1) t+1 − (1−W ) t t R t+1 t+1 Y φ t t t Y = L t t where ∞ s (cid:88) (cid:89) ω = 1+ (1−p)sβs( δ )) t t+u−1 s=1 u=1 As death rate p rises, η also increases and households respond more to the asset price at time t. Ifp = 0holds, η = 0andthiseconomydegeneratesintoastandardNKmodel. Inthiscase, the wealth effect of an asset price Q does not directly affect the movement of consumption. t For notational simplicity, we will also write the consumption Euler equation as follows. Π C t+1 t+1 C = κ Q +κ t 1,t t 2,t R t G.2 Welfare Welfare function at t is defined as the weighted average of time-zero value functions of 0 all cohorts, including those who are already born and those who are yet to be born. σ is the subjective discount factor of the social planner.21 For expositional simplicity, we abstract 21The second term of the first line shows that the policy maker discounts time t instantaneous utilities of already-born agents back to birth dates, rather than the current period. This assumption looks odd, but, without the assumption, the problem of the time-inconsistency in preference arises, as noted by Calvo and Obstfeld (1988). 39

from the time-variation in the household’s discount rate.   (cid:88) ∞ (cid:88) ∞ (cid:88) t0 (cid:32) (cid:88) ∞ (cid:33) W t0 = σj p [β(1−p)]t−ju(j,t)+ σj p [β(1−p)]t−ju(j,t) j=t0+1 t=j j=−∞ t=t0 (cid:124) (cid:123)(cid:122) (cid:125) (cid:124) (cid:123)(cid:122) (cid:125) unborn already born   (cid:88) ∞ (cid:88) ∞ (cid:18) β(1−p) (cid:19)t−j (cid:88) t0 (cid:34) (cid:88) ∞ (cid:18) β(1−p) (cid:19)t−j (cid:35) = p  u(j,t)σt +p u(j,t)σt σ σ j=t0+1 t=j j=−∞ t=t0   (cid:88) ∞ (cid:88) t (cid:18) β(1−p) (cid:19)t−j = p σt  u(j,t) σ t=t0 j=−∞ ∞ (cid:88) = βtu(C ,L ) t t t=t0 Thelastequalityfollowsfromthefollowingtwoassumptions. Thefirstassumptionisthat the social planner can implement a distributional policy so that the aggregate consumption and labor supply becomes equal for each cohort at time t. That is, C = C , L = L , and j,t t j,t t u = u . The second assumption is that the subjective discount factor of social planner and j,t t that of each cohort are the same, that is σ = β.22 G.3 Ramsey Problem At time t = 1, the government’s problem is to find an allocation that maximizes the household welfare and price and policy variables that decentralize the allocation as an equilibrium. ∞ t (cid:88) (cid:89) max βt−1( δ )[lnC +vln(1−Y )] s−1 t t t=1 s=1 subject to Π C t+1 t+1 C = κ Q +κ t 1,t t 2,t R t 1 Q = Π (Q +D ) t t+1 t+1 t+1 R t vC = W (1−Y ) t t t θ φ D = Y −W Y − (Π −1)2Y t t t t t t θ−1 2 φ C = [1− (Π −1)2]Y t t t 2 1 Y θ Π (Π −1) = Π2 (Π −1) t+1 − (1−W ) t t R t+1 t+1 Y φ t t t R ≥ 1 t 22Notice that (cid:80)∞ (cid:16) β(1−p) (cid:17)z = 1, which is also used in the last equality. z=0 σ p 40

The associated Lagrangian problem is given by: ∞ t (cid:88) (cid:89) L = max βt−1( δ ){lnC +vln(1−Y ) t s−1 t t t=1 s=1 Π C t+1 t+1 +λ (κ Q +κ −C ) 1,t 1,t t 2,t t R t 1 +λ ( Π (Q +D )−Q ) 2,t t+1 t+1 t+1 t R t +λ (W (1−Y )−vC ) 3,t t t t θ φ +λ ( Y −W Y − (Π −1)2Y −D ) 4,t t t t t t t θ−1 2 φ +λ ([1− (Π −1)2]Y −C ) 5,t t t t 2 1 Y θ +λ ( Π2 (Π −1) t+1 − (1−W )−Π (Π −1)) 6,t R t+1 t+1 Y φ t t t t t +λ (R −1)} 7,t t FONCs are given by: κ 1 2,t−1 C : Π λ +δ β −δ βλ −δ βvλ −δ βλ = 0 t t 1,t−1 t t 1,t t 3,t t 5,t R C t−1 t Π t Q : λ +δ βκ λ −δ βλ = 0 t 2,t−1 t 1,t 1,t t 2,t R t−1 Π t D : λ −δ βλ = 0 t 2,t−1 t 4,t R t−1 v 1 1 θ φ Y : −δ β +λ Π2(Π −1) −δ βλ W +δ βλ ( −W − (Π −1)2) t t 1−Y 6,t−1 R t t Y t 3,t t t 4,t θ−1 t 2 t t t−1 t−1 φ 1 Y +δ βλ [1− (Π −1)2]−δ βλ Π2 (Π −1) t+1 = 0 t 5,t 2 t t 6,t R t+1 t+1 Y2 t t θ W : λ (1−Y )−λ Y +λ = 0 t 3,t t 4,t t 6,t φ C 1 1 Y Π : λ κ t +λ (Q +D )+λ (3Π2−2Π ) t t 1,t−1 2,t−1 R 2,t−1 R t t 6,t−1 R t t Y t−1 t−1 t−1 t−1 −δ βλ φ(Π −1)Y −δ βλ φ(Π −1)Y −δ βλ (2Π −1) = 0 t 4,t t t t 5,t t t t 6,t t Π C Π (Q +D ) Π2 (Π −1)Y R : −λ κ t+1 t+1 −λ t+1 t+1 t+1 −λ t+1 t+1 t+1 +λ = 0, t 1,t 2,t R2 2,t R2 6,t R2 Y 7,t t t t t (λ ≥ 0, R −1 ≥ 0, λ (R −1) = 0) 7,t t 7,t t G.4 Calibration and solution method We solve the model in its original nonlinear form using the modified Newton method developed by Juillard, Laxton, McAdam, and Pioro (1998). This method modifies the standard Newton algorithm to take advantage of the recursive structure common in infinite-horizon macroeconomicmodels. Nakata(2017)usesthismethodtosolveoptimalfiscalandmonetary policyinNewKeynesianmodelwiththeZLBconstraint. Seetheformerpaperforthegeneral description of the method and the latter for how to apply the method to fully nonlinear New 41

Keynesian models with the ZLB constraint. Parameter values are listed in Table G.1. The weight on leisure in the utility function is close to those values used in Castelnuovo and Nistic´o (2010), Nistic´o (2012), and Del Negro, Giannoni, and Patterson (2015). Price adjustment costs implies a slope of the Phillips curve thatisconsistentwithCalvoparameterof0.9. Thisadjustmentcostparameterisabithigher than what is commonly used in the literature and is chosen so that inflation does not decline too much in recessions. We consider 21 different values of p between 0 and 0.1. For each, p, the size of the discount rate shock is chosen so that the initial decline in consumption is 10 percent. Table G.1: Parameter Values Parameter Description Values β Discount factor 0.9925 v Utility weight on L 1.25 j,t p Turnover rate [0,0.1] θ Relative price elasticity of demand 11 ϕ Price adjustment cost 1200 Ψ Coefficient of the Taylor rule 1.5 R The effective lower bound 1 ELB Chosen so that δ Demand shock C declines 10% from the SS H 1 under the time varying Taylor rule G.5 Results Figure G.1 shows the dynamics of the economy for three different values of p under the simple rule. According to the figure, allocations are not noticeably different across different values of p. For all values of p, consumption declines 10 percent and inflation declines by 2 percentage points in period one. They gradually increase and, once the shock disappears at period 21, the economy is at the steady state. One unique feature of the model is that the steady state nominal interest rates depends on p. In particular, a higher p is associated with a higher steady state nominal interest rate.23 Note that, in this model, the magnitudes of the declines in consumption and inflation at the ZLB depend on the time, whereas they do not in the infinite-horizon model considered in Section 3 in the main text. This difference is driven by the difference in the shock process. In the model of this section, the shock is not stochastic; the duration of the shock is set to 20 periods. In the infinite-horizon model considered in Section 3 in the main text, the shock is stochastic and is governed by a Markov process. Figure G.2 shows the dynamics of the economy for three different values of p under the optimal commitment policy. For any values of p, the policy rate is kept at the ZLB even after the shock disappears. Consistent with what we saw in models with discounted Euler equation and Phillips curve, this low-for-long policy creates overshooting of consumption and inflation, which in turn mitigates the decline in consumption and inflation at the beginning of recessions through expectations. The additional duration of holding the policy rate at the ZLB is shorter when p is higher. With p = 0, the additional ZLB duration is 4 periods, whereas it is 3 periods under p = 0.02 23If we modify β as we modify p, the power of forward guidance will not get weakened. 42

Figure G.1: IRFs under the simple rule Policy Rate Consumption Inflation 7 1 0.5 6 -1 -0.5 5 -3 -1.5 4 -5 3 -2.5 -7 2 -3.5 p=0 1 -9 p=0.02 p=0.04 0 -11 -4.5 10 20 30 40 10 20 30 40 10 20 30 40 Time Time Time Figure G.2: IRFs under optimal policy Policy Rate Consumption Inflation 7 2 0.4 6 1 0.2 5 0 4 0 3 -1 2 -0.2 -2 p=0 1 p=0.02 p=0.04 0 -3 -0.4 10 20 30 40 10 20 30 40 10 20 30 40 Time Time Time and p = 0.04. The allocations are similar, but there is slightly more overshooting in a model with a higher p. Inthismodel,itwouldbemisleadingtousetheadditionalZLBdurationasthemeasureof theextenttowhichthecentralbankusesforwardguidancetostimulateeconomicactivitiesat the ZLB, because the steady state nominal interest rates are different across different values of p. For example, when the central bank keeps the policy rate at the ZLB for one additional period in the model with p = 0, it is setting the interest rate 3 percentage points below the rate required to stabilize the economy. However, when the central bank keeps the policy rate at the ZLB for one additional period in the model with p = 0.04, it is setting the interest rate 5.5 percentage points below the rate required to stabilize the economy. Thus, by keeping the policy rate at the ZLB for one additional period, the central bank is providing more future policy accommodation under p = 0.04 than under p = 0. To account for the difference in steady state nominal interest rates across different value of p, we use the following statistics—which we call “EA (Extra Accommodation)”—which is the cumulative difference between the path of policy rates under the simple rule (RSR) and t that under the optimal commitment policy (ROCP))—as the measure of how actively the t central bank uses forward guidance: 43

∞ (cid:88) EA := (RSR−ROCP) t t t=0 FigureG.3showshowthismeasureoffuturepolicyaccommodationvarieswithp. According to the figure, the central bank promises more future accommodation when p is higher— that is, when forward guidance is less powerful. This result is consistent with the analysis of the infinite-horizon model in the main text; the central bank finds it optimal to compensate the reduced power of forward guidance by more extensively using it. Figure G.3: Extra accommodation 0.03 0.028 0.026 0.024 0.022 0.02 0.018 0.016 0 0.02 0.04 0.06 0.08 0.1 p 44

Cite this document
APA
Taisuke Nakata, Ryota Ogaki, Sebastian Schmidt, & and Paul Yoo (2018). Attenuating the Forward Guidance Puzzle: Implications for Optimal Monetary Policy (FEDS 2018-049). Board of Governors of the Federal Reserve System, Finance and Economics Discussion Series. https://whenthefedspeaks.com/doc/feds_2018-049
BibTeX
@techreport{wtfs_feds_2018_049,
  author = {Taisuke Nakata and Ryota Ogaki and Sebastian Schmidt and and Paul Yoo},
  title = {Attenuating the Forward Guidance Puzzle: Implications for Optimal Monetary Policy},
  type = {Finance and Economics Discussion Series},
  number = {2018-049},
  institution = {Board of Governors of the Federal Reserve System},
  year = {2018},
  url = {https://whenthefedspeaks.com/doc/feds_2018-049},
  abstract = {We examine the implications of less powerful forward guidance for optimal policy using a sticky-price model with an effective lower bound (ELB) on nominal interest rates as well as a discounted Euler equation and Phillips curve. When the private-sector agents discount future economic conditions more in making their decisions today, an announced cut in future interest rates becomes less effective in stimulating current economic activity. While the implication of such discounting for optimal policy depends on its degree, we find that, under a wide range of plausible degrees of discounting, it is optimal for the central bank to compensate for the reduced effect of a future rate cut by keeping the policy rate at the ELB for longer. Accessible materials (.zip)},
}