feds · May 15, 2019

Credible Forward Guidance

Abstract

We analyze credible forward guidance policies in a sticky-price model with an effective lower bound (ELB) constraint on nominal interest rates by solving a series of optimal sustainable policy problems indexed by the duration of reputational loss. Lower-for-longer policies---while effective in stimulating the economy at the ELB---are potentially time-inconsistent, as the associated overheating of the economy in the aftermath of a crisis is undesirable ex post. However, if reneging on a lower-for-longer promise leads to a loss of reputation and prevents the central bank from effectively using lower-for-longer policies in future crises, these policies can be time-consistent. We find that, even without an explicit commitment technology, the central bank can still credibly keep the policy rate at the ELB for an extended period---though not as extended under the optimal commitment policy---and meaningfully mitigate the adverse effects of the ELB constraint on economy activity. Accessible version (.zip)

Finance and Economics Discussion Series Divisions of Research & Statistics and Monetary Affairs Federal Reserve Board, Washington, D.C. Credible Forward Guidance Taisuke Nakata and Takeki Sunakawa 2019-037 Please cite this paper as: Nakata, Taisuke, and Takeki Sunakawa (2019). “Credible Forward Guidance,” Finance and Economics Discussion Series 2019-037. Washington: Board of Governors of the Federal Reserve System, https://doi.org/10.17016/FEDS.2019.037. NOTE: Staff working papers in the Finance and Economics Discussion Series (FEDS) are preliminary materials circulated to stimulate discussion and critical comment. The analysis and conclusions set forth are those of the authors and do not indicate concurrence by other members of the research staff or the Board of Governors. References in publications to the Finance and Economics Discussion Series (other than acknowledgement) should be cleared with the author(s) to protect the tentative character of these papers.

Credible Forward Guidance∗ Taisuke Nakata† Takeki Sunakawa‡ Federal Reserve Board Kobe University First Draft: November 2018 This Draft: April 2019 Abstract We analyze credible forward guidance policies in a sticky-price model with an effective lower bound (ELB) constraint on nominal interest rates by solving a series of optimal sustainable policy problems indexed by the duration of reputational loss. Lower-for-longer policies— while effective in stimulating the economy at the ELB—are potentially time-inconsistent, as the associated overheating of the economy in the aftermath of a crisis is undesirable ex post. However, if reneging on a lower-for-longer promise leads to a loss of reputation and prevents the central bank from effectively using lower-for-longer policies in future crises, these policies can be time-consistent. We find that, even without an explicit commitment technology, the central bank can still credibly keep the policy rate at the ELB for an extended period— though not as extended under the optimal commitment policy—and meaningfully mitigate the adverse effects of the ELB constraint on economy activity. JEL: E32, E52, E61, E62, E63 Keywords: Credibility, Effective Lower Bound, Forward Guidance, Sustainable Plan, Time- Consistency. ∗WethankRobertoBilli,TimothyHills,KiminoriMatsuyama,SebastianSchmidt,andseminarparticipants at the 5th End-of-Year Conference held at the Canon Institute of Global Studies, Asian Development Bank Institute,BankofJapan,BoardofGovernorsoftheFederalReserveSystem,KobeUniversity,andUniversity of Tokyo forcommentsand suggestions. Wealso thankKatherine Arnoldand PhilipCoylefor theireditorial assistance. Sunakawa is supported by JSPS Grant-in-Aid for Scientific Research (KAKENHI) for Young Scientists, Project Number 18K12743. The views expressed in this paper, as well as all errors and omissions, should be regarded as those solely of the author and are not necessarily those of the Federal Reserve Board of Governors or the Federal Reserve System. †Division of Research and Statistics, Federal Reserve Board, 20th Street and Constitution Avenue N.W. Washington, D.C. 20551; Email: taisuke.nakata@frb.gov ‡Kobe University; Email: takeki.sunakawa@gmail.com. 1

1 Introduction I believe the FOMC should seriously consider pursuing a lower-for-longer or makeup strategy for setting short rates when the zero lower bound binds and should articulate its intention to do so before the next zero lower bound episode. Janet L. Yellen, September 20181 Developing effective strategies to deal with the effective lower bound (ELB) constraint on nominal interest rates is an important task for economists and central bankers. In forwardlooking models with an ELB, the commitment to keeping the policy rate at the ELB for an extended period—and temporarily overshooting inflation and output targets—is known to be effective in stimulating economic activity during a deep recession, as the anticipation of an overheated economy leads forward-looking households and firms to increase consumption and set higher prices (Reifschneider and Williams (2000); Eggertsson and Woodford (2003); Jung, Teranishi, and Watanabe (2005); and Adam and Billi (2006)). While the effectiveness of such overheating commitment or lower-for-longer policy in theory is widely known, central banks that recently faced, or are currently facing, the ELB constraint have not adopted this type of policy, with an exception of the Bank of Japan.2 One key argument against overheating commitment policy is its potential time-inconsistency. Ex ante, it is desirable to promise to overheat the economy in the future, as the expectations of future overheating stimulate inflation and output when the economy faces headwinds and the ELB is a binding constraint. However, once the headwinds dissipate, the central bank will have an incentive to renege on the promise of overheating the economy by raising the policy rate, because the overheating is ex post undesirable. A number of policymakers have stated that this time-inconsistency problem is one reason for why commitment policy may be not as effective in reality as in theory.3 Nakata (2018) has recently shown that, once the central bank’s reputational concern is taken into account, optimal overheating commitment policy can be made time-consistent. In his analysis, if the central bank were to renege on its promise of overheating the economy in the aftermath of a crisis, it would suffer reputational losses, and private-sector agents would not believe similar promises in future crises. If private-sector agents do not believe the central bank’s promise to overheat the economy, future ELB episodes will be associated with larger declines in inflation and output than otherwise. Thus, concern for maintaining its reputation gives the central bank an incentive to fulfill the promise of overheating the 1Yellen(2018),“CommentsonMonetaryPolicyattheEffectiveLowerBound”https://www.brookings. edu/wp-content/uploads/2018/09/ES_20180910_Yellen-ELB.pdf. 2This type of policy is also referred to as “make-up” policy. Price-level targeting, temporary pricelevel targeting, and average inflation targeting are alternative frameworks to implement lower-for-longer and overheating commitment policy. 3See Appendix G of this paper or Nakata (2015) for quotes from various policymakers discussing the time-inconsistency of commitment policy. 2

economy. According to Nakata (2018), if the policy rate is expected to fall into the ELB in the future with sufficient frequency and the loss of reputation lasts for a sufficiently long duration, the incentive for the central bank to maintain reputation outweighs the short-run incentive for it to eliminate the overheating of the economy, making optimal commitment policy credible as a result. However, Nakata (2018) is silent about what the central bank can credibly achieve when the optimal commitment policy is not credible. In this paper, we study credible lower-for-longer policies in a sticky-price model with the ELB focusing on understanding what the central bank can credibly achieve when optimal commitment policy is not time-consistent. Specifically, we formulate and solve a series of optimal sustainable policy problems in which the central bank chooses state-contingent allocations to maximize welfare subject to not only private-sector equilibrium conditions and an incentive compatibility constraint. The incentive compatibility constraint—which we will refer to as the sustainability constraint—requires that the continuation value associated with the chosen state-contingent allocation has to be at least as large as the continuation value associated with deviating from that allocation—and falling into a discretionary regime for N periods—at any time and after any history of shocks. Each of the optimal sustainability problemsisindexedbythedurationforwhichtheeconomyspendsinthediscretionaryregime if the central bank were to deviate from the chosen allocations and lose its reputation. As shown by Nakata (2018), optimal commitment policy is credible when certain conditions are met. Undersuchconditions, thesustainabilityconstraintdoesnotbind, andoptimalsustainable policy coincides with optimal commitment policy. Our main interest is to characterize optimal sustainable policies when the sustainability constraint occasionally binds. Our main result is that, even when optimal commitment policy is not credible, the central bank can still credibly keep the policy rate at the ELB for an extended period in the aftermath of a crisis—though not as extended as under optimal commitment policy. As in optimal commitment policy, such lower-for-longer policy generates a temporary post-crisis overheating of the economy and mitigates the declines in output and inflation in a crisis through expectations. Under reasonable assumptions regarding how long the central bank suffers from a loss of reputation after reneging on the promise of lower-for-longer, the welfare cost of the ELB constraint is substantially lower under an optimal sustainable policy than under optimal discretionary policy. Onekeyfeatureofoptimalsustainablepoliciesisthattheyarelesshistorydependentthan optimalcommitmentpolicy. AsdiscussedindetailbyEggertssonandWoodford(2003),akey feature of optimal commitment policy is history dependence. In particular, under optimal commitment policy, the additional period at which to keep the policy rate at the ELB in the aftermath of a crisis—as well as the magnitude of output and inflation overshoot— increases as the realized crisis shock duration increases. When the reputational force is strong, optimal sustainable policies exhibit qualitatively similar history dependence, though the degree of history dependence is weaker than under optimal commitment policy. When 3

the reputational force is sufficiently weak, optimal sustainable policies do not feature any history dependence. That is, the additional ELB duration as well as the magnitude of output and inflation overshoot do not depend on the realized crisis shock duration. Our optimal sustainable policies are of interest to central banks for two reasons. First, by construction, the optimal sustainable policies are time-consistent; thus, it is immune to the criticism that the promised overshoot of inflation and output associated with any lowerfor-longer strategies may not be credible. Second, when the duration of reputational loss is sufficiently short, optimal sustainable policies are not history dependent or not as history dependent as optimal commitment policy is. Thus, it overcomes the criticism that, because the policy rate path associated with a lower-for-longer strategy is complex, it is difficult for central banks to clearly explain these strategies to the public. Our analysis contributes to the policy debate on how to best conduct forward guidance policies in future ELB episodes. Although most central banks have refrained from using lower-for-longer policies in the past,4 there is a growing interest in adopting this type of policy in the future (Bernanke (2017), Yellen (2018), and Williams (2018)). Our analysis suggests that it is possible for central banks to credibly adopt lower-for-longer policies, but there may be some limit on how long they can promise to keep the policy rate at the ELB. Of course, there are other factors absent in our model that may limit the effectiveness and implementability of lower-for-longer policies, such as the possibilities that lower-for-longer policies induce financial instability or the unanchoring of long-run inflation expectations (Kohn (2012) and Yellen (2018)). It would be useful to carefully examine the implications of these other factors for lower-for-longer policies in future research. Our paper builds on the literature on optimal monetary policy in the New Keynesian model with the ELB. This literature has demonstrated the effectiveness of lower-for-longer policies in stimulating the economy at the ELB, assuming that the central bank is equipped with an explicit commitment technology (Eggertsson and Woodford (2003); Jung, Teranishi, and Watanabe (2005); Adam and Billi (2006); and Nakov (2008)). Our paper contributes to this body of work by characterizing optimal sustainable policies in a model with the ELB and showing that the central bank can credibly engage in lower-for-longer policies even in the absence of an explicit commitment technology. WithintheliteratureonoptimalpolicyandtheELB,someauthorshaveexploredwaysto implement lower-for-longer policies at the ELB in a time-consistent way. Eggertsson (2006) and Burgert and Schmidt (2014) show that in models with non-Ricardian fiscal policy, a discretionary government can provide incentives to a future government to keep the policy rate at the ELB for longer by adopting expansionary fiscal policy and raising the nominal level of government debt. Jeanne and Svensson (2007), Berriel and Mendes (2015), and Bhattarai, Eggertsson, and Gafarov (2013) show that central banks’ balance sheet policies can act as a commitment device that allows the central bank to credibly implement lower- 4See Appendix G for quotes from various policymakers expressing concerns for time-inconsistency. 4

for-longer policies. Billi (2017) and Nakata and Schmidt (2019) analyze policy delegation in models with the ELB, showing that lower-for-longer policies can be implemented in a timeconsistent way if the discretionary central bank’s standard dual-mandate objective function is replaced by a nominal-income stabilization objective or augmented with an interest-rate smoothing objective, respectively. Unlike these papers that either introduce a new policy instrument or modify the central bank’s objective function, we use reputation to achieve lower-for-longer policies in a time-consistent way. Our paper is closely related to Nakata (2018) and Walsh (2018). Nakata (2018) has shown that optimal commitment policy in the New Keynesian model with the ELB can be made time-consistent by a particular trigger strategy capturing the reputational concern of the central bank. Our paper is different from Nakata (2018) because we study the best allocations the central bank can credibly achieve when the optimal commitment policy is not credible, whereas Nakata (2018) characterizes the conditions under which the optimal commitment policy is credible. Walsh (2018) examines credibility of simple policy rules with forward guidance—those that keeps the policy rate at the ELB for a fixed number of periods aftercrises—andreachesaconclusionsimilar tothatofNakata(2018).5 Ourpaperis different from Walsh (2018) because we characterize the optimal allocation the central bank can credibly achieve subject to a sustainability constraint, whereas Walsh (2018) studies credibility of simple policy rules that may or may not be optimal.6 It turns out that there is aninterestingrelationshipbetweenouroptimalsustainablepoliciesandtheforwardguidance policy of Walsh (2018), which will be discussed in detail in Section 4.1.7 This paper is also closely related to the work of Dong and Young (2019). Using the recursive method of Abreu, Pearce, and Stacchetti (1990), Chang (1998), and Phelan and Stacchetti (2001), Dong and Young (2019) characterize the entire set of sustainable plans in a fully nonlinear New Keynesian model with the ELB. They find that the commitment outcome is not sustainable in their model and that the central bank does not lower the policy rate to the ELB under the best sustainable plan. Our paper is methodologically different from theirs because we characterize a particular (countable) subset of sustainable plans that have a trigger structure and in which on-equilibrium paths are given by the solutions to a sequence of optimal sustainable policy problems indexed by the duration of reputational loss. Finally, our paper is related to a set of papers that characterize optimal allocations in macroeconomic models with a sustainability constraint. Kehoe and Perri (2002) characterize 5See also Sukeda (2018), which extends the analysis of Walsh (2018) to a model with a discounted Euler equation and a discounted Phillips curve. 6Nakata(2018)andWalsh(2018)inturnbuildonearlierworkofreputationinmacroeconomics,including Barro and Gordon (1983), Rogoff (1987), Chari and Kehoe (1990), Chang (1998), and Phelan and Stacchetti (2001), among many others. Recent contributions include Kurozumi (2008) and Loisel (2008). 7See also Barth´elemy and Mengus (2018) who examine sustainability of optimal commitment policy in a model with the ELB constraint in which the central bank’s objective function—either a benevolent or conservative kind—is unknown to private-sector agents and there is an inflationary bias. In their model, the benevolent central bank can make the optimal commitment policy sustainable by raising inflation prior to a liquidity trap and signaling its type to private-sector agents. 5

the optimal allocation in an international business cycle model in which a deviation from the promised plan would push the economy to autarky. Fujiwara, Kam, and Sunakawa (2016) study the optimal sustainable policy in a two-country model in which the deviation from the promised cooperative plan would push the countries into a non-cooperative regime. Most closely related to our paper is Sunakawa (2015) who characterizes the optimal sustainable policy in a New Keynesian model with cost-push shocks but without the ELB constraint. Our paper applies the same analytical framework and methodology used in these papers to models with the ELB. Therestofthepaperisorganizedasfollows. Section2describesthemodelandthecentral bank’s optimization problems. Section 3 presents the results. Section 4 provides additional discussions and results. Section 5 concludes. 2 Model 2.1 Private sector Our main model is a semi-loglinear New Keynesian model with a static Phillips curve. The private-sector equilibrium conditions of this model are given by: y (st) = E y (st+1)−σ(i (st)−E π (st+1)−r∗)+s (1) t t t+1 t t t+1 t π (st) = κy (st) (2) t t i (st) ≥ i (3) t ELB where y is output, π is inflation, and i is the policy rate. Equations (1) and (2) are t t t the Euler equation and the static Phillips curve, respectively. Inequality (3) imposes the ELB constraint, denoted by i , on the policy rate. σ is the intertemporal elasticity of ELB substitution, r∗ > 0 is the natural rate of interest at the deterministic steady state, and κ is the slope of the static Phillips curve. s is a natural rate shock. st denotes a history of shocks t up to time t. That is, st := {s }t . Because there is uncertainty, allocations are statek k=1 contingent and depend on st. We refer to the state-contingent sequence of consumption, inflation, and the nominal interest rate, {y (st),π (st),i (st)}∞ , as an outcome. Given a t t t t=1 process for s , an outcome is said to be competitive if, for all t ≥ 1 and st ∈ St, (i) y (st) ∈ R, t t π (st) ∈ R, i (st) ∈ R, where R denotes a set of real numbers, and (ii) equations (1)-(3) are t t satisfied. We assume that s follows a two-state Markov process. s = r∗ > 0 in the “high” or t t “normal” state, whereas s = r < 0 in the “low” or “crisis” state. The probability of moving t c from the high/normal state to the low/crisis state is denoted by p and will be referred to H as the crisis frequency, whereas the probability of moving from the low/crisis state to the low/crisis state is denoted by p and will be referred to as the crisis persistence. Following L 6

Nakata (2018), we allow p to be non-zero, which opens up the possibility for a reputational H concern to make lower-for-longer policies credible. The central bank’s value at period t is given by ∞ V (st) := E (cid:88) βju (cid:0) π (st+j),y (st+j) (cid:1) (4) t t t+j t+j j=0 where the per-period objective function is given by the following function. 1(cid:104) (cid:105) u(π,y) := − π2+λy2 (5) 2 This quadratic objective function can be obtained as the second-order approximation to the household’s welfare.8 For any outcome, there is an associated state-contingent sequence of values, {V (st)}∞ , which will be referred to as the value sequence. t t=1 We use this model with a static Phillips curve as our baseline model for a computational reason. As described in Section 2.4, we use a time-iteration method—a commonly used numerical method for nonlinear models—to solve our model. Regardless of the specification of the Phillips curve, this solution method fails to converge if the duration of reputational loss—a key parameter governing how long the central bank is prohibited from engaging in state-contingent policies after it reneges on a previously announced policy rate path—is sufficiently short. As discussed in Appendix E, we can devise an alternative solution method if the duration of reputational loss is sufficiently short in the model with a static Phillips curve, while we cannot do so in the model with a forward-looking Phillips curve.9 Thus, the model can be solved for a wider range of durations of reputational loss in the model with a static Phillips curve than in the model with a forward-looking Phillips curve.10 In Section 4.3, we present some select results from the model with a forward-looking Phillips curve and confirm that key features of the optimal sustainable policy in the model with the static Phillips curve survive in the model with a forward-looking Phillips curve.11 2.2 Central bank We will consider three classes of competitive outcomes that differ in how the central bank sets its interest rate policy: the discretionary outcome, the commitment outcome, and the sustainable outcomes. 8See, for example, Woodford (2003) and Gal´ı (2015). 9Thekeyingredientforthealternativesolutionmethodisthatthereisnoinflation-outputtradeoffinthe normal state. Thus, the method also works for the model with perfectly sticky prices in which the inflation rate is always zero and the only relevant private-sector equilibrium condition is the Euler equation. 10Theinstabilityofthetime-iterationmethodinthepresenceofasustainabilityconstraintwithlowvalues ofN isnotspecifictothemodelwithELB.Thesecondauthorofthispaperexperiencedasimilarinstability issueinFujiwara,Kam,andSunakawa(2019). Weleavethetaskofdevelopingarobustalgorithmformodels with sustainability constraints to future research. 11See also Bilbiie (Forthcoming) who uses a model with a static Phillips curve. 7

2.2.1 Discretionary outcome At each time t, the discretionary central bank’s optimization problem is to choose {y , t π , i } to maximize the value today, taking as given the value function (W (·)) and policy t t t+1 functions for inflation and output (π (·) and y (·)) in the next period. That is, t+1 t+1 W (s ) = max u(y ,π )+βE W (s ), (6) t t t t t t+1 t+1 πt,yt,it subject to equations (1), (2), and (3). Let {W (·), π (·), y (·), i (·)} be the set of time-invariant value and policy functions that d d d d solve the Bellman equation above and in which the ELB binds only in the crisis state.12 They are functions of today’s shock realization, s . The discretionary outcome is defined t as, and denoted by, the state-contingent sequence of output, inflation, and the policy rate, {y (st),π (st),i (st)}∞ suchthaty (st) = y (s ),π (st) = π (s ),andi (st) = i (s ) d,t d,t d,t t=1 d,t d t d,t d t d,t d t and the discretionary value sequence is defined as, and denoted by, {V (st)}∞ such that d,t t=1 V (st) = W (s ). We will also refer to the discretionary outcome as the outcome under the d,t d t optimal discretionary policy (ODP). 2.2.2 Commitment outcome At the beginning of time one, the central bank with commitment technology chooses a state-contingent allocation, {y (st), π (st), i (st)}∞ , to maximize the time-one value. That t t t t=1 is, ∞ (cid:88) V (s ) = max E βt−1u(y (st),π (st)), (7) c,1 1 1 t t {yt(st),πt(st),it(st)}∞ t=1 t=1 subject to equations (1), (2), and (3) for all t ≥ 1 and after all histories of shocks st. The commitment outcome, or the Ramsey outcome, is defined as the solution to this optimization problem. Inotherwords,thecommitmentoutcomeisacompetitiveoutcomewiththehighest time-one value. We denote the commitment outcome by {y (st), π (st), i (st)}∞ . The c,t c,t c,t t=1 value sequence associated with the commitment outcome is denoted by {V (st)}∞ and will c,t t=1 be referred to as the commitment value sequence. We will also refer to the commitment outcome as the outcome under the optimal commitment policy (OCP). 2.2.3 Sustainable outcomes At the beginning of time one, the central bank chooses a state-contingent allocation, {y (st), π (st), i (st)}∞ , to maximize the time-one value: t t t t=1 12There also exists a time-invariant solution to this discretionary government’s problem in which the ELB binds in both states. See Armenter (2017), Nakata (2018), and Nakata and Schmidt (Forthcoming) for extensive analyses of such deflationary Markov-perfect equilibrium. 8

∞ (cid:88) V (s ) = max E βt−1u(y ,π ), (8) s,1 1 1 t t {yt(st),πt(st),it(st)}∞ t=1 t=1 subject to equations (1), (2), and (3), and the following sustainability constraint, ∞ (cid:88) E βku(y (st+k),π (st+k)) ≥ WN(s ), (9) t t+k t+k d t k=0 for all t ≥ 1 and after all histories of shocks, st. The left-hand side of the sustainability constraint is the continuation value of implementing a chosen state-contingent allocation at time t after st. The right-hand side, WN(s ), is the continuation value if the central bank d t deviates from the chosen state-contingent allocation, with N indicating how many periods it takes for the central bank to restore its lost reputation (“punishment” duration). During the periods of reputational loss, the central bank cannot engage in state-contingent policies. That is, the central bank has to act under discretion. WN(s ) is recursively defined as follows. For N = 0, d t W0(s) := V (s), π0(s) := π (s), y0(s) := y (s). d s,1 d s,1 d s,1 In this case with N = 0, the punishment duration is zero and the central bank is not allowed to deviate from the sustainable outcome. For any N > 0, WN(s) = max u(π,y)+βE[WN−1(s(cid:48))|s] d d π,y,i where the maximization is subject to the private-sector equilibrium conditions, taking as given the value and policy functions for the next period (that is, WN−1(·), πN−1(·), and d d yN−1(·)). d Notethatthesustainabilityconstrainthastoberespectedeachperiodandforeachhistory of shocks, just as the Euler equation, the Phillips curve, and the ELB constraint have to be respected each period and for each history of shock. The sustainable outcome with N-period reputational loss is defined as the solution to this infinite-horizon optimization problem. We will also refer to the sustainable outcome with N-period reputational loss as the outcome under the optimal sustainable policy (OSP) with N-period reputational loss. Note that the punishment value, WN(s ), is determined jointly with the sustainable d t outcome, except when N = ∞. When N = ∞, the punishment lasts forever and its value is givenbythediscretionaryvalue,W (s),whichisindependentofthesustainableoutcome. For d any finite N, the central bank eventually restores its reputation and the economy returns to the allocations consistent with the sustainable outcome. Thus, the punishment value and the sustainable outcome are not independent of each other. An increase (decrease) in the value associated with the sustainable outcome implies an increase (decrease) in the punishment value. 9

As described in detail in Appendix A, once the sustainable outcome is computed from the optimization problem above, we can construct a plan—a pair of central bank and privatesectorstrategies—thatinducesthesustainableoutcomeandthathasatrigger-typestructure. In particular, we can construct a revert-to-discretion plan in which (i) the economy follows the sustainable outcome as long as the central bank has never deviated from the policy rate path consistent with the sustainable outcome in the past, and (ii) the economy follows the discretionary outcome, or a temporary deviation to a discretionary regime, otherwise. By construction, such a revert-to-discretion plan is credible, meaning that neither the central bank nor private-sector agents have incentives to deviate from the instructions given by the strategies. The central bank does not have an incentive to deviate from the policy rate path consistent with the sustainable outcome because the sustainability constraint is imposed on the central bank’s optimization problem, ensuring that the continuation value under the sustainable outcome is at least as large as the punishment continuation value. Private-sector agents do not have incentives to deviate from the private-sector strategy because the Euler equation and the Phillips curves are satisfied, meaning that the output and inflation are consistent with their optimizing behaviors given the central bank strategy. Even though the deviation does not occur in equilibrium, the specification of what would happen if the central bank were to deviate from the sustainable outcome does affect what happens under the sustainable outcome. Ifthesustainabilityconstraintdoesnotbindatanytimetandafteranyhistoriesofshocks, the sustainable outcome coincides with the commitment outcome. Also, if the sustainability constraint always binds—which happens, for example, when the punishment length (N) is zero or when the crisis frequency (p ) is zero—the sustainable outcomes coincides with the H discretionaryoutcome. Ourmaininterestisthosecasesinwhichthesustainabilityconstraint occasionally binds. 2.3 Parameter values Table 1 shows the baseline parameter values. The quarterly frequency of crises is set to 0.5/100 (=2/400). This choice is motivated by the fact that, in the United States, there have been two large crises that pushed the short-term nominal interest rate to the ELB over roughly the last 100 years (400 quarters) since the creation of the Federal Reserve System. The crisis shock persistence is set to 3/4, which implies the expected duration of the crisis shock of 4 quarters. σ is set to 1. r is chosen so that output declines 7 percent in the crisis c state under the optimal discretionary policy. Conditional on the value of r , the value of κ is c chosen so that inflation declines 1 percentage point (annualized) in the crisis state under the optimal discretionary policy. This severity of the crisis is consistent with that considered in Boneva, Braun, and Waki (2016) and Nakata (2018), and is intended to capture the severity of the Great Recession of 2007-2009 in the United States. We consider three values for the duration of reputational loss (20, 60, and ∞), which 10

Table 1: Baseline Parameter Values Parameter Description Values β Discount factor 0.9925 σ Intertemporal elasticity of substitution 1 κ Slope of the Phillips Curve 0.25/7 i Effective lower bound on the policy rate 0 ELB p Frequency of the crisis state 0.5/100 H p Persistence of the crisis state 3/4 L r∗ Natural rate in the normal state 3/400 r Natural rate in the crisis state **Chosen so that c y (s =r )=−0.07 d t c N Duration of reputational loss [20, 60, ∞] are chosen to cover qualitatively distinct cases that can arise. To put these values into perspective, note that the assumption that the central bank can restore its reputation after a finite number of periods can be motivated by the fact that the tenure of governorship at central banks is finite as well as the possibility that reputation may be specific to the leader of the central bank, as opposed to the institution. As shown in Appendix F, the average tenure of the governorship in central banks in economies that have recently faced, or are currently facing, the ELB ranges from about 5 years (20 quarters) in the Bank of Japan to about 10 years (40 quarters) for the Bank of Canada. The maximum tenure duration exceeds 15 years (60 quarters) at several central banks (the Federal Reserve, Bank of Canada, Bank of England, and Sveriges Riksbank). 2.4 Solution method The model is highly nonlinear, featuring two inequality constraints—the ELB constraint and the sustainability constraint—and cannot be solved analytically. Following Kehoe and Perri (2002) and Sunakawa (2015), we recursify the infinite-horizon optimization problem of the central bank into a saddle-point functional equation using the Lagrange multiplier on the Euler equation as a pseudo-state variable. We then apply a time-iteration method to find the set of time-invariant policy functions that solve the saddle-point functional equation. Appendix C describes the details of the solution method as well as its accuracy.13 13Asdiscussedearlier,ourtime-iterationmethodfailstoconvergewhenN issufficientlysmall. Theuseof alternativegridpoints,suchasnon-rectanglegridsandsimulation-basedgrids,andtheuseofalternativebasis functionsdidnothelp. ForthosesmallvaluesofN underwhichthetime-iterationfails,weuseanalternative solutionmethoddescribedinAppendixE,whichcansolvethemodelforasubsetofthosesmallNs. Alltold, with these two solution methods, we could not solve the model for N between 28 and 36. 11

3 Results 3.1 Dynamics Figure 1 shows the dynamics of the economy under the ODP, the OCP, and OSPs with N = [20,60,∞]. In this figure, the crisis shock hits the economy at time 1 and stays there until time 8. The crisis shock disappears at time 9 and the economy is in the normal state from that point on. Figure 1: Dynamics Policy Rate (ann. %) Output Gap (%) Inflation (ann. %) 3 1 4 2 0.5 2 0 0 -2 1 OCP/OSP w/ N= -4 -0.5 OSP w/ N=60 OSP w/ N=20 -6 ODP 0 -1 1 10 20 1 10 20 1 10 20 Note: ODP, OCP, and OSP stand for optimal discretionary policy, optimal commitment policy, and optimal sustainable policy, respectively. The policy rate and the inflation rate are expressed in annualized percent. The output gap is expressed in percent. Under the ODP—shown by the solid red lines—the central bank keeps the policy rate at the ELB as long as the crisis shock persists and raises the policy rate immediately after the crisis shock disappears. Under the OCP—shown by the solid black lines—the central bank keeps the policy rate at the ELB even after the crisis shock disappears, engineering the overshooting of inflation and output above their targets. Since households are forward looking, the anticipation of high inflation and high output in the aftermath of the crisis stimulates economy activity during the crisis. The declines in inflation and output are substantially smaller under the OCP than under the ODP. The allocations under the OSP with N = ∞ are identical to those under the OCP in this crisisscenario. AsshownintheleftpanelofFigure2,thevalueundertheOSPwithN = ∞— shown by the solid black—is always above the value in case the central bank deviates from the OSP with N = ∞—shown by the dashed red line. That is, the sustainability constraint does not bind. In our calibration, the crisis shock is sufficiently frequent so that the cost of being unable to use lower-for-longer policies in the future forever outweighs the benefit of eliminating the temporary overshooting of inflation and output targets. This result is consistent with the finding of Nakata (2018) that a very small probability of being hit by the crisis shock suffices to make the OCP credible. When the loss of reputation is not as long, the cost of reneging on the lower-for-longer 12

Figure 2: Value of fulfilling versus reneging on the promised allocations Value Value Value (N= ) (N=60) (N=20) 0 0 0 -100 -100 -100 -200 -200 -200 OSP -300 -300 -300 Reneging 1 10 20 1 10 20 1 10 20 Note: OSP stands for optimal sustainable policy. promise in the aftermath of the crisis shock is smaller. In other words, the continuation value in case of deviation is higher with a smaller N. The middle panel of Figure 2 shows the value of the OSP with N = 60 and the value of deviating from the OSP with N = 60 (solid black and dashed red lines, respectively). According to the panel, the sustainability constraint binds right after the crisis shock disappears, limiting the magnitude of the overshooting in the aftermath of the crisis. The smaller overshoot means that inflation and output decline by more during the crisis under the OSP with N = 60 than under the OCP and the OSP with N = ∞. However, the declines in inflation and output are still much smaller under the OSP with N = 60 than under the ODP. Similarly, the sustainability constraint binds right after the crisis shock disappears under the OSP with N = 20, as can be seen in the right panel of Figure 2, limiting the magnitude of the overshoot. The overshoot in the aftermath of the crisis is smaller—and as a result, the declines in inflation and output are larger—under the OSP with N = 20 than under the OSP with N = 60. Even with N = 20, the declines in inflation and output are still much smaller under the OSP with N = 60 than under the ODP. Reflecting the less severe crisis under the OSP with N = 20 and N = 60, the welfare cost of the ELB—shown in Table 2—is substantially lower under these OSPs than under the ODP. With N = 60, welfare cost of the ELB is about 20 percent of that under the ODP and is only slightly larger than under the OCP. Even with N = 20, welfare cost of the ELB is only about half of the ODP. 3.2 History Dependence Figure 3 shows the dynamics of output and the policy rate—displayed in the top and bottom rows, respectively—under three alternative realized durations of the crisis shock. The first, second, and third columns are for the realized crisis shock duration of 1, 4, and 8 quarters, respectively. 13

Table 2: Welfare Cost of the ELB abs(E[V]) Optimal commitment policy 29.5 (0.23) Optimal sustainable policy with N = ∞ 29.5 (0.23) with N = 60 34.6 (0.27) with N = 20 63.3 (0.49) Optimal discretionary policy 128.1 (1.00) Note: NumbersinparenthesesarethewelfarecostoftheELBrelativetothatundertheoptimaldiscretionary policy. Figure 3: History Dependence (I) —Dynamics with Alternative Realized Crisis Shock Durations— Output Gap (%) Output Gap (%) Output Gap (%) (Shock Duration=1) (Shock Duration=4) (Shock Duration=8) 5 5 5 0 0 0 -5 -5 -5 OCP/OSP w/ N= OSP w/ N=60 OSP w/ N=20 ODP -10 -10 -10 1 10 20 1 10 20 1 10 20 Policy Rate (ann. %) Policy Rate (ann. %) Policy Rate (ann. %) (Shock Duration=1) (Shock Duration=4) (Shock Duration=8) 3 3 3 2 2 2 1 1 1 0 0 0 1 10 20 1 10 20 1 10 20 Note: ODP, OCP, and OSP stand for optimal discretionary policy, optimal commitment policy, and optimal sustainable policy, respectively. The policy rate and the inflation rate are expressed in annualized percent. Under the OCP—shown by the solid black lines—the additional ELB duration is 2, 4, and 6 quarters when the realized crisis shock duration is 1, 4, and 8 quarters, respectively, as can be seen in the bottom panels of Figure 3. The magnitude of the output overshoot is 2 percentagepoints, 4percentagepoints, and5percentagepointswhentherealizedcrisisshock duration is 1, 4, and 8 quarters, respectively, as can be seen in the top panels of Figure 3. Thus, both the additional ELB duration and the size of the inflation and output overshoot 14

Figure 4: History Dependence (II) Size of Output Gap Additional ELB Duration Overshooting 8 8 6 6 4 4 2 2 0 0 5 10 15 20 5 10 15 20 Realized Shock Duration Realized Shock Duration OCP OSP w/ N= OSP w/ N=60 OSP w/ N=20 ODP Note: ODP, OCP, and OSP stand for optimal discretionary policy, optimal commitment policy, and optimal sustainable policy, respectively. The output gap is expressed in percent. depend on the realized crisis shock duration. This dependence can be seen in Figure 4, which shows how the additional ELB duration and the size of the output gap overshoot—shown in the left and right panels, respectively—vary with the realized crisis shock duration.14 The history dependence of the OCP is in sharp contrast with the lack of history dependence in the ODP. Under the ODP—shown by the solid red lines in Figure 3 and 4—the additional ELB duration and the size of output gap overshoot do not depend on the realized crisis shock duration: they are 0 quarter and 0 percent, regardless of the realized crisis shock duration. OSPs with sufficiently large Ns exhibit qualitatively similar history dependence to that of the OCP. With N = ∞, the dynamics of the economy under the OSP—shown by the dashed lines in Figure 3 and 4—are identical to those under the OCP when the realized crisis duration is sufficiently small. However, according to the right panel of Figure 4, when the realized crisis duration is sufficiently long—longer than 12 quarters—the size of the output gap overshoot does not increase further with an increase in the realized crisis shock duration, because the sustainability constraint bindsand limits the magnitude of the outputovershoot. Because the crisis persistence is 0.75, the probability that the crisis shock lasts for more than 12 quarters is very small, but it is not zero. Thus, the OSP with N = ∞ is less history dependent than the OCP. Under the OSP with N = 60, the additional ELB duration and the size of the output gap overshoot increase with the realized crisis duration when the realized crisis duration is 14In computing the additional ELB duration and the size of the output gap overshoot, we assume that, priortothecrisisshock,theeconomyhasbeeninthenormalstateforsometimeandtheLagrangemultiplier on the Euler equation is zero in the period right before the crisis shock materializes. 15

short, as can be seen by the dash-dotted lines in Figure 3 and 4. In particular, when the realizedcrisisdurationislongerthan3quarters,theydonotincreasefurtherwiththerealized crisis duration, as the sustainability constraint binds and limits the magnitude of the output overshoot. The OSPs with N = 60 is history dependent but is less history dependent than the OCP or the OSP with N = ∞. OSPsarenothistorydependentatallwhenthedurationofreputationallossissufficiently short. As can be seen by the dotted lines in Figure 3 and 4, with N = 20, the additional ELB duration and the size of the output gap overshoot do not depend at all on the realized crisis shock duration: they are 2 quarters and 2 percentage points. That is, the OSP with N = 20 is history independent. Note that the magnitude of overshooting is slightly larger under the OSP with N = ∞ thanundertheOCPwhentherealizedcrisisshockdurationis9to13quarters—atorslightly below the shortest crisis shock duration associated with the binding sustainability constraint under the OSP with N = ∞. Consistent with this pattern, the post-crisis path of the policy rate is slightly lower under the OSP with N = ∞ than under the OCP when the realized crisis shock duration is 9 to 13 quarters (not shown). In fact, the post-crisis ELB duration is one period longer under the OSP with N = ∞ than under the OCP when the realized crisisshockdurationis12quarters. Similarly, themagnitudeofovershootingisslightlylarger under the OSP with N = 60 than under the OCP or the OSP with N = ∞ when the realized crisis shock duration is 2 to 4 quarters—at or slightly below the lowest crisis shock duration associated with the binding sustainability constraint under the OSP with N = 60. How can the magnitude of overshooting slightly be larger with a smaller N for certain realized crisis shock durations? Suppose that the crisis shock has persisted for a long time, but if the economy were to return to the normal state in the next period, the magnitude of post-crisis overshooting will not be limited by the sustainability constraint. However, if the magnitude of post-crisis overshooting will be limited by the sustainability constraint in case thecrisislastsforafewmoreperiods,thepossibilityofbeingconstrainedbythesustainability constraint down the road exerts downward pressures on output and inflation now. Because the sustainability constraint will not be binding if the economy returns to the normal state in the next period, the central bank tries to counteract that force by promising a larger postcrisis overshooting—larger than what it would promise in the absence of the sustainability constraint—if the economy were to return to the normal state in the next period. 3.3 On “reasonable” duration of reputational loss When the loss of reputation lasts for a long time, the power of reputation is strong and OSPs resemble the OCP. When the loss of reputation lasts for a short time, the power of reputation is weak and OSPs resemble the ODP. A natural question that arises is what reasonable values of the duration of reputational loss are. One way to think about the reasonable duration of reputational loss is to hypothetically 16

ask how long it might take for a central bank to restore its reputation once it loses it. The reasonable value of the duration of reputational loss based on this thought experiment may depend on various factors: whether one believes the central bank’s reputation is individualspecific or institution-specific and the tenure duration of the central bank’s governors or chairs, if one believes in the individual specific nature of reputation. Anotherwaytothinkaboutthereasonabledurationofreputationallossistotheoretically refine the concept of sustainability. In models in which the commitment and discretionary outcomes are different, there are multiple—typically infinitely many—sustainable plans. In this paper, we study infinitely many (countable) sustainable plans indexed by the duration of lost reputation. By imposing further restrictions on the set of sustainable plans and thus refining the concept of sustainability, we can select one of these sustainable plans as being more reasonable than others. One refinement concept for any sequential equilibria—a sustainable plan in our setup— developed by game theorists in the context of two-player games is renegotiation proofness. Roughly speaking, renegotiation proofness requires that, even if the deviation from an equilibrium were to occur hypothetically, two players would have no incentives to renegotiate the contract—strategiesinoursetup—thattheyhaveinitiallyagreedon. Accordingtoonedefinitionofrenegotiation-proofnessproposedbyPearce(1987),asustainableplanisrenegotiationproof if the punishment value associated with that plan is lower than the punishment value of any other sustainable plans.15 To apply this concept of renegotiation proofness to our model, the dash-dotted red line in Figure 5 shows the punishment value associated with deviating from the OSPs with different values for N. According to the figure, the punishment continuation value is non-monotonic. When N is large, a reduction in the punishment duration increases the punishment value: all else equal, it is good to stay in the discretionary regime for a shorter duration, as the value under the discretionary regime is lower than the value under the OSP. However, when the punishment duration is sufficiently short and the sustainability constraint binds, a shorter punishmentdurationlowersthevalueassociatedwiththeOSPs. Thisnon-monotonicityarises because a shorter punishment duration limits the size of the overshoot in the aftermath of crises and lowers the value associated with OSPs. As a result, when the punishment regime ends, the economy will return to a sustainable outcome that is not as good as the sustainable outcome with a longer punishment duration. When N is sufficiently small, this second effect dominates the first effect, and a shorter punishment duration lowers the punishment value. According to Figure 5, in our model, the sustainable outcome with the least severe punishment value is the sustainable outcome with N = 37. The dash-dotted red lines in Figure 6 15Farrell and Maskin (1989) proposed an alternative definition of renegotiation-proofness whereby a sustainable plan is renegotiation proof if there is no Pareto-improving move to another sustainable plan after deviating from the on-equilibrium path at any point in time. As pointed out by Matsuyama (1997), in models with benevolent government where the government’s objective function and the private-sector’s objective function coincide, this definition rules out any chance for the economy to achieve allocations better than the ODP. 17

Figure 5: Values in case of reneging Note: Continuation value under the sustainable outcome after a crisis that has lasted for 8 quarters. show the dynamics of the economy under the OSP with N = 37, together with those under the OCP and the ODP. The post-crisis ELB duration is 3 quarters under the OSP with N = 37, 3 quarters shorter than that under the OCP. The size of the post-crisis overshooting is smaller—and output and inflation decline by more—under the OSP with N = 37 than under the OCP. However, the declines in inflation and output are much smaller under the OSP with N = 37 than under the ODP. The welfare cost of the ELB constraint is about one-third of that under the ODP. Figure 6: Dynamics Policy Rate (ann. %) Output Gap (%) Inflation (ann. %) 3 1 4 2 0.5 2 0 0 -2 1 OCP -4 -0.5 OSP w/ N=37 -6 ODP 0 -1 1 10 20 1 10 20 1 10 20 Note: ODP, OCP, and OSP stand for optimal discretionary policy, optimal commitment policy, and optimal sustainable policy, respectively. The policy rate and the inflation rate are expressed in annualized percent. The output gap is expressed in percent. 18

4 Additional results and discussion 4.1 Relation to the simple forward guidance policies of Walsh (2018) We have shown that, when N is sufficiently small, the OSPs are not history dependent; thepolicyratepathafterthecrisisshockdisappearsdoesnotdependontherealizedduration of the crisis shock. Thus, the OSPs bear some resemblance to the simple forward guidance policies considered by Walsh (2018). Under the simple forward guidance policies of Walsh (2018), the central bank keeps the policy rate at the ELB for a fixed number of periods after the crisis shock disappears, regardless of the realized duration of the crisis shock, and lets the policy rate return to the steady-state level immediately thereafter. The only (minor) difference is that, under the optimal sustainable policy, the policy rate does not return to the steady state level immediately after liftoff. Instead, there is typically one period after liftoff in which the policy rate is still below the steady-state level. The similarity between the simple forward guidance policies and OSPs with small Ns points to one benefit of OSPs over the OCP; it may be easier for central banks to explain these OSPs to the public than the OCP. One key criticism against the OCP is that it is complex. As Walsh (2018) argues, because of its complexity, it may be difficult for the central bank in practice to steer the private-sector agents’ expectations in a way consistent with the OCP. One dimension of complexity is history dependence. The OSPs have an advantage over the OCP because they are less history dependent and thus simpler. Note that, in our model, the ODP and the OSPs with small Ns are history independent but state-contingent. They are state-contingent because the policy rate path—in particular the liftoff date—depends on the realized crisis shock duration. Thus, these policies are different from so-called calendar-based forward guidance that specifies the likely liftoff date, if that guidance were to be narrowly or mistakenly interpreted as a non-state-contingent commitment to raising the policy rate from the ELB at a particular date regardless of the evolution of the economy.16 4.2 Relation to the loose commitment approach of Bodenstein, Hebden, and Nunes (2012) Under OSPs, the central bank achieves crisis-state allocations that are “in between” that under the ODP and that under the OCP. This feature of OSPs is reminiscent of the optimal policy obtained in a loose commitment approach in which the central bank reoptimizes with a constant probability every period regardless of the incentive to renege on the prior 16EvenwhencentralbanksindicatealikelydateofliftofffromtheELB,theytypicallyemphasizethatthe liftoffdatewilldependontheevolutionoftheeconomicoutlook. Thatis,iftheeconomyweretorecoverfaster or more slowly than in the baseline economic projection, the central bank will raise the policy rate from the ELB earlier or later than the most likely liftoff date under the baseline projection. In practice, it is unlikely that any central bank will ever engage in non-state-contingent forward guidance, though market participants may not interpret the forward guidance specifying the likely date of liftoff as state-contingent as the central bank intends. 19

commitment.17 While these two approaches differ from each other in many ways, both approaches share the same spirit that they are intended to shed light on what the central bank may be able to achieve when no explicit commitment technology is available. Indeed, recent work by Fujiwara, Kam, and Sunakawa (2019) shows that, when using a model without the ELB, the allocations under the loose commitment approach with an appropriately chosen re-optimization probability can approximate the allocation under the OSP with N-period punishment reasonably well for any N. While we believe their result is likely to extend to the model with ELB, it would be useful to verify the validity of their claim in our model in future research. 4.3 Results from the model with a forward-looking Phillips curve As discussed earlier, we have focused on the model with a static Phillips curve instead of the model with a forward-looking Phillips curve, because the range of durations of reputational loss under which we can solve the model is wider in the model with a static Phillips curve. A natural question is whether our key results thus far would extend to the model with a forward-looking Phillips curve. In this section, we discuss some select results from the model with a forward-looking Phillips curve. The private sector equilibrium conditions of the model with a forward-looking Phillips curve are characterized by the Euler equation given by equation (1), the forward-looking Phillips curve, π (st) = κy (st)+βE π (st+1) (10) t t t t+1 and the ELB constraint given by inequality (3). The central bank’s objective function is the same as that in the model with a static Phillips curve. The ODP, the OCP, and the OSPs are defined in ways that are similar to how they are defined in the model with the static Phillips curve. The parameter values used are shown in Table 3. The values for β, σ, p , H and s are the same as in the previous section. κ is set to 0.005. p is set to 0.5, implying H L the expected duration of the crisis state of 2 quarters. λ is set to 1/16, a value consistent with equal weights on the volatility of the output gap and the volatility of the annualized rate of inflation. A high value of λ and a low value of p increase a range of the duration of L reputational loss values under which we can solve the model. With these parameter values, we could solve the model for N ≥ 75. We show the dynamics of the model under the OSPs with three values of N = [80,160,∞]. Figure 7 shows the dynamics of the economy under the OCP, the ODP, and OSPs with the three values of N listed above. The dynamics of the economy under the OCP and the ODP are consistent with those in the model with a static Phillips curve as well as those in the existing studies. Under the OCP—shown by the solid black lines—the central bank keeps the policy rate at the ELB 17See Bodenstein, Hebden, and Nunes (2012) for an analysis of optimal monetary policy under loose commitment in the model with ELB 20

Table 3: Parameter Values —Model with the Forward-Looking Phillips Curve Parameter Description Parameter Value β Discount rate 1 ≈0.9925 1+0.0075 σ Intertemporal elasticity of substitution 1 κ Slope of the Phillips curve 0.005 λ Relative weight on output volatility [1/16] p Crisis shock frequency 0.5/100 H p Crisis shock persistence 0.5 L r∗ Natural rate in the normal state 3/400 r Natural rate in the crisis state −0.0125 c N Punishment length [80, 160, ∞] even after the crisis shock disappears and engineers the overshooting of inflation and output above their targets. Under the ODP—shown by the solid red lines—the central bank keeps the policy rate at the ELB as long as the crisis shock continues and raises the policy rate immediately once the crisis shock disappears. Figure 7: Dynamics —Model with Forward-Looking Phillips Curve— Policy Rate (ann. %) Output Gap (%) Inflation (ann. %) 4 1 5 3 0 0.5 2 -5 0 OCP 1 OSP w/ N= OSP w/ N=160 OSP w/ N=80 ODP 0 -10 -0.5 1 10 20 1 10 20 1 10 20 Note: ODP, OCP, and OSP stand for optimal discretionary policy, optimal commitment policy, and optimal sustainable policy, respectively. The policy rate and the inflation rate are expressed in annualized percent. The output gap is expressed in percent. Under the OSPs—shown by the dashed, dash-dotted, and dotted black lines for N = ∞, N = 160, and N = 80, respectively—the central bank keeps the policy rate at the ELB after the crisis shock disappears, but not as long as it would do under the OCP. The magnitudes of inflation and output overshoots are smaller under the OSPs than under the OCP, with the magnitudes smaller when N is smaller. Consistent with the magnitudes of the overshoot in the aftermath of the crisis, the paths of output and inflation during the crisis state are lower when N is smaller. While the OSPs are not as stimulative as the OCP, they still substantially reduce the welfare cost of the ELB constraint relative to the ODP, which can 21

be seen in Table 4. Table 4: Welfare Cost of ELB —Model with Forward-Looking Phillips Curve— abs(E[V]) Optimal commitment policy 26.8 (0.39) Optimal sustainable policy with N = ∞ 27.1 (0.39) with N = 160 28.0 (0.40) with N = 80 29.9 (0.43) Optimal discretionary policy 68.9 (1.00) Note: NumbersinparenthesesarethewelfarecostoftheELBrelativetothatundertheoptimaldiscretionary policy. One key feature of OSPs in the model with a static Phillips curve is that they are less history dependent than the OCP. To examine whether this feature holds in the model with a forward-looking Phillips curve, Figure 8 shows how the additional ELB duration, the size of the output gap overshoot, and the size of the inflation overshoot vary with the realized crisis shock duration. In the figure, the solid black and red lines are for the OCP and the ODP, respectively, whereas the dashed, dash-dotted, and dotted black lines are for the OSPs with N = ∞, N = 160, and N = 80, respectively. According to the figure, the additional ELB duration, the size of the output gap overshoot, and the size of the inflation overshoot are less sensitive to the realized crisis shock duration under the OSPs than under the OCP. That is, the OSPs are less history dependent than the OCP. Also, the OSPs are less history dependent with smaller Ns. All told, qualitatively, the key insights from the model with a static Phillips curve carry over to the model with a forward-looking Phillips curve. 5 Conclusion In this paper, we have characterized OSPs in models with the ELB constraint. We find that, even when the OCP is not credible, the central bank can still credibly commit to keeping the policy rate at the ELB in the aftermath of a crisis—though not as long as under the OCP—and meaningfully mitigate the adverse consequences of the ELB constraint on economic activity in crises. By construction, our OSPs are time-consistent and thus overcome the criticism that the temporary overheating of the economy associated with lower-for-longer strategies is not credible. When the loss of reputation is sufficiently short-lived, these OSPs are not history dependent or not as history dependent as the OCP. Thus, it overcomes the criticism that the implied policy rate path is too complex for the central bank to be able to explain to the public, making the OSPs even more attractive. 22

Figure 8: History Dependence —Model with Forward-Looking Phillips Curve— Additional Size of Size of ELB Duration Output Gap Overshooting Inflation Overshooting 7 6 0.2 6 5 0.15 4 4 0.1 3 2 2 0.05 1 0 0 0 2 4 6 8 2 4 6 8 2 4 6 8 Realized Shock Duration Realized Shock Duration Realized Shock Duration OCP OSP w/ N= OSP w/ N=160 OSP w/ N=80 ODP Note: ODP, OCP, and OSP stand for optimal discretionary policy, optimal commitment policy, and optimal sustainable policy, respectively. The inflation rate are expressed in annualized percent. The output gap is expressed in percent. Althoughwefocusonthetime-consistencyaspectoflower-for-longerpoliciesinthispaper, there are other aspects of these policies that could make them less attractive in reality than in theory. For example, the public may not understand the temporary nature of the inflation overshooting, resulting in unanchoring of the long-run inflation expectations (Kohn (2012) andYellen(2018)). Theoverheatingoftheeconomymaybelessdesirableforpolicymakersin reality than what’s implied by our model if the overheating of the economy leads to financial instability (Yellen (2018)). It would be useful to formally analyze how these factors affect the effectiveness and implementability of lower-for-longer strategies. We leave such analysis to future research. 23

References Abreu, D., D. Pearce, and E. Stacchetti (1990): “Toward a Theory of Discounted Repeated Games with Imperfect Monitoring,” Econometrica, 58(5), 1041–1063. Adam, K., and R. Billi (2006): “Optimal Monetary Policy Under Commitment with a Zero Bound on Nominal Interest Rates,” Journal of Money, Credit, and Banking, 38(7), 1877–1905. Armenter, R. (2017): “The Perils of Nominal Targets,” Review of Economic Studies, 0, 1–37. Barro, R., and D. Gordon (1983): “Rules, Discretion, and Reputation in a Model of Monetary Policy,” Journal of Monetary Economics, 12, 101–121. Barthe´lemy, J., and E. Mengus (2018): “The Signaling Effect of Raising Inflation,” Journal of Economic Theory, 178, 488–516. Bean, C. (2013): “Global Aspects of Unconventional Monetary Policies,” Remarks at the Federal Reserve Bank of Kansas City Economic Policy Symposium, Jackson Hole, Wyoming. Bernanke, B. S. (2017): “Temporary Price-Level Targeting: An Alternative Framework for Monetary Policy,” Ben Bernanke’s Blog (October 12, 2017). Berriel, T. C., and A. Mendes (2015): “Central Bank Balance Sheet, Liquidity Trap, and Quantitative Easing,” Mimeo. Bhattarai, S., G. Eggertsson, and B. Gafarov(2013): “TimeConsistencyandtheDuration of Government Debt: A Signalling Theory of Quantitative Easing,” NBER Working Paper No. 21336. Bilbiie, F. O. (Forthcoming): “Optimal Forward Guidance,” American Economic Journal: Macroeconomics. Billi, R. (2017): “A Note on Nominal GDP Targeting and the Zero Lower Bound,” Macroeconomic Dynamics, 21(8), 2138–2157. Bodenstein, M., J. Hebden, and R. Nunes (2012): “Imperfect Credibility and the Zero Lower Bound,” Journal of Monetary Economics, 59(2), 135–149. Boneva, L. M., A. R. Braun, and Y. Waki (2016): “Some Unpleasant Properties of Loglinearized Solutions when the Nominal Rate is Zero,” Journal of Monetary Economics, 84, 216–232. 24

Bullard, J. (2013): “Monetary Policy in a Low Policy Rate Environment,” OMFIF Golden Series Lecture, London, United Kingdom. Burgert, M., and S. Schmidt (2014): “Dealing with a Liquidity Trap when Government Debt Matters: Optimal Time-Consistent Monetary and Fiscal Policy,” Journal of Economic Dynamics and Control, 47, 282–299. Carney, M. (2012): “Guidance,” Remarks at the CFA Society Toronto, Toronto, Ontario. Chang, R. (1998): “Credible Monetary Policies in an Infinite Horizon Model: Recursive Approaches,” Journal of Economic Theory, 81, 431–461. Chari, V. V., and P. Kehoe (1990): “Sustainable Plans,” Journal of Political Economy, 98(4), 783–802. Clarida, R. H. (2019): “The Federal Reserve’s Review of Its Monetary Policy Strategy, Tools, and Communication Practices,” Remarks at the 2019 U.S. Monetary Policy Forum, sponsored by the Initiative on Global Markets at the University of Chicago Booth School of Business, New York City. Cœure´, B. (2013): “The Usefulness of Forward Guidance,” Remarks at the Money Marketeers Club of New York, New York City, New York. Dong, B., and E. Young (2019): “Forward Guidance and Credible Monetary Policy,” Mimeo. Dudley, W. C. (2013): “Unconventional Monetary Policies and Central Bank Independence,” Remarks at the Central Bank Independence Conference: Progress and Challenges in Mexico, Mexico City, Mexico. Eggertsson, G. (2006): “The Deflation Bias and Committing to Being Irresponsible,” Journal of Money, Credit, and Banking, 38(2), 283–321. Eggertsson, G., and M. Woodford (2003): “The Zero Bound on Interest Rates and Optimal Monetary Policy,” Brookings Papers on Economic Activity, 34(1), 139–235. Farrell, J., and E. Maskin (1989): “Renegotiation in repeated games,” Games and Economic Behavior, 4, 327–360. Fujiwara, I., T. Kam, and T. Sunakawa (2016): “Sustainable International Monetary Policy Cooperation,” Working Paper. (2019): “On Two Notions of Imperfect Credibility in Optimal Monetary Policies,” Economics Letters, 174, 22–25. Gal´ı, J. (2015): Monetary Policy. Inflation, and the Business Cycle. Princeton: Princeton University Press. 25

George, E. L. (2019): “The Economic Outlook, Monetary Policy and the Federal Open Market Committees Inflation Objective,” Remarks at the The Money Marketeers of New York University, New York City. Jeanne, O., and L. E. O. Svensson (2007): “Credible Commitment to Optimal Escape from a Liquidity Trap: The Role of the Balance Sheet of an Independent Central Bank,” American Economic Review, 97(1), 474–490. Jung, T., Y. Teranishi, and T. Watanabe (2005): “Optimal Monetary Policy at the Zero-Interest-Rate Bound,” Journal of Money, Credit, and Banking, 35(7), 813–35. Kehoe, P. J., and F. Perri (2002): “International Business Cycles with Endogenous Incomplete Markets,” Econometrica, 70(3), 907–928. Kohn, D. L.(2012): “InterviewwithDonaldKohn,”Top of Mind, Goldman Sachs Research. Kurozumi, T. (2008): “Optimal Sustainable Monetary Policy,” Journal of Monetary Economics, 55, 1277–1289. Lacker, J. (2013): “Monetary Policy in the United States: The Risks Associated With Unconventional Policies,” Remarks at the Swedbank Economic Outlook Seminar, Stockholm, Sweden. Loisel, O. (2008): “Central bank reputation in a forward-looking model,” Journal of Economic Dynamics and Control, 11, 3718–3742. Matsuyama, K. (1997): “Credibility and Intertemporal Consistency: A Note on Strategic Macroeconomic Policy Models,” Macroeconomic Dynamics, 1, 658–665. Nakata, T. (2015): “Credibility of Optimal Forward Guidance at the Interest Rate Lower Bound,” Feds notes, Board of Governors of the Federal Reserve System (U.S.). (2018): “Reputation and Liquidity Traps,” Review of Economic Dynamics, 28, 252–268. Nakata, T., and S. Schmidt (2019): “Gradualism and Liquidity Traps,” Review of Economic Dynamics, 31, 182–199. (Forthcoming): “Conservatism and Liquidity Traps,” Journal of Monetary Economics. Nakov, A. (2008): “Optimal and Simple Monetary Policy Rules with Zero Floor on the Nominal Interest Rate,” International Journal of Central Banking, 4(2), 73–127. Pearce, D. G. (1987): “Renegotiation Proof Equilibria: Collective Rationality and Intertemporal Cooperation,” Mimeo. 26

Phelan, C., and E. Stacchetti (2001): “Sequential Equilibria in a Ramsey Tax Model,” Econometrica, 69(6), 1491–1518. Plosser, C. (2013): “Forward Guidance,” Remarks at Stanford Institute for Economic Policy Researchs (SIEPR) Associates Meeting, Stanford, California. Powell, J. H. (2019): “Monetary Policy: Normalization and the Road Ahead,” the 2019 SIEPR Economic Summit, Stanford Institute of Economic Policy Research, Stanford, California. Reifschneider, D., and J. C. Williams (2000): “Three Lessons for Monetary Policy in a Low-Inflation Era,” Journal of Money, Credit and Banking, 32(4), 936–966. Rogoff, K. (1987): “Reputational Constraints on Monetary Policy,” Carnegie-Rochester Conference Series on Public Policy, 26, 141–182. Sukeda, A. (2018): “Credible Forward Guidance with Financial Disruption,” Mimeo. Sunakawa, T.(2015): “AQuantitativeAnalysisofOptimalSustainableMonetaryPolicies,” Journal of Economic Dynamics and Control, 52, 119–135. Ueda, K. (2013): “A Long and Bumpy Journey ahead for Forward Guidance,” in Forward Guidance: Perspectives from Central Bankers, Scholars and Market Participants, ed. by W. den Haan, pp. 165–170. VoxEU.org Ebook. Walsh, C. (2018): “Simple Sustainable Forward Guidance at the ELB,” Working Paper. Williams, J.C.(2012): “TheFederalReserve’sUnconventionalPolicies,”FRBSFEconomic Letter, 2012-34 (November 13). (2018): “Monetary Policy Strategies for a Low-Neutral-Interest-Rate World,” Remarks at the 80th Plenary Meeting of the Group of Thirty, Federal Reserve Bank of New York, New York City. Woodford, M. (2003): Interest and Prices: Foundations of a Theory of Monetary Policy. Princeton: Princeton University Press. Yellen, J. (2018): “Comments on Monetary Policy at the Effective Lower Bound,” Brookings Papers on Economic Activity, Fall. 27

Technical Appendix for Online Publication This technical appendix is organized as follows: • Appendix A defines some key concepts. • AppendixBdescribestheequilibriumconditionscharacterizingthesustainableoutcome in detail. • AppendixCdescribesthenumericalsolutionmethodandreportsthesolutionaccuracy. • Appendix D analyzes the dynamics of the model with a static Phillips curve in detail through policy and value functions. • Appendix E describes the solution method for the model with a static Phillips curve when N is small. • Appendix F documents the average tenure of chairpersons/governors in select central banks. • Appendix G collects policymakers’ speeches in which the time-inconsistency problem of the lower-for-longer policy is discussed. A Definition of a plan and credibility This section defines a plan, credibility, and the revert-to-discretion plan. The definitions closely follow Chang (1998) and Nakata (2018). A.1 Plan A government strategy, denoted by σ := {σ }∞ , is a sequence of functions that maps g g,t t=1 a history of the nominal interest rates up to the previous period and a history of states up to today into today’s nominal interest rate. Formally, σ is given by σ : S → R and g,t g,1 ≥0 σ : Rt−1×St → R for all t ≥ 2.18 Given a particular realization of {s }∞ , a sequence of g,t ≥0 ≥0 t t=1 nominal interest rates will be determined recursively by i = σ (s ) and i = σ (it−1,st) 1 g,1 1 t g,t for all t > 1 and for all st ∈ St. A government strategy is said to induce a sequence of the nominal interest rates. A private-sector strategy, denoted by σ := {σ }∞ , is a sequence of p p,t t=1 functionsmappingahistoryofnominalinterestratesuptotodayandahistoryofstatesupto todayintotoday’sconsumptionandinflation. Formally,σ isgivenbyσ : Rt×St → (R,R) p,t p,t for all t. Given a government and private-sector strategy, a sequence of consumption and inflation will be determined recursively by (y ,π ) = σ (it,st) for all t ≥ 1 and for all st ∈ St. A t t p,t private sector strategy, together with a government strategy, is said to induce a sequence of consumption and inflation.19 A plan is defined as a pair of government and private sector 18The first period is a special case, as there is no previous policy action. 19Note that, while the nominal interest rate today depends on the history of nominal interest rates up to the previous period, consumption and inflation today depend on the history of nominal interest rates up to today. The implicit within-period-timing protocol behind this setup is that the government moves before the private sector does. 28

strategies, (σ ,σ ). Notice that a plan induces an outcome—a state-contingent sequence of g p consumption, inflation, and the nominal interest rate. As discussed earlier, there is a value sequence {w (st)}∞ , associated with any outcome. t t=1 A.2 Credibility Let us use CEI(s) to denote a set of state-contingent sequences of the nominal interest t rate consistent with the existence of a competitive equilibrium when s = s. Formally, for t each s ∈ S, CEI(s) := {i (s) ∈ I∞| ∃ (y (s),π (s)) s.t. (y (s),π (s),i (s)) ∈ CE (s)}. σ is t t t t t t t t g said to be admissible if, after any history of policy actions, it−1, and any history of states, st, i (s) induced by the continuation of σ belongs to CEI(s ). t g t t Aplan, (σ ,σ ), iscredible if(i)σ isadmissible, (ii)afteranyhistoryofpolicyactions, it, g p g and any history of states, st, the continuation of σ and σ induce a (y (s ),π (s ),r (s )) ∈ p g t t t t t t CE (s ), and (iii) after any history it−1 and st, i (s ) induced by σ maximizes the governt t t t g ment’s objective over CEI(s ) given σ . In plain languages, a plan is said to be credible if t t p neither the private sector nor the government has incentive to deviate from the strategies associated with it. Anoutcomeissaidtobecredibleifthereisacredibleplanthatinducesit. Whenacertain plan A is credible and the plan A induces a certain outcome α, we say that the outcome α can be made credible, or time-consistent, by the plan A. A.3 The revert-to-discretion plan I now define a key object of this paper, the revert-to-discretion plan, and discuss the condition under which this plan is credible. Therevert-to-discretionplan,(σrtd,σrtd),consistsof(i)thefollowinggovernmentstrategy: g p σrtd = i (s ) for any s ∈ S, σrtd(it−1,st) = i (st) if i = i (sj) for all j ≤ t−1, and g,1 s,1 1 1 g,t s,t j s,j σrtd(it−1,st) = i (st) otherwise, and (ii) the following private-sector strategy: σrtd(it,st) = g,t d,t p,t (y (st),π (st)) if r = i (sj) for all j ≤ t, σrtd(it,st) = (y (s ,i ),π (s ,i )) otherwise,20 s,t c,t j c,j p,t br t t br t t where (cid:104) (cid:105) y (s ,r ) = E y (st+1)−σ (cid:2) i −E π (st+1)−r∗(cid:3) +s (11) br t t t d,t+1 t t d,t+1 t π (s ,r ) = κy (s ,r )+βE π (st+1) (12) br t t br t t t d,t+1 The government strategy instructs the government to choose the nominal interest rate consistent with the sustainable outcome, but chooses the interest rate consistent with the discretionary outcome if it has deviated from the sustainable outcome at some point in the past. The private sector strategy instructs the household and firms to choose consumption and inflation consistent with the sustainable outcome as long as the government has never deviated from the sustainable outcome. If the government has ever deviated from the nominal interestrateconsistentwiththesustainableoutcome, theprivatesectorstrategyinstructsthe household and firms to choose output and inflation today based on the belief that the government in the future will choose the nominal interest rate consistent with the discretionary outcome. By construction, the revert-to-discretion plan induces the sustainable outcome, and the implied value sequence is identical to the sustainable value sequence. 20Subscript br stands for best response. 29

It is relatively straightforward to show that the revert-to-discretion plan is credible. By construction, V (st) ≥ V (st) for all t ≥ 1 and all st ∈ St, making sure that the government s,t d,t does not have an incentive to deviate from the instruction given by the government strategy after any history it−1 and st in which the optimal sustainable policy has been followed. The revert-to-discretion plan that induces the optimal sustainable outcome with a finite period punishment is defined in a similar way (see Nakata (2018) for rigorous exposition). It is also straightforward to show that such a plan is credible. B Model with a static Phillips Curve The policymaker maximizes ∞ (cid:88) V = −E βty2, 0 0 t t=0 subject to y = (1+σ−1κ)E y −σ−1(i −r∗)+s , (13) t t t+1 t t i ≥ i , (14) t ELB ∞ (cid:88) V = −E βjy2 ≥ W(s ), (15) t t t+j t j=0 for all t ≥ 0, where we used a relationship based on the static Phillips curve, π = κy to t t obtain equation 13. The shock, s , follows two-state Markov chain, s ∈ {s ,s } where t t H L (cid:20) (cid:21) 1−p p s > s . Transition probability matrix is given as P = H H , where p is the H L 1−p p H L L frequency of the crisis and p is the persistence of the crisis. W(s ) is the value under the L t optimal discretionary policy. The analytical solution for the discretionary outcome: The ZLB is slack in the high/normal state, φ = 0, and the ZLB is binding in the low/crisis state, φ = φ > 0. H L Thus, the equilibrium conditions become y = 0, H y = −φ, L y +σ−1(i −r∗−σs )−(1+σ−1κ)(p y +(1−p )y ) = 0, H H H H L H H y +σ−1(i −r∗−σs )−(1+σ−1κ)(p y +(1−p )y ) = 0, L ELB L L L L H V = −y2 +β(p V +(1−p )V ), H H H L H H V = −y2 +β(p V +(1−p )V ). L L L L L H The solution to the system of equations is given by the following: −i +r∗+σs ELB L y = 0, y = −φ = , H L σ(1−(1+κσ−1)p ) L i = r∗+σs +σ(1+κσ−1)p y , i = i , H H H L L ELB −βp y2 −y2 +β(1−p )V V = H L , V = L L H . H L (1−β)(1−β(p −p )) (1−βp ) L H L 30

Solving for the sustainable equilibrium: The Lagrangean is given by ∞ L ≡ E (cid:88) βt(cid:8) −y2+2φ (cid:0) y +σ−1(i −r∗)−s −(1+σ−1κ)E y (cid:1) 0 t t t t t t t+1 t=0   ∞ (cid:88)  +ψ t−E t βjy t 2 +j −W(s t ) ,  j=0 ∞ (cid:26) (cid:27) = E (cid:88) βt −Ψ y2+2φ (cid:0) y +σ−1(i −r∗)−s (cid:1) − 1 2φ (1+σ−1κ)y −ψ W(s ) , 0 t t t t t t β t−1 t t t t=0 whereΨ = ψ +ψ +...+ψ < ∞isthesumoftheLagrangemultipliersonthesustainability t −1 0 t constraint. The FOCs are given by ∂y : −Ψ y −φ +β−1(1+σ−1κ)φ = 0, t t t t t−1 ∂φ : y +σ−1(i −r∗)−s −(1+σ−1κ)E y = 0. t t t t t t+1 Normalizing the first equation by Ψ , we have t −y −φ˜ +β−1(1+σ−1κ)z φ˜ = 0, t t t t−1 where φ˜ = φ /Ψ and z = Ψ /Ψ ∈ (0,1]. The Karush-Kuhn-Tucker conditions (KKTCs) t t t t t−1 t must be satisfied as well φ (i −i ) = 0, t t ELB φ ≥ 0, t ψ (V −W(s )) = 0, t t t ψ ≥ 0. t The initial conditions on the Lagrange multipliers are such that φ = 0 and ψ = 1, which −1 −1 implies φ˜ = 0 and Ψ = 1. −1 −1 C Time-iteration method Weexplicitlyconsideravectorofpolicyandvaluefunctionsς(ξ ) = [y(ξ ),i(ξ ),φ˜(cid:48)(ξ ),z(ξ ),V(ξ )](cid:48) j j j j j j asfunctionsofthestatevariablesξ = (φ˜,s )forj = 1,...,N. Wehavethefollowingasystem j j of functional equations: e (ξ ) ≡−y(ξ )−φ(cid:48)(ξ )+β−1(1+σ−1κ)z(ξ )φ˜= 0, 1 j j j j N (cid:88) e (ξ ) ≡−y(ξ )−σ−1(i(ξ )−r∗)+s +(1+σ−1κ) p(s |s )y(φ˜(cid:48)(ξ ),s ) = 0, 2 j j j j k j j k k=1 N (cid:88) e (ξ ) ≡−V(ξ )−y(ξ )2+β p(s |s )V(φ˜(cid:48)(ξ ),s ) = 0. 3 j j j k j j k k=1 31

Note that we also have two occasionally binding constraints i(ξ ) ≥ i , j ELB V(ξ ) ≥ W(s ). j j Algorithm The time iteration method takes the following steps: 1. Make an initial guess for the policy function ς(0)(ξ ) for j = 1,...,N. i 2. For n = 1,2,... (n is an index for the number of iteration), given the policy function previously obtained ς(n−1)(ξ ) for each j, solve j −y−φ˜(cid:48)+β−1(1+σ−1κ)zφ˜= 0, N (cid:88) −y−σ−1(i−r∗)+s +(1+σ−1κ) p(s |s )y(n−1)(φ˜(cid:48),s ) = 0, j k j k k=1 N (cid:88) −V −y2+β p(s |s )V(n−1)(φ˜(cid:48),s ) = 0, k j k k=1 i ≥ i , ELB V ≥ W(s ), j for (y,r,φ˜(cid:48),z,V). 3. Update the policy function by setting y = y(n)(ξ ), i = i(n)(ξ ), φ˜(cid:48) = φ˜(cid:48)(n)(ξ ), z = j j j z(n)(ξ ), V = V(n)(ξ ) for j = 1,...,N. j j (cid:13) (cid:13) 4. Repeat 2-3 until (cid:13)ς(n)(ξ j )−ς(n−1)(ξ j )(cid:13) is small enough. In Step 2, there are four patterns of binding constraints due to the KKTCs. (i) i > i and V > W(s ): We immediately know φ˜(cid:48) = 0 and z = 1. Note that, ELB j taking the policy functions in the previous iteration, y(n−1)(φ˜(cid:48),s ) and V(n−1)(φ˜(cid:48),s ) k k as given, the expected values of the next period’s variables are obtained by evaluating the policy functions at φ˜(cid:48) = 0. Then we obtain y = β−1(1+σ−1κ)φ˜, N (cid:88) i = r∗+(σ+κ) p(s |s )y(n−1)(0,s )−σy+σs , k j k j k=1 N (cid:88) V = −y2+β p(s |s )V(n−1)(0,s ). k j k k=1 (ii) i > i and V ≤ W(s ): φ˜(cid:48) = 0 and z ∈ (0,1). The sustainability constraint is ELB j 32

binding, V = W(s ). Then we obtain j (cid:32) N (cid:33)1 2 (cid:88) y = −Vd(s )+β p(s |s )V(n−1)(0,s ) , t k j k k=1 N (cid:88) i = r∗+(σ+κ) p(s |s )y(n−1)(0,s )−σy+σs , k j k j k=1 z = β(1+σ−1κ)−1y/φ˜. (iii) i ≤ i and V > W(s ): φ˜(cid:48) > 0 and z = 1. The ZLB is binding, i = i . (y,φ˜(cid:48)) ELB j ELB are obtained by solving the following nonlinear equations: y = −φ˜(cid:48)+β−1(1+σ−1κ)φ˜, N (cid:88) i = r∗+(σ+κ) p(s |s )y(n−1)(φ˜(cid:48),s )−σy+σs . ELB k j k j k=1 (iv) i ≤ i and V ≤ W(s ): φ˜(cid:48) > 0 and z ∈ (0,1). Both constraints are binding, ELB j i = i and V = W(s ). (y,φ˜(cid:48)) are obtained by solving the following nonlinear ELB j equations: N (cid:88) W(s ) = −y2+β p(s |s )V(n−1)(φ˜(cid:48),s ), k k j k k=1 N (cid:88) i = r∗+(σ+κ) p(s |s )y(n−1)(φ˜(cid:48),s )−σy+σs . ELB k j k j k=1 Note that these equations do not depend on φ˜. Having (y,φ˜(cid:48)) at hand, z = β(1 + σ−1κ)−1(y+φ˜(cid:48))/φ˜ is also obtained. In solving the problem, we discretize the state space of φ˜∈ [0,φ ¯˜]. We use 401 points for each state variable. The upper bound φ ¯˜ is endogenously set so that the sequence of the Lagrange multipliers {φ˜ } under consecutive negative shocks s = s for t = 1,2,...,T are bounded t t L 1 above.21 We divide the state space by log-spaced grid points so that more grid points are toward the origin. We use piecewise-linear functions to approximate the policy functions off the grid points. Figure 9 shows the impulse response of the residual functions {e ,e ,e } under the 1,t 2,t 3,t OSP with N = 6022. As in the impulse responses in Figure 1 and 2 in the main text, the crisis shock hits the economy at time 1 and stays until time 8. We can see that all the values of the residual functions are reasonably small. When the sustainability constraint binds at time 9, the value of e increases as z < 1 and the equation becomes nonlinear. Also, the 1,t t values of e increase when the values of V are in transition from one value to another (i.e., 3,t t in each of the first few periods after time 1 and after time 9). 21Taking as given the cutoff probability p¯=0.005, T is set to the highest integer that satisfies pT1 >p¯. 1 L 22WecannotcalculatetheEulerequationerrorsundertheOSPwithN =20astheonlyanalyticalsolutions are available. 33

Figure 9: Residuals (under the OSP with N = 60) |e | |e | |e | 10-3 1 10-3 2 10-3 3 1 1 1 0.8 0.8 0.8 0.6 0.6 0.6 0.4 0.4 0.4 0.2 0.2 0.2 0 0 0 1 10 20 1 10 20 1 10 20 Value z -20 30 1 -40 20 0.8 -60 10 0.6 -80 0.4 -100 0 1 10 20 1 10 20 1 10 20 Table 5 shows the the average and maximum of the residuals in absolute terms (log 10 units) based on a 100,000-period stochastic simulation under the OCP, the OSP with N = ∞ and the OSP with N = 60. The values of e tend to be worse than others as we use linear 3 interpolation, whereas the value functions are at least quadratic. Note that the equation for e holds with equality (up to the machine precision) under the OCP, as the equation is linear 1 (i.e., z = 1) and we use the equation to substitute variables other than the ones we solve for with the other equations. Table 5: Residuals |e | |e | |e | 1 2 3 L L L L L L 1 ∞ 1 ∞ 1 ∞ OCP -16.59 -14.15 -7.55 -3.48 -4.83 -2.36 OSP with N = ∞ -7.43 -3.38 -7.27 -3.66 -4.84 -2.28 with N = 60 -6.22 -3.46 -8.01 -3.94 -4.99 -2.66 Notes: L is the average and L is the maximum of the residuals in absolute terms (log 10 1 ∞ units) based on a 100,000-period stochastic simulation. 34

D A detailed account of the model with a static Phillips curve In this section, we provide a detailed account of the dynamics of the model with a static Phillips curve from the vantage point of the policy functions. There are two purposes. First, seeing the dynamics of the economy in this way helps us better understand what’s happening in themodel. Second, it helps usto understand why the dynamics of theeconomyare history independent when N is sufficiently small and why it is possible to solve the model in this case in an alternative way that we later describe in Appendix E. We can accomplish the first with the baseline calibration of the main text, but cannot accomplish the second because the time-iteration does not converge for any small values of N consistent with history independent dynamics. Accordingly, we will use an alternative calibrationinwhichthetime-iterationmethodconvergesforsomevaluesofN consistentwith history independent dynamics. In this alternative calibration, p = 0.2/100 and p = 0.5. H L The values for other parameters are the same as the baseline values from the main text. We first describe the dynamics of the economy under the optimal commitment policy (OCP). We then describe the dynamics of the economy under optimal sustainable policies (OSPs) with N = ∞ and N = 120. D.1 Optimal commitment policy Suppose that the economy is initially at its risky steady state at time 0 where the policy rate is positive and the Lagrange multiplier is zero. The economy falls into the crisis state at time 1 and stays there until time 4. The economy is back in the normal state from time 5 on. Figure 10: Dynamics with an alternative calibration Policy rate (ann. %) Output gap (%) Inflation (ann. %) 3 4 1 2 0.5 2 0 0 -2 1 OCP -4 -0.5 OSP w/ N= OSP w/ N=120 -6 ODP 0 -1 0 5 10 0 5 10 0 5 10 Note: ODP, OCP, and OSP stand for optimal discretionary policy, optimal commitment policy, and optimal sustainable policy, respectively. The policy rate and the inflation rate are expressed in annualized percent. The output gap is expressed in percent. The economy’s dynamics under the OCP in this recession scenario are shown by the solid grey lines in Figure 10. Consistent with what we saw in the main body of the paper, the central bank keeps the policy rate at the ELB even after the crisis shock disappears and engineers an overheating of the economy. The solid black lines in Figure 11 are the value and policy functions associated with the OCP.TheblackdotsinFigure11tracethedynamicsoftheeconomyinthisrecessionscenario along the value and policy functions. The dynamics of the economy from time 1 to time 4 35

are governed by the crisis-state value and policy functions shown in the right panel. As the economystaysinthecrisisstate, theLagrange multiplier—thesoleendogenousstatevariable ofthemodel—increases. Thepaceoftheincreasedeceleratesthelongertheeconomyisinthe crisis state. If the economy were to stay in the crisis state forever, the Lagrange multiplier converges to a finite value. The evolution of output in the crisis state mirrors that of the Lagrange multiplier. As the Lagrange multiplier increases, output increases. If the economy were to stay in the crisis state forever (which is a zero probability event), output converges to a finite value, just as the Lagrange multiplier converges to a finite value. Figure 11: Value/policy functions —OCP and ODP— 5 0 0 5 10 15 tuptuO s=s s=s h l 0 OCP ODP -5 0 5 10 15 0 -0.5 -1 0 5 10 15 etar yciloP 0 -0.5 -1 0 5 10 15 10 5 0 0 5 10 15 egnargaL reilpitlum 30 20 10 0 0 5 10 15 0 -50 -100 0 5 10 15 -1 eulaV -50 -100 0 5 10 15 -1 Note: ODP and OCP stand for optimal discretionary policy and optimal commitment policy, respectively. The policy rate and the inflation rate are expressed in annualized percent. The output gap is expressed in percent. Once the economy is back in the normal state, the dynamics of the economy are governed bythenormal-statevalueandpolicyfunctionsshownintheleftpanel. Inthefirstperiodback in the normal state, the lagged Lagrange multiplier is positive, which implies positive output at time 1. As the time continues, the Lagrange multiplier gradually declines, eventually returns to zero. Together with the Lagrange multiplier, output declines as the normal state continues, eventually returning to zero. 36

D.2 Optimal sustainable policies The economy’s dynamics under the OSPs with N = ∞ and N = 120 in this recession scenario are shown by the solid and dash-dotted black lines in Figure 10. For the set of parameter values considered in this section, the sustainability constraint binds right after the crisis state ends even under the OSP with N = ∞, shortening the ELB duration and limiting the size of output and inflation overshoots. With N = 120, the ELB duration is shorter and the size of the overshoot is smaller than with N = ∞. Thevalue/policyfunctionsassociatedwiththeOSPswithN = ∞andN = 120areshown bythesolidblacklinesinFigure12and13, respectively. Thedynamicsoftheeconomyunder the OSPs in this recession scenario are traced by the black dots in these figures. Figure 12: Value/policy functions —OSP with N = ∞— 5 0 0 5 10 15 tuptuO s=s s=s h l 0 OCP OSP w/ N= ODP -5 0 5 10 15 0 -0.5 -1 0 5 10 15 etar yciloP 0 -0.5 -1 0 5 10 15 10 5 0 0 5 10 15 egnargaL reilpitlum 30 20 10 0 0 5 10 15 0 -50 -100 0 5 10 15 -1 eulaV -50 -100 0 5 10 15 -1 Note: ODP, OCP, and OSP stand for optimal discretionary policy, optimal commitment policy, and optimal sustainable policy, respectively. The policy rate and the inflation rate are expressed in annualized percent. The output gap is expressed in percent. Under the OSP with N = ∞, the crisis-state Lagrange multiplier increases as the crisis state persists, as seen in the right column of Figure 12. The crisis-state output increases with the Lagrange multiplier, but only when the lagged Lagrange multiplier is very small. Otherwise, the crisis-state policy function for output is flat. Once the economy returns to the normal state, the dynamics of the economy are governed by the normal-state value and policy functions shown in the left column of Figure 12. The sustainability constraint binds if the lagged Lagrange multiplier is sufficiently large. For those values of the lagged Lagrange 37

multiplier, the value and policy functions are flat. As a result, the Lagrange multiplier takes a low positive value in the first period of the normal state. Thus, the Lagrange multiplier declines quickly under the OSP with N = ∞ than under the OCP. Figure 13: Value/policy functions —OSP with N = 120— 5 0 0 5 10 15 tuptuO s=s s=s h l 0 OCP OSP w/ N=120 ODP -5 0 5 10 15 0 -0.5 -1 0 5 10 15 etar yciloP 0 -0.5 -1 0 5 10 15 10 5 0 0 5 10 15 egnargaL reilpitlum 30 20 10 0 0 5 10 15 0 -50 -100 0 5 10 15 -1 eulaV -50 -100 0 5 10 15 -1 Note: ODP, OCP, and OSP stand for optimal discretionary policy, optimal commitment policy, and optimal sustainable policy, respectively. The policy rate and the inflation rate are expressed in annualized percent. The output gap is expressed in percent. Under the OSP with N = 120, the crisis-state Lagrange multiplier increases as the crisis state persists, as seen in the right column of Figure 13. However, the policy function for output is flat so that output is constant in the crisis state. When the economy returns to the normalstate, thedynamicsoftheeconomyaregovernedbythenormal-statevalueandpolicy functions shown in the left column of Figure 12. The normal-state value and policy functions are qualitatively similar to those under the OSP with N = ∞. However, one interesting feature of the economy under the OSP with N = 120 is that the value of the lagged Lagrange multiplier in the first period of the normal state does not depend on the realized crisis shock duration. No matter how long the crisis shock lasts, the Lagrange multiplier takes a certain value given by the flat part of the normal-state policy function for the Lagrange multiplier. As a result, post-crisis dynamics do not depend on the realized crisis shock duration. In other words, the OSP with N = 120 is history independent. In this case, the dynamics of the economy in the normal state can be fully characterized by a finite sequence of output, policy rate, Lagrange multiplier, and the value. Notethatthekeydifferencebetweenthehistorydependentandhistoryindependentcases 38

iswhetherthevalueoftheLagrangemultiplierinthe first period of the crisis state isaboveor below the value of the lagged Lagrange multiplier above which the sustainability constraint binds in the normal state. This threshold value of the lagged Lagrange multiplier is indicated by thin vertical lines in the left panels of Figure 12 and 13. If the value of the Lagrange multiplier in the first period of the crisis state is above the thresholdvalueofthelaggedLagrangemultiplier,thenthevalueoftheLagrangemultiplierin anyperiodofthecrisisstate isalsoabovethethresholdvalueofthelaggedLagrangemultiplier becausetheLagrangemultiplierincreasesasthecrisispersists. Inthiscase,whentheeconomy returns to the normal state, output, inflation, value, policy rate, and the Lagrange multiplier in the first period of the normal state will be given by the value of the normal state policy function in the flat region where the sustainability constraint binds, regardless of how long the crisis lasts. Thus, the allocations in the first period back in the normal state do no depend on the realized crisis shock duration. If the Lagrange multiplier in the first period of the normal state does not the realized crisis duration, the dynamics of the economy will not depend on the realized crisis shock duration. If the value of the Lagrange multiplier in the first period of the crisis state is below the thresholdvalueofthelaggedLagrangemultiplier, thentheallocationsinthefirstperiodback in the normal state depend will differ depending on whether the realized crisis duration is one period or longer. This is because the normal-state value and policy functions are not in the flat region. In this case, the economys dynamics exhibit history dependence. The higher the threshold value of the lagged Lagrange multiplier is relative to the value of the Lagrange multiplier in the first period of the crisis state, the more history dependent the economys dynamics become. E Alternative solution method when N is low The previous section describes why the dynamics are history independent when N is sufficiently small and suggests that the dynamics of the economy are fully characterized by a vector of scalars satisfying a system of certain nonlinear equations and inequality constraints. As a result, it is not necessary to rely on the time-iteration method to compute the dynamics of the model. In this section, we provide the details of the solution algorithm we use when the economy’s dynamics are history dependent. The big picture of the solution algorithm is as follows. Assuming that the additional ELB duration, denoted by τ, is k, compute the allocation satisfying the model’s equilibrium conditionsthatarecapturedbyequalityconstraints. Check(i)whetherthemodel’sinequality constraints (ELB and sustainability constraints) are satisfied and (ii) whether the history independence assumption is indeed valid. If these two requirements are met, then we have foundthesolution. Ifnot,continuetosearchforthevalueofτ thatsatisfiesalltheequilibrium conditions, both those represented by equality and inequality constraints. As discussed in Section 2, this alternative solution method works in the model with a staticPhillips curve, but notinthe modelwitha forward-lookingPhillipscurve. As analyzed in Nakata and Schmidt (Forthcoming), in the model with a forward-looking Phillips curve, when the crisis shock is recurring, inflation and the output gap in the normal state are not zero even after the effect of a past crisis dissipated. However, in the model with a static Phillips curve, inflation and the output gap converge to zero in the normal state once the effect of a past crisis shock dissipates (because there is no trade-off between inflation and output stabilization). The alternative solution method we describe below takes advantage of 39

this feature of the model with a static Phillips curve.23 Below, we first describe the set of equality and inequality constraints the solution has to satisfy given τ. We then describe how to verify that the history independence assumption is satisfied. Case 1: τ = 0 (liftoff occurs immediately after the crisis shock disappears) Wedescribehowtofindthesolutiontothemodeliftheeconomy’sdynamicsarehistoryindependentandthepolicyrateisabovetheELBrightafterthecrisisshockdisappears. Inthis case,thedynamicsoftheeconomyarefullycharacterizedby{i ,i ,i ,y ,y ,y ,v ,v ,v } H M L H M L H M L where the subscript denote one of the three states of this economy: • In L state, the crisis shock is present. Only the ELB constraint is binding. • In M state, the crisis shock is absent. Only the sustainability constraint is binding. This state follows L state. • In H state, the crisis shock is absent. No constraint is binding. By definition of this case, the following equality and inequality constraints must be satisfied: i > −r∗ H i > −r∗ M i = −r∗ L and v > v (v ,v ,y ,y ) H Punish,H H L H L v = v (v ,v ,y ,y ) M Punish,H H L H L v > v (v ,v ,y ,y ) L Punish,L H L H L where v (v ,v ,y ,y ) and v (v ,v ,y ,y ) are given by: Punish,H H L H L Punish,L H L H L vi−1 = β (cid:2) (1−p )vi +p vi (cid:3) , D,H H D,H H D,L vi−1 = −y2 +β (cid:2) (1−p )vi +p vi (cid:3) , D,L D,L L D,H L D,L yi−1 =−σ−1(−r∗−s )+(1+σ−1κ) (cid:2) (1−p )yi +p yi (cid:3) , D,L L L D,H L D,L for i = 1,...,K, where (vK ,vK ,yK ,yK ) = (v ,v ,y ,y ) and v = v0 . K is D,H D,L D,H D,L H L H L Punish,H D,H 23Notethatwecouldalsousethisalternativesolutionmethodtothemodelwithperfectlystickyprices—in which the inflation rate is zero at any time and after any history of shocks—. In the model with perfectly sticky prices, there is no inflation-output trade-off and the output gap eventually converges to zero once the effect of a past crisis dissipates. 40

the length of punishment. In this case, the equilibrium conditions are given by: y = (1+σ−1κ)[(1−p )y +p y ]−σ−1(i −s ), H H H H L H H y = (1+σ−1κ)[(1−p )y +p y ]−σ−1(i −s ), M H H H L M H y = (1+σ−1κ)[(1−p )y +p y ]−σ−1(−r∗−s ), L L M L L L v = −y2 +β[(1−p )v +p v ], H H H H H L v = −y2 +β[(1−p )v +p v ] = v (v ,v ,y ,y ), M M H H H L Punish,H H L H L v = −y2 +β[(1−p )v +p v ], L L L M L L where we used i = −r∗. Because the central bank only cares about output, y = 0. Thus, L H i = σ(1+σ−1κ)p y +s , H H L H i = σ(1+σ−1κ)p y +s −σy , M H L H M y = (1+σ−1κ)[(1−p )y +p y ]−σ−1(−r∗−s ), L L M L L L v = β[(1−p )v +p v ], H H H H L v = −y2 +β[(1−p )v +p v ], M M H H H L v = −y2 +β[(1−p )v +p v ], L L L M L L v = v (v ,v ,y ,y ) M Punish,H H L H L There are 7 equations and 7 unknowns: (i ,i ,y ,y ,v ,v ,v ). H M M L H M L Guess (y ,v ). Then, L L • From the first three equations, we can determine (i ,i ,y ). H M M • From the fifth and sixth equations, we can determine (v ,v ). H M We need to check whether the fourth and seventh equations hold: v = β[(1−p )v +p v ], H H H H L v = v (v ,v ,y ,y ) M Punish,H H L H L That is, we can basically reduce the system to two-unknowns in two equations. Once you solve the system of equations, we need to verify the following four inequalities: i > −r∗ H i > −r∗ M v > v (v ,v ,y ,y ) H Punish,H H L H L v > v (v ,v ,y ,y ) L Punish,L H L H L When τ = 0, the Lagrange multiplier is zero in the first period after the crisis shock disappears. Thus, the dynamics of the economy cannot depend on the realized duration of the shock. That is, the economy’s dynamics are history independent by construction. Case 2: τ > 0 (liftoff occurs at least two periods after the crisis shock disappears) 41

In this section, we discuss how to solve for cases in which the policy rate stays at the ZLB for at least one period after the crisis shock disappears: • In L state, the crisis shock hits the economy. Only the ZLB constraint is binding. • In M state, the crisis shock is absent. Both the ZLB and sustainability constraints are 1 binding. This state follows L state. • In M state from i = 2 to i = τ−1, the crisis shock is absent. Only the ZLB constraint i is binding. This state follows M state. i−1 • In M state, the crisis shock is absent. No constraint is binding. This state follows τ M state. τ−1 • In H state, the crisis shock is absent. No constraint is binding. And, output gap is zero. By construction, the following equality and inequality constraints must be satisfied: i > −r∗ H i > −r∗ M,τ i = −r∗ M,τ−1 ... i = −r∗ M,1 i = −r∗ L and v > v (v ,v ,y ,y ) H Punish,H H L H L v > v (v ,v ,y ,y ) M,τ Punish,H H L H L ... v > v (v ,v ,y ,y ) M,2 Punish,H H L H L v = v (v ,v ,y ,y ) M,1 Punish,H H L H L v > v (v ,v ,y ,y ) L Punish,L H L H L where v (v ,v ,y ,y ) and v (v ,v ,y ,y ) are given by: Punish,H H L H L Punish,L H L H L vi−1 = β (cid:2) (1−p )vi +p vi (cid:3) , D,H H D,H H D,L vi−1 = −y2 +β (cid:2) (1−p )vi +p vi (cid:3) , D,L D,L L D,H L D,L yi−1 =−σ−1(−r∗−s )+(1+σ−1κ) (cid:2) (1−p )yi +p yi (cid:3) , D,L L L D,H L D,L for i = 1,...,K, where (vK ,vK ,yK ,yK ) = (v ,v ,y ,y ) and v = v0 . K is D,H D,L D,H D,L H L H L Punish,H D,H the length of punishment. In this case, the equilibrium conditions are given by: 42

y = (1+σ−1κ)[(1−p )y +p y ]−σ−1(i −s ), H H H H L H H y = (1+σ−1κ)[(1−p )y +p y ]−σ−1(i −s ), M,τ H H H L M,τ H y = (1+σ−1κ)[(1−p )y +p y ]−σ−1(−r∗−s ), M,τ−1 H M,τ H L H ... y = (1+σ−1κ)[(1−p )y +p y ]−σ−1(−r∗−s ), M,k H M,k+1 H L H ... y = (1+σ−1κ)[(1−p )y +p y ]−σ−1(−r∗−s ), M,1 H M,2 H L H y = (1+σ−1κ)[(1−p )y +p y ]−σ−1(−r∗−s ), L L M,1 L L L and v = −y2 +β[(1−p )v +p v ], H H H H H L v = −y2 +β[(1−p )v +p v ], M,τ M,N H H H L v = −y2 +β[(1−p )v +p v ], M,τ−1 M,τ−1 H M,τ H L ... v = −y2 +β[(1−p )v +p v ], M,k M,k H M,k+1 H L ... v = −y2 +β[(1−p )v +p v ], M,2 M,2 H M,3 H L v = −y2 +β[(1−p )v +p v ], M,1 M,1 H M,2 H L v = −y2 +β[(1−p )v +p v ], L L L M,1 L L v = v (v ,v ,y ,y ) M,1 Punish,H H L H L Because the central bank only cares about output stabilization, y = 0. Using y = 0, we H H obtain i = σ(1+σ−1κ)p y +s , H H L H i = σ(1+σ−1κ)p y +s −σy , M,τ H L H M,τ y = (1+σ−1κ)[(1−p )y +p y ]−σ−1(−r∗−s ), M,τ−1 L M,τ L L H ... y = (1+σ−1κ)[(1−p )y +p y ]−σ−1(−r∗−s ), M,k L M,k+1 L L H ... y = (1+σ−1κ)[(1−p )y +p y ]−σ−1(−r∗−s ), M,2 L M,3 L L H y = (1+σ−1κ)[(1−p )y +p y ]−σ−1(−r∗−s ), M,1 L M,2 L L H y = (1+σ−1κ)[(1−p )y +p y ]−σ−1(−r∗−s ), L L M,1 L L L 43

and v = β[(1−p )v +p v ], H H H H L v = −y2 +β[(1−p )v +p v ], M,τ M,τ H H H L v = −y2 +β[(1−p )v +p v ], M,τ−1 M,τ−1 H M,τ H L ... v = −y2 +β[(1−p )v +p v ], M,k M,k H M,k+1 H L ... v = −y2 +β[(1−p )v +p v ], M,2 M,2 H M,3 H L v = −y2 +β[(1−p )v +p v ], M,1 M,1 H M,2 H L v = −y2 +β[(1−p )v +p v ], L L L M,1 L L v = v (v ,v ,y ,y ) M,1 Punish,H H L H L We are solving for (y ,i ,i ), {y }τ , (v ,v ) and {v }τ . 2τ +5 unknowns in L H M,τ M,i i=1 H L M,i i=1 2τ +5 equations. Guess (y ,v ). Then, L L • From the first τ +2 equations, we can determine (i ,i ) and {y }τ . H M,τ M,i i=1 • From the equations for v and v (τ + 1 equations), we can determine v and M,i L H {y }τ . M,i i=1 Then, we need to check whether the following two equations hold: v = β[(1−p )v +p v ], H H H H L v = v (v ,v ,y ,y ) M Punish,H H L H L That is, we can basically reduce the system to two unknowns in two equations. Once you solve the system of equations, we need to verify the following N +3 inequalities: i > −r∗ H i > −r∗ M,N v > v (v ,v ,y ,y ) H Punish,H H L H L v > v (v ,v ,y ,y ) M,τ Punish,H H L H L ... v > v (v ,v ,y ,y ) M,2 Punish,H H L H L v > v (v ,v ,y ,y ) L Punish,L H L H L Iftheseinequalityconstraintsaresatisfied, thenwehavetherightτ andproceedtoverifythe history independence assumption. Otherwise, we adjust τ and solve the system of equations described above again. Finally, we need to check whether the dynamics of the economy are indeed history independent. As we saw in the previous section, whether the history independence assumption is valid depends on whether the value of the Lagrange multiplier in the first period of the crisis state is larger than a certain threshold value. While we need to know policy functions to know the exact value of the threshold value, we can compute an approximate threshold 44

value by extrapolating the implicit mapping from the lagged Lagrange multiplier to today’s Lagrange multiplier implied by the solution computed via the alternative solution method. If the value of the Lagrange multiplier in the first period of the crisis state is larger than the approximated threshold, we conclude that the history independence assumption is satisfied. In our model, we find that the history independence assumption is satisfied for any values of N below 27. F Tenure duration of leadership at central banks Table 6: Average Tenure Duration of Chairpersons in Select Central Banks Year of No. of leaders No. of leaders Average tenure Average tenure Central Bank foundation since foundation since 1946 since foundation since 1946 Federal Reserve System 1914 16 10 6.9 8.1 European Central Bank 1998 3 3 6.5 6.5 Bank of Canada 1934 9 8 9.9 9.7 Bank of Japan 1882 31 15 4.4 4.9 Bank of England 1694 120 9 2.7 8.5 Sveriges Riksbank 1901 14 11 8.1 6.1 Swiss National Bank 1907 14 10 8.1 7.4 Table 7: Maximum Tenure Duration of Chairpersons in Select Central Banks Max duration Max duration Central Bank since foundation since 1946 Federal Reserve System 18 yrs and 10 months (Martin) 18 yrs and 10 months (Martin)* European Central Bank 8 (Trichet) 8 (Trichet) Bank of Canada 20 yrs and 4 months (Towers) 14 (Boey) Bank of Japan 8 yrs and 6 months (Ichimada) 8 yrs and 6 months (Ichimada) Bank of England 24 (Norman) 12 (Cobbold) Sveriges Riksbank 19 (Rooth) 18 (Asbrink) Swiss National Bank 14 (Bachmann) 11 (Leutwiler) Note: The tenure of Alan Greenspan lasted for 18 years and 6 months. G Time-inconsistency of the commitment policy in the words of policymakers The time-inconsistency of the commitment policy at the ELB is not a mere theoretical curiosity. Policymakers in many central banks have pointed out the potential timeinconsistency of the commitment-type forward guidance policy. Some have argued that the time-inconsistency is one key reason for why most central banks refrained from making the overheating commitment. Below are some examples: G.1 Bean (2013) “In particular, we signalled our intention not to countenance tightening policy until unemployment has fallen to at least 7 percent.” 45

“This guidance is intended primarily to clarify our reaction function and thus make policy more effective, rather than to inject additional stimulus by precommitting to a time-inconsistent lower for longer’ policy path in the manner of Woodford (2012). While such a time-inconsistent policy may be desirable in theory, in an individualistic committee like ours, with a regular turnover of members, it is not possible to implement a mechanism that would credibly bind future members in the manner required.” G.2 Bullard (2013) “The New Keynesian, sticky price literature has been influential in U.S. monetary policymaking. The literature has been led by Michael Woodford. This line of research argues that policy accommodation can be provided even when the policy rate is near zero. The extra accommodation comes from a promise to maintain the near zero policy rate into the future, beyondthepointwhenordinarypolicymakerbehaviorwouldcallforanincreaseinthepolicy rate. This promise must be credible to have an impact. The “Woodford period” approach to forward guidance relies on a credible announcement made today that future monetary policy will deviate from normal. The central bank does not actually behave differently today. One might argue that such an announcement is unlikely to be believed. Why should future monetary policy deviate from normal once the economy is growing and inflation is rising? But if the announcement is not credible, then the private sector will not react with more consumption and investment today. That is, any effects would be minimal.” G.3 Carney (2012) “Today, to achieve a better path for the economy over time, a central bank may need to commit credibly to maintaining highly accommodative policy even after the economy and, potentially, inflation picks up. Market participants may doubt the willingness of an inflation-targeting central bank to respect this commitment if inflation goes temporarily above target. These doubts reduce the effective stimulus of the commitment and delay the recovery.” G.4 Clarida (2019) “The benefits of the makeup strategies rest heavily on households and firms believing in advance that the makeup will, in fact, be delivered when the time comes–for example, that a persistent inflation shortfall will be met by future inflation above 2 percent. As is well known from the research literature, makeup strategies, in general, are not time consistent because when the time comes to push inflation above 2 percent, conditions at that time will not warrant doing so. Because of this time inconsistency, any makeup strategy, to be successful, would have to be understood by the public to represent a credible commitment. That important real-world consideration is often neglected in the academic literature, in which central bank “commitment devices” are simply assumed to exist and be instantly credible on decree. Thus, one of the most challenging questions is whether the Fed could, in practice, attain 46

the benefits of makeup strategies that are possible in models.” G.5 Cœur´e (2013) “Mostnotably, thecentralbankmaytrytoconvincemarketsthatitwouldkeepinterestrates low, even if this would imply inflation well above its previous objective, at least temporarily. The promise of higher future inflation, if credible, induces private agents to substitute future for current consumption, hence providing additional stimulus today. This type of forward guidance is closer to the academic concept of forward guidance in the strict sense—as discussed, for example, in Woodford (2012). The main challenge of such guidance is its inherent inconsistency over time and thus lack of credibility. When the time comes, the central bank may be tempted to deviate from its prior commitment: once the benefits of higher inflation expectations in terms of front-loaded spending have been reaped, the central bank may not be willing to pay the bill in terms of higher inflation afterward. If the public foresees this temptation, expectations might remain unaffected in the first instance and the desired inter-temporal substitution of spending might not materialise. This is a possible explanation why, in practice, central banks have refrained from using forward guidance in a way that implies a major change in strategy. Therefore, central banks’ forward guidance has rather aimed at providing greater clarity on the reaction function and the assessment of future economic conditions.” G.6 Dudley (2013) “With respect to forward guidance, it is important to distinguish between two specific forms thatthisguidancemaytake. Inthefirstformthecentralbankprovidesitsforecastforthefuture path of the policy rate and, possibly, some sense of the degree of uncertainty around this path. Inthe second, thecentralbank pre-commits toa specific future path forits policy rate. Providing a forecast for the policy rate by itself does not create any budget or reputational risk for the Federal Reserve, so I generally do not see the first form of forward guidance as posing much risk to central bank independence. The second form of forward guidance—pre-commitment to a policy rate path— could create more risk for the central bank. In particular, consider a scenario in which the central bank decided to increase monetary accommodation by committing to maintain a low short-term interest rate for a long time even if this commitment resulted in inflation overshooting the central bank’s objective in the future. I could see how this could create a potential threat to the central bank’s independence. That is because the commitment could force the central bank in the future to conduct monetary policy in a way that was inconsistent with the inflation portion of its mandate. Although this second form of forward guidance could create greater risk for the central bank with respect to its future independence, this is not a policy that has been adopted by the Federal Reserve. There are implementation challenges with this approach. In particular, it is difficult for a monetary policy committee today to institutionally bind 47

future monetary policy committees to follow actions that could conflict with their objectives in the future. Without such a credible forward commitment, such policies would likely be ineffective in affecting expectations in the manner needed to provide additional monetary policy accommodation.” G.7 George (2019) “Third, a price-level targeting strategy is time inconsistent unless policymakers can credibly commit to following it. If the goal is to have inflation of 2 percent on average, a period of below 2 percent inflation would require an equal period of inflation above 2 percent. But once inflation has moved up to 2 percent, policymakers might be tempted to renege on their prior commitment and not allow inflation to go higher. This would undermine the future credibility of the price-level targeting strategy. To the extent the public understood this time inconsistency problem, price-level targeting would not be credible to begin with, absent a commitment device. With regular turnover among members of the FOMC, it would be difficult for one Committee to commit a future Committee to a particular course of action.” G.8 Lacker (2013) “Designing such conditional guidance involves trade-offs, however. Credibility requires consistency, over time, between a central bank’s statements and its actual subsequent actions. A central bank’s statements will have greater immediate effect on the public’s expectations the more they are seen as limiting the central bank’s future choices. Yet there are likely to be circumstances, ex post, in which the central bank feels constrained by past statements. Yieldingtothetemptationtoimplicitlyrenegebyreworkingdecisioncriteriaorcitingunforeseen economic developments may have short-term appeal, but widely perceived discrepancies between actual and foreshadowed behavior will inevitably erode the faith people place in future central bank statements. So central banks face an ex ante trade-off, as well, between the short-run value of exercising discretion and the ability to communicate effectively and credibly in the future.” G.9 Plosser (2013) “Note,however,thatthecentralbank’sabilitytoinfluencethepublic’sbeliefaboutthefuture path of policy and the economy depends critically on the bank’s commitment to that policy path and the credibility of that commitment in the eyes of the public. The public must believe that even after the economy begins to strengthen, the central bank will hold rates lower than it otherwise might have found desirable to do had it not been at the zero bound in the past.” G.10 Powell (2019) “By the time of the crisis, there was a well-established body of model-based research suggesting that some kind of makeup policy could be beneficial. In light of this research, one might ask why the Fed and other major central banks chose not to pursue such a policy. The answer lies in the uncertain distance between models and reality. For makeup strategies to achieve their stabilizing benefits, households and businesses must be quite confident that the 48

“makeup stimulus” is really coming. This confidence is what prompts them to raise spending and investment in the midst of a downturn. In models, confidence in the policy is merely an assumption. In practice, when policymakers considered these policies in the wake of the crisis, they had major questions about whether a central bank’s promise of good times to come would have moved the hearts, minds, and pocketbooks of the public. Part of the problem is that when the time comes to deliver the inflationary stimulus, that policy is likely to be unpopular–what is known as the time consistency problem in economics.” G.11 Ueda (2013) “If Max (the Taylor rule rate, zero) describes the usual central bank’s reaction function to the macroeconomic environment, the central bank can generate easing effects by offering a new reaction function to the market with a promise of a longer period at the zero rate than theaboverulesuggests. To the extent that the Taylor rule represents an optimal response of the central bank to macroeconomic environment, however, this forward guidance strategy amounts to “irresponsible” central bank behaviour. In other words, the strategy is time-inconsistent. This means that when the economy no longer requires a zero rate, it is better to raise the interest rate, reneging on the promise made. If people foresaw this ex ante, however, the strategy would become ineffective. Thus, the central bank would be sending a confusing signal if it was using forward guidance in this sense and insisted that it was still behaving in a “responsible” way. Also, the central bank does not seem to get much mileage out of a vague promise, such as the maintenance of a low policy rate “for an extended period,” unless there is much confusion in the market as to where the policy rate would go in the short term. The BOJ seems to have faced the time-inconsistency problem in 2000.” G.12 Williams (2012) “Although forward policy guidance has proven to be a very useful policy tool, it’s not a perfect substitute for the kind of monetary stimulus that comes from lower interest rates. Oneissueisthat, fortheforwardguidancepolicytoworkasdesired, thepublichastobelieve thattheFOMCwillreallycarryoutthepolicyasitsaysitwill. But,theFeddoesn’thavethe abilitytotieitshandsthatway. ThispointwasmadebyFinnKydlandandEdwardPrescott in the late 1970s. Let me explain. For forward policy guidance to have its maximum effect, the Fed must commit to keeping the short-term policy rate lower than it otherwise would to compensate for the fact that the short-term interest rate cannot be lowered today. But when the time comes to carry out the commitment made in its forward guidance, it may no longer want to do so. For instance, it might be hard to resist raising rates earlier than promised to head off an increase in inflation. So, even when central bankers say they will keep rates unusually low for a set time, the public may worry that the central bank will raise rates earlier to fight budding inflation pressures.” 49

Cite this document
APA
Taisuke Nakata and Takeki Sunakawa (2019). Credible Forward Guidance (FEDS 2019-037). Board of Governors of the Federal Reserve System, Finance and Economics Discussion Series. https://whenthefedspeaks.com/doc/feds_2019-037
BibTeX
@techreport{wtfs_feds_2019_037,
  author = {Taisuke Nakata and Takeki Sunakawa},
  title = {Credible Forward Guidance},
  type = {Finance and Economics Discussion Series},
  number = {2019-037},
  institution = {Board of Governors of the Federal Reserve System},
  year = {2019},
  url = {https://whenthefedspeaks.com/doc/feds_2019-037},
  abstract = {We analyze credible forward guidance policies in a sticky-price model with an effective lower bound (ELB) constraint on nominal interest rates by solving a series of optimal sustainable policy problems indexed by the duration of reputational loss. Lower-for-longer policies---while effective in stimulating the economy at the ELB---are potentially time-inconsistent, as the associated overheating of the economy in the aftermath of a crisis is undesirable ex post. However, if reneging on a lower-for-longer promise leads to a loss of reputation and prevents the central bank from effectively using lower-for-longer policies in future crises, these policies can be time-consistent. We find that, even without an explicit commitment technology, the central bank can still credibly keep the policy rate at the ELB for an extended period---though not as extended under the optimal commitment policy---and meaningfully mitigate the adverse effects of the ELB constraint on economy activity. Accessible version (.zip)},
}