feds · November 30, 2005

Robustifying Learnability

Abstract

In recent years, the learnability of rational expectations equilibria (REE) and determinacy of economic structures have rightfully joined the usual performance criteria among the sought-after goals of policy design. Some contributions to the literature, including Bullard and Mitra (2001) and Evans and Honkapohja (2002), have made significant headway in establishing certain features of monetary policy rules that facilitate learning. However a treatment of policy design for learnability in worlds where agents have potentially misspecified their learning models has yet to surface. This paper provides such a treatment. We begin with the notion that because the profession has yet to settle on a consensus model of the economy, it is unreasonable to expect private agents to have collective rational expectations. We assume that agents have only an approximate understanding of the workings of the economy and that their learning the reduced forms of the economy is subject to potentially destabilizing perturbations. The issue is then whether a central bank can design policy to account for perturbations and still assure the learnability of the model. Our test case is the standard New Keynesian business cycle model. For different parameterizations of a given policy rule, we use structured singular value analysis (from robust control theory) to find the largest ranges of misspecifications that can be tolerated in a learning model without compromising convergence to an REE.

Finance and Economics Discussion Series Divisions of Research & Statistics and Monetary Affairs Federal Reserve Board, Washington, D.C. Robustifying Learnabiltiy Robert J. Tetlow and Peter von zur Muehlen 2005-58 NOTE: Staff working papers in the Finance and Economics Discussion Series (FEDS) are preliminary materials circulated to stimulate discussion and critical comment. The analysis and conclusions set forth are those of the authors and do not indicate concurrence by other members of the research staff or the Board of Governors. References in publications to the Finance and Economics Discussion Series (other than acknowledgement) should be cleared with the author(s) to protect the tentative character of these papers.

Robustifying Learnability Robert J. Tetlow Peter von zur Muehlen ∗ † November 2005. Abstract In recent years, the learnability of rational expectations equilibria (REE) and determinacy of economic structures have rightfully joined the usual performance criteria among the sought-after goals of policy design. Some contributions to the literature, including Bullard and Mitra (2001) and Evans and Honkapohja (2002), have made significant headway in establishing certain features of monetary policy rules that facilitate learning. However a treatment of policy design for learnability in worlds where agents have potentially misspecified their learning models has yet to surface. This paper provides such a treatment. We begin with the notion that because the profession has yet to settle on a consensus model of the economy, it is unreasonable to expect private agents to have collective rational expectations. We assume that agents have only an approximate understanding of the workings of the economy and that their learning the reduced forms of the economy is subject to potentially destabilizing perturbations. The issue is then whether a central bank can design policy to account for perturbations and still assure the learnability of the model. Our test case is the standard New Keynesian business cycle model. For different parameterizations of a given policy rule, we use structured singular value analysis (from robust control theory) to find the largest ranges of misspecifications that can be tolerated in a learning model without compromising convergence to an REE. In addition, we study the cost, in terms of performance in the steady state of a central bank that acts to robustify learnability on the transition path to REE. (Note: This paper contains full-color graphics) JEL Classifications: C6, E5. • Keywords: monetary policy, learning, E-stability, learnability, robust control. • Contact author: Robert Tetlow, Federal Reserve Board, 20th and C Streets, NW, Washington, D.C. ∗ 20551. Email: rtetlow@frb.gov. WethankGeorgeEvans,SteveDurlauf,JohnC.Williamsandparticipants at aconferenceonlearningandmonetaryeconomics attheUC-SantaCruzforhelpfulcomments, andBrian IronsideandSeanTaylorforhelpwiththecharts. Theviewsexpressedinthispaperarethoseoftheauthors aloneanddonotrepresentthoseoftheBoardofGovernorsoftheFederalReserveSystemorothermembers of its staff. This paper and others may be found at http://www.roberttetlow.com von zur Muehlen & Associates, Vienna, VA 22181. E-mail: pvzmuehlen@cox.net † 0

1 Introduction Itisnowwidelyacceptedthatpolicyrules–andinparticular, monetarypolicyrules–should notbechosensolelyonthebasisoftheirperformanceinagivenmodeloftheeconomy. There is simply too much uncertainty about the true structure of the economy to warrant taking the risk of so narrow a criterion for selection. Rather, policy should be designed to operate "well" in a wide range of models. There has been substantial progress in a relatively short period of time in the literature on robustifying policy. The first strand of the literature examines the performance of rules given the presence of measurement errors in either model parameters or unobserved state variables.1 The second strand focuses on comparing rules in rival models to see if their performance spanned reasonable sets of alternative worlds.2 The third considers robustifying policy against unknown alternative worlds, usually by invoking robust control methods.3 At roughly the same time, another literature was developing on the learnability (or Estability)ofmodels.4 Thelearnabilityliteraturetakesastepbackfromrationalexpectations andaskswhetherthechoicesofuninformedprivateagentscouldbeexpectedtoconvergeona rational expectationsequilibrium(REE)astheoutcomeofaprocessoflearning. Important early papers in this literature include Bray [5], Bray and Savin [6] and Marcet and Sargent [33]. Evans and Honkapohja summarize some of their many contributions to this literature in their book [17]. The question arises: could monetary policy help or hurt private agents learn the REE? The common features of the robust policy literature include, first, that it is the government that does not understand the true structure of the economy, and second, that the government’signorancewillnotvanishsimplywiththecollectionofmoredata.5 Bycontrast, inthe 1 Brainard [4] is the seminal reference. Among the many, more recent references in this large literature are Sack [40], Orphanides et al. [38], Soderstrom [42] and Ehrmann and Smets [16]. 2 See, e.g., Levin et al. [28] and [29]. 3 Hansen and Sargent [26] and [27], Tetlow and von zur Muehlen [46] and Coenen [13]. These strands of the robustness literature are named in the text in chronological order but the three methods should be seen as complementary rather than substitutes. 4 In this paper, as in most of the rest of the literature, the terms learnability, E-stability and stability under learning will all be used interchangeably. These terms are distinct from stable—without the "E-" or "under learning" added—which should be taken to mean saddle-point stable. The term saddle-point stable, determinate and regular are taken as equivalent adjectives describing equilibria. 5 Theconceptof"truth"isaslipperyoneinlearningmodels. Insomesense,thetruthisjointlydetermined bythedeepstructuralparametersoftheeconomyandwhatpeoplebelievethemtobe. Onlyinsteadystate, and then only under some conditions, will this be solely a function of deep parameters and not of beliefs. 1

learning literature itisusuallythe private sectorthat is assumednot tohave theinformation necessary to form rational expectations, but this situation has at least the prospect of being alleviated with the passage of time and the collection of more data. In this paper, we take the robust policy rules literature and marry it with the learnability literature. Since the profession has been unable to agree on a generally acceptable workhorse model of the economy, it is unreasonable to expect private agents to have rational expectations at all points in time. The most that one can expect is that agents have an approximate understanding of the workings of the economy and that they are on a transition path toward learning the true structure. And as Evans and McGough [21] show, designing policy as if the process of learning has already been completed can result in indeterminacy or unstable equilbria. So we retain the assumption of adaptive learning that is common with most contributions to this literature. If the presumption of rational expectations is questionable, and the hazards of learning cannot be ignored, then from a policy perspective, it follows that the job of facilitating the transition to REE is logically prior to the job of maximizing the performance of the economy once the transition is complete. In this paper, we consider two issues. The first is how a policy maker might choose policy to maximize the set of worlds inhabited by private agents that are able to learn the REE. The second is an assessment of the welfare cost of assuring learnability in terms of forgone stability in equilibrium. Or, put differently, we measure the welfare cost of learnability insurance. Each of these questions is important. In worlds of model uncertainty, an ill-chosen policy rule—or policy maker—could lead to explosiveness or indeterminacy. At the same time, excessive concern for learnability will imply costs in terms of forgone welfare. Ours is not the first paper to consider the issue of choosing monetary policies for their ability to deliver determinacy and learnability. Bernanke and Woodford [2] argue that inflation-forecast-based (IFB) policy rules—that is, rules that feed back on forecasts of future output or inflation—can lead to indeterminacy in linear rational expectations (LRE) models. Clarida, Gali and Gertler [11] show that violation of the so-called Taylor principle in the Nevertheless, in this paper, when we refer to a "true model" or "truth" we mean the REE upon which successful learning eventually converges. 2

context of an IFB rule may have been the source of the inflation of the 1970s.6 Bullard and Mitra [8] in an important paper show that higher persistence in instrument setting— meaning a large coefficient on the lagged instrument in a Taylor-type rule—can facilitate determinacy in the same class of models. Evans and Honkapohja [19] note similar problems in a wider class of rules and argue for feedback on structural shocks, although questions regarding the observability of such shocks leave open the issue of whether such a policy is implementable. Evans and McGough [21] compute optimal simple rules conditional on their beingdeterminateinrival models. Eachof thesepapers makesanimportantcontributionto the literature, but all are special cases within broader sets of policy choices. In this paper, we follow a somewhat different approach and consider the design of policies to maximize learnability of the economy. Theremainderofthepaperisorganizedasfollows. Thesecondsectionlaysoutthetheory, beginning with a review of the literature on least-squares learnability and determinacy, and following with methods from the robust control literature. Section 3 introduces the models with which we work, beginning with the case of the very simple Cagan model of money demand in hyperinflations and then moving on to the New Keynesian business cycle (NKB) model. For the NKB model, we study the design of time-invariant simple monetary policy rules to robustify learnability of three types: a lagged-information rule, a contemporaneous information rule and a forecast-based policy rule. We close the section by covering the insurancecostofrobustifyinglearnability. Afourthandfinalsectionsumsupandconcludes. 2 Theoretical overview 2.1 Expectational equilibrium under adaptive learning The theory of E-stability or learnability in linear rational expectations models dates back more than 20 years to Bray [5] who showed that agents using recursive least squares would, if the arguments to their regressions were properly specified, eventually converge on the correct REE. This convergence property gave a considerable shot in the arm to rational 6 Levin et al. [29] and Batini and Pearlman [1] study the robustness properties of different types of inflation-forecast based rules for their stability and determinacy properties. 3

expectations applications since proponents had an answer to the question "how could people come to have rational expectations?" The theory has been advanced by the work of Marcet and Sargent [33] and Evans and Honkapohja [various]. Our rendition follows Evans and Honkapohja [17], chapters 8-10. Begin with the following linear rational expectations model: y = A+ME y +Ny +Pv , (1) t t t+1 t 1 t − where y is a vector of n endogenous variables, including, possibly, policyinstruments, andv t t comprisesallmexogenousvariables. Equation(1)isgeneralinthatbothnon-predetermined variables, E y ,and predetermined variables, y , are represented. By defining auxiliary t t+1 t 1 − variables, e.g., yj = y ,j = 0, arbitrarily long (finite) lead or lag lengths can also be t t+j 6 accommodated. Finally, extensions to allow lagged expectations formation; e.g., E y , t 1 t − and exogenous variables are straightforward to incorporate with no significant changes in results. Next, define the prediction error for y , to be ξ = y E y . Under t+1 t+1 t+1 t t+1 − rational expectations, E ξ = 0, a martingale difference sequence. Evans and Honkapohja t t+1 [17] show that for at least one rational expectations equilibrium to exist, the stochastic process, y , that solves (1), must also satisfy: t y = M 1A+M 1y M 1Ny M 1Pv +ξ (2) t+1 − − t − t 1 − t t+1 − − − − We can express (2) as a first-order system: y M 1A M 1 M 1N y M 1 1 t+1 = − − + − − − t + − − Pv + ξ y 0 I 0 y 0 t 0 t+1 t n t 1 ∙ ¸ ∙ ¸ ∙ ¸∙ − ¸ ∙ ¸ ∙ ¸ or, rewriting: Y = F +BY +Cv +Dξ , (3) t+1 t t t+1 where Y = [y ,y ]. Then we can easily show when (3) satisfies the Blanchard-Kahn t t 1 0 − [3] conditions for stability, namely, that the number of characteristic roots of the matrix B of norm less than unity equal the number of predetermined variables (taking y to be t scalar, this is one), then the model is determinate, and there is just one martingale difference sequence, (cid:18) , that will render (2) stationary; if there are fewer roots inside the unit circle t+1 than there are predetermined variables, the model is explosive meaning that there is no 4

martingale difference sequence that will satisfy the system; and if there are more roots inside the unit circle than there are predetermined variables, the model is said to be indeterminate, and there are infinite numbers of martingale difference sequences that make (2) saddlepoint stable. The roots of B are determined by the solution to the characteristic equation: λ2 M 1λ+M 1N = 0. − − − Determinacy is one thing, learnability is quite another. As Bullard and Mitra [8] have emphasized, determinacy does not imply learnability, and indeterminacy does not imply a lack of learnability. We can address this question by postulating a representation for the REE that a learning agent might use. For the moment, we consider the minimum state variable (MSV) representation, advanced by McCallum [34]. Let us assume that v is t observable and follows a first-order stochastic process, v = ρv +(cid:18) , t t 1 t − where (cid:18) is an iid white noise process. The ρ matrix is assumed to be diagonal. t Under these assumptions, we can write the following perceived law of motion (PLM): y = a+by +cv . (4) t t 1 t − Rewrite equation (1) slightly, and designate expectations formed using adaptive learning with a superscripted asterisk on the expectations operator, E : t∗ y = A+ME y +Ny +Pv . (5) t t∗ t+1 t 1 t − Then, leading(4)oneperiod, takingexpectations, substituting(4)intotheresult, andfinally into (5), we obtain the actual law of motion, (ALM), the model under the influence of the PLM : y = A+M(I +b)a+(N +Mb2)y +(M(bc+cρ)+P)(cid:18) . (6) t t 1 t − So the MSV solution will satisfy the mapping from PLM to ALM: A+M(I +b)a = a, N +Mb2 = b, M(bc+cρ)+P = c. 5

Learnability depends then on the mapping of the PLM on to the ALM, defined from (6): T(a,b,c) = [A+M(I +b)a,N +Mb2,M(bc+cρ)+P] (7) The fixed point of this mapping is a MSV representation of a REE, and its convergence is given by the matrix differential equation: d (a,b,c) = T(a,b,c) (a,b,c). (8) dτ − Convergence is assured if certain eigenvalue conditions for the following matrix differential equations are satisfied. da = [A+M(I +b)]a a, (9) dτ − db = Mb2 +N b, dτ − dc = M(bc+cρ)+P c. dτ − As shown by Evans and Honkapohja (2001), the necessary and sufficient conditions for Estability are that the eigenvalues of the following matrices have negative real parts: DT I = M(I +b) I, a − − DT I = b M +I Mb I, b 0 − ⊗ ⊗ − DT I = ρ M +I Mb I. c 0 − ⊗ ⊗ − The important points to take from equations (9) are that the conditions are generally multivariate in nature–meaning that the coefficients constraining the intercept term, a, can beconflatedwiththoseoftheslopeterm, b,andthatthecoefficientsofboththePLMandthe ALM come into play. Learnability applications in the literature to date have been confined to very simple, small-scale models where these problems rarely come into play.7 In the kind of medium- to large-scale models that policy institutions use, these issues cannot be safely ignored.8 Without taking away anything from the important contributions of Bullard and 7 A notable exception is Garratt and Hall [22], but even then the learning problem was constrained to exchange rate determination. The rest of the London Business School model that they used was taken as known. 8 At the Federal Reserve Board, for example, the staff use a wide range of models to analyze monetary policy issues, including a variety of reduced-form forecasting models, a calibrated multi-country DSGE model called SIGMA, a medium-scale DSGE U.S. model, and the FRB/US model, a larger-scale, partly micro-founded estimated model. 6

Mitra [8] and Evans and Honkapohja [19], the choice of monetary policy rules must not only consider howtheyfoster learnability ina givenmodel but whether theydo so for the broader class of models within which the true learning model might be found. Similarly, taking as given the true model, the initial beliefs of private agents can affect learnability both through the inclusion and exclusion of states to the PLM and through the initial values attached to parameters. In the context of the above example, values of a, b, and c that are initially "too far" from equilibrium can block convergence. The choice of a particular policy can shrink or expand the range of values for a, b, and c that is consistent with E-stability.9 This is our concern in this paper: how can a policy maker deal with uncertainty in agents’ learning mechanisms in the choice of his or her policy rule and thereby maximize the prospect that the economy will converge successfully on a rational expectations equilibrium? For this, we workwithperturbations to the T-mapping describedbyequations (8) or systems like it. We take this up in the next subsection. 2.2 Structured robust control In the preceding subsection, we outlined the theory of least-squares learning in a relatively general setting. In this subsection we review some useful methods from robust control theory. Recall that our objective is to uncover the conditions under which monetary policy can maximize the prospect that the process of learning will converge on a REE–that is, to robustify learnability–so the integration of the theories of these two subsections is what will provide us with the tools we seek. The argument that private agents might have to learn the true structure of the economy takes a useful step back from the assumption of known and certain linear rational expectations models. However, what the literature to date has usually taken as given is, first, that agents use least-squares learning to adapt their perceptions of the true economic structure, and second, that they knowthe correct linear or linearized formof the REE solution.10 Both of these assumptions can be questioned. It is a common-place convenience of macroecono- 9 In fact, in this example, the intercept coefficient, a, turns out to be irrelevant for the determination of learnability, although this result is not general. 10 Evans and Honkapohja [17] survey variations on least-squares learning, including under- and overparameterized learning models and discounted (or constant-gain) least squares learning. Still, in general, either least-squares learning or constant gain learning is assumed. An exception is Marcet and Nicolini [32]. 7

mists to formulate a dynamic stochastic general equilibrium model and then linearize that model. It is certainly possible that ill-informed agents use only linear approximations of their true decision rules. But it is hard to argue that the linearized decision rule is any more valid than some other approximation. Similarly, least-squares learning is the subject of research more because of its analytical tractability than its empirical plausibility. The utility of tractable, linear formulations of economic forms is undeniable; at the same time, however, the risk in over reliance on such forms should be just as apparent. There would appear to be a least a prima facie case for retaining the simplicity of linear state-space representations and linear rules, while taking seriously the consequences of such approximations. With this in mind, we retain the assumption of a linear reference model, and leastsquares learning on the part of agents, but assume that the process of learning is subject to uncertainty. Such uncertainty may arise because agents take their decision rules as simplificationsoftrulyoptimaldecisionrulesduetothecomplexityofsuchrules. Attributing model uncertainty to agents can also be justified if we assume that least-squares learning is untenable in worlds where agents pick and choose the information to which they respond in forming and updating beliefs. The point is that from the perspective of the monetary authority, there are good reasons to be wary of both the assumed learning rules and the underlying models, and yet there is very little guidance on how to model those doubts. We motivate the present approach by positing a central bank that is concerned about ensuring learnability of the true model when learning is to subject errors, the distributions of which are unknown. Accordingly, we analyze these doubts using only a minimum amount of structure, drawing on the literature on structured model uncertainty and robust control. For the most part, treatments of robust control have regarded model uncertainty as unstructured, that is, as uncertainty not ascribed to particular features of the model but instead represented by one or more additional shock variables wielded by some "evil agent" bent on causing harm.11 The approach taken here differs in that we consider a central bank worried about how large agents’ learning errors can be before learning fails to drive the economy towards eventual equilibrium. The central bank will employ techniques of robust 11 See, in particular, Sargent [41], Giannoni [24], Hansen and Sargent ([26], [27]), Tetlow and von zur Muehlen ([45], [46]), Onatski and Stock [36], and Onatski and Williams [37]. 8

control to set the parameters of her policy rule in a way that makes room for the largest degree of estimation errors by agents without rendering the true model unlearnable. To develop strategies for setting policies that determine maximum allowable misspecifications in agents’ learning models while keeping the economy just shy of becoming unlearnable, we need to consider structured model uncertainty. Structured model uncertainty shares with its unstructured sibling a concern for uncertainty in the sense of Knight–meaning that the uncertainty is assumed to be nonparametric. But structured robust control differs in that it associates the uncertainty with perturbations to particular parts of the model. More importantly, to such perturbations we can assign a variety of structures, including timeseries specifications. Work in this field was initiated by Doyle [15] and further developed in Dahleh and Diaz-Bobillo [52] and Zhou et al. [51], among others. Recent applications of this strand of robust control to monetary policy can be found in Onatski and Stock [36], Onatski [35], and Tetlow and von zur Muehlen [45]. While most contributions to the literature on monetary policy have been concerned with maximizing economic performance, our concern is maximizing the prospects for learnability. Thus our metric of success is not the usual one of maximized utility or a quadratic approximation thereof, although we will have a look at the "insurance premium" for robustness. Boiled down to its essence, the five steps to designing policies subject to the constraint that agents must adapt to those policies and a learning model that may be misspecified are: 1. Write down a structural model of the economy and compute the conditions necessary for the model to attain a unique saddle-point stationary equilibrium. 2. Given the structural model, formulate the central bank’s depiction of the perceived law of motion used by agents to learn the structural model. Substitute this into the structuralmodeltoarriveattheactuallawofmotion. Werefertothisasthereference model. While the reference model is the central bank’s best guess of the ALM, the bank understands that the reference model is only an approximation of the true ALM, and doubts remain about its local accuracy.12 12 It is sometimes argued that robust control–by which people mean minmax approaches to model uncertainty–is unreasonable on the face of things. The argument is that the worst-case assumption is too extreme, that to quote a common phrase, "if I worried about the worst case outcome every day, I 9

3. Specify a set of perturbations to the reference model structured in such a way as to isolate the possible misspecifications to which the reference model is regarded to be most vulnerable. 4. For a given policy, use structured singular value analysis to determine the maximum allowableperturbationstotheALMthatwillbringtheeconomyupto, butnotbeyond, the point of E-instability. 5. Finally, compute the policy for which the allowable range of misspecifications is the largest. When, in the agents’ learning model–the MSV-based PLM described in (4)–the parameters a, b, and c or Π, have been correctly estimated by agents, this model should be considered to be the true reduced form of the structural model in (1). Note, however, that evenif individuals managetospecifytheirlearningmodel correctlyinterms of includedvariables and lag structures, the expectations of future output and inflation they base on these estimates are (at best) in a period of transition towards being truly rational. The model that agents actually estimate may differ from (1) in various ways that may be persistent. We want to determine how far off the true model agents’ learning model can become before it becomes in principle unlearnable. To begin, we rewrite the ALM from (6) and vectorize the disturbance, (cid:18) , to emphasize t the stochastic nature of the estimating problem faced by agents, Y = ΠY +(cid:18) , t+1 t t where Y = [1,y ,v ] is of dimension n+1, (cid:18) = [e0 0 (cid:18) ] and t t 1 t 0 t t − 0 1 e0 0 Π = A+M(I+b)a (N +Mb2) M(bc+cρ)+P . " 0 0 ρ # Notice that by using the ALM, we are modeling the problem from the policy authority’s point of view. The authority is taken as knowing, up to the perturbations we are about to wouldn’t get out of bed in the morning". Such remarks miss the point that the worst-case outcome should bethought of aslocalinnature. Decisionmakers areenvisioned aswantingto protectagainst uncertainties that are empirically indistinguishable from the data generating process underlying their reference models. 10

add to the model, the structure of private agents’ learning problems. As a consequence, the authority is in a position to influence the resolution of that problem. Potential errors in parameter estimation are then represented by a perturbation block, ∆. In principle, the ∆ operator can be structured to implement a variety of misspecifications, including alternative dynamic features. Robust control theory is remarkably rich in how it allows one to consider omitted lag dynamics, inappropriate exogeneity restrictions, missing nonlinearities, and time variation. This being the first paper of its kind, we keep our goals modest: in the language of linear operator theory, we will confine our analysis to linear time-invariant scalar (LTI-scalar) perturbations. LTI-scalar perturbations represent such eventsasone-timeshiftsandstructuralbreaksinmodelparameters, asagentsperceivethem. Suchperturbationshavebeenthesubjectofstudyofparametricmodel uncertainty; see, e.g., Bullard and Euseppi [7].13 With this restriction, the perturbed model becomes:14 (cid:18) = Y [Π+W ∆W ]Y , t t+1 1 2 t − = [Q W ∆W ]Y , (10) 1 2 t e − where Q = I L 1 Π, L is the lag operator, ∆ is a k k linear, time-invariant blockn − − × diagonal operator representing potentially destabilizing learning errors, and W and W are, 1 2 respectively, (n+1) k and k (n+1) selector matrices of zeros and ones that select which × × parameters in which equations are deemed to be subject to such errors. Either W or W 1 2 can, in addition, be chosen to attach scalar weights to the individual perturbations so as to reflectrelativeuncertaintieswithwhichmodel estimatesaretoberegarded. Thesecondline is convenient for analyzing stability of the perturbed model under potentially destabilizing learning errors. Using this construction, the perturbation operator, ∆, and the weighting matrices can be structured so that misspecifications are focused on particular features of the model deemed especially susceptible to learning errors involving the model’s variables for any chosen lag or lags. The essence of this paper is to find out how large, in a sense to be defined presently, the misspecifications represented by the perturbations in (10)–called the radius of allowable 13 See Evans and Honkapohja [17] for some treatment of learning with an over-parameterized PLM. 14 Multiplicative errors in specification would be modeled in a manner analagous to (10): (cid:18) = [A(1 t − W ∆W )]Y . 1 2 t 11

perturbations–can become without eliciting a failure of convergence to rational expectations equilibrium. Any policy that expands the set of affordable perturbations is one that allows the widest room for misspecifications committed by agents and thus offers an improved chance that policy will not be destabilizing. To do this we bring the tools of structured robust control analysis mentioned earlier. Let denote the class of allowable perturbations to the set of parameters of a model D defined as those that carry with them the structure information of the perturbations. Let r > 0 be some finite scalar and define D as the set of perturbations in (10) that obey r ∆ < r, where ∆ is the induced norm of ∆ considered as an operator acting in a normed || || || || space of random processes15. The scalar, r, can be considered a single measure of the maximum size of errors in estimation. A policy authority wishing to operate with as much room to maneuver as possible will act to maximize this range. For the tools to be employed here, norms will be defined in complex space. In what follows, much use is made of the concept of maximum singular value, conventionally denoted by σ16. For reasons that will become clearer below, the norm of ∆ that we shall use will be the L norm of the function ∞ ∆(eiω), defined as the largest singular value of ∆(eiω) on the frequency range ω [ π,π]: ∈ − ∆ = supmaxeig[∆(e iω)∆(eiω)] 1/2, (11) 0 − || ||∞ { ω } where max eig denotes the maximum eigenvalue. The choice of ∆ as a measure of · || ||∞ the size of perturbations conveys a sense that the authority is concerned with worst-case outcomes. Imagine two artificial vectors, h = [h ,h ,...,h ] and p = [p ,p ,...,p ], connected t 1t 2t kt 0 t 1t 2t kt 0 15 Induced norms are defined as follows. Let X be a vector space. A real-valued function defined on ||·|| X is said to be a norm on X if it satisfies: (i) x 0, (ii) x =0 only if x=0, (iii) αx = α x for any scalar α, (iv) x+y x + y for any || x ||≥ X and y || || X. For x Cn, the | v | ect | o | r p | -no | r | m | | o | n x p is defined as x | = | ( n||≤ x || p | ) | 1/p | , | w || here 1 p ∈ . For p ∈ = 2, = ∈ x = L n x 2, that is, the quadratic pro | b | le || m p . Corr i e = sp 1 o | n i d | ing to ,we al ≤ so ha ≤ ve ∞ = n x L , 2 and || || 2 =max i=1| x i | . Finally, let A=[a ] Cm n in a P n equation y = L A 2 x , where x L m 1 ay som i= e 1 r | an i | dom ve L c∞tor. T p h P e 1 m≤ i a≤t 1 ri | x i n | orm induced ij × t t t t ∈ P by A a v = ec m to a r x p-norm, n || x || a p , is . | M | A o || r p e ≡ det s a u i p ls x 6 = ar 0 e || | g A | x i x v || | e p |p n . in N T o e te tlo t w ha a t n f d or vo p n = zu 2 r , M || A ue | h | 2 le = n [45 m ]. axeig(A · A)and || 1|6| 1 As is ap 1 ≤pa j ≤re n nt f i r = o 1 m | i t j h | e expression in (11), the largest singular value, σ(X), of p a matrix, X, is the largest eigenvalue oPf X X. 0 12

to each other and to Y via17 t p = W Y t 2 t h = ∆ p . t t · Then we may recast the perturbed system (10) as the augmented feedback loop18 Y Π W Y t+1 = 1 t , (12) p W 0 h t 2 t ∙ ¸ ∙ ¸∙ ¸ h = ∆ p . (13) t t · A reduced-form representation of this loop (from h to Y and p ) is the transfer function t t t Y G t = 1 h , (14) p G t t 2 ∙ ¸ ∙ ¸ where G = (I L 1 Π) 1W , and G = W (I L 1 Π) 1W is a n k matrix, where k is 1 n − − 1 2 2 n − − 1 − − × the number of diagonal elements in ∆. As we shall see, the stability of the interconnection betweenh andp , representingafeedforwardp = G h andafeedbackh = ∆ p , iscritical. t t t 2 t t t · Note first that, together, these two relationships imply the homogenous matrix equation 0 = (I G ∆)p . (15) k 2 t − An E-stable ALM is also dynamically stable, meaning that Π has all its eigenvalues inside the unit circle. To make this link, the following theorem is critical. Theorem 1 The Small Gain Theorem. Let Re(s) denote the real part of s , where is the field of complex numbers, and ∈ C C let denote the set of functions analytic in Re(s) > 0. Furthermore, let H∞ L∞ RH∞ designate the set of real, rational values in the -normed space. Suppose G and 2 H∞ ∈ RH∞ r > 0. Then the interconnected system in (12)-(13) is well posed and internally stable for all ∆(s) with ∈ RH∞ (a) ∆ 1/r, if and only if G (s) < r, 2 || || ≤ || ||∞ (b) ∆ < 1/r, if and only if G (s) r. 2 || || || ||∞ ≤ 17 See Dahleh and Bobillo [14], chapter 10. 18 Because the random errors in this model play no role in what follows, we leave out the (cid:18) vector. 13

Proof: See Zhou and Doyle,[52] p. 137. ¤ By assumption, Q, defined in (10), is invertible on the unit circle, allowing us to write19 det(Q)det(I G ∆) = det(Q)det(I W Q 1W ∆) k 2 k 2 − 1 − − = det(Q)det(I Q 1W ∆W ) k − 1 2 − = det(Q W ∆W ). 1 2 − The preceding expressions establish the link between stability of the interconnection, G , 2 and stability of the perturbed model: if det(I G ∆) = 0, then the perturbed model (10) k 2 − is no longer invertible on the unit circle, hence unstable, and vice versa. Thus, any policy rule that stabilizes the G also stabilizes the augmented system (12)-(13). The question 2 to be asked then is how large, in the sense . , can ∆ become without destabilizing the || ||∞ feedback system (12)-(13). The settings we consider involve linear time-invariant perturbations, where the object is to find the minimum of the largest singular value of the matrix, ∆, from the class of D such r that I G ∆ is not invertible. The inverse of this minimum, expressed in the frequency 2 − domain, isthestructured singular value20 ofG withrespecttoD , definedateachfrequency, 2 r ω [ π,π], ∈ − 1 µ[G (eiω)] = , (16) 2 min σ¯[∆(eiω)] : ∆ D ,det(I G ∆)(eiω) = 0 r 2 { ∈ − } withthe provisionthat if there isno∆ suchthat det(I G ∆)(eiω) = 0, thenµ[G (eiω)] = 0. 2 2 − The small gain theorem then tells us that, for some r > 0, the loop (12)-(13) is well posed and internally stable for all ∆(.) with ∆ < r, if and only if sup µ[G (eiω)] 1/r. r ω 2 ∈ D || || ∈R ≤ Let φ denote a vector of policy parameters. Thus we can now formally state the problem that interests us as seeking a best φ = φ ∗ by finding a maximum value of µ = µ, satisfying µ(φ ) = inf supµ[G (eiω)] ∗ 2 φ ω ∈R 19 The small gain theorem links the stability of the loop between p and h under perturbations to the full system subject to model uncertainty. For some sufficiently large number r, such that ∆ < r, the determinant det(I G ∆) = 0. Now raise r to some value r such that det(I G ∆) || = | d |∞et(I k 2 k 2 k W Q 1W ∆)=0. − 6 − − 2 − 1 20 The singular value is said to be "structured" in recognition of structure built into the perturbation matrix, ∆. Through selection of the structure, ∆ can encompass uncertainties about one-time shifts in selected parameters, unmodeled dynamics in parts of the model, and nonlinearities, among other things. 14

subject to the satisfaction of the saddle-point stability condition for the relevant model. The solution to this problem is not amenable to analytical methods, except in special cases, an example of which we explore in the next section. Instead, we will employ efficient numericaltechniquestofindthelowerboundonthestructuredsingularvalue. Theminimum of µ 1(G ) over ω [0,π] is exactly the maximal allowable range of misspecification for a − 2 ∈ given policy. A monetary authority wishing to give agents the widest latitude for learning errors that nevertheless allow the system to converge on REE selects those parameters in its policy rule that yield largest value of r. Figure 1 below provides a schematic representation of what is done. Given a policy rule, the ALM (and hence the reference model) is represented by the transition matrix, Π. By assumption it is in the stable and determinate region of the space. The central bank chooses perturbation, ∆, to Π. The largest feasible perturbation, Π —the one that renders the ∗∆ largest radius, r, defines a region shown by the ellipse within which any ALM, including but not restricted to the reference model ALM, will converge on a rational expectations equilibrium. The weights, W1 and W2, determine the shape of the ellipse. Perturbations larger (in norm) than r will push the ellipse over the line into indeterminate or explosive regions; smaller perturbations provide less robustness than the optimal one. 3 Two examples We study two sample economies, one the very simple model of money demand in hyperinflations of Cagan [10], the other the linearized neo-Keynesian model originated by Woodford ([48], [49]), Rotemberg and Woodford [39] and Goodfriend and King [25]. Closed-form solutionsforµ, beingnon-linearfunctionsof theeigenvaluesofmodels, arenotgenerallyfeasible. However, some insight is possible through considering simple scalar example economies like the Cagan model. The second has the virtue of having been studied extensively in the literature on monetary policy design. It thus provides some solid benchmarks for comparison. 15

Figure 1: Schematic representation of robust learnability 3.1 A simple univariate example ConsideraversionofCagan’smonetarymodel, citedinEvansandHonkapohja[17],although our rendition differs slightly. The model has two equations, one determining (the log of) the price level, p , and the other a simple monetary feedback rule determining the (log of t the) money supply,m : t m p = κ(E p p ) t t t t+1 t − − − m = χ φp . t t 1 − − Normally, allparametersshouldbegreaterthanzero; forκthismeansthatmoneydemandis inverselyrelatedtoexpectedinflation. Wewillrelaxthisassumptionabitlater. Combining the two equations leads to: p = α+βE p γp , (17) t t t+1 t 1 − − where α = χ/(1+κ) , β = κ/(1+κ), and γ = φ/(1+κ). To set the stage for what follows, let us consider the conditions for a unique rational expectations equilibrium. First off, we will clearly want to assume that β,γ = 0 to avoid degenerate models. We will go a bit 6 further and assume that β γ = 1, which is a mild invertibility assumption. We will also − 6 16

assume that κ = 0 and κ = 1. Finally, for simplicity we will focus on the case where 6 6 − solutions are real. Standard methods then reveal: κ φ βλ2 λ γ = λ2 λ = 0 (18) i − i − 1+κ i − i − 1+κ where λ ,i = 1,2 are the eigenvalues of the quadratic equations associated with (17). As i is well known from Blanchard and Kahn [3], among other sources, Equation (18) is the key to establishing existence and uniqueness of saddle-point equilibruim. Letting λ λ , 1 2 ≥ without loss of generality, if 1 < λ λ ,then the model is explosive, meaning that there 2 1 ≤ are no initial conditions which can establish a saddle-point equilibrium; if λ λ < 1, 2 1 ≤ then the model is said to be irregular, meaning that every set of initial conditions leads to a different equilibrium. Putting the same thing in different words, the equilibrium is said to be indeterminate (or non-unique). Finally, the condition λ < 1 < λ is the condition 2 1 for saddle-point stability; that is, the condition by which all (local) sets of initial conditions converge, in expectation, on the same equilibrium. In this instance, the equilibrium is said to be determinate (or regular, or unique). Inspection of the equation to the right of the first equality in equation (18) shows that the determinacy of the model is governed by the interaction between the structural money-demand parameter, κ, and the policy feedback parameter, φ. Proposition 1 Assume that κ = 1. For κ > 1, determinacy requires: φ > 1 and 6 − − − φ < 1+2κ. For κ < 1, determinacy requires φ < 1 and φ > 1+2κ. − − Proof: Equation (18) means that β γ = (κ φ)/(1 + κ) < 1 is the condition for | − | | − | exactly one eigenvalue to be above unity. There are two cases. Assume first that κ > 1, − which means that the denominator is always positive. Then simple arithmetic shows that φ,κ : κ φ < 1 = φ > 1 φ < 1+2κ is implied. Now consider κ < 1. In − { } ∈ R 1+κ { − }∪{ } − this insta©nce, ¯φ,κ ¯ ª : κ − φ < 1 = φ < 1 φ > 1+2κ . ¤ {¯ }¯∈ R 1+κ { − }∪{ } © ¯ ¯ ª ¯ ¯ Nowletusassumethatagentsformexpectationsemployingadaptivelearning, anddesignate expectations formation in this way with the operator, E . The perceived lawof motion t∗ for this model is assumed to be p = a+bp —the minimum state variable (MSV) solution, t t 1 − 17

implying E p = (1+b)a+b2p . The actual law of motion is found by substituting the t∗ t+1 t 1 − PLM into the structural model: p = [α+βa(1+b)]+(βb2 γ)p . (19) t t 1 − − Following the steps outlined earlier, the ALM is p = T (a,b)+T (a,b)p where T defines t a b t 1 − a a the mapping ( ) = T( ). This is b b T : a = α+βa(1+b) (20) a T : b = βb2 γ. (21) b − It is equation (21) that is key for the learnability of the model. But notice that (21) is identical to equation (18) with b = λ . This means that the conditions for learnability and i determinacy are tightly connected in this particular model, as we shall discuss in detail below. The solutions to equations (20) and (21) are: a = α/[1 β(1+b)] (22) − b = .5[1 1+4βγ]/β. (23) ± p Equation (23) is quadratic with one root greater than or equal to unity, and the other less than unity. Designate the larger of the two values for b as b+ and the smaller as b . − Existence of the REE requires us to choose the smaller root; otherwise,b+ b > 1. The 1 − ordinary differential equation system implied by this mapping is a d( ) b a a = T( ) ( ), dτ b − b for which the associated DT matrix is derived by differentiating [ T T ] with respect to a b 0 a and b: β(1+b) aβ DT = . 0 2βb ∙ ¸ The eigenvalues of DT I are, − ψ = 2βb 1 (24) 1 − ψ = β(1+b) 1. (25) 2 − Satisfaction of the weak E-stability condition requires that both eigenvalues be negative. 18

Proposition 2 Determinate solutions of the Cagan model that are real are also E-stable. Proof: Substitute the expression for b into (24) to get ψ = [1+4κφ/(1+κ)2]1/2and do − 1 − the same for (25) to arrive at ψ = κ/(1 + κ) 1 + ψ . Substitute ψ into b to arrive 2 − 2 1 1 − at: b = 1+κ[1 + ψ ]. A necessary and sufficient condition for ψ < 0, ψ is (P1): − 2κ 1 1 1 ∈ < φ > (1+κ)2/(4κ) S. Proposition 1 imposes restrictions on φ as a function of κ to ensure ≡ determinacy. For κ > 1, these are φ > 1 and φ < 1 + 2κ. Substituting these into the − − expression for ψ gives ψ < [(1 + 3κ)/(1 + κ)]2 which readily yields ψ < 0 and is real 1 1 1 − whenever b < 1 provided the solution is real. Simple substitution of ψ into ψ shows that − 1 2 | | ψ is also negative. For κ < 1, a similar proof applies. 2 ¤ − Havingoutlinedtheconnectionbetween b andβ (orκ)forE-stability, letusnowconsider unstructured perturbations to the ALM. Let X = [1 p ]. The reference ALM model is t t 0 then written as X = Π X , where t t 1 − 1 0 Π = α+βa(1+b) βb2 γ ∙ − ¸ is the model’s transition matrix. For simplicity, let us focus on b as the object of concern to policy makers, and let the policy maker apply structured perturbations to Π, scaled by the parameter, σ . The scaling parameter can be thought of as a standard deviation, but need b not be. Letting W = [ 0 σ ] and W = 0 1 , write the perturbed matrix Π as: 1 b 0 2 1 £ ¤ 0 Π = . ∆ α+βa(1+b) βb2 γ +∆ ∙ − ¸ As in (14), the relevant matrices are defined in complex space. Accordingly, let z = eiω, ω [ π,π]. To find the maximal allowable perturbation, write ∈ − z 1 1 0 G = α − βa − (1+b) z − 1 βb2 +γ = I · z − 1 − Π, ∙− − − ¸ which, defining W = [ 0 σ ] and W = 0 1 , is used to form G : 1 b 0 2 2 £ ¤ G = W G 1W 2 2 − 1 z 0 0 = 0 1 1 z [ ] (α βa−(1+b))z z σ "(1 z) − (1 (βb2 γ)z) 1 (βb2 γ)z# b £ ¤σ z − − − − − b = . 1 (βb2 γ)z − − 19

It is for this expression that we seek the smallest structured singular value, µ, as indicated by (16). In the multivariate case, the scaling parameter σ , can be parameterized as the standard b deviation of b relative to a, although other methods of parameterization can be entertained. Doing so would reflect a concern for robustness of the decision maker and thus could also be thoughtofasatasteparameter. Sinceitisarelative term, itwill turnouttobeirrelevantin thisscalarcase, andsofromherewesetittounitywithoutlossofgenerality. Thestructured norm of G –equal to the absolute value of this last expression (see footnote 15)–is µ. It 2 is also easily established that the maximum of µ over the frequency range π,π , let us {− } call it µ is µ = ∆ = G arises at frequency π. Also, since at frequency π, z = 1, it 2 || ||∞ | | − follows that G = µ = 1 , or equivalently, the allowable perturbation is: | 2 | 1+b 1 ∆ = = 1+b (26) µ = 1+βb2 γ − 1 = 1+ [1 1+4βφ/(1+κ)], 2β − p which depends inversely on the policy parameter, φ.21 Note also that while we have derived thisexpressionfor∆ byapplyingperturbationstotheALM, wewouldhaveobtainedexactly the same result by working with the PLM. If equation ∆ is the allowable perturbation, conditional on a given φ, then we can define a φ as the policy maker’s optimal choice of φ, where optimality is defined in the sense of ∗ choosing the largest possible perturbation to b –call it ∆ –such that the model will retain ∗ the property of E-stability. Let us call this the maximum allowable perturbation. It is the ∆ and the associated φ that is at a boundary where ∆ is just above 1: ∗ ∗ − φ = 1+2κ (cid:18), (27) ∗ − where φ < 1+2κ maintains stable convergence toward a REE and (cid:18) is an arbitrarily small positive constant necessary to keep b + ∆ off the unit circle. Note that this expression for φ indicates that the monetary authority will always respond more than one-for-one to ∗ 21 Note that at frequency π, 1 G ∆=1 σb (1+b) =0 as required by the definition of µ. − 2 − (1+b) σb 20

deviations in lagged prices from steady state, with the extent of that over-response being a positive function of the slope of the money demand function. Substituting these expressions back into our perturbed transition matrix, 1 0 Π = ∗∆ α βa(1+b) βb2 γ +∆ ∙ − − ¸ 1 0 = α βa(1+b) 1+2(βb2 γ) ∙ − − ¸ 1 0 = , (28) α βa(1+b) 1+η ∙ − − ¸ where η is an arbitrarily small number, as determined by (cid:18) in (27). The preceding confirms thattheauthority’spolicyisresilienttoaperturbationinthelearningmodel thatpushesthe transition matrix is to the borderline of instability. In other words, setting a φ that allows forthemaximalstablemisspecificationofthelearningmodelisonethatpermitsconvergence to the REE. Figure 2 shows the regions of dynamic stability and learnability for the Cagan model as functions of the structural parameters: the absolute interest elasticity of money demand, κ, and the monetary policy feedback parameter, φ. As noted in the legend to the right, the determinate regions of the structural model are the blue areas, to the northeast and southwest of the figure. Proposition 1 above, warns the monetary authority to stay out of the indeterminate regions, the sliver of purple toward the southeast of the chart, that is possible for some κ > 0 when φ < 1, and the larger region of purple to the west. Also to − be avoided are the orange regions of explosive solutions to the north and south. The E-stable region is the large area between the two dashed lines. The first thing to noteisthatE-stabilitydoesnotimplydeterminacy: convergenceinlearningonindeterminate equilibria in the area where both 1 < κ < 1/2 and 1 < φ < 0, is possible, corroborating − − − a point made by Evans and McGough [20] in a different context. In addition, learnability of unstable equilibria is also possible as shown by the orange regions between the two dashed lines. Indeed, even if one were to accept a priori that κ > 0, as Cagan assumed, there are unstable equilibria that are learnable. At the same time, the figure clearly shows what Proposition 2 noted: for this model, determinate models are always E-stable; the blue region is entirely within the area bordered by the two dashed lines. It follows that in the special 21

Figure 2: Regions of determinacy and E-stability in the Cagan model case of the Cagan model, robustifying learnability is equivalent to maximizing the basin of attractionfortherationalexpectationsequilibriumofthemodel. Thelociofrobustpolicies, φ , conditional of values of κ, is shown by the thick diagonal line running from the south ∗ west to north east of the chart, and marked φ = 1+2κ. The line shows that contrary to ∗ what unguided intuition might suggest, the robust policy does not choose a rule that is in the middle of the blue determinate and E-stable region, but rather chooses a policy that might be quite close to the boundary of indeterminacy for the REE. Doing so increases the region of E-stable ALMs—something that cannot be seen in the chart—and thereby enhances the prospects for convergence on an REE. 3.2 The canonical New Keynesian model We now turn to an analysis of the canonical New Keynesian business cycle model of RotembergandWoodford[39],GoodfriendandKing[39]andothers. Clarida,Gali,andGertler[12] used this model to derive optimal discretionary as well as optimal commitment rules. Their 22

version includes a specified process for exogenous natural output. Evans and Honkapohja [18] study this model to explore issues of determinacy and learnability for several optimal commitment rules. Bullard and Mitra [9] likewise use the Woodford model to examine determinacy and learnability of variants of the Taylor rule. The behavior of the private sector is described by two equations. The aggregate demand (IS) equation is a log-linearized Euler equation derived from optimal consumer behavior, x = E x σ[r E π rn], (29) t t∗ t+1 t t∗ t+1 t − − − and the aggregate supply (AS) equation–indeed, the price setting rule for monopolistically competitive firms is, π = κx +βE π , (30) t t t∗ t+1 where x is the log deviation of output from potential output, π is inflation, r is a shortterm interest rate controlled by the central bank, and rn is the natural interest rate. For the application of Bullard and Mitra’s [8] (BM) example, we assume that rn is driven by a t first-order autoregressive process, rn = ρ rn +(cid:18) , (31) t r t 1 r,t − 0 ρ < 1, and (cid:18) iid(0,σ2). This is essentially Woodford’s [48] version of this model, ≤ | r | r,t ∼ r which specifies that aggregate demand responds to the deviation of the real rate, r E π t t t+1 − from the natural rate, rn. t We need to close the model with an interest-rate feedback rule. We study three types of policy rules. In the first set of experiments described in Section 3.3, a central bank chooses an interest rate setting in each period as a reaction to observed events, such as inflation and the output gap, without explicitly attempting to improve some measure of welfare. Instead, the policy authority is mindful of the effect its policy has on the prospect of the economy reaching REE and designs its rule accordingly. Bullard and Mitra [8] study such rules for their properties in promoting learnable equilibria and consider that effort as prior to one of finding optimal policy rules consistent with REE. We take this analysis further by seeking to find policy rules that maximize learnability of agents’ models when policy influences the outcome. 23

The information protocol in these experiments is as follows. The central bank knows the structural model and has access to the data. Economic agents see the data, which change over time, and formulate the perceived law of motion. Agents form expectations based on recursive (least-squares) estimation of a reduced form. The data are regenerated each period, subject tothe authorityhaving implementeditspolicyandagents’ havingmade investment and consumption decisions based on their newly formed expectations. We assume that agents mistakenly specify a vector-autoregressive model in the endogenous and exogenous variables of the model. That means we assume the learning model to be overparameterized in comparison with the model implied by the MSV solution. The scaling factors used in W to scale the perturbations to the PLM are the standard errors of 1 the coefficients obtained from an initial run of a recursive least squares regression of such a VAR with data being updated by the true model, given an arbitrary but determinate parameterization of the policy rule being studied. As noted earlier, an alternative approach wouldbetorevisethescalingswitheachtrial policy, giventhattheVARwouldlikelychange with each parameterization of policy. We leave this for a revision. 3.3 Simple interest-rate feedback rules This section describes two versions of the Taylor rule analyzed by Bullard and Mitra [8]. The complete system comprises equations (29)-(32), and the exogenous variable, rn. The t policy instrument is the nominal interest rate, r . The first policy rule specifies that the t interest rate responds to lagged inflation and the lagged output gap. In their paper, BM study the role of interest-rate inertia and so include a lagged interest rate term. r = φ π +φ x +φ r (32) t π t 1 x t 1 r t 1 − − − McCallum has advocated such a lagged data rule because of its implementability, given that contemporaneous data are generally not available in real time to policy makers. Someresearchsuggeststhatforward-lookingrulesperformwell intheory(see, e.g., Evans and Honkapohja [18]) as well as in actual economies, such as Germany, Japan, and the US (see Clarida, Gali, and Gertler [11]). Accordingly, BM propose the rule r = φ E π +φ E x +φ r . (33) t π t∗ t+1 x t∗ t+1 r t 1 − 24

The expectations operator E has an asterisk to indicate that expectations need not be ∗ rational. Finally, the most popular rules of this class are contemporaneous data rules, of which the following is our choice: r = φ π +φ x +φ r (34) t π t x t r t 1 − where as before, we allow the lagged federal funds rate to appear to capture instrumentsmoothing behavior by uncertainty averse decision makers. 3.4 Results WeadoptBM’scalibrationfortheNewKeynesianmodel’sparameters, σ = 1/.157,κ = .024, β = .99, and ρ = .35, the same calibration as in Woodford [49]. We also set σ = 0.01. For r reference purposes, it is useful to compare our results against those of rules that are not parameterized with robust learnability in mind. To facilitate this, we employ a standard quadratic loss function: 1000 ∞ L = βj[(π π )2 +λ x2 +λ (r r )2]. (35) t 2 t+j − ∗ x t+j r t+j − ∗ j=0 X Walsh[47]showsthatwiththevaluesλ = .077andλ = .027, equation(35)isthequadratic x i approximation to the social welfare function of the model. Rules that are computed to maximize the prospect of convergence to REE under the greatest possible misspecification of the ALM model in the manner described above will be referred to as "robust" or "robust learnable" rules. A credible benchmark against which to compare these robust rules, are what we shall refer to as optimized rules. These are rules that minimize (35) subject to (29), (30), (31) and one of either (32), (34) or (33). Such rules can be optimized using a standard hill-climbing algorithm using methods well described in the appendix to Tetlow and von zur Muehlen [44] among other sources. Let us consider the lagged-data rule first. BM find that the determinacy of a unique rational expectations equilibrium, as well as convergence toward that equilibrium when agents learn adaptively, is extremely sensitive to the policy parameters, φ , φ , and φ . Without r x π some degree of monetary policy inertia, (φ > 0), this model is determinate and learnable, r 25

with the above calibrations, only if the Taylor principle holds, (φ > 1), and the response to π the output gap is modest, (φ 0.5). Insufficient or excessive responsiveness to either inflax ≤ tion or the output gap can in some instances lead to explosive instability or indeterminacy. Through simulation, BM establish the regions for the parameters that lead to determinacy as well as E-stability. Table 1 shows our results. The table is broken into three panels. The upper panel—the rows marked (1) to (3)—shows optimized rules. The second panel, contains some results for the generic Taylor rule. Finally, the third panel shows our robust learnable rules. The next-to-last column of the table gives a measure of the total uncertainty that the PLM can tolerate underthe citedpolicy. Itis ameasureof the maximal allowable deviationembodied in 1/µ. 22 The last column shows the loss as measured by (35). Table 1 : Standard and robust learnable rules row φ φ φ radius1 L2 x π r optimized rules: lagged data rule (1) 0.052 0.993 1.13 1.07 3.679 contemporaneous data rule (2) 0.053 0.995 1.12 1.06 3.626 forecast-based rule (3) 0.286 0.999 1.32 0.88 3.628 standard rules: Taylor rule (4) 0.500 1.500 0 0.85 5.690 robust learnable rules: lagged data rule (5) 0.065 0.40 1.10 1.16 3.712 contemporaneous data rule (6) 0.052 1.21 1.41 1.13 3.701 forecast-based rule (7) 0.040 2.80 0.10 2.32 4.434 1.Magnitude of the largest allowable perturbation. r= W ∆W 1 2 2. Asymptotic loss, calculated according to eq. (35) in k REE unde k r∞the reference model. Let us concentrate initially on our optimized rules along with the Taylor rule to provide some context for the robust learnable rules. The lagged data rule, shown in row (1), and the contemporaneous data rule, (2), are essentially the same. They both feature very small feedback on the output gap, and strong responses to inflation. Moreover, they also feature funds rate persistence that amounts to a first-difference rule; that is, a rule where the dependent variable is ∆r rather than r. The forecast-based rule, in line (3), has much stronger feedback on the output gap, although proper interpretation of this requires noting that in equilibrium the expectation of future output gaps will always be smaller than actual 22 For comparison of the trials with each other and also to give a sense of natural units related to the scalings we employed, the radius is calculated as the H norm of the scaled perturbations to the PLM model: radius = W ∆W . ∞ 1 2 || ||∞ 26

gapsbecauseoftheabsenceofexpectedfutureshocksandtheinternalizationoffuture policy in the formulation of that expectation. Thus, the response of the funds rate to the expected future gap will not be as large as the feedback coefficient alone might lead one to believe. These three rules confirm the received wisdom of monetary control in New Keynesian models, to wit: strong feedback on inflation, comparatively little on output, and strong persistence in funds rate setting. These rules are chosen to minimize the loss shown in the right-hand column of the table; the losses for all three are very similar, at a little over 3.6. The results for the Taylor rule demonstrate, indirectly, the oft-discussed advantages of persistence in funds rate setting for monetary control. Without such persistence, the Taylor rule produces losses that are substantially higher than those of the optimized rules. Nowletusturntotherobustlearnablerulesinthebottompanel ofthetable, concentrating for the moment on the lagged data and contemporaneous data rules shown in lines (5) and (6). The first thing to note is that the results confirm the efficacy of persistence in instrument setting. The robust learnable rules are at least as persistent—if persistence greater than unity is a meaningful concept—as the optimized rules. At the same time, while persistence is evidently useful for learnability, our results do not point to the hyper-persistence result, (φ 1), that BM hint at. To understand this outcome, it is important to realize r À that while our results are related to the BM results, there are conceptual differences. BM describe the range of policy-rule coefficients for which the model is learnable, taking as given the model. We are describing the range of policy coefficients that maximizes the range of models that are still learnable. So while large values for φ are beneficial to learnability r holding constant the model and its associated ALM, at some point, they come at a cost in terms of the perturbations that can be withstood in other dimensions. Now let us look at the costs and benefits of these two rules in comparison with their optimized counterparts. We measure the benefits by comparing the radii of robustness from the column second from the right, for various rules. For the optimized, outcomebased rules, shown in the first two rows of the table, the radii are about 1.06 or so, while those of their robustified couterparts range from 1.13 to 1.16. Thus the improvement in robustness of learnability would appear to be moderate. Costs are inferred by comparing the losses shown in the right-hand column of the table. The results show that the cost of 27

maximizinglearnabilitymeasuredintermsofforegoneperformanceintheREEisverysmall. Evidently, learnability can be robustified, to some degree, without much of any concomitant loss in economic performance, at least in the canonical NKB model. Before moving on to forecast-based rules, let us consider the classic Taylor rule shown in the fourth row. Recall that the Taylor rule has been advocated as a policy that is at least reasonably robust across a fairly wide range of models. Here, however, the radius associated with the Taylor rule is shown to be quite small at 0.85. At the same time, the performance of the rule in terms of loss is relatively weak. Thus, to the extent that we can take claims of the robustness of the Taylor rule with its original parameterization as applying to the issue of learnability, the rule would appear to come up a bit short. Now let us examine the results for the forecast-based policy shown in the seventh row. Here the prescribed robust learnable policy is much different from the optimized rule shown in line (3). The robust rule essentially removes the policy persistence that the optimized policy calls for. The policy performance in the rational expectations equilibrium of the forecast-based robustly learnable rule is somewhat worse than its optimized counterpart, but notice that the radius of learnability is nearly triple that of the optimized rule. While the superiority in terms of robustness of an (almost) non-intertial forecast-based rule is superficially at odds with Bullard and Mitra, the result really should not be all that surprising. Forecast-based rules leverage heavily the rational expectations aspects of the model—even more so than the contemporaneous and lagged data rules since there are rational expectations in the model itself and in the policy rule—and there is risk in leverage. The learnability of the economy is highly susceptible to misspecification in this area. This is, of course, just a manifestation of the problemthat Bernanke and Woodford [2] and others have warned about. We can obtain a deeper understanding of the effects of a concern for robust learnability on policy design by examining the properties of different calibrations of policy rules for their effects on the allowable perturbations. The magnitude of perturbations that a given model can tolerate, conditional on a policy rule, is given by the radius. The radii for the rules shown in Table 1 are in the column second from the right. We can, however, provide a visualization of radii mapped against policy-rule coefficients and judge how policy affects 28

robust learnability. Figure 3 provides one such visualization: contour maps of radii against the output-gap feedback coefficient, φ , and inflation feedback coefficient, φ , in this case for the contempox π raneous data rule. The third dimension of policy, the feedback on the lagged fed funds rate, φ ,isbeingheldconstantinthesecharts, atzerointheupperpanelandatunityinthelower. r The colors of the chart index the radii of allowable perturbations for each rule, with the bar at the right-hand side showing the tolerance for misspecification. The area in deep blue, for example, represents policies with no tolerance for misspecification of the model or learning whatsoever, either because the rule fails to deliver E-stability in the first place, or because it is very fragile. The sizable region of deep blue in the upper panel shows the area that violates the Taylor principle. The right of the deep blue region–where φ > 1–we enter π regions of green, where there is modest tolerance for misspecification that allows learnability. In general, with no interest-rate smoothing, there is little scope for misspecification. Now let us look at the case where φ = 1 in the bottom panel. Now the region of r deep blue is relegated to the very south-west of the chart, as is the region of green. To the north-east of those are expansive areas of higher tolerance for misspecification. Evidently, at least some measure of persistence in policy is useful for robustifying learnability. Notice how there is a deep burgundy sliver of fairly strong robustness in the north-east part of the panel. Figure 4 continues the analysis for the contemporaneous data rule by showing contour charts for two more levels of φ . The upper panel shows the value for the rule that allows r the maximum allowable perturbation as shown in line (6) of the table. With φ = 1.41 the r burgundy region of highest robustness is at its largest and the policyrule shown in line (6) of thetableiswithinthatregion. Moregenerally,theareaofsignificantrobustness–theredder regions–are collectively quite large. Finally, we go to the bottom panel of the figure which shows the results for a relatively high level of φ . What has happened is that the regions r shown in the top panel have rotated down and to the right as φ has risen. The burgundy r region is now gone, and the red regions command much less space. Thus, while policy persistence is good for learnability, in terms of robustness of that result to misspecification, one can go too far. 29

Figure 3: Contours of radii for the NKB model, contemporaneous data rule, selected φ r 30

Figure 4: Contours of radii for the NKB model, contemporaneous data rule, selected φ r 31

Figures 3 and 4 cover the case of the contemporaneous data rule. We turn now to forecast-based rules. The results here look quite different, but the underlying message is very much the same. As before, Figure 5 shows the results for low levels of persistence in policy setting. The upper panel shows the static forecast-based rule. The deep blue areas to the left of φ = 1 are areas of indeterminacy, as they were in Figure 3. There π are, however, numerous blue "potholes" elsewhere in the panel. These are areas where the learnable equilibrium is feasible, but fragile.23 Notice, however, that these blue regions border very closely to burgundy regions where the allowable perturbations exceed 2; that is, the allowable perturbations are very large. The bottom panel shows contours covering the policy persistence level that is optimal, as shown in line (7) of table 1. There are fewer potholes. The optimally robust policy is toward the top of this chart. Finally, let us examine Figure 6. The top panel shows that a small increase in φ from r 0.10 to 0.12, reduces the number of potholes to nearly zero. The radii shown in the rest of the chart remain high, but the optimal policy is not in this region.24 The bottom panel of the chart shows the contours for a modest and conventional value of funds rate persistence, φ = 0.50. The potholes have now completely disappeared, but r the large red region is less robust than the burgundy regions in the previous charts. Not shown in these charts are still higher levels of persistence. These involve still lower levels of robustness, with radii for φ > 1 associated with radii that are less than half the magnitude r of the maximum allowable perturbation for this rule. Higher levels of persistence in policy setting are deleterious for robustification of model learnability in inflation-forecast based policy rules.25 Of course these particular results are contingent on the relative weightings for pertur- 23 Since degree of robustness is a function of the model’s eigenvalues and those are non-linear functions of the parameters of the model and of the learning mechanism, it is not possible to identify the source of these potholes. That said, it isn’t necessary either. The idea behind the methods described in this paper is to avoid the pitfalls of nonparametric errors. 24 The presence of the "potholes" in the chart for φ = 0.10, wherein the optimally robust rule is found, r and their near-absence for the chart φ = 0.12 points to another concept of robustness. We assume the r monetary authority knows the structural model. As a result, the economy cannot accidentally fall into one of the potholes shown in the figure. A worthwhile extension would be to allow the authority to have doubts about the structural parameters of the model, in addition to the learning mechanism. However the current paper is—the first in this area—is already ambitious enough and so we leave the issue for future research. 25 We tested φ up to nearly 20. What we found is that the radii fell as φ rose for intermediate levels, r r and then rose slowly again for φ 1. However, for no level of φ could we find radii that came anywhere r À r close to the maximum allowable perturbation shown in row (7) of the table. 32

Figure 5: Contours of radii for the NKB model, forecast-based rule, selected φ r 33

Figure 6: Contours of radii for the NKB model, forecast-based rule, selected φ r 34

bations, captured in W , and our selection is just one of many that could have been made. 1 For the numerical experiments, the weightings were set equal to the standard deviations of the coefficients of a first-order VAR for the output gap, inflation, the interest rate, and the natural rate, estimated at the beginning of each experiment using recursive least squares. This approximates the private sector’s problem, and should give a rough idea of the relative uncertainties associated with the coefficients of the PLM. Whether using estimated standard deviations to scale the relative impact of Knightian model uncertainty on the learning mechanism is proper or desirable can be debated, of course. For now the salient point is that robustness of learning in the presence of model uncertainty is not the same thing as choosing the rule parameters for which the E-stable region of a given model is largest. 4 Concluding remarks We have argued that model uncertainty is a serious issue in the design of monetary policy. On this score we are in good company. Many authors have advanced that minimizing a loss function subject to a given model presumed to be known with certainty is no longer best practice for monetary authorities. Central bankers must also take model uncertainty and learning into account. Where this paper differs from its predecessors is that we unify uncertaintyaboutthelearningmechanismusedbyprivateagentswiththestepsthemonetary authority can take to address the problem. In particular, we examine a central bank that designs monetary policy to maximize the possible worlds in which ill-informed private agents need to learn about their particular world and still allow convergence on the rational expectations equilibrium. The motivation for this approach is straightforward: if economics as a profession cannot agree on what the true model of the economy is, it is a leap of faith to expect private agents to agree, coordinate, and find the REE by themselves. Policy makers can play a role in facilitating (or frustrating) the process of learning the REE through the design of policy. This paper begins from the premise that best practice for a monetary authority using simple instrument rules is to parameterize the rule that provides good performance 35

not just in the steady state–that is, when convergence to REE has been achieved–but also in out-of-equilibrium behavior of the system. We further argue that foremost among the considerations of what constitutes good out-of-equilibrium behavior of an uncertain system under learning should be the prospects for converging on an REE. In pursuit of this goal, this paper has married the literature on adaptive learning to that ofstructuredrobustcontrol toexaminewhatpolicymakerscandotofacilitatelearning. We have introduced some tools with which the questions that Bullard and Mitra [8] are asking can be broadened and generalized. More narrowly we have also found that the warnings of Bernanke and Woodford [2] are well placed; inflation-forecast-based monetary policy rules do present dangers. We have also shown that the conclusion that Bullard and Mitra [8] point to is not as general as one might initially suppose. Looking ahead, we see new research directions that broaden the scope of robustness by addressing a wider range of uncertainties against which a policy maker may wish to protect. Evans and McGough [21], for example, show how to compute Taylor-type rules that will converge on an REE in a variety of models, taking the learning mechanism as given, whereas we take the structural model as given and investigate the implications of misspecification of learning rules. A fusion of the two approaches would seem to be worth investigating. 36

References [1] Batini, N. and Pearlman, J. (2002) "Too much too soon: instability and indeterminacy with forward-looking rules" unpublished manuscript, Bank of England. [2] Bernanke,B.andWoodford,M.(1997)"Inflationforecastsandmonetarypolicy"Journal of Money, Credit and Banking 24: 653-684. [3] Blanchard, O. and Kahn, C. (1980) "The solution of linear difference equations under rational expectations" Econometrica,48: 1305-1311. [4] Brainard, W. (1967) "Uncertainty and the effectiveness of monetary policy" American Economic Review 57(2): 411-425. [5] Bray, M. (1982) "Learning, estimation and the stability of rational expectations equilibria" Journal of Economic Theory,26: 318-339. [6] Bray, M. and Savin, N. (1986) "Rational expectations equilibria, learning and model specification" Econometrica,54: 1129-1160. [7] Bullard, J. and Eusepi, S. (2003) "Did the great inflation occur despite policymaker commitment to a Taylor rule?" Federal Reserve Bank of St. Louis working paper no. 2003-13. [8] Bullard, J. and Mitra, K. (2003) "Determinacy, learnability and monetary policy inertia"FederalReserveBankofSt.Louisworkingpaper2000-030A(revisedversion:March 2003) [9] Bullard, J. and Mitra, K. (2002), "Learning about monetary policy rules", Journal of Monetary Economics,49: 1105-1139. [10] Cagan, P (1956) "The monetary dynamics of hyperinflations" in M. Friedman (ed.) Studies in the Quantity Theory of Money (Chicago: University of Chicago Press). [11] Clarida, Richard, Jordi Gali, and Mark Gertler (1998) "Monetary Policy Rules in Practice: Some International Evidence" European Economic Review,42: 1033-1067. 37

[12] Clarida, Richard, JordiGali, andMarkGertler(1999)"TheScienceofMonetaryPolicy: A New Keynesian Perspective" Journal of Economic Literature,70: 807-824. [13] Coenen, G. (2003) "Inflation persistence and robust monetary policy design" European Central Bank working paper no. 290 (November). [14] Dahleh, M and Diaz-Bobillo, I. (1995) Control of Uncertain Systems: A Linear Programming Approach (Englewood Hills, NJ: Prentice Hall). [15] Doyle, J. "Analysis of feedback systems with structured uncertainties" IEEE Proceedings,133 (part D, no. 2): 45-56. [16] Ehrmann, M. and Smets, F. (2003) "Uncertain potential output: implications for monetary policy" Journal of Economic Dynamics and Control,27: 1611-1638.- [17] Evans, G. and Honkapohja, S. (2001) Learning and Expectations in Macroeconomics (Princeton: Princeton University Press). [18] Evans, G.W. and Honkapohja, S.(2002) "Monetary Policy, Expectations, and Commitment" unpublished manuscript, University of Oregon and Oregon State University (May). [19] Evans, G. and Honkapohja, S. (2003) "Expectations and the stability for optimal monetary policies" Review of Economic Studies,70: 807-824. [20] Evans, G. and McGough, B. (2005a) "Monetary policy, indeterminacy and learning" Journal of Economic Dynamics & Control,29: 1809-1840. [21] Evans, G. and McGough, B. (2005b) "Optimal constrained monetary policy rules" unpublished manuscript, Department of Economics, University of Oregon http://economics.uoregon.edu/papers/UO-2005-9_Evans_Optimal_Constrained.pdf. [22] Garratt, A. and Hall, S.(1997) "E-equilibria and adaptive expectations: output and inflation in the LBS model" Journal of Economic Dynamics and Control,21: 87-96. 38

[23] Giordani, P. and Soderlind, P. (2004) "Solution of macromodels with Hansen-Sargent robust policies: some extensions" Journal of Economic Dynamics & Control,28: 2367- 2397. [24] Giannoni, M.P. (2002) "Does model uncertainty justify caution?: model uncertainty in a forward-looking model" Macroeconomic Dynamics,6(1): 111-144. [25] Goodfriend, M. and King, R. (1997) "The new neo-classical synthesis and the role of monetary policy" [26] Hansen, L and Sargent, T. (2003) Misspecification in Recursive Macroeconomic Theory (unpublished monograph, November 2003) [27] Hansen, L.P. and Sargent, T.J. (2003) "Robust control of forward-looking models" Journal of Monetary Economics 50(3): 581-604. [28] Levin, A.T., Wieland, V.andWilliams, J.C.(1999)"Monetarypolicyrulesundermodel uncertainty"inJ.B.Taylor(ed.)Monetary Policy Rules (Chicago:UniversityofChicago Press). [29] Levin, A.T., Wieland, V. and Williams, J.C. (1999) "The performance of forecast-based monetarypolicyrulesundermodeluncertianty"AmericanEconomicReview,93(2): 622- 645. [30] Lubik, T.A. andSchorfheide, F. (2003) "Computingsunspotequilibriainlinearrational expectations models" Journal of Economic Dynamics and Control,28(2): 273-285. [31] Lubik, T.A. and Schorfheide, F. (2004) "Testing for indeterminacy: an application to U.S. monetary policy" American Economic Review,94(1): 190-217. [32] Marcet, A and Nicolini, J.P. (2003) "Recurrent hyperinflations and learning" American Economic Review,96:1476-1498. [33] Marcet, A. and Sargent, T.J.(1989) "Convergence of least squares learning mechanisms in self-referential linear stochastic models" Journal of Economic Theory,48(2): 337-368 39

[34] McCallum, B. (1983) "On non-uniqueness in rational expectations models: an attempt at perspective" Journal of Monetary Economics,11: 134:168. [35] Onatski, A (2003) "Robust monetary policy under model uncertainty: incorporating rational expectations" unpublished manuscript, Columbia University. [36] Onatski, A. and Stock, J. (2002) "Robust monetary policy under model uncertainty in a small model of the U.S. economy" Macroeconomic Dynamics [37] Onatski, A and N. Williams (2003) "Modeling model uncertainty" Journal of the European Economic Association,1: 1087-1122. [38] Orphanides, A., Porter, R., Reifschneider, D., Tetlow, R., and Finan, F. (2000) "Errors in the measurement of the output gap and the design of monetary policy" Journal of Economics and Business,52(1/2): 117-141. [39] Rotemberg, J. and Woodford, M. (1997) "An optimization-based econometric framework for the evaluation of monetary policy" in B. Bernanke and J. Rotemberg (eds.) NBER Macroeconomics Annual (Cambridge, MA: MIT Press): 297-345. [40] Sack, B. (1999) "Does the fed act gradually: a VAR analysis" Journal of Monetary Economics,46(1): 229-256. [41] Sargent, T. (1999) "Comment" in J.B. Taylor (ed.) Monetary Policy Rules (Chicago: University of Chicago Press): 144-154 [42] Soderstrom, U. (2002) "Monetary policy with uncertain parameters" Scandinavian Journal of Economics,104(1): 125-145. [43] Taylor, J.B.(1993) "Discretion versus policy rules in practice" Carnegie-Rochester Conference Series on Public Policy,39: 195-214. [44] Tetlow, R. and von zur Muehlen, P. (2001) "Simplicity versus optimality: the choice of monetary policy rules when agents must learn" Journal of Economic Dynamics and Control,25(1/2): 245-279. 40

[45] Tetlow. R. and von zur Muehlen, P. (2001) "Robust monetary policy with misspecified models:doesmodeluncertaintyalwayscallforattenuatedpolicy?" Journal of Economic Dynamics and Control 25(6/7): 911-949. [46] Tetlow, R. and von zur Muehlen, P. (2004) "Avoiding Nash Inflation: Bayesian and robust responses to model uncertainty" 7,Review of Economic Dynamics, 4 (October): 869-899. [47] Walsh, C. E (2004) "Parametric misspecification and robust monetary policy rules," unpublished manuscript, University of California, Santa Cruz. [48] Woodford, M.(1999) "Optimal Monetary Policy Inertia" NBER working paper no. 7261 (July 1999) [49] Woodford, M. (2003) Interest and Prices: Foundations of a theory of monetary policy (Princeton: Princeton University Press). [50] Zames, G. (1966) "On the input-output stability of nonlinear time-varying feedback systems, parts I and II" IEEE Transactions on Automatic Control,AC-11: 228, 465. [51] Zhou, K., Doyle,J.C., and Glover, K. (1996) Robust and Optimal Control (Englewood Cliffs, NJ.: Prentice-Hall). [52] Zhou, K., with Doyle,J.C. (1998) Essentials of Robust Control (Englewood Cliffs, NJ.: Prentice-Hall). 41

Cite this document

APA

Robert J. Tetlow and Peter von zur Muehlen (2005). Robustifying Learnability (FEDS 2005-58). Board of Governors of the Federal Reserve System, Finance and Economics Discussion Series. https://whenthefedspeaks.com/doc/feds_2005-58

BibTeX

@techreport{wtfs_feds_2005_58,
  author = {Robert J. Tetlow and Peter von zur Muehlen},
  title = {Robustifying Learnability},
  type = {Finance and Economics Discussion Series},
  number = {2005-58},
  institution = {Board of Governors of the Federal Reserve System},
  year = {2005},
  url = {https://whenthefedspeaks.com/doc/feds_2005-58},
  abstract = {In recent years, the learnability of rational expectations equilibria (REE) and determinacy of economic structures have rightfully joined the usual performance criteria among the sought-after goals of policy design. Some contributions to the literature, including Bullard and Mitra (2001) and Evans and Honkapohja (2002), have made significant headway in establishing certain features of monetary policy rules that facilitate learning. However a treatment of policy design for learnability in worlds where agents have potentially misspecified their learning models has yet to surface. This paper provides such a treatment. We begin with the notion that because the profession has yet to settle on a consensus model of the economy, it is unreasonable to expect private agents to have collective rational expectations. We assume that agents have only an approximate understanding of the workings of the economy and that their learning the reduced forms of the economy is subject to potentially destabilizing perturbations. The issue is then whether a central bank can design policy to account for perturbations and still assure the learnability of the model. Our test case is the standard New Keynesian business cycle model. For different parameterizations of a given policy rule, we use structured singular value analysis (from robust control theory) to find the largest ranges of misspecifications that can be tolerated in a learning model without compromising convergence to an REE.},
}