feds · March 31, 2006

Real-time Model Uncertainty in the United States: The Fed from 1996-2003

Abstract

We study 30 vintages of FRB/US, the principal macro model used by the Federal Reserve Board staff for forecasting and policy analysis. To do this, we exploit archives of the model code, coefficients, baseline databases and stochastic shock sets stored after each FOMC meeting from the model's inception in July 1996 until November 2003. The period of study was one of important changes in the U.S. economy with a productivity boom, a stock market boom and bust, a recession, the Asia crisis, the Russian debt default, and an abrupt change in fiscal policy. We document the surprisingly large and consequential changes in model properties that occurred during this period and compute optimal Taylor-type rules for each vintage. We compare these optimal rules against plausible alternatives. Model uncertainty is shown to be a substantial problem; the efficacy of purportedly optimal policy rules should not be taken on faith. We also find that previous findings that simple rules are robust to model uncertainty may be an overly sanguine conclusion.

Finance and Economics Discussion Series Divisions of Research & Statistics and Monetary Affairs Federal Reserve Board, Washington, D.C. Real-time Model Uncertainty in the United States: The Fed from 1996-2003 Robert J. Tetlow and Brian Ironside 2006-08 NOTE: Staff working papers in the Finance and Economics Discussion Series (FEDS) are preliminary materials circulated to stimulate discussion and critical comment. The analysis and conclusions set forth are those of the authors and do not indicate concurrence by other members of the research staff or the Board of Governors. References in publications to the Finance and Economics Discussion Series (other than acknowledgement) should be cleared with the author(s) to protect the tentative character of these papers.

. Real-time Model Uncertainty in the United States: the Fed from 1996-2003 (cid:3) Robert J. Tetlow and Brian Ironside. Federal Reserve Board December 2005 Abstract We study 30 vintages of FRB/US, the principal macro model used by the Federal Reserve Board sta⁄for forecasting and policy analysis. To do this, we exploit archives of the model code, coe¢ cients, baseline databases and stochastic shock sets stored after eachFOMCmeetingfromthemodel(cid:146)sinceptioninJuly1996untilNovember2003. The period of study was one of important changes in the U.S. economy with a productivity boom, a stock market boom and bust, a recession, the Asia crisis, the Russian debt default, and an abrupt change in (cid:133)scal policy. We document the surprisingly large and consequential changes in model properties that occurred during this period and compute optimal Taylor-type rules for each vintage. We compare these optimal rules against plausible alternatives. Model uncertainty is shown to be a substantial problem; the e¢ cacy of purportedly optimal policy rules should not be taken on faith. We also (cid:133)nd that previous (cid:133)ndings that simple rules are robust to model uncertainty may be an overly sanguine conclusion. JEL Classi(cid:133)cations: E37, E5, C5, C6. (cid:15) Keywords: monetary policy, uncertainty, real-time analysis. (cid:15) (cid:3)Contactaddress: RobertTetlow,FederalReserveBoard,Washington,D.C.20551. Email: rtetlow@frb.gov. The authors acknowledge the helpful comments of Richard Dennis, Spencer Krane, Ed Nelson, Lucrezia Reichlin, David Romer,GlennRudebusch,PierreSiklos,EllisTallman,DanieleTerlizzese,SimonvanNorden,PetervonzurMuehlen, John C. Williams, Tony Yates and seminar participants at the Federal Reserve Board, the European Central Bank, the Bank of England, the FRB-SF and the Bank of Canada. Special thanks to Dave Reifschneider for thoughtful, detailed comments and much patience. We thank Flint Brayton for helping us interpret the origins of many of the model changes, and Douglas Battenberg for help getting the FRB/US model archives in working order. Part of this researchwasconductedwhilethe(cid:133)rstauthorwasvisitingtheBankofEnglandandtheSanFranciscoFed;hethanks those institutions for their hospitality. Allremaining errors are ours. The views expressed in this paper are those of the authors alone and do not represent those of the Federal Reserve Board or other members of its sta⁄.

1. Introduction Policy makers face a formidable problem. They must decide on a policy notwithstanding considerable ambiguity about the proper course of action. Monetary policy makers in particular need to make decisions on a timely basis in an environment where the data are rarely authoritative on the state of the world. For guidance, they turn to models, but models too have their foibles. Both in academia and within central banks, the models in use today di⁄er substantially from those of yesteryear. The policy prescriptions that come from these models also di⁄er, and often in ways that could have important consequences for economic outcomes. This paper considers, measures and evaluates real-time model uncertainty in the United States. In particular, we study 30 vintages of the Board of Governors(cid:146)workhorse macroeconomic model, FRB/US,thatwereusedextensivelyforforecastingandpolicyanalysisattheFedfromthemodel(cid:146)s inception in July 1996 until November 2003. To do this, we exploit archives of the model code, coe¢ cients, databases and stochastic shock sets for each vintage. The choice of the model is not incidental: by working with the FRB/US model, we isolate the policy issues and forecast outcomes that actually a⁄ected the Fed sta⁄(cid:146)s modeling decisions over time. The period of study was one of remarkable change in the U.S. economy with a productivity boom, a stock market boom and bust, a recession, the Asia crisis, the Russian debt default, corporate governance scandals and an abrupt change in (cid:133)scal policy. There were also 23 changes in the intended federal funds rate, 7 increases and 16 decreases. Armed with this archive, we do four things: First, we examine the real-time data. Second, we document the changes in the model properties(cid:150)a surprisingly large and consequential set, it turns out(cid:150)and identify the economic events that contributed to them. Third, we compute optimal Taylor-type rules for each vintage. And fourth, we compare the performance of these ex ante optimal rules against alternative rules, including an ex post optimal rule, and the original Taylor (1993) speci(cid:133)cation. From this, we draw conclusions about model uncertainty and its implications for policy design. It turns out that model uncertainty is quantitatively large and important, even over the short period studied here. In this regard, our (cid:133)ndings are consistent with those of Sargent, 1

Williams and Zha (2005), although our approach is very di⁄erent. This exercise goes a number of steps beyond previous contributions to the literature. One technique often used to study model uncertainty is the rival models method, where following the suggestion of McCallum (1988) a candidate policy rule is assessed for its performance across an entire set of rival models. The limitation is that it is far from obvious how to settle on the set of rival models. To date, the literature has used alternative models of relatively abstract economies comparedinalaboratoryenvironment. Asaresult,Levinet al. (1999)forexampleweresusceptible to the criticism of Christiano and Gust (1999) that their analysis of the robustness properties of simpleTaylor-typeruleswasunderminedbythesimilarityofthemodelstheychose.1 Ourreal-time analysis avoids this problem by basing the rival models on the decisions of the Board of Governors(cid:146) sta⁄, conditional on the issues and questions that the sta⁄faced. Thus the set is a plausible one. 2 The current paper also goes beyond the literature on parameter uncertainty. That literature assumes that parameters are random but the model is (cid:133)xed over time; misspeci(cid:133)cation is simply a matter of sampling error.3 Model uncertainty is a thornier problem, in large part because it often does not lend itself to statistical methods of analysis. We explicitly allow the models to change over time in response not just to the data but to the economic issues of the day.4 Lastly, and most important, the analysis we provide derives from models that were actually used to advise on monetary policy decisions. To the best of our knowledge, no one has ever done this before. 1 Levin et al. (1999), use four models as rivals but all were New Keynesian linear rational expectations models withwage-price(orPhillipscurve)mechanismsthatruno⁄ofoutputgaps. Williams(2003)demonstratesthatlinear rationalexpectationsmodelsareveryforgivingofperturbationsinpolicyrulesinthesensethatadeviationfromthe optimized coe¢ cients of a Taylor-type rule does not substantially change policy outcomes provided that the model under control is still stable, a property not shared by models with little or no rationality of expectations. This, plus the similarity of the monetary policy mechanisms in the four models limits the applicability of Levin et al. [1999] to broader environments, as Levin and Williams (2003) shows. 2 Robust control theory is also sometimes advocated; see, e.g., Hansen and Sargent (2005), Giannoni (2002), Onatski (2001) and Tetlow and von zur Muehlen (2001). In this instance, the policy maker seeks to protect against a worst-case outcome to misspeci(cid:133)cation in the neighborhood of a reference model. The di¢ culty in this instance is in the specifying the neighborhood. 3 There is an extensive literature on the design of monetary policy under uncertainty. Most of it deals with data orparameteruncertainty. Techniquesforhandlingtheseissuesarenowwellknown;see,e.g.,SvenssonandWoodford (2003) and references therein. In most instances, the optimal response is either certainty equivalence or attenuation of policy reponses relative to the certainty equivalent policy reaction. A counterexample to the usual attenuation case is S(cid:246)derstr(cid:246)m (2002). 4 There have been a number of valuable contributions to the real-time analysis of monetary policy issues. Most are associated with data and forecasting. See, in particular, the work of Croushore and Stark (2001) and a whole conferenceonthesubjectdetailsofwhichcanbefoundathttp://www.phil.frb.org/econ/conf/rtdconfpapers.htmlAn additional,deeperlayerofreal-timeanalysisconsidersrevisionstounobservablestatevariables,suchaspotentialoutput;onthisseeOrphanideset al. (2000)andOrphanides(2001). SeealsoGiannoneet al. (2005)forasophisticated, real-time analysis of the history of FOMC behavior. 2

The rest of this paper proceeds as follows. The second section begins with a discussion of the FRB/US model in generic terms, and the model(cid:146)s historical archives. The third section compares model properties by vintage. To do this, we document changes in real-time "model multipliers" and compare them with their ex post counterparts. The succeeding section computes optimized Taylor-type rules and compares these to commonly accepted alternative policies in a stochastic environment. The (cid:133)fth section examines the stochastic performance of candidate rules for two selected vintages, the February 1997 and November 2003 models. A sixth and (cid:133)nal section sums up and concludes. 2. Thirty vintages of the FRB/US model and the data 2.1. The real-time data In describing model uncertainty, it pays to start at the beginning; in present circumstances, the beginning is the data. It is the data, and the sta⁄(cid:146)s view of those data, that determined how the (cid:133)rst vintage of FRB/US was structured. And it is the surprises from those data and how they were interpreted as the series were revised and extended with each successive vintage that conditioned the model(cid:146)s evolution. To that end, in this subsection we examine key data series by vintage. We also provide some evidence on the model(cid:146)s forecast record during the period of interest. And we re(cid:135)ect on the events of the time, the shocks they engendered, and the revisions to the data. Our treatment of the subject is subjective(cid:150)it comes, in part, from the archives of the FRB/US model(cid:150) andincomplete. Itisbeyondthescopeofthispartofthepapertoprovideancomprehensivesurvey of data revisions over the period from 1996 to 2003; fortunately, however, Anderson and Kliesen (2005) provide just such a summary and we borrow in places from their work. Figure 2.1 shows the four-quarter growth rate of the GDP price index, for selected vintages. (Note we show only real-time historical data because of rules forbidding the publication of FOMCrelated data more recent than in the last (cid:133)ve years.) The in(cid:135)ation rate moves around some, but the various vintages for the most part are highly correlated. In any event, our reading of the literature is that data uncertainty, narrowly de(cid:133)ned to include revisions of published data series, is not a 3

Figure 2.1: Real-time 4-quarter GDP price in(cid:135)ation (selected vintages) (cid:133)rst-order source of problems for monetary policy design; see, e.g., Croushore and Stark (2001). Figure 2.2 shows the more empirically important case of model measures of growth in potential non-farm business output.5 Unlike the case of in(cid:135)ation, potential output growth is a latent variable the de(cid:133)nition and interpretation of which depends on model concepts. What this means is the historical measures of potential are themselves a part of the model, so we should expect signi(cid:133)cant revisions.6 Even so, the magnitudes of the revisions shown in Figure 2.2 are truly remarkable. The July 1996 vintage shows growth in potential output of about 2 percent. For the next several years, succeeding vintages show both higher potential output growth rates and more responsiveness to the economic cycle. By January 2001, growth in potential was estimated at over 5 percent for some dates, before 5 Morepreciselyweadjustpotentialnon-farmbusinessoutputwheretheadjustmentistoexcludeowneroccupied housing, and to include oil imports. This makes output conformable with the model(cid:146)s production function which includes oil as a factor of production. Henceforth it should be understood that all references to productivity or potential output are to the concept measured in terms of adjusted non-farm business output. 6 De(cid:133)ned in this way, data uncertainty does not include uncertainty in the measurement of latent variables, like potential output. The important conceptual distinction between the two is that eventually one knows what the (cid:133)nal data series is(cid:150)what "the truth" is(cid:150)when dealing with data uncertainty. One never knows, even long after the fact, what the true values of latent variables are. Latent variables are more akin to parameter uncertainty than data uncertainty. On this, see Orphanides et al. (2000) and Orphanides (2001). 4

Figure 2.2: Real-time 4-quarter non-farm potential business output growth subsequent changes resulted in a path that was lower and more variable. Why might this be? Table 1 reminds us about how extraordinary the late 1990s were. The table shows selected FRB/US model forecasts for the four-quarter growth in real GDP, on the left-hand side of the table, and PCE price in(cid:135)ation, on the right-hand side, for the period for which public availability of the data are not restricted.7 The table shows the substantial underprediction of GDP growth over most of the period, together with a underpredictions of PCE in(cid:135)ation. 7 A record such asthe onein the table wasnotunusualduring thisperiod; the Survey ofProfessionalForecasters similarly underpredicted output growth. Tulip (2005) documents how the o¢ cial Greenbook forecast exhibited a similar pattern of forecast errors. 5

Table 1 Four-quarter growth in real GDP and PCE prices: selected FRB/US model forecasts Real GDP PCE prices forecast date forecast data data - forecast* forecast data data - forecast* July 1996 2.2 4.0 1.8 2.3 1.9 -0.4 July 1997 2.0 3.5 1.5 2.4 0.7 -1.6 Aug. 1998 1.7 4.1 2.4 1.5 1.6 0.1 Aug. 1999 3.2 5.3 2.1 2.2 2.5 0.3 Aug. 2000 4.5 0.8 -3.7 1.8 1.5 -0.3 *4Q growth forecasts from the vintage of the year shown; e.g. for GDP in July 1996, forecast =100*(GDP[1997:Q2]/GDP[1996:Q2]-1), compared against the "(cid:133)rst (cid:133)nal" data contained in the database two forecasts hence. So for the same example, the (cid:133)rst (cid:133)nal is from the November 1997 model database. The most recent historical measures shown in Figure 2.2 are for the August 2002 vintage, where the path for potential output growth di⁄ers in two important ways from the others. The (cid:133)rst way is that it is the only series shown that is less optimistic than earlier ones. In part, this re(cid:135)ects the onset of the 2001 recession. The second way the series di⁄ers is in its volatility over time. This is a manifestation of the ongoing evolution of the model in response to emerging economic conditions. In its early vintages, the modeling of potential output in FRB/US was traditional for large-scale econometric models, in that trend labor productivity and trend labor input, were based on exogenous split time trends. In essence, the model took the typical Keynesian view that nearly all shocks a⁄ecting aggregate output were demand-side phenomena. Then, as under-predictions of GDP growth were experienced, without concomitant underpredictions in in(cid:135)ation, these priors were updated. The sta⁄began adding model code to allow the supply side of the model to respond to output surprises by projecting forward revised pro(cid:133)les for productivity growth. What had been an essentially deterministic view of potential output was evolving into a stochastic one.8 FurtherinsightontheoriginsandpersistenceoftheseforecasterrorscanbegleanedfromFigure 2.3 below, which focuses attention on a single year, 1996, and shows forecasts and "actual" fourquarter GDP growth, non-farm business potential output growth, and PCE in(cid:135)ation for that year. Each date on the horizontal axis corresponds with a database, so that the (cid:133)rst observation on the 8 Some details on this evolution of thought are provided in the Appendix. 6

farleftoftheblacklineiswhattheFRB/USmodeldatabaseforthe1996:Q3(July)vintageshowed for four-quarter GDP growth for1996. (The black line, is broken over the (cid:133)rst two observations to indicate that some observations for 1996 were forecast data at the time; after the receipt of the advance release of the NIPA for 1996:Q4 on January 31, 1997, the (cid:133)gures are treated as data.) Similarly, the last observation of the same black line shows what the 2005:Q4 database has for historical GDP growth in 1996, given current concepts and measures. The black line shows that the model predicted GDP growth of 2.2 percent for 1996 as of July 1996; when the (cid:133)rst (cid:133)nal data for the 1996:Q4 were released on January 31, 1997, GDP growth for the year was 3.1 percent, a sizable forecast error of 0.8 percentage points. It would get worse. The black line shows that GDP growth was revised up in small steps and large jumps right up until late in 2003 and now stands at 4.4 percent; so by the (unfair) metric of current data, the forecast error from the July 1996 projection is a whopping 2.2 percentage points. Given the long climb of the black line, the revisions to potential output growth shown by the red line seem explicable, at least until about 2000. After that point, the emerging recession resulted in wholesale revisions of potential output growth going well back into history. The blue line shows that there was a revision in PCE in(cid:135)ation that coincided with substantial changes in both actual GDP and potential, in 1998:Q3. This re(cid:135)ects the annual revision of the NIPA data and with it some updates in source data.9 Comparing the black line, which represents real GDP growth, with the red line, which measures potential output growth, shows clearly the powerful in(cid:135)uence that data revisions had on the FRB/US measures of potential. Despitethevolatilityofpotentialoutputgrowth,theresultingoutputgaps,showninFigure2.4, show considerable covariation, albeit with non-trivial revisions. This observation underscores the sometimes underappreciated fact that resource utilization (that is, output gaps or unemployment) is not the sole driver of (cid:135)uctuations in in(cid:135)ation; other forces are also at work, including trend productivity which a⁄ects unit labor costs, and relative price shocks such those a⁄ecting food, 9 Threweremethodologicalchangestoexpendituresandpricesofcarsandtrucks;impovedestiamtedofconsumer expenditures on services; new methods of computing changes in business inventories; and some expenditures on software by businesses were removed from business (cid:133)xed investment and reclassi(cid:133)ed as expenses. 7

realGDP NFBpotential PCEinflation 4.5 4.0 3.5 3.0 2.5 f 2.0 1.5 1996 1997 1998 1999 2000 2001 2002 2003 2004 2005 Figure 2.3: 4-quarter growth in 1996 for selected variables by vintage energy and non-oil import prices. 2.2. Description of the FRB/US model The FRB/US model came into production in July 1996 as a replacement for the venerable MIT- Penn-SSRC (MPS) model that had been in use at the Board of Governors for many years. The main objectives guiding the development of the model were that it be useful for both forecasting and policy analysis; that expectations be explicit; that important equations represent thedecisionrulesofoptimizingagents; thatthemodelbeestimatedandhavesatisfactorystatistical properties; and that the full-model simulation properties match the "established rules of thumb regarding economic relationships under appropriate circumstances" as Brayton and Tinsley (1996, p. 2) put it. Toaddressthesechallenges,thesta⁄includedwithintheFRB/USmodelaspeci(cid:133)cexpectations block, and with it, a fundamental distinction between intrinsic model dynamics (dynamics that are immutable to policy) and expectational dynamics (which policy can a⁄ect). In most instances, the 8

Figure 2.4: Real-time GDP output gaps (selected vintages) intrinsic dynamics of the model were designed around representative agents choosing optimal paths for decision variables facing adjustment costs.10 Ignoring asset pricing equations for which adjustment costs were assumed to be negligible, a generic model equation would look something like: (cid:1)x = (cid:11)(L)(cid:1)x+E (cid:12)(F)(cid:1)x +c(x x )+u (1) t (cid:3) t 1 (cid:3)t 1 t (cid:0) (cid:0) (cid:0) where (cid:11)(L) is a polynomial in the lag operator, i.e., (cid:11)(L)z = a +a z +a z +::: and 0 1 t 1 2 t 2 (cid:0) (cid:0) (cid:12)(F) is a polynomial in the lead operator. The term (cid:1)x is the expected changes in target levels (cid:3) of the generic decision variable, x, c(:) is an error-correction term, and u is a residual. In general, the theory behind the model will involve cross-parameter restrictions on (cid:11)(L);(cid:12)(F) and c. The point to be taken from equation (1) is that decisions today for the variable, x; will depend in part 10 The model introduced the notion of polynomial adjustment costs, a straightforward generalization of the wellknown quadratic adjustment costs, which allowed, for example, the (cid:135)ow of investment to be costly to adjust, and not just the capital stock. This idea, controversial at the time, has recently been adopted in the broader academic community; see e.g., Christiano, Eichenbaum and Evans (2005). 9

on past values and expected future values, with an eye on bringing x toward its desired value, x ; (cid:3) over time. From the outset, FRB/US has been a signi(cid:133)cantly smaller model than was MPS, but it is still quite large. At inception, it contained some 300 equations and identities of which perhaps 50 were behavioral. About half of the behavioral equations in the (cid:133)rst vintage of the model were modeled using formal speci(cid:133)cations of optimizing behavior.11 Among the identities are the expectations equations. Two versions of expectations formation were envisioned: VAR-based expectations and perfect foresight. Theconceptofperfectforesightiswellunderstood,butVAR-basedexpectationsprobably requires some explanation. In part, the story has the (cid:135)avor of the Phelps-Lucas "island paradigm": agents live on di⁄erent islands where they have access to a limited set of core macroeconomic variables, knowledge they share with everyone in the economy. The core macroeconomic variables are the output gap, the in(cid:135)ation rate and the federal funds rate, as well as beliefs on the long-run target rate of in(cid:135)ation and what the equilibrium real rate of interest will be in the long run. These variables comprise the model(cid:146)s core VAR expectations block. In addition they have information that is germane to their island, or sector. Consumers, for example, augment their core VAR model with information about potential output growth and the ratio of household income to GDP, which formstheconsumer(cid:146)sauxiliary VAR.Twoimportantfeaturesofthisset-upareworthnoting. First, the set of variables agents are assumed to use in formulating forecasts is restricted to a set that is much smaller than under rational expectations. Second, agents are allowed to update their beliefs, but only in a restricted way. In particular, for any given vintage, the coe¢ cients of the VARs are taken as (cid:133)xed over time, while agents(cid:146)perceptions of long-run values for the in(cid:135)ation target and the equilibrium real interest rate are continually updated using simple learning rules.12 By de(cid:133)nition, under perfect-foresight expectations, the information set is broadened to include all the states in the model with all the cross-equation restrictions implied by the model. 11 Thatis,polynomialadjustmentcostsinpriceandvolumedecisionrules. In(cid:133)nancialmarkets,intrinsicadjustment costs were assumed to be zero. 12 This idea has been articulated and extended in a series of papers by Kozicki and Tinsley. See, e.g., their (2001) article. 10

In this paper, we will be working exclusively with the VAR-based expectations version of the model. Typically it is the multipliers of this version of the model that are reported to Board members when they ask "what if" questions. This is the version that is used for forecasting and most policy analysis by the Fed sta⁄, including, as Svensson and Tetlow (2005) demonstrate, policy optimizationexperiments.13 Thus,thepertinenceofusingthisversionofthemodelforthequestion at hand is unquestionable. What might be questioned, on standard Lucas-critique grounds, is the validity of the Taylor-rule optimizations carried out below. However, the period under study is one entirely under the leadership of a single Chairman, and we are aware of no evidence to suggest that there was a change in regime during this period. So as Sims and Zha (2004) have argued, it seems likely that the perturbations to policies encompassed by the range of policies studied below are not large enough to induce a change in expectations formation. Moreover, in an environment such as theoneunderstudy, wherechangesinthenon-monetarypartoftheeconomyarelikelytodwarfthe monetary-policy perturbations, it seems safe to assume that private agents were no more rational with regard to their anticipations of policy than the Fed sta⁄ was about private-sector decision making.14 In their study of the evolution of the Fed beliefs over a longer period of time, Romer and Romer (2002), ascribe no role to the idea of rational expectations. Finally, what matters for this real-time study is that it is certainly the case that the Fed sta⁄ believed that expectations formation, as captured in the model(cid:146)s VAR-expectations block, could be taken as given and thus policy analyses not unlike those studied here were carried out. Later on we will have more to say about the implications of assuming VAR-based expectations for our results and those in the rest of the literature. Thereisnotthespacehereforacompletedescriptionofthemodel,aproblemthatisexacerbated by the fact that the model is a moving target. Readers interested in detailed descriptions of the model are invited to consult papers on the subject, including Brayton and Tinsley (1996), Brayton, Levin,TryonandWilliams(1997),andReifschneider,TetlowandWilliams(1999). However,before 13More recently, the model section has added to its repertoire optimal control policy experiments conducted on a version of model with rational expectations in asset prices. 14 Stochastic simulation and optimization of a large-scale non-linear rational expectations model is a Hurculean task. In any case, a complete set of archives of the perfect-foresight version of the model is not available. 11

leaving this section it is important to note that the structure of macroeconomic models at the Fed have always responded to economic events and the di⁄erent questions that those events evoke, even before FRB/US. Brayton, Levin, Tryon and Williams (1997) note, for example, how the presence of (cid:133)nancial market regulations meant that for years a substantial portion of the MPS model dealt speci(cid:133)cally with mortgage credit and (cid:133)nancial markets more broadly. The repeal of Regulation Q induced the elimination of much of that detailed model code. Earlier, the oil price shocks of the 1970s and the collapse of Bretton Woods gave the model a more international (cid:135)avor than it had previously. We shall see that this responsiveness of models to economic conditions and questions continued with the FRB/US model in the 1990s. The key features in(cid:135)uencing the monetary policy transmission mechanism in the FRB/US model are the e⁄ects of changes in the funds rate on asset prices and from there to expenditures. Philosophically, the model has not changed much in this area: all vintages of the model have had expectations of future economic conditions in general, and the federal funds rate in particular, a⁄ecting long-term interest rates and in(cid:135)ation. From this, real interest rates are determined and this in turn a⁄ects stock prices and exchange rates, and from there, real expenditures. Similarly, themodelhasalwayshadawage-priceblock, withthesamebasicfeatures: stickywagesandprices, expected future excess demand in the goods and labor markets in(cid:135)uencing price and wage setting, and a channel through which productivity a⁄ects real and nominal wages. That said, as we shall see, there have been substantialchanges over time in both (what we maycall)the interest elasticity of aggregate demand and the e⁄ect of excess demand on in(cid:135)ation. Over the years, equations have come and gone in re(cid:135)ection of the needs, and data, of the day. The model began with an automotive sector but this block was later dropped. Business (cid:133)xed investment was originally disaggregated into just non-residential structures and producers(cid:146) durable equipment, but the latter is now disaggregated into high-tech equipment and "other". The key consumer decision rules and wage-price block have undergone frequent modi(cid:133)cation over the period. On the other hand, the model has always had an equation for consumer non-durables and services, consumer durables expenditures, and housing. There has always been a trade block, with 12

aggregate exports and non-oil and oil imports, and equations for foreign variables. The model has always had a three-factor, constant-returns-to-scale Cobb-Douglas production function with capital, labor hours and energy as factor inputs. 2.3. The archive and the data Since its inception in July 1996, the FRB/US model code, the equation coe¢ cients, the baseline forecast database, and the list of stochastic shocks with which the model would be stochastically simulated, have all been stored for each of the eight forecasts the Board sta⁄conducts every year. It is releases of National Income and Product Accounts (NIPA) data that typically induce reassessments of the model, so we elected to use four archives per year, or 30 in total, the ones immediately following NIPA preliminary releases.15 In what follows, we experiment with each vintage of model, comparing their properties in selected experiments. Consistent with the real-time philosophy of this endeavor, the experiments we choose are typical of those used to assess models by policy institutions in general and the Federal Reserve Board in particular. They fall into two broad classes. One set of experiments, model multipliers, attempts to isolate the behavior of particular parts of the model. A multiplier is the response of a key endogenous variable to an exogenous shock after a (cid:133)xed period of time. An exampleistheresponseoftheunemploymentrateaftereightquarterstoapersistentincreaseinthe federal funds rate. We shall examine several such multipliers. The other set of experiments judge the stochastic performance of the model and are designed to capture the full-model properties under fairly general conditions. So, for example, we will compute by stochastic simulation the optimal coe¢ cients of a Taylor rule, conditional on a model vintage, a baseline database, and a set of stochastic shocks.16 We will then compare these optimal rules with other alternative rules and indeed other alternative worlds de(cid:133)ned by the set of our model vintages. 15 Nothing of importance is lost from the analysis by excluding every second vintage from consideration. The archivesarelistedbytheprecisedateoftheFOMCmeetinginwhichtheforecastswerediscussed. Forourpurposes, we do not need to be so precise so we shall describe them by month and year. Thus, the 30 vintages we use are, in 1996: July and November; in 1997: February, May, July, and November; in 1998 through 2000: February, May, August and November; and in 2001 through 2003: January, May, August and November. 16 Eachvintagehasalistofvariablesthatareshockedusingbootstrapmethodsforstochasticsimulations. Thelist ofshocksisasubsetofthemodel(cid:146)scompletesetofresidualssinceotherresidualsaretreatednotasshocksbutrather as measurementerror. The precise nature ofthe shocks willvary according to data construction and the period over which the shocks are drawn. 13

Model multipliers have been routinely reported to and used by members of the FOMC. Indeed, the model(cid:146)s sacri(cid:133)ce ratio(cid:150)about which we will have more to say below(cid:150)was used in the very (cid:133)rst FOMC meeting following the model(cid:146)s introduction.17 Similarly, model simulations of alternative policies have been carried out and reported to the FOMC in a numberof memos and o¢ cialFOMC documents.18 The archives document model changes and provide a unique record of model uncertainty. As we shall see, the answers to questions a policy maker might ask di⁄er depending on the vintage of the model. The seemingly generic issue of the output cost of bringing down in(cid:135)ation, for example, can be subdivided into several more precise questions, including: (i) what would the model say is the output cost of bringing down in(cid:135)ation today?; (ii) what would the model of today say the output cost of bringing down in(cid:135)ation would have been in February1997?; and (iii) what would the model have said in February 1997 was the output cost of disin(cid:135)ation at that time? These questions introduce a time dependency to the issue that rarely appears in other contexts. The answers to these and other related questions depend on the model vintage. Here, however, the model vintage means more than just the model alone. Depending on the question, the answer can depend on the baseline; that is, on the initial conditions from which a given experiment is carried out. It can also depend on the way an experiment is carried out, and in particular on the policyrulethatisinforce. Andsincemodelsareevaluatedintermsoftheirstochasticperformance, it can depend on the stochastic shocks to which the model is subjected to judge the appropriate policy and to assess performance. So in the most general case, model uncertainty in our context comes from four interrelated sources: model, policy rule, baseline and shocks. How much model variability can there be over a period of just eight years? The answer is a surprisingly large amount. But to provide a speci(cid:133)c answer, let us begin with the data. It is 17 ThetranscriptoftheJuly2-3,1996FOMCmeeting(p. 42)quotesthenFedGovernorJanetYellen:"Thesacri(cid:133)ce ratioinournew FRB-USmodel,withoutcredibilitye⁄ects,is2.5..."Basedonthis(cid:133)gureandotherarguments,Gov. Yellen spearheaded a discussion of what long-run target rate of in(cid:135)ation the FOMC might wish to achieve. Yellen is now President of the Federal Reserve Bank of San Francisco. 18 The Board sta⁄present their analysis of recent history, the sta⁄forecast and alternative simulations, the latter using the FRB/US model, in the Greenbook. The FOMC also receives detailed analysis of policy options in the Bluebook Alternative policy simulations are typically carried out using the FRB/US model. In addition, for the FOMC(cid:146)s semi-annual two-day meetings, detailed reports are often prepared by the sta⁄and these reports frequently involvetheFRB/USmodel. Gotohttp://www.federalreserve.gov/fomc/transcripts/fortransciptsofFOMCmeetings aswellasthepresentationsoftheseniorsta⁄totheFOMC.SeeSvenssonandTetlow(2005)forarelateddiscussion. 14

ultimatelywhatisgleanedfromthedatathatelicitschangesinthemodel, changesinthestochastic shocks, and changes in policy rules. In summary, the FRB/US model archives show considerable change in equations and the data by vintage. The next section examines the extent to which these di⁄erences manifest themselves in di⁄erent model properties. The following section then examines how these di⁄erences, together with their associated stochastic shock sets, imply di⁄erent optimal monetary policy rules. 3. Model multipliers in real time and ex post In this subsection, we consider the variation in real time of selected model multipliers. In most instances, we are interested in the response after 8 quarters of unemployment to a given shock, (although our (cid:133)rst experiment is an exception to this rule). We choose unemployment as our response variable because it is one of the key real variables that the Fed has concerned itself with over the years; in principle, we could have used the output gap instead, but its de(cid:133)nition has changed over time. The horizon of eight quarters is a typical one for exercises such as this as conducted at the Fed and other policy institutions. Except where otherwise noted, we hold the nominal federal funds rate at baseline for each of these experiments. It is easiest to show the results graphically. But before turning to speci(cid:133)c results, it is useful to outline how these (cid:133)gures are constructed and how they should be interpreted. In all cases, we show two lines. The black solid line is the real-time multiplier by vintage. Each point on the line represents the outcome of the same experiment, conducted on the model vintage of that date, using the baseline database at that point in history. So at each point shown by the black line, the model, its coe¢ cients and the baseline all di⁄er. The red dashed line shows what we call the ex post multiplier. The ex post multiplier is computed using the most recent model vintage for each date; the only thing that changes for each point on the dashed red line is the initial conditions under which the experiment is conducted. Di⁄erences over time in the red line reveal the extent to which the model is nonlinear, because the multipliers for linear models are independent of initial conditions. Comparingthetwoallowsustoidentifyoneofthefoursourcesofmodeluncertainty(cid:150)the 15

Figure 3.1: Sacri(cid:133)ce ratio by vintage baseline(cid:150)that we described above.19 Now let us look at Figure 3.1, which shows the 5-year employment sacri(cid:133)ce ratio; that is, the cost in terms of cumulative annualized forgone employment, that a one-percentage-point reduction in the in(cid:135)ation rate would entail after (cid:133)ve years.20. When computed over a reasonably lengthy horizon such as this one, the sacri(cid:133)ce ratio is essentially a measure of slope of the Phillips curve. Let us focus on the red dashed line (cid:133)rst. It shows that for the November 2003 model, the sacri(cid:133)ce ratio is essentially constant over time. So if the model group was asked to assess the sacri(cid:133)ce ratio, or what the sacri(cid:133)ce ratio would have been in, say, February 1997, the answer based on the November 2003 model would be the same: about 4-1/4, meaning that it would take that many percentage-point-years of unemployment to bring down in(cid:135)ation by one percentage point. Now, 19 Another way of examining the same thing would be to initiate each of the ex ante multipliers experiments at the same date in history and compare these with the black line in each (cid:133)gure. Such an experiment is not completely clean, however, because each model is only conformable with its own baseline database and these baselines have di⁄erentconditionsforeverygivendateasFigures2.1through2.4demonstrated. Nonetheless,theresultsofsuchan exercise are available from the corresponding author on request. 20 More precisely, the experiment is conducted by simulation, setting the target rate of in(cid:135)ation in a Taylor rule to one percentage point below its baseline level. The sacri(cid:133)ce ratio is cumulative annualized change in the unemployment rate, undiscounted, relative to baseline, divided by the change in PCE in(cid:135)ation after 5 years. Other rules would produce di⁄erent sacri(cid:133)ce ratios but the same pro(cid:133)le over time. 16

however, look at the black solid line. Since each point on the line represents a di⁄erent model, and the last point on the far right of the line is the November 2003 model, the red dashed line and the black solid line must meet at the right-hand side in this and all other (cid:133)gures in this section. But notice how much the real-time sacri(cid:133)ce ratio has changed over the 8-year period of study. Had the model builders been asked in February 1997 what the sacri(cid:133)ce ratio was, the answer based on the February 1997 model would have been about 2-1/4, or approximately half the November 2003 answer. The black line undulates a bit, but cutting through the wiggles, there is a general upward creep over time, and a fairly discrete jump in the sacri(cid:133)ce ratio in late 2001.21 The climb in the model sacri(cid:133)ce ratio is striking, particularly as it was incurred over such a short period of time among model vintages with substantial overlap in their estimation periods. Onemightbeforgivenforthinkingthatthisphenomenonisidiosyncratictothemodelunderstudy. On this, two facts should be noted. First, even if it were idiosyncratic such a reaction misses the point. The point here is that this is the principal model that was used by the Fed sta⁄ and it was constructed with all due diligence to address the sort of questions asked here. Second, other work shows that this result is not a (cid:135)uke.22 The history of the FRB/US model supports the belief that the slope of the Phillips curve lessened, much like Atkeson and Ohanian (2001). At the same time, as we have already noted the model builders did incorporate shifts in the NAIRU (and in potentialoutput), butfoundthatleaningexclusivelyonthisonestoryformacroeconomicdynamics in the late 1990s was insu¢ cient. Thus, the revealed view of the model builders contrasts with idea advanced by Staiger, Stock and Watson (2001), among others, that changes in the Phillips curve are best accounted for entirely by shifts in the NAIRU. Figure 3.2 shows the funds-rate multiplier; that is, the increase in the unemployment rate after 21 The sizable jump in the sacri(cid:133)ce ratio in late 2001 is associated with a shift to estimating the models principle wageandpriceequationssimultaneouslytogetherwithotherequationstorepresenttherestoftheeconomy,including aTaylorruleforpolicy. Amongotherthings,thisallowed expectationsformation in wageand pricesettingdecisions to re(cid:135)ect more recent Fed behavior than the full core VAR equations that are used in the rest of the model. See the Appendix for more details. 22 Inparticular,thesamephenomenonoccurstovaryingdegreesinsimplesingle-equationPhillipscurvesofvarious speci(cid:133)cationsusingbothreal-timeandex post data;seeTetlow(2005b). Roberts(2004)showshowgreaterdiscipline in monetary policy may have contributed to the reduction in economic volatility in the period since the Volcker disin(cid:135)ation. Cogley and Sargent (2004) use Bayesian techniques to estimate three Phillips curves and an aggregate supply curve simultaneously asking why the Fed did not choose an in(cid:135)ation stabilizing policy before the Volcker disin(cid:135)ation. They too (cid:133)nd time variation in the (reduced-form) output cost of disin(cid:135)ation. See, as well, Sargent, Williams and Zha (2005). 17

Figure 3.2: Funds rate multiplier by model vintage eight quarters in response to a persistent 100-basis-point increase in the funds rate. This time, the red dashed line shows important time variation: the ex post funds rate multiplier varies with initial conditions, it is highest at a bit over 1 percentage point in late 2000, and lowest at the beginning and at the end of the period. The nonlinearity stems entirely from the speci(cid:133)cation of the model(cid:146)s stock market equation. In this vintage of the model, the equation is written in levels, rather than in logs, which makes the interest elasticity of aggregate demand an increasing function of the ratio of stock market wealth to total wealth. The mechanism is that an increase in the funds rate raises long-termbondrates,whichinturnbringaboutadropinstockmarketvaluationoperatingthrough the arbitrage relationship between expected risk-adjusted bond and equity returns. The larger the stock market, the stronger the e⁄ect.23 The real-time multiplier, shown by the solid black line is harder to characterize. Two observations stand out. The (cid:133)rst is the sheer volatility of the multiplier. In a large-scale model such 23 The levels relationship of the stock market equation means that the wealth e⁄ect of the stock market on consumption can be measured in the familiar "cents per dollar" form (of incremental stock market wealth). 18

as the FRB/US model, where the transmission of monetary policy operates through a number of channels, time variation in the interest elasticity of aggregate demand depends on a large variety of parameters. Second,thereal-timemultiplierisalmostalwayslowerthantheex post multiplier. The gap between the two is particularly marked in 2000, when the business cycle reached a peak, as did stock prices. At the time, concerns about possible stock market bubbles were rampant. One aspect of the debate between proponents and detractors of the active approach to stock market bubbles concerns the feasibility of policy prescriptions in a world of model uncertainty.24 And in fact, there were three increases in the federal funds rate during 2000, totalling 100 basis points.25 The considerable di⁄erence between the real-time and ex post multipliers during this period demonstrates the di¢ culty in carrying out historical analyses of the role of monetary policy; today(cid:146)s assessment of the strength of those monetary policy actions can di⁄er substantially from what the sta⁄thought at the time. Figure 3.3 shows the government expenditure multiplier(cid:150)the e⁄ect on the unemployment rate of a persistent increase in government spending of 1 percent of GDP. Noting that the sign on this multiplier is negative, one aspect of this (cid:133)gure is the same as the previous one: the real-time multiplier is nearly always smaller (in absolute terms) than ex post multiplier. If we take the ex post multiplier as correct, this says that policy advice based on the real-time FRB/US estimates through recent history would have routinely understated the extent to which perturbations in (cid:133)scal policywouldobligeano⁄settingmonetarypolicyresponse. Giventhattheperiodofstudyinvolved asubstantialchangeinthestanceof(cid:133)scalpolicy, thisisanimportantobservation. Asecondaspect of the (cid:133)gure is the near-term reduction in the ex post multiplier, from about -0.9 in the 1990s, to about -0.75 in this decade. To summarize this section, real-time multipliers show substantial variation over time, and di⁄er considerablyfromwhatonewouldsayex post themultiplierswouldbe. Moreover,thediscrepancies between the two multiplier concepts have often been large at critical junctures in recent economic 24 The "active approach" to the presence of stock market bubbles argues that monetary policy should speci(cid:133)cally respond to bubbles. See, e.g., Cecchetti et al. (2000). The passive approach argues that bubbles should a⁄ect monetary policy only insofar as they a⁄ect the forecast for in(cid:135)ation and possibly output. They should not be a special object of policy. See, Bernanke and Gertler (1999, 2001). 25 The intended federal funds rate was raised 25 basis points on February 2, 2000, to 5-3/4 percent; by a further 25 basis points on March 21, and by 50 basis points on May 16, to 6-1/2 percent. 19

Figure 3.3: Government expenditure multiplier by vintage history. It follows that real-time model uncertainty is an important problem for policy makers. The next section quanti(cid:133)es this point by characterizing optimal policy, and its time variation, conditional on these model vintages. 4. Monetary policy in real time 4.1. Optimized Taylor rules One way to quantify the importance of model uncertainty for monetary policy is to examine how policy advice would di⁄er depending on the model. A popular device for providing policy advice is with the prescribed paths for interest rates from simple monetary policy rules, like the rule proposed by Taylor (1993) and Henderson and McKibbin (1993). A straightforward way to do this is to compute optimized Taylor (1993) rules. Many central banks use simple rules of one sort or another in the assessment of monetary policy and for formulating policy advice. Because they react to only those variables that would be key in a wide set of models, simple rules often claimed 20

to be robust to model misspeci(cid:133)cation. In addition, Giannone et al. (2005) show that the good (cid:133)t of simple two-argument Taylor-type rules can be attributed to the small number of fundamental factors driving the U.S. economy; that is, the two arguments that appear in Taylor rules encompass all that one needs to know to summarize monetary policy in history. Thus, optimized Taylor rules would appear to be an ideal vehicle for study Formally, a Taylor rule is optimized by choosing the parameters of the rule, (cid:8) = (cid:11) ;(cid:11) to Y (cid:5) f g minimize a loss function subject to a given model, x = f( ); and a given set of stochastic shocks, (cid:1) (cid:6): T MIN (cid:12)i (cid:25) (cid:25) 2 +(cid:21) u u 2 +(cid:21) ((cid:1)r )2 (2) t+i (cid:3)t+i Y t+i (cid:3)t+i (cid:1)R t+i (cid:8) (cid:0) (cid:0) h i X i=0 h (cid:0) (cid:1) (cid:0) (cid:1) i subject to: x = f(x ;:::x ;z ;:::z ;r ;:::r )+v j;k;m > 0 (3) t t t j t t k t t m t (cid:0) (cid:0) (cid:0) and r = rr +(cid:25)+(cid:11) (y y )+(cid:11) ((cid:25) (cid:25) ) (4) t(cid:3) Y t t(cid:3) (cid:5) t (cid:3)t (cid:0) (cid:0) e e and (cid:6) = v v (5) u 0 where x is a vector of endogenous variables, and z a vector of exogenous variables, both in logs, except for those variables measured in rates, (cid:25) is the in(cid:135)ation rate, (cid:25) = (cid:6)3 (cid:25) =4 is the i=0 t i (cid:0) four-quarter moving average of in(cid:135)ation, (cid:25) is the target rate of in(cid:135)ation, y is (the log of) output; (cid:3) e y is potential output, u is the civilian unemployment rate, u is the natural rate of unemployment, (cid:3) (cid:3) and r is the federal funds rate. Trivially, it is true that: (cid:25);(cid:25) ;u;u ;y ;(cid:1)r x:26 In principle, (cid:3) (cid:3) (cid:3) 2 the loss function, (2), could have been derived as the quadratic approximation to the true social 26 The intercept used in the model(cid:146)s Taylor rule, designated rr(cid:3), is a medium-term proxy for the equilibrium real i t n h t e er fe e d st er r a a l te fu . n I d t s is ra a t n e, e a n n d d og (cid:13) e = n 0 o . u 0 s 5. va A r s ia a bl r e ob in us t t h n e es m s o c d h e e l c . k, In we pa e r x t p ic e u ri l m ar e , n r t r e t(cid:3) d = wi ( t 1 h(cid:0)ad (cid:13) d ) i r n r g t(cid:3) (cid:0)a 1 c + on (cid:13) s ( t r a n nt t (cid:0)in (cid:25) th t ) e w op h t e i r m e i r ze i d s rules in addition to rr(cid:3) and found that this term was virtually zero for every model vintage. Note that relative to the classic version ofthe Taylorrule where rr(cid:3) is (cid:133)xed, thisalteration biases results in favorofgood performance by this class of rules. 21

welfare function for the FRB/US model. However, it is technically infeasible for a model the size of FRB/US. That said, with the possible exception of the term penalizing the change in the federal funds rate, the arguments to (2) are standard. The penalty on the change in the funds rate may be thoughtofasrepresentingeitherahedgeagainstmodeluncertaintyinordertoreducethelikelihood of the fed funds rate entering ranges beyond those for which the model was estimated, or as a pure preference of the Committee. Whatever the reason for its presence, the literature con(cid:133)rms that some penalty is needed to explain the historical persistence of monetary policy; see, e.g., Sack and Wieland (2000) and Rudebusch (2001). The optimal coe¢ cients of a given rule are a function of the model(cid:146)s stochastic shocks, as equation (5) indicates.27 The optimized coe¢ cient on the output gap, for example, represents not only the fact that unemployment-rate stabilization(cid:151)and hence, indirectly, output-gap stabilization(cid:151)is an objective of monetary policy, but also that in economies where demand shocks play a signi(cid:133)cant role, the output gap will statistically lead changes in in(cid:135)ation in the data; so the output gap will appear because of its role in forecasting future in(cid:135)ation. However, if the shocks for which the rule is optimized turn out not to be representative of those that the economy will ultimately bear, performance will su⁄er. As we shall see, this dependence will turn out to be signi(cid:133)cant for our results.28 Solving a problem like this is easily done for linear models. However FRB/US is a non-linear model. Wethereforecomputetheoptimizedrulebystochasticsimulation. Speci(cid:133)cally,eachvintage ofthemodelissubjectedtobootstrappedshocksfromitsstochasticshockarchive. Historicalshocks from the estimation period of the key behavioral equations are drawn.29 In all, 400 draws of 80 periods each are used for each vintage to evaluate candidate parameterizations, with a simplex method used to determine the search direction. The target rate of in(cid:135)ation is taken to be two percent as measured by the annualized rate of change of the personal consumption expenditure 27 OurruleswillbeoptimalintheclassofTaylor-typerulesoftheforminequation(4),conditionalonthestochastic shock set, (5), under anticipated utility as de(cid:133)ned by Kreps (1996). 28 Thefactthatthepolicyruledependsonthevariance-covariancematrixofstochasticshocksmeansthattherule is not certainty equivalent. This is the case for two reasons. One is the non-linearity of the model. The other is the fact that the rule is a simple one: it does not include all the states of the model. 29 Thenumberofshocksused forstochasticsimulationshasvaried with thevintage,and generally hasgrown. For the (cid:133)rst vintage, 43 shocks were used, while for the November 2003 vintage, 75 were used. 22

price index.30 This is obviously a very computationally intensive exercise and so we are limited in the range of preferences we can investigate. Accordingly, we discuss only results for one set of preferences: equal weights on output, in(cid:135)ation and the change in the federal funds rate in the loss function. The choice is arbitrary but does have the virtue of matching the preferences that have been used in policy optimization experiments carried out for the FOMC; see Svensson and Tetlow (2005). The results of this exercise can be summarized graphically. In Figure 4.1, the green solid line is the optimized coe¢ cient on in(cid:135)ation, (cid:11) , while the blue dashed line is feedback coe¢ cient on the (cid:5) outputgap,(cid:11) . Theresponsetoin(cid:135)ationisuniversallylow,neverreachingthe0.5ofthetraditional Y Taylor (1993) rule.31 By and large, there is relatively little time variation in the in(cid:135)ation response coe¢ cient. The output gap coe¢ cient is another story. It too starts out low with the (cid:133)rst vintage in July 1996 at about 0.2, but then rises almost steadily thereafter, reaching a peak of nearly 1 with the last vintage in November 2003. There is also a sharp jump in the gap coe¢ cient over the (cid:133)rst two quarters of 2001. One might be tempted to think that this is related to the jump in the sacri(cid:133)ce ratio, shown in Figure 3.1. In fact, the increase in the optimized gap coe¢ cient precedes the jump in the sacri(cid:133)ce ratio. The increase in the gap coe¢ cient coincided with the inclusion of a new investment block in the model, which in conjunction with changes to the supply block, tightened the relationship between supply-side disturbances and subsequent e⁄ects on aggregate demand, particularly over the longer term.32 Thenewinvestmentblock,inturn,wasdrivenbytwofactors: theadditionbytheBureauof EconomicAnalysisayearearlierofsoftwareinthede(cid:133)nitionofequipmentspendingandthecapital stock, and associated new appreciation on the part of the sta⁄, of the importance of the ongoing productivity and investment boom. In any case, while the upward jump in the gap coe¢ cient stands out, it bears recognizing that the rise in the gap coe¢ cient was a continual process. Further 30 For these experiments any reasonable target will su¢ ce since the stochastic simulations e⁄ectively randomize over initial conditions. 31 Thatsaid,themeasureofin(cid:135)ationdi⁄ershere. Inkeepingwiththetraditionofin(cid:135)ationtargetingcountries,we use the rate of the change in the PCE price index as the in(cid:135)ation rate of interest. Taylor (1993) used the GDP price de(cid:135)ator. 32 In essence, the linkage between a disturbance to total factor productivity and the desired capital stock in the future was clari(cid:133)ed and strengthened so that an increase in TFP that may produce excess supply in the very short run can be expected to produce an investment-led period of excess demand later on. 23

Figure 4.1: Optimized Taylor rule coe¢ cients by model vintage discussion of some of the forces behind model changes can be found in the appendix. Before leaving this subsection it is worth noting that similar results were obtained for Taylor rules that are extended to allow for a lagged endogenous variable as a third optimized coe¢ cient In particular, the coe¢ cient on the lagged fed funds rate was about 0.2 regardless of the vintage, and the coe¢ cients on in(cid:135)ation and the output gap were slightly lower than in Figure 4.1, about enough to result in the same long-run elasticity.33 4.2. Ex post optimal policies We have tried to emphasize the four ingredients of model uncertainty in the real-time context: the model itself, the baseline, the policy rule, and the stochastic shocks. We also noted that these ingredients are jointly determined; in particular, Figure 4.1 showed rule coe¢ cients that were optimal given the shocks as measured by each vintage(cid:146)s bootstrapped residuals. But these shocks 33 . This result is consistent with the (cid:133)nding of Rudebusch (2001) for the Rudebusch-Svensson model, but di⁄ers from that of Williams (2003) for a linearized rational expectations version of the FRB/US model. The reason is that without rational expectations, the e¢ cacy of "promising" future settings of the funds rate through instrument smoothing is impaired. 24

are themselves, conditional on model design and speci(cid:133)cation decisions that were taken by the model builders. So uncertainty about the shocks one might face is also an issue, and indeed the ex post assessment of these shocks can be a driver of model respeci(cid:133)cations. At the same time, as much as model properties and optimal policies have changed, the performance of the U.S. economy during this period was remarkably good. Four-quarter PCE in(cid:135)ation averaged 1-3/4 percent from 1996:Q3to2003:Q4whiletheunemploymentrateaveraged4.9percent,accordingtothelatestdata. In this subsection, we investigate the role of the shock sets in the determination of the results in Figure4.1. Indoingso, wealso(indirectly)exploreonepossiblereasonfortheextraordinarilygood performance of the U.S. economy during this period, namely that the FOMC may have understood the shocks as they occurred in real-time better than the model could have. Todothis, wereconsidertheoptimizedTaylorrulesofFigure4.1, butassumethistimethatthe Fed knows in advance the precise sequence of shocks astheyoccurred. Sowhereasthecoe¢ cientsin Figure 4.1 were chosen to minimize the loss function, (2), over bootstrapped draws of the residuals, here we use just the one sequence of draws that was actually experienced. In this way, Figure 4.1 can be thought of as the ex ante optimal coe¢ cients, so called because those coe¢ cients are optimal given that the Fed does not know the precise sequence of shocks, and here we will look at ex post optimal coe¢ cients. Obviously, the idea of an ex post optimal rule is an arti(cid:133)cial concept. It assumes information that no one could have. Moreover, if one did have such information (and knew the model with certainty as well), it would not be reasonable to restrict oneself to a simple rule like the Taylor rule. Instead, one would choose precise values of the funds rate, period by period, to minimize the loss function. Our goal here is diagnostic, not prescriptive. We are attempting to illustrate and later quantify the bene(cid:133)ts of better real-time information. Later on, we shall look at the other side of the coin by examining the costs of the hubris of believing too much. Before we look at the results, it is worth noting that since the ex post optimal rules are conditional on just a single "draw" of shocks, they will tend to be sensitive to relatively small changes in speci(cid:133)cation or shocks and will vary a great deal from vintage to vintage. For that reason, our 25

comparisons with the ex ante optimal rules will be broad brush. The results are shown in Figure 4.2 which can be compared with those in Figure 4.1. It is worthwhile to divide the results into two parts, demarcated by vintage: the 1990s and the new century. Volatility aside, in the 1990s the ex post optimal output-gap coe¢ cients are mostly lower, andthein(cid:135)ationcoe¢ cientsaremostlyhigher,thantheirex ante counterparts. Smoothingthrough thewiggles,theexpost policyprescriptionforthelate1990sisalmostoneofpurein(cid:135)ationtargeting; that is, without feedback on the output gap. We already emphasized that the late 1990s was a period dominated by persistent productivity shocks. One e⁄ect of a spate of productivity shocks, more persistent and larger than in the historical data, is to confound the usual lead-lag relationship between output (cid:135)uctuations and movements in in(cid:135)ation, because productivity a⁄ects unit labor costs and then in(cid:135)ation without the necessity of changing the output gap. In these circumstances, stabilizing output becomes a less complementary device to the goal of controlling in(cid:135)ation than otherwise would be the case. The appropriate policy response in this instance is to focus more directly on controlling in(cid:135)ation and de-emphasize output stabilization. This prescription echoes that of Orphanides (2001), but operates through a di⁄erent channel. Whereas Orphanides (2001) was interested in the e⁄ect of mismeasurement of the output gap, we emphasize uncertainty in stochastic shocks. The situation in the new century is quite di⁄erent. By this time, the high-tech bubble had burst and the stock market swooned(cid:150)both traditional demand-side phenomena(cid:150)and so the ex ante and ex post optimal coe¢ cients look quite similar. The ex post shocks were representative of the "normal" pattern of shocks. Also of interest given the recent literature on the subject is the response to the output gap. Figure 4.3 shows the real-time output gap coe¢ cients for the ex ante optimal coe¢ cients (the green solid line) and the ex post optimal (the blue dashed line). In broad terms, the two lines share some features. Both are low (on average) in the early period; both climb steeply at the turn of the century, and both continue to climb thereafter, albeit more slowly. But there are interesting di⁄erences as well, with 1999 being a particularly noteworthy period. This was a period where 26

Figure 4.2: Ex post optimal Taylor rule coe¢ cients by vintage critics of the Fed argued that policy was too easy. The context was the three 25-basis-point cuts in thefundsrateundertakenin1998inresponsetotheAsiacrisisandtheRussiandebtdefault. Atthe time, a sharp increase in investor perceptions of risk coupled with deterioration in global (cid:133)nancial conditions raised fears of an imminent global credit crunch, concerns that played an important role in Fed decision making. By 1999, however, these factors had abated and so the FOMC starting "taking back" the previous decreases. To some, including Cecchetti et al. (2000) the easier stance undertaken in late 1998 and into 1999 exacerbated the speculative stock market boom of that time and may have ampli(cid:133)ed the ensuing recession. The ex post optimal feedback on the output gap, shown by the blue dashed line, was volatile. For the 1999 models, and given the particular shocks over the period shown in the picture, the optimal response to the gap was zero; but within months, it rose to about 0.4. In contrast, the ex ante optimal coe¢ cients were essentially unchanged over thesameperiod,aswerethemoreimportantmultipliers,whichindicatesthatchangesintheshocks were critical. Given that the shock sets in 1999 and 2000 overlap, this is a noteworthy change. To 27

Figure 4.3: Comparison of output gap coe¢ cients us, the important point to take from this is not the proper stance of policy at that point in history, but rather that it is so dependent on seemingly small changes. Our analysis also hints at some advantages of discretion: the willingness to respond to the speci(cid:133)c shocks of the day(cid:150)if one is able to discern them. We shall have more to say about this a bit later. 5. Performance To this point, we have compared model properties and the policies that those properties prescribe but have had nothing directly to say about performance. This section (cid:133)lls this void. In the (cid:133)rst subsection, we investigate how useful prior information about the sequence of shocks might be for policy and hence welfare. Speci(cid:133)cally, we conduct counterfactual experiments on the single sequence of shocks immediately preceding each model vintage. Thus, this subsection is the performance counterpart to the design subsection of optimal ex post policies. It tells us the bene(cid:133)t of being right about the shock sequence underlying the ex post optimal policy. Then in subsection 28

5.2 , we consider the performance, on average of the model economies under stochastic simulation. The exercise in subsection 5.2 is a counterpart to the ex ante optimized policy rules in Figure 4.1. Among other things, it will tell us about the cost of being wrong in our beliefs about knowledge of the shocks. 5.1. Performance in retrospect: counterfactual experiments Iftheex post optimalrulereallywouldhavebeenoptimalforeachvintageofthemodel(cid:150)conditional, of course, on that model(cid:150)how much better would it have been than, say, the ex ante optimal rule? In other words, how valuable is that kind of information for the design of policy? We answer this questionwithacounterfactualsimulationonselectedmodelvintages. Tofacilitatecomparisonwith the next subsection and still keep the size of the problem manageable, we restrict our attention to just two of our 30 model vintages, the February 1997 and the November 2003 vintages. These were chosen because they were far apart in time, thereby re(cid:135)ecting as di⁄erent views of the world as this environment allows, and because their properties are the most di⁄erent of any in the set. In particular, the February 1997 model has the lowest sacri(cid:133)ce ratio of all vintages considered, and the November 2003 model has the highest. It follows that these two models should more-or-less encompass the results of other vintages. The details of our simulation are straightforward: each simulation is initialized with the conditions as of 20 years and two quarters before the vintage itself, as measured by that model vintage and ends two quarters before the vintage date. The Fed controls the funds rate with the policy rule in question. The model is subjected to those shocks that the economy bore over the period, as measured by the relevant model vintage. The loss in each instance is measured using the same loss function as in the optimization exercises, equation (2) and, as before, the target rate of in(cid:135)ation is set to two percent.34 The losses are then normalized such that the historical path represents a loss 34 Inadiscussionofin(cid:135)ationtargetingattheFOMCmeetinginJuly1996(cid:150)thesamedateasour(cid:133)rstmodelvintage(cid:150) mostmemberstheFOMCappearedtohaveagreedthat2percentwouldhavebeareasonsabletargetrateofin(cid:135)ation. The thorny issues of settling on a particular index and the assessment of, and correction for, measurement error in priceindexesremainedunsettled. Seehttp://www.federalreserve.gov/fomc/transcripts/1996/19960703Meeting.PDF, especially at pp. 63-65. 29

of unity. All other losses can be interpreted in terms of percentage deviations from the baseline loss. The results are shown in Table 2 below. Let us focus for the time being on the left-hand panel with the results for the February 1997 model. To aid in the interpretation of the results, the policy rule(cid:146)s coe¢ cients are shown, where applicable. According to the model, the ex post optimal policy would have been superior to the historical policy. This is perhaps not all that surprising, since the ex post optimal policy has the bene(cid:133)t of "seeing" the shocks before they occur, although this advantage is mitigated by the constraint(cid:150)not faced by the Fed(cid:150)that the ex post optimal policy responds only to the output gap and the in(cid:135)ation rate. For this vintage, knowing the shocks turns out to be very useful indeed: the ex post policy does better(cid:150)almost twice as well(cid:150)as the historical policy.35 However, the traditional Taylor rule also outperforms the historical policy. By contrast, the ex ante optimal policy does a fair amount worse. What both the Taylor rule and the ex post optimal policy share is stronger responses in general, and to in(cid:135)ation in particular, than the ex ante optimal policy. Evidently, the average sequence of shocks that conditions the ex ante optimal policy was less in(cid:135)ationary than the actual sequence. Table 2 Normalized model performance in counterfactual simulation (cid:3) February 1997 vintage November 2003 vintage (cid:11) (cid:11) L (cid:11) (cid:11) L (cid:25) y (cid:25) y Historical policy - - 1 - - 1 Ex post optimal 0.94 0.33 0.56 0.78 1.31 2.25 Ex ante optimal 0.18 0.25 1.80 0.30 1.07 4.17 Taylor rule 0.50 0.50 0.74 0.50 0.50 10.79 * Selected rules and model vintages. Using the estimated shocks over 20 years. The right-hand panel shows the results for the November 2003 vintage of the model. Here the results are much di⁄erent, and surprising. The historical policy is substantially better than any of the alternative candidates. The fact that knowledge of the shocks is an insu¢ cient advantage to design an e⁄ective Taylor rule suggests that responding to just two variables is not enough for the shocks borne during this period. If the best two coe¢ cients of the ex post optimal policy were less 35 Thatsaid,aswenotedbefore,theperformancecomparisonassumespreferencesthatmaynotmatchtheFOMC(cid:146)s preferences, although they are arguably very reasonable preferences. 30

than ideal, the basic Taylor rule and the ex ante policy should do worse, and indeed they do: much worse. The lower the feedback on the output gap in these scenarios, the poorer the performance. With a bit of re(cid:135)ection, the reasons for this should not be surprising: the shocks during this period includedshockstothegrowthrateofpotentialoutput, asoutlinedinFigure2.2above. Suchshocks manifest themselves in more variables than just the output gap and in(cid:135)ation. Indeed, the short-run impact of an increase in productivity is to reduce in(cid:135)ation and raise output, leading to o⁄setting e⁄ects on policy. However as time goes by, the higher growth rate of productivity raises the desired capital stock thereby increasing the equilibrium real interest rate. The Taylor rule and its cousins are ill designed to handle such phenomena. 5.2. Performance on average: stochastic simulations Another way that we can assess candidate policies is by conducting stochastic simulations of the various model vintages under the control of the candidate rules and evaluating the loss function. We do this here. We subject both of these models to same set of stochastic shocks as in the ex ante optimization exercise. Under these circumstances, the ex ante optimal rule must perform the best. Accordingly, in this case, we normalize the loss under the ex ante optimal policy to unity. The results are shown in Table 3. Table 3 Normalized model performance under stochastic simulation (cid:3) February 1997 vintage November 2003 vintage (cid:11) (cid:11) L (cid:11) (cid:11) L (cid:25) y (cid:25) y Ex ante optimal 0.18 0.25 1 0.30 1.07 1 Ex post optimal 0.94 0.33 1.76 0.78 1.31 4.19 Taylor rule 0.50 0.50 1.33 0.50 0.50 1.49 * Selected rules and model vintages. 400 draws of 80 periods each. For the moment, let us focus on the left-hand panel, with the results for the February 1997 model; once again, we show the coe¢ cients of the candidate rules for easy reference. The ex ante optimal coe¢ cients are both low, at about 0.2. The ex post optimal coe¢ cients are higher, particularly for in(cid:135)ation. However, the table shows that applying the policy that was optimal for 31

the particular sequence of shocks to the average sequence, selected from the same set of shocks, would have been somewhat injurious to policy performance, with a loss that is 76 percent higher. The Taylor rule prescribes stronger feedback on output but weaker feedback on in(cid:135)ation, than the ex ante optimal policy. The fact that the loss under the Taylor rule is approximately midway between that of the ex ante and ex post rules suggests that it is the response to in(cid:135)ation that is the key to performance for this model vintage and the corresponding shock set. Still, in broad terms, none of the rules considered here performs too badly for this vintage. The results for the November 2003 vintage, shown in the right-hand panel, are in some ways more interesting. Recall that in Table 2 we showed that the ex post optimal rule performed approximately twice as well as the ex ante optimal rule for the particular sequence of shocks studied. Here it is shown that this same ex post optimal rule(cid:150)that is optimal for the speci(cid:133)c shocks in the particular order of the period immediately before the vintage(cid:150)performs very poorly for the same shocks on average. The reasons are clear from our prior examinations. The period ending in mid-2003 contained a number of important, correlated shocks; namely, the productivity boom and the stock market boom. The episodic nature of these disturbances makes them special. With knowledge of these shocks including the order of their arrival, a policy(cid:150)even a policy constrained to respond to just two objects, in(cid:135)ation and the output gap(cid:150)can be devised to do a reasonable job. But with randomization over these shocks, so that one knows their nature but not the speci(cid:133)c order, the best policy is very di⁄erent. This tells us is about the cost of hubris: a policy maker that thinks he knows a lot about the economy and acts on that belief, may pay a substantial price if the world turns out to be di⁄erent than he expected. This impression is ampli(cid:133)ed by the Taylor rule which show performances that, while inferior to the ex ante optimal rule(cid:150)as they must be(cid:150)are not too bad. One might wonder why the November 2003 model is so much more sensitive to policy settings than the February 1997 model. Earlier, we noted thatperformance in generalis jointly determined by initial conditions (that is, the baseline), the stochastic shocks, the model and the policy rule. Allofthesefactorsareinplayintheseresults. However, asweindicatedintheprevioussubsection, 32

the nature of the shocks is an important factor. The shocks for the February 1997 model come from the relatively placid period of the late 1960s to the mid-1990s, whereas the shocks to the November 2003 model contain the disturbances from the mid-1990s. We tested the importance of theseshocksbyrepeatingtheexperimentinthissubsectionusingtheNovember2003butrestricting theshockstothesamerangeusedfortheFebruary1997vintage. Performancewasmarkedlybetter regardless of the policy rule. Moreover, there was less variation in performance across policy rule speci(cid:133)cations. Since, however, the stochastic shocks come from the same data that render the model respeci(cid:133)cations, this just emphasizes the importance of model uncertainty in general, and designing monetary policy to respond to seemingly unusual events in particular. 5.3. Discussion In two important papers Levin et al. (1999, 2003) layout a case for judiciously parameterized simple rules as hedges against model uncertainty. In particular, they identify persistence in policy setting(cid:150)a Taylor rule with a lagged fed funds rate term bearing a coe¢ cient of unity(cid:150)as being the key for robustness. At the end of subsection 4.1, we noted that none of our vintages favored a large coe¢ cient on the lagged fed funds rate. This is surprising and particularly so since one of the models that Levin et al. (1999, 2003) used in their rival models analysis was a version of the FRB/US model. What explains the apparent contradiction? The answer is rational expectations. All of the four models that they used were linear rational expectations models where future output gaps are a key determinant of in(cid:135)ation. An implication of this is that missettings of the currentperiod funds rate have few implications for overall economic performance so long as private agents believe the policy rule guarantees a unique stable rational expectations equilibrium. 36 The VAR-based expectations models used in this paper are not so forgiving. Policy errors today imply a train of events in the future that must be countered with future policy settings. The self-correcting properties in rational expectations models of agents(cid:146)beliefs are not operational. We would argue that given that the premise of the literature; that is, that policy makers do not 36 The evidence for this is contained in Levin and Williams (2003) where it is shown that policy makers face a more di¢ cult choice in (cid:133)nding a robust policy if one of the rival models is a linear rational expectations model and another is a "backward-looking" model. Cogley and Sargent (2004) point to a similar issue in their work explaining the runaway in(cid:135)ation of the 1970s. 33

understand the model they are attempting to stabilize, the e¢ cacy of maintaining the rational expectations assumption for private agents is open to question. 6. Concluding remarks Thispaperhasprovidedthe(cid:133)rstexaminationofreal-timemodeluncertainty, andhasdonesousing the archive of vintages of the FRB/US model of the macro economy since the model(cid:146)s inception as the Board of Governor(cid:146)s macroeconometric model in 1996. We examined how the model properties have changed over time and how the optimal policies for those vintages have changed alongside. We found that the time variation in model properties is surprisingly substantial. Surprising because the period under study, at eight years, is short; substantial because the di⁄erences in model properties over time imply large di⁄erences in optimized policy coe¢ cients. We also compared di⁄erent policies by model vintage, doing so in two di⁄erent ways. In one rendition, we compared policies conditional on bootstrapped model residuals; in the other, we conducted counterfactual simulations examining performance over approximately the same period where the model vintage was estimated. Besides (cid:133)nding that our optimized rules di⁄er by vintage, we also found that plausible alternatives to the optimized policy result in signi(cid:133)cant incremental losses. Our results suggest that policy makers and researchers should not be sanguine about simple policy rules. The kind of rules promulgated by Levin et al. (1999) do not work very well in these models, the models used by the Fed sta⁄ to help inform FOMC members in their policy deliberations. We also found that knowledge, in real time, of the disturbances the economy is bearing can, under some circumstances, be critical for good policy. In the late 1990s, a time where it is generally agreed that policy was very good, there was no Taylor rule parameterization that performed particularly well. The subject warrants further study, the (cid:133)ndings to date point in the direction of discretionary policy with considerable attention to discerning the nature of shocks in real time as an alternative, or complement, to the use of policy rules as guides for policy. 34

References Anderson, Richard G and Kevin L. Kliesen (2005) "Productivity measurement and monetary policymaking during the 1990s" Federal Reserve Bank of St. Louis working paper no. 2005-067A (October). Atkeson, Andrew and Lee E. Ohanian (2001) "Are Phillips Curves Useful for Forecasting?" Federal Reserve Bank of Minneapolis Quarterly Review,25(1): 2-11. Bernanke, Ben S., and Mark Gertler (1999) "Monetary Policy and Asset Price Volatility" in New Challenges for Monetary Policy (Kansas City: Federal Reserve Bank of Kansas City): 77-1229 Bernanke, Ben S. and Mark Gertler (2001) "Should Central Banks Respond to Movements in Asset Prices?" American Economic Review,91: 253-257. Brainard, William (1967) "Uncertainty and the E⁄ectiveness of Monetary Policy" American Economic Review57: 411-425. Brayton, Flint and Peter Tinsley (eds.) (1996) "A Guide to FRB/US (cid:150)a Macroeconomic Model of theUnitedStates"FinanceandEconomicsDiscussionSeriespaperno.1996-42,BoardofGovernors of the Federal Reserve System, 1996. Brayton, Flint, Eileen Mauskopf, David L. Reifschneider, Peter Tinsley, and John C. Williams, J.C. (1997) "The Role of Expectations in the FRB/US Macroeconomic Model" Federal Reserve Bulletin: 227-245. Brayton,Flint,AndrewLevin, RalphTryonand JohnC.Williams(1997)"TheEvolutionofMacro Models at the Federal Reserve Board" Carnegie-Rochester Conference Series on Public Policy,47: 227-245. Cecchetti, Stephen, Hans Genberg, John Lipsky and Sushil Wadhwani (2000) Asset Prices and Central Bank Policy The Geneva Report on the World Economy, vol. 2 (London: Center for Economic Policy Research). Christiano, Lawrence, Martin Eichenbaum and Charles Evans (2005) "Nominal Rigidities and the Dynamic E⁄ects of a Shock to Monetary Policy" Journal of Political Economy,113: 1-45. Christiano, Lawrence and Christopher Gust (1999) "Comment" in J. B. Taylor (ed.) Monetary Policy Rules (Chicago: University of Chicago Press): 299-318. Cogley, Timothy, and Thomas J.Sargent (2004) "The Conquest of U.S. In(cid:135)ation: Learning and Robustness to Model Uncertainty" unpublished manuscript, University of California at Davis and New York University (October). Croushore, Dean and Thomas Stark (2001) "A Real-time Data Set for Macroeconomists" Journal of Econometrics,105: 111-130. Giannoni, Marc .P (2002) "Does Model Uncertainty Justify Caution?: robust optimal monetary policy in a forward-looking model" Macroeconomic Dynamics,6: 111-144. Giannone, Dominco, Lucrecia Reichlin and Luca Sala (2005) "Monetary Policy in Real Time" CEPR working paper no. 4981. 35

Hansen, Lars.P and Thomas J.Sargent (2005) Robustness (Princeton: Princeton University Press) forthcoming. Dale W. Henderson and Warwick J. McKibbin (1993) "A Comparison of Some Basic Monetary Policy Regimes for Open Economies: implications of di⁄erent degrees of instrument adjustment and wage persistence" Carnegie-Rochester Conference Series on Public Policy: 221-317. Kozicki, Sharon. and Peter Tinsley (2001) "Shifting Endpoints in the Term Structure of Interest Rates" Journal of Monetary Economics,43: 613-652. Kreps, David (1998) "Anticipated Utility and Dynamic Choice" in Frontiers of Research in Economic Theory: The Nancy L. Schwartz Memorial Lectures (Cambridge: Cambridge University Press). Levin, Andrew and John C.Williams (2003) "Robust Monetary Policy with Competing Reference Models" Journal of Monetary Economics,50: 945-975. Levin, Andrew, Volker Wieland and John C.Williams (1999) "Monetary Policy Rules under Model Uncertainty" in J.B. Taylor (ed.) Monetary Policy Rules (Chicago: University of Chicago Press): 263-299 Levin, Andrew, Volker Wieland and John C.Williams (2003) "The Performance of Forecast-based Monetary Policy Rules under Model Uncertainty" American Economic Review,93: 622-645. McCallum, Bennett (1988) "Robustness Properties of a Rule for Monetary Policy" Carnegie- Rochester Conference Series on Public Policy,39: 173-204. Oliner, Stephen. and Daniel Sichel (1994) "Computers and output growth revisited: how big is the puzzle?" Brookings Papers on Economic Activity,2: 273-317. Oliner, Stephen. and Daniel Sichel (2002) "The Resurgence of Growth in the Late 1990s: is information technology the story?" Journal of Economic Perspectives,14: 3-32. Onatski, Alexei (2003) "Robust Monetary Policy under Model Uncertainty: incorporating rational expectations" unpublished manuscript, Columbia University, 2003. Onatski, Alexei and Noah Williams (2003) "Modeling Model Uncertainty" Journal of the European Economic Association,1: 1087-1122. Orphanides, Athanasios (2001)." Monetary Policy Based on Real-time Data" American Economic Review,91: 964-985. Orphanides, Athanasios, Richard Porter, David Reifschneider, Robert Tetlow and Frederico Finan (2000)"ErrorsintheMeasurementoftheOutputGapandtheDesignofMonetaryPolicy"Journal of Economics and Business,52: 117-141. Reifschneider, David L., Robert J.Tetlow and John C. Williams (1999)"Aggregate Disturbances, MonetaryPolicyandtheMacroeconomy: theFRB/USPerspective"Federal Reserve Bulletin (January): 1-19 Roberts, John M. (2004) "Monetary Policy and In(cid:135)ation Dynamics" Finance and Economics Discussion Series paper no. 2004-62, Board of Governors of the Federal Reserve System. 36

Romer,ChristinaandDavidRomer(2002)"TheEvolutionofEconomicUnderstandingandPostwar Stabilization Policy" in Rethinking Stabilization Policy (Kansas City: Federal Reserve Bank of Kansas City): 11-78. Rudebusch, Glenn (2001) "Is the Fed Too Timid?: monetary policy in an uncertain world" Review of Economics and Statistics,83: 203-217. Sack, Brian and Volker Wieland (2000) "Interest-rate Smoothing and Optimal Monetary Policy: a review of recent empirical evidence" Journal of Economics and Business,52: 205-228. Sargent, Thomas; Noah Williams and Tao Zha (2005) "Shocks and Government Beliefs: the rise and fall of American In(cid:135)ation" forthcoming in American Economic Review. Sims, Christopher and Tao Zha. "Were There Regime Shifts in U.S. Monetary Policy?" Federal Reserve Bank of Atlanta working paper no. 2004-14 (June). S(cid:246)derstr(cid:246)m, Ulf (2002) "Monetary Policy with Uncertain Parameters" Scandinavian Journal of Economics,54:125-145. Staiger, Douglas; James H. Stock and Mark Watson (2001) "Prices, Wages and the U.S. NAIRU in the 1990s" NBER working paper no. 8320 (June). Svensson, Lars .E.O. (1999) "In(cid:135)ation Targeting as a Monetary Policy Rule" Journal of Monetary Economics,XLIII (1999), 607-654. Svensson, Lars.E.O.(2002) "In(cid:135)ation Targeting: should it be modeled as an instrument rule or a targeting rule?" European Economic Review,46(4/5): 771-180. Svensson, Lars.E.O. and Robert Tetlow (2005) "Optimum Policy Projections" National Bureau of Economic Research working paper no 11392. Forthcoming in International Journal of Central Banking. Svensson, Lars.E.O.andMichaelWoodford(2003)"OptimalIndicatorsforMonetaryPolicy"Monetary Economics,46: 229-256. Taylor, John.B(1993)"DiscretionVersusPolicyRulesinPractice"Carnegie-Rochester Conference Series on Public Policy,39: 195-214. Tetlow, Robert (2005)"Time variation in the U.S. sacri(cid:133)ce ratio in real time and ex post" unpublished manuscript in progress, Division of Research and Statistics, Board of Governors of the Federal Reserve System. Tetlow. Robert and Peter von zur Muehlen."Robust Monetary Policy with Misspeci(cid:133)ed Models: does model uncertainty always call for attenuated policy?" Journal of Economic Dynamics and Control XXV (2001), 911-949. Tulip, Peter (2005). "Has Output Become More Predictable?: changes in Greenbook forecast accuracy" Finance and Economics Discussion Series paper no. 2005-31, Board of Governors of the Federal Reserve System. Williams,JohnC.(2003)"SimpleRulesforMonetaryPolicy"FederalReserveBankofSanFrancisco Economic Review, pp.1-13. 37

A. Appendix This appendix documents changes to the FRB/US model over the period from July 1996 to November 2003. The (cid:133)rst section is fairly general, discussing the broad aspects of the model. In re(cid:135)ection of the importance of the productivity shock of the late 1990s on economic thought and on modeling at the Board of Governors, the second section focusses more narrowly on the model(cid:146)s supply block. A.1 Model Changes by Vintage Figures A1.a and A1.b(cid:150)which are really one (cid:133)gure spread over two pages(cid:150)provide a helicopter tour of the model(cid:146)s changes over time, along with reminders of some of the events of that era. The chart across the top shows two things: the total number of equation changes by vintage (the red bars, measured o⁄ the left-hand scale), and the total number of model equations, including identities (the blue line and the right-hand scale). Three facts immediately arise from the picture. First, there have been (cid:135)urries of numerous changes in the model. Second, the number of changes has tended to decrease over time.37 And third, the number of equations has increased, particularly in the period from 2000 to 2002. The fact that many model changes were undertaken early in the model(cid:146)s history but without adding to the size of the model while fewer changes were adopted later on that nonetheless added to the model(cid:146)s size suggests that early period was one of model shakedown while the latter period was one of revision. Indeed, during the period from about 1998 to 2002, the range of questions that the model was expected to address increased, and the sta⁄(cid:146)s view of the economy became more complicated. 37 A"modelchange"isthenon-trivialaddition,deletionorchangeinspeci(cid:133)cationofa"signi(cid:133)cant"modelequation from the vintage immediately preceding. Re-estimation of a given equation does not count as a model change. Rewriting an equation in a mathematically equivalent way also does not count. In a fully articulated model with a large number of identities, changes in structural equations can oblige corresponding changes in a large number of associated identities. As a result, the count of model changes mounts rapidly. 38

Figure A-1a : Model changes by vintage, 1996 - 1999 39

Figure A-1b : Model changes by vintage, 2000 - 2003 40

The bottom part of the two pages is a table divided into two columns. The right-hand column of each page documents some of the more important model changes incurred over the period. The entries shown are marked with a letter (in red) with a corresponding entry appearing in the appropriate place and in the same color, in the chart. Theleft-handcolumnidenti(cid:133)essomenoteworthyeconomiceventsoftheera. Some, butnotall, of these events directly in(cid:135)uenced subsequent model changes; the various NIPA revisions are stark examples of this. Other entries appearing in the left-hand column may have had a more indirect e⁄ect on model changes, or the timing of changes; some represent shocks to the model forecast that obscured, for a time, the emerging productivity boom. The Y2K phenomenon and its transitory in(cid:135)uence on the boom in high-tech business investment in the period prior to January 1 2000 is an example. Still others appear as reminders of the economic forces that were at work during the period, whichinsomeinstancesin(cid:135)uencedthequestionsaskedofthemodel. Forexample, thelong swings in the federal budget position and the exchange value of the dollar were among the factors that changed the nature of the questions asked of the model from shorter-term forecasting issues, to medium-term policy-analysis and counterfactual-simulation issues. Analogous to the lettered entries in the right-hand column, the entries in the left-hand columns are marked by a number, with a corresponding entry appearing in the chart. The stock market was already booming in July 1996, when the model was brought into service. By the end of the year, the model(cid:146)s stock market equation and the consumption and housing equations that stock market wealth a⁄ect had been changed. The most signi(cid:133)cant changes came, however, as the lasting implications of the productivity boom became prominent. In late 1999, as a part of the comprehensive revisions to the National Income and Product Accounts, software was added to the measurement of the capital stock.38 Investment expenditures(cid:150)particularly expenditures on information technology(cid:150)boomed over the same period as did stock market valuations. By late 1999, it became clear that machinery and equipment expenditures would have to be disaggregated into high-tech and "other" because of the sharp divergence in the movements of their relative prices. The boom also engendered other questions: what is the e⁄ect of an acceleration in productivity on the equilibrium real interest rate and on the savings rate? What are the implications of persistent di⁄erences in the productivity of the high-technology and other sectors of the economy? These and other questions resulted in a reformulation of the model(cid:146)s supply side. 38 Prior to that time, expenditures on software were regarded as an intermediate input; they had no direct e⁄ect on GDP. 41

New data, new questions and new speci(cid:133)cations interacted in complex ways. The ascent of new economic views arose in a mixture of gradual accumulation of new data, together with spurts of marked revisions to historical data. The latter came as changes in de(cid:133)nition and concept for the NIPA data played important roles throughout this period. The revisions and conceptual changes were not exogenous events, of course, but rather re(cid:135)ected, in part, the changes that were going on in the economy. Table A1 below summarizes the more important statistic revisions. The table shows, (cid:133)rst, that the revisions changed the historical "backcast" of the data is substantial ways, and second, contributed to the pro-cyclical nature of the model(cid:146)s revisions to potential output. The unifying theme of the questions of the time was an reorientation toward more longer-run or lower-frequency questions than had previously been the case. The introduction of chain-weighted data in late 1996 made modeling these low-frequency trends feasible in a way that had not been the case before.39 The point is that changes to the model were not always a re(cid:135)ection of the model underperforming at the tasks it was originally built to do; in many instances, it was an outcome of an expansion of the tasks to which the model was assigned. 39 In the absence of chain-weighting, trends in relative prices, like the relative price of high-tech capital goods, could not be modeled well. The inability to account for weight shifts in expenditure bundles, which was merely a nuisance over short horizons, was a substantial barrier for the analysis of longer-term phenomena. 42

Table A1 Major NIPA revisions and their e⁄ects, 1996-2003* date revision major aspects of revision estimated magnitude of revision Jan. 1996 comprehensive Adoption of chain-weighted data; RealGDPgrowthrevisedupby0.2 new de(cid:133)nition of government in- percentage point, on average from vestment; new methodology for 1959to1984,butdownby0.1percalculating capital depreciation. centage point from 1987 to 1994. July 1998 annual New source data. Methodological Raised real GDP growth from changes for expenditures on cars 1994:Q4to1998:Q1by0.3percentand trucks; improved estimates age points, mostly through higher on consumer services; new method business investment. of computing business inventories; some software moved from investment to business expenses.New source data. Oct. 1999 comprehensive Switch to geometric weights to be Raised estimates of real GDP consistentwithearlierCPIrede(cid:133)n- growth from 1987 to 1998 by an ition;softwareincludedinbusiness average of 0.4 percentage points. investmentandcapitalstocks;new census data and 1992 benchmark input-output accounts. July 2001 annual New source data. New price Reduced estimates of real GDP index for communications equip- growth from 1998:Q1 to 2001:Q1 ment; conversion from SIC to by 0.3 percentage points, on aver- NAICS industry classi(cid:133)cation sys- age. tem. July 2002 annual New source data. New methodol- Real GDP growth revised down ogy taken on for computing wages from 1999:Q1 to 2002:Q1 by 0.4 and salaries; new price index for percentage points, on average. PCE services. * Source: based on Anderson and Kliesen (2005), Table 2. A.2 FRB/US aggregate supply block in real time. This section provides a summary of the evolution of the supply block of the FRB/US model. In particular, we outline the changes in the de(cid:133)nition and behavior of potential output and its determinants over time. As noted in the main text, changes in the model(cid:146)s supply side were initially driven by the lessons of the data, and in particular by a sequence of underpredictions of output with coinciding overpredictionsofin(cid:135)ation.40 At(cid:133)rst,theunderpredictionsweremetwithshiftsinthedeterministic 40 SvenssonandTetlow[2005]documentthechangeintheBoardsta⁄(cid:146)sviewbetween1997and1999. Tulip[2005] summarizes the forecast record of the Board sta⁄over this period and others. In both of these papers, it is the sta⁄ economic projection(cid:150)a judgmental forecast(cid:150)rather than the model forecast that is discussed but the records of the two forecasts were quite similar. 43

paths of latent variables like the NAIRU and trend labor productivity. Stochastic elements of determinants of aggregate supply made their introduction in the August 1998 vintage. The (cid:133)rst change was a relatively modest one, allowing stochastic trends in the labor force participation rate. More stochastic trends were to follow. Beginning with the May 2001 vintage, a production function accounting approach was adopted which allowed capital services to play a direct role in the evolution of potential, with stochastic trends in the average work week, the participation rate and in trend total factor productivity. The evolution from a nearly deterministic view of potential output to a stochastic view was complete. Among other things, this change in view manifests itself in more volatile measures of potential growth(cid:150)and more ex post correlation between potential and actual output growth(cid:150)just as the path for the August 2002 vintage shows. Themodel(cid:146)ssupplysideisfairlydetailed. Inordertokeeptheexpositionasshortandtransparentaspossible,wesimplifyindescribingcertainaspectsofthedeterminantsofsomevariableswhere the simpli(cid:133)cation will not mislead the reader.41 Table A2 facilitates the exposition by explaining the mnemonics of the equations. Table A2 Appendix equation mnemonics desired, target or equilibrium value (cid:3) y output q labor productivity n employment ng government employment lf labor force h employment hours ww average work week nq labor quality u civilian unemployment rate z total factor productivity k capital services or stock e energy input s wedge between payroll and establishment surveys t j time trend, commencing at date j f g d k shift dummy, equals zero before k and unity thereafter f g mave moving average operator 41 For example, below we describe the describe trend labor productivity as being a geometric weighted sum of lagged capital-to-output and energy-to-output ratios. This is true, but these actual ratios are then modeled as a function of desired ratios, which in turn are a function of the ratio of output price to user cost. 44

A.2.1 Aggregate supply in the July 1996 vintage In the model(cid:146)s (cid:133)rst vintage, potential output in the (adjusted) non-farm business sector, y ;was (cid:3) the product of potential employment, n ;trend labor productivity, q , and the trend average work (cid:3) (cid:3) week, ww , as shown by equation (A1). Equation (A2) shows that potential employment was given (cid:3) by the trend labor force, lf , adjusted for the NAIRU, u , and the trend in the wedge between the (cid:3) (cid:3) householdandpayrollsurveysofemployment,s ,lessamovingaverageofgovernmentemployment. (cid:3) The trend labor force is just the civilian population over the age of 16 multiplied by some time trends and shift dummies. Trend labor productivity, q , is given by total factor productivity, z , (cid:3) (cid:3) and a long moving average of past capital-output and energy-output ratios, multiplied by their factor shares (and divided by labor(cid:146)s share). Target hours, h ; was also modeled as a split time (cid:3) trend, as was the trend component of the wedge. The NAIRU, u , was set at 6 percent and trend total factor productivity, z , was assumed (cid:3) (cid:3) to have grown exogenously at an annual rate of 2.3 percent until 1972, and have slowed to a 1.2 percent pace thereafter. y = n q ww (A1) (cid:3) (cid:3) (cid:3) (cid:3) n = lf [(1 u ) s ] mave(ng) (A2) (cid:3) (cid:3) (cid:3) (cid:3) (cid:1) (cid:0) (cid:0) (cid:0) lf = n16 (b +b t47+b t901+b d90+b d94) (A3) (cid:3) 0 1 2 3 4 (cid:1) q = z mave(k=y)0:257mave(e=y)0:075 (A4) (cid:3) (cid:3) ww = 1=exp(a +a t47+a t801) (A5) (cid:3) 0 1 2 The historical record aside, the point to take from the above is that potential output was modeled as essentially deterministic. The primitives underlying the evolution of y over time were (cid:3) time trends, shift dummies and slow moving averages of variables that do not (cid:135)uctuate a great deal with perturbations to the data. The underlying view behind potential output at the outset of the model was a distinctly Keynesian one wherein the vast majority of (cid:135)uctuations in output arose 45

from demand-side factors.42 Twootheraspectsofthestateofthemodelingworldin1996areworthnoting. First,inresponse to the introduction early in 1996 of chain-weighting of the National Income and Product Accounts (NIPA) data, the model section considered adding chain-weighted code to the model. However, the idea was shelved for the time being because of the volume and complexity of the necessary additional code. Second, the economic importance of computers (and associated equipment and software) were beginning to attract the attention of some of the Board sta⁄. In particular, Oliner and Sichel [1994] studied the implications of the penetration of computers in the workplace for the measurement of capital stocks and labor productivity. At the time they regarded the stock of computers as too small to be of quantitative importance for productivity measurement. For this reason, and because the relatively new and rapidly growing high-tech sector was di¢ cult to model,themodelsectionoptednottosplitoutcomputers(orhigh-techequipment)fromproducers(cid:146) durable equipment. Both of these decisions would be revisited, and for related reasons, as we shall see. A.2.2 The October 1997 vintage As the (cid:133)rst row of Table 1 in the main text shows, by October 1997, it was apparent that the sta⁄were about to record a large error in the forecast of GDP growth in the four quarters ending 1997:Q3: The July 1996 forecast was for growth in real GDP of 2.2 percent, while the data would eventuallycomeinat4.8percent. Andyetthisunderestimateofoutputgrowthwasarisingwithout concomitant increases in in(cid:135)ation as the demand-side view of the world would predict. The model builders responded by revising the model(cid:146)s NAIRU, raising it the 1980s to about 6.3 percent (measured on a demographically adjusted basis), and then allowing a gradual reduction over the (cid:133)ve years from 1989 to 1993 to about 5.5 percent, or one-half percentage point below the previous estimate for this period. As before, however, u , and other supply-side variables were extrapolated (cid:3) into the forecast period as exogenous trends; that is, there was no stochastic element to their revision and projection. 42 Perhaps more accurately it could be said that persistent supply shocks were infrequent enough that they could be disregarded as improbable, ex ante, and large enough that they could be identi(cid:133)ed in real time. 46

A.2.3 The April 1998 vintage Instead of modeling trend labor productivity, q ; as a combination of three slow-moving pieces, (cid:3) and allowing a residual, the model section decided to enforce the identity connecting q and z , (cid:3) (cid:3) eliminating the residual in the equation. Equation (A4) above still describes how trend labor productivity evolves in forecasting and simulation; in the historical data, q however, was constructed (cid:3) with a kinked time trend with z backed out. As a consequence, looked at in isolation, trend total (cid:3) factor productivity showed signi(cid:133)cant time variation. Mathematically, the change was of trivial importance; however, the measure of trend total factor productivity that was implied by the choice of trend labor productivity was now a variable that could be reviewed and checked for its plausibility. A.2.4 The August 1998 vintage TheshiftintheNAIRUinOctober1997aside, potentialoutputdeterminationremainedessentially thesameuntiltheAugust1998vintage,otherthansometinkeringinthenumberanddatesofbreaks in trend in the h equation. At that point, the economy was booming and more workers were being (cid:3) elicited to o⁄er their employment services than the sta⁄ had previously anticipated. The model builders decided to replace the split time trend in the desired labor force, lf , with an Hodrick- (cid:3) Prescott (cid:133)lter of the actual labor force in history and then extrapolate that trend exogenously into the forecast period. lf = lfpr n16 (A6) (cid:3) (cid:3) (cid:1) lfpr = hp(lfpr) (A7) (cid:3) The H-P (cid:133)lter, even though it is a two-sided (cid:133)lter, responds to (cid:135)uctuations in the data in a way that time trends (kinked or otherwise) do not. Thus, the idea that the supply side of the economy had its (persistent) stochastic elements was introduced into the model. At about the same time, the incipient productivity boom was raising new questions of a lower frequency (or longer-run) nature than the sorts of questions the model had originally been envisioned as answering. In particular, the model section was asking (and being asked) about the implications of a sustained increase in productivity capacity on wage determination, on stock market valuation, and on the equilibrium real interest rate. In addition, with the (cid:133)scal position of the 47

federal government rapidly improving, questions regarding the determination of bond rates and the current account were coming to the forefront of discussion. These questions required longer-run simulations and more carefully modeled steady-state conditions than before. Approximations that had been deemed acceptable in model code for earlier vintages of the model were coming under the strain of the new demands on the model. A.2.5 The August 1999 vintage Withthemodelsectionconductingmoreandmorelong-termsimulations(simulationsof, say, more than 20 years in length) the limitations of some of the approximations that had been used in place of chain-weighting in the model code were becoming apparent. What was "close enough" over an 12-quarter horizon was not close enough when approximation errors were allowed to essentially cumulate over an 80-quarter horizon. Accordingly, the section adopted chain-aggregated equations for the (cid:133)rst time. The pertinence of this for the present discussion of the supply side of the model is that the long-duration productivity shocks, and other persistent supply shocks, that are now routinely carried out with the model could not have been done properly without chain-aggregation code. A.2.6 The June 2000 vintage The modeling of ww with split time tends disappeared in favor a Hodrick-Prescott (cid:133)lter. Out (cid:3) of sample, the trend was projected exogenously. The model section(cid:146)s use of stochastic trends was expanding. y = n q ww (A8) (cid:3) (cid:3) (cid:3) (cid:3) ww = hp(ww) (A9) (cid:3) Also around this time, Steve Oliner and Dan Sichel were completing their work, (subsequently published in 2002) on the contribution of computers and other high-tech investments to the capital stock and consequently on measured productivity. Their work would eventually allow the model group to accurately measure capital services for the (cid:133)rst time. 48

A.2.7 The May 2001 vintage May 2001 featured large-scale changes to the model(cid:146)s supply side. First, and most important, a full production function approach was adopted y = (n ww nq )0:700k0:265mave(e=y)0:0350z =(1 0:0350) (A10) (cid:3) (cid:3) (cid:3) (cid:3) (cid:3) (cid:0) Second, investment in equipment and software was broken into two categories, high-tech and "other". And third, trend total factor productivity was modeled using an H-P (cid:133)lter, z = hp(z) (cid:3) with z de(cid:133)ned as the Solow residual of the equation immediately above evaluated with y instead of y on the left-hand side. At this point, H-P (cid:133)lters were (cid:133)guring in the construction of potential (cid:3) output in three places: the trend work week, the trend labor force participation rate, and trend total factor productivity. Anumberofeventsconspiredtobringaboutthesechanges(orfacilitatedtheiradoption)mostof which have already been mentioned. These include the ongoing productivity boom ; the adoption of chain-aggregation model code; an acceleration in the decline in computer prices in 1999; the increase of computers and other high-tech equipment as a share of expenditures on machinery and equipment; and the BEA(cid:146)s comprehensive revision in December 1999 and January 2000 which added software to the de(cid:133)nition of capital services. The economic boom in the second half of the 1990s had made the kinked-time-trend view of trend labor productivity untenable; in the March 2001 vintage, for example, there were four breaks in trend for q , including two as recent and as close together as 1995:Q3 and 1998:Q1. As noted (cid:3) above, it also changed the nature of the questions that were asked of the model, turning them more toward longer-term issues, which changed the demands on the model. Finally, the events in hightech production and investment made the avoidance of disaggregating expenditures on machinery and equipment too costly to bear. A.2.8 The December 2001 vintage Concern with the two-sided nature of the H-P (cid:133)lter had been building for some time within the model section. If one interpreted capital put in place and labor supply as the outcome of rational, optimizingagents,thenthestockofcapitalandthelevelofpotentialoutputshouldre(cid:135)ectthebeliefs of (cid:133)rms and workers over time. It followed that "trend" variables should not use information that was not available at the time decisions are made; that is, two-sided (cid:133)lters should be avoided. 49

The (cid:133)rst step in the section(cid:146)s reconsideration of this was to replace the H-P (cid:133)lter of the trend labor force participation rate with a (one-sided) Kalman (cid:133)lter estimate. The Kalman (cid:133)lter model allowedthechangeinthelogofthegrowthrateofthelaborforceparticipationratetobeastochastic (drift) process. The model also allowed for the in(cid:135)uence of the unemployment rate and unidenti(cid:133)ed stationaryshocksontheparticipationrate. Withthischange, adistinctionwasintroducedbetween theactualgrowthrateofpotentialoutput,(cid:1)y ,andthetrendgrowthrate,g ,withthelatterbeing (cid:3) (cid:3) interpreted as agents(cid:146)beliefs about potential growth going ahead. The model had always had an expectations block, but much of the model(cid:146)s expectations code was concerned with short-term expectations of stationary or "gap" variables. There were, however, important exceptions to this including the expected long-run in(cid:135)ation rate and the expected long-run real interest rate, as well as certain levels or shares of personal income. The former two variables were based on survey and (cid:133)nancial market data, respectively, and could reasonably be said to represent private-sector expectations. The expected income variables were ad hoc autoregressive speci(cid:133)cations that did not allow for changes in expected growth rates. The new view of trend labor force participation was the (cid:133)rst formal step toward broadening the pre-existing modeling convention and re(cid:135)ected an increased appreciation of expectations of trend growth rates. This new view would eventually have substantial e⁄ects on measures of certain latent variables such as potential output growth as a comparison of the data shown in Figure 2 of the main text makes clear. 43 A.2.9 The March 2002 vintage The H-P (cid:133)lter for the average work week is replaced by the drift component of a Kalman (cid:133)lter model for that variable. For the expected trend growth rate of potential, g ;a measure of the (cid:3) expected growth rate of trend total factor productivity is introduced. Here too, a Kalman (cid:133)lter is used and an I(2) drift term is extracted. 43 This distinction is the main reason why the growth rate of potential for the August 2002 vintage shown in Figure 2 looks so much more volatile than its predecessors. The expected growth rate upon which some of the model(cid:146)s agents base their decisions at any given date in history was smoother. At this stage, however, with just the labor force particpation rate modeled using the Kalman (cid:133)lter, the distinction was not all that large. 50

Cite this document
APA
Robert J. Tetlow and Brian Ironside (2006). Real-time Model Uncertainty in the United States: The Fed from 1996-2003 (FEDS 2006-08). Board of Governors of the Federal Reserve System, Finance and Economics Discussion Series. https://whenthefedspeaks.com/doc/feds_2006-08
BibTeX
@techreport{wtfs_feds_2006_08,
  author = {Robert J. Tetlow and Brian Ironside},
  title = {Real-time Model Uncertainty in the United States: The Fed from 1996-2003},
  type = {Finance and Economics Discussion Series},
  number = {2006-08},
  institution = {Board of Governors of the Federal Reserve System},
  year = {2006},
  url = {https://whenthefedspeaks.com/doc/feds_2006-08},
  abstract = {We study 30 vintages of FRB/US, the principal macro model used by the Federal Reserve Board staff for forecasting and policy analysis. To do this, we exploit archives of the model code, coefficients, baseline databases and stochastic shock sets stored after each FOMC meeting from the model's inception in July 1996 until November 2003. The period of study was one of important changes in the U.S. economy with a productivity boom, a stock market boom and bust, a recession, the Asia crisis, the Russian debt default, and an abrupt change in fiscal policy. We document the surprisingly large and consequential changes in model properties that occurred during this period and compute optimal Taylor-type rules for each vintage. We compare these optimal rules against plausible alternatives. Model uncertainty is shown to be a substantial problem; the efficacy of purportedly optimal policy rules should not be taken on faith. We also find that previous findings that simple rules are robust to model uncertainty may be an overly sanguine conclusion.},
}