feds · August 12, 2018

A Shadow Rate or a Quadratic Policy Rule? The Best Way to Enforce the Zero Lower Bound in the United States

Abstract

We study whether it is better to enforce the zero lower bound (ZLB) in models of U.S. Treasury yields using a shadow rate model or a quadratic term structure model. We show that the models achieve a similar in-sample fit and perform comparably in matching conditional expectations of future yields. However, when the recent ZLB period is included in the sample, the models' ability to match conditional expectations away from the ZLB deteriorates because the time-series dynamics of the pricing factors change. In addition, neither model provides a reasonable description of conditional volatilities when yields are away from the ZLB. Accessible materials (.zip)

Finance and Economics Discussion Series Divisions of Research & Statistics and Monetary Affairs Federal Reserve Board, Washington, D.C. A Shadow Rate or a Quadratic Policy Rule? The Best Way to Enforce the Zero Lower Bound in the United States Martin M. Andreasen and Andrew Meldrum 2018-056 Please cite this paper as: Andreasen,MartinM.,andAndrewMeldrum(2018). “AShadowRateoraQuadraticPolicy Rule? The Best Way to Enforce the Zero Lower Bound in the United States,” Finance and Economics Discussion Series 2018-056. Washington: Board of Governors of the Federal Reserve System, https://doi.org/10.17016/FEDS.2018.056. NOTE: Staff working papers in the Finance and Economics Discussion Series (FEDS) are preliminary materials circulated to stimulate discussion and critical comment. The analysis and conclusions set forth are those of the authors and do not indicate concurrence by other members of the research staff or the Board of Governors. References in publications to the Finance and Economics Discussion Series (other than acknowledgement) should be cleared with the author(s) to protect the tentative character of these papers.

A Shadow Rate or a Quadratic Policy Rule? The Best Way to Enforce the Zero Lower Bound in the United States Martin Andreasen and Andrew Meldrum (cid:3) Abstract We study whether it is better to enforce the zero lower bound (ZLB) in models of U.S. Treasury yields using a shadow rate model or a quadratic term structure model. We show that the models achieve a similar in-sample (cid:133)t and perform comparably in matching conditional expectations of future yields. However, when the recent ZLB period is included in the sample, the models(cid:146) ability to match conditional expectations away from the ZLB deteriorates because the time-series dynamics of the pricing factors change. In addition, neither model provides a reasonable description of conditional volatilities when yields are away from the ZLB. Keywords: Quadratic term structure models, Sequential regression approach, Shadow rate models, Zero lower bound. JEL: E43, E47, G12 Andreasen, mandreasen@econ.au.dk, Aarhus University, CREATES, and Danish Finance Institute; Meldrum, (cid:3) andrew.c.meldrum@frb.gov, BoardofGovernorsoftheFederalReserveSystem. WegivespecialthankstoHendrik Bessembinder (the editor of the Journal of Financial and Quantitative Analysis, where this paper is forthcoming) and an anonymous referee for many helpful suggestions. We thank Jens Christensen, Michiel De Pooter, Hans Dewachter, Gregory R. Du⁄ee, Tom Engsted, Peter H(cid:246)rdahl, Scott Joslin, Don Kim, Donna Lormand, Thomas Pedersen, Jean-Paul Renne, Glenn Rudebusch, Oreste Tristani, and Chris Young for helpful comments, as well as seminarparticipantsatthe2015SoFieConference,theFederalReserveBankofSanFrancisco,theEuropeanCentral Bank, and the Bank of England. Andreasen acknowledges (cid:133)nancial support from the Danish e-Infrastructure Cooperation (DeIC) and (cid:133)nancial support to CREATES (Center for Research in Econometric Analysis of Time Series; DNRF78) from the Danish National Research Foundation. Meldrum acknowledges the Bank of England, where he worked during the preparation of an early draft of this article (Bank of England Sta⁄ Working Paper No. 550, September 2015). The analysis and conclusions are those of the authors and do not indicate concurrence by the Bank of England, the Board of Governors of the Federal Reserve System, or other members of the research sta⁄of the Board.

I. Introduction The Gaussian a¢ ne term structure model (ATSM) has been one of the most popular dynamic term structure models (DTSMs) of the past two decades. However, a well-known shortcoming of the Gaussian ATSM is its inability to ensure non-negative nominal bond yields. The fact that U.S. yields have reached historically low levels during the past several years has therefore generated substantial interest in alternative DTSMs that are consistent with the zero lower bound (ZLB). Several studies of U.S. yields have extended the ATSM by truncating the short rate using the maximum function, which gives rise to the shadow rate model (SRM) proposed by Black (1995) (see e.g. Kim and Singleton (2012), Christensen and Rudebusch (2015), Bauer and Rudebusch (2016), and Wu and Xia (2016)). The SRM is appealing because it enforces the ZLB while preserving an approximately linear relationship between yields and pricing factors when yields are away from the ZLB. However, it is not the only way of enforcing the ZLB in DTSMs. Another possibility is to let the short rate be a restricted quadratic function of pricing factors, which leads to the quadratic term structure model (QTSM) studied by Ahn, Dittmar, and Gallant (2002), Leippold and Wu (2002), and Realdon (2006) among others.1 An obvious advantage of the QTSM relative to the SRM is that the QTSM attains closed-form expressions for bond prices, which makes it computationally much more tractable than the (multifactor) SRM, where bond prices are unavailable in closed form. Although both the SRM and QTSM enforce the ZLB, it is unclear which model performs best when estimated using U.S. data. Unlike the SRM, the QTSM implies a nonlinear relationship between yields and pricing factors when yields are away from the ZLB. Thus, the two models may have quite di⁄erent implications for the conditional moments of yields. The aim of this article is to increase our understanding of ZLB-consistent DTSMs by analyzing the ability of the SRM and QTSM to match these moments of U.S. yields. 1The ZLB may also be enforced in ATSMs with square-root processes, as in Cox, Ingersoll, and Ross (1985) and Dai and Singleton (2000). Alternative ways to account for the ZLB have recently been suggested by Feunou, Fontaine, Le, and Lundblad (2015), Filipovic, Larsson, and Trolle (2017), and Monfort, Pegoraro, Renne, and Roussellet (2017). 1

We highlight the following three main conclusions. First, there is little to distinguish between the ability of the SRM and QTSM in matching conditional expectations of yields. The two models display similar abilities to (cid:133)t yields in-sample, i.e. to match the conditional expectations of current yields, both away from and at the ZLB. When it comes to matching the conditional expectations of future yields, the SRM and QTSM also display similar performance, e.g. they have similar abilities to match short-rate expectations from surveys and to forecast yields out of sample. A standard 3-factor SRM does appear to o⁄er some small advantages relative to a 3-factor QTSM in satisfying standard tests for conditional expectations of future yields, i.e. Campbell-Shiller regressions, risk-adjusted Campbell-Shiller regressions, and Mincer-Zarnowitz regressions for realized excess returns. However, the di⁄erences are not statistically signi(cid:133)cant at standard con(cid:133)dence levels and are largely eliminated by the addition of a fourth factor to the QTSM. Thus, it is not clear that any small bene(cid:133)ts of using the SRM rather than the QTSM are su¢ cient to outweigh the greater computational complexity involved with estimating the SRM. Perhaps more noteworthy than the small di⁄erences between the SRM and QTSM are their common failings. Indeed, our second main conclusion is that neither the SRM nor the QTSM appears to fully capture the change in the dynamics of U.S. yields that occurred when the short rate reached the ZLB. When the SRM and QTSM are estimated using a sample that ends before the recent ZLB period, both models replicate the well-known ability of the ATSM to satisfy the standard tests referred to above. However, when the sample is extended to cover the ZLB period, neither model continues to satisfy these tests when yields are away from the ZLB (the same is true for the ATSM). The problem seems to be that the time-series dynamics of the pricing factors change when the recent period of low yields is included in the sample, which in turn a⁄ects model-implied expectations away from the ZLB. We also show that extending the standard 3-factor version of the SRM and QTSM with a fourth or (cid:133)fth pricing factor does not resolve this shortcoming. Our third main conclusion is that neither the SRM nor the QTSM has the (cid:135)exibility to provide a realistic description of the conditional volatilities of U.S. yields. Both models generate 2

a compression in the conditional volatilities of short-term yields at the ZLB, similar to what we observe in the data. However, they have counterfactual implications for conditional volatilities when yields are away from the ZLB because the SRM generates conditional volatilities that become approximately constant, while the QTSM generates a tight positive relationship between conditional volatilities and the level of yields. This study is most closely related to Kim and Singleton (2012), who examined the performance of 2-factor models estimated using Japanese yields from 1995 to 2008. They found that both the SRM and QTSM achieve a reasonable (cid:133)t to yields and are able to match movements in observed conditional volatilities close to the ZLB. In addition, their results suggest that the two models have similar implications for bond risk premia. However, we should be cautious about extrapolating their results to other countries, including the United States, because Japanese short rates were close to the ZLB for most of their sample period, whereas most studies of DTSMs in the United States consider samples where the short rate is also away from zero for an extended period. This di⁄erence makes the comparison of ZLB-consistent models more complicated when using U.S. yields, because we not only need to consider the performance of DTSMs at the ZLB, but also whether they preserve the desirable properties of the ATSM away from the ZLB.2 The rest of this article is organized as follows: In Section II, we outline the DTSMs and how they are estimated. Sections III, IV, and V respectively explore the models(cid:146)ability to match the cross section of yields, the conditional expectations of future yields, and the conditional volatilities of future yields. In Section VI, we investigate the implications of the models for Sharpe ratios and return predictability through the lens of the "robust properties" of Sharpe ratios and return predictability documented for ATSMs by Du⁄ee (2010). Here, we show that the SRM and QTSM reproduce these robust properties when yields are away from the ZLB, but that there are some di⁄erences between model-implied Sharpe ratios when short rates are close to the ZLB. In Section VII, we discuss our conclusions. 2Realdon (2016) studies ZLB-consistent models estimated using euro-area sovereign yields, which may provide a more relevant comparison with U.S. yields. However, the version of the SRM he estimates di⁄ers from the speci(cid:133)cation considered previously in the literature. 3

II. Models, Estimation Method, and Data In this section we present the ATSM, SRM, and QTSM and explain how we estimate them using U.S. Treasury yields. Although the ATSM does not enforce the ZLB, we include it because it serves as a benchmark for assessing the performance of the ZLB-consistent models. A. Dynamic Term Structure Models The ATSM, SRM, and QTSM that we consider are standard models. They all specify yields as functions of a small number of pricing factors, which follow (cid:133)rst-order Vector Autoregressions (VARs) under both the real-world and risk-neutral probability measures. In Section II.A.1, we present the common features of the models, while in Section II.A.2 we discuss how the functional forms for the short rate di⁄er between the models, and how we compute bond yields. 1. Pricing Factors In all three models, yields are driven by an n 1 vector of unobserved pricing factors x x t (cid:2) that follow a (cid:133)rst-order VAR under the physical measure P, i.e. (1) x = h +h x +(cid:6)" ; t+1 0 x t t+1 where " NID(0;I), h is an n 1 vector, and h and (cid:6) are n n matrices. In the t+1 0 x x x x (cid:24) (cid:2) (cid:2) absence of arbitrage, the time-t price of a j-period zero-coupon bond is P =EQ[exp( r )P ]. Here, r denotes the (risk-free) short rate and expectations are t;j t t t+1;j 1 t (cid:0) (cid:0) formed with respect to the risk-neutral probability measure Q. Following Du⁄ee (2002), the factors also follow a (cid:133)rst-order VAR under Q, i.e. (2) x = (cid:8)(cid:22)+(I (cid:8))x +(cid:6)"Q ; t+1 (cid:0) t t+1 where "Q NID(0;I), (cid:22) is an n 1 vector, and (cid:8) is an n n matrix. t+1 (cid:24) x (cid:2) x (cid:2) x 4

We impose standard restrictions on equations (1) and (2) to identify the models. In all of the models, we require (cid:6) to be lower triangular and let (cid:8) be a Jordan matrix with diagonal elements (cid:30) (cid:30) ::: (cid:30) .3 In addition, we require (cid:22) = 0 in the ATSM and SRM, 11 (cid:20) 22 (cid:20) (cid:20) nxnx whereas it is su¢ cient in the QTSM to restrict (cid:8)(cid:22) 0 (as shown by Realdon (2016)). The (cid:21) parameters in h and h are unrestricted in all three models. 0 x As is standard in most recent studies, we mainly focus on models with 3 pricing factors. However, some previous studies of ATSMs have argued that the inclusion of a fourth or (cid:133)fth pricing factor can help to predict future yields, even though these additional factors have little explanatory power for the current cross section of yields (see e.g. Cochrane and Piazzesi (2008), Du⁄ee (2011b), and Adrian, Crump, and Moench (2013)). We therefore also consider whether ZLB-consistent models with 4 or 5 factors deliver substantially di⁄erent results than the benchmark 3-factor model. As far as we are aware, this issue has been neglected in previous studies of ZLB-consistent DTSMs. 2. Short Rate Equations The models are closed by di⁄erent functional forms for the short rate. Here, we adopt the general speci(cid:133)cation that r = f (s ), where f ( ) is some function and s is a "short rate factor" t t t (cid:1) that is a¢ ne in the pricing factors, i.e. s = (cid:11)+(cid:12) x , where (cid:11) is a scalar and (cid:12) is an n 1 t 0 t x (cid:2) vector. In the remainder of this section, we discuss the di⁄erent functional forms for f ( ). (cid:1) ATSM The short rate in the ATSM is simply given by the short rate factor, i.e. r = s . We t t follow Joslin et al. (2011) and impose the identifying condition that (cid:12) = 1 (in addition to the restrictions on the Q dynamics mentioned in Section II.A.1). The standard 3-factor ATSM therefore has 22 free parameters (three in h , nine in h , six in (cid:6), three in (cid:8), and one in (cid:11)), 0 x with 4- and 5-factor ATSMs having 35 and 51 free parameters, respectively. The yield on a j-period zero-coupon bond is a¢ ne in the pricing factors, i.e. yATSM = (1=j) A +B x , t;j (cid:0) j 0j t where the recursive formulae for A and B are easily derived.4 (cid:0) (cid:1) j j 3We restrict (cid:8) to have real eigenvalues. Strictly speaking, a maximally-(cid:135)exible model would allow for complex eigenvalues (see Joslin, Singleton, and Zhu (2011)). 4An appendix providing further details on bond pricing in all three models is available on request. 5

SRM The short-rate in the SRM is given by r = max 0;s , which ensures that r cannot be t t t f g negative. We apply the same restriction that (cid:12) = 1 as in the ATSM, meaning that the SRM has the same number of free parameters as the ATSM. Closed-form expressions for yields are not available for the SRM with multiple factors, and we therefore approximate yields using the second-order approximation of Priebsch (2013). QTSM The short rate in the QTSM is given by r = s2, which also ensures that r cannot be t t t negative. We impose the parameter restrictions that (cid:11) = 0 and (cid:12) = 1, leaving a 3-factor QTSM with 24 free parameters (three in h , nine in h , six in (cid:6), three in (cid:22), and three in (cid:8)), with 4- 0 x and 5-factor QTSMs having 38 and 55 free parameters, respectively. The yield on a j-period zero-coupon bond is quadratic in the pricing factors, i.e. yQTSM = (1=j) A +B x +x C x , where the recursive formulae for A , B , and C are t;j (cid:0) j 0j t 0t j t j j j (cid:16) (cid:17) derived in Realdon (2006). The existence of closed-form bond prices means that the QTSM is e e e e e e computationally more tractable than the SRM, e.g. with 3 pricing factors and 1 period corresponding to 1 month, it takes around 1,000 times longer to compute yields with maturities up to 10 years in the SRM than in the QTSM.5 We should stress that this version of the QTSM is not maximally-(cid:135)exible. To understand why this is the case, note (cid:133)rst that the short rate in a maximally-(cid:135)exible QTSM is given by (3) r = (cid:14) +(cid:14) x +x (cid:1) x ; t 0 0x t 0t xx t where (cid:14) is a scalar, (cid:14) is an n 1 vector, and (cid:1) is an n n matrix. Together with the 0 x x xx x x (cid:2) (cid:2) restrictions given in Section II.A.1, identi(cid:133)cation is achieved when (cid:14) = 0 and (cid:1) is symmetric x xx with diagonal elements equal to 1 (see Ahn et al. (2002) and Realdon (2006)). The ZLB may then be enforced by imposing the additional restrictions that (cid:14) = 0 and (cid:1) is positive 0 xx semi-de(cid:133)nite. The version of the QTSM that we consider imposes both the normalizing and ZLB 5For this comparison we use Matlab 2017 code running on a PC with a 3:40 GHz Intel Core i7-6700 processor and 16 GB RAM. It takes about 0:01 seconds to compute yields for a single time period in the SRM and about 10 5 seconds in the QTSM. (cid:0) 6

restrictions. However, it also imposes that the o⁄-diagonal elements of (cid:1) are equal to 1 xx (because (cid:1) = (cid:12)(cid:12) = 1 ). These further restrictions on (cid:1) are convenient, because they xx 0 nx nx xx (cid:2) imply that the short rate in the QTSM can also be written as a function of a linear combination of the pricing factors, as in the ATSM and SRM. Moreover, unreported results show that this convenient feature of our QTSM is obtained with hardly any loss of (cid:135)exibility when (cid:133)tting yields, because the o⁄-diagonal elements in (cid:1) are essentially equal to 1 if we freely estimate xx them using our sample of U.S. yields from 1990(cid:150)2016. We therefore only report results from our over-identi(cid:133)ed QTSM throughout this article, with one exception discussed in Section V. To understand why the o⁄-diagonal elements in (cid:1) turn out to be close to 1 when freely xx estimated, consider the decomposition (cid:1) = ADA, where A is a lower triangular matrix with xx 0 diagonal elements equal to 1 and D is a diagonal matrix.6 We can then rotate the factors to x = Ax , which implies that the short rate can be re-written as t 0 t r = x2 +d x2 +:::+d x2 . As pointed out by Kim and Singleton (2012), if (cid:1) is e t 1;t 22 2;t nxnx nx;t xx positive de(cid:133)nite (i.e. d >0 for all i) then all of the rotated factors must be simultaneously equal ii e e e to 0 if the short rate is equal to 0, and thus longer-term yields are constant when the short rate is at the ZLB, which is counter to empirical evidence. If instead (cid:1) = 1 , such that the xx nx nx (cid:2) rank of (cid:1) is equal to 1, then only 1 eigenvalue of (cid:1) is di⁄erent from 0, and hence only the xx xx (cid:133)rst rotated factor appears in the short rate. The remaining factors are therefore free to match longer-term yields even when the short rate is at the ZLB.7 B. Estimation Method and Data 1. The Sequential Regression Approach In this section, we provide a brief description of how we estimate the models using the sequential regression (SR) approach of Andreasen and Christensen (2015). Previous studies have 6The just-identifying restrictions imply that the diagonal elements of D are given by d ii =1 (cid:0) i j(cid:0)= 1 1 a2 ij d jj for i=1;2;:::;n . Note that Kim and Singleton (2012) use a di⁄erent (but invariant) normalization scheme, in which x P they restrict (cid:6)=0:1 I but allow the diagonal elements of (cid:9) (and hence D) to be free parameters. (cid:2) 7ThisfollowsfromthefactthattheQdynamicsoftherotatedfactorsarex t+1 =A 0 (cid:8)(cid:22)+ I A 0 (cid:8)(A 0 )(cid:0) 1 x t + (cid:0) A va 0 l (cid:6) ue "Q t o + f 1 x . 1 S ;t in u c n e d A er i Q slo a w nd er t t h r e ia re n f g o u re la l r o , n I g (cid:0) er- A te 0 r (cid:8) m (A yi 0 e ) l (cid:0) d 1 s i b s e u c p au p s e e r r tr t ia = ng x u 2 1; l t ar w , h a ee n n d (cid:1) he x n x ce = a 1 ll n f x a (cid:2) c n t(cid:16) x o . rsa⁄ecttheexp(cid:17)ec e ted e e 7

typically estimated non-linear DTSMs by quasi-maximum likelihood using a non-linear extension of the Kalman (cid:133)lter. However, the asymptotic properties of this estimator are unknown, and the joint optimization across many parameters makes the estimation computationally challenging, even for 3-factor models. The SR approach overcomes these limitations because it provides consistent and asymptotically normal estimates and is computationally straightforward to implement. This computational simplicity greatly facilitates our comparison of DTSMs with up to 5 pricing factors. Before we describe the SR approach, it is convenient to de(cid:133)ne two vectors containing partly overlapping sub-sets of parameters. First, (cid:18) denotes the "risk-neutral parameters" that 1 determine the relationship between the factors and yields, while (cid:18) denotes the "time-series 2 parameters" that determine the P dynamics in equation (1). Because (cid:6) appears in both (cid:18) 1 and 0 (cid:18) , it is convenient to further partition these vectors as (cid:18) (cid:18) vech((cid:6)) and 2 1 (cid:17) 011 0 (cid:20) (cid:21) 0 (cid:18) (cid:18) vech((cid:6)) . The vector (cid:18) is given by (cid:11) diag((cid:8)) in the ATSM and SRM, 2 (cid:17) 022 0 11 0 (cid:20) (cid:21) (cid:20) (cid:21) 0 0 and by (cid:22) diag((cid:8)) in the QTSM. Finally, the vector (cid:18) = h vec(h ) in all three 0 0 22 00 x 0 (cid:20) (cid:21) (cid:20) (cid:21) models. Suppose in period t that we observe n yields with maturities m ;m ;:::;m . The y;t 1 2 ny;t observed yield with maturity m at time t is given by y = g (x ;(cid:18) )+v , where j t;mj mj t 1 t;mj g (x ;(cid:18) ) is the model-speci(cid:133)c function that relates the pricing factors to the cross section of mj t 1 yields and v is a measurement error. We assume that these measurement errors have means t;mj equal to 0 and (cid:133)nite, positive-de(cid:133)nite covariance matrices. The SR approach has three steps. At Step 1, we jointly estimate (cid:18) and the factors using 1 cross-sectional regressions. For a given value of (cid:18) , we can estimate the factors in period t as 1 1 ny;t 2 x^ ((cid:18) ) = arg min y g (x ;(cid:18) ) : t 1 xt Rnx2n y;t t;mj (cid:0) mj t 1 2 j=1 X(cid:0) (cid:1) To estimate (cid:18) we pool the squared residuals from these regressions and minimize their sum with 1 8

respect to (cid:18) , i.e. 1 1 T ny;t (4) (cid:18) ^step1 = arg min y g (x^ ((cid:18) );(cid:18) ) 2 : 1 (cid:18)1 (cid:2)1 2N t;mj (cid:0) mj t 1 1 2 t=1 j=1 XX(cid:0) (cid:1) Here, (cid:18) ^step1 denotes the Step 1 estimate of (cid:18) , N T n , and (cid:2) is the feasible domain of 1 1 (cid:17) t=1 y;t 1 ^step1 (cid:18) . Given standard regularity conditions, (cid:18) is coPnsistent and asymptotically normal when 1 1 n for all t.8 y;t ! 1 At Step 2 of the SR approach, we estimate (cid:18) using the estimated factors 2 x^ (cid:18) ^step1 T . As shown in Andreasen and Christensen (2015), when (cid:18) is unrestricted, we t 1 2 t=1 n (cid:16) (cid:17)o can estimate equation (1) simply by running a modi(cid:133)ed regression with all second moments corrected for estimation uncertainty in the factors. At Step 3 of the SR approach, we combine the estimates of (cid:6) from Step 1 and Step 2 ((cid:6)^step1 and (cid:6)^step2, respectively) optimally and re-estimate (cid:18) conditional on the optimal 11 estimate of (cid:6). Preliminary analysis revealed that (cid:6)^step1 tends to be estimated very inaccurately compared to (cid:6)^step2, meaning that the time-series estimate (cid:6)^step2 cannot be improved by adding information from the cross section of yields.9 We therefore simply condition on (cid:6)^step2 and re-estimate (cid:18) as 11 (cid:18) ^step3 = arg min T ny;t y g x^ (cid:18) ;(cid:6)^step2 ;(cid:18) ;(cid:6)^step2 2 ; 11 (cid:18)11 (cid:2)11 t;mj (cid:0) mj t 11 11 2 X t=1 X j=1 (cid:16) (cid:16) (cid:16) (cid:17) (cid:17)(cid:17) ^step3 where (cid:18) denotes the Step 3 estimate of (cid:18) and (cid:2) is the feasible domain of (cid:18) . We (cid:133)nally 11 11 11 11 update our estimate of (cid:18) by re-running Step 2 using the estimated factors 2 x^ (cid:18) ^step3 ;(cid:6)^step2 T . t 11 t=1 n (cid:16) (cid:17)o 8These regularity conditions are stated by Andreasen and Christensen (2015). No further assumptions are imposed on the measurement errors, i.e. they may be correlated across both maturity and time. 9This (cid:133)nding is consistent with the results of Joslin et al. (2011) for ATSMs, because their estimates of (cid:6) from the time-series dynamics of their (observed) factors hardly change when taking account of the cross section of yields. 9

2. Data Our data set consists of end-month U.S. nominal zero-coupon Treasury yields computed using the method of Fama and Bliss (1987). Our sample starts in Jan. 1990 and ends in Dec. 2016. The starting point of this sample is broadly representative of recent studies of SRMs using U.S. data and is consistent with the (cid:133)ndings of Rudebusch and Wu (2007), who argue there was a structural break in U.S. yields during the middle or late 1980s.10 The SR approach is constructed for settings with large cross sections, and we therefore include more yields than are typically used when estimating DTSMs. Speci(cid:133)cally, we represent the yield curve by 27 points, using the 1-month yield, yields in the 3-month to 3-year range at 3-month intervals, and yields in the 3- to 10-year range at 6-month intervals. III. Matching Conditional Expectations of Current Yields This section explores how well the models match the cross section of yields, i.e. the conditional expectations of current yields. In Section III.A, we show that the 3-factor models give a very similar average in-sample (cid:133)t. In Section III.B, we explain this result by showing that it is possible to rotate the factors in the three models such that they are approximately the same and have broadly similar loadings on yields. That said, the loadings in the SRM and QTSM do vary over time, which has potential consequences for the ability of the models to match the conditional expectations and volatilities of future yields, which we explore in Sections IV and V. Finally, in Section III.C, we show that adding a fourth and (cid:133)fth factor makes only a modest di⁄erence to the in-sample (cid:133)t of the models. A. In-Sample Fit We start by considering the (cid:133)t of the 3-factor models to the short rate (i.e. the 1-month yield). Figure 1 shows the plot of the model-implied short rate factor (s ) along the horizontal t 10In an appendix that is available on request, we assess the robustness of our main conclusions to extending the sample back to June 1961. We (cid:133)nd that most of our main conclusions continue to hold, although the inclusion of the ZLB period in the sample has a smaller e⁄ect than when the models are estimated using post-1990 data. 10

axes against both the model-implied short rate (black dots) and the observed 1-month yield in the data (gray stars). The (cid:133)gure illustrates the di⁄erent assumptions regarding the functional form of the short rate, i.e. a linear mapping in the ATSM, the so-called "hockey stick" of the maximum function in the SRM, and a smooth quadratic function in the QTSM. However, the di⁄erences between the models(cid:146)ability to (cid:133)t the data are most apparent if we focus on the ZLB period from 2009(cid:150)2016, as shown in the right-hand column in Figure 1.11 The top, right chart highlights occasionally negative (cid:133)tted short rates in the ATSM. The middle, right chart shows that the SRM avoids negative short rates by construction but produces (cid:133)tted short rates that are occasionally too low. This is particularly evident when the shadow rate is negative and the (cid:133)tted short rate is exactly 0%, whereas observed 1-month rates actually remain slightly positive throughout most of our sample. This means that the 3-factor QTSM (shown in the bottom, right chart) achieves a slightly closer (cid:133)t to the short rate during the ZLB period than the SRM. Nonetheless, as reported in Table 1, the di⁄erences between the three models are small. The root mean squared error (RMSE) for the short rate implied by the ATSM is 8.8 basis points over the ZLB period, compared with 7.1 basis points by the SRM and 5.6 basis points in the QTSM. The di⁄erences in (cid:133)t are also small for longer-maturity yields. Table 1 also shows RMSEs for yields with selected maturities, in addition to the overall RMSE computed across all 27 considered maturities. The 3-factor models achieve similar average (cid:133)t away from the ZLB and at the ZLB, e.g. the di⁄erences between the RMSEs implied by the SRM and QTSM are less than 2 basis points at almost all maturities (including those not reported in the table). B. Factor Loadings To explain why the 3-factor models give broadly the same in-sample (cid:133)t, we next show that their factors may be rotated such that they are approximately the same in each of the three models and have broadly similar loadings on yields. More speci(cid:133)cally, we apply invariant linear 11Throughout this paper, we refer to a "pre-ZLB period" ending in Dec. 2007 and a "ZLB period" from Jan. 2009(cid:150)Dec. 2016. Although the target for the federal funds rate remained at 4.25% at the end of 2007 and did not fall to a target range of 0(cid:150)0.25% until Dec. 2008, ending the pre-ZLB period in 2007 is consistent with the sample periods chosen by some recent studies of Gaussian ATSMs to avoid near-zero yields (for example, Bauer, Rudebusch, and Wu (2012)). Although the target range for the federal funds rate did rise to 0.25(cid:150)0.5% in Dec. 2015, it stayed at this low level until Dec. 2016 (i.e. the (cid:133)nal month in our sample), when it rose to 0.5(cid:150)0.75%. 11

Figure 1: In-Sample Fit of the Short Rate This(cid:133)gureplots theshortratefactor(s )againstthe shortratesimpliedby 3-factor models andobserved t 1-month yields. Charts to the left cover the full sample from Jan. 1990(cid:150)Dec. 2016, while charts to the right focus on the ZLB period from Jan. 2009(cid:150)Dec. 2016. Percent ATSM: 1990-2016 Percent ATSM: 2009-2016 10 0.8 5 0.3 0 -0.2 -0.008 -0.004 0.000 -0.0080 -0.0075 -0.0070 s s t t Percent SRM: 1990-2016 Percent SRM: 2009-2016 10 0.8 5 0.3 0 -0.2 -0.014 -0.007 0.000 -0.014 -0.010 -0.006 s s t t Percent QTSM: 1990-2016 Percent QTSM: 2009-2016 10 0.8 5 0.3 0 -0.2 -0.050 0.025 0.100 -0.02 0.01 0.04 s s t t Model-implied Data transformations to obtain rotated factors that may be interpreted as representing the level, slope, and curvature of the yield curve. To obtain these rotated factors, we consider a vector y = g(x ;(cid:18) ) of model-implied t t 1 yields with maturities from 6 months to 10 years at 6-month intervals. To a (cid:133)rst-order approximation, the unconditional covariance matrix of these model-implied yields is given by (cid:10) var[y ] = g (x(cid:22);(cid:18) )var[x ;(cid:18) ]g (x(cid:22);(cid:18) ) ; (cid:25) t x 1 t 2 x 1 0 where g ( ) denotes the (cid:133)rst derivative of g( ) evaluated at the unconditional mean of the x (cid:1) (cid:1) factors x(cid:22) and var[x ;(cid:18) ] is the variance of x . We can decompose this covariance matrix as t 2 t (cid:10) = VDV, where V is an n n matrix of eigenvectors and D is an n n diagonal matrix 0 y x x x (cid:2) (cid:2) of eigenvalues. The rotated factors z = Vg (x(cid:22);(cid:18) )x are then approximately equal to the (cid:133)rst t 0 x 1 t 12

Table 1: In-Sample Fit of Bond Yields Across Maturity Thistablereportstherootmeansquarederrors(RMSEs)inannualizedbasispointsbetweenactualyields at selected maturities and the (cid:133)tted values from the models with 3, 4, and 5 factors estimated using data from Jan. 1990(cid:150)Dec. 2016. Panel refers to the pre-ZLB period from Jan. 1990(cid:150)Dec. 2007, while Panel A refers to the ZLB period from Jan. 2009(cid:150)Dec. 2016. The (cid:133)nal column of each panel reports the RMSE B computed across all 27 considered maturities. : 1990(cid:150)2007 : 2009(cid:150)2016 A B Maturity (months): 1 6 12 24 60 120 All 1 6 12 24 60 120 All (a) ATSM n =3 9:3 6:4 7:5 3:0 4:0 5:3 4:9 8:8 2:7 7:0 4:1 7:6 13:0 6:4 x n =4 4:6 5:3 3:3 3:1 2:4 4:3 3:4 3:6 3:3 3:6 2:6 3:7 7:4 4:0 x n =5 3:6 4:9 2:7 3:6 2:4 5:3 3:5 2:4 3:5 2:2 2:9 2:9 6:5 3:7 x (b) SRM n =3 8:7 6:5 7:0 2:9 3:7 5:2 4:7 7:1 4:6 5:0 3:3 6:0 11:1 5:3 x n =4 4:5 5:1 2:8 3:0 2:5 4:4 3:3 2:9 3:4 3:2 2:3 3:7 6:2 3:8 x n =5 3:2 4:4 2:6 2:5 2:5 2:5 2:8 2:9 3:1 2:5 2:5 3:4 5:0 3:4 x (c) QTSM n =3 8:8 6:3 6:9 2:9 3:7 4:9 4:7 5:6 3:3 5:8 3:3 5:6 9:7 5:0 x n =4 4:0 4:4 2:7 2:7 2:5 4:5 3:2 2:6 3:2 3:3 2:3 3:7 7:3 3:9 x n =5 2:4 3:5 3:1 2:3 2:4 3:5 2:8 1:4 2:8 2:4 2:4 3:2 5:1 3:5 x n principal components of model-implied yields in the SRM and QTSM (the decomposition is x exact in the ATSM). We can then compute yields as a function of the rotated factors, i.e. 1 y t = f (z t ) g (V 0 g x (x(cid:22);(cid:18) 1 ))(cid:0) z t ;(cid:18) 1 ; (cid:17) (cid:16) (cid:17) and obtain approximate factor loadings for the rotated factors. The (cid:133)rst row of Figure 2 shows the rotated factors normalized to have means equal to 0 and standard deviations equal to 1. As we might expect, the correlations between these factors across the three models are high, particularly for the (cid:133)rst two factors, which explain most of the variation in yields. The remaining rows in Figure 2 show the rotated factor loadings in Mar. 1990, which is the month with the highest observed short rate, and July 2012, when the short rate is at the ZLB and yields are low at all maturities. We (cid:133)rst consider the ATSM (which has time-invariant 13

Figure 2: Factor Loadings This (cid:133)gure plots the rotated factors and the associated factor loadings from 3-factor models estimated using a sample from Jan. 1990(cid:150)Dec. 2016. For each model, we compute a (cid:133)rst-order approximation to the unconditional covariance matrix of yields when the unrotated pricing factors are equal to their unconditional mean. We then decompose this covariance matrix, which allows us to rotate the factors to be approximately equal to the principal components of model-implied yields. The (cid:133)rst row reports the rotated factors, normalized to have means equal to 0 and standard deviations equal to 1. The second and third rows report loadings of yields on the normalized rotated factors in Mar. 1990 and July 2012, respectively. First factor Second factor Third factor 4 4 5 2 2 0 0 0 -2 -2 -4 -4 -5 1990 2000 2010 1990 2000 2010 1990 2000 2010 Loading: Mar. 1990 Loading: Mar. 1990 Loading: Mar. 1990 4 1.5 0.8 2 0.5 0.3 0 -0.5 -0.2 0 2 4 6 8 10 0 2 4 6 8 10 0 2 4 6 8 10 Maturity (years) Maturity (years) Maturity (years) Loading: Jul. 2012 Loading: Jul. 2012 Loading: Jul. 2012 4 1.5 0.8 2 0.5 0.3 0 -0.5 -0.2 0 2 4 6 8 10 0 2 4 6 8 10 0 2 4 6 8 10 Maturity (years) Maturity (years) Maturity (years) ATSM SRM QTSM 14

loadings). The (cid:133)rst rotated factor has nearly constant loadings with maturity, meaning that it has the usual "level" interpretation. The loadings on the second and third rotated factors display the usual "slope" and "curvature" interpretations. We see broadly the same pattern for the SRM and QTSM at medium and long-term maturities. The fact that we can rotate the factors to be roughly equal and have similar loadings for medium and long-term yields explains why the three models display roughtly similar in-sample (cid:133)t at these maturities. However, there are some more notable di⁄erences between the rotated factor loadings at short maturities. When yields are far from the ZLB, the factor loadings in the SRM are (unsuprisingly) close to those in the ATSM. However, at the ZLB, the loadings in the SRM for short-term yields on the (cid:133)rst and second factor decline. The QTSM produces a similar compression in these loadings at the ZLB, but it also generates loadings for the level and slope factors that are substantially larger than in the ATSM and SRM when yields are high. The fact that the factor loadings in the ZLB-consistent models vary depending on the level of yields means that those models imply a trade-o⁄between matching current yields and matching conditional expectations of future yields. To understand why, recall that we essentially estimate the P dynamics of the models by minimizing the squared di⁄erences between 1-month-ahead expectations of the estimated factors and subsequent realizations. In the ATSM, this is essentially equivalent to achieving the best possible 1-step-ahead predictions of yields because there is a linear mapping between factors and yields. In the ZLB-consistent models, however, the conditional expectations of yields also depend on the volatilities of the pricing factors. Thus, the estimated time-series dynamics of the factors in the ZLB-consistent models do not necessarily provide the best possible 1-step-ahead predictions of yields, which may hinder the ability of those models to match conditional expectations of future yields. This concern seems likely to be more acute for the QTSM, which implies strong non-linearities even when yields are far from the ZLB, unlike the SRM. In Section IV, we therefore compare the ability of the three models to match the conditional expectations of future yields. The variation in the factor loadings in the SRM and QTSM also imply that the conditional volatilities of yields vary over time, even with pricing factors that have constant 15

conditional volatilities. We therefore also explore the implications of the models for conditional volatilities in Section V. C. Increasing the Number of Factors We conclude our analysis of the in-sample (cid:133)t of the models by considering whether it can be materially improved by moving beyond the standard 3 pricing factors. The results reported in Table 1 show that the addition of a fourth factor does indeed reduce the average (cid:133)tting errors, with the largest gains coming for the 1-month yield. However, the improvements are not economically signi(cid:133)cant. For example, during the ZLB period, the RMSE for the 1-month rate falls nearly 4 basis points in the SRM and about 3 basis points in the QTSM with the addition of a fourth factor. Adding a (cid:133)fth factor results in even smaller reductions in (cid:133)tting errors, and unreported results show that repeating the exercise in Section III.B with a 5-factor model produces loadings for the (cid:133)fth factor that are negligible. These results are consistent with the (cid:133)ndings of Du⁄ee (2010), who shows that the fourth and (cid:133)fth factors in ATSMs estimated using pre-ZLB data have only modest e⁄ects on the cross section of yields. However, he also shows that these factors may signi(cid:133)cantly a⁄ect expectations of future yields, and we therefore consider this possibility in the following section. IV. Matching Conditional Expectations of Future Yields We now turn to the models(cid:146)ability to match conditional expectations of future yields. Because those conditional expectations cannot be observed directly, we use four alternative approaches to evaluate the models(cid:146)performance. In Section IV.A, we consider the two "linear projections of yields" (LPY) tests of Dai and Singleton (2002), which examine whether the models can replicate the desired slope coe¢ cients from standard and risk-adjusted Campbell-Shiller regressions. In Section IV.B, we report Mincer-Zarnowitz regressions of observed excess returns on model-implied predicted excess returns. In Section IV.C, we consider how well model-implied short-rate expectations match the corresponding expectations implied by 16

surveys of professional forecasters. Finally, in Section IV.D, we evaluate the out-of-sample forecasting performance of the models. A. Campbell-Shiller Regressions 1. The LPY Tests of Dai and Singleton (2002) The two LPY tests proposed by Dai and Singleton (2002) provide a standard approach for testing the ability of DTSMs to match the conditional mean of future yields. The (cid:133)rst test (LPY(i)) examines whether DTSMs imply population slope coe¢ cients from Campbell and Shiller (1991) regressions that match those estimated in the data, i.e. the loadings (cid:30) in j m (5) y y = (cid:14) +(cid:30) (y y )+u ; t+m;j (cid:0) m (cid:0) t;j j jj m t;j (cid:0) t;m t;j (cid:0) with u IID(0;var(u )) for j = m+1;m+2;:::;K. The second test (LPY(ii)) examines t;j t;j (cid:24) whether yields within the observed sample obey the expectations hypothesis once adjusted for model-implied term premia. This corresponds to testing whether the loadings (cid:30)Q are equal to 1 j in the risk-adjusted version of equation (5) m m (6) y y (c c )+ (cid:18) = (cid:14)Q +(cid:30)Q (y y )+uQ : t+m;j (cid:0) m (cid:0) t;j (cid:0) t+m;j (cid:0) m (cid:0) t;j (cid:0) m j m t;j (cid:0) m j j j m t;j (cid:0) t;m t;j (cid:0) (cid:0) j 1 Here, c y (1=j) (cid:0) E [r ] is the term premium in the j-period yield, t;j t;j t t+i (cid:17) (cid:0) i=0 (cid:18) f E [r ] is thXe term premium in the forward rate f log(P =P ), and t;j t;j t t+j t;j t;j+1 t;j (cid:17) (cid:0) (cid:17) (cid:0) uQ IID 0;var uQ . t;j (cid:24) t;j Th(cid:0)e poss(cid:0)ibilit(cid:1)y(cid:1)that the linear relationships in equations (5) and (6) may change as yields approach the ZLB is of particular relevance for this study. One option would be to examine the models(cid:146)ability to produce the desired loadings both in periods away from the ZLB and in periods where the ZLB is binding. However, the slope coe¢ cients in the data for the LPY(i) test are estimated imprecisely when only using the short ZLB period, which means that such a comparison is not particularly informative. Moreover, while there is no uncertainty attached to 17

the desired slope coe¢ cients of 1 in the LPY(ii) tests, the model-implied coe¢ cients during the short ZLB period will be imprecisely estimated and hence also not particularly informative. We therefore employ an alternative procedure. We (cid:133)rst evaluate the models(cid:146)ability to satisfy the LPY tests when yields are away from the ZLB and the models are estimated using only pre-ZLB data. Next, we consider whether extending the sample for the model estimation to include the ZLB period a⁄ects the models(cid:146)ability to match the LPY tests when yields are away from the ZLB. If a model speci(cid:133)es the ZLB correctly, including yields at the ZLB in the model estimation should not adversely a⁄ect the properties of the model when yields are away from the ZLB. 2. LPY(i) Test Results For the LPY(i) test the details of our two-step procedure are as follows. In the (cid:133)rst step, we carry out the LPY(i) test for models estimated using a pre-ZLB sample, conditioning on yields being away from the ZLB. Speci(cid:133)cally, we estimate the models using a sample ending in Dec. 2007. Given these estimates, we then simulate 1,000 samples of the same length as in the data (216 months), discarding any simulated paths where any yield is below 1% in any period. For each of the simulated sample paths, we estimate equation (5) using a horizon (m) of 6 months and compute the mean estimate of (cid:30) . We compare these model-implied loadings j conditional on yields being above 1% with the loadings in the data for the period ending in Dec. 2007. In the second step, we carry out the same test for models estimated on a sample that includes the ZLB period. Speci(cid:133)cally, we estimate the models using a sample ending in Dec. 2016. Given these estimates, we repeat the above simulation procedure, conditioning on yields being above 1%. We again compare the resulting model-implied loadings conditional on yields being above 1% to the loadings in the data for the period ending in Dec. 2007. We start by analyzing the performance of the standard 3-factor models. The left column in Figure 3 shows results for the models estimated using a sample ending in 2007. The heavy black lines indicate the estimated Campbell-Shiller loadings ((cid:30) ) in the data. The loadings j 18

implied by the ATSM, SRM, and QTSM with 3 factors (shown using triangular markers) generally match the Campbell-Shiller loadings in the data closely, which is consistent with previous (cid:133)ndings for Gaussian ATSMs. However, extending the sample for estimating the the model estimation to 2016 distorts the ability of the 3-factor models to pass the LPY(i) test when yields are away from the ZLB. This is indicated by the charts in the right column in Figure 3, which show the loadings implied by models estimated using the sample ending in 2016. All of the 3-factor models now produce Campbell-Shiller loadings conditional on yields being away from the ZLB that fall too quickly with maturity, with the loadings at long maturities outside the 95% con(cid:133)dence interval for the estimates of (cid:30) in the data. j To examine the source of this deterioration against the LPY(i) test once the ZLB period is included in the sample, we take the estimated risk-neutral parameters (cid:18) and (cid:6)^step2 from the 11 3-factor models estimated using the sample ending in 2016, but exclude the period after 2007 b when estimating the P dynamics at Step 3 of the SR approach. Unreported results show that the resulting model-implied LPY(i) loadings are similar to those that we obtain when we estimate (cid:18) 11 and (cid:6)^step2 using the sample ending in 2007. This result suggests that the observed deterioration b against the LPY(i) test when the ZLB period is included is due to changes in the P dynamics (i.e. h 0 and h x ) at the ZLB, rather than changes in the short rate equation or the Q dynamics. We also note that the estimated Campbell-Shiller loadings in the data are lower for long-term yields when the sample is extended to 2016, e.g. the coe¢ cients for the 10-year bond are -1.2 and -1.6 for the samples ending in 2007 and 2016, respectively. During the ZLB period, bond returns of a given size are therefore associated with a smaller slope of the yield curve at the start of the holding period, perhaps because the short rate cannot fall below the ZLB. While the inclusion of yields at the ZLB should not a⁄ect the ability of a correctly-speci(cid:133)ed model to match the desired Campbell-Shiller loadings when yields are away from the ZLB, in the considered models the relationship between the slope of the yield curve and subsequent bond returns during the ZLB period appears to have a material e⁄ect on conditional expectations of future yields away from the ZLB. 19

Figure 3: Campbell-Shiller Loadings Away from the ZLB This (cid:133)gure reports Campbell-Shiller loadings implied by the models and the data. The loadings in the data are estimated using a sample from Jan. 1990(cid:150)Dec. 2007. The 95% con(cid:133)dence intervals for these estimates are computed using a block bootstrap applied jointly to the regressand and the regressor in the Campbell-Shiller regressions (in the data) with a block length of 189 months and 5,000 repetitions. The model-implied loadings are the mean loadings from running 1,000 Campbell-Shiller regressions on simulated samples of 216 months, conditional on all yields being above 1% at all points in the simulated sample. In the left column, all models are estimated using a sample from Jan. 1990(cid:150)Dec. 2007. In the right column, all models are estimated using a sample from Jan. 1990(cid:150)Dec. 2016. ATSM: 1990-2007 ATSM: 1990-2016 j2 j2 0 0 -2 -2 -4 -4 -6 -6 -8 -8 0 2 4 6 8 10 0 2 4 6 8 10 Maturity (years) Maturity (years) SRM: 1990-2007 SRM: 1990-2016 j2 j2 0 0 -2 -2 -4 -4 -6 -6 -8 -8 0 2 4 6 8 10 0 2 4 6 8 10 Maturity (years) Maturity (years) QTSM: 1990-2007 QTSM: 1990-2016 j2 j2 0 0 -2 -2 -4 -4 -6 -6 -8 -8 0 2 4 6 8 10 0 2 4 6 8 10 Maturity (years) Maturity (years) 3-factor 4-factor 5-factor Data 95% confidence interval (data) Figure 3 also shows the equivalent results for models with 4 and 5 factors. In short, the conclusions for the 3-factor models are broadly robust to the inclusion of additional factors. Adding a fourth or even (cid:133)fth factor makes only small di⁄erences when the models are estimated over the pre-ZLB period. When the sample is extended to 2016, the 4-factor QTSM actually 20

performs a little better than the 3-factor version (with loadings that are just within the 95% con(cid:133)dence interval for the loadings in the data), although the 5-factor QTSM performs slightly worse. In contrast, adding factors to the SRM results in a substantial deterioration in performance against the LPY(i) test. 3. LPY(ii) Test Results When implementing the LPY(ii) test we again adopt a two-step procedure. In the (cid:133)rst step, we estimate the models using data from 1990(cid:150)2007 and estimate equation (6) using model-implied term premia and a horizon (m) of 6 months. In the second step, we reestimate the models using the full sample ending in 2016 but again estimate equation (6) for 1990(cid:150)2007, i.e. before reaching the ZLB. The left column of Figure 4 shows the risk-adjusted Campbell-Shiller loadings implied by the 3-factor models estimated using a sample ending in 2007 (shown in triangular markers), with the dotted lines indicating the 95% con(cid:133)dence interval for these estimates. The estimated slope coe¢ cients implied by the 3-factor ATSM and SRM are close to the desired value of 1 at all maturities. While the point estimates from the 3-factor QTSM are slightly above 1, the desired value nevertheless falls within the con(cid:133)dence interval at all maturities. We therefore conclude that all of the 3-factor models satisfy the LPY(ii) test when they are estimated on a sample of yields that is away from the ZLB. However, the inclusion of the ZLB period in the sample to estimate the model parameters causes problems for all of the 3-factor models in matching the LPY(ii) test away from the ZLB, although the wide con(cid:133)dence intervals around the model-implied estimates mean that it is di¢ cult to be conclusive. This is shown in the right column of Figure 4, which presents estimates of (cid:30)Q for 1990(cid:150)2007 when the model parameters are estimated using the full sample from j 1990(cid:150)2016. The 3-factor ATSM and SRM continue to perform well for short- and medium-term yields, but display somewhat larger deviations from 1 for long-term yields. For the 3-factor QTSM, we see notable deviations from 1 at shorter maturities and even larger departures from 1 for long-term yields, although these di⁄erences are in general not statistically signi(cid:133)cant. 21

Figure 4: Risk-Adjusted Campbell-Shiller Loadings Away from the ZLB This (cid:133)gure reports risk-adjusted Campbell-Shiller loadings from Jan. 1990(cid:150)Dec. 2007, where term premia are obtained from models estimated using data from Jan. 1990(cid:150)Dec. 2007 (left column) and from Jan. 1990(cid:150)Dec. 2016 (right column). A well-speci(cid:133)ed model should return loadings equal to 1, which is highlighted using the heavy solid line. Conditional on the model estimates of term premia, the 95% con(cid:133)dence intervals for the risk-adjusted Campbell-Shiller loadings from the 3-factor models are computed using a block bootstrap applied jointly to the regressand and the regressor in the risk-adjusted Campbell-Shiller regressions with a block length of 189 months and 5,000 repetitions. Q ATSM: 1990-2007 Q ATSM: 1990-2016 j 3 j 3 2 2 1 1 0 0 -1 -1 0 2 4 6 8 10 0 2 4 6 8 10 Maturity (years) Maturity (years) Q SRM: 1990-2007 Q SRM: 1990-2016 j 3 j 3 2 2 1 1 0 0 -1 -1 0 2 4 6 8 10 0 2 4 6 8 10 Maturity (years) Maturity (years) Q QTSM: 1990-2007 Q QTSM: 1990-2016 j 3 j 3 2 2 1 1 0 0 -1 -1 0 2 4 6 8 10 0 2 4 6 8 10 Maturity (years) Maturity (years) 3-factor 4-factor 5-factor 95% confidence interval (3-factor) 22

To shed light on what is causing the relatively large change in the loadings for the 3-factor QTSM, we repeat the additional exercise reported for the LPY(i) test. We take the estimated risk-neutral parameters (cid:18) and (cid:6)^step2 for the QTSM estimated using a sample ending 11 in 2016, but exclude the period after 2007 when estimating the P dynamics at Step 3 of the SR b approach. Unreported results show that the model-implied LPY(ii) loadings for the 1990(cid:150)2007 period are similar to those that we obtain when we estimate (cid:18) and (cid:6)^step2 using the sample 11 ending in 2007. This result suggests that the deterioration in the performance against the b LPY(ii) test is due to a change in the P dynamics (i.e. h 0 and h x ) at the ZLB, rather than a change in the short-rate equation or the Q dynamics. The addition of a fourth pricing factor again improves the performance of the QTSM, such that it achieves risk-adjusted Campbell-Shiller loadings close to the desired value of 1 even when the model is estimated using the full sample period. In contrast, adding a fourth factor to the SRM or a (cid:133)fth factor to either the SRM or the QTSM again results in a deterioration in performance, as it does for the LPY(i) test. B. Mincer-Zarnowitz Regressions for Excess Bond Returns Another test of a DTSM(cid:146)s ability to match conditional expectations uses Mincer and Zarnowitz (1969) regressions of realized m-period excess returns on model-implied expectations, i.e. (7) rx = (cid:11) +(cid:11) E [rx ]+u ; t+m;m;j 0;j 1;j t t;m;j t+m;m;j where rx (j m)y +jy my . For a correctly speci(cid:133)ed model, the t+m;m;j t+m;j m t;j t;m (cid:17) (cid:0) (cid:0) (cid:0) (cid:0) intercept (cid:11) should equal zero and the slope coe¢ cient (cid:11) should equal 1. Because the results 0;j 1;j for the LPY tests raise the question of whether the models(cid:146)ability to match the loadings in equation (7) depends on whether the ZLB period is included in the sample for estimating the model, we again adopt a two-step procedure. In the (cid:133)rst step, we estimate the three models using a sample ending in 2007 and estimate equation (7) for model-implied expected returns 23

with a horizon (m) of 6 months. In the second step, we reestimate the model parameters using a sample ending in 2016, but again estimate equation (7) for excess returns from 1990(cid:150)2007. The charts in the left column of Figure 5 show the slope coe¢ cients from equation (7) when the models are estimated using a sample ending in 2007, while the dotted lines report the associated 95% con(cid:133)dence intervals. Although the 3-factor models (shown with triangular markers) imply slope coe¢ cients that are slightly larger than the desired value of 1, the desired value generally falls well within the con(cid:133)dence intervals. We therefore conclude that the 3-factor models can broadly satisfy the test posed by Mincer-Zarnowitz regressions when these models are estimated using a sample of yields that are away from the ZLB. However, the performance of the 3-factor models again deteriorates when they are estimated using a sample that includes the ZLB period. The right column of Figure 5 shows estimates of the slope coe¢ cients from equation (7), with the sample for estimating the models extended to 2016. The 3-factor models now imply slope coe¢ cients for the pre-ZLB period that are above 1 at short maturies and below 1 at long maturities.12 We can understand what generates this pattern in the slope coe¢ cients by examining the volatilities of excess returns in the data. The volatilities of excess returns on short-term yields are compressed during the ZLB period (as we discuss further in Section V). The ATSM implies that conditional volatilities are constant, meaning that it is unable to capture this volatility compression. When the sample is extended to include the ZLB period, the overall best (cid:133)t is therefore obtained by lowering the volatilities of excess returns on short-term bonds in all periods, which leaves the volatilities of excess returns during the pre-ZLB period too low, and hence generates values of (cid:11) above 1 for short maturities. However, at longer maturities the 1;j volatilities of returns do not fall by as much in the ZLB period. Instead, the pattern observed for the Campbell-Shiller loadings in Section IV.A.1 dominates, meaning that the relationship between the slope of the yield curve and bond returns becomes more negative at the ZLB. As we show in Section IV.A.1, the ATSM is unable to capture this e⁄ect, and the best (cid:133)t is obtained by 12Unreported results reveal that all the 3-factor models give estimates of (cid:11) in equation (7) that are close to 0;j and not signi(cid:133)cantly di⁄erent from 0 when the models are estimated using a sample ending in Dec. 2007. The same generally also applies when the sample for estimating the models is extended to Dec. 2016, except for a few short maturities, where the intercepts in the SRM are close to but signi(cid:133)cantly di⁄erent from 0. 24

Figure 5: Mincer-Zarnowitz Regression Slopes for Excess Returns Away from the ZLB This (cid:133)gure reports slope coe¢ cients from Mincer-Zarnowitz regressions of excess returns on an intercept and model-implied expected excess returns. Model-implied excess returns are computed by drawing 1,000 times from the conditional distributions of returns at each point in time. The point estimates and 95% con(cid:133)dence intervals for the 3-factor models are obtained using a block bootstrap procedure, with a block length of 189 months and 5,000 repetitions. The left column reports results when the model parameters are estimated using data from Jan. 1990(cid:150)Dec. 2007. The right column reports results when the sample for estimating the model parameters is extended to Dec. 2016. The Mincer-Zarnowitz regressions are in both cases estimated using a sample from Jan. 1990(cid:150)Dec. 2007. A well-speci(cid:133)ed model should return a slope coe¢ cient equal to 1, which is highlighted using the heavy solid line. ATSM: 1990-2007 ATSM: 1990-2016 1,j 2 1,j 2 1.5 1.5 1 1 0.5 0.5 0 0 0 2 4 6 8 10 0 2 4 6 8 10 Maturity (years) Maturity (years) SRM: 1990-2007 SRM: 1990-2016 1,j 2 1,j 2 1.5 1.5 1 1 0.5 0.5 0 0 0 2 4 6 8 10 0 2 4 6 8 10 Maturity (years) Maturity (years) QTSM: 1990-2007 QTSM: 1990-2016 1,j 2 1,j 2 1.5 1.5 1 1 0.5 0.5 0 0 0 2 4 6 8 10 0 2 4 6 8 10 Maturity (years) Maturity (years) 3-factor 4-factor 5-factor 95% confidence interval (3-factor) 25

increasing the overall correlation between the slope of the yield curve and bond returns for all periods. This implies that excess returns from 1990(cid:150)2007 become too volatile, and we therefore see values of (cid:11) below 1 at long maturities. 1;j Given that the 3-factor QTSM and SRM are unable to improve upon the ability of the ATSM to match the Campbell-Shiller loadings in Section IV.A, it is perhaps not surprising that these models also display values of (cid:11) that are too low for long maturities when using the full 1;j sample for the model estimation. As we will show in Section V, the QTSM is marginally better than the SRM at matching the volatility compression of the short-term yields at the ZLB, which is why the QTSM generates values of (cid:11) at short maturities that are closer to 1 than the SRM 1;j when using the full sample for model estimation. These results are broadly robust to the inclusion of additional pricing factors, although the picture is less clear-cut than for the LPY tests. The addition of a fourth factor generally worsens the performance of the models slightly, both close to and away from the ZLB. The 5-factor ATSM and SRM (but not the QTSM) generally perform a little better than the corresponding 3-factor models when estimated using pre-ZLB data. When including the ZLB period in the model estimation, the 5-factor ATSM and SRM imply slope coe¢ cients that are closer to 1 at short maturities, but their performance at longer maturities is substantially worsened. However, the 5-factor QTSM implies slope coe¢ cients that are below 1 at all maturities. In short, based on Mincer-Zarnowitz regressions, it is di¢ cult to make a compelling case for increasing the number of pricing factors beyond the standard 3. C. Matching Survey Expectations A further test of the models(cid:146)ability to match conditional expectations of future yields is whether they can match expectations implied by surveys of professional forecasters. We therefore construct a survey-based measure of short-rate expectations using the mean of responses to Blue Chip Financial Forecasts surveys of federal funds rate expectations. Because survey respondents are asked to report their expectations of the average federal funds rate over speci(cid:133)c calendar periods, we linearly interpolate to compute measures of expectations at 26

constant horizons of 6, 12, 24, and 36 months. These measures are available monthly at the 6and 12-month horizons and semi-annually at the 24- and 36-month horizons. In this section, we only report results for models estimated using the full sample from 1990(cid:150)2016, but compare model-implied expectations with surveys separately for the pre-ZLB period from 1990(cid:150)2007 and the ZLB period from 2009(cid:150)2016. Table 2 reports the RMSEs between the survey-based measure and expected model-implied short rates. The 3-factor models provide an almost identical average (cid:133)t during the pre-ZLB period. During the ZLB period, more substantial di⁄erences emerge between the models, as the SRM consistently outperforms the QTSM. However, the di⁄erences between squared forecast errors are not statistically signi(cid:133)cant according to unreported Diebold-Mariano tests, perhaps because the ZLB period is relatively short. Table 2: Matching Short-Rate Expectations from Surveys This table reports the root mean squared errors in annualized percentage points between the model-implied expected short rate and the expected federal funds rate derived from Blue Chip Financial Forecasts surveys. The survey-based measure of the expected short rate is computed as the mean response across all survey participants. These expectations are linearly interpolated to compute expectations at the reported constant horizons. The model-implied short rate expectations are derived from models estimated using data from Jan. 1990(cid:150)Dec. 2016. 3-Factor Models 4-Factor Models 5-Factor Models 1990(cid:150)2007 2009(cid:150)2016 1990(cid:150)2007 2009(cid:150)2016 1990(cid:150)2007 2009(cid:150)2016 (a) 6 months ahead ATSM 0:47 0:24 0:48 0:23 0:48 0:29 SRM 0:47 0:15 0:44 0:16 0:42 0:16 QTSM 0:46 0:18 0:50 0:14 0:47 0:24 (b) 12 months ahead ATSM 0:80 0:39 0:79 0:35 0:85 0:63 SRM 0:80 0:21 0:72 0:26 0:70 0:25 QTSM 0:79 0:29 0:80 0:21 0:79 0:40 (c) 24 months ahead ATSM 1:66 1:48 1:65 1:50 1:79 2:00 SRM 1:68 1:27 1:56 1:22 1:53 1:35 QTSM 1:70 1:46 1:66 1:33 1:80 1:56 (d) 36 months ahead ATSM 2:04 2:14 2:01 2:19 2:20 2:73 SRM 2:04 1:97 1:92 1:91 1:88 2:08 QTSM 2:06 2:25 2:02 2:04 2:25 2:31 27

Consistent with the results from the LPY tests, the gap between the performance of the models is narrowed by the addition of a fourth pricing factor to the QTSM (a fourth factor also generally improves the performance of the SRM, but by less). The case for adding a (cid:133)fth factor to both models again appears weak, because the 5-factor models do less well at matching survey expectations during the ZLB period. D. Out-of-Sample Forecasts Finally, we evaluate the models(cid:146)ability to match conditional expectations of bond yields by exploring how well they predict future yields in a recursive out-of-sample forecasting exercise. Speci(cid:133)cally, we (cid:133)rst estimate all the models using a sample from Jan. 1990(cid:150)Jan. 2005 and produce forecasts of yields at horizons up to 12 months ahead. We then repeat the process recursively, adding 1 month of data to the estimation sample at each iteration. The (cid:133)nal estimation sample ends in Dec. 2015 because we reserve the (cid:133)nal 12 months of data for forecast evaluation. This forecast period seems challenging for the models, because it contains yields (i) far from zero, (ii) when hitting the ZLB, and (iii) a prolonged period at the ZLB. The left column of Figure 6 shows the root mean squared prediction errors (RMSPEs) by maturity for 6- and 12-month forecast horizons. The 3-factor SRM and QTSM both out-perform the 3-factor ATSM, particularly at the shorter forecast horizon. However, there is very little di⁄erence between the average forecasts from the two ZLB-consistent models and unreported Diebold-Mariano tests show that the di⁄erences are not signi(cid:133)cant at a 95% con(cid:133)dence level. We also (cid:133)nd that adding a fourth factor to the SRM and QTSM (shown using the gray markers) generally worsens the forecast performance of these models, particularly for medium- and long-term yields. Thus, in contrast to results elsewhere in this section, the inclusion of a fourth factor in the QTSM does not improve its forecasting performance. This result suggests that while a 4-factor model performs better in matching other measures of conditional expectations, its larger number of parameters harms its ability to forecast well out-of-sample. Unreported results show that 5-factor versions of the models perform extremely poorly in forecasting out-of-sample, with RMSPEs that are o⁄the scales of the charts reported in Figure 6. 28

Overall, none of these benchmark DTSMs are particularly compelling when it comes to forecasting yields, as they all struggle to beat a random walk forecast, which is shown by the heavy black lines. This is a common (cid:133)nding for DTSMs with fully-(cid:135)exible time-series dynamics, and previous studies have found that restricting those dynamics can substantially improve forecasts in ATSMs when yields are away from the ZLB. Diebold and Li (2006) and Christensen, Diebold, and Rudebusch (2011) obtain better forecasts when h is a diagonal matrix, while x Du⁄ee (2011a) (cid:133)nds a similar result when setting the (cid:133)rst eigenvalue of h equal to 1. x To explore whether a similar result also holds for the SRM and QTSM, we next consider speci(cid:133)cations where we restrict the P dynamics by letting h x be a diagonal matrix with h x;11 =1. The right column of Figure 6 shows the RMSPEs from these restricted models. The restricted 3-factor SRM and the restricted 3- and 4-factor QTSM now outperform a random walk in forecasting short-maturity yields, particularly at the 12-month horizon. The di⁄erences between the ZLB-consistent models remain fairly small, although the SRM does achieve slightly smaller RMSPEs at the 12-month horizon.13 E. Summary Despite the potential for non-linear terms to cause problems for the ZLB-consistent models (and particularly the QTSM) when it comes to matching conditional expectations of future yields, our results suggest that there is actually little to choose between the models in this respect. While a 3-factor SRM does o⁄er some small advantages relative to a 3-factor QTSM in matching some of the tests considered in this section, the di⁄erences are not statistically signi(cid:133)cant and in most cases are eliminated with the addition of a fourth factor to the QTSM. In contrast, adding a fourth factor to the SRM or a (cid:133)fth factor to either model results in a deteriotation in performance. In the remainder of this article, we therefore focus on the standard 3-factor models, although we also report results for the 4-factor QTSM when these are substantively di⁄erent. 13Unreported results show that the ATSM and SRM with 4 and 5 factors do not bene(cid:133)t from letting h be x diagonal. The same holds for the QTSM with 5 factors. This is because these additional factors increase the probabilityofgettingnearidenticaleigenvaluesin(cid:8),andhenceverysimilarloadingsforsomeofthefactors,which then generates strong factor cross-correlation under the P measure that is not captured by letting h x be diagonal. 29

Figure 6: Out-Of-Sample Forecasting Performance This(cid:133)gurereportsrootmeansquaredpredictionerrorsbetweenactualyieldsandmodel-impliedforecasts. The forecasts are for Jan. 2006(cid:150)Dec. 2015. They are computed by recursively estimating each of the models, starting with a sample ending in Dec. 2005. The forecasted yields in the SRM are computed by drawing 10,000 times from the conditional distributions of yields. The (cid:133)gure shows results for 3-factor versions of all the models, as well as the 4-factor QTSM (denoted QTSM(4)) and SRM (denoted SRM(4); only for the unrestricted case). Random walk forecasts are constructed by assuming that yields do not change from their values at the end of the relevant sample period. Basis Basis points Unrestricted: 6-month ahead points Restricted: 6-month ahead 90 90 80 80 70 70 60 60 50 50 0 2 4 6 8 10 0 2 4 6 8 10 Maturity (years) Maturity (years) Basis Basis points Unrestricted: 12-month ahead points Restricted: 12-month ahead 150 150 100 100 50 50 0 2 4 6 8 10 0 2 4 6 8 10 Maturity (years) Maturity (years) ATSM SRM QTSM QTSM(4) SRM(4) Random walk Perhaps more noteworthy than the small di⁄erences between the models are their common failings. In particular, when the SRM and QTSM are estimated on pre-ZLB data, both models replicate the satisfying ability of the ATSM to match the LPY tests and the desired coe¢ cients from Mincer-Zarnowitz regressions. However, when the models are estimated on a sample that includes the ZLB period, both models are unable to match the LPY tests and the Mincer-Zarnowitz regressions when yields are away from the ZLB. Thus, we conclude that neither model fully captures the change in the dynamics of yields that occurred when the short rate reached the ZLB. 30

V. Matching Conditional Volatilities We now turn to the second moment of yields. As discussed in Section III.B, the time-varying factor loadings for the SRM and QTSM imply that these models generate time-variation in the conditional volatilities of yields, in contrast to the ATSM. In Section V.A, we show that although the SRM and QTSM can generate a compression in the volatilities of short-term yields at the ZLB, similar to what we observe in the data, both models imply a link between volatilities and the level of yields that is too tight when yields are away from the ZLB. In Section V.B we consider whether a more (cid:135)exible version of the QTSM can (cid:133)t conditional volatilities more closely when yields are away from the ZLB, without implying a materially worse (cid:133)t to the (cid:133)rst moments of yields. Our results suggest that the QTSM is simply unable to match the (cid:133)rst two conditional moments of yields simultaneously.14 A. The Tight Link Between the Level of Yields and Volatility To estimate model-implied conditional volatilities, we use a local linearization of the relationship between the yields and pricing factors, i.e. we approximate the model-implied one-period-ahead conditional volatility of y as t+1;j (8) (cid:27) (x^ ;(cid:18) ) g (x^ ;(cid:18) ) (cid:6)(cid:6)g (x^ ;(cid:18) ); t+1;j t 1 j;x t 1 0 0 j;x t 1 (cid:25) q b where g (x^ ;(cid:18) ) denotes the (cid:133)rst derivative of g (x ;(cid:18) ) with respect to the pricing factors, j;x t 1 j t 1 evaluated at x^ .15 Because we do not observe conditional volatilities in the data, we approximate t them using the generalized autoregressive conditional heteroscedasticity (GARCH) model of Bollerslev (1986) applied separately to each bond yield. The (cid:133)rst row in Figure 7 shows plots of the (cid:133)tted 1-year yields on the horizontal axes against their conditional volatilities from the 3-factor QTSM and SRM on the vertical axes (the 14We do not consider the SRM in this context because near-constant conditional volatilities away from the ZLB are an intrinsic feature of the SRM. 15For the ATSM and QTSM it is straightforward to compute analytical expressions for these derivatives. For the SRM, we evaluate the derivatives numerically. 31

black markers). We compare these results with the corresponding 1-year yields and conditional volatilities in the data (the gray markers). The bottom chart shows the same data points using a time-series plot. Figure 7: Conditional Volatility and the Level of One-Year Bond Yields The top row of charts plots one-year yields y on the horizontal axes against their conditional volatilt+1;12 ities (cid:27) (y ) on the vertical axes, both in the data and implied by the 3-factor models estimated t t+1;12 using data from Jan. 1990(cid:150)Dec. 2016. Yields and conditional volatilities are expressed as annualized percentages. The model-implied conditional volatilities are computed using a local linearization of the relationship between yields and the pricing factors, evaluated at the estimated factor values. The conditional volatilities in the data are estimated using a univariate GARCH(1,1) model applied to the changes in the yield. The bottom chart plots the conditional volatilities from the QTSM and SRM over time, together with the GARCH(1,1) estimates. (Percent) (Percent) t+1,12 SRM t+1,12 QTSM 0.6 0.6 0.3 0.3 0 0 0 5 10 0 5 10 y (Percent) y (Percent) t,12 t,12 (Percent) t+1,12 1 0.8 0.6 0.4 0.2 0 1990 1995 2000 2005 2010 2015 SRM QTSM GARCH(1,1) As suggested by the time-varying factor loadings reported in Section III.B, both the SRM and QTSM imply a compression in conditional volatilities as yields approach the ZLB, with the QTSM achieving a slightly better (cid:133)t to the GARCH estimates. Thus, close to the ZLB, the models have realistic implications for conditional volatilities, consistent with the results of Kim and Singleton (2012) for Japanese yields. Away from the ZLB, however, neither the SRM nor the QTSM can provide a realistic 32

description of volatilities in the data. In the SRM, conditional volatilities are by construction approximately constant when yields are su¢ ciently far from the ZLB. In the QTSM there is a stronger positive relationship between conditional volatilities and yields even far from the ZLB. In contrast, in the data volatilities vary over time (unlike in the SRM) but are only weakly correlated with the level of yields (unlike in the QTSM). Finally, unreported results show that there are no substantive di⁄erences to the results discussed in this section when we add additional pricing factors to either model. This failure to match conditional volatilities when yields are away from the ZLB is consistent with the (cid:133)ndings of some previous studies. In particular, Kim (2007) (cid:133)nds that a 2-factor QTSM with only positive eigenvalues in (cid:1) (to xx enforce the ZLB, as discussed in Section II.A.2) does not produce a good (cid:133)t to conditional volatilities in the data. B. Improving the Fit to Conditional Volatilities As discussed above, it is no surprise that the SRM generates essentially constant conditional volatilities when yields are away from the ZLB. To understand why our benchmark QTSM implies an extremely tight relationship between the level of yields and volatilities, recall that we impose that the short rate is the square of a linear combination of the pricing factors. As discussed in Section II.A.2, this gives the model the (cid:135)exibility to match variation in long-term yields when the short rate is at the ZLB. However, this feature comes at the cost of generating an extremely tight link between the level of the short rate and its volatility, because var [r ] 4(cid:12) (cid:6)(cid:6)(cid:12) r , i.e. the conditional volatility of the short rate is approximately t t+1 0 t (cid:25) (cid:2) proportional to the short rate itself. The results reported in Section V.A suggest that there is a similar tight link between volatilities and the level of yields at longer maturities. To explore whether this tight link can be relaxed without materially harming the model (cid:133)t to the (cid:133)rst moments of yields, we reestimate the 3-factor QTSM but with two di⁄erences relative to the QTSM discussed elsewhere in this article. First, to give the model the best possible chance of matching yields and volatilities simultaneously, we allow the o⁄-diagonal elements of (cid:1) in equation (3) to be free parameters (subject to (cid:1) remaining symmetric and xx xx 33

positive semi-de(cid:133)nite to ensure identi(cid:133)cation and to enforce the ZLB, respectively, as explained in Section II.A.2). Second, we include the GARCH estimates of 1-month-ahead conditional volatilities of the 1-, 5- and 10-year yields as "observables", along with the same set of yields used to estimate the benchmark model. Speci(cid:133)cally, at Step 1 of the SR approach, we modify equation (4) such that the estimated parameters are given by 1 T ny;t nv;t (9) (cid:18) ^step1 = arg min y g (x ;(cid:18) ) 2 + (cid:27)GARCH (cid:27) (x ;(cid:18) ) 2 ; 1 (cid:18)1 (cid:2)1 2N " t;mj (cid:0) mj t 1 t+1;j (cid:0) t+1;j t 1 # 2 t=1 j=1 j=1 X X(cid:0) (cid:1) X(cid:0) (cid:1) b where (cid:27) (x ;(cid:18) ) is the approximate model-implied volatility computed according to equation t+1;j t 1 (8), (cid:27)GARCH is the corresponding GARCH estimate, n is the number of observed volatilities in t+b1;j v;t period t, and N T (n +n ).16 Below we report results for (cid:133)tted yields and volatilities (cid:17) t=1 y;t v;t from Step 1 of the PSR approach, although in principle we could also make a similar modi(cid:133)cation to Step 3 in the SR approach. Figure 8 shows that this modi(cid:133)ed version of the QTSM does indeed achieve a closer (cid:133)t to the conditional volatility of the 1-year yield when compared to the benchmark QTSM. However, the modi(cid:133)ed QTSM has three important de(cid:133)ciencies. First, the conditional volatilities of the 5and 10-year yields are still not anywhere close to the GARCH-based estimates. Second, the (cid:133)t to yields deteriorates, e.g. the RMSEs for the 1- and 10-year yields increase to 13 and 16 basis points, respectively, compared with just 8 basis points at both maturities in the QTSM estimated using only data on yields. Third, the extracted factors imply estimates of h that x induce explosive factor dynamics under the P measure. Unreported results show that adding a fourth factor to the QTSM does not substantively change these results. Thus, we conclude that the ZLB-consistent QTSM is simply unable to match the (cid:133)rst and second moments of yields simultaneously. 16This version of the SR approach is closely related to the estimator of Andersen, Fusari, and Todorov (2015). 34

Figure 8: Conditional Volatility of Bond Yields in a QTSM with Observed Volatilities This (cid:133)gure plots conditional volatilities ((cid:27) (y )) of the 1-, 5- and 10-year yields in the data and t t+1;j implied by the 3-factor models estimated using data from Jan. 1990(cid:150)Dec. 2016. Conditional volatilities are expressed as annualized percentages. The model-implied conditional volatilities are computed using a (cid:133)rst-order linearization of the relationship between bond yields and the pricing factors, evaluated at the estimated factor values. The conditional volatilities in the data are estimated using a univariate GARCH(1,1) model applied to the change in a given yield. Percent 1-year 1 0.5 0 1990 1995 2000 2005 2010 2015 Percent 5-year 1 0.5 0 1990 1995 2000 2005 2010 2015 Percent 10-year 0.6 0.4 0.2 0 1990 1995 2000 2005 2010 2015 GARCH(1,1) Model VI. Sharpe Ratios and Return Predictability The fact that the SRM and QTSM have di⁄erent implications for conditional volatilities means that they are also likely to have di⁄erent implications for other important aspects of bond yields. In this section, we therefore consider whether the SRM and QTSM have plausible implications for model-implied Sharpe ratios (i.e. the ratio of the (cid:133)rst conditional moment of excess returns to the second) and return predictability (i.e. the ratio of the variance of model-implied expected excess returns to the variance of observed excess returns). Speci(cid:133)cally, we consider whether the models can replicate the three "robust properties" reported by Du⁄ee (2010) for ATSMs estimated using pre-ZLB data. We largely con(cid:133)rm that these robust properties 35

also hold for the SRM and QTSM when yields are away from the ZLB but that the models have di⁄erent implications for model-implied Sharpe ratios as the short rate approaches the ZLB. A. Average Conditional Sharpe Ratios Du⁄ee(cid:146)s (cid:133)rst robust property of ATSMs is an inverse relationship between a bond(cid:146)s average Sharpe ratio and its maturity. The conditional Sharpe ratio on a j-period bond is de(cid:133)ned as E [R R ] t t+1;j t+1;1 (10) S = (cid:0) ; t;j var [R R ] t t+1;j t+1;1 (cid:0) p where R P =P . Following Du⁄ee (2010), we report average population conditional t+1;j t+1;j 1 t;j (cid:17) (cid:0) Sharpe ratios, although we additionally condition on whether the short rate is above or below 1%. To estimate the Sharpe ratios, we simulate a single time series from each of the models until we have at least 1,000 periods in which the short rate is below 1% and 1,000 periods in which it is 1% or higher. For each simulated observation we make 1,000 draws from the model-implied conditional return distributions and compute the conditional Sharpe ratios according to equation (10). Table 3 reports the means of these conditional Sharpe ratios for the 3-factor models when the short rate is above or below 1%. In periods in which the short rate is above 1%, all three models generate an inverse relationship between the average Sharpe ratio and maturity, con(cid:133)rming Du⁄ee(cid:146)s (cid:133)rst robust property of ATSMs also holds in the SRM and QTSM when yields are away from the ZLB. However, the patterns change when the short rate is below 1%. To understand why, consider (cid:133)rst the ATSM. Recall that the conditional variance of returns is constant, so conditional Sharpe ratios only depend on the level of the short rate through its e⁄ect on expected excess returns. On the one hand, the fact that the model is stationary means that the further yields fall below their unconditional mean, the more bond prices tend to be expected to 36

fall, and the lower the expected return tends to be, lowers Sharpe ratios. On the other hand, if the short rate is low, the cost of (cid:133)nancing a long-term bond position is also very low, which increases Sharpe ratios. At short maturities, the former e⁄ect dominates and the Sharpe ratio of a 1-year bond is loewr when the short rate is below 1% than when it is above 1%. However, because the yield curve tends to be steeply upward sloping when the short rate is low, long-term bonds o⁄er abnormally higher expected excess returns over the 1-month rate, and their Sharpe ratio rises slightly. Thus, when the short rate is low we observe a (cid:135)atter average relationship between the Sharpe ratio and maturity in the ATSM. The QTSM and SRM introduce an additional complication, because the conditional variance of returns (i.e. the denominator in equation (10)) also falls as the short rate approaches the ZLB. At long maturities, this compression is on average relatively unimportant, such that the average Sharpe ratio of a 10-year bond is higher when the short rate is below 1% than when it is above 1%, as in the ATSM. At shorter maturities, the e⁄ect on Sharpe ratios depends on the relative speed with which the (cid:133)rst and second conditional moments of returns approach zero. In the QTSM, the volatility compression dominates and the average Sharpe ratio of a 1-year bond is higher when the short rate is below 1%, while the reverse is true in the SRM. Finally, adding a fourth factor to the QTSM raises the conditional Sharpe ratios for short-maturity bonds somewhat, to about 0.3 when the short rate is above 1% and to 0.8 when it is below 1%. In summary, we conclude that Du⁄ee(cid:146)s (cid:133)rst robust property of ATSMs also holds in the SRM and QTSM when the short rate is away from the ZLB, although the patterns do change when the short rate approaches the ZLB and with the addition of a fourth factor to the QTSM. B. Return Predictability Du⁄ee(cid:146)s second robust property of ATSMs is that they imply that 15(cid:150)20% of annual excess bond returns are predictable. The fourth and (cid:133)fth columns of Table 3 therefore report the population ratio of the variance of expected excess returns implied by the 3-factor models to the variance of actual excess returns in our simulated samples, again conditioning on whether the short rate is above or below 1%. 37

Table 3: Sharpe Ratios and the Predictability of Excess Returns This table reports mean conditional Sharpe ratios, the predictable proportion of excess returns at di⁄erent horizons, and the fraction of expected returns at di⁄erent horizons explained by the (cid:133)rst principal component of monthly expected returns at di⁄erent maturities. All results are for models estimated using data from Jan. 1990(cid:150)Dec. 2016. The results are reported separately for periods in which the short rate is below 1% and in which it is 1% or higher. To obtain these model-implied properties, we (cid:133)rst simulate from each of the models until we have at least 1,000 periods in which the short rate is below 1% and 1,000 periods in which it is 1% or higher. For each simulated sample period we then obtain 1,000 draws from the 1-month- and 1-year-ahead conditional excess return distributions to compute various model-implied moments. The (cid:133)rst three columns show the resulting estimates of mean conditional monthly Sharpe ratios. The fourth and (cid:133)fth columns report the predictable portion of excess returns at a monthly and annual horizon, de(cid:133)ned as the ratio of the variance of model-implied expected excess returns on a 10-year bond to the variance of subsequent excess returns. The (cid:133)nal two columns show the R2 from unrestricted linear regressions of monthly and annual expected excess returns on the (cid:133)rst principal component of expected 1-month excess returns on bonds with maturities of 2, 3, 4, 5, 6, 7, 8, 9, and 10 years. All models have 3 factors except QTSM(4), which denotes the 4-factor QTSM. Mean Conditional Predictable Proportion Fraction of Expected Sharpe Ratio of Excess Returns on a Returns Explained by a (Maturity in Years) 10-year Bond "Monthly Return Factor" 1 5 10 Monthly Annual Monthly Annual ATSM r 1% 0:31 0:20 0:16 0:02 0:18 1:00 0:94 t (cid:21) r <1% 0:19 0:21 0:21 0:02 0:24 1:00 0:94 t SRM r 1% 0:25 0:19 0:15 0:02 0:22 1:00 0:92 t (cid:21) r <1% 0:15 0:23 0:20 0:02 0:27 0:99 0:92 t QTSM r 1% 0:26 0:14 0:12 0:03 0:24 0:97 0:88 t (cid:21) r <1% 0:33 0:24 0:21 0:02 0:25 0:97 0:91 t QTSM(4) r 1% 0:32 0:16 0:11 0:03 0:22 0:97 0:81 t (cid:21) r <1% 0:76 0:21 0:14 0:03 0:29 0:97 0:89 t Focusing (cid:133)rst on periods in which the short rate is above 1%, we (cid:133)nd that 2(cid:150)3% of monthly returns on a 10-year bond are predictable in all the models, only slightly lower than the 5% reported by Du⁄ee (2010) for a 3-factor ATSM estimated using pre-ZLB data. For annual excess returns, this proportion increases to 18% in the ATSM, close to the 19% reported by Du⁄ee (2010). The proportions are slightly higher in the QTSM and SRM (but not materially so) at 24% and 22% respectively. When the short rate is below 1%, return predictability rises slightly at an annual horizon in all three models (although the di⁄erences are still not particularly large), whereas return predictability is little changed at a monthly horizon. Finally, adding a fourth factor to the QTSM increases return predictability at an annual horizon a little further, although it remains below 30% irrespective of whether the short rate is above or below 1%. In summary, we interpret these results as evidence that the second robust property of Du⁄ee (2010) also holds 38

approximately in the QTSM and SRM. Finally, Du⁄ee(cid:146)s third robust property is that the variation in excess returns on di⁄erent maturity bonds over a holding period of 1 month are almost exclusively explained by a single factor, but the same factor explains less of the variation in excess results for an annual holding period. Following his approach, we construct a "monthly return factor" by taking the (cid:133)rst principal component of model-implied 1-month expected excess returns on bonds with maturities from 2 to 10 years at 1-year intervals during our simulated sample, again conditioning on whether the short rate is above or below 1%. The results reported in the (cid:133)nal two columns of Table 3 show that we can also con(cid:133)rm the third robust property for all of the models. The monthly return factor explains almost all of the monthly expected excess returns on the 10-year bond, irrespective of whether the short rate is above or below 1%. However, if we regress expected annual excess returns on this factor, then the R2 is somewhat lower, again irrespective of whether the short rate is above or below 1%. VII. Conclusion This article examines the performance of the SRM and QTSM when estimated using U.S. Treasury yields. While the SRM has received more attention in recent empirical work, the two models actually perform remarkably similarly against a number of criteria. While the SRM achieves marginally superior performance in some respects, the small and generally statistically-insigni(cid:133)cant di⁄erences between the models do not appear su¢ ciently large to support the clear preference that has recently emerged in the literature in favor of the SRM, particularly if we add a fourth factor to the QTSM and take into account its greater computational tractability. Perhaps more noteworthy than the modest di⁄erences between the models are their common failings. Although both the SRM and the QTSM outperform the ATSM when yields are close to the ZLB, neither of the two ZLB-consistent models appear to fully capture the change in the yield dynamics at the ZLB. The problem seems to be that the time-series dynamics of yields during the ZLB period change, and simply modifying the functional form of 39

the short rate does not fully capture this change. A potential solution to this problem might be to modify the factor dynamics of the P measure as the short rate approaches the ZLB. In addition, neither of the models has the (cid:135)exibility to provide a good description of conditional volatilities of yields when yields are away from the ZLB, suggesting that it may be necessary to incorporate some form of unspanned stochastic volatility into the models. We leave these and other extensions of the models to future research. 40

References Adrian, T., R. K. Crump, and E. Moench. (cid:147)Pricing the Term Structure with Linear Regressions.(cid:148)Journal of Financial Economics, 110 (2013), 110(cid:150)138. Ahn, D.-H., R. F. Dittmar, and A. R. Gallant. (cid:147)Quadratic Term Structure Models: Theory and Evidence.(cid:148)Review of Financial Studies, 15 (2002), 243(cid:150)288. Andersen, T. G., N. Fusari, and V. Todorov. (cid:147)Parametric Inference and Dynamic State Recovery from Option Panels.(cid:148)Econometrica, 83 (2015), 1081(cid:150)1145. Andreasen, M. M. and B. J. Christensen. (cid:147)The SR Approach: A New Estimation Procedure for Non-Linear and Non-Gaussian Dynamic Term Structure Models.(cid:148)Journal of Econometrics, 184 (2015), 420(cid:150)451. Bauer, M. D. and G. D. Rudebusch. (cid:147)Monetary Policy Expectations at the Zero Lower Bound.(cid:148) Journal of Money, Credit and Banking, 48 (2016), 1440(cid:150)1465. Bauer, M. D., G. D. Rudebusch, and J. C. Wu. (cid:147)Correcting Estimation Bias in Dynamic Term Structure Models.(cid:148)Journal of Business and Economic Statistics, 30 (2012), 454(cid:150)467. Black, F. (cid:147)Interest Rates as Options.(cid:148)Journal of Finance, 50 (1995), 1371(cid:150)1376. Bollerslev, T. (cid:147)Generalized autoregressive conditional heteroskedasticity.(cid:148)Journal of Econometrics, 31 (1986), 307(cid:150)327. Campbell, J. Y. and R. J. Shiller. (cid:147)Yield Spread and Interest Rate Movements: A Bird(cid:146)s Eye View.(cid:148)Review of Economic Studies, 58 (1991), 495(cid:150)514. Christensen, J. H. E., F. X. Diebold, and G. D. Rudebusch. (cid:147)The A¢ ne Arbitrage-Free Class of Nelson-Siegel Term Structure Models.(cid:148)Journal of Econometrics, 164 (2011), 4(cid:150)20. Christensen, J. H. E. and G. D. Rudebusch. (cid:147)Estimating Shadow-Rate Term Structure Models with Near-Zero Yields.(cid:148)Journal of Financial Econometrics, 13 (2015), 226(cid:150)259. 41

Cochrane, J. H. and M. Piazzesi. (cid:147)Decomposing the yield curve.(cid:148)Working Paper, University of Chicago (2008). Cox, J. C., J. E. Ingersoll, and S. A. Ross. (cid:147)A Theory of the Term Structure of Interest Rates.(cid:148) Econometrica, 53 (1985), 385(cid:150)407. Dai, Q. and K. J. Singleton. (cid:147)Speci(cid:133)cation Analysis of A¢ ne Term Structure Models.(cid:148)Journal of Finance, 55 (2000), 1946(cid:150)1978. Dai, Q. and K. J. Singleton. (cid:147)Expectation Puzzles, Time-Varying Risk Premia and A¢ ne Models of the Term Structure.(cid:148)Journal of Financial Economics, 63 (2002), 415(cid:150)441. Diebold, F. X. and C. Li. (cid:147)Forecasting the Term Structure of Government Bond Yields.(cid:148) Journal of Econometrics, 130 (2006), 337(cid:150)364. Du⁄ee, G. R. (cid:147)Term Premia and Interest Rate Forecasts in A¢ ne Models.(cid:148)Journal of Finance, 57 (2002), 405(cid:150)443. Du⁄ee, G. R. (cid:147)Sharpe Ratios in Term Structure Models.(cid:148)Working Paper, Johns Hopkins University (2010). Du⁄ee, G. R. (cid:147)Forecasting with the Term Structure: The Role of No-Arbitrage Restrictions.(cid:148) Working Paper, Johns Hopkins University (2011a). Du⁄ee, G. R. (cid:147)Information in (and not in) the Term Structure.(cid:148)Review of Financial Studies, 24 (2011b), 2895(cid:150)2934. Fama, E. F. and R. R. Bliss. (cid:147)The Information in Long-Maturity Forward Rates.(cid:148)American Economic Review, 77 (1987), 680(cid:150)692. Feunou, B., J.-S. Fontaine, A. Le, and C. Lundblad. (cid:147)Tractable Term-Structure Models and the Zero Lower Bound.(cid:148)Working Paper, Bank of Canada (2015). Filipovic, D., M. Larsson, and A. B. Trolle. (cid:147)Linear-Rational Term Structure Models.(cid:148)Journal of Finance, 72 (2017), 655(cid:150)704. 42

Joslin, S., K. J. Singleton, and H. Zhu. (cid:147)A New Perspective on Gaussian Dynamic Term Structure Models.(cid:148)Review of Financial Studies, 24 (2011), 926(cid:150)970. Kim, D. H. (cid:147)Spanned stochastic volatility in bond markets: A reexamination of the relative pricing between bonds and bond options.(cid:148)Working Paper, BIS (2007). Kim, D. H. and K. J. Singleton. (cid:147)Term Structure Models and the Zero Bound: An Empirical Investigation of Japanese Yields.(cid:148)Journal of Econometrics, 170 (2012), 32(cid:150)49. Leippold, M. and L. Wu. (cid:147)Asset Pricing under the Quadratic Class.(cid:148)Journal of Financial and Quantitative Analysis, 37 (2002), 271(cid:150)295. Mincer, J. A. and V. Zarnowitz. The Evaluation of Economic Forecasts, NBER, chap. 1, 3(cid:150)46 (1969). Monfort, A., F. Pegoraro, J.-P. Renne, and G. Roussellet. (cid:147)Staying at Zero with A¢ ne Processes: A New Dynamic Term Structure Model.(cid:148)Journal of Econometrics, 201 (2017), 348(cid:150)366. Priebsch, M. A. (cid:147)Computing Arbitrage-Free Yields in Multi-factor Gaussian Shadow-Rate Term Structure Models.(cid:148)Finance and Economics Discussion Series, Federal Reserve Board (2013). Realdon, M. (cid:147)Quadratic Term Structure Models in Discrete Time.(cid:148)Finance Research Letters, 3 (2006), 277(cid:150)289. Realdon, M. (cid:147)Gaussian Models for Euro High Grade Government Yields.(cid:148)European Journal of Finance, 23 (2016), 1(cid:150)44. Rudebusch, G. D. and T. Wu. (cid:147)Accounting for a Shift in Term Structure Behavior with No-Arbitrage and Macro-Finance Models.(cid:148)Journal of Money, Credit and Banking, 39 (2007), 395(cid:150)422. Wu, J. C. and F. D. Xia. (cid:147)Measuring the Macroeconomic Impact of Monetary Policy at the Zero Lower Bound.(cid:148)Journal of Money Credit and Banking, 48 (2016), 253(cid:150)291. 43

Cite this document

APA

Martin M. Andreasen and Andrew Meldrum (2018). A Shadow Rate or a Quadratic Policy Rule? The Best Way to Enforce the Zero Lower Bound in the United States (FEDS 2018-056). Board of Governors of the Federal Reserve System, Finance and Economics Discussion Series. https://whenthefedspeaks.com/doc/feds_2018-056

BibTeX

@techreport{wtfs_feds_2018_056,
  author = {Martin M. Andreasen and Andrew Meldrum},
  title = {A Shadow Rate or a Quadratic Policy Rule? The Best Way to Enforce the Zero Lower Bound in the United States},
  type = {Finance and Economics Discussion Series},
  number = {2018-056},
  institution = {Board of Governors of the Federal Reserve System},
  year = {2018},
  url = {https://whenthefedspeaks.com/doc/feds_2018-056},
  abstract = {We study whether it is better to enforce the zero lower bound (ZLB) in models of U.S. Treasury yields using a shadow rate model or a quadratic term structure model. We show that the models achieve a similar in-sample fit and perform comparably in matching conditional expectations of future yields. However, when the recent ZLB period is included in the sample, the models' ability to match conditional expectations away from the ZLB deteriorates because the time-series dynamics of the pricing factors change. In addition, neither model provides a reasonable description of conditional volatilities when yields are away from the ZLB. Accessible materials (.zip)},
}