feds · August 31, 2014

What Drives the Cross-Section of Credit Spreads?: A Variance Decomposition Approach

Abstract

I decompose the cross-sectional variation of the credit spreads for corporate bonds into changing expected returns and changing expectation of credit losses with a model-free method. Using a log-linearized pricing identity and a vector autoregression applied to micro-level data from 1973 to 2011, I find that the expected credit loss component and the excess return component each explains about half of the variance of the credit spreads. Unlike the market-level findings in Gilchrist and Zakrajsek (2012), at the firm level, the expected credit loss is volatile and affects the firms' investment decision more than the expected excess returns.

Finance and Economics Discussion Series Divisions of Research & Statistics and Monetary Affairs Federal Reserve Board, Washington, D.C. What Drives the Cross-Section of Credit Spreads?: A Variance Decomposition Approach Yoshio Nozawa 2014-62 NOTE: Staff working papers in the Finance and Economics Discussion Series (FEDS) are preliminary materials circulated to stimulate discussion and critical comment. The analysis and conclusions set forth are those of the authors and do not indicate concurrence by other members of the research staff or the Board of Governors. References in publications to the Finance and Economics Discussion Series (other than acknowledgement) should be cleared with the author(s) to protect the tentative character of these papers.

What Drives the Cross-Section of Credit Spreads?: A Variance Decomposition Approach (cid:3) Yoshio Nozawa y August 2, 2014 Abstract I decompose the cross-sectional variation of the credit spreads for corporate bonds into changing expected returns and changing expectation of credit losses with a modelfreemethod. Usingalog-linearizedpricingidentityandavectorautoregressionapplied to micro-level data from 1973 to 2011, I (cid:133)nd that the expected credit loss component and the excess return component each explains about half of the variance of the credit spreads. Unlike the market-level (cid:133)ndings in Gilchrist and Zakraj(cid:154)ek (2012), at the (cid:133)rm level, the expected credit loss is volatile and a⁄ects the (cid:133)rms(cid:146)investment decision more than the expected excess returns. I am grateful for comments and suggestions from John Cochrane, Simon Gilchrist, Lars Hansen, Don (cid:3) Kim, Arvind Krishnamurthy, Marcelo Ochoa, Bin Wei and participants in workshops at Chicago Booth, Chicago Fed, FRB and New York Fed. The views expressed herein are the author(cid:146)s and do not necessarily re(cid:135)ect those of the Board of Governors of the Federal Reserve System. Division of Monetary A⁄airs, Board of Governors of the Federal Reserve System, Mail Stop 165, 20th y Street and Constitution Avenue NW, Washington, DC 20551. Email: yoshio.nozawa@frb.gov. 1

1 Introduction What drives the cross-sectional variation in credit spreads? Credit spreads are higher when the issuer of a corporate bond faces a higher risk of default and when the rate at which the corporate bond(cid:146)s cash (cid:135)ows are discounted rises. Since the expected default and expected returns are unobservable, past research often relies on structural models of debt, such as Merton (1974) model, to decompose credit spreads. However, there is little agreement on the best measures of expected default loss and expected returns. In this article, I take advantage of a large panel dataset of the US corporate bond prices and estimate the market expectation without relying on a particular model of default. Based on these model-free estimates, I quantify the contributions from the default component and the discount rate component by decomposing the variance of the credit spreads. I apply the variance decomposition approach of Campbell and Shiller (1988a and 1988b) to the credit spread. In the decomposition, the credit spread plays the role of the dividendpriceratioforstocks, whilecreditlossplaystheroleofdividendgrowth. Thisdecomposition framework relates the current credit spread to the sum of expected excess returns and credit losses over the long run. This relationship implies that, if the credit spread varies, then either long-run expected excess returns or long-run expected credit loss must vary. The log-linear identity that expresses credit spreads as a linear function of log excess returns and credit loss allows me to infer the long-run expected excess returns and the expected credit loss from monthly VARs. Thus, I do not have to take a stand on how (cid:133)rms make their decisions about their capital structure and defaults, or on what factors drive the (cid:133)rm value. I estimate a VAR involving credit spreads, excess returns, and distance to default of the corporate bonds. Since default occurs infrequently, estimating the expected credit loss and expected returns by running forecasting regressions requires a large dataset. To overcome this issue, I collect corporate bond prices from the Lehman Brothers Fixed Income Database, the Mergent FISD/NAIC Database, TRACE and DataStream, which cover most 2

of the publicly traded corporate bonds from 1973 to 2011. In addition, I use Moody(cid:146)s Default Risk Service to make sure that the price observations upon default are complete, and thus my credit loss measure does not miss bond defaults that occur during the period. Based on the estimated VAR, I compute the ratio of volatility of the implied long-run expected credit loss to the volatility of credit spreads and (cid:133)nd that the ratio is 0.51. In the world where the credit spreads are driven solely by the expected default, the volatility ratio for the expected credit loss would be one. In the data, I (cid:133)nd that the estimated volatility ratio of 0.51 is 3.5 standard errors from one. Instead, about half of the volatility of credit spreads comes from changing expected excess returns. The (cid:133)rst main empirical (cid:133)nding of the paper is that the expected excess return component is about as volatile as the cash (cid:135)ow component for corporate bonds. Since the distribution of credit loss is highly skewed, there is a concern about the nonlinearity in the relationship between credit loss and credit spreads. To address the issue of nonlinearity, I estimate the VAR rating-by-rating and allow the slope coe¢ cients to di⁄er across credit ratings. This subsample analysis shows that 97 percent of the variation in credit spreads within investment grade bonds is associated with the variation in risk premium. In contrast, for the subsample of junk bonds, only 31 percent of the credit spread variationcorrespondstothediscountratevariation, andtherestisduetotheexpectedcredit loss. Thus, the relationship between credit loss and credit spreads is nonlinear. However, once I aggregate the rating-based subsample estimates for expected credit loss and compute the volatility ratios using all the bonds, the volatility of expected credit loss is still similar to the volatility of the expected excess returns. Therefore, the (cid:133)rst main result remains mostly intact after accounting for nonlinearity. I run additional VARs with di⁄erent times to maturity, multiple lags and additional variables such as leverage and equity volatilities, and (cid:133)nd that the estimated volatility ratio does not change signi(cid:133)cantly from the basic VAR speci(cid:133)cation. These results are robust to 3

the approximation errors, small sample biases, the state tax e⁄ect, the e⁄ect of call option premium and the inclusion of industry (cid:133)xed e⁄ects. In addition, the return forecasting regressions perform reasonably well out of sample. The second main result of the paper is the investment forecasting regressions based on the two components of the credit spreads identi(cid:133)ed in the previous analysis. Using the (cid:133)rm-level panel data, I regress the ratio of investment to capital this year on the expected credit loss and excess returns at the end of the previous year. When the expected credit loss and excess returns are used separately as regressors, both components forecast a decrease in the investment rates. However, when I include both components in a multivariate regression controlling for other determinants of investment, only the expected credit loss predicts investments negatively, and the expected excess returns are statistically insigni(cid:133)cant. These results are in stark contrast with the (cid:133)ndings of Gilchrist and Zakraj(cid:154)ek (2012), who (cid:133)nd that at the aggregate market level, the risk premium component is more important than the default risk component in forecasting macroeconomic activities. Decomposing credit spreads is important for at least three reasons: First, the variation in thecreditspreadsislarge,andlinkedtotheshockstothe(cid:133)rm-levelandmacro-leveleconomic activities. As shown by Philippon (2009), Gilchrist, Yankov and Zakraj(cid:154)ek (2009), and Gilchrist, Sim and Zakraj(cid:154)ek (2013), at the aggregate economy level, credit spreads forecast economic fundamentals, such as output, consumption, in(cid:135)ation and investments, above and beyond stock prices do. The variation in credit spreads was even more prominent during the (cid:133)nancial crisis in 2008. The di⁄erence in yields between Baa and Aaa corporate bonds rose to 260 basis points in October 2008, more than three times as high as the yield di⁄erence a year earlier. Due to such a large variation in spreads both over time and across bonds, understanding why the spreads are changing at the individual bond level and market level is important. In this paper, I show that the drivers for the market-wide variation in credit spreads are quite di⁄erent from the drivers of individual bonds. The di⁄erence arises due to diversi(cid:133)cation e⁄ects: The default shocks are more idiosyncratic than the expected return 4

shocks, and thus the expected credit loss component is more important at the individual bond level than at the aggregate market level. Second, understanding the information in credit spreads is important for a dynamic portfolio choice problem, since part of the variation in credit spreads signals the variation in expected returns. The decomposition is also important for credit risk management, as one might use credit spreads to measure default risk. My analysis shows that credit spreads forecast both excess returns and default in the future, and thus provide a useful signal for portfolio management. Third, the examination of the contribution of variation in expected returns on corporate bond prices serves as an out-of-sample test for the excess volatility found in stock prices (e.g., Campbell and Shiller, (1988a and 1998b), Campbell, (1991), Vuolteenaho, (2002), and Cochrane, (2008 and 2011)). In recent studies, Chen (2009) and Chen, Da and Zhao (2013) challenge the previous equity decomposition results by adopting di⁄erent measures of cash (cid:135)ownewsanddiscountratenews. Thus, anout-of-sampletestofthevariancedecomposition using the securities closest to stocks, namely corporate bonds, contributes to the discussion of stock price volatility. In fact, I (cid:133)nd that the variance decomposition results for junk bonds are reasonably close to those for stocks in Vuolteenaho (2002). Related Literature The papers closest to mine are Bongaerts (2010) and Elton, Gruber, Agrawal and Mann (2001). The idea of applying a variance decomposition approach to corporate bonds starts in Bongaerts (2010), who decomposes variance of the returns on the corporate bond indices. This article is a complement to Bongaerts (2010), as I use micro-level data to study the bond level variation of credit spreads, and decompose the returns on corporate bonds in excess of matching Treasury bond returns to remove shocks to Treasury yield curves. Elton, Gruber, Agrawal and Mann (2001) explain the level of the average credit spreads for AA, A and BBB bonds based on the average probability of default and loss given default. In 5

contrast, this article decomposes the variance of credit spreads allowing for the time-varying probability of default and risk premia. By studying the variance of the credit spreads, I show a link in movements between the di⁄erent components of the credit spreads and the issuers(cid:146)investment. By examining credit spreads, rather than returns, this article relates to the literature which tries to decompose and explain credit spreads on corporate bonds. Collin-Dufresne, Goldstein and Martin (2001) show that changes in credit spreads cannot be explained by changes in the inputs to the Merton (1974) model, such as leverage and volatility. In addition, numerous papers attempt to explain the credit spread using structural models of debt (e.g., Leland (1994), Collin-Dufresne and Goldstein (2001), Chen, Collin-Dufresne and Goldstein (2009), Bharmra, Kuehn and Strebulaev (2010), Chen (2010), and Huang and Huang (2012)), reduced-form models (Du⁄ee (1999) and Driessen (2005)) or the credit default swap spreads (Longsta⁄, Mithal and Neis (2005)). This article di⁄ers from the literature as I do not rely on particular models of default. Instead, by applying Campbell- Shiller (1988a) style decomposition to the credit spread, I estimate the expected credit loss and risk premium components via VARs. Finally, this paper adds to the literature which studies the information content in the price ratios of a variety of assets. For Treasury bonds, Fama and Bliss (1988) and Cochrane and Piazzezi (2005) (cid:133)nd that forward rates forecast bond returns, not future short rates. For foreign exchange, Hansen and Hodrick (1980), Fama (1984), and Lustig and Verdelhan (2007) show that uncovered interest rate parity does not hold in the data. The di⁄erence in interest rates between a home country and a foreign country forecasts returns on the foreign currency, instead of changes in the exchange rate. Beber (2006), McAndrews (2008), Taylor (2009)andSchwartz(2013)decomposetheyieldspreadsinthesovereignandmoneymarkets. This article complements the literature by quantifying the variation in risk premia in the corporate bond market. 6

The rest of the article is organized as follows: Section 2 shows the decomposition of the credit spread of corporate bonds. I describe the data and show the empirical results in Section 3. Section 4 examines how the risk premium and expected credit loss a⁄ect (cid:133)rms(cid:146) investment decisions, and Section 5 provides concluding remarks. 2 Decomposition of the Credit Spreads of Corporate Bonds 2.1 A Simple Example To illustrate the idea, I start with the simple case of a one-period zero-coupon corporate bond. Suppose there is a one-period corporate bond and a Treasury bond whose face values are normalized to one. At time 0, I observe the log price of the corporate bond, p , and 0 the log price of the Treasury bond, pf. Let s(cid:28) be the time 0 credit spread, de(cid:133)ned as 0 0 s(cid:28) pf p . At time 1, the corporate bond either matures or defaults. The negative log 0 (cid:17) 0 (cid:0) 0 payo⁄from the corporate bond at time 1 is given by l > 0 if defaults, l = 1 8 > 0 otherwise, < > : while the log payo⁄ from the Treasury bond is always zero. Then, the log returns on the corporate bond and the Treasury bond are r = l p , 1 1 0 (cid:0) (cid:0) rf = pf: 1 (cid:0) 0 7

Now let us de(cid:133)ne a log excess return on the corporate bond as re r rf = l +s(cid:28) . (1) 1 (cid:17) 1 (cid:0) 1 (cid:0) 1 0 Rearranging the terms in (1), I obtain s(cid:28) = re +l . (2) 0 1 1 Equation (2) is only a de(cid:133)nition of a log excess return and has no economic content. However, this equation provides a useful framework to study the information in the credit spread of a corporate bond. From (2), we can see a simple rule: If the current credit spread is higher, then either the excess return or default loss in the next period must be higher. If s(cid:28) varies, either over time or across securities, then s(cid:28) must forecast either excess returns 0 0 or defaults. As (2) holds for any realization of random variables at time 1, the equality also holds under the time 0 conditional expectation: s(cid:28) = E[re ]+E[l ], (3) 0 1jF 0 1 jF 0 where is the economic agent(cid:146)s information set. This identity under conditional expec- 0 F tation implies that we can decompose the variation of s(cid:28) into the expected excess return 0 component and the expected default component. With equation (3), we can quantify how much of the variation of s(cid:28) comes from the risk premium or expected defaults. 0 I decompose the credit spread using forecasting regressions. If one regresses re and l on 1 1 a set of variables in time 0 information set, the (cid:133)tted values of re and l are the conditional 1 1 expectation. This regression-based approach does not rely on a particular model of default. Instead, this methodology uses weighted averages of realized returns and credit losses to estimate the conditional expectation. Combining regression estimates with the identity (3), 8

I can cleanly separate the risk premium component of the credit spread from the expected default component without leaving unexplained residuals. In estimating the expected credit loss, I do not rely on the probability of default estimated by the rating agencies, which is held constant over time. Instead, I allow the expected credit loss and risk premium to vary over time. The other way to decompose the credit spread is to directly model E[l ]. For 1 0 jF example, based on the Merton (1974) model, we can estimate the expected default loss by the model-implied probability of default, multiplied by the loss given default. Once we have E[l ], we can back out E[re ] by s(cid:28) E[l ]. However, if the model is 1 jF 0 1jF 0 0 (cid:0) 1 jF 0 misspeci(cid:133)ed, this model-based approach produces a biased decomposition. Thus, in this article, I adopt a model-free approach to decompose credit spreads. Unlike this simple example, most corporate bonds have a long time to maturity and pay coupons. Furthermore, their expected excess returns and expected defaults may vary over time. Therefore, in the subsection that follows, I develop a more general framework that relates the credit spread of corporate bonds to their excess returns and credit losses under a multi-period setup. 2.2 Log-linear Approximation of Bond Excess Returns I log-linearize excess returns on a corporate bond to obtain a linear relationship among log excess returns, credit spreads and credit loss. I consider the strategy where an investor buys and holds an individual corporate bond i until it matures or defaults. If the bond defaults, the investor sells the defaulted bond and buys the Treasury bond with the same coupon rate and remaining time to maturity as the defaulting bond. Let P be the price per one dollar face value for corporate bond i at time t including i;t accrued interest, and C be the coupon rate. Then, the return on the bond is i;t 9

P +C i;t+1 i;t+1 R = : i;t+1 P i;t Suppose that there is a matching Treasury bond for corporate bond i, such that the matching Treasury bond has an identical coupon rate and repayment schedule as corporate bond i. Let Pf and Cf be the price and coupon rate for such a Treasury bond. Then, the i;t i;t return on the matching Treasury bond is Pf +Cf Rf = i;t+1 i;t+1 . i;t+1 Pf i;t As I do not have the data for the loss upon default for coupon payments, I assume that therateofcreditloss(de(cid:133)nedbelow)forthecouponsisthesameastheratefortheprincipal. Under this assumption, following Campbell and Shiller (1988a), I log-linearize both R i;t+1 and Rf using the same expansion point, (cid:26) (0;1). i;t+1 2 The log return on corporate bond i, in excess of the log return on the matching Treasury bond, can then be approximated as re logR logRf (cid:26)s(cid:28) +s(cid:28) l +const; (4) i;t+1 (cid:17) i;t+1 (cid:0) i;t+1 (cid:25) (cid:0) i;t+1 i;t (cid:0) i;t+1 where Pf log i;t if t < t ; s(cid:28) Pi;t D (5) i;t 8 (cid:17) > 0 otherwise. < > Pf : log i;t if t = t ; l Pi;t D (6) i;t 8 (cid:17) > 0 otherwise, < > : where t is the time of default. The variable s(cid:28) measures credit spreads while l measures D i;t i;t thecredit loss upondefault. Equation(4) impliesthattheexcess returnoncorporatebondi 10

is low due to either widening credit spreads or defaults. In Appendix A, I show the detailed derivation of (4). The credit spread measure, s(cid:28) , is the price spreads rather than the yield spreads. i;t The price spreads have important advantages over the yield spreads: The price spread has a de(cid:133)nitionbasedonthesimpleformula, whileyieldspreadscanonlybecomputednumerically for a bond that pays coupons. As s(cid:28) has a simple analytic form, s(cid:28) can be approximated i;t i;t using a linear function of s(cid:28) ;re and l in (4) without inducing large approximation i;t+1 i;t+1 i;t+1 errors. In contrast, yield spreads for coupon bearing bonds can only be de(cid:133)ned implicitly and computed numerically, which makes it hard to express the bond returns using a linear functionofyieldspreads. However, thepricespread, s(cid:28) , iscloselyrelatedtothecommonly i;t used yield spread. For coupon bearing bonds, a price change can be approximated by a change in yields multiplied by duration. The average cross-sectional correlation between the price spreads and the yield spreads in my sample is 0.82, while the correlation between the price spreads and the yield spreads times duration is 0.97. Thus, both spreads are, conceptually and empirically, closely tied together, and the analysis on the price spreads is useful in understanding the information content in the yield spreads. The credit loss measure, l , encodes the information about both the incidence of default i;t and the loss given default. The loss given default is measured using the market price of the corporate bonds upon default. As such, this measure of loss given default is the loss for an investor who invests in corporate bonds. This measure of credit loss is consistent with the way in which Moody(cid:146)s estimates the loss given default1, which is widely used in pricing credit derivatives. However, my measure of credit loss, l , is not the loss accrued to the i;t economy due to default, as I do not use the ultimate recovery after the bankruptcy court settlements. 1For example, Moody(cid:146)s (1999) reports "One methodology for calculating recovery rates would track all paymentsmadeonadefaulteddebtinstrument,discountthembacktothedateofdefault,andpresentthem as a percentage of the par value of the security. However, this methodology, while not infeasible, presents a number of calculation problems and relies on a variety of assumptions.... For there reasons, we use the trading price of the defaulted instrument as a proxy for the present value of the ultimate recovery." 11

To determine if a bond is in default, I follow Moody(cid:146)s (2011) de(cid:133)nition of defaults. A bondisindefaultifthereis(a)missedordelayedrepayments, (b)abankruptcy(cid:133)lingorlegal receivership that will likely cause a miss or delay in repayments, (c) a distressed exchange or (d) a change in payment terms that results in a diminished (cid:133)nancial obligations for the borrower. My de(cid:133)nition does not include so-called technical defaults, such as temporary violations of the covenants regarding (cid:133)nancial ratios, and slightly delayed payments due to technical or administrative errors. In the empirical work below, I set (cid:26) = exp(pc)=(1+exp(pc)), where pc is equal to the sample mean of logP =C . Speci(cid:133)cally, I use (cid:26) = 0:993. The di⁄erence equation (4) i;t i;t approximates log excess returns using the (cid:133)rst-order Taylor expansion. I show below that the approximation error is small and does not a⁄ect my empirical results. None of the variables on the right-hand side of (4) depend on the coupon payments. Since the corporate bond and the Treasury bond have the same coupon rates, the coupons cancel with each other. As a result, there is no seasonality in these variables, enabling one to use monthly returns for the decomposition. Chen (2009) points out that the use of annual horizon makes it necessary to make an assumption about how the cash (cid:135)ows paid out in the middle of a year are reinvested by investors, and the variance decomposition results are sensitive to such assumptions. Since the coupon payments from the corporate bond and the Treasury bond o⁄set with each other, the variance decomposition in this article does not rely on the assumption about cash (cid:135)ow reinvestments. Now I iterate the di⁄erence equation forward up to the maturity of the bond, T. That is, T t T t (cid:0) (cid:0) s(cid:28) (cid:26)j 1re + (cid:26)j 1l +const: (7) i;t (cid:25) (cid:0) i;t+j (cid:0) i;t+j j=1 j=1 X X If the bond defaults at t < T, the investor adjusts the position such that re = l = 0 D i;t i;t for t > t . Therefore, I can still iterate the di⁄erence equation forward up to T with no D consequences. 12

The equation (7) shows that the credit spread of corporate bonds has a discount rate component and a credit loss component. The basic idea behind this decomposition is the same as that behind the decomposition of the price-dividend ratio for a stock. Since corporate bonds have (cid:133)xed cash (cid:135)ows, the only source of shocks to cash (cid:135)ows is credit loss. Thus, the term l plays a role analogous to dividend growth for equities. In the case of i;t corporate bonds, however, we have s(cid:28) = 0 by construction. As a result, I do not have to i;T impose the condition in which (cid:26)js(cid:28) tends to zero, as j goes to in(cid:133)nity. i;t+j Since (7) holds path-by-path, the approximate equality holds under expectation. Taking the time t conditional expectation of the both sides of (7), we have T t T t (cid:0) (cid:0) s(cid:28) E (cid:26)j 1re +E (cid:26)j 1l +const; i;t (cid:25) " (cid:0) i;t+j (cid:12)F t # " (cid:0) i;t+j (cid:12)F t # X j=1 (cid:12) X j=1 (cid:12) (cid:12) (cid:12) (cid:12) (cid:12) (cid:12) (cid:12) where is the information set of economic agents. t F Let us de(cid:133)ne the expected credit loss as T t (cid:0) s(cid:28)l E (cid:26)j 1l . i;t (cid:17) " (cid:0) i;t+j (cid:12)F t # X j=1 (cid:12) (cid:12) (cid:12) (cid:12) We can then measure how much the volatility of s(cid:28) corresponds to the volatility of i;t the expected credit loss by the ratio (cid:27) s(cid:28)l =(cid:27)(s(cid:28) ). To evaluate the magnitude of i;t i;t (cid:27) s(cid:28)l =(cid:27)(s(cid:28) ), it is useful to set a ben(cid:0)chm(cid:1)ark case, in which all volatility in the credit i;t i;t sp(cid:0)read(cid:1)is associated with the expected credit loss. De(cid:133)nition. The expected credit loss hypothesis holds if a change in the credit spread only re(cid:135)ects the news about the expected credit loss. That is, s(cid:28) = s(cid:28)l +const; i;t i;t holds. 13

Under the expected credit loss hypothesis, (cid:27) s(cid:28)l =(cid:27)(s(cid:28) ) = 1 holds. Therefore, using i;t i;t the hypothesis as a benchmark, we can ask how(cid:0)far (cid:1)from one the estimated volatility ratio is. The expected credit loss hypothesis also implies that s(cid:28)r E T t(cid:26)j 1re is i;t (cid:17) j= (cid:0) 1 (cid:0) i;t+j F t h (cid:12) i a constant. Under this hypothesis, the long-run excess returns on coPrporate bonds a(cid:12)re not (cid:12) forecastable. The expected credit loss hypothesis is the corporate bond counterpart of the expectation hypothesis for interest rates and of uncovered interest rate parity for foreign exchange rates. These hypotheses share the same basic idea that the current scaled price should re(cid:135)ect the future fundamentals in an unbiased way. If these hypotheses fail, either due to time-varying risk premia or irrational expectations, then the excess returns are forecastable using the scaled price. 2.3 Estimation by a VAR I set up the empirical framework to measure the volatility ratio, based on a VAR. To focus on the cross-sectional variation, I subtract the cross-sectional mean at time t from the state variables, and denote them with tilde. In the basic setup, I use a vector of state variables, 0 X = re s(cid:28) (cid:28) z , i;t i;t i;t i;t i;t (cid:18) (cid:19) e e g where (cid:28) is the bond(cid:146)s duration and z is a vector of state variables other than re and i;t i;t i;t s(cid:28) . i;t e e The dynamics of the state variables is given by X = AX +BW . i;t+1 i;t i;t+1 The VAR coe¢ cient matrices A and B are assumed to be constant, both over time and across bonds. This VAR speci(cid:133)cation implies that ex-ante, a bond is expected to behave 14

similarly to other bonds with the same values of the state variables. I also assume that W i;t is independent over time but can be correlated across bonds. Since many structural models of debt (e.g., Merton (1974)) or reduced form models (e.g., Du¢ eandSingleton(1999))implythattheexpectedreturnsandtheriskofacorporatebond depend on its time to maturity, the state variables z are scaled by the bond(cid:146)s duration. i;t The price spread, s(cid:28) , has a convenient feature in that it tends to shrink with its duration: i;t Since a price spread is roughly equal to a yield spread times the bond(cid:146)s duration, holding e yield spreads constant, s(cid:28) tends to zero as the bond approaches maturity. Thus, we do not i;t have to scale s(cid:28) with duration. Although I do not scale re with duration, I add another i;t e i;t variable (cid:28) re later as a robustness check, and show that adding (cid:28) re in the VAR does not i;t i;te e i;t i;t change the results. g g Let e ;i = 1;2 be unit vectors whose i th entry is one while the other entries are zero. i (cid:0) Then, the long-run expected loss implied by the VAR is T t (cid:0) E (cid:26)j 1l X = e GX ; (8) (cid:0) t+j i;t L i;t " (cid:12) # X j=1 (cid:12) (cid:12) e (cid:12) (cid:12) where G A(I (cid:26)A) 1 I ((cid:26)A)T t and e = (cid:26)e +e A 1 e . To obtain (8), I use (cid:0) (cid:0) L 2 2 (cid:0) 1 (cid:17) (cid:0) (cid:0) (cid:0) (cid:0) (cid:16) (cid:17) the one-period identity in (4). Solving for l and taking the conditional expectation, we i;t+1 have e E l X = E[ (cid:26)e X +e X e X X ]; i;t+j i;t 2 i;t+j 2 i;t+j 1 1 i;t+j i;t (cid:0) (cid:0) (cid:0) j h (cid:12) i e (cid:12) (cid:12) = e L AjX i;t . Plugging E l X into E (cid:26)j 1l X yields (8). i;t+j i;t (cid:0) i;t+j i;t h (cid:12) i h (cid:12) i (cid:12) P (cid:12) In order toediagnose the estimated voelatility ratio, it is also useful to consider the implied (cid:12) (cid:12) 15

long-run return forecasting regressions: T t (cid:0) E (cid:26)j 1re X = e GX : (cid:0) i;t+j i;t 1 i;t " (cid:12) # X j=1 (cid:12) (cid:12) e (cid:12) (cid:12) Then, by identity (7), e G+e G = 0 1 0 ::: 0 (9) 1 L (cid:18) (cid:19) holds. Moreover, the expected credit loss hypothesis implies e G = 0 0 0 ::: 0 ; 1 (cid:18) (cid:19) e G = 0 1 0 ::: 0 ; L (cid:18) (cid:19) musthold. Iftheestimatedvolatilityratioisnotone,wecanexaminethelong-runregression coe¢ cients to identify the source of the deviation. For statistical inference, I compute the standard errors of the VAR-implied long-run coe¢ cients and volatility ratios by the delta method. To this end, I numerically calculate the derivative of the long-run coe¢ cients and volatility ratios with respect to the VAR parameters. 3 Empirical Results 3.1 Data I construct the panel data of corporate bond prices from the Lehman Brothers Fixed Income Database, the Mergent FISD/NAIC Database, TRACE and DataStream. Appendix B provides a detailed description of these databases. When there are overlaps among the four databases, I prioritize in the following order: the Lehman Brothers Fixed Income Database, TRACE, Mergent FISD/NAIC and DataStream. I check whether the main result is robust 16

to the change in orders in Appendix B. If the observation is missing in the databases above, I use Moody(cid:146)s Default Risk Service to complement the price upon default. CRSP and Compustat provide the stock prices and accounting information. I remove bonds with (cid:135)oating rates and with option features other than callable bonds. Untilthelate1980s, veryfewbondswerenoncallable. Thus, removingcallablebondswould signi(cid:133)cantly reduce the length of the sample period, and for this reason I include callable bonds in my sample. As the callable bond price re(cid:135)ects the discount due to the call option value, the yields on these bonds are not exactly comparable to the yields on non callable bonds. Crabbe (1991) estimates that call options contribute nine basis points to the bond spread, on average, for investment grade bonds. Therefore, the e⁄ect of call options does not seem large enough to signi(cid:133)cantly a⁄ect my results. To show the robustness of the results, I include (cid:133)xed e⁄ects for callable bonds, repeat the main exercise in Appendix B, and show that callability does not drive the main results. I apply three (cid:133)lters to remove the observations that are likely to be subject to erroneous recording. First, I remove the price observations that are higher than matching Treasury bond prices. Second, I drop the price observations below one cent per dollar. These two (cid:133)lters are applied to the prices to buy in. Third, I remove the return observations that show a large bounceback. Speci(cid:133)cally, I compute the product of the adjacent return observations and remove both observations if the product is less than 0:04. That is, if the same bond (cid:0) jumps up more than 20 percent in one month and comes down more than 20 percent in the following month, I assume that the price observation in the middle is recorded with errors. To compute excess returns and credit spreads, I need to construct the prices of the synthetic Treasury bonds that match the corporate bonds. To this end, I use the Federal Reserve(cid:146)s constant-maturity yields data. First, I interpolate the Treasury yield curve using cubic splines and construct Treasury zero-coupon curves by bootstrapping. At each month and for each corporate bond in the data set, I construct the future cash (cid:135)ow schedule for the 17

couponandprincipalpayments. ThenImultiplyeachcash(cid:135)owbythezero-couponTreasury bondpricewiththecorrespondingtimetomaturity. Iaddallofthediscountedcash(cid:135)owsto obtain the synthetic Treasury bond price that matches the corporate bond. I do this process for all corporate bonds at each month to obtain the panel data of matching Treasury bond prices. With this method, the credit spread measure is, in principle, una⁄ected by changes in the Treasury yield curve. 3.2 Main Results In this section, I estimate the VAR in the previous section and quantify the contribution of the volatility of expected credit loss to the changes in credit spreads. I start from the simple case in which the state vector includes only re , s(cid:28) and distance to default times i;t i;t duration, (cid:28) DD . I use re instead of l , as l in the right-hand side of the regression (cid:0) i;t i;t i;t i;t i;t e e is mostly zero. I include distance to default because it is known to forecast default (e.g., g e e e Gropp, Lo-Duca and Vesala (2006) and Harada, Ito and Takahashi (2010)), and Gilchrist and Zakraj(cid:154)ek (2012) use distance to default to decompose their measure of credit spreads. I use negative distance to default, so a greater value corresponds to a higher probability of default. Appendix C provides the computational details of distance to default. As I have to take a stand on the maturity of the bonds to use the present-value identity (7), I start by analyzing the sample of (cid:133)ve-year bonds. I identify the bonds with the remaining time to maturity of (cid:133)ve years, and use their history of data until they drop out of the dataset. For example, if there is a bond whose time to maturity is 20 years at issuance, I use the observations for the bond only after its remaining time to maturity becomes (cid:133)ve years. I discard all bonds whose time to maturity at issuance is less than (cid:133)ve years. Below, I show that the results are robust to the choice of maturity. I run pooled OLS regressions using demeaned state variables to forecast credit loss and estimate the VARs. To account for the cross-sectional correlation in error terms, I cluster 18

Table 1: Summary Statistics of the Variables: Monthly from 1973 to 2011 Variable Mean Std. 5%-pct 25%-pct Median 75%-pct 95%-pct Panel A: Descriptive Statistics, Basic Data re 0.14 3.37 -2.69 -0.47 0.14 0.74 3.21 i;t s(cid:28) 7.73 16.31 0.25 1.26 3.30 8.80 25.03 i;t (cid:28) DD 0.20 0.14 0.03 0.10 0.17 0.27 0.47 i;t i;t l 0.08 3.82 0.00 0.00 0.00 0.00 0.00 i;t Panel B: Descriptive Statistics, Demeaned Data re 0 3.25 -2.60 -0.51 -0.04 0.47 3.03 i;t s(cid:28) 0 15.48 -10.96 -4.72 -2.13 1.10 14.29 i;t e (cid:28) DD 0 0.13 -0.17 -0.09 -0.02 0.07 0.25 i;t fi;t l 0 3.81 -0.43 -0.06 0.00 0.00 0.00 g i;t Means, staendard deviations and percentiles (5, 25, 50, 75, and 95 percent) are estimated using the monthly panel data of (cid:133)ve-year bonds from January 1973 to December 2011. All the variables except (cid:28) DD are i;t i;t shown in percentage. re is the log return on the corporate bonds in excess of the matching Treasury bond, i;t l isthecreditloss,s(cid:28) isthecreditspreadofthecorporatebondsand (cid:28) DD isthedistancetodefault i;t i;t i;t i;t times the bond(cid:146)s duration. Panel A reports the statistics for the raw data, while in Panel B the variables are market-adjusted by subtracting the cross-sectional average each month. The number of observations is 197,206 bond months. standard errors by time. Later I compare the clustered standard errors with the standard errors from bootstrapping which con(cid:133)rms the reliability of the statistical inference. Table 1 shows the summary statistics of the variables used in regressions. The statistics are computed using the panel data of (cid:133)ve-year bonds. Panel A shows the raw data before demeaning. The excess returns and distance to default are distributed symmetrically, while the credit spreads and credit loss are right-skewed. Panel B shows the demeaned data, in which the cross-sectional mean is subtracted from each observation. To estimate the VAR below, I use the demeaned data. Demeaning does not signi(cid:133)cantly reduce the volatility of the variables, while it somewhat reduces the skewness of credit loss. The (cid:133)rst panel of Table 2 shows the estimated credit loss forecasting regressions. In order to see how approximation error a⁄ects the regression results, I use both l and its i;t+1 log-linear approximation based on (4), l (cid:26)s(cid:28) +s(cid:28) re , for the left-hand side i0;t+1 (cid:17) (cid:0) i;t+1 i;t (cid:0) i;t+1 e 19 e e e

variables. Table 2 shows that the credit loss measure, l , is forecastable based on the credit i;t+1 spread, with a slope coe¢ cient of 2.69. The bond is more likely to default when s(cid:28) is e i;t high (i.e., the price of the corporate bond is relatively lower than the price of the matching e Treasury bond). Past excess returns do not forecast the loss next period, while distance to default helps forecast default. When the issuer is closer to default (high (cid:28) DD ), the (cid:0) i;t i;t bond is more likely to default, which is consistent with the Merton (1974) model. g ThesecondpanelofTable2presentstheestimatedVARcoe¢ cients. Excessreturnstend to be higher when past excess returns are high, credit spreads are high, or the issuer is far from default. Statistically, only the credit spread is signi(cid:133)cant, with a coe¢ cient of 2.15 and a standard error of 0.98. Though R-squared is low, the return predictability is economically signi(cid:133)cant, with a standard deviation of expected excess returns of 0.21 percent per month. The variation in expected returns is very large compared with the variation found in the previous literature. For example, Gebhardt, Hvidkjaer and Swaminathan (2005) (cid:133)nd that the di⁄erence in average excess returns between di⁄erent credit ratings is 0.07 percent per month and the di⁄erence between di⁄erent durations is 0.04 percent. The VAR coe¢ cients for the credit spreads and distance to default show that these two variables are fairly autonomous and are forecastable mostly by their own past values. Due to the identity (7), lower prices of corporate bonds must correspond to either higher excessreturnsorhighercreditloss. Tomeasurethecontributionfromthesetwocomponents, I compute the VAR-implied long-run forecasting coe¢ cients, e G and e G, shown in the L 1 third panel of Table 2. Holding everything else constant, when credit spreads go up by one percent, the expected long-run credit loss only goes up by 0.52 percent. Under the benchmark case of the expected credit loss hypothesis, the slope coe¢ cient on credit spreads must be one. In the data, the estimated long-run credit loss forecasting coe¢ cient is more than three standard errors from one. 20

Table 2: Estimated VARs, Implied Long-run Regression Coe¢ cients and Volatility Ratios Explanatory variable Joint sigre s(cid:28) (cid:28) DD R2 ni(cid:133)cance (cid:27)(E [y ]) i;t i;t (cid:0) i;t i;t t i;t+1 Regression of credit loss on information: l -5e.63 f2.69 g1.85 0.03 [0.000] 0.30 i;t+1 (3.88) (0.91) (0.85) el -5.19 2.38 1.86 0.02 [0.018] 0.26 i0;t+1 (3.92) (0.92) (0.86) e VAR estimates: A 100 (cid:2) re 1.05 2.15 -1.79 0.01 [0.000] 0.21 i;t+1 (3.30) (0.98) (1.24) s(cid:28) 4.17 96.14 -0.07 0.90 [0.000] ei;t+1 (5.15) (1.25) (1.50) (cid:0) (cid:28) i;t+1 DfD i;t+1 -0.16 0.05 98.22 0.99 [0.000] (0.03) (0.01) (0.29) g Long-run regression coe¢ cients: (cid:26)j 1l -0.03 0.52 0.76 4.76 (cid:0) i;t+j (0.02) (0.16) (0.34) P (cid:26)j 1ree 0.03 0.48 -0.76 4.80 (cid:0) i;t+j (0.02) (0.16) (0.34) P e Implications of VAR estimates: (cid:27)(s(cid:28)l)=(cid:27)(s(cid:28)) (cid:27)(s(cid:28)r)=(cid:27)(s(cid:28)) corr(s(cid:28)l;s(cid:28)) corr(s(cid:28)r;s(cid:28)) corr(s(cid:28)l;s(cid:28)r) Estimates 0.51 0.51 0.900 0.900 0.919 (0.16) (0.15) (0.012) (0.025) (0.068) The sample period is monthly from 1973 to 2011. re is the log return on the corporate bond i in excess of i;t the matching Treasury bond, l is the credit loss on bond i, l is the credit loss implied from re ;s(cid:28) i;t i0;t i;t i;t 1 and s(cid:28) based on (4); s(cid:28) is the credit spread,eDD is the distance to default, and (cid:28) is the bond(cid:0)(cid:146)s i;t i;t i;t i;t duration. Thevariabless(cid:28)l i;t eands(cid:28)r i;t arethesumofexpecteedlong-rundiscountedcreditlossan e dth f esum of exfpected long-run discfounted excess returns, de(cid:133)ned by s(cid:28)l i;t = e L A(I (cid:0) (cid:26)A)(cid:0) 1 I (cid:0) ((cid:26)A)T (cid:0) t X i;t and s(cid:28)r i;t =e 1 A(I (cid:0) (cid:26)A)(cid:0) 1 I (cid:0) ((cid:26)A)T (cid:0) t X i;t . Thecolumn(cid:27)(E t [y t+1 ])showsthesamp(cid:16)lestandardde(cid:17)viationof (cid:133)tted values of the left-(cid:16)hand side vari(cid:17)ables. Standard errors, reported in parentheses under each coe¢ cient, are clustered by time, and p-values are reported in brackets. The matrix A and the associated standard errors are multiplied by 100. 21

Since the long-run forecasting coe¢ cients on the credit spreads add up to one, the excess return forecasting coe¢ cient is 1-0.52=0.48, which is three standard errors from zero. Economically, the return forecastability is as large as the credit loss forecastability. When the credit spread increases, expected excess returns go up about as much as expected credit loss does. Though lower corporate bond prices signal higher defaults in the future, the credit loss predictability is not large enough to eliminate the excess return predictability. As a result, rising credit spreads signals both increasing default loss and excess returns in the future. Statistically, the evidence for excess return forecastability is more clear in the implied long-run coe¢ cients than in the one-period coe¢ cients. Since the long-run coe¢ cients are estimated more precisely, they are highly statistically signi(cid:133)cant, while the one-period coe¢ cients are only marginally signi(cid:133)cant. By the identity (9), the slope coe¢ cients of long-run excess returns and long-run credit lossonre and (cid:28) DD adduptozero. Inorderforvariablesotherthanthecreditspreads i;t (cid:0) i;t i;t to forecast long-run excess returns, these variables have to help credit spreads forecast longg e run credit loss. In Table 2, past excess returns do not forecast either long-run excess returns or credit loss, while distance to default has a marginal forecasting power. The bottom panel shows the ratio of the volatility of expected credit loss to the credit spreads. This volatility ratio answers the question of how much of the volatility of observed credit spreads is driven by changes in expected credit loss. The estimated volatility ratio is 0.51. Since past excess returns and distance to default are economically insigni(cid:133)cant forecasters of credit loss, the volatility ratio nearly coincides with the long-run credit loss forecasting coe¢ cient on credit spreads. The ratio of the volatility of long-run expected excess returns to the volatility of credit spreads is 0.51, which is the same as the ratio for the expected credit loss. The volatility ratio for expected excess returns is more than three standard errors from zero, a benchmark value predicted by the expected credit loss hypothesis. Since credit spreads are the only economically signi(cid:133)cant regressors, longrun expected credit loss and excess returns are highly correlated with credit spreads, with 22

correlation coe¢ cients of 0.90 and 0.90, respectively. In general, the volatility ratios for expected credit loss and expected excess returns do not have to add up to one. If there are signi(cid:133)cant forecasters of long-run credit loss other than credit spreads, which in turn forecast long-run excess returns, then the volatility ratios do not add up to one due to the covariance between expected credit loss and expected excess returns. In this VAR speci(cid:133)cation, the predictive power of past excess returns and distance to default is weak, which makes it possible to cleanly separate the volatility of credit spreads due to expected credit loss from expected excess returns. The log-linear approximation does not signi(cid:133)cantly a⁄ect the regression results. The regressions coe¢ cients for l and l are similar and within one standard error. When I run i;t t0 the VAR using re (cid:26)s(cid:28) +s(cid:28) l instead of re , the long-run excess return i;0t+1 (cid:17) (cid:0) e i;t+1 e i;t (cid:0) i;t+1 i;t+1 forecasting coe¢ cient becomes 0.53, instead of 0.48 as in Table 2. Thus, the approximation e e e e e errordoes not signi(cid:133)cantlya⁄ect the results, foreitherthe one-periodorlong-runforecasting coe¢ cients. There is no particular reason for using (cid:133)ve-year bonds in the analysis. To determine whetherchangingthetimetomaturitymatters, IrepeattheVARestimationusingthe3-, 7-, 10-, 15- and 20-year bonds. The results based on the di⁄erent times to maturity are shown in Table 3. The decomposition result does not change signi(cid:133)cantly across maturities. The volatilities of expected long-run credit loss and excess returns peak at seven-year bonds. For shorter maturities, the variation of state variables shrinks due to shorter durations, leading to smaller (cid:27) s(cid:28)l and (cid:27)(s(cid:28)r). For longer maturities, the sample size becomes limited and slightly biase(cid:0)d fo(cid:1)r high quality bonds, and thus the volatilities for s(cid:28)l and s(cid:28)r become i;t i;t smaller. The volatility ratio, however, is very stable; the ratio is slightly below 0.5 for expected credit loss and a bit more than 0.5 for expected excess returns. The last two columns of Table 3 report the VARs for (cid:133)ve-year bonds with more lags and more state variables. For more lags, I report the VAR with three lags as an example. The 23

Table 3: VARs with Various Maturities, Lags and Variables VAR(1) with X = re s(cid:28) (cid:28) DD More Lags More Variables i;t i;t i;t i;t i;t Years 3 5 7 (cid:16) 10 15 2(cid:17)0 5 5 (cid:27)(s(cid:28)l) 3.77 4.64 4.99 e4.04f 3.99g 3.71 3.86 4.99 (cid:27)(s(cid:28)l)=(cid:27)(s(cid:28)) 0.50 0.50 0.46 0.41 0.43 0.43 0.43 0.53 (0.17) (0.15) (0.10) (0.10) (0.12) (0.12) (0.13) (0.19) (cid:27)(s(cid:28)r) 4.32 5.27 6.12 5.96 5.34 5.00 5.18 4.40 (cid:27)(s(cid:28)r)=(cid:27)(s(cid:28)) 0.57 0.56 0.56 0.60 0.57 0.58 0.57 0.47 (0.13) (0.13) (0.10) (0.10) (0.12) (0.12) (0.13) (0.18) The sample period is monthly from 1973 to 2011. The table shows the estimated volatility ratio based on the VARs, X = AX +BW . The values 3,...,20 mean that the VARs are estimated using i;t+1 i;t i;t+1 the sample of 3,...,20-year bonds. The column (cid:145)More Lags(cid:146)shows the VAR with three lags for (cid:133)ve-year bonds, while (cid:145)More Variables(cid:146)shows the estimates for (cid:133)ve-year bonds based on the VAR(1) with the state vector X = re s(cid:28) (cid:28) re (cid:28) Lev (cid:28) Evol 0. The long-run expected credit loss is de(cid:133)ned i;t i;t i;t i;t i;t i;t i;t i;t i;t by s(cid:28)l i;t = e L A(cid:16)( e I (cid:0) (cid:26) f A)(cid:0) 1A g I (cid:0) ((cid:26)A) g T (cid:0) t X i;t an g d the lo(cid:17)ng-run expected returns are de(cid:133)ned by s(cid:28)r i;t = e 1 A(I (cid:26)A)(cid:0) 1 I ((cid:26)A)T (cid:0) t(cid:16)X i;t . Stand(cid:17)ard errors are clustered by time, and reported in parentheses (cid:0) (cid:0) under each coe¢(cid:16)cient. (cid:17) volatility ratios for credit loss and excess returns are 0.43 and 0.57, which are similar to the case of one lag. The (cid:145)More Variables(cid:146)speci(cid:133)cation extends the state vector to include (cid:133)ve variables: 0 X = re s(cid:28) (cid:28) re (cid:28) Lev (cid:28) Evol , i;t i;t i;t i;t i;t i;t i;t i;t i;t (cid:18) (cid:19) where Lev is the market leeveraege de(cid:133)gned by gthe total b g ook value of the issuer(cid:146)s debt i;t divided by the sum of the book value of the debt and the market value of equity, and Evol i;t is the issuer(cid:146)s equity volatility estimated from the daily stock returns over the last year. As past excess returns can a⁄ect the risk and expected returns on bonds di⁄erently for various maturities, I include excess returns scaled by time as an additional state variable. Since distance to default is essentially a nonlinear function of leverage and equity volatility, I use both variables instead of distance to default to allow for more (cid:135)exible dependence of default loss on state variables. With more variables, I run the VAR with one lag and (cid:133)nd that the results do not change signi(cid:133)cantly. The volatility ratio for expected long-run credit loss is 0.53, while it is 0.47 for excess returns. The decomposition results are not sensitive to time to maturity of corporate bonds, the lags, or the additional state variables in VARs. 24

3.3 Subsamples Based on Credit Ratings and Nonlinearity In this subsection, I estimate the volatility ratio separately for bonds with di⁄erent credit ratings. HuangandHuang(2012)(cid:133)ndthatdefaultcomponentsimpliedbystructuralmodels explain larger fractions of the level of credit spreads for speculative grade (junk) bonds than for investment grade (IG) bonds. Although I focus on the variation of credit spreads rather than the level of credit spreads, it is still interesting to see how my results di⁄er for IG bonds and junk bonds. In addition, the subsample analysis by credit ratings provides an e⁄ective way to address the nonlinearity in the state variables. Nonlinearity can be an issue if extremely high values of credit spreads are more informative about defaults than lower level of spreads. As the credit spread varies across credit ratings, estimating long-run forecasting coe¢ cients separately for each credit rating can handle the potential issues with nonlinearity. I use the bonds with (cid:133)ve years to maturity and split the sample based on the credit ratings when each bond has the time to maturity of (cid:133)ve years. As the present value formula in (7) holds for the life of a bond, I keep the bond in the same rating-based subsample even if its credit rating later changes. I use the bonds with the remaining time to maturity of (cid:133)ve years, as there are more observations for (cid:133)ve-year bonds than bonds with a longer time to maturity. Using the subset of bonds, I estimate the VAR(1) including re ;s(cid:28) and (cid:28) DD as i;t i;t (cid:0) i;t i;t state variables. Table 4 shows the standard deviation of s(cid:28)l and s(cid:28)r and the volatility g e e ratios for each rating category. The volatility of expected long-run credit loss increases monotonically as credit ratings fall. The volatility is essentially zero for AAA/AA bonds, while it is 20.2 percent for CCC/C bonds. The volatility ratio for expected long-run credit loss also rises from 0.02 to 0.91. In contrast, the volatility of expected long-run excess returns is nearly constant across credit ratings, except for CCC/C bonds. As a result, the volatility ratio for expected excess returns tends to be lower for bonds with a high credit 25

Table 4: Subsamples Based on Credit Ratings AAA/AA A BBB BB B CCC/C IG Junk (cid:27)(s(cid:28) ) 0.06 0.09 1.83 6.32 9.56 20.21 0.23 9.17 l (cid:27)(s(cid:28) )=(cid:27)(s(cid:28)) 0.02 0.02 0.32 0.71 0.76 0.91 0.04 0.69 l (0.04) (0.01) (0.15) (0.31) (0.36) (0.24) (0.02) (0.17) (cid:27)(s(cid:28) ) 2.95 3.75 3.98 3.76 3.11 15.99 6.18 4.12 r (cid:27)(s(cid:28) )=(cid:27)(s(cid:28)) 0.98 0.98 0.69 0.42 0.25 0.72 0.97 0.31 r (0.02) (0.01) (0.14) (0.21) (0.36) (0.17) (0.02) (0.17) Gathering rating-based estimates into one bin (cid:27)(s(cid:28)l) (cid:27)(s(cid:28)l) (cid:27)(s(cid:28)r) (cid:27)(s(cid:28)r) corr(s(cid:28)l;s(cid:28)r) (cid:27)(s(cid:28)) (cid:27)(s(cid:28)) Estimates 5.11 0.53 4.13 0.43 0.44 Thesampleperiodismonthlyfrom1973to2011. Thetoppanelshowstheestimatedvolatilityratiosbased on the VARs, X =AX +BW , estimated separately for each credit rating. The results are based i;t+1 i;t i;t+1 on (cid:133)ve-year bonds and the bonds are sorted into subsamples based on their credit ratings when they are (cid:133)ve years to maturity. The long-run expected credit loss is de(cid:133)ned by s(cid:28)l = e GX and the long-run i;t L i;t expected returns are de(cid:133)ned by s(cid:28)r = e GX . IG is the investment grade, which includes AAA/AA, A i;t 1 i;t and BBB, while Junk includes BB, B and CCC/C. Standard errors, reported in parentheses, are clustered by time. The number of observations is 21,797 for AAA/AA, 52,524 for A, 60,960 for BBB, 28,507 for BB, 28,923 for B, 4,495 for CCC/C bonds. The bottom panel collects the separately estimated s(cid:28)l and s(cid:28)r i;t i;t for AAA/AA to CCC/C into one sample, and compute their summary statistics. risk. FortheCCC/Cbonds, distancetodefaultisvolatileandbecomesaverystrongforecaster of long-run credit loss (and thus long-run excess returns). As a result, both expected longrun credit loss and excess returns are highly volatile compared with credit spreads, and the two volatility ratios do not add up to one. For IG bonds, 97 percent of the variation in credit spreads corresponds to discount rate news. For Treasury bonds, the ratio (cid:27)(s(cid:28)r)=(cid:27)(s(cid:28)) must be one, as there is no shock to their cash (cid:135)ow. Thus, IG bonds have a volatility ratio similar to Treasury bonds. In contrast, for junk bonds, cash (cid:135)ow news is roughly twice as volatile as the discount rate news. The standard deviation of the expected long-run credit loss for junk bonds is 9.2 percent, which is slightly more than double the volatility for long-run discount rates (4.1 percent). Vuolteenaho (2002) examines the individual stock returns and (cid:133)nds that the ratio is roughly the same: The cash (cid:135)ow news standard deviation is about twice as high as that of the discount rate news. Thus, the behavior of the price of junk bonds is consistent with 26

the previous literature on stocks, while the movement of the IG bond prices is quite similar to that of Treasury bonds. To examine the e⁄ect of the potential nonlinearity, I plot the long-run credit loss and excess return forecasting coe¢ cients on credit spreads, e Ge and e Ge in Figure 1. I L 02 1 02 determine the range of credit spreads for each credit rating by setting the border in which the two neighboring histograms of credit spreads overlap with each other. Within each range, I draw two lines with the slope equal to e Ge and e Ge , while the intercepts are set L 02 1 02 so that there are no jumps across credit ratings. Figure 1 visualizes how the slope di⁄ers across credit ratings and thereby shows the degree of nonlinearity in the long-run VAR. Figure 1: Long-Run Forecasting Coe¢ cients By Credit Ratings Long Run Forecasting Coefficients 50 45 AA+ A BBB BB B CCC 40 35 30 25 20 15 10 5 Expected Credit Loss Expected Excess Returns 0 0 10 20 30 40 50 Demeaned Price Spread (%) The x-axis is the demeaned credit spreads, with the left end set by the 1st percentile of credit spreads and the right end set by the 99th percentile. The y-axis is the long-run expected credit loss, E (cid:26)j 1l , t (cid:0) i;t+j andexcessreturns,E (cid:26)j 1re ,estimatedfromthecreditrating-basedsubsamples. Dashhedlinesdenotie t (cid:0) i;t P e +/- standard-error bounds. The borders between credit ratings are set where the histogram of the credit (cid:2)P (cid:3) spreads for the credit rating overlaps with the histogram for the neighboring credit ratings. e WithintherangeofIGratings,theexpectedcreditlossforecastingcoe¢ cientsarecloseto zero, andthusthelineisrather(cid:135)at. Incontrast, theexcessreturnforecastingcoe¢ cientsare closetoone, leadingtothesteepline. Thisimpliesthatthevariationincreditspreadswithin 27

the IG ratings corresponds mostly to the variation in expected excess returns. However, as the credit spread increases, the line for expected credit loss starts to steepen, while the line for expected excess returns begins to (cid:135)atten out. Thus, looking across all ratings, nonlinearity certainly exists, but the ranges of the distribution of the expected credit loss and excess returns are very similar to each other. In the bottom panel of Table 4, I gather the estimated long-run expected credit loss, s(cid:28)l E (cid:26)j 1l , and the estimated long-run expected excess returns, s(cid:28)r i;t (cid:17) t j (cid:0) i;t+j i;t (cid:17) h i E (cid:26)j 1rPe , from the rating-based subsamples, and compute volatility across all (cid:133)vet j (cid:0) i;t+j e h i yearPbonds. This way, I allow the VAR coe¢ cients to be di⁄erent across the six credit e rating categories while computing the volatility ratio using all the rating-based subsamples. Even when I allow the long-run coe¢ cients to di⁄er across ratings, I obtain (cid:27) s(cid:28)l = 5:1 i;t percent and (cid:27) s(cid:28)r = 4:1 percent, which are similar to each other. The ra(cid:0)tios o(cid:1)f these i;t volatilities to t(cid:0)he vo(cid:1)latility of credit spreads are also similar: 0.53 for credit loss and 0.43 for the expected returns. Therefore, even after allowing the forecasting coe¢ cients to di⁄er across credit ratings and accounting for nonlinearity, the conclusion that the volatility of the cash (cid:135)ow and discount rate shocks is similar still holds in the data. The subsample analysis based on credit ratings shows that, for an investor who can invest only in IG bonds for institutional reasons, the expected excess returns are the main drivers of the variation in credit spreads. However, for an investor who can invest in bonds with all ranges of credit spreads, to say that the contributions from the expected credit loss component and excess return component are roughly the same is an acceptable description of the overall market. One result that did change fromthe simple VAR in Table 2 is the correlation between the expected credit loss and excess returns. When we impose linearity, these two components appear highly correlated, with a correlation coe¢ cient of 0.92. However, this is misleading: Once we account for nonlinearity using the rating-based subsamples as in Table 4, then 28

correlation goes down to 0.44. To correctly estimate the correlation between the expected defaultandexcessreturns, itisessentialtoaccountforthenonlinearitybetweenthelong-run credit loss and credit spreads. Several readers worry about the (cid:147)peso problem(cid:148)in my analysis. If the sample size is too small, then we may not see any defaults for investment grade bonds in the sample, as the probability of default for these bonds is very low. However, because I construct a large panel dataset from the various sources and work on the long-horizon regressions, I still observe defaults in my sample. No AAA bonds jump to default in a month, but since I wait for (cid:133)ve years since the bond is rated, the number of defaults in the sample is not zero. In particular, there are two default observations in the AAA/AA subsample and nine default observations in the A subsample. For the A bonds, the volatility of the long-run expected default is statistically signi(cid:133)cantly di⁄erent from zero. Thus, the sample size is large enough for variance decomposition, even for investment grade bonds. 3.4 Small Sample Biases and Out-of-Sample Predictability This section provides several robustness checks. First, the point estimates and standard errors in Table 2 might be a⁄ected by a small sample bias, such as the bias pointed out by Stambaugh (1999). The variable, s(cid:28) , is persistent and contemporaneously negatively i;t correlated with excess returns, which may result in potentially spurious return predictabile ity. To address this concern, I run bootstrap simulations, generating 10,000 paths of the state variables under the null that the (cid:133)rst row of the estimated matrix A in Table 2 is zero. To bootstrap, I resample months with replacement to generate panel data, allowing heteroskedasticity and cross-sectional correlation in error terms. By running bootstrap simulations instead of Monte Carlo simulations, I allow non-normality in error terms and avoid estimating the variance-covariance matrix of error terms. Table 5 reports the average point estimates for the key coe¢ cients and test statistics over 29

Table 5: Bootstrap Simulations LHV re (cid:26)j 1re i;t+1 (cid:0) i;t+j RHV re s(cid:28) (cid:28) DD re s(cid:28) (cid:28) DD (cid:27)(s(cid:28)l)=(cid:27)(s(cid:28)) (cid:27)(s(cid:28)r)=(cid:27)(s(cid:28)) i;t i;t (cid:0) i;t i;t i;tP i;t (cid:0) i;t i;t Data: e e Estimates 1e.05 f2.15 g-1.79 0e.03 f0.48 g-0.76 0.51 0.51 Asym. pv [0.375] [0.014] [0.074] [0.090] [0.001] [0.012] [0.001] [0.000] Simulation results: Null 0 0 0 0 0 0 1 0 Mean -0.29 0.02 0.06 0.00 -0.05 -0.05 1.05 0.21 Simulated pv [0.355] [0.012] [0.067] [0.149] [0.002] [0.100] [0.000] [0.056] 10,000 simulated paths are generated under the null that A(1; ) = 0. Estimates and asymptotic p-values (cid:1) are taken from Table 2. The Mean shows the average of the point estimates over 10,000 paths. Simulated p-values are the fraction of the simulated paths in which the estimates based on the path are below (for (cid:27)(s(cid:28)l)=(cid:27)(s(cid:28))) or above (for other variables) the point estimates from the data. 10,000 paths, as well as the fraction of the 10,000 paths that exceed the point estimates in Table 2. The coe¢ cients on s(cid:28) for the one-period return forecasting regression show a i;t small upward bias of 0.02. However, the p-values based on the simulated probability density e function are about the same as the asymptotic p-values. In addition, the long-run return forecasting coe¢ cient on s(cid:28) is slightly downward biased. Thus, there is little evidence i;t that the contribution of the expected long-run excess return to credit spreads in Table 2 is e overestimated. Thevolatilityratioforexpectedlong-runcreditlossaveragedoversimulatedpathsis1.05, slightly upward biased compared with the null of one. Compared with the point estimate of 0.51, the magnitude of the bias is economically small. In contrast, the volatility ratio for expected long-run excess returns seems to be more upward biased: The average over the simulatedpathsis0.21andthep-valuebasedonthesimulateddistributionisabove5percent. However, this upward bias does not contradict the claim that substantial variation in credit spreads corresponds to changing expected returns. The null that expected long-run returns are constant implies that (cid:27)(s(cid:28)r)=(cid:27)(s(cid:28)) = 0 and corr(s(cid:28)r;s(cid:28)) = 0. In the data, I (cid:133)nd that (cid:27)(s(cid:28)r)=(cid:27)(s(cid:28)) > 0 and corr(s(cid:28)r;s(cid:28)) > 0: Thus, to judge the statistical signi(cid:133)cance of the contribution of the discount rate component, one has to check the probability of observing 30

(cid:27)(s(cid:28)r)=(cid:27)(s(cid:28)) > 0 and corr(s(cid:28)r;s(cid:28)) > 0 at the same time, under the null that long-run returns are not predictable. Figure 2 plots the volatility ratios and correlations for each simulated path. Most of the paths that give (cid:27)(s(cid:28)r)=(cid:27)(s(cid:28)) greater than the sample estimate (0.51) lie in the region of negative correlation, which is counterfactual. On the contrary, to produce a correlation coe¢ cient as large as that in the data, the volatility ratio has to be low. Thus, under the null of constant expected excess returns, the probability that both a large volatility ratio and a high (positive) correlation are drawn is less than one percent. The signi(cid:133)cance of the expected long-run excess return component in credit spreads is not a spurious result due to small sample bias. Figure 2: Volatility Ratio and Correlation Based on Bootstrap Simulations 2.5 2 1.5 ) ts ( s/)r 1 0.06 0.00 ts ( s 0.5 0 0.61 0.33 0.5 1 0.5 0 0.5 1 corr(st ,st ) r The (cid:133)gure plots the volatility ratio, (cid:27)(s(cid:28)r)=(cid:27)(s(cid:28)), and correlation corr(s(cid:28)r;s(cid:28)) over the simulated paths. The large dot at (0.9, 0.5) gives the sample estimates. Values are the fraction of 10,000 simulations that fall in the indicated quadrants. Second, I examine the out-of-sample return predictability. The fact that the expected return variation is a signi(cid:133)cant driver of the credit spreads suggests that an investor may be able to take advantage of the return predictability in the cross section. In particular, 31

by taking a long position in the bonds with high credit spreads and a short position in the bonds with low credit spreads, the investor should be able to earn higher excess returns, on average. However, as Goyal and Welch (2008) show, many equity return predictors that work in sample do not perform well out of sample. In principle, the out-of-sample test does not validate or invalidate the variance decomposition results. To what extent the price variation comes from changing risk premium is a somewhat di⁄erent question from the usefulness of return predictability for real-time trading. In fact, Cochrane (2008) shows that even if 100% of the variation in the dividend price ratio of stocks corresponds to the risk premium variation, the out-of-sample return predictability can be rather poor. However, the out-of-sample test can provide a useful diagnostic as to the stability of the forecasting coe¢ cients, and help us detect whether a few outliers drive the entire results. To examine the out-of-sample return predictability in the cross section, I follow Lewellen (2014) and run the cross-sectional regressions every month re = (cid:13) E re +(cid:17) ; i = 1; ;N , i;t+1 t t i;t+1 i;t+1 (cid:1)(cid:1)(cid:1) t (cid:2) (cid:3) c e e where E re is the expected excess returns estimated from the same VAR(1) as in Table t i;t+1 2, but usi(cid:2)ng on(cid:3)ly the information up to month t. If E re is the true expected return, c e t i;t+1 then the slope coe¢ cient (cid:13) must be one, while if E re (cid:2) is(cid:3)just noise and not associated t t ci;t+e1 with true expected returns, then we have (cid:13) = 0. Of(cid:2)course(cid:3), even if the VAR(1) is correctly t c e speci(cid:133)ed, the estimation errors in E re are likely to bring down (cid:13) towards zero due t i;t+1 t to attenuation bias. In this exercise,(cid:2)I exam(cid:3) ine to what extent the VAR provides a useful c e measure of the expected returns for real-time trading. In such exercise, investors who try to pro(cid:133)t from this trading strategy also su⁄er from the attenuation biases. Thus, from the investors(cid:146)perspective, the biased estimate of (cid:13) is the relevant measure to evaluate the t performance of the strategies. In addition, the attenuation bias works against my argument that the forecasting regression performs well out of sample. Thus, I only correct standard 32

errorsfortheestimationerrorsinE re , andleavethecoe¢ cientestimates uncorrected. t i;t+1 (cid:2) (cid:3) I run both rolling window foreccasteing regressions and cumulative forecasting regressions, with the window size of 120 months and 240 months. I repeat the exercise with the full sample and the subsamples for IG bonds and junk bonds. I also estimate VARs separately for AAA/AA, A, BBB, BB, B and CCC/C bonds; collect the estimated E re for all t i;t+1 bonds; and run a single series of monthly cross-sectional regressions to account(cid:2)for p(cid:3)otential c e nonlinearity. Table 6 shows the average (cid:13) and R-squared of the out-of-sample forecasting regressions. t The estimated expected returns based on the VARs indeed forecast returns out of sample. For all cases except the rolling window regression (120 months) with all bonds, average (cid:13) is t statisticallysigni(cid:133)cantlydi⁄erentfromzero. For10of16cases, theaverage(cid:13) isstatistically t indistinguishable from one. The average R-squared ranges from 9% to 18%. Comparing the results for IG bonds and junk bonds, the predictability is pervasive across credit ratings, though the forecasting regression slightly underpredicts the variation in expected returns for the IG bonds and overpredicts the variation for the junk bonds. All told, at least with longer rolling windows (240 months) or cumulative estimation, the forecasting regressions indeed provide a measure of expected returns that forecasts returns out of sample. In the online appendix2, I show that the variation in expected excess returns can be explained by exposures to systematic risks. Also, as a further robustness check, I show in the online appendix that the variance decomposition results in this article are not a⁄ected by the state tax e⁄ects pointed out by Elton, Gruber, Agrawal and Mann (2001). Though the state tax can a⁄ect the level of the state variables, it does not change their movements. Finally, in the online appendix, I show that the main results in Table 2 are robust, even if I include industry (cid:133)xed e⁄ects in estimating the VAR to account for the di⁄erence in credit spreads across industries. 2Theonlineappendixcanbefoundat: https://sites.google.com/site/yoshio(cid:133)nancialeconomics/home/research 33

Table 6: Return Forecasting Regressions: Out-Of-Sample Performance Window size 120 months 240 months Estimates (cid:13) s:e:((cid:13) ) R2 (cid:13) s:e:((cid:13) ) R2 t t t t Rolling window regressions: All 0.34 (0.24) 11.17 0.75 (0.38) 12.84 IG 1.14 (0.13) 15.80 1.18 (0.11) 17.71 Junk 0.70 (0.16) 12.43 0.75 (0.22) 9.94 Gathering AAA to CCC into one group 0.59 (0.08) 9.01 0.68 (0.11) 9.01 Cumulative regressions: All 0.90 (0.22) 13.07 0.96 (0.33) 13.29 IG 0.96 (0.09) 16.60 1.11 (0.11) 17.77 Junk 0.86 (0.17) 11.83 0.69 (0.23) 9.92 Gathering AAA to CCC into one group 0.67 (0.08) 9.78 0.69 (0.11) 9.19 Tablereportsthetime-seriesaverageoftheparameterestimatesfromthemonthlycross-sectionalregressions of the form re = (cid:13) E re +(cid:17) ;i = 1; ;N , where E re is the expected excess returns i;t+1 t t i;t+1 i;t+1 (cid:1)(cid:1)(cid:1) t t i;t+1 estimated using only the information up to t. Rolling window regressions show the estimates in which (cid:2) (cid:3) (cid:2) (cid:3) I estimate E et r i e ;t+1 uscing e the rolling window of 120 or 240 mocnth e s. Cumulative regressions show the forecasting results 120 or 240 months after the sample begins, where I use all the data available up to time (cid:2) (cid:3) t to estimatce E et r i e ;t+1 . The row (cid:147)All(cid:148)shows the results using all bonds to run a single VAR to estimate E re ,whiletherows(cid:147)IG(cid:148)and(cid:147)Junk(cid:148)showtheresultsusingthesubsamplesofinvestmentgradebonds t i;t+1 (cid:2) (cid:3) (AAA/AA, AcandeBBB) and junk bonds (BB,B and CCC/C). The row (cid:147)Gathering AAA to CCC into one (cid:2) (cid:3) gcrou e p(cid:148)shows the results in which I run a separate VAR for AAA/AA, A, BBB, BB, B and CCC/C to estimate E re , and gather all the estimates to run a single series of forecasting regressions. R2 is the t i;t+1 (unadjusted) R-squared multiplied by 100. Standard errors, reported in parentheses, are corrected for the (cid:2) (cid:3) estimationcererors in E t r i e ;t+1 using GMM and adjusted for Newey-West 3 lags. (cid:2) (cid:3) c e 34

4 Aggregate Credit Spread Dynamics and the E⁄ects on Investment 4.1 VARs with Aggregate Variables Campbell and Shiller (1988a and 1988b) and Cochrane (2008 and 2011) emphasize the importanceoftime-varyingriskpremiainunderstandingthepriceofthestockmarketportfolio. In contrast, Vuolteenaho (2002) (cid:133)nds that cash (cid:135)ow shocks are more important for individual stocks. Thus far, I (cid:133)nd that the expected default component is just as important as the expected excess return component for individual corporate bonds. However, given the evidence in the stock market, these results may be di⁄erent for the aggregate corporate bond market portfolio. To examine the di⁄erence for the aggregate market, I take the equal-weighted average of individual variables in each month to obtain the macro variables, and denote them with subscripts EW. For example, the equal-weighted market portfolio returns are computed by 1 Nt re re ; EW;t (cid:17) N i;t t i=1 X where N is the number of bonds in month t. These equal-weighted average returns and t credit spreads are an approximation to the logarithm of the market returns and spreads, as the average of the logarithm is not, in general, equal to the logarithm of the averages. Using these macro variables, I run a restricted VAR with a state vector: X = re s(cid:28) (cid:28) DD re s(cid:28) (cid:28)DD ; i;t i;t i;t (cid:0) i;t i;t EW;t EW;t (cid:0) EW;t (cid:18) (cid:19) which follows the dynamics X = AX +BW : i;t+1 i;t i;t+1 I restrict the three-by-three entries at the lower left corner of the matrix A to be zero, 35

so that the current individual variables do not forecast the future macro variables. By including the macro variables, I can exploit the cross-sectional variation of individual bonds without demeaning. Thus, based on this VAR with macro variables, the cross-sectional average of the estimated expected credit loss and excess returns will not be zero, making it possible to examine the variation in the average expected credit loss and excess returns over time. Table 7 shows the estimatedVARandits long-runimplications usingthe (cid:133)ve-yearbonds. The volatility of expected credit loss is 5.03%, while the volatility of expected excess returns is 6.77%. Thus, these volatility ratios are still comparable to each other. However, the volatility (over time) of the equal-weighted average of expected credit loss is only 0.95%, which is much lower than 4.05% for the equal-weighted average expected excess returns. The reason for the gap between individual bond results and the aggregate market results is the diversi(cid:133)cation e⁄ect. Following Vuolteenaho (2002), I compute the diversi(cid:133)cation factor: (cid:27)2 s(cid:28)u EW;t Diversi(cid:133)cation Factor = u l;r ; (cid:27)(cid:22)2 s(cid:28)u 2 f g (cid:0) i;t (cid:1) where (cid:27)(cid:22)2 s(cid:28)u 1 N (cid:27)2 s(cid:28)u . The divers (cid:0) i(cid:133)cat (cid:1) ion factor compares the variance of the i;t (cid:17) N i=1 i i;t market va(cid:0)riabl(cid:1)e withPthe aver(cid:0)age o(cid:1)f the variance of the individual variable. If the variation of the individual variable is idiosyncratic, then the diversi(cid:133)cation factor becomes close to zero. In contrast, if much of the variation of the individual variable comes froma systematic shock, then the diversi(cid:133)cation factor becomes larger. Table 7 shows that the diversi(cid:133)cation factor is 0.10 for the expected credit loss while it is 1.19 for the expected excess returns. The diversi(cid:133)cation factors show that much of the variation in individual bonds(cid:146)expected credit loss is due to idiosyncratic shocks, while much of the variation in expected excess returns is from systematic shocks. Thus, my (cid:133)ndings are consistent with the previous (cid:133)ndings in stocks, in which the risk premium variation dominates the aggregate dynamics, while the cash (cid:135)ow variation is signi(cid:133)cant for 36

the individual securities. I repeat the VAR with macro variables using the subsamples based on credit ratings, as shown in Table 8. As before, using the subsamples and allowing nonlinearity do not a⁄ect the relative importance of the expected credit loss and excess returns. The overall volatility from the combined estimates of the six subsamples is 6.96% for the expected credit loss and 7.91% for the expected excess returns. The diversi(cid:133)cation factor is 0.04 for the expected credit loss and 1.24 for the expected excess returns. Only the correlation between expected excess returns and credit loss changes signi(cid:133)cantly from Table 7. After accounting for nonlinearity, the correlation is close to zero, much lower than 0.599 in Table 7. 4.2 E⁄ects of Credit Spreads on Investment In this section, I examine how the two components of credit spreads a⁄ect (cid:133)rms(cid:146)investment in the future. Gilchrist and Zakraj(cid:154)ek (2012) decompose credit spreads based on the Merton (1974) model, and (cid:133)nd that many macro economic variables are forecastable mainly by the (cid:147)excess bond premium(cid:148), or the residuals of credit spreads unexplained by the Merton (1974) model, not by the default risk implied by the model. Gilchrist and Zakraj(cid:154)ek (2012) focus on the aggregate credit spreads, rather than the cross section of individual (cid:133)rms. Thus, it is interesting to see how the di⁄erent components of the credit spreads forecast economic activities in the cross section. In this section, I use the decomposition results of the (cid:133)ve-year bonds using the credit rating-based subsamples in section 4.1. I use the results based on the rating-based subsamples because the correlation between the expected credit loss and expected excess returns after accounting for nonlinearity is more accurately measured than the full sample results with the linearity assumption. I use the estimates based on the VAR, including macro variables, to contrast the results at the individual (cid:133)rm level and aggregate level. I take the average of all the bonds issued by a (cid:133)rm to estimate the (cid:133)rm-level expected 37

Table 7: VAR with Aggregate Variables Explanatory variable Joint sigre s(cid:28) (cid:28) DD re s(cid:28) (cid:28)DD R2 ni(cid:133)cance (cid:27)(E [y ]) i;t i;t (cid:0) i;t i;t EW;t EW;t (cid:0) EW;t t t+1 VAR estimates: A 100 (cid:2) re 1.63 2.15 -2.39 12.09 0.81 -0.65 0.01 [0.000] 0.33 i;t+1 (3.35) (0.95) (1.11) (5.51) (1.06) (4.87) s(cid:28) 3.48 96.17 0.69 -14.69 0.09 1.34 0.91 [0.000] i;t+1 (5.17) (1.22) (1.31) (5.83) (1.10) (4.93) (cid:28) DD -0.15 0.05 98.16 1.11 -0.46 -2.16 0.99 [0.000] i;t+1 i;t+1 (cid:0) (0.04) (0.01) (0.29) (0.40) (0.11) (0.61) re 0 0 0 16.70 3.69 -0.66 0.05 [0.000] EW;t+1 (10.35) (2.52) (9.46) s(cid:28) 0 0 0 -22.28 99.95 7.60 0.93 [0.000] EW;t+1 (11.00) (2.61) (10.03) (cid:28)DD 0 0 0 2.37 -0.95 93.67 0.97 [0.000] EW;t+1 (cid:0) (0.93) (0.28) (1.50) Long-run regression coe¢ cients: (cid:26)j 1l -0.03 0.52 0.85 0.05 -0.47 -0.77 5.03 (cid:0) i;t+j (0.02) (0.16) (0.34) (0.07) (0.36) (0.70) P (cid:26)j 1re 0.03 0.48 -0.85 -0.05 0.47 0.77 6.77 (cid:0) i;t+j (0.02) (0.16) (0.34) (0.07) (0.36) (0.70) P Implications of VAR estimates: (cid:27)(s(cid:28)l) (cid:27)(s(cid:28)r) corr s(cid:28)l;s(cid:28) corr(s(cid:28)r;s(cid:28)) corr s(cid:28)l;s(cid:28)r (cid:27)(s(cid:28)) (cid:27)(s(cid:28)) Estimates 0.48 0.64 0.859 0.925 0.599 (cid:0) (cid:1) (cid:0) (cid:1) (0.16) (0.10) (0.137) (0.102) (0.404) Diversi(cid:133)cation e⁄ects: s(cid:28)l s(cid:28)r EW EW Std. 0.95 4.05 Diversi(cid:133)cation factor 0.10 1.19 The sample period is monthly from 1973 to 2011. re is the log return on the corporate bond i in excess of i;t the matching Treasury bond, l is the credit loss on bond i, l is the credit loss implied from re ;s(cid:28) i;t i0;t i;t i;t 1 and s(cid:28) based on (4); s(cid:28) is the credit spread,eDD is the distance to default, and (cid:28) is the bond(cid:0)(cid:146)s i;t i;t i;t i;t duration. Thevariabless(cid:28)l i;t eands(cid:28)r i;t arethesumofexpecteedlong-rundiscountedcreditlossan e dth f esum of exfpected long-run discfounted excess returns, de(cid:133)ned by s(cid:28)l i;t = e L A(I (cid:0) (cid:26)A)(cid:0) 1 I (cid:0) ((cid:26)A)T (cid:0) t X i;t and s(cid:28)r i;t =e 1 A(I (cid:0) (cid:26)A)(cid:0) 1 I (cid:0) ((cid:26)A)T (cid:0) t X i;t . Thecolumn(cid:27)(E t [y t+1 ])showsthesamp(cid:16)lestandardde(cid:17)viationof (cid:133)tted values of the left-(cid:16)hand side vari(cid:17)ables. Standard errors, reported in parentheses under each coe¢ cient, are clustered by time, and p-values are reported in brackets. The matrix A and the associated standard errors are multiplied by 100. The variables with subscript EW are the equal-weighted average over bonds, computedeverymonth. Thediversi(cid:133)cationfactoris(cid:27)2 s(cid:28)u =(cid:27)(cid:22)2 s(cid:28)u ;where(cid:27)(cid:22)2 s(cid:28)u isthevariance EW;t i;t i;t of s(cid:28)u over t averaged across bonds. i;t (cid:0) (cid:1) (cid:0) (cid:1) (cid:0) (cid:1) 38

Table 8: Subsamples Based on Credit Ratings with Macro Variables AAA/AA A BBB BB B CCC/C IG Junk (cid:27)(s(cid:28)l) 0.07 0.14 1.93 6.49 10.08 15.18 1.12 8.92 (cid:27)(s(cid:28)l)=(cid:27)(s(cid:28)) 0.02 0.03 0.29 0.60 0.69 0.58 0.20 0.61 (0.03) (0.03) (0.15) (0.28) (0.33) (0.20) (0.12) (0.19) (cid:27)(s(cid:28)r) 3.34 4.20 5.50 6.35 5.66 14.08 4.75 6.88 (cid:27)(s(cid:28)r)=(cid:27)(s(cid:28)) 0.98 0.97 0.83 0.58 0.39 0.54 0.86 0.47 (0.02) (0.01) (0.10) (0.17) (0.23) (0.15) (0.07) (0.14) Gathering rating-based estimates of s(cid:28)l and s(cid:28)r into one bin (cid:27)() (cid:27)()=(cid:27)(s(cid:28)) corr(;s(cid:28)r) (cid:27)() Div. factor (cid:1) (cid:1) (cid:1) (cid:1) s(cid:28)l 6.96 0.63 0.01 s(cid:28)l 0.72 0.04 EW s(cid:28)r 7.91 0.71 1 s(cid:28)r 3.74 1.24 EW The sample period is monthly from 1973 to 2011. The top panel shows the estimated volatility ratios based on the VARs with macro variables, X = AX +BW , estimated separately for each credit i;t+1 i;t i;t+1 rating. The results are based on (cid:133)ve-year bonds and the bonds are sorted into subsamples based on their credit ratings when they are (cid:133)ve years to maturity. The long-run expected credit loss is de(cid:133)ned by s(cid:28)l = i;t e GX and the long-run expected returns are de(cid:133)ned by s(cid:28)r = e GX . IG is the investment grade, L i;t i;t 1 i;t which includes AAA/AA, A and BBB, while Junk includes BB, B and CCC/C. Standard errors, reported in parentheses, are clustered by time. The number of observations is 21,797 for AAA/AA, 52,524 for A, 60,960 for BBB, 28,507 for BB, 28,923 for B, and 4,495 for CCC/C bonds. The bottom panel collects the separately estimated s(cid:28)l and s(cid:28)r for AAA/AA to CCC/C into one sample, and computes its summary i;t i;t statistics. The variables s(cid:28)l and s(cid:28)r are the equal-weighted average of s(cid:28)r and s(cid:28)r over i. Div. EW EW i;t i;t factor is the diversi(cid:133)cation factor de(cid:133)ned by (cid:27)2(s(cid:28)u )=(cid:27)(cid:22)2 s(cid:28)u , where (cid:27)(cid:22)2 s(cid:28)u is the average variance EW i;t i;t over bond i. (cid:0) (cid:1) (cid:0) (cid:1) 39

excess returns and credit loss. I forecast the investment rate, measured by the ratio of the capital expenditures this (cid:133)scal year to the capital (measured by property, plant and equipment) at the end of the previous year, using the variables as of the end of the previous year. Since (cid:133)rms(cid:146)(cid:133)scal years end in di⁄erent months, I use monthly bond data to (cid:133)nd the exact (cid:133)scal year end. I also use the book-to-market ratio, pro(cid:133)tability, sales-to-capital ratio, idiosyncratic volatility and lagged investment rate of the (cid:133)rm as a control, following Gilchrist, Sim and Zakraj(cid:154)ek (2013). For this exercise, I exclude (cid:133)nancial (cid:133)rms (SIC codes from 6000 to 6800), as the nature of investment and capital is di⁄erent for the (cid:133)nancial and non(cid:133)nancial industries. First, I focus on the individual (cid:133)rm-level variation and run pooled OLS regressions. I demean all variables using the cross-sectional average every month, and exploit the crosssectional variation. Panel A of Table 9 shows the results of the forecasting regressions. The (cid:133)rst two columns show the forecasting regressions using each component of credit spreads separately, controlling only for the lagged value of the investment rate. Both the expected credit loss and excess return components negatively forecast the investment next period. Larger expected credit loss and excess returns lead to lower investment next period. I include all the other control variables in the next two columns. The forecasting power of the expected credit loss increases with more controls, while it decreases for the expected excess returns. When the expected credit loss rises by 1 percent, the investment rate falls by 0.33 percent next year. In contrast, a 1-percent rise in the expected excess returns leads to a 0.09 percent decrease in the investment rate, which is statistically insigni(cid:133)cant. The results are similar when I include both the expected credit loss and excess returns in one regression, as shown in the last column. Thus, at the (cid:133)rm level, the expected default component plays a major role in a⁄ecting individual (cid:133)rms(cid:146)investment decisions. To square with the market-level results in Gilchrist and Zakraj(cid:154)ek (2012), I also take the equal-weighted average of all variables every year to obtain the market-level variable and run 40

time-seriesregressions. PanelBofTable9showstheestimatedcoe¢ cientsoftheforecasting regressions at the market level. When the expected credit loss and excess returns are used separately, but withthe laggedinvestment rate as acontrol, only the expectedexcess returns predict a decrease in the investment rate next year. In contrast, the expected credit loss is economically and statistically insigni(cid:133)cant. When put together with other control variables, the expected return component becomes an even better predictor of investment, while the expected credit loss component remains insigni(cid:133)cant. Figure 3 shows the time series of the aggregate investment rate, expected credit loss and expected excess returns. The negative correlation between the investment rate and the expected excess return component of credit spreads (forwarded one year) is evident throughout the sample period. In contrast, the expected credit loss component moves little over time and is unrelated to the investment rate. Gilchrist and Zakraj(cid:154)ek (2012) interpret the credit spreads as (cid:147)a crucial gauge of the degree of strains in the (cid:133)nancial system.(cid:148) They argue that (cid:147)A reduction in the supply of credit(cid:151)an increase in the excess bond premium(cid:151)causes a drop in asset prices and a contractionineconomicactivitythroughthe(cid:133)nancialacceleratormechanisms....(cid:148) Although it is reasonable to conjecture that much of the risk premium variation in corporate bonds comes from the shocks to (cid:133)nancial intermediaries, it is not obvious that (cid:133)rms should react more to the risk premium variation than to the expected cash (cid:135)ow variation. Tobin(cid:146)s q theory does not discriminate between the risk premium variation and cash (cid:135)ow variation as a determinant of investment. A (cid:133)rm should change its investment in response to a changing market value of its assets, regardless of whether the change comes from the risk premium or cash (cid:135)ow shocks. Thus, in an e¢ cient market with no frictions, investment must respond to both types of shocks to asset prices in the same way. However, there are several reasons why the decomposition of shocks to bond prices can show di⁄erent results in the data. First, the Black-Scholes (1973) and Merton (1974) model implies that the price of a corporate bond is a nonlinear function of the price of the underlying assets. As the option 41

delta changes with the asset values, the price of a corporate bond is an increasing concave function of the underlying assets. This relationship implies that Tobin(cid:146)s average q is a convex function of the credit spreads. Phillipon (2009) con(cid:133)rms this intuition based on a structural model of debt (see his Figure 1). The convexity implies that, for a (cid:133)rm with low spread bonds, a change in the credit spread corresponds to a large change in Tobin(cid:146)s q. In contrast, for a (cid:133)rm with junk bonds, Tobin(cid:146)s q is relatively insensitive to the change in the credit spread. Recall from Table 4 and Table 8 that much of the variation in IG bonds(cid:146) spread corresponds to the risk premium variation, while the expected default component is more volatile for junk bonds. Taking these pieces of evidence together, we might expect that the risk premium variation, which dominates the credit spread variation for IG bonds, should be more informative about the future investment than the cash (cid:135)ow variation. Second, Stein(1996)arguesthat, ifinvestorsareirrationalanda(cid:133)rmmanagerisrational, then the manager whose (cid:133)rm is not (cid:133)nancially constrained should ignore the risk premium variation in making an investment decision to maximize the long-run (cid:133)rm value. In the market with irrational investors, the risk premium variation is simply a re(cid:135)ection of timevarying mispricing, which gets corrected over time. Thus, a rational manager should not set thehurdlerateforaninvestmentprojectbasedontheshort-term(cid:135)uctuationofmarketprices and should instead focus on the properties of the expected cash (cid:135)ow from the project. If (cid:133)rm managers follow this advice in reality, then the variation in the expected excess returns shouldnotpredictinvestments, whiletheexpectedcreditlosscomponentshould. Consistent with this view of the (cid:133)nancial market, Greenwood and Hanson (2013) (cid:133)nd some evidence for time-varying mispricing in the corporate bond market. Third, the expected default component can a⁄ect (cid:133)rms(cid:146)investment decisions due to managerialfrictionsandmarketsegmentations. Forexample, debtoverhangofMyers(1977) suggests that a (cid:133)rm with too much debt may reduce its investment suboptimally, forgoing safe but pro(cid:133)table projects. Debt overhang works even if there is no risk premium, and the debt issued by the (cid:133)rm is fairly priced. If the (cid:133)rm is close to default, the con(cid:135)ict between 42

bond holders and equity holders intensi(cid:133)es, which a⁄ects the (cid:133)rm(cid:146)s investment decisions. Thus, a rise in the expected default component can reduce investment through an additional channel due to managerial frictions. As the three explanations work in opposite directions, which part of the credit spreads better forecasts investment is unclear based on the existing theories. Thus, I have to let the data tell which e⁄ects seem to dominate the others. The empirical results above show a strikingdi⁄erenceintheinformationcontentofthecreditspreadsatthe(cid:133)rmlevel andatthe market level. The shocks to the risk premium are mostly systematic, and these systematic shocks a⁄ect (cid:133)rms(cid:146)collective investment decision. In the aggregate, there is little evidence that the variation in bond mispricing and debt overhang problem a⁄ect investment. In contrast, the variation in default risk is the key to understanding the (cid:133)rm-level variation in investment. Thus, my (cid:133)ndings are consistent with the interpretation that much of the bond mispricing and frictions among the (cid:133)rm(cid:146)s stake holders, if they exist, are (cid:133)rm-speci(cid:133)c, and a⁄ect only the individual (cid:133)rms(cid:146)investment decision, not the aggregate investment. Figure 3: Equal-Weighted Average Investment Ratio, Expected Credit Loss and Expected Excess Returns 0.15 0.3 0.2 0.1 0.1 0.05 0 0.1 0 0.2 0.05 Expected Credit Loss (Left) 0.3 ExpectedExcess Returns (Left) Investment (Right) 0.1 0.4 1975 1980 1985 1990 1995 2000 2005 2010 The(cid:133)gureplotstheequal-weightedaverageinvestmentratio, logI=K , expectedcreditloss, s(cid:28)l , and EW;t EW;t expected excess returns, s(cid:28)r . The expected credit loss and excess returns are moved forward by one EW;t year, so I plot s(cid:28)l and s(cid:28)r in t+1. All variables are demeaned. EW;t EW;t 43

Table 9: Investment Forecasting Regressions: Firm-Level Annual Data From 1973 to 2012 Panel A: Individual Firms Panel B: Equal-Weighted Market Portfolios Left-hand side variable: logI=K Left-hand side variable: logI=K k;t+12 EW;t+12 E [ (cid:26)j 1l ] -0.25 -0.33 -0.37 0.13 2.24 -1.37 t j (cid:0) k;t+j (0.11) (0.12) (0.15) (2.34) (1.97) (1.29) P E [ (cid:26)j 1re ] -0.37 -0.09 -0.17 -2.45 -4.09 -4.19 t (cid:0) k;t+j (0.12) (0.12) (0.14) (0.32) (0.39) (0.46) P logB=M -0.13 -0.13 -0.13 -0.15 -0.13 -0.11 k;t (0.02) (0.02) (0.02) (0.04) (0.02) (0.03) log(cid:5)=K 0.03 0.03 0.03 5.30 -0.50 -0.54 k;t (0.12) (0.12) (0.12) (1.46) (0.66) (0.66) logY=K 0.06 0.06 0.06 0.49 0.58 0.58 k;t (0.01) (0.01) (0.01) (0.15) (0.10) (0.11) log(cid:27)IV 0.02 0.02 0.03 -0.02 0.05 0.06 k;t (0.02) (0.02) (0.02) (0.03) (0.03) (0.03) logI=K 0.72 0.71 0.64 0.64 0.64 0.55 0.38 0.37 0.09 0.07 k;t (0.02) (0.02) (0.02) (0.02) (0.02) (0.11) (0.09) (0.09) (0.05) (0.05) R(cid:22)2 0.519 0.519 0.551 0.551 0.551 0.220 0.590 0.457 0.845 0.843 Panel A shows the result of the forecasting regression of the log investment rate for (cid:133)rm k over the period betweentandt+12,logI=K ,usingthecomponentsofcreditspreadsinmontht. Theforecastingcoefk;t+12 (cid:133)cients are estimated using pooled OLS regressions. The variables E [ (cid:26)j 1l ] and E [ (cid:26)j 1re ] t j (cid:0) k;t+j t j (cid:0) k;t+j are the long-run expected credit loss and excess returns for the (cid:133)ve-year bonds estimated using the credit P P rating-basedsubsamples,asinTable4. Thecomponentsofcreditspreadsfor(cid:133)rmk arecomputedbytaking the average of all bonds issued by (cid:133)rm k each month. The variable logB=M is the log book-to-market k;t ratiocomputedfollowingFamaandFrench(1993),log(cid:5)=K isthelogpro(cid:133)tability(operatingpro(cid:133)tdivided k;t by property, plant and equipment), logY=K is the log ratio of sales to capital, log(cid:27)IV is the log idiosynk;t k;t cratic volatility computed following Ang, Hodrick, Xing and Zhang (2006), and logI=K is the lagged log k;t investment rate for (cid:133)rm k in the (cid:133)scal year ending in month t. R(cid:22)2 is an adjusted R-squared. Standard errors, reported in parentheses, are clustered by time and adjusted for autocorrelation with Newey-West 12 lags. Allvariablesaredemeanedusingtheequal-weightedaverageeverymonth,andwinsorizedatthe0.1th and 99.9th percentile. The number of observations is 7,300 (cid:133)rm years. Panel B shows the result of the forecasting regression of the equal-weighted average of the log investment rate from t to t+12; logI=K . All explanatory variables are also the equal-weighted average of the EW;t+12 individual(cid:133)rms. Standarderrors,reportedinparentheses,areadjustedforautocorrelationwithNewey-West 3 lags. The number of observations is 39 years. 44

5 Conclusion I showthat thecredit spreads of corporatebonds canbedecomposedintoanexpectedexcess return component and an expected credit loss component without relying on a particular model of default. Applying the Campbell-Shiller (1988a) style decomposition, I relate the credit spread of corporate bonds to the sum of discounted excess returns and credit loss in the future. Since the relationship among these variables can be log-linearized, I can use VARs to obtain the long-horizon forecasting coe¢ cients and volatility ratios. I show that about half of the cross-sectional variation of the credit spreads corresponds to changes in the risk premium, and its volatility is as large as that of the expected credit loss. By estimating the VARs including market-level variables, I contrast the (cid:133)rm- or bondlevel results with the aggregate market results. Though the expected credit loss is as important as the expected excess returns at the individual bond level, the risk premium component is the dominating factor in the aggregate credit spread dynamics. Since much of the expected default loss at the security level is idiosyncratic, the credit loss components are mostly diversi(cid:133)ed away in the aggregate market, and their aggregate volatility is small. Consistent with Gilchrist and Zakraj(cid:154)ek (2012), at the market-level, the predictability of investment activities based on the credit spreads comes mostly from the risk premium component. However, at the (cid:133)rm level, the results are the opposite. The expected credit loss component of credit spreads a⁄ects individual (cid:133)rm(cid:146)s investment decisions more than the risk premium component does. At the (cid:133)rm-level, the expected credit loss component does not only vary more, but also carry useful information in forecasting future investment activities than at the market level. One analysis left for the future project is to explore the role of illiquidity in corporate bonds within the variance decomposition framework. If an investor expects the corporate bond will become illiquid when she has to sell in the future, then she might discount the valuation of the bond today, leading to a variation in credit spreads. In Appendix D, I 45

propose a three-way decomposition of credit spreads in which the spreads are driven by the expected credit loss, excess returns and illiquidity. I show that the model-free approach presented in this paper can be easily extended to account for illiquidity. However, since the illiquidity measures typically require high frequency price observations and/or trading volume data only available from Mergent FISD or TRACE, the sample size will become too small to apply the model-free method. Thus, to decompose the credit spreads taking illiquidity concern into account, we will have to resort to proxies for the default component, such as distance to default or the CDS spreads, or wait until the TRACE data accumulates long enough to cover several credit cycles. 46

References [1] Amihud, Yakov, 2002, Illiquidity and Stock Returns: Cross-Section and Time-Series E⁄ects, Journal of Financial Markets 5, 31(cid:150)56. [2] Ang, Andrew, Robert J. Hodrick, Yuhang Xing and Xiaoyan Zhang, 2006, The Cross- Section of Volatility and Expected Returns, Journal of Finance 61, 259-299. [3] Bao, Jack, Jun Pan and Jiang Wang, 2011, The Illiquidity of Corporate Bonds, Journal of Finance 66, 911-946. [4] Beber, Alessandro, Brandt, Michael W. and Kavajecz, Kenneth A., 2009, Flight-to- Quality or Flight-to-Liquidity? Evidence from the Euro-Area Bond Market, Review of Financial Studies 22: 925-957. [5] Bhamra, Harjoat S., Lars-Alexander Kuehn and Ilya A. Strebulaev 2010, The Levered Equity Risk Premium and Credit Spreads: A Uni(cid:133)ed Framework, Review of Financial Studies 23, 2, 645-703. [6] Bongaerts, Dion, 2010, Overrated Credit Risk: Three Essays on Credit Risk in Turbulent Times, Working Paper. [7] Campbell, John Y., 1991, A Variance Decomposition for Stock Returns, Economic Journal 101, 405, 157-179. [8] Campbell, John Y., and Robert J. Shiller, 1988a, The dividend-price ratio and expectations of future dividends and discount factors, Review of Financial Studies 1, 195-228. [9] Campbell, John Y., and Robert J. Shiller, 1988b, Stock Prices, Earnings, and Expected Dividends, Journal of Finance 43, 3, 661-676. [10] Chen, Hui, 2010, Macroeconomic Conditions and the Puzzles of Credit Spreads and Capital Structure, Journal of Finance 65, 6, 2171-2212. [11] Chen, Long, David A. Lesmond, Jason Wei, 2007, Corporate Yield Spreads and Bond Liquidity, Journal of Finance 62, 1, 119-149. [12] Chen, Long, 2009, On the reversal of return and dividend growth predictability: A tale of two periods, Journal of Financial Economics 92, 128-151. [13] Chen, Long, Pierre Collin-Dufresne and Robert S. Goldstein, 2009, On the Relation BetweentheCreditSpreadandtheEquityPremiumPuzzle,Reviewof Financial Studies 22 (9), 3367-3409. [14] Chen, Nai-Fu, Richard Roll and Stephan A. Ross, 1986, Economic Forces and the Stock Market, Journal of Business 59, 383-403. [15] Cochrane, John H., 2008, The dog that did not bark: A defense of return predictability, Review of Financial Studies 21, 1533-1575. 47

[16] Cochrane, John H., 2011, Discount rates, Journal of Finance 66, 1047-1108. [17] Collin-Dufresne, Pierre, Robert S. Goldstein, 2001, Do credit spreads re(cid:135)ect stationary leverage ratios?, Journal of Finance 56, 1929-1957. [18] Collin-Dufresne, Pierre, Robert S. Goldstein and J. Spencer Martin, 2001, The Determinants of Credit Spread Changes, Journal of Finance 56, 6, 2177-2207. [19] Crabbe, Leland E., 1991, Callable corporate bonds: A vanishing breed, FEDS working paper #155, Board of Governors of the Federal Reserve System. [20] Crabbe, Leland E. and Jean Helwege, 1994, Alternative Tests of Agency Theories of Callable Corporate Bonds, Financial Management 23, 4, 3-20. [21] Dick-Nielsen, Jens, Peter Feldh(cid:252)tter and David Lando, 2012, Corporate Bond Liquidity Before and After the Onset of the Subprime Crisis, Journal of Financial Economics 103, 471-492. [22] Driessen, Joost, 2005, Is Default Event Risk Priced in Corporate Bonds?, Review of Financial Studies 18, 1, 165-195. [23] Du⁄ee, Gregory R., 1999, Estimating the Price of Default Risk, Review of Financial Studies 12, 197-226. [24] Du¢ e,DarrellandKennethJ.Singleton,1999,ModelingTermStructuresofDefaultable Bonds, Review of Financial Studies 12, 687-720. [25] Elton, Edwin J., Martin J. Gruber, Deepak Agrawal, Christopher Mann, 2001, Explaining the Rate Spread on Corporate Bonds, Journal of Finance 56, 1, 247-277. [26] Edwards, Amy K., Lawrence E. Harris and Michael S Piwowar, 2007, Corporate Bond Market Transaction Costs and Transparency, Journal of Finance 62, 1421-1451. [27] Fama, Eugene F., 1984, Forward and Spot Exchange Rates, Journal of Monetary Economics 14, 319(cid:150)38. [28] Fama, EugeneF.andRobertR,Bliss, 1987, TheInformationinLong-MaturityForward Rates, American Economic Review 77, 4, 680-692. [29] Feldh(cid:252)tter,Peter,2012,TheSameBondatDi⁄erentPrices: IdentifyingSearchFrictions and Selling Pressures, Review of Financial Studies 25, 1155-1206. [30] Giesecke, Kay, Francis A. Longsta⁄, Stephen Schaefer and Ilya Strebulaev, 2011, Corporate bond default risk: A 150-year perspective, Journal of Financial Economics 102, 233-250. [31] Gilchrist, Simon, Vladimir Yankov and Egon Zakraj(cid:154)ek, 2009, Credit market shocks and economic (cid:135)uctuations: Evidence from corporate bond and stock markets, Journal of Monetary Economics 56, 471-493. 48

[32] Gilchrist, Simon, and Egon Zakraj(cid:154)ek, 2012, Credit Spreads and Business Cycle Fluctuations, American Economic Review 102, 4, 1692-1720. [33] Gilchrist,Simon,JaeW.SimandEgonZakraj(cid:154)ek,2013,Uncertainty,FinancialFrictions and Irreversible Investment, Working Paper. [34] Goyal, AmitandIvoWelch, 2008, AComprehensiveLookattheEmpiricalPerformance of Equity Premium Prediction, Review of Financial Studies 21, 1455-1508. [35] Greenwood, Robin and Samuel G. Hanson, 2013, Issuer Quality and Corporate Bond Returns, Review of Financial Studies 26, 1483(cid:150)1525. [36] Gropp, Reint, JukkaVesalaandGiuseppeVulpes, 2006, Equityandbondmarketsignals as leading indicators of bank fragility, Journal of Money, Credit and Banking 38, 399- 428. [37] Harada, Kimie, Takatoshi Ito and Shuhei Takahashi, 2010, Is the distance to default a good measure in predicting bank failures? Case studies, Working paper. [38] Hansen, Lars.P.andRobertJ.Hodrick, 1980, ForwardExchangeRatesasOptimalPredictors of Future Spot Rates: An Econometric Analysis, Journal of Political Economy 88, 829(cid:150)53. [39] Huang, Jingzhi and Ming Huang, 2012, How Much of the Corporate-Treasury Yield Spread is Due to Credit Risk?, Review of Asset Pricing Studies 2, 153-202. [40] Leland, Hayne E., 1994, Corporate Debt Value, Bond Covenants, and Optimal Capital Structure, Journal of Finance 49, 4, 1213-1252. [41] Lewellen, Jonathan W., 2014, The cross section of expected stock returns, Critical Finance Review, forthcoming. [42] Longsta⁄, Francis, Sanjay Mithal and Eric Neis, 2005, Corporate yield spreads: Default riskorliquidity? Newevidencefromthecredit-defaultswapmarket, Journal of Finance 60, 2213-2253. [43] Lin, Hai, Junbo Wang and Chunchi Wu, 2011, Liquidity risk and expected corporate bond returns, Journal of Financial Economics 99, 628-650. [44] Lustig, Hanno, and Adrien Verdelhan, 2007, The Cross-section of Foreign Currency Risk Premia and Consumption Growth Risk, American Economic Review 97, 89(cid:150)117. [45] McAndrews, James, Asani Sarkar and Zhenyu Wang, 2008, The E⁄ect of the Term Auction Facility on the London Inter-Bank O⁄ered Rate, Federal Reserve Bank of New York Sta⁄ Report 335. [46] Merton, Robert C., 1974, On the Pricing of Corporate Debt: The Risk Structure of Interest Rates, Journal of Finance 29, 449-470. [47] Moody(cid:146)s, 1999, Historical Default Rates of Corporate Bond Issuers, 1920-1998. 49

[48] Moody(cid:146)s, 2011, Corporate Default and Recovery Rates, 1920-2010. [49] Myers, Stewart C., 1977, Determinants of Corporate Borrowing, Journal of Financial Economics 5, 147(cid:150)175. , [50] PÆstor, Lubo(cid:154)and Robert F. Stambaugh, 2003, Liquidity Risk and Expected Stock Returns, Journal of Political Economy 111, 3, 642-685. [51] Roll, Richard, 1984, A Simple Implicit Measure of the E⁄ective Bid-Ask Spread in an E¢ cient Market, Journal of Finance 39, 1127-1139. [52] Schwartz, Krista, 2013, Mind the Gap: Disentangling Credit and Liquidity in Risk Spreads, Working Paper. [53] Stambaugh, Robert F., 1999, Predictive Regressions, Journal of Financial Economics 54, 375-421. [54] Stein, Jeremy C., 1996, Rational capital budgeting in an irrational world, Journal of Business 69, 429-455. [55] Taylor, John B. and John C. Williams, 2009, A Black Swan in the Money Market, American Economic Journal, Macroeconomics, 1: 58-83. [56] Vuolteenaho,Tuomo,2002,WhatDrivesFirm-LevelStockReturns?,JournalofFinance 57, 1, 233-264. [57] Warga, Arthur and Ivo Welch, 1993, Bondholder Losses in Leveraged Buyouts, Review of Financial Studies 6, 959-982. 50

A Derivation of the Credit Spread Decomposition In this appendix, I show the detailed derivation of (4). First, I assume that the recovery rate for the coupon upon default is the same as that of the principal. Formally, I assume Cf i;t = exp(l ). (10) i;t C i;t Furthermore, I make the technical assumption that after a default occurs, the investor buys the Treasury bond with the coupon rate equal to the original coupon rate, C , and short the i same bond so that the credit spreads and excess returns are always zero. I log-linearize returns on corporate bond i such that r (cid:26)(cid:14) (cid:14) +(cid:1)c +const, (11) i;t+1 i;t+1 i;t i;t+1 (cid:25) (cid:0) where (cid:14) logP =C and (cid:1)c logC =C . i;t i;t i;t i;t+1 i;t+1 i;t (cid:17) (cid:17) Similarly, I log-linearize returns on the matching Treasury bonds using the same expansion point, (cid:26): rf (cid:26)(cid:14)f (cid:14)f +(cid:1)cf +const, (12) i;t+1 (cid:25) i;t+1 (cid:0) i;t i;t+1 where (cid:14) logPf =Cf and (cid:1)cf logCf =Cf . i;t (cid:17) i;t i;t i;t+1 (cid:17) i;t+1 i;t Subtracting (12) from (11) yields r rf (cid:26) (cid:14)f (cid:14) + (cid:14)f (cid:14) (cid:1)cf (cid:1)c +const. (13) i;t+1 (cid:0) i;t+1 (cid:25) (cid:0) i;t+1 (cid:0) i;t+1 i;t (cid:0) i;t (cid:0) i;t+1 (cid:0) i;t+1 (cid:16) (cid:17) (cid:16) (cid:17) (cid:16) (cid:17) 51

The second term of (13) can be written as Pf C (cid:14)f (cid:14) = log i;t i;t ; i;t (cid:0) i;t P i;t C i f ;t ! Pf log i;t if t = t = Pi;t 6 D ; 8 (cid:18) (cid:19) > 0 if t = t < D = s(cid:28) : (14) >i;t : In the second equality, I use the fact that the matching Treasury bond has the same coupon rate as the corporate bond, as well as the de(cid:133)nition of l in (6) and the assumption in (10). i;t The last term of (13) is Cf C (cid:1)cf (cid:1)c = log i;t+1 i;t : i;t+1 (cid:0) i;t+1 C i;t+1 C i f ;t ! This term can be thought of separately for the three cases: (i) When t = t and t+1 = t , D D 6 6 we have Cf =C = Cf =C = 1 as the matching Treasury bond has the same coupon i;t+1 i;t+1 i;t i;t rate. (ii) When t = t and t+1 = t , we have Cf =C = exp(l ) by assumption 6 D D i;t+1 i;t+1 i;t+1 (10), and Cf =C = 1. (iii) When t = t and t + 1 = t , we have C =Cf = exp( l ). i;t i;t D 6 D i;t i;t (cid:0) i;t However, as I assume that right after the default (time t+), the investor buys the bond with the coupon rate equal to C , we have Cf = C = Cf = C = C , so that i i;t+1 i;t+1 i;t+ i;t+ i (cid:1)cf (cid:1)c = log C i f ;t+1Ci;t+ = 0. Combining the three cases, we have i;t+1 (cid:0) i;t+1 Ci;t+1Cf (cid:18) i;t+(cid:19) (cid:1)cf (cid:1)c = l . (15) i;t+1 (cid:0) i;t+1 i;t+1 Plugging (14) and (15) into (13) leads to the one-period pricing identity in (4). In the decomposition of the credit spread in (4), there are no terms involving coupon rates, C or Cf . Since I work on excess returns rather than returns, the coupons from i;t i;t corporate bonds tend to o⁄set the coupons from the matching Treasury bonds. In addition, 52

I make the assumption in (10), and thus I completely eliminate the coupon payment from the approximated log excess returns. This feature of the excess returns is convenient as I work on monthly returns. Otherwise, the strong seasonality of coupon payments would make it necessary to use the annual frequency rather than the monthly frequency. Due to the o⁄setting nature of the excess returns over matching Treasury bonds, I can work on monthly series without adjusting for seasonality. B Data B.1 Corporate Bond Database In this section, I provide a more detailed description of the panel data of corporate bond prices. I obtain monthly price observations of senior unsecured corporate bonds from the followingfourdatasources. First, fortheperiodfrom1973to1997, IusetheLehmanBrothers Fixed Income Database, which provides month-end bid prices. Since Lehman Brothers used these prices to construct the Lehman Brothers bond index while simultaneously trading it, the traders at Lehman Brothers had an incentive to provide correct quotes. Thus, although the prices in the Lehman Brothers Fixed Income Database are quote-based, they are considered reliable. In the Lehman Brothers Fixed Income Database, some observations are dealers(cid:146)quotes while others are matrix prices. Matrix prices are set using algorithms based on the quoted prices of other bonds with similar characteristics. Though matrix prices are less reliable than actual dealer quotes (Warga and Welch (1993)), I choose to include matrix prices in our main result to maximize the power of the test. However, I also repeat the main exercise below and show that the results are robust to the exclusion of matrix prices. Second, fortheperiodfrom1994to2011, I usetheMergent FISD/NAICDatabase. This database consists of actual transaction prices reported by insurance companies. Third, for 53

the period from 2002 to 2011, I use TRACE data, which provides actual transaction prices. TRACE covers more than 99 percent of the OTC activities in U.S. corporate bond markets after 2005. The data from Mergent FISD/NAIC and TRACE are transaction-based data, and therefore the observations are not exactly at the end of months. Thus, I use only the observations that are in the last (cid:133)ve days of each month. If there are multiple observations in the last (cid:133)ve days, I use the latest one and treat it as a month-end observation. Lastly, I use the DataStream database, which provides month-end price quotes from 1990 to 2011. TRACE includes some observations from the trades that are eventually cancelled or corrected. I drop all cancelled observations, and use the corrected prices for the trades that are corrected. I also drop all the price observations that include dealer commissions, as the commission is not re(cid:135)ecting the value of the bond, and these prices are not comparable to the prices without commissions. Since there are some overlaps among the four databases, I prioritize in the following order: the Lehman Brothers Fixed Income Database, TRACE, Mergent FISD/NAIC and DataStream. The number of overlaps is not large relative to the total size of the data set, with the largest overlaps between TRACE and Mergent FISD making up 3.3% of the nonoverlapping observations. To check the data consistency, I examine the e⁄ect of priority orderingbyreversingthepriority, andthee⁄ectofthepricedi⁄erenceontheempiricalresult below. To classify the bonds based on credit ratings, I use the ratings of Standard & Poor(cid:146)s when available, and use Moody(cid:146)s ratings when Standard & Poor(cid:146)s rating is not available. To identify defaults in the data, I use Moody(cid:146)s Default Risk Service, which provides a historical record of bond defaults from 1970 onwards. The same source also provides the secondary-market value of the defaulted bond one month after the incident. If the price observation in the month when a bond defaults is missing in the corporate bond database, I add the Moody(cid:146)s secondary-market price to my data set in order to include all default 54

observations in the sample. B.2 The E⁄ects of Matrix Prices I repeat the main VAR in Table 2 excluding the observations based on matrix prices. By removing matrix prices, the number of observations decrease to 386,697 bond months from 546,815 bond months in the original dataset. The results without matrix prices are reported in Table 10. The resulting VARcoe¢ cients and volatility ratios are similar to those in Table 2. B.3 Comparing Overlapping Data Sources Table 11 compares the summary statistics of the monthly returns of corporate bonds in my sample (Panel A) with the alternative database, which uses the reverse priority (Panel B). Namely, in constructing the alternative database, I prioritize in the following order: DataStream, Mergent FISD/NAIC, TRACE and the Lehman Brothers Fixed Income Database. To see a detailed picture, I tabulate the returns based on credit ratings and time periods. I split the sample into two periods: January 1973 to March 1998 and April 1998 to December 2011. I choose the cuto⁄of March 1998 because the Lehman Brothers Fixed Income Database is available up to March 1998. As there are more duplicate observations after April 1998, the latter period may show a greater di⁄erences between the two priority orders. Comparing the distribution of bond returns in Panel A with that in Panel B, there is very little di⁄erence at any rating category or in any time period. The greatest discrepancy is found in junk bonds from January 1973 to March 1998. The mean for the sample used in this paper is 1.35 percent with the standard deviation of 51.42 percent, while they are 1.20 percent and 35.10 percent in the alternative sample. As the most of the percentiles coincide between the two distributions, the di⁄erence comes from the maximum of the distribution. 55

Table 10: The Dataset without Matrix Prices Explanatory variable Joint sigre s(cid:28) (cid:28) DD R2 ni(cid:133)cance (cid:27)(E [y ]) i;t i;t (cid:0) i;t i;t t t+1 Regression of credit loss on information: l -7e.62 f3.28 g2.61 0.03 [0.000] 0.40 i;t+1 (4.88) (1.10) (1.17) el -6.70 2.71 2.32 0.03 [0.009] 0.34 i0;t+1 (4.62) (0.98) (1.04) e VAR estimates: A 100 (cid:2) re 4.93 1.72 -1.91 0.01 [0.000] 0.21 i;t+1 (3.55) (1.02) (1.47) s(cid:28) 1.79 96.25 -0.42 0.90 [0.000] ei;t+1 (5.98) (1.34) (1.80) (cid:0) (cid:28) i;t+1 DfD i;t+1 -0.10 0.04 98.11 0.99 [0.000] (0.03) (0.01) (0.35) g Long-run regression coe¢ cients: (cid:26)j 1l -0.06 0.60 0.85 6.01 (cid:0) i;t+j (0.03) (0.18) (0.41) P (cid:26)j 1ree 0.06 0.40 -0.85 4.54 (cid:0) i;t+j (0.03) (0.18) (0.41) P e Implications of VAR estimates: (cid:27)(s(cid:28) )=(cid:27)(s(cid:28)) (cid:27)(s(cid:28) )=(cid:27)(s(cid:28)) corr(s(cid:28) ;s(cid:28)) corr(s(cid:28) ;s(cid:28)) corr(s(cid:28) ;s(cid:28) ) l r l r l r Estimates 0.58 0.44 0.915 0.903 0.909 (0.18) (0.16) (0.012) (0.041) (0.092) The sample period is monthly from 1973 to 2011. re is the log return on the corporate bond i in excess of i;t the matching Treasury bond, l is the credit loss on bond i, l is the credit loss implied from re ;s(cid:28) i;t i0;t i;t i;t 1 and s(cid:28) based on (4); s(cid:28) is the credit spread,eDD is the distance to default, and (cid:28) is the bond(cid:0)(cid:146)s i;t i;t i;t i;t duration. Thevariabless(cid:28)l i;t eands(cid:28)r i;t arethesumofexpecteedlong-rundiscountedcreditlossan e dth f esum of exfpected long-run discfounted excess returns, de(cid:133)ned by s(cid:28)l i;t = e L A(I (cid:0) (cid:26)A)(cid:0) 1 I (cid:0) ((cid:26)A)T (cid:0) t X i;t and s(cid:28)r i;t =e 1 A(I (cid:0) (cid:26)A)(cid:0) 1 I (cid:0) ((cid:26)A)T (cid:0) t X i;t . Thecolumn(cid:27)(E t [y t+1 ])showsthesamp(cid:16)lestandardde(cid:17)viationof (cid:133)tted values of the left-(cid:16)hand side vari(cid:17)ables. Standard errors, reported in parentheses under each coe¢ cient, are clustered by time, and p-values are reported in brackets. The matrix A and the associated standard errors are multiplied by 100. 56

Table 11: Comparing Monthly Corporate Bond Returns (Percent) Percentile Period Rating Mean Median Std. 1 5 10 25 75 90 95 99 Panel A: Priority Order = Lehman Brothers, TRACE, Mergent FISD, DataStream 1973/1 AAA/AA 0.75 0.59 7.47 -7.87 -3.88 -2.38 -0.53 1.84 3.68 5.19 10.06 to A 0.71 0.71 2.59 -6.30 -3.55 -2.26 -0.43 1.85 3.50 4.89 7.92 1998/3 BBB 0.82 0.77 2.64 -5.99 -3.46 -2.15 -0.33 1.96 3.68 5.07 8.15 junk 1.35 0.95 51.42 -11.82 -4.76 -2.89 -0.21 2.33 4.92 6.90 13.38 Subtotal 0.88 0.76 23.64 -7.76 -3.86 -2.37 -0.39 1.97 3.87 5.47 9.89 1998/4 AAA/AA 0.57 0.59 2.26 -6.24 -2.71 -1.49 -0.06 1.11 2.62 3.89 7.88 to A 0.63 0.60 2.73 -7.06 -2.92 -1.67 -0.17 1.34 2.98 4.38 8.98 2011/12 BBB 0.66 0.59 14.71 -9.22 -3.26 -1.79 -0.24 1.49 3.18 4.73 10.12 junk 0.79 0.69 9.04 -14.41 -3.95 -1.70 0.39 1.19 3.29 5.59 15.90 Subtotal 0.71 0.64 10.33 -10.43 -3.38 -1.71 -0.01 1.33 3.15 4.91 12.04 Panel B: Priority Order = DataStream, Mergent FISD, TRACE, Lehman Brothers 1973/1 AAA/AA 0.74 0.59 7.45 -7.87 -3.87 -2.38 -0.53 1.84 3.67 5.18 10.06 to A 0.71 0.71 2.59 -6.31 -3.54 -2.25 -0.42 1.84 3.49 4.88 7.93 1998/3 BBB 0.82 0.78 2.64 -6.01 -3.45 -2.13 -0.32 1.95 3.66 5.05 8.16 junk 1.20 0.95 35.10 -11.82 -4.75 -2.85 -0.21 2.33 4.89 6.89 13.43 Subtotal 0.85 0.76 16.40 -7.78 -3.85 -2.35 -0.39 1.97 3.85 5.46 9.89 1998/4 AAA/AA 0.57 0.59 2.33 -6.61 -2.71 -1.45 -0.03 1.08 2.56 3.86 8.20 to A 0.68 0.59 16.29 -7.66 -2.84 -1.60 -0.11 1.29 2.89 4.32 9.49 2011/12 BBB 0.72 0.59 22.11 -9.28 -3.12 -1.66 -0.17 1.44 3.07 4.57 9.99 junk 0.77 0.69 5.29 -14.18 -3.79 -1.57 0.43 1.15 3.19 5.45 15.73 Subtotal 0.73 0.64 15.25 -10.49 -3.26 -1.60 0.04 1.28 3.05 4.79 12.09 The top panel reports the summary statistics of the (gross) corporate bond returns used in the paper. The bottom panel reports the summary statistics of the data where the priority across the database is reversed (DataStream, Mergent FISD, TRACE, Lehman Brothers). 57

To examine how the choices among duplicate data points may a⁄ect the (cid:133)nal results, I repeat the exercise in Table 2 using an alternative dataset constructed from the reverse priority order. Table 12 reports the estimates of the VAR as well as the test results. The test results are essentially the same as the results in Table 2. Therefore, I conclude that the choice among di⁄erent datasets does not signi(cid:133)cantly a⁄ect the conclusion of the paper. B.4 The E⁄ect of Callability Finally, I show that the main result in Table 2 is robust to the inclusion of the (cid:133)xed e⁄ects of callable bonds. To this end, I demeaned all variables in the state vector X using the crossi;t sectional averages in each month separately for callable bonds and noncallable bonds. By demeaning separately, callable bonds are allowed to have di⁄erent means than noncallable bonds. Afteraccountingforcallability, IrepeattheestimationprocessinTable2. TheVAR estimated using the separately demeaned data is shown in Table 13. The resulting VAR coe¢ cients and volatility ratios are nearly identical to the estimates in Table 2. Therefore, the di⁄erence between callable and noncallable bonds is not driving the main result. C Computation of Distance to Default To construct distance to default, I use an implication of the Merton (1974) model. The value of the assets of a (cid:133)rm, A , follows a geometric Brownian motion: t dA t = (cid:22)dt+(cid:27) dW : (16) A t A t Let D be the book value of the debt of the (cid:133)rm at time t. If the value of the (cid:133)rm(cid:146)s t assets is less than the book value of the debt at the maturity date, then it cannot repay the debt and defaults. When in default, the bondholders immediately take over the (cid:133)rm, 58

Table 12: VARs Based on the Alternative Dataset Explanatory variable Joint sigre s(cid:28) (cid:28) DD R2 ni(cid:133)cance (cid:27)(E [y ]) i;t i;t (cid:0) i;t i;t t t+1 Regression of credit loss on information: l -5e.79 f2.74 g1.87 0.03 [0.000] 0.30 i;t+1 (3.89) (0.93) (0.87) el -5.41 2.44 1.88 0.02 [0.016] 0.27 i0;t+1 (3.94) (0.94) (0.87) e VAR estimates: A 100 (cid:2) re 2.96 2.25 -1.33 0.01 [0.000] 0.22 i;t+1 (3.46) (0.97) (1.31) s(cid:28) 2.47 95.98 -0.56 0.90 [0.000] ei;t+1 (5.52) (1.25) (1.56) (cid:0) (cid:28) i;t+1 DfD i;t+1 -0.15 0.05 98.22 0.99 [0.000] (0.03) (0.01) (0.29) g Long-run regression coe¢ cients: (cid:26)j 1l -0.04 0.51 0.67 4.65 (cid:0) I;t+j (0.02) (0.16) (0.33) P (cid:26)j 1ree 0.04 0.49 -0.67 4.86 (cid:0) I;t+j (0.02) (0.16) (0.33) P e Implications of VAR estimates: (cid:27)(s(cid:28)l)=(cid:27)(s(cid:28)) (cid:27)(s(cid:28)r)=(cid:27)(s(cid:28)) corr(s(cid:28)l;s(cid:28)) corr(s(cid:28)r;s(cid:28)) corr(s(cid:28)l;s(cid:28)r) Estimates 0.50 0.52 0.902 0.903 0.936 (0.15) (0.15) (0.012) (0.020) (0.060) The sample period is monthly from 1973 to 2011. re is the log return on the corporate bond i in excess of i;t the matching Treasury bond, l is the credit loss on bond i, l is the credit loss implied from re ;s(cid:28) i;t i0;t i;t i;t 1 and s(cid:28) based on (4); s(cid:28) is the credit spread,eDD is the distance to default, and (cid:28) is the bond(cid:0)(cid:146)s i;t i;t i;t i;t duration. Thevariabless(cid:28)l i;t eands(cid:28)r i;t arethesumofexpecteedlong-rundiscountedcreditlossan e dth f esum of exfpected long-run discfounted excess returns, de(cid:133)ned by s(cid:28)l i;t = e L A(I (cid:0) (cid:26)A)(cid:0) 1 I (cid:0) ((cid:26)A)T (cid:0) t X i;t and s(cid:28)r i;t =e 1 A(I (cid:0) (cid:26)A)(cid:0) 1 I (cid:0) ((cid:26)A)T (cid:0) t X i;t . Thecolumn(cid:27)(E t [y t+1 ])showsthesamp(cid:16)lestandardde(cid:17)viationof (cid:133)tted values of the left-(cid:16)hand side vari(cid:17)ables. Standard errors, reported in parentheses under each coe¢ cient, are clustered by time, and p-values are reported in brackets. The matrix A and the associated standard errors are multiplied by 100. 59

Table 13: Accounting for Call Fixed E⁄ects Explanatory variable Joint sigre s(cid:28) (cid:28) DD R2 ni(cid:133)cance (cid:27)(E [y ]) i;t i;t (cid:0) i;t i;t t t+1 Regression of credit loss on information: l -5e.64 f2.76 g2.00 0.03 [0.000] 0.30 i;t+1 (3.90) (0.93) (0.91) el -5.19 2.45 2.00 0.02 [0.019] 0.27 i0;t+1 (3.94) (0.94) (0.91) e VAR estimates: A 100 (cid:2) re 0.98 2.18 -1.54 0.01 [0.000] 0.21 i;t+1 (3.32) (1.00) (1.27) s(cid:28) 4.24 96.04 -0.46 0.90 [0.000] ei;t+1 (5.18) (1.29) (1.55) (cid:0) (cid:28) i;t+1 DfD i;t+1 -0.16 0.05 98.22 0.99 [0.000] (0.03) (0.01) (0.29) g Long-run regression coe¢ cients: (cid:26)j 1l -0.03 0.52 0.73 4.73 (cid:0) i;t+j (0.02) (0.16) (0.33) P (cid:26)j 1ree 0.03 0.48 -0.73 4.71 (cid:0) i;t+j (0.02) (0.16) (0.33) P e Implications of VAR estimates: (cid:27)(s(cid:28)l)=(cid:27)(s(cid:28)) (cid:27)(s(cid:28)r)=(cid:27)(s(cid:28)) corr(s(cid:28)l;s(cid:28)) corr(s(cid:28)r;s(cid:28)) corr(s(cid:28)l;s(cid:28)r) Estimates 0.51 0.51 0.890 0.890 0.923 (0.16) (0.15) (0.012) (0.024) (0.066) The sample period is monthly from 1973 to 2011. re is the log return on the corporate bond i in excess of i;t the matching Treasury bond, l is the credit loss on bond i, l is the credit loss implied from re ;s(cid:28) i;t i0;t i;t i;t 1 and s(cid:28) based on (4); s(cid:28) is the credit spread,eDD is the distance to default, and (cid:28) is the bond(cid:0)(cid:146)s i;t i;t i;t i;t duration. Thevariabless(cid:28)l i;t eands(cid:28)r i;t arethesumofexpecteedlong-rundiscountedcreditlossan e dth f esum of exfpected long-run discfounted excess returns, de(cid:133)ned by s(cid:28)l i;t = e L A(I (cid:0) (cid:26)A)(cid:0) 1 I (cid:0) ((cid:26)A)T (cid:0) t X i;t and s(cid:28)r i;t =e 1 A(I (cid:0) (cid:26)A)(cid:0) 1 I (cid:0) ((cid:26)A)T (cid:0) t X i;t . Thecolumn(cid:27)(E t [y t+1 ])showsthesamp(cid:16)lestandardde(cid:17)viationof (cid:133)tted values of the left-(cid:16)hand side vari(cid:17)ables. Standard errors, reported in parentheses under each coe¢ cient, are clustered by time, and p-values are reported in brackets. The matrix A and the associated standard errors are multiplied by 100. All state variables are demeaned every month using the means separately estimated for callable and non callable bonds. 60

and the equity holders receive zero. If the assets exceed the debt, then the equity holders receive the di⁄erence between A and D . This way, the market value of equity, S , can be t t t considered the price of a call option. The equity value is given by the Black-Scholes formula for a call option: S = A (cid:8)(d ) D (cid:8)(d ), (17) t t 1 t 2 (cid:0) where log(A =D )+ r+ 1(cid:27)2 d = t t 2 A , 1 (cid:27) A(cid:0) (cid:1) log(A =D )+ r 1(cid:27)2 d = t t (cid:0) 2 A , 2 (cid:27) A(cid:0) (cid:1) r is a risk-free rate and (cid:8) is a cumulative density function of a standard normal distribution. We cannot directly observe the market value of the asset, A , and its volatility, (cid:27) . t A Instead, we can observe the market value of equity, S , and its volatility, (cid:27) . By applying t S Ito(cid:146)s lemma to the equity and imposing a no-arbitrage condition, we have the risk-neutral dynamics of equity, S : t @S t dS = rS dt+ A (cid:27) dW . t t t A t @A t By matching the standard deviation of the dynamics, we obtain @S A t t (cid:27) = (cid:27) S A @A S t t A t =(cid:8)(d ) (cid:27) . (18) 1 A S t Equations (17) and (18) give a system of two equations with two unknowns: A and (cid:27) . t A 61

Since they are nonlinear equations, I solve them numerically using a KNITRO solver. The distance to default is then obtained by log(A =D )+ r 1(cid:27)2 DD d = t t (cid:0) 2 A . t 2 (cid:0) (cid:17) (cid:0) (cid:0) (cid:27) A(cid:0) (cid:1) D Decomposition into Three Components, Including Liquidity The decomposition of the credit spread in (4) can easily be extended to include liquidity. Suppose that the bond market is illiquid and the investor can only buy a bond at the frictionless price times H 1. When she sells a bond, she only receives 1=H for each i;t i;t (cid:21) dollar of the frictionless price. Let us de(cid:133)ne a liquidity-adjusted return as P =H +C i;t+1 i;t+1 i;t+1 R = . i(cid:3);t+1 P H i;t i;t We can think of H as one plus the fraction of the bond value that needs to be paid for i;t the purchase of the bonds, due to a bid-ask spread. For simplicity, I assume that there is no liquidity concern for Treasury bonds. Then, applying the same log-linear approximation to R , I obtain the one-period identity for a log liquidity-adjusted excess return as i(cid:3);t+1 re logR logRf (cid:26)s(cid:28) +s(cid:28) l h +const; i;(cid:3)t+1 (cid:17) i(cid:3);t+1 (cid:0) i;t+1 (cid:25) (cid:0) i;t+1 i;t (cid:0) i;t+1 (cid:0) i;t+1 where h is a illiquidity measure de(cid:133)ned by h (cid:26)logH + logH . Iterating i;t+1 i;t+1 i;t+1 i;t (cid:17) forward, I can obtain the three-way decomposition of the credit spread: T t T t T t (cid:0) (cid:0) (cid:0) s(cid:28) (cid:26)j 1re + (cid:26)j 1l + (cid:26)j 1h +const. (19) i;t (cid:25) (cid:0) i;(cid:3)t+j (cid:0) i;t+j (cid:0) i;t+j j=1 j=1 j=1 X X X 62

This identity says that, holding discount rates and credit loss constant, higher illiquidity in the future leads to higher current credit spreads (lower current prices) for corporate bonds. Taking the conditional expectation yields s(cid:28) s(cid:28)r +s(cid:28)l +s(cid:28)h ; i;t (cid:25) i;t i;t i;t where T t (cid:0) s(cid:28)h E (cid:26)j 1h : i;t (cid:17) " (cid:0) i;t+j (cid:12)F t # X j=1 (cid:12) (cid:12) (cid:12) Therefore, we can decompose the variation of cre(cid:12)dit spreads into three components: changes in expected excess returns, expected credit loss and expected illiquidity. The question is how to measure h . There are variety of measures of illiquidity in the existing i;t literature. For example, I could follow Bao, Pan and Wang (2011) in constructing the Roll (1984) measure to estimate the transaction cost. Speci(cid:133)cally, for bond i, I compute Cov ((cid:1)p ;(cid:1)p ), t i;t;d 1 i;t;d (cid:0) (cid:0) q where (cid:1)p is the log price change between day d 1 and day d in month t. Few investors i;t;d (cid:0) trade bonds every month. Indeed many of them hold the bonds until their maturity. To account for trading frequency, I can use Cov ((cid:1)p ;(cid:1)p ) times the bond(cid:146)s t i;t;d 1 i;t;d (cid:0) (cid:0) turnover rate (monthly trading volume dividepd by the face value of the bond) to obtain the illiquidity measure logH . Roll (1984) shows that the e⁄ective bid-ask spreads are i;t 2 Cov ((cid:1)p ;(cid:1)p ), when the fundamental value follows a random walk. Thus, t i;t;d 1 i;t;d (cid:0) (cid:0) lopgH measuresthee⁄ectivetransactioncostsforaninvestorwhoseportfoliohastheaverage i;t turnover rate. Once I compute the illiquidity measure, h , I can estimate the VAR with the state i;t 0 vector X = re s(cid:28) h (cid:28) DD and infer the implied long-run forecasting i;t i;(cid:3)t i;t i;t (cid:0) i;t i;t (cid:18) (cid:19) 63

coe¢ cients. By comparing the volatility of s(cid:28)h with the credit spread volatility, I can i;t in principle quantify the contribution from the liquidity variation in explaining the credit spreads, controlling for risk premium and expected default. Though this extension of the decomposition to include illiquidity is conceptually simple, it is not easy to empirically implement this three-way decomposition, due to the limited data availability. To my knowledge, the construction of any illiquidity measure requires the daily priceand/ortradingvolumeinformation. SincethisinformationisavailableonlyinMergent FISD/NAIC and TRACE, the sample period is limited to 1994 onwards. Even after 1994, the fraction of the bonds in these two databases is limited. This is problematic, as the model-free decomposition approach in this paper crucially depends on the large sample size. Since default occurs relatively infrequently, the time series dimension of the data must be su¢ ciently large so it covers at least several credit cycles. Also, the cross section must be large so it covers wider range of credit ratings. Thus, it is not feasible to reliably estimate the VAR with the subsample only from Mergent FISD/NAIC and TRACE. Another issue in estimating the liquidity e⁄ect is a bias due to the sample selection. As I can only compute the liquidity measure for the bonds that are traded relatively frequently, the subsample based on Mergent FISD and TRACE is biased toward more liquid bonds. Thus, the resulting decomposition can only provide the lower bound for the role of liquidity. 64

Cite this document

APA

Yoshio Nozawa (2014). What Drives the Cross-Section of Credit Spreads?: A Variance Decomposition Approach (FEDS 2014-62). Board of Governors of the Federal Reserve System, Finance and Economics Discussion Series. https://whenthefedspeaks.com/doc/feds_2014-62

BibTeX

@techreport{wtfs_feds_2014_62,
  author = {Yoshio Nozawa},
  title = {What Drives the Cross-Section of Credit Spreads?: A Variance Decomposition Approach},
  type = {Finance and Economics Discussion Series},
  number = {2014-62},
  institution = {Board of Governors of the Federal Reserve System},
  year = {2014},
  url = {https://whenthefedspeaks.com/doc/feds_2014-62},
  abstract = {I decompose the cross-sectional variation of the credit spreads for corporate bonds into changing expected returns and changing expectation of credit losses with a model-free method. Using a log-linearized pricing identity and a vector autoregression applied to micro-level data from 1973 to 2011, I find that the expected credit loss component and the excess return component each explains about half of the variance of the credit spreads. Unlike the market-level findings in Gilchrist and Zakrajsek (2012), at the firm level, the expected credit loss is volatile and affects the firms' investment decision more than the expected excess returns.},
}