ifdp · July 31, 2011

Firm Default and Aggregate Fluctuations

Abstract

This paper studies the relationship between macroeconomic fluctuations and corporate defaults while conditioning on industry affiliation and an extensive set of firm-specific factors. By using a panel data set for virtually all incorporated Swedish businesses over 1990-2009, a period which includes a full-scale banking crisis, we find strong evidence for a substantial and stable impact from aggregate fluctuations on business defaults. A standard logit model with financial ratios augmented with macroeconomic factors can account surprisingly well for the outburst in business defaults during the banking crisis, as well as the subsequent fluctuations in default frequencies. Moreover, the effects of macroeconomic variables differ across industries in an economically intuitive way. Out-of-sample evaluations show that our approach is superior to models that exclude macro information and standard well-fitting time-series models. Our analysis shows that firm-specific factors are useful in ranking firms' relative riskiness, but that macroeconomic factors are necessary to understand fluctuations in the absolute risk level.

Board of Governors of the Federal Reserve System International Finance Discussion Papers Number 1029 August 2011 Firm Default and Aggregate Fluctuations Tor Jacobson, Jesper Lindé, and Kasper Roszbach NOTE: International Finance Discussion Papers are preliminary materials circulated to stimulate discussion and critical comment. References in publications to International Finance Discussion Papers (other than an acknowledgment that the writer has had access to unpublished material) should be cleared with the author or authors. Recent IFDPs are available on the Web at ww.federalreserve.gov/pubs/ifdp/. This paper can be downloaded without charge from Social Science Research Network electronic library at http://www.sssrn.com.

Firm Default and Aggregate Fluctuations ∗ Tor Jacobson Jesper Lindé Kasper Roszbach August 22, 2011 Abstract This paper studies the relationship between macroeconomic fluctuations and corporate defaultswhileconditioningonindustryaffiliationandanextensivesetoffirm-specificfactors. By using a panel data set for virtually all incorporated Swedish businesses over 1990-2009, a period which includes a full-scale banking crisis, we find strong evidence for a substantial and stable impact from aggregate fluctuations on business defaults. A standard logit model with financial ratios augmented with macroeconomic factors can account surprisingly well for the outburst in business defaults during the banking crisis, as well as the subsequent fluctuations in default frequencies. Moreover, the effects of macroeconomic variables differ across industries in an economically intuitive way. Out-of-sample evaluations show that our approach is superior to models that exclude macro information and standard well-fitting time-series models. Our analysis shows that firm-specific factors are useful in ranking firms’ relative riskiness,but that macroeconomic factors are necessary to understand fluctuations in the absolute risk level. Keywords: Default, default-risk model, business cycles, aggregate fluctuations, microdata, logit, firm-specific variables, macroeconomic variables JEL: C35, C52, E44, G33. Jacobson and Roszbach: Research Division, Sveriges Riksbank, email: firstname.lastname@riksbank.se. ∗ Lindé: Federal Reserve Board, email: jesper.l.linde@frb.gov. We would like to thank Rikard Kindell, who coauthored the working paper version of this paper on a shorter data sample for outstanding contributions to this project. Discussions with and suggestions from Franklin Allen, Ed Altman, Mitch Berlin, Mark Carey, Ines Drumond, Xavier Freixas, Bob Hunt, Wenli Li, Leonard Nakamura, Dragon Tang, Cees Ullersma, and Kostas Tsatsaronishavebeenveryhelpfulinimprovinguponearlierdrafts. WearealsogratefulforcommentsfromseminarparticipantsattheBankofAustria,theBankofHungary,theEinaudiInstituteforEconomicsandFinance, theBankofEngland,theBankofFinland,theFederalReserveBankofPhiladelphia,theFederalReserveBankof NewYork,UppsalaUniversity,EARIE,theC.R.E.D.I.T.2008conference,theEEA-ESEMmeetingsinBudapest (2008) and Barcelona (2009), the 2009 BIS Research task force workshop, the 2008 ASSA meetings, the DNB conferenceonFinancialStabilityandFinancialCrises,andtheBIS.EricaReisman,ErikvonSchedvinandIngvar Strid provided outstanding research assistance. The views expressed in this paper are solely the responsibility of the authors and should not be interpreted as reflecting the views of the Executive Board of Sveriges Riksbank, theBoardofGovernorsoftheFederalReserveSystemorofanyotherpersonassociatedwiththeFederalReserve System.

1 Introduction The failure of a business is an event of fundamental importance in economic life. Our understanding of the determinants of business defaults, despite long-studying, is far from complete, particularly with respect to the influences of broader economic conditions. Recent economic events, namely a global financial crisis shifting into a recession of exceptional depth, highlight the importance of understanding and predicting this crucial aspect of the economy, not least for timely and appropriate policy measures. The aim of this paper is to shed light on the dynamics of business defaults. In particular, we seek to understand the interactions between macroeconomic fluctuations and firms’ individual likelihoods to default, as well as the relationship between macro variables and the aggregate rate of default. To this end we have compiled a new panel data set with detailed firm-level information on all incorporated Swedish businesses over the period 19901 20092. The − panel comprises more than 16 million firm-date observations, with an average of more than 200000 firms per point in time. The length and width of this panel allow for several extensions of previous research; we are able to, among other things, carefully evaluate industry-specific effects of macroeconomic fluctuations. Since our data set includes virtually all incorporated Swedish firms, our findings provide insight into the significance of aggregate fluctuations for both listed and privately held firms; the latter group is typically responsible for over half of GDP in developed economies. This feature is important because Merton-like models of default, being based on equity-price information, are in practice limited to listed firms. Econometric studies of business defaults started in the 1960s with work by Altman and coauthors (1968197119731984). These influential papers focus on explaining bankruptcies of publiclyquotedbusinessesinacross-sectionalcontextusingasmallsetoffirm-specificvariables. Later work by Shumway (2001) attempts to account for the dynamic nature of defaults for publicly listed firms. Bharath and Shumway (2008) evaluate the out-of-sample accuracy of the Merton (1974) model and find that the distance-to-default measure is not a sufficient statistic for the probability of default. Over time the average default frequency and individual default probabilities display comovement with macroeconomic and financial variables in a way that suggests that aggregate shocks might be an important driver of default. The seminal work of Bernanke, Gertler and Gilchrist (1999) provides a theoretical framework whereby both firm-specific factors and macro shocks affect the default outcome of individual firms. In the BGG framework, firm default is 1

affected by firm-specific productivity shocks and aggregate shocks (e.g., an aggregate productivity shock). Hence, it follows that an empirical model of default should feature variables that proxy for the underlying firm-specific productivity, as well as variables that proxy for unobserved aggregate shocks. The recent paper by Hackbarth, Miao and Morellec (2006) provides an additional mechanism through which macroeconomic conditions affect default risk. They argue that, when cash flows depend on economic conditions, firms’ optimal default thresholds will be affected by aggregate shocks. Hence, aggregate shocks can trigger simultaneous defaults. These theoretical insights have recently been explored in the empirical literature on default modelling, and there is a small but growing number of papers investigating the importance of macroeconomic fluctuations on business defaults. Recent work by Duffie, Saita and Wang (2007), Pesaran, Schuerman, Treutler, and Weiner (2006), Bonfim (2009), Lando and Nielsen (2010), and Tang and Yan (2010) provides empirical evidence that firm-specific factors alone appear unable to fully explain the variation in corporate default rates and credit spreads. Using aggregatetimesseriesanddataonpubliclylistedindustrialfirms,theseauthorsfindthatmacrofinancial covariates have significant explanatory power for credit losses, spreads and corporate default rates using structural and reduced-form approaches. In this paper, we adopt a standard econometric specification and estimate multi-period logistic regressions on firm-level default data. Shumway (2001) shows that, under some mild restrictions, this model is equivalent to a discrete-time hazard model, and hence not prone to the bias and inconsistency of the static model hitherto used in bankruptcy modelling. In addition to an extensive set of financial statement variables and payment remarks reflecting a firm’s financial track record, we include four standard macroeconomic variables. The default risk models are estimated both at an economy-wide level and for 10 industries on a sub-sample covering 19901 19994. − Our large panel data set enables us to make several contributions to the above-mentioned literature. First, we show that a simple logit model with constant parameters is able to account for the outburst in default rates during the Swedish banking crisis, as well as the historically low rates occurring in subsequent recovery periods. The included macroeconomic variables are of key importance for explaining the time-varying likelihood of default. Firm-specific variables prove insufficient for explaining variation in the level of default risk over time, but are very useful for ranking firmsaccording totheir relative riskiness. Second, havingaccess to a very rich set of firm-specific controls we can credibly reject the possibility that the empirical significance 2

of macroeconomic variables is merely (or partially) an artifact of a shortage of firm-specific controls. Third, the length of our panel enables us to do extensive out-of-sample performance testing 20001 20092. The results suggest that the default risk models with both macro- − economic and firm-specific variables included perform very well out-of-sample. This holds both in the cross-sectional and the time-series dimension. Fourth, the width of our panel permits us to investigate the relationship between aggregate fluctuations and business defaults across industries. By isolating and comparing industry-specific effects from macroeconomic variables we get an additional measure of the robustness of their impact on business defaults. Our results are quite intuitive and suggest that the effects are stronger in sectors like construction and real estate, which on a priori grounds can be deemed to be more cyclical since they involve production of more durable goods. Fifth, the combined width and length of our panel allow us to look into the stability over time of cross-sectionally estimated parameters. We show that models estimated on cross-sectional data are likely to suffer from substantial parameter instability over time, andtherefore will be unable toaccount for variationsinthe average default frequencyover time. Finally, our analysis also suggests that considering only macroeconomic variables while ignoring relevant firm-specific information leads to a substantial loss of out-of-sample prediction accuracy. According to our analysis, the two key macroeconomic factors affecting business defaults are the nominal interest rate and the output gap. In the current situation, where economic activity and the output gap in many countries have dropped at the fastest pace since the Great Depression, our results suggest that central banks can reduce the likelihood of an outburst in defaults rates and the associated spike in credit losses for banks by aggressively cutting nominal interestrates. Interestingly, thisisexactlywhatmanycentralbankshavedoneduringtherecent crisis. The remainder of this paper is structured as follows. The next section presents the micro and macro data sets. The regression results are presented in Section 3 along with an assessment of the in-sample fit. In Section 4, we undertake a thorough out-of-sample investigation of the estimated models along several dimensions. Finally, Section 5 concludes. 3

2 Data The firm data set is a panel consisting of 16928521 quarterly observations on the population of Swedish aktiebolag, or firms, between January 1, 1990, and June 30, 2009. Aktiebolag are by approximation the Swedish equivalent of US corporations and UK limited liability businesses. SwedishlawrequireseveryaktiebolagtohaveatleastSEK100000(approximatelyUS$13000) of equity to be eligible for registration at Bolagsverket, the Swedish Companies Registration Office(SCRO).SwedishcorporationsarealsorequiredtosubmitanannualreporttotheSCRO. The firm data have been obtained from Upplysningscentralen AB (UC), the leading credit bureau in Sweden, independently operated but jointly owned by the Swedish commercial banks. The UC data come from two general sources. Annual balance sheet and income statement data comefromfirms’compulsoryannualreportssubmittedtotheSCRO.Thesedatacovertheperiod January 1, 1989, through June 30, 2009, and the format follows European Union standards. We converttheannualreportdataintoquarterlyobservationsbylinearinterpolation,i.e.,weassume that the variables remain constant over the quarters in a given reporting period. The second information source is atypical in the existing default literature and is somewhat unique for Sweden. The credit bureau systematically collects information about events related to firms’ payment behavior from all relevant sources, e.g., the Swedish retail banks, the Swedish tax authorities, and the institutions that deal with the legal formalities in firms’ bankruptcy processes. The credit bureau thus has a register of more than 60 different payment remarks concerning primarily credit and tax-related events, but also records of various steps in the legal procedures leading up to formal bankruptcy. The information in the register includes a flag for the occurrence of an event in the form of a date and the amount of due payment (if applicable). Some examples of registered events are delays in tax payments, the repossession of delivered goods, the seizure of property, the restructuring of loans, and actual bankruptcy. Payment remarks turn out to be powerful predictors of default and are essentially available in real-time. Admittedly, to allow for comparability with other studies, one might prefer excluding payment remarks, as these are not generally available outside Sweden. However, we prefer to include the payment remarks in our analysis in order to have a comprehensive set of firm-specific control variables. This way, we seek to eliminate the possibility of macroeconomic variables spuriously proxying for omitted firm-specific controls. In Appendix B, however, it is shown that the role of neithertheaccountingvariablesnorthemacroeconomicfactorsismuchaffectedbytheinclusion of the remark variables that we consider. 4

The population of existing firms in quarter  is defined as including those firms that have issuedafinancial statementcovering thatquarterand areclassifiedas“active,”i.e., thefirmhas reported total sales and total assets in excess of 1000 SEK (roughly US$ 130). However, since there are firms that neglect to fulfill their reporting obligation, a behavior typically associated withdistress,wewouldmissanimportantsegmentoffirmsbyonlyconsideringthosethatsubmit annual reports regularly. For this reason, we will add firms that, according to the data set with payment remarks, are classified as defaulted firms in quarter . Many firms that default choose not to submit their compulsory annual reports in that year or even for a number of years prior to default. Hence, often the only records of their existence that we have come from the payment remark registers. We adopt the following definition of a default: a firm has default status if any of the following events has occurred: the firm is declared legally bankrupt, has suspended payments, has negotiated a debt composition settlement, is undergoing a re-construction, or is distraint without assets. More details on the construction of the default variable are provided in Appendix A. For the selection of which financial ratios to use in the default models, we evaluated a large number of frequently used ratios in the literature on bankruptcy risk and on the balance sheet channel.1 Many papers employ measures of liquidity, profitability and efficiency, and solvency or leverage, while some also make use of a size variable. In this paper the six selected financial ratios are: earnings before interest, depreciation, taxes and amortization (EBITDA) over total assets (TA) (earnings ratio); interest payments (IP) over the sum of interest payments and earnings before interest, depreciation, taxes and amortization (interest coverage ratio); total liabilities (TL) over total assets (leverage ratio); the log of total liabilities over total sales (TS) (debt ratio); liquid assets (LA) in relation to total liabilities (quick ratio); and inventories (I) over total sales (inventory turnover ratio). Details on the selection of financial ratios along with agraphicalexpositionofthedataareprovidedinAppendixA.2. Itisimportanttonotethatthe non-linear feature of some financial ratios does not imply that these variables are uninformative for default risk when entered linearly in the logit model. The reason for this is that the covariation between the financial ratios in the cross-section is substantial, which makes each of these variables contribute substantially to predicting default events in the joint linear empirical model. Moreover, the accounting data provides information on whether a firm has paid out 1 Table A.1 in Appendix A.3 provides an account of the variables considered in Altman (1968), Altman, Frydman, and Kao (1985), Shumway (2001), Pesaran et al (2006), Duffie et al (2007), Bonfim (2009), and Bharath and Shumway (2008). 5

dividends to shareholders or not, which we enter as a dummy variable (PAYDIV) in the models. As mentioned previously, some firms classified as active, or defaulted, fail to submit a financial report in every period, leading to a missing-observation problem. For the purpose of using an aggregate default series that closely corresponds to the official default frequency series as computed in the official statistics for Sweden and to ensure unbiased macro coefficients in the econometricmodel, we decided toretainfirms with missing variables in the sample, by replacing missing values by imputation rather than excluding such firms from the sample.2 In order to capture the relationship between not submitting a financial statement and subsequent default, we include a dummy variable, denoted TTLFS. In line with actual reporting lags, this dummy equals unity at time  if a firm has not issued a financial statement in the one and a half years priortothecurrentquarter,andequalszerootherwise.3 TTLFSattemptstocapturethesignal that many firms (deliberately) choose not to file a financial report when in financial distress, and thus are more likely to default. In Appendix B.2 we document that our results are robust w.r.t. the approach that we use to deal with missing observations by reporting results where we only include firms for which data on all the financial ratios are available. For the remark variables, we use simple dummy variables by setting them to unity if certain remarks existed for the firm during the year prior to quarter , and 0 otherwise. An intuitively reasonable starting point was to find remark events that (i) lead default in time as much as possible and (ii) are highly correlated with default. As it turns out, many payment remark variables are either contemporaneously correlated with default or lack a significant correlation with default behavior. For our final model, we constructed the PAYREMARK variable as a composite dummy of four events: “a bankruptcy petition,” “the issuance of a court order because of absence during the court hearing - to pay a debt,” “the seizure of property,” and “having a non-performing loan.” The TAXARREARS variable reflects whether the firm is in various tax arrears. It should be emphasized, however, that these two remark variables do not imply a subsequent default incident. The shares of defaulted firms that have received payment remarks, or are in tax arrears, are about 0.15 and 0.41, respectively. Corresponding shares for non-defaulted firms are 0.00 and 0.03. Hence, there are no tautological issues involved in using these variables to predict default events. Table A.2 in Appendix A.3 provides descriptive 2 Imputationisimplementedinasequentialprocedureforagivenvariableaccordingto: (i)abackwardsearch forthe last available observation forthat particular firm in the past, (ii) a forward search for future observations for the firm, (iii) if these measures fail, a randomized draw from the data is done conditional on industry and default status. 3 See Appendix A.2 for more details about TTLFS. 6

statistics for all firm-specific variables that are used in the subsequent analyses. We make use of four aggregate variables: the output gap (i.e., the deviation of GDP from its trend value), the yearly inflation rate (measured as the fourth difference of the GDP deflator), the repo interest rate (a short-term nominal interest rate, set by the Riksbank), and the real exchange rate. The output gap series is computed by HP-filtering GDP, where the smoothing coefficientissettothestandardvalueof1600. Therealexchangerateismeasuredasthetrade weighted nominal exchange rate times trade weighted foreign price level (CPI deflators) divided by the domestic CPI deflator. During the sample period, the real exchange rate is characterized by an upward (depreciating) trend and it is therefore detrended with the HP-filter to achieve stationarity. Appendix A.2 provides a figure with the macro data and Appendix B.3 verifies that the results presented below are robust with respect to our detrending procedure of the real exchange rate. 3 The default-risk models: Estimation and in-sample fit In this section, we examine if default risk at the firm level is affected by aggregate fluctuations over and above the set of firm-specific information that we have at our disposal. We study the in-sample gains of estimating separate models for each industry and assess the role of aggregate fluctuations for improving the models’ fit. The in-sample period is chosen to be 19901 19994. For this period, we have a total of 8106176 observations of which − 105605aredefaults. Theout-of-sampleperiod,20001 20092,issavedtoallowforextensive − model-evaluation exercises. and comprisesa total of 8822345 observations of which 55945 are defaults. Thus, the average default frequency per quarter equals about 1 percent in our dataset for the full sample period. This is somewhat higher than the 0.75 percent per quarter business failure rate reported for the US by, among others, Bernanke, Gertler and Gilchrist (1999), but if we exclude the banking crisis and instead consider the period 19951 20092, then − the average unconditional default frequency essentially equals the value reported by Bernanke, Gertler, and Gilchrist. Analyses of industry effects will be conducted at the one-digit level to ensure sufficiently many default observations in each industry in both the cross-sectional and thetimeseriesdimensions. In addition, weestimatethemodel for allfirmsjointly, andwill refer to this model as the “economy wide” model. 7

3.1 The default-risk models The reduced-form statistical model that we employ for estimating probabilities of default for all Swedish incorporated firms is similar to the multiperiod logit approach used in Shumway (2001) and Campbell, Hilscher, and Szilagyi (2008). Using a reduced-form model both avoids the problem that the Merton (1974) model has, namely that it cannot be implemented for privately held companies without very strong assumptions, and enables us to use a unified approach for all businesses, both privately and publicly held. As discussed in the introduction, there is also a recent theoretical literature - including papers by Bernanke, Gertler and Gilchrist (1999), Hackbarth, Miao and Morellec (2006) and Tang and Yan (2010) - which argues that both firmspecific and aggregate shocks can trigger simultaneous defaults. Thus we propose to estimate the following model:  =  +  +      where 1 if  + + 0 (firm defaults)   = 0 if   +  +  ≥  0 (firm stays in business)     ½ undertheassumptionthatthevectoroffirm-specificregressors(i.e., )andthemacroeconomic  variables we consider (collected in the vector  ) are stochastically independent with respect to  theerrorterm . Thisapproachalsoallowsustocontrolforthecompetingriskofexitingfirms  duetootherreasonsthandefault(Allison,1995). Wealsomaketheadditionalassumptionsthat, conditionalontheextensivesetoffirm-specificandmacroeconomicsetofcovariatesweconsider, the errors are independent between both firms and time points, i.e., (  )=( )( )     for  = and (  )=( )( ) for =0. These assumptions are rejected by Das,  +  + 6 6 Duffie, Kapadia and Saita (2007) on US data. Lando and Nielsen (2010), however, revisit the relation between contagion through covariates and conditional dependence addressed in Das et al. and find that the assumption of conditional independence can no longer be rejected when the set of firm-specific and macroeconomic controls is slightly altered and expanded. Hence, conditional on an appropriate set of covariates, Lando and Nielson find no evidence that the default of a firm causes default intensities of other firms to increase, providing support for our assumptions for the error term. Moreover, for a similar model estimated on a subset of the data used in this paper, Carling, Roszbach and Rönnegård (2004) find that the estimated parameters arerobustwhenthecorrelationbetweenresidualsistakenintoaccount. Theseresults,alongwith the excellent out-of-sample performance of the model in the time and cross-section dimension 8

documented in Section 4, provide support of our assumptions for the error term. Our theoretical basis for selecting the set of macroeconomic variables is that they should span both aggregate “demand” and “supply” type of shocks that hit the economy. The output gap is intended as an indicator of demand conditions, i.e., increased demand in the economy is expected to reduce default risk. We also include the nominal interest rate in  because credit  conditionsfacingfirms,inparticularfirmsindistress,arelikelytobetightlylinkedtotheinterest rate. Asonecanplausiblyarguethattherealinterestrate, rather thanthenominal one, iswhat shouldaffectthe defaultfrequency, wealsoincludetheinflation rate in . Moreover, apartfrom  capturing the effects of supply shocks, higher inflation obviously implies higher nominal income for firms, which furthermore should tend to reduce default risk. However, it is also conceivable that higher inflation rates are associated with less certainty about correct relative prices and thus may lead to increased default risks. For these reasons, the sign of the inflation coefficient is unclear, and will depend on the relative strength of the underlying sources of macroeconomic fluctuations. Furthermore, given that the export-to-GDP ratio in Sweden was around 040 during the sample period, the real exchange rate is potentially an important variable, since a depreciation renders improved competitiveness to Swedish firms in the export sector.4 3.2 Estimation results To document how aggregate variables contribute to the default risk models, we present estimationresultsfortwospecifications: onewithandonewithoutmacroeconomicvariables. Moreover, results are presented for ten industry-specific models and an economy-wide model where firms in all industries are jointly modelled. Table 1 contains estimation results for a model with firm-specific determinants of default risk only (i.e., the six financial ratios augmented with the dummy variables PAYDIV, TTLFS, PAYREMARK, and TAXARREARS), while Table 2 shows results with the macroeconomic variables added. The regressors have not been re-scaled to have the same mean, and therefore one cannot directly judge the importance of a particular variable from the size of its coefficient. However, by suitably transforming each variable, its marginal impact is calculated in Appendix B.4, verifyingthatsuchcalculationsyieldsimilarrankingsofimportanceasstandard-statistics. Hence, the importance of each variable is below approximated with the size of its -statistic. 4 Inadditiontothesefourvariables,wehavealsoexperimentedwithafewothervariablessuchasrealhousing prices, taken as deviations from linear/HP-trend; and a measure of the spread between the interest rate charged to non-financial firms and the policy rate set by the central bank. For our sample period, these variables are largely redundant given the set of included variables in the benchmark specification. 9

Since the firms’ annual financial reports are typically submitted with a significant time lag, it cannot in general be assumed that accounting data for year  are available during, or even at the end of, year  and enable forecasted default risks for year  +1. To account for this, all accounting data is lagged by four quarters in the estimations. For most firms, which report balance-sheet and income-statement data over calendar years, this means that data for year  are assumed to have been available in the first quarter of year  +1. It should be emphasized that our decision to lag the accounting data four quarters in the estimation in order to make the model “operational” in real time has minor implications for the estimated coefficients. When re-estimating the model using contemporaneous data instead, the estimation results were found to be very similar to the ones reported in Tables 1 and 2. TheresultsinTable1showthatthefirm-specificinformationweconsiderisindeedimportant for explaining default behavior in both the industry-specific models and in the economy-wide model. In particular, the indicator variable TTLFS (which takes a value of 1 if a firm has not filed an annual report on time, and 0 otherwise) and the variables for remarks on firms’ payment records are very powerful predictors of default. Among the financial ratios we find the earnings ratio EBITDA/TA, leverage ratio TL/TAand the debt ratio TL/TS to be quite useful. However, the roles played by financial ratios in the various industry models differ substantially; while accounting data are less important in the financial services (bank, finance and insurance) sector, it is more important in the manufacturing industry. The coefficients for the payment remarks and the indicator variable TTLFS are quite similar across industries. So to the extent that these variables are the more important ones for explaining firm default behavior, there is no clear gain at the firm-specific level from conditioning on industry.5 Turning to the results in Table 2, we find that the coefficients for the firm-specific variables in Table1 donotchangemuch when the model isaugmented withthemacroeconomicvariables. Moreover, and despite the robustness of the firm-specific coefficients, we find that all coefficients forthemacroeconomicvariablesaresignificantintheeconomy-widemodelandhavetheexpected signs. The possible exception is inflation, but for reasons discussed in Section 3.1, it is hard to have a strong view a priori on the sign of the inflation coefficient. The notion that it is importanttoconditiononmacroeconomicvariablesindefaultriskmodelingisfurthersupported by the industry-specific model results. Table 2 shows that the impact of the macroeconomic 5 Noticethatbydefiningadefaulteventatthequarterlyfrequency,andbytransformingyearlystatementsto quarterly ones, we could potentially underestimate the effects of the accounting variables. As a robustness check we therefore estimated the default-risk models on annual data and found that the coefficients for the accounting variables are quite similar to those reported in Table 1, see results in Appendix B.3. 10

factors is estimated to be more important in industries that are arguably more cyclical. For instance, the output gap is more important in the construction and the real estate sectors in comparison with other industries, and as expected the nominal interest rate is found to be very important for the financial services and the real estate sectors. The remaining macroeconomic variables, inflation and the real exchange rate, appear less important overall. However, it is reassuringtofindthatadepreciatingrealexchangerate(i.e.,increasingvalues)isassociatedwith a significantly lower default risk in the manufacturing sector, which is the most export-oriented industry. Thecoefficientfortherealexchangerateisalsolargeforthefinancialservicesindustry, possibly reflecting that Swedish credit conditions, which were very tight in the resolvement of the banking crises, subsequently eased when the krona was allowed to depreciate in November 1992. Regarding inflation, we can reject the view that it is only the real interest rate that matters for default risk at the firm level, with the possible exception of the financial services industry.6 Finally, we would like to emphasize that the gain in using firm-specific data for default-risk modeling is substantial. OLS estimation (TSLS with lagged variables as instruments yielded very similar results) for a model of the average quarterly default rate on average financial ratios and the four macroeconomic variables yields: EBITDA TL LA  = 945 027 + 017 0003 ...  −(1067) −(018) µ TA ¶ (012) µ TA ¶ − (002) µ TL ¶ I TL IP 040 002 + 016  −(017) µ TS ¶ −(004) µ TS ¶ (016) µ IP+EBITDA ¶ 012 0007 0005 002 +ˆ  (1)      −(006) − (004) − (004) −(001) 2 =088 DW=198 Sample: 19901 19994 ( =40). − If we compare the point estimates for the financial ratios in (1) with the economy-wide model in Table 2, we see that they differ substantially and the ratios I/TS and TL/TS appear with counterintuitive signs. The coefficients for the macroeconomic variables are more robust, with the exception of the nominal interest rate which has a counterintuitive sign. Since the average financial ratios are quite smooth over time, it is not surprising that we obtain spurious results when the firm-specific information is aggregated. Moreover, we notice that some explanatory 6 As a robustness check, we examined a model allowing for non-linear relationships between default and the financial ratios and found that the macroeconomic variables are still highly significant and quantitatively important. WeusedthecumulateddistributionsdepictedinFigureA.1inAppendixAtocategorizethevariables (3 categories for each variable). For instance, we classified EBITDA/TA into the decile-based categories 0 10, − 10 90,90 100,whereasTL/TAwasclassifiedintothecategories0 75,75 90,90 100. Thiscategorization res − ultedina − nincreaseinpseudo2from034to042intheeconomy-w − ideTab − le2mod − el,butthemacroeconomic variables still enter highly significantly and with coefficients very close to those reported in Table 2. 11

power is lost by aggregating data; the model in (1) yields an 2 of 088, which can be directly compared with the aggregated fit (2 = 095) of the corresponding model in Table 2. The reduction in fit is primarily driven by the inability to take advantage of the dummy variables for payment remarks, dividends and failure to submit a financial statement in regressions at the aggregate level. 3.3 Assessing the models’ in-sample fit The last rows in Tables 1 and 2 report on the number of observations, the mean log-likelihood and the pseudo-2. The latter measures the ability of the estimated models to explain default at the firm level and is computed using the method of McFadden (1974). Another important and interesting feature of the models is their aggregate performance over time, i.e., how well the models account for the average default frequency. Hence, we report what we label as “industry” or “aggregate” 2s. These are calculated by aggregating all the fitted firm default probabilities inaparticularindustrymodelforeachquarter19901 19994andthenusingtheresulting40 − time-series observations to compute the implied aggregate 2. To assess the gain in estimating separateindustry-specificmodels, wealsoreportthepseudo-andindustry-2 valuesconditional on the economy-wide model coefficients instead of the industry model coefficients. BycomparingTables1and2,weseethatthepseudo-2 isnotmuchaffectedbyconditioning on macroeconomic factors in any of the industries, merely 1-2 percentage points. Tang and Yan (2010) find a somewhat larger role for macro factors: about 6 percentage points. However, the industry-2 is doubled and sometimes even more than doubled by the introduction of macroeconomic variables. Thus, the firm-specific variables account for the cross-section of the default distribution, while the macroeconomic variables in the model play the role of shifting the mean of the default distribution in each period. This also implies that the model with firm-specific information cannot capture the upturns and downturns in the average default rate over time. This is visualized in Figure 1, where we plot the average default rate over time against the fitted values from the economy-wide models with (Table 2) and without (Table 1) macroeconomic variables. The results to the right-hand side of the vertical line pertain to out-of-sample results and will be discussed in Section 4.1. According to Figure 1, the model with both microeconomic andmacroeconomicvariablesincludedindeedappearsabletoreplicatetheextremedefaultrates during the deep recession/banking crisis in the beginning of the 1990s, as well as the downturn to moderate default rates towards the end of the in-sample period. This finding is very inter- 12

esting, because it suggests that the extreme default rates recorded during the banking crisis in the early 1990s were not exceptional events that are uninformative in a model context. Rather, they appear to be consequences of unusually bad economic outcomes. An additional feature of interest in Tables 1 and 2 is that the fall in pseudo-2 values associated with conditioning on the economy-wide model coefficients is distinct but limited, whereas the corresponding reduction in industry 2 is often quite substantial. In three cases (the agricultural, the bank, finance & insurance, and the non-classified sectors) the industry-2 are negative conditional on the economy-wide model coefficients. At first sight this may seem strange, given that the industryspecific coefficients in Table 2 are not very different from the economy-wide model coefficients. However, these seemingly inconsistent results are driven by the unreported intercept, which is larger in the economy-wide model compared with the sector models. Therefore, it induces a systematic over-prediction of default risk in these sectors. A conceivable objection to our claim of the importance of conditioning on macroeconomic factorsin default risk modelsisthatthesignificance of these variables simply reflects achanging impact of the firm-specific variables over time. Accordingly, if one were to continuously reestimate the coefficients of the models in Table 1 using the most recent quarterly information only, then the fit of the models in terms of aggregate 2 would increase dramatically and make the macroeconomic variables redundant. Figure 2 displays the estimated coefficients for the financial ratios in the economy-wide model for such a set of separate cross-sectional models.7 These results are computed for the economy-wide model only, because there are not enough defaults available in each quarter to estimate industry-specific models. As can be seen from Figure 2, the coefficients for most ratios are highly unstable and some even switch sign over time. Accordingly, any out-of-sample forecasts beyond the very short horizon generated by any of these 40 models would be deficient. To be convincing, a model of firm-level defaults with time-varying coefficients for firm-specific variables would require an understanding of how the time-variation in its coefficients comes about. The irregular and economically implausible patterns in Figure 2, make such a model seem highly improbable. To further understand the role of macroeconomic variables for default risk, let us approach the issue from an opposite angle and study the importance of firm-specific variables in the 7 We have also conducted these cross-sectional regressions when imposing the restriction that the constant equals the estimated intercept in the economy-wide model in Table 1 and subtracting the panel mean from each regressor. SoinsteadofrunningtheregressionsunderlyingFigure2,i.e.  = +  + foreachquarter,   0   we estimated  =¯+  ¯  +  This alternative estimation procedure yielded very similar results to  0− 0   those reported in Figure 2.   13

models. One way to demonstrate the information loss due to omitting the microdata is to regress the average default frequency exclusively on the macroeconomic variables. When doing so we obtain the following result:  = 040 020 +0007 + 010 003  +ˆ        (011)−(003) (002) (002) −(0009) 2 =081 DW=143 Sample: 19901 19994 ( =40). (2) − When comparing this regression with the economy-wide model in Table 2, we see that exclusion of the financial statement variables is associated with a loss of close to 15 percentage points of explanatorypower. Moreover, omittingthe firm-specificinformationintroducesmisspecification problems in (2) as indicated by the DW-statistic, in contrast to the results in (1), which has a DW-statistic around 2 and hence displays no signs of autocorrelation. A simple -test reveals that the loss of fit in (2) relative to (1) is significant at the 5-percent level (using asymptotic critical values). The autocorrelation problem in (2) turns out to induce further problems with out-of-samplestabilityforthemodelin(2), asdocumentedinSection4(seeTable3). Ourinterpretation is thatomitting firm-specific variables when modeling default risk attributes too much of the variation in default risk to the macroeconomic factors in-sample. The model therefore doesn’t perform as well out-of-sample. 4 Out-of-sample performance of the estimated model In this section we investigate the robustness of the results in the previous section by examining the out-of-sample performance of the models of Tables 1 and 2 for the period 20001 20092. − We evaluate the models along two dimensions. First, we study the models’ performance at the industry and aggregate level, i.e., we assess their ability to predict future average default rates. The predictions we consider are static one-step-ahead forecasts because we do not have a complete model for all the regressors. Although there are no major fluctuations in the aggregate default rate during the out-of-sample period (see Figure 1), this period is nevertheless very informative about the out-of-sample performance of our models since there is still substantial variation in the macroeconomic variables, as displayed in Figure A.2. Second, we look into the models’ properties in predicting future default events at the firm level. To this end, there is a substantial amount of information (55945 default observations) to assess the stability of the models. 14

4.1 Evaluating the models at the aggregate and the industry level In Figure 1, the results to the right-hand side of the vertical bar show the one-step-ahead, out-of-sample performance at the aggregate level for the economy-wide model. Overall, the outof-sample fit is remarkably good, although there is a tendency for the model to under-predict during the years 2006 and 2007. Interestingly, the model is able to pick up the emerging spike in the default frequency in the recent recession through the strong fall in the output gap from 4 to -4 percent. In Table 3, we report the root mean squared one-step-ahead prediction errors (RMSEs), for the estimated models of Tables 1 and 2. As a reference, we also show results for three reference timeseries models: arandom walk, a4-quarter moving-average model, and the model estimated on only macroeconomic data (eq. 2, denoted “Industry OLS macroregression”). The results in Table3pertaintodefaultriskmodelsthathavebeenre-estimatedusingmacroeconomicvariables that are lagged one quarter. This ensures that all models in Table 3 have been estimated on the same information set, thereby allowing for a fair comparison between the logit and the timeseries models. In the “Industry OLS macroregression” models an additional dummy for the third quarter is included. In order to assess to which extent the forecasting errors are quantitatively different from a statistical point of view, we perform the Diebold-Mariano test on the forecast errors underlying thecomputedRMSEdifferentialsinthelowerpanelofTable3. InthetabletheRMSE-ratiosthat are bolded (in italics) indicate that the forecasting performance is significantly better (worse) relative to the corresponding models in Table 2.8 Finally, it is imperative to notice that the RMSEs are shown in percent, i.e., the actual and fitted default frequencies have been multiplied with a factor of 100 before the prediction errors are calculated. From inspection of Table 3, it is evident from the first row in the lower panel that the effect on forecasting performance from conditioning on both macro and firm-specific information is considerable. The largest gain is found for the economy-wide model where forecast precision increases by a factor of 36 when we include macroeconomic variables. Disregarding the not classified residual industry for which the model with macro variables is associated with a significant loss of out-of-sample accuracy, the corresponding factors for the industry-specific models 8 Our application of the test suggested by Diebold and Mariano (1995) examines the null hypothesis:    =   , where  is the squared loss-function of the one-step-ahead forecast errors  +1 |  +1 |  +1 |  for  m  odels   and .Diebold and Mariano show that a test-statistic based on the loss differential   , and suitably normalized by the asymptotic variance of  , is asymptotically standard normal.  15

range between 1.2 and 3.6 and constitute a significant improvement in 8 out of 9 cases. Moreover, the second row in the lower panel shows that the industry-specific models often generate significantly lower RMSEs compared with the industry models conditional on coefficients from the economy-wide model in Table 2, except for the manufacturing, retail and hotel & restaurant sectors where they are slightly, but not significantly, higher. By and large, the above findings constitute evidence that the industry-specific models are not over-parameterized with respect to the macroeconomic variables. Therefore it will typically be worthwhile to work with an industry-specific model if the focus is on understanding default behaviorinaparticularindustry. However,ifinterestliesinmodelingaggregatedefaultbehavior only, then the economy-wide default model appears to suffice. This tentative conclusion can be drawn from the two right-most columns of the second row in Table 3. In the second to last column,theforecastscomputedwiththeindustry-specificmodelshavebeenweighted(according toindustry-size)toaforecastfortheaggregatedefaultfrequency. Thisresultsinaslightlylower RMSE (00716) in comparison with the RMSE (00802) for the economy-wide model. Although this difference in RMSE is moderate in comparison with the other models in absolute terms, it is still significant in favor of the industry models according to the D-M test. Comparing the industry models in Table 3 with the time series models, we see that while the random walk model is doing significantly better in 3 out of 10 sectors, and the 4-quarter moving-average specification is better 6 out of 10 times at the industry level, they are both significantly inferior at the aggregated industry level. This implies that they are also inferior in terms of RMSE fit to the economy-wide model specification in Table 2 (which conditions on aggregate fluctuations). The models that are based on OLS regressions for average industry default frequencies on the macroeconomic variables only are often associated with a significant increase in RMSE (7 out of 10 industries) in comparison with the Table 2 models. To sum up, we have found strong evidence that the favorable fit in-sample of the estimated industry (and aggregate) models, conditional on macroeconomic variables, is preserved out-ofsample at the industry and aggregate level. This is reassuring for the hypothesis that aggregate variables matter because the in-sample and the out-of sample periods taken together cover several upturns and downturns in the Swedish economy. Finally, we have also documented that there are relatively small gains in terms of forecasting accuracy to be made by using industryspecific models rather than simply an economy-wide model, as long as an appropriate set of macroeconomic variables is included. 16

4.2 Evaluating the models at the firm and the industry level We now turn to an evaluation of the ability of the models to both rank firms according to their relative riskiness and to determine firms’ absolute risk level for the out-of-sample period. In addition, we report the industry-specific pseudo-2 conditional on the industry-specific model coefficients of Table 2, as well as the pseudo-2 calculated conditional on the economy-wide model coefficients. The results are displayed in Table 4. First, starting with the pseudo-2 for the models with industry-specific coefficients and comparing the in-sample and out-of-sample results reported in Tables 2 and 4, respectively, we seethattheexplanatorypowerout-of-sampleisinfacteitherhigherthanin-sampleorunchanged in eight out of ten industries. Next we turn to the pseudo-2 for the predictions based on the economy-wide model coefficients. The lower panel of Table 4 shows that the average explanatory power has increased substantially, from 034 in-sample to 038 out-of-sample. We also see, relative to Table 2, that the explanatory power has increased in all industries except for agriculture, real-estate and the construction sectors. Moreover, by comparing the pseudo-2 values generated when using industry-specific coefficients with those obtained using economy-wide model coefficients (i.e. the values in upper and lower panels of Table 4), we find that they are larger/smaller in four industries and similar otherwise. This implies that pseudo-2 at the aggregate level is about the same (038) for the economy-wide model compared with an aggregation of pseudo-2 over the industry-specific models (denoted Industry Aggregate in Table 4). These results provide support for two important conclusions. First, the industry models are not over-parameterized. Second, the reduced-form coefficients appear to be stable over time and the regressions thus reflect relationships that hold out-of-sample. Moving on to measures of relative risk, we follow Shumway (2001) and evaluate the models’ ability to rank firms according to their riskiness in terms of ex post default frequencies. At first glance, we see from Table 4 that the estimated models classify roughly 75 80 percent of the − defaulting firms in the riskiest decile. These numbers are about the same as those reported insample by Shumway for a data set that was substantially smaller and included only listed firms. Our models cover the entire population of Swedish incorporated businesses, of which only a very small subset is publicly listed on the stock exchange (slightly less than 500 out of 260000). We therefore conclude that our models are quite successful in ranking firms according to their level of default risk, and support our conclusion that the role of macroeconomic variables in our 17

models of default risk is not driven by the omission of key microeconomic variables. Table 4 also reveals that the quality of the risk rankings does not depend on whether we condition on industry-specific coefficients or coefficients from the economy-wide model. This contrasts with our findings in the previous subsection, where we found that conditioning on industry-specific parameters improved the models’ empirical performance at the industry level. The explanation for these seemingly inconsistent results lies in the fact that the most important difference between the economy-wide and industry-specific models is due to the varying impact of the aggregate factors. Those factors have little impact on the firms’ relative risk ranking and hence their inclusion or omission has little impact on the models’ ability to risk-rank firms. Finally, we assess the out-of-sample properties of the models at the microeconomic level in an absolute sense, as opposed to the relative appraisal in Table 4. We do so by sorting all estimated default probabilities according to size and calculate the average probability of default in each percentile. We then compare the average probabilities of default with the actual default experience of the firms for each of these percentiles. In Figure 3, we plot the result where we have used both the industry-specific and the economy-wide model coefficients in Tables 1 and 2 to compute the estimated default probabilities for each firm. On the x-axis, we have the estimated default frequency in a given percentile, and on the y-axis, we have the actual default frequency in each percentile. In the figure, each dot represents a percentile. In order to make the results easier to interpret, a logarithmic scale is used for both the estimated and the actual series. If the estimated models could perfectly predict the absolute riskiness of the firms within each percentile, all dots would line up along the 45-degree line drawn in the figures, corresponding to a slope-coefficient of unity and an intercept equal to zero. As can be seen in Figure 3, this is not the case for either model, but the dots are generally very close to the line, suggesting that the absolute riskiness ranking is very accurate. In particular, the results show that both the industry-specific models and the economy-wide model with macroeconomic variablesincludedpassourtestinthecross-sectionaldimension,sincetheyarenotsystematically below of above the 45-degree line, whereas the models without macroeconomic variables tend to overestimate default risk. These findings provide further support for the main theme of this paper; macroeconomic variables are key for getting the absolute risk level right, but are not important for ranking firms according to their relative riskiness in a given period. 18

5 Conclusions In this paper, we studied the interaction between macroeconomic fluctuations and default risk at the firm level using reduced-form methods; we present five main findings. First, we provide insightintothesignificanceofaggregatefluctuationsfordefaultsamongbothlistedandprivately held firms. This is important, since privately held businesses typically account for over half of GDP in developed economies. Second, a nearly exhaustive set of firm-specific background variables permits us to investigate the importance of and interaction between firm-specific variables and macroeconomic information - an area that so far has received little attention. Third, we document that a standard logit approach to model default at the firm level, using both firmspecific and macroeconomic variables, can explain the extreme default frequencies during the Swedish banking crisis of the early 1990s as well as the considerably lower default frequencies in the late 1990s. Fourth, the estimated models are shown to be very robust and successful out-of-sample, suggesting that aggregate fluctuations play an important role in understanding the absolute level of firm default risk. Fifth and finally, the width of our panel permits us to investigate the relationship between aggregate fluctuations and firm defaults across industries. This shows that macroeconomic variables have a robust impact on business defaults. We want to stress that we do not interpret our results to imply that aggregate fluctuations are the most important source of default risk at the firm level. Rather, we argue that the results suggest that macroeconomic factors shift the mean of the default risk distribution over time and thereby are the most important determinants of the average level of default. In view of these results, we conclude by providing some suggestions as to why aggregate fluctuations have an important impact on firm default behavior, over and above the effects of firm-specific variables, which themselves move in response to macroeconomic fluctuations. We have in this paper relied on the work of, among others, Bernanke, Gertler and Gilchrist (1999) and Hackbarth, Miao and Morellec (2006) to motivate why firm default should be affected by aggregate shocks. In addition, one could imagine several additional channels whereby aggregate variables might contain predictive information for firm-default risk over and above firm-specific information. One such explanation is related to the costliness of monitoring. If monitoring borrowersiscostlyforbanks,thenbanksmayuseaggregateinformationtoassesstheprobability of getting repayment on loans granted. That is, banks may form their credit-granting policies on the basis of macroeconomic forecasts and decide to not extend new lines of credit to firms with a given set of performance indicators in one particular phase of the business cycle, but 19

readily do so in another phase. The tightening and loosening of banks’ credit standards over various phases of the business cycle captures such behavior.9 Yet another argument follows a similar line of reasoning. If entrepreneurs have imperfect information about their own future business prospects, they may resort to using aggregate conditions as a basis for their decision to either invest more effort in a firm or declare bankruptcy. A final possibility is that firms may be inclined to adjust their yearly accounts, e.g., to smooth profit over time in order to please banks’ monitoring efforts, and thereby reduce the predictive power of firm-level information. We believe that further work on the theory of how macroeconomic variables affect firm defaults and assessing the empirical relevance of the arguments above are important issues for future research. References Allison, Paul D., (1995), Survival Analysis Using SAS. A Practical Guide, SAS Publishing. Altman, Edward I., (1968), Financial ratios, discriminant analysis and the prediction of corporate bankruptcy, Journal of Finance, 23(4), pp. 589-611. Altman, Edward I., (1971), Railroad bankruptcy propensity, Journal of Finance, 26(2), pp. 333-345. Altman, Edward I., (1973), Predicting railroad bankruptcies in America, Bell Journal of Economics, 4(1), pp. 184-211. Altman, Edward I., (1984), The success of business failure prediction models: An international survey , Journal of Banking and Finance, 8(2), pp. 171-198. Altman, Edward I., and Anthony Saunders, (1997), Credit risk measurement: developments over the last twenty years, Journal of Banking and Finance, 21(11-12), pp. 1721-42. Altman, Edward, Halina Frydman, and Duen-Li Kao, (1985), Introducing recursive partitioning for financial classification: the case of financial distress, Journal of Finance, XL(1), pp. 269-291. 9 TheempiricalresultsinLownandMorgan(2006)andJiménez,Ongena,PeydróandSaurina(2011)support this reasoning. 20

Bernanke, Ben S., Mark Gertler, and Simon Gilchrist (1999), ”The Financial Accelerator in a Quantitative Business Cycle Framework.” Chapter 21 in Handbook of Macroeconomics, Volume 1, edited by J.B. Taylor and M. Woodford. Amsterdam, Elsevier. Bharath, SreedharT., andTylerShumway,(2008),ForecastingdefaultwiththeMertondistance to default model, Review of Financial Studies 21(3), pp. 1339-1369. Bonfim, Diana, (2009), Credit risk drivers: Evaluating the contribution of firm level information and of macroeconomic dynamics , Journal of Banking and Finance, 33, pp. 281—299. Campbell, John, Jens Hilscher and Jan Szilagyi, (2008), In Search of Distress Risk, Journal of Finance, 63(6), pp. 2899-2939. Carling,Kenneth,KasperRoszbachandLarsRönnegard(2004),IsFirmInterdependencewithin Industries Important for Portfolio Credit Risk?, Sveriges Riksbank Working Paper Series, No 168. Das, S., D. Duffie, N. Kapadia, and L. Saita (2007). Common failings: How corporate defaults are correlated,.Journal of Finance, 62, 93—117 Diebold, Francis, and Roberto Mariano, (1995), Comparing Predictive Accuracy, Journal of Business and Economic Statistics, 13, pp. 253-265. Duffie, Darrell, Leandro Saita and Ke Wang, (2007), Multi-period corporate default prediction, Journal of Financial Economics, 83(3), pp. 635-665. Hackbarth, Dirk, Jianjun Miao and Erwan Morellec, (2006), Capital structure, credit risk and macroeconomic conditions, Journal of Financial Economics, 82(3), pp. 519-550. Jiménez,Gabriel,StevenOngena,JoséLuisPeydróandJesúsSaurina(2011),CreditSupplyand Monetary Policy: Identifying the Bank Balance-Sheet Channel with Loan Applications, American Economic Review, forthcoming. Lando, David, and Mads Stenbo Nielsen, (2010), Correlation in Corporate Defaults: Contagion or Conditional Dependence?, Journal of Financial Intermediation, 19, pp. 355-372. Lown, Cara and Donald Morgan, (2006), The Credit Cycle and the Business Cycle: New FindingsUsingthe Loan Officer Opinion Survey, Journal of Money, Credit andBanking, 38(6), pp. 1575-1597. 21

McFadden, Daniel, (1974), The measurement of urban travel demand, Journal of Public Economics, 3(4), pp. 303-328. Merton, Robert C., (1974), On the pricing of corporate debt: the risk structure of interest rates, Journal of Finance, 29, pp. 449-470. Pesaran, M. Hashem, Til Schuermann, Björn-Jakob Treutler, and Scott M. Weiner, (2006), Macroeconomic dynamics and credit risk: a global perspective, Journal of Money, Credit and Banking, 38(5), pp. 1211-1261. Shumway, Tyler, (2001), Forecasting bankruptcy more accurately: asimple hazard model, Journal of Business, 74(1), pp. 101-124. Tang, Dragon Yongjun, and Hong Yan, (2010), Market conditions, default risk and credit spreads, Journal of Banking and Finance, 24, pp. 743-753. 22

Table 1: Regression results 1990Q1-1999Q4 for the default risk model estimated with only firm-specific variables Manu- Hotel & Bank. Finance Real Consulting & Not Economy Agriculture facturing Construction Retail Restaurant Transport & Insurance Estate Rental Classified Wide Firm-specific variablesa EBITDA/TA -1.004 -1.123 -1.195 -0.825 -0.599 -1.031 -0.329 -0.755 -0.806 -0.758 -0.837 (0.094) (0.038) (0.044) (0.021) (0.034) (0.055) (0.068) (0.059) (0.026) (0.026) (0.011) TL/TA 0.693 0.835 0.452 0.471 0.240 0.840 0.138 0.472 0.292 0.244 0.395 (0.061) (0.028) (0.032) (0.013) (0.023) (0.046) (0.036) (0.028) (0.021) (0.020) (0.007) LA/TL -0.254 -0.390 -0.307 -0.272 -0.203 -0.154 -0.085 -0.089 -0.182 -0.117 -0.207 (0.067) (0.029) (0.028) (0.014) (0.033) (0.032) (0.028) (0.020) (0.012) (0.010) (0.006) I/TS 0.047 0.177 -0.064 0.131 0.241 0.472 -0.062 0.034 0.239 0.207 0.057 (0.034) (0.030) (0.027) (0.014) (0.237) (0.153) (0.032) (0.009) (0.031) (0.020) (0.005) TL/TS 0.118 0.092 0.187 0.086 0.058 0.015 0.081 0.086 0.118 0.215 0.108 (0.021) (0.005) (0.007) (0.004) (0.011) (0.010) (0.015) (0.007) (0.006) (0.006) (0.002) IP/(IP+EBITDA) 0.102 0.098 0.060 0.047 0.014 0.154 0.044 0.162 0.042 0.066 0.066 (0.032) (0.011) (0.013) (0.006) (0.017) (0.024) (0.043) (0.019) (0.012) (0.013) (0.004) PAYREMARK 1.336 1.431 1.685 1.557 1.568 1.687 2.456 1.787 1.826 2.583 1.726 (0.111) (0.041) (0.043) (0.027) (0.058) (0.063) (0.148) (0.060) (0.038) (0.051) (0.014) TAXARREARS 2.905 2.308 2.492 2.483 2.410 2.804 2.913 2.543 2.845 2.479 2.565 (0.072) (0.026) (0.027) (0.017) (0.039) (0.041) (0.102) (0.039) (0.025) (0.029) (0.009) PAYDIV -3.471 -2.912 -3.021 -3.268 -2.458 -3.406 -3.690 -3.187 -2.878 -3.578 -3.173 (0.709) (0.168) (0.190) (0.133) (0.335) (0.410) (1.004) (0.355) (0.159) (0.260) (0.071) TTLFS 4.013 3.513 3.931 3.543 3.326 3.913 3.618 3.665 3.773 3.906 3.694 (0.066) (0.024) (0.027) (0.015) (0.038) (0.040) (0.085) (0.032) (0.022) (0.025) (0.008) Mean log-likelihood -0.024 -0.042 -0.045 -0.052 -0.071 -0.036 -0.025 -0.051 -0.033 -0.076 -0.046 Pseudo R2 0.38 0.28 0.35 0.31 0.30 0.38 0.38 0.33 0.37 0.42 0.33 Pseudo R2 | agg.par. b 0.36 0.27 0.34 0.31 0.30 0.37 0.30 0.33 0.37 0.41 0.33 Industry R2 0.32 0.48 0.46 0.49 0.47 0.44 0.20 0.59 0.45 0.68 0.42 Industry R2 | agg.par. b -4.88 0.46 0.39 0.46 0.40 0.29 -4.22 0.59 0.14 -0.82 0.51 Number of obs 260,941 1,242,335 914,337 2,260,316 263,648 502,158 120,995 444,609 1,611,700 485,137 8,106,176 Notes: Standard errors in parentheses. The variables are not scaled, so the importance of a variable cannot be interpreted directly from the size of the parameter estimate. a See Section 2 for definitions of these variables. b Pseudo R². | agg.par is the Pseudo R² value calculated for each industry using the estimated coefficients in the economy-wide model (i.e., the coefficients in the last column in the table above). The pseudo R² values are calculated according to McFadden (1974). In addition to the coefficients reported above, three more variables were included (but not reported). First, an industry-specific intercept. Second, since the bankruptcy rate is systematically lower in the third quarter (most likely due to Swedish courts' summer holiday period in July-August), a seasonal dummy is included to capture this phenomenon. Third, because no data on the payment records of firms (i.e., the dummy variables PAYREMARK and TAXARREARS) exist prior to 1992Q3 for legal storage reasons, the models also include one additional variable common to all i firms that is constructed to be an estimate of the average value of the sum of the payment record variables PAYREMARK and TAXARREARS for the quarters 1990Q1-1992Q2. This variable was constructed by estimating a logit model for the event of either of the dummy variables PAYREMARK and TAXARREARS taking on the value 0 or 1 for the period 1992Q3-1999Q2, using all the other variables in the model in Table 1 as regressors (except PAYREMARK and TAXARREARS, of course). The imputed average value for this variable for the period 1990Q1-1992Q2 (after 1992Q2, it is set to nil) was then constructed as the average estimated probability for each firm and period.

Table 2: Regression results 1990Q1-1999Q4 for the default risk model estimated with both firm-specific and aggregate variables Bank, Manu- Hotel & Finance & Real Consulting & Not Economy Agriculture facturing Construction Retail Restaurant Transport Insurance Estate Rental Classified Wide Firm-specific variablesa EBITDA/TA -1.005 -1.092 -1.119 -0.794 -0.584 -1.016 -0.312 -0.675 -0.811 -0.753 -0.822 (0.093) (0.038) (0.043) (0.021) (0.034) (0.055) (0.068) (0.058) (0.027) (0.026) (0.011) TL/TA 0.685 0.817 0.432 0.476 0.244 0.811 0.124 0.488 0.264 0.230 0.383 (0.062) (0.028) (0.033) (0.013) (0.023) (0.046) (0.037) (0.029) (0.021) (0.020) (0.007) LA/TL -0.270 -0.391 -0.303 -0.278 -0.208 -0.158 -0.083 -0.102 -0.182 -0.116 -0.204 (0.067) (0.029) (0.028) (0.014) (0.033) (0.033) (0.028) (0.021) (0.012) (0.010) (0.006) I/TS 0.022 0.174 -0.075 0.119 0.245 0.468 -0.075 0.024 0.189 0.184 0.044 (0.035) (0.030) (0.027) (0.014) (0.236) (0.157) (0.032) (0.009) (0.032) (0.020) (0.005) TL/TS 0.111 0.084 0.182 0.073 0.040 0.004 0.087 0.073 0.110 0.210 0.101 (0.021) (0.005) (0.007) (0.004) (0.011) (0.011) (0.015) (0.007) (0.006) (0.006) (0.002) IP/(IP+EBITDA) 0.095 0.086 0.052 0.039 0.006 0.130 0.044 0.139 0.036 0.060 0.056 (0.031) (0.011) (0.012) (0.006) (0.017) (0.023) (0.042) (0.019) (0.011) (0.013) (0.004) PAYREMARK 1.449 1.558 1.841 1.676 1.668 1.828 2.539 1.940 1.950 2.688 1.850 (0.113) (0.042) (0.043) (0.027) (0.058) (0.064) (0.150) (0.061) (0.038) (0.051) (0.015) TAXARREARS 2.983 2.423 2.647 2.602 2.494 2.904 3.014 2.639 2.989 2.587 2.684 (0.073) (0.027) (0.028) (0.017) (0.040) (0.042) (0.104) (0.041) (0.026) (0.030) (0.009) PAYDIV -3.352 -2.728 -2.806 -3.095 -2.299 -3.232 -3.509 -2.968 -2.714 -3.459 -3.007 (0.709) (0.168) (0.190) (0.134) (0.335) (0.410) (1.003) (0.355) (0.159) (0.261) (0.071) TTLFS 3.954 3.421 3.822 3.446 3.228 3.848 3.598 3.500 3.700 3.849 3.608 (0.067) (0.025) (0.027) (0.016) (0.039) (0.040) (0.086) (0.033) (0.022) (0.025) (0.008) Aggregate variablesb Output gap -0.130 -0.131 -0.200 -0.126 -0.141 -0.138 -0.137 -0.171 -0.133 -0.070 -0.133 (0.021) (0.007) (0.008) (0.005) (0.011) (0.012) (0.031) (0.011) (0.007) (0.008) (0.003) Nominal interest rate 0.066 0.062 0.083 0.070 0.048 0.060 0.096 0.115 0.079 0.060 0.073 (0.013) (0.004) (0.005) (0.003) (0.007) (0.007) (0.018) (0.007) (0.004) (0.005) (0.002) GDP inflation -0.043 0.016 -0.020 0.020 0.037 0.020 -0.084 0.000 0.013 0.027 0.012 (0.018) (0.006) (0.007) (0.004) (0.010) (0.011) (0.027) (0.009) (0.006) (0.007) (0.002) Real exchange rate -0.021 -0.034 -0.030 -0.023 -0.021 -0.031 -0.034 -0.027 -0.031 -0.021 -0.026 (0.006) (0.002) (0.002) (0.001) (0.003) (0.004) (0.009) (0.003) (0.002) (0.002) (0.001) Mean log-likelihood -0.024 -0.042 -0.044 -0.051 -0.070 -0.035 -0.025 -0.050 -0.032 -0.076 -0.046 Pseudo R². 0.38 0.28 0.36 0.32 0.30 0.38 0.38 0.35 0.38 0.43 0.34 Pseudo R². | agg.coeffs. c 0.38 0.28 0.36 0.32 0.31 0.38 0.32 0.34 0.38 0.41 0.34 Industry R2 0.75 0.92 0.91 0.95 0.89 0.86 0.71 0.87 0.93 0.93 0.95 Industry R2 | agg.coeffs. c -4.95 0.90 0.85 0.92 0.72 0.66 -4.86 0.83 0.47 -0.48 0.95 Number of obs 260,941 1,242,335 914,337 2,260,316 263,648 502,158 120,995 444,609 1,617,700 485,137 8,106,176 Notes: Standard errors in parentheses. The variables are not scaled, so the importance of a variable cannot be interpreted directly from the size of the parameter estimate. a See Section 2 for definitions of these variables. b See Section 2.2 for definitions. c Pseudo R². | agg.coeffs. is the Pseudo R² value calculated for each industry using the estimated coefficients in the economy-wide model (i.e., the coefficients in the last column in the table above). The pseudo R² values are calculated according to McFadden (1974). In addition to the coefficients reported above, three more variables were included (but not reported). First, an industry-specific intercept. Second, since the bankruptcy rate is systematically lower in the third quarter (most likely due to Swedish courts' summer holiday period in July-August), a seasonal dummy is included to capture this phenomenon. Third, because no data on the payment records of firms (i.e., the dummy variables PAYREMARK and TAXARREARS) exist prior to 1992Q3 for legal storage reasons, the models also include one additional variable common to all i firms that is constructed to be an estimate of the average value of the sum of the payment record variables PAYREMARK and TAXARREARS for the quarters 1990Q1-1992Q2. This variable was constructed by estimating a logit model for the event of either of the dummy variables PAYREMARK and TAXARREARS taking on the value 0 or 1 for the period 1992Q3-1999Q2, using all the other variables in the model in Table 1 as regressors (except PAYREMARK and TAXARREARS, of course). The imputed average value for this variable for the period 1990Q1-1992Q2 (after 1992Q2, it is set to nil) was then constructed as the average estimated probability for each firm and period.

Table 3: Out-of-Sample Root Mean Squared Error (RMSE) for various models Model RMSE (in percent) b Bank, Manu- Hotel & Finance & Consulting Not Industry Economy Absolute RMSE for Model ja Agriculture facturing Construction Retail Restaurant Transport Insurance Real- Estate & Rental Classified aggregate Wide Firm-specific and macro 0.0821 0.1013 0.1145 0.1148 0.3110 0.0735 0.1200 0.1322 0.0664 0.7843 0.0716 0.0802 Only firm-specific variables 0.1848 0.2986 0.3725 0.4103 0.7034 0.2569 0.1456 0.4516 0.2342 0.4411 0.2856 0.2902 Economy-wide coefficients 0.2516 0.0874 0.1173 0.1058 0.2740 0.0830 0.1863 0.2732 0.1419 1.5866 0.0802 0.0802 dc Time series random walk 0.0877 0.1297 0.1361 0.1418 0.2102 0.1489 0.1467 0.0734 0.1013 0.6084 0.1365 0.1371 dc Industry OLS macroregression 0.1514 0.1124 0.1879 0.2107 0.4808 0.1582 0.1263 0.3708 0.1084 0.7933 0.1320 0.1308 dc 4 quarter moving average 0.0703 0.1241 0.1035 0.1068 0.1598 0.1143 0.1289 0.0674 0.0858 0.4921 0.1031 0.1033 RMSE model j / RMSE Table 2 model c Only firm-specific variables 2.2509 2.9477 3.2533 3.5740 2.2617 3.4952 1.2133 3.4160 3.5271 0.5624 3.9888 3.6185 Economy-wide coefficients 3.0646 0.8628 1.0245 0.9216 0.8810 1.1293 1.5525 2.0666 2.1370 2.0230 1.1201 1.0000 dc Time series random walk 1.0682 1.2804 1.1886 1.2352 0.6759 2.0259 1.2225 0.5552 1.5256 0.7757 1.9064 1.7095 dc Industry OLS macroregression 1.8441 1.1096 1.6410 1.8354 1.5460 2.1524 1.0525 2.8048 1.6325 1.0115 1.8436 1.6309 dc 4-quarter moving average 0.8563 1.2251 0.9039 0.9303 0.5138 1.5551 1.0742 0.5098 1.2922 0.6274 1.4399 1.2880 Notes: The RMSEs have been computed as one-step-ahead forecasts for the period 2000Q1-2009Q2 The RMSE ratios have been computed relative to the first row in the upper panel, i.e., the industry-specific models. All models were estimated for the period 1990Q1-1999Q4. Industry aggregate RMSEs have been computed by summing the default frequency probabilities implied by each industry model quarterly. a Note that the macro variables in these forecasting models are lagged one quarter, so that all models are based on the same information set. b Notice that the RMSE numbers are expressed in percent, i.e., c fitted and actual default numbers are multiplied by 100 before the RMSE numbers are computed. Values in bold (italics) style indicate that the RMSE-ratios are significantly better (worse) according to the dc Diebold-Mariano test at the 5 percent level (two-sided test). The Diebold-Mariano test is computed under the assumption of no serial correlation in the RMSE differentials. These models are estimated on industry or economy-wide data, and not at the firm level.

Table 4: Out-of-sample Pseudo R2 and decile tests at the industry level Industry-specific model coefficients Hotel & Bank. Finance & Consulting & Not Industry Agriculture Manufacturing Construction Retail Restaurant Transport Insurance Real-Estate Rental Classified aggregate Pseudo R² 0.35 0.29 0.41 0.33 0.35 0.39 0.44 0.26 0.40 0.46 0.38 Decile 1 0.79 0.74 0.85 0.76 0.80 0.82 0.81 0.77 0.81 0.79 0.81 2 0.09 0.11 0.06 0.09 0.07 0.08 0.04 0.06 0.07 0.08 0.06 3 0.05 0.06 0.03 0.05 0.03 0.04 0.04 0.04 0.04 0.04 0.04 4 0.02 0.04 0.02 0.04 0.02 0.02 0.02 0.03 0.03 0.03 0.03 5 0.02 0.02 0.01 0.02 0.02 0.01 0.05 0.03 0.02 0.02 0.02 6 - 10 0.04 0.04 0.03 0.04 0.06 0.03 0.04 0.08 0.03 0.04 0.04 Sum 1.00 1.00 1.00 1.00 1.00 1.00 1.00 1.00 1.00 1.00 1.00 Economy-wide model coefficients Hotel & Bank. Finance & Consulting & Not Agriculture Manufacturing Construction Retail Restaurant Transport Insurance Real-Estate Rental Classified Aggregate Pseudo R² 0.28 0.30 0.41 0.35 0.34 0.39 0.37 0.28 0.37 0.47 0.38 Decile 1 0.79 0.72 0.85 0.75 0.79 0.81 0.81 0.76 0.81 0.80 0.78 2 0.08 0.10 0.06 0.08 0.08 0.07 0.04 0.05 0.07 0.08 0.07 3 0.05 0.07 0.03 0.06 0.03 0.03 0.04 0.04 0.04 0.04 0.05 4 0.02 0.04 0.02 0.04 0.03 0.03 0.03 0.03 0.03 0.03 0.03 5 0.02 0.02 0.01 0.02 0.02 0.02 0.03 0.03 0.02 0.02 0.02 6 – 10 0.04 0.04 0.03 0.04 0.06 0.04 0.04 0.09 0.03 0.04 0.04 Sum 1.00 1.00 1.00 1.00 1.00 1.00 1.00 1.00 1.00 1.00 1.00 Notes: The out-of-sample period is 2000Q1-2009Q2, and the total number of firms in the panel for this period is 8,822,345. The decile test outcomes in the table are obtained by sorting the estimated default probabilities, in descending order, and by computing observed default frequencies in the different deciles of the sorted data. The coefficients used for calculating the default probabilities are the ones presented in Table 3. Industry aggregate numbers are obtained by generating the estimated default probabilities using industry-specific model coefficients, then aggregating observations over the various industries to form a single data set, to which, finally, the procedure outlined above is applied in order to compute default frequencies for various deciles.

0,035 0,03 0,025 0,02 0,015 0,01 0,005 0 Figure 1: Actual (solid) and projected (dashed, dotted) aggregate default frequency rates 1990Q1-2009Q2. The projected rates are constructed using the estimated economy-wide models in Table 1 (dotted) and Table 2 (dashed). The models are estimated on data until 1999Q4. The projections shown to the right of the vertical line are out-of-sample. etaR tluafeD Actual Firm Specific Firm Specific and Macro

Figure 2: Time-varying coefficients for the financial ratios in the economy-wide model Notes: Coefficients have been calculated by estimating a separate economy-wide model for each quarter up to 1999Q4. The macroeconomic variables have been excluded from the model. Intercepts are allowed to vary by quarter. Solid lines are coefficient estimates, dashed lines are 95-percent confidence bands based on the standard errors from each quarterly regression. The solid (red) horizontal lines correspond to the estimated coefficients in the economy-wide model in Table 1.

Figure 3: Sorted estimated default percentiles versus actual default frequencies for both economy-wide and industry-specific parameters. Left panel: Only firm-specific variables included (Table 1 models); Right panel: Macrovariables included (Table 2 models).

Appendix A Data A.1 De(cid:133)nition of default AsdescribedinSection2, thedefaultde(cid:133)nitionweadoptisthefollowing: a(cid:133)rmisconsideredto be in default whenever one of the following events occurs: the (cid:133)rm is declared legally bankrupt; has suspended payments; has negotiated a debt composition settlement; is undergoing a reconstruction; or is distraint without assets. The data on Swedish public and private (cid:133)rms that we use to construct the default variable have been provided by Upplysningscentralen AB (UC), the main Swedish credit bureau, jointly owned by the Swedish banks. UC taps its information from Tingsr(cid:228)tten, the District Courts, Bolagsverket, the Swedish Companies Registration O¢ ce (SCRO), and Kronofogdemyndigheten, the Swedish Enforcement Authority, i.e. the institutions that deal with the legal formalities in (cid:133)rms· bankruptcy processes.A.1 UC stores information on (cid:133)rms minor and major distress events in two databases, AM and JP. In the (cid:133)rst database, AM, variables are constructed by giving each type of event a label AMTYPXX, a Swedish acronym for remark type, and an integer number su¢ x. For example, AMTYP12 is a dummy variable indicating if a (cid:133)rm has suspended its payments, or not. The second database, JP, contains 27 variables in total, on various milestones and stages for a broader category of major (mostly, but not exclusively) distress events that may occur for incorporated Swedish (cid:133)rms. From this database we take variables that are related to legal bankruptcy: "bankruptcy procedures started," "bankruptcy procedures concluded," "bankruptcy procedures concluded with a surplus," "bankruptcy procedures continued," and "declaredbankrupt."Moreover, wealsoinclude: "negotiationsonadebtcompositionsettlement started," and "negotiations on a debt composition settlement concluded." If any of the above distress-event dummy variables equals one at some point in our sample period, the (cid:133)rm in question is considered to be in default in that particular quarter. In the following quarter, we let the (cid:133)rm exit our data set. If more than one of these distress events are observed for a speci(cid:133)c (cid:133)rm over our sample period, we assume the (cid:133)rm in question defaulted in the (cid:133)rst instance. An additional variable we use from the second data set indicates if a "bankruptcy [was] cancelled" by a court. Over the whole sample period (i.e., in-sample and out-of-sample) this occurs 24 times, and 16 of these 24 events concern (cid:133)rms that subsequently end up in default. We treat (cid:133)rms for which the bankruptcy status was cancelled by the District A.1 Currently, i.e., on July 9, 2010, there are 462 publicly listed (cid:133)rms out of roughly 250,000 active, limited liability (cid:133)rms. 30

Court as healthy until the data indicate otherwise. Moreover, we let (cid:133)rms that default but reemerge from their default status exit the data set after the quarter in which default takes place; they re-enter in the quarter in which UC registered that the default status had been "removed." Our decision to let (cid:133)rms that default exit the data set in the subsequent quarter is based on the following statistics: out of 161,550 defaults in the entire data set, 148,874 are terminal in the sensethatsubsequentlynonewinformationonthe(cid:133)rmsappearsinanyofthedatabases.A.2 The remaining 12,676 observations concern 6,638 (cid:133)rms that default twice within the sample period. Thus, 5,999 (cid:133)rms end up in terminal default at the second occurrence, while 339 re-emerge even after the second default. No (cid:133)rm defaults more than twice in the sample period. Out of the 148,874 (cid:133)rst-time-is-terminal defaults, an overwhelming majority of 145,684 are due to legal bankruptcy declarations. Roughly 60 percent of these (cid:133)rms experience a second default-triggering distress event simultaneously, i.e., in the same quarter. In almost all cases (98 percent) these are due to the event "bankruptcy proceedings initiated." In most (88 percent) of the remaining terminal defaults, i.e., those that are not legal bankruptcies, are associated with "distraint, no assets." The remaining distress events account for less than 1 percent of the (cid:133)rst-time-is-terminal defaults. For the (cid:133)rms that re-emerge after a default, the (cid:133)rst default involves a legal bankruptcy in less than 2 percent of all cases and "distraint, no assets" in close to 90 percent. At their second default, these percentages are 97 percent due to legal bankruptcy, and 6 percent to (cid:147)distraint, no assets,(cid:148)in the cases of terminal defaults. Among the (cid:133)rms that experience a second but non-terminal default, 62 percent has the status of the "distraint, no assets." A.2 Balance sheet data By means of a two-step procedure, the six (cid:133)nancial ratios were selected as: earnings before interest, depreciation, taxes and amortization (EBITDA) over total assets (TA) (earnings ratio); interest payments (IP) over the sum of interest payments and earnings before interest, depreciation, taxes and amortization (interest coverage ratio); total liabilities (TL) over total assets (leverage ratio); total liabilities over total sales (TS) (debt ratio); liquid assets (LA) in relation to total liabilities (quick ratio); and inventories (I) over total sales (inventory turnover ratio).A.3 A.2 Firms that are declared bankrupt at some point do not disappear from the databases that UC maintain. Firm identi(cid:133)ers (organisationsnummer) are unique and are never re-cycled by Swedish tax authorities. A.3 It should be noted that the log-level of debt, in addition to the leverage ratio (TL /TA ) for (cid:133)rm i in i;t i;t period t, contains predictive power for default. We therefore decided to include TL as a separate variable, but i;t scaled it with average total sales in period t to obtain a stationary ratio. Thus, the debt-to-sales ratio is de(cid:133)ned as log(TL =TS );where TS denotes average total sales in period t: i;t t t 31

Table A.1 in Appendix A.3 provides an account of the variables considered by other well-known studies in the literature. First, theunivariaterelationshipbetweenaratioanddefaultriskwasinvestigated. Byvisual inspection, ratios that are largely uncorrelated with default risk were eliminated from the set of candidate explanatory variables. Figure A.1 illustrates this for the six selected ratios by comparing default rates (jagged line) and the cumulative distributions of the variables (smooth line)forallobservationsintheperiod1990Q1 2009Q2. Thedefaultrateforagivenobservation (cid:0) of a ratio is calculated as an average over the interval of +/- 5000 adjacent observations in the empirical distribution of the ratio at hand. The cumulative distribution at any point X on the 0 X-axis gives the share of defaulted (cid:133)rms for which the (cid:133)nancial ratio is smaller than X . Given 0 the density of the observations, there is a positive relationship between default and the leverage, interest coverage and turnover ratios, while there is a negative relationship for both the earnings and the liquidity ratios. Moreover, Figure A.1 suggests that the relationships between default and the earnings ratio, total liability over total sales ratio and interest costs over the sum of interest costs and earnings are all non-linear. For instance, for the interest coverage variable, this relationship is quite intuitive. The ratio turns highly negative if earnings are negative and slightlylargerthaninterestpaymentsinabsolutevalue,whichisassociatedwithanincreasedrisk of default. On the other tack, large interest payments and low earnings will also make the ratio large, positively, which is likewise associated with an increased default risk. Similar reasoning can be be applied to the other ratios. In the second step in the selection procedure, variables that did not enter signi(cid:133)cantly were subsequently dropped one by one to get the (cid:133)nal set of variables. For instance, standard variables like size (proxied by total sales) and age (proxied by the number of periods in the panel) were dropped in this second step as they were found to be insigni(cid:133)cant in the full model. Regarding the de(cid:133)nition of the dummy variable for (cid:133)rms that have not submitted a (cid:133)nancial statement, TTLFS, there are three points worth noting. First, this information is assumed to be available with a 6-quarter time lag, since (cid:133)nancial statements for year (cid:28) are typically available in the third quarter of year (cid:28) +1. By letting the dummy variable equal unity with a 6-quarter time-lag, we account for the real-time delay. Second, given the way we de(cid:133)ne the population of existing (cid:133)rms, recently registered (cid:133)rms entering the panel would automatically be assigned TTLFS = 1 in the third quarter of their existence, since they have not, of course, issued any (cid:133)nancial statement prior to entering. For these new (cid:133)rms, TTLFS has been set to 0 and the 32

accounting data variables have been taken from their (cid:133)rst yearly balance sheet and income statement. Third, for defaulting (cid:133)rms that are in the panel but have on no occasion submitted an annual report, we also set TTLFS equal to 0. This is the case for about 20 percent of the 161;550 defaulting (cid:133)rms in the panel. So, although TTLFS turns out to be very important in the default-risk models, this feature is down-played rather than exaggerated in the construction of the variable. A.3 Descriptive statistics for winsorized data In Table A.2, we report the means and standard deviations for a set of accounting ratios, payment remarks, and a variable that measures the average elapsed time since the latest (cid:133)ling of a (cid:133)nancial statement for the (cid:133)nal data set that is used in the estimations in Section 3. The table distinguishes between defaulted and non-defaulted (cid:133)rms, at the aggregate as well as the industry level, for the in-sample period 1990Q1 1999Q4, that is, the sub-sample pe- (cid:0) riod for which we will specify and estimate all subsequent models. For this period, we have a total of 8;106;176 observations of which 105;605 are defaults. Analyses of industry e⁄ects will be conducted at the one-digit level to ensure su¢ ciently many default observations in each industry in both the cross-sectional and the time series dimensions. The ten industries are; agriculture, manufacturing, construction, retail, hotel and restaurant, transportation, banking, (cid:133)nance and insurance, real-estate, consulting and rental, and (cid:133)nally a residual industry labeled (cid:147)not classi(cid:133)ed(cid:148). Because of the varying availability of data, the statistics in Table A.2 are calculated based on slightly di⁄erent numbers of observations for the variables in a given industry. Dealing with microdata sets of this size invariably involves dealing with outliers. These observations would distort the estimation results if they were to be included in the logit model and therefore, we have winsorized the top and bottom 1 percent observations for the accounting variables in each industry.A.4 Given the large number of observations in our data set, this approach is practically more or less equivalent to simply deleting 1 percent of the observations that have accounting data that fall outside a certain region. Note that we choose to winsorize the observations in each industry separately, rather than at the aggregate level, thereby implicitly allowing for dispersion and di⁄erent means in di⁄erent industries. Table A.2 shows the descriptive statistics for the A.4 Winsorization is quite common in the literature using (cid:133)nancial ratios to avoid outliers that are created by near-zero denominators. Shumway (2001) winsorizes the top and bottom 1 percent of all observations. It should be emphasized that the results are robust to varying the winsorization rate between 0:5 and 2 percent. 33

winsorized microdata set.A.5 A.4 Macro data The aggregate time series are depicted in Figure A.2. They are: the output gap (i.e., the deviation of GDP from its trend value), the yearly in(cid:135)ation rate (measured as the fourth di⁄erence of the GDP de(cid:135)ator), the repo interest rate (a short-term nominal interest rate, set by the Riksbank), and the real exchange rate. The repo rate was extremely high in the third quarter of 1992 due to the Riksbank having raised the so-called marginal interest rate to 500 percent, unexpectedly and temporarily, in an attempt to defend the (cid:133)xed exchange rate. If the repo rate is not adjusted for this exceptional event, the estimation procedure would lead to underestimation of the importance of (cid:133)nancial costs for default behavior. We therefore adjust the repo rate series in the third quarter of 1992 by means of a simple of a simple regression R = b +b D923+b R +" . The estimated t 1 2 3 t 1 t (cid:0) dummy coe¢ cient ^b equals 28:2, and we therefore adjusted the repo rate 1992Q3 to equal 9:8 2 percent instead of 38 percent. The output gap series is computed by HP-(cid:133)ltering GDP, where the smoothing coe¢ cient (cid:21) is set to the standard value of 1;600. The real exchange rate is measured as the nominal TCW-weighted (TCW = trade competitive weights) exchange rate times the TCW-weighted foreign price level (CPI de(cid:135)ators) divided by the domestic CPI de(cid:135)ator. Note that a larger value for the real exchange rate implies a depreciation; hence a negative estimated coe¢ cient for this variable implies that a depreciation on average reduces the risk of default at a given point in time. During the sample period, the real exchange rate is characterized by an upward trend (i.e.,a tendency of gradual depreciation) and it is therefore detrended with the HP-(cid:133)lter as well to achieve stationarity. Appendix B.3 veri(cid:133)es that the results are robust with respect to our detrending procedure of the real exchange rate. In the output-gap series in Figure A.2 there is clear evidence of the deep recession in the beginningofthe1990swithanegativeoutputgapofmorethan4percentin1993. Theeconomic rebound of 1994-1995 is also evident, as is the IT-boom bust cycle in the late 1990. Finally, A.5 Comparisonofthedescriptivestatisticsforunwinsorizeddatamakesitclearthatdefaulted(cid:133)rmsaredisproportionally more a⁄ected when winsorizing all observations jointly. Since the PAYREMARK, TAXARREARS, PAYDIV and TTLFS are dummy variables that are una⁄ected by choice of winsorization procedure, a joint one couldleadtounderestimationoftheimportanceoftheaccountingdatavariablesinthedefaultriskmodelrelative tothesedummyvariables. Tochecktherobustnessofourchosenapproach,weusedanalternativeapproachwhere we truncated the healthy and defaulted (cid:133)rms separately. As expected, the estimation results of the default-risk modelwiththisalternativewinsorizationsuggestasomewhatlargerrolefortheaccountingratios,buttheoverall picture remains the same. 34

Figure A.2 documents the exceptionally sharp downturn in the economy during late 2008 and the beginning of 2009. 35

Table A.1: Firm-specific explanatory variables used in papers focusing on the development of models of default risk RHS variables Paper Year Sample Firms Model LHS variable Liquidity Profitability and efficiency Solvency and leverage Size Other Altman 1968 66 Listed Discriminant Bankruptcy (CA-CL)/TA RE/TA, EBIT/TA, TS/TA MVE/TC Frydman et al. 1985 200 Listed Discriminant Bankruptcy LA/TA, CA/TA, LA/CL, CA/CL CF/TD, EBIT/TA, NI / TA MVE/TC, Ln(IC) Ln(TA) Shumway 2001 39,745 Listed Panel Logit Bankruptcy (CA-CL)/TA CA/CL RE/TA, EBIT/TA, NI / TA, TS/TA MVE/TL, TL/TA Ln(MVE/market) r E-r M σE-σM Pesaran et al. 2006 [2,219] Listed Merton Default r E MVE TE/TA σE CR Duffie et al. 2007 392,404 Listed Hazard Default r E DtD Bonfim 2008 113,119 Mixed Probit/Hazard Loan default LA/CL NI/TA TS/TA TL/TA, TE/TA IR TSG Bharath, Shumway 2008 1.016m Listed Merton Distance to default NI/TA MVE TD σE "" " Hazard Time to default NI/TA MVE TD PD Notes Table A.1: Lists of variables reflect the main model presented in each paper. In cases where no preferred model" was presented, the list reflects variables with significant variables in any model.The number of observations in a paper can vary dependent on model specification. CA = Current Assets, CL = Current Liabilities, TA = Total Assets, RE = Retained Earnings, EBIT = Earnings Before Interest and Taxes, TS = Total Sales, MVE = Market Value of Equity, TC = Total Credit, LA = Liquid Assets, CF = Cash Flow, TD = Total Debt, NI = Net Income, IC = Interest Coverage, r E = Return on Equity, r M = Market Return on Equity, σE = Volatility of Stock Returns, σM = Volatility of Market Stock Returns, TE = Total Equity, CR = credit Rating, DtD = Distance to Default, IR = Investment Rate, TSG = Total Sales Growth, PD = Probability of default. The term quick assets has been named liquid assets in this table because no consensus on the exact difference between liquid and quick assets. Some sources include inventories in liquid assets, but exclude them from quick assets.

Table A.2: Descriptive statistics for truncated firm-specific micro data 1990Q1-1999Q4 Hotel & Bank, Finance Consulting & Agriculture Manufacturing Construction Retail Resturant Transport & Insurance Real- Estate Rental Not Classified Total Defaulted Number Obs 1643 13052 11843 32350 5457 5147 811 6470 14619 14213 105605 EBITDA/TA -0.04 -0.01 0.01 -0.06 -0.12 0..01 -0.09 0.01 -0.03 -0.07 -0.04 (0.35) (0.29) (0.29) (0.35) (0.55) (0.33) (0.66) (0.27) (0.45) (0.45) (0.38) TL/TA 1.09 0.97 0.96 1.04 1.20 1.01 1.12 1.02 0.94 0.99 1.01 (0.55) (0.39) (0.40) (0.56) (0.82) (0.44) (1.25) (0.55) (0.59) (0.67) (0.57) LA/TL 0.14 0.13 0.18 0.17 0.20 0.19 0.43 0.18 0.34 0.36 0.22 (0.73) (0.57) (0.57) (0.70) (0.64) (0.74) (1.86) (0.85) (1.22) (1.41) (0.91) I/TS 0.38 0.20 0.15 0.27 0.04 0.01 0.20 0.35 0.08 0.16 0.19 (0.84) (0.31) (0.42) (0.45) (0.06) (0.04) (1.33) (1.54) (0.25) (0.48) (0.56) TL/TS -1.12 -2.64 -1.64 -2.46 -1.72 -2.34 -2.56 -0.62 -1.69 -1.54 -1.98 (1.71) (1.78) (1.78) (1.69) (1.61) (1.70) (2.56) (2.19) (1.88) (2.18) (1.93) IP/(IP+EBITDA) 0.25 0.24 0.18 0.23 0.19 0.23 0.23 0.41 0.15 0.20 0.22 (0.96) (1.02) (0.87) (1.17) (1.00) (0.76) (1.05) (0.83) (0.86) (0.92) (1.00) PAYDIV 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 (0.03) (0.05) (0.05) (0.04) (0.04) (0.03) (0.04) (0.04) (0.05) (0.03) (0.04) TTLFS 0.41 0.30 0.37 0.36 0.35 0.40 0.51 0.44 0.44 0.50 0.39 (0.49) (0.46) (0.48) (0.48) (0.48) (0.49) (0.50) (0.50) (0.50) (0.50) (0.49) REMARK1 0.15 0.12 0.15 0.14 0.16 0.16 0.21 0.14 0.17 0.17 0.15 (0.36) (0.32) (0.36) (0.35) (0.37) (0.37) (0.41) (0.35) (0.37) (0.37) (0.36) REMARK2 0.47 0.36 0.46 0.39 0.46 0.49 0.42 0.34 0.44 0.38 0.41 (0.50) (0.48) (0.50) (0.49) (0.50) (0.50) (0.49) (0.47) (0.50) (0.49) (0.49) Non-defaulted Number Obs 259298 1229283 902494 2227966 258191 497011 120184 438139 1597081 470924 8000571 EBITDA/TA 0.14 0.12 0.12 0.08 0.08 0.17 0.08 0.09 0.14 0.13 0.11 (0.25) (0.21) (0.21) (0.24) (0.35) (0.23) (0.42) (0.20) (0.28) (0.34) (0.25) TL/TA 0.70 0.69 0.71 0.74 0.87 0.72 0.68 0.80 0.63 0.65 0.71 (0.34) (0.30) (0.29) (0.36) (0.50) (0.30) (0.70) (0.39) (0.36) (0.43) (0.37) LA/TL 0.48 0.43 0.44 0.43 0.38 0.46 1.42 0.42 0.90 0.99 0.58 (1.12) (1.01) (0.79) (1.03) (0.77) (0.97) (4.67) (1.31) (1.82) (2.03) (1.42) I/TS 0.24 0.14 0.11 0.19 0.03 0.01 0.36 0.19 0.04 0.07 0.12 (0.57) (0.24) (0.31) (0.34) (0.06) (0.05) (1.80) (1.11) (0.18) (0.34) (0.44) TL/TS 0.66 0.50 0.58 0.33 0.51 0.48 4.33 5.09 0.64 0.50 0.80 (1.30) (1.85) (1.81) (0.90) (1.25) (1.74) (26.14) (20.15) (2.26) (1.58) (6.03) IP/(IP+EBITDA) 0.15 0.15 0.12 0.18 0.17 0.14 0.18 0.29 0.10 0.11 0.15 (0.71) (0.72) (0.72) (0.87) (0.79) (0.54) (0.94) (0.68) (0.75) (0.75) (0.77) PAYDIV 0.14 0.15 0.13 0.13 0.06 0.13 0.13 0.08 0.16 0.12 0.13 (0.35) (0.36) (0.34) (0.33) (0.24) (0.34) (0.33) (0.28) (0.36) (0.32) (0.34) TTLFS 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 (0.03) (0.03) (0.03) (0.03) (0.03) (0.03) (0.04) (0.03) (0.03) (0.03) (0.03) REMARK1 0.00 0.00 0.00 0.00 0.01 0.00 0.00 0.00 0.00 0.00 0.00 (0.05) (0.06) (0.06) (0.06) (0.08) (0.05) (0.05) (0.07) (0.05) (0.05) (0.06) REMARK2 0.03 0.03 0.04 0.03 0.06 0.04 0.02 0.03 0.03 0.03 0.03 (0.16) (0.18) (0.20) (0.18) (0.24) (0.18) (0.14) (0.16) (0.17) (0.16) (0.18) Notes: The definition of variables are: EBITDA = earnings before taxes, interest payments and depreciations; TA = total assets; TL = total liabilities; LA = liquid assets; I = inventories; TS = total sales; IP = sum of net interest payments on debt and extra-ordinary net income; PAYDIV = a dummy variable equal 1 if the firm has paid out dividends during the accounting period and 0 otherwise; TTLFS = a dummy variable equal to 1 if the firm has not submitted an annual report in the previous year, and 0 otherwise; REMARK1 = a dummy variable taking the value of 1 if the firm has a payment remark due to one or more of the following events in the preceding four quarters; (i) a "non-performing loan" at a bank, or (ii) a bankruptcy petition, or (iii) issuance of a court order to pay a debt, or (iv) seizure of property; REMARK2 = a dummy variable taking the value of 1 if the firm is in various tax arrears.

8 6 4 2 0 −1.8 −1.4 −1 −0.6 −0.2 0.2 0.6 1 etaR tluafeD 10 8 6 4 2 0 −1.8 −1.4 −1 −0.6 −0.2 0.2 0.6 1 0 0.4 0.8 1.2 1.6 2 2.4 2.8 EBITDA/TA etaR tluafeD 0 0.4 0.8 1.2 1.6 2 2.4 2.8 TL/TA 3.5 3 2.5 2 1.5 1 0.5 0 0 0.6 1.2 1.8 2.4 3 etaR tluafeD 2.5 2 1.5 1 0.5 0 0 0.6 1.2 1.8 2.4 3 −2 1 4 7 10 I/TS etaR tluafeD −2 1 4 7 10 LA/TL 2.5 2 1.5 1 0.5 0 0 2 4 6 8 10 etaR tluafeD 4 3 2 1 0 0 2 4 6 8 10 −5 −4 −3 −2 −1 0 1 2 3 4 5 TL/TS etaR tluafeD −5 −4 −3 −2 −1 0 1 2 3 4 5 IP/(IP+EBITDA)

Output Gap (in Percent) Nominal Interest Rate (APR) 5 18 4 16 3 14 2 12 1 0 10 -1 8 -2 6 -3 4 -4 2 -5 -6 0 90 92 94 96 98 00 02 04 06 08 90 92 94 96 98 00 02 04 06 08 Yearly Inflation Rate (in Percent) Real Exchange Rate (in Percent) 12 15 10 10 8 5 6 0 4 -5 2 -10 0 -2 -15 90 92 94 96 98 00 02 04 06 08 90 92 94 96 98 00 02 04 06 08 Figure A.2: Swedish Macro data 1990Q1-2009Q2 used in the estimated panel logit models

Appendix B Robustness analysis The purpose of this appendix is to demonstrate that the results for the estimated models in Tables 1 and 2 are robust with respect to a number of perturbations. To keep the analysis tractable, we will restrict the analysis to the economy-wide models. B.1 Role of remark data Panel A in Table B.1 reports estimation results for the economy-wide models in Tables 1 and 2 when the PAYREMARK and TAXARREARS dummies have been dropped. For the sake of comparison, we also report the estimation results when the remarks variables are included in Panel A. As can be seen from comparing the results, the estimated coe¢ cients for the accounting variables are similar irrespective of whether the remarks variables are included in the model or not. The coe¢ cients for TL/TA and LA/TL increase when the remark variables are excluded, but the coe¢ cient for EBITDA/TA is reduced. The remaining coe¢ cients are roughly una⁄ected. Not in any case does exclusion of the Remark variables change the sign of the coe¢ cients for the (cid:133)nancial ratios. Thus, our estimation results for the (cid:133)nancial ratios are not crucially a⁄ected by the inclusion of the remark variables. Nor do they diverge from the previous literature. However, it is clear that omitting the remark variables reduces the pseudo-R2 measures, and thus reduces the ability of the model to rank the relative riskiness of (cid:133)rms. Turning to models where we include the macroeconomic variables in the regressions ( c.f., Table 2), we again see that the coe¢ cients are quite similar. An exception is the output gap coe¢ cient, which turns out to be somewhat smaller in the model without remark variables. However, the coe¢ cientsfortheoutputgapandthenominalinterestratearestillofkeyimportance, andthus the overall roles of macroeconomic variables are not a⁄ected by the presence of remark data. Once again, we (cid:133)nd that extending the set of (cid:133)rm-speci(cid:133)c factors with the remark variables allows us to increase the pseudo-R2 measures substantially. Weconcludefromthisanalysisthatinclusionofremarkdataisnotofkeyimportanceforthe estimated impact of macroeconomic factors. Accordingly, our (cid:133)ndings regarding the impact of macroeconomic factors would therefore be expected to hold in other countries, where payment remark data is not available. Nonetheless, the models(cid:146)exceptional risk-ranking performance (as documented in Section 4.2) is clearly partly driven by the possibility to include remark data; without these data the out-of-sample risk-ranking performance would be worse than the in- 40

sample results reported by Shumway (2001) for publicly listed (cid:133)rms. But, given that the (cid:133)rms in our data set (i.e., the entire population of Swedish (cid:133)rms) are very heterogeneous, it is still a remarkable that the risk-ranking performance of the models is quite acceptable even without the remark data. B.2 Imputation of missing (cid:133)nancial ratios Panel B in Table B.1 reports estimation results for the economy-wide model in Tables 1 and 2 for regressions when only (cid:133)rms that have submitted complete (cid:133)nancial statements for all sample periods are selected. In this case, obviously no imputation of missing (cid:133)nancial ratios is carried out and the interaction dummy TTLFS is dropped since it will equal 0 for all (cid:133)rms. In addition, wedropall(cid:133)rmsthathavebeenactiveforatooshorttimeperiodtomeetthe(cid:133)nancial statementrequirement. ComparingtheresultsinPanelBwiththebenchmarkresultsinPanelA, we see that the coe¢ cients are little a⁄ected by our imputation procedure. The absolute values for the coe¢ cients for EBITDA/TA and for LA/TL increase somewhat, while the remaining (cid:133)nancial ratios are more or less the same. The coe¢ cients for the remark variables, and for the DIVIDEND-variable are roughly the same, as are the coe¢ cients for the macroeconomic variables. One interesting conclusion from this robustness analysis is that TTLFS is even more important than the remark data for the (cid:133)t of the model at the (cid:133)rm level (pseudo-R2). In the full benchmark model (III) in Panel A, the pseudo-R2 is 0:34. In the model without remark datathepseudo-R2 fallsto0:24. ExcludingTTLFS(i.e., informationonwhetherthe(cid:133)rmissued a (cid:133)nancial report in due time or not) leads to an even larger fall in pseudo-R2 to about 0:17, suggesting that this indicator variable is the single most important predictor of default. This result has a lot of intuitive appeal, one would think that failure to complile a (cid:133)nancial report shouldbeaveryserioussignalthata(cid:133)rmisinthesortoftroublethatcouldleadtoapermanent exit. B.3 Data frequency, sample period, and the real exchange rate Firstweconsiderthee⁄ectsofexcludingtherealexchangerateintheTable2model. Inaddition, we report results when the real exchange rate is calculated as the percentage deviation around a constant mean (q = (Q Q(cid:22))=Q(cid:22), where Q(cid:22) = 1 T Q for the period 1990Q1 2009Q2) t t (cid:0) T t=1 t (cid:0) instead of being HP-(cid:133)ltered. The results of these exPperiments are reported in Panel C in Table B.1. By and large, our results are not much a⁄ected by the choice of procedure for detrending 41

the real exchange rate. Di⁄erences in estimated coe¢ cients for the (cid:133)nancial ratios occur at the fourth decimal and are really miniscule. They are somewhat larger, though still small, for the set of indicator variables. For the other three macro variables we (cid:133)nd that using the un(cid:133)ltered real exchange rate reduces their coe¢ cients marginally, while the real exchange rate coe¢ cient is slightly increased.We conclude that using the (cid:133)ltered or un(cid:133)ltered real exchange rate is of little consequence for the results in this paper. By a balanced regression argument we prefer the (cid:133)ltered real exchange rate as the benchmark, and we note in Section 2 that the results are robust w.r.t. to the detrending procedure. In Panel D, we report results when we have estimated the model in Table 2 on an annual frequency instead of the quarterly frequency used in the paper. Again, we conclude that, on the whole, it is immaterial for the estimated parameters whichever choice of frequency is made. Noticeable exceptions are the coe¢ cients for the debt ratio, TL/TA, and the output-gap, which both turn out stronger, while the coe¢ cient for the real exchange rate is reduced, and that of in(cid:135)atation (imprecisely estimated) switches sign. Therefore, a quarterly model seems more appealling since it allows for more detailed forecasting and more interesting interpretations. It is nevertheless reassuring that our disaggregation procedure does not seem to introduce unwarranted biases. Finally, to examine the stability of the coe¢ cients for the Economy Wide models of Table 1 and 2, we have re-estimated them using the full sample period, 1990Q1 2009Q2, and present (cid:0) the results in Panel E in Table B.1. Comparing with the coe¢ cients Panel E with those in Tables 1 and 2, we (cid:133)nd only minor di⁄erences. The estimates for both the (cid:133)rm-speci(cid:133)c and aggregate regressors are remarkably stable, consistent with the favorable out-of-sample results in Section 4 in both the cross-section and time series dimension. B.4 Marginal e⁄ects The explanatory variables in this paper have not been re-scaled to have the same mean, and therefore one cannot judge the importance of a particular variable from the size of its coe¢ cient. The discussion about relative importance of explanatory variables in this paper is based on the relative sizes of the estimated t-statistics, since coe¢ cient size is not su¢ cient for such inference. Alternatively,onecancalculatethemarginalcontribution,ore⁄ect,fromavariableatthemean, or the median, of the variable. Table B.2 report on such marginal e⁄ects and the calculations yield similar rankings of importance as the standard t-statistics, e.g., the output-gap and the 42

nominal interest rate are the more in(cid:135)uential macroeconomic variables. 43

Table B.1: Sensitivity analysis for the economy-wide default risk models in Tables 1 and 2 1990Q1-1999Q4 Panel A: Panel B: Panel C: Panel D: Panel E: Robustness w.r.t. Remarks Robustness w.r.t. TTLFSa Robustness w.r.t. real ex. rate Robustness w.r.t. freq Robustness w.r.t. Sample Only include firms who have always Use alternative RER and omit RER Quarterly vs. Annual Data Original sample (as in Tables 1 and 2) submitted all FS data Benchmark Alt. QD Omit QD Quarterly Annual Full Sample (90Q1 - 09Q2) I II III IV I II III IV I II III I II I II Micro Variables EBITDA/TA -0,8371 -0,7594 -0,8221 -0,7477 -0,9929 -0,9082 -0,9747 -0,8919 -0,8221 -0,8222 -0,8259 -0,8221 -0,9411 -0,7139 -0,7547 0,0106 0,0098 0,0107 0,0098 0,0125 0,0117 0,0125 0,0118 0,0107 0,0107 0,0107 0,0107 0,012 0,0080 0,0080 TL/TA 0,3948 0,6982 0,3832 0,6976 0,3968 0,6926 0,3857 0,6977 0,3832 0,3834 0,3752 0,3832 0,497 0,3200 0,2982 0,0071 0,0063 0,0072 0,0064 0,0083 0,0074 0,0084 0,0075 0,0072 0,0072 0,0072 0,0072 0,008 0,0053 0,0054 LA/TL -0,2067 -0,3159 -0,2045 -0,3142 -0,4073 -0,6328 -0,3915 -0,6206 -0,2045 -0,2044 -0,2060 -0,2045 -0,194 -0,1517 -0,1425 0,0061 0,0067 0,0061 0,0067 0,0121 0,0135 0,0121 0,0135 0,0061 0,0061 0,0061 0,0061 0,006 0,0037 0,0037 I/TS 0,0573 0,0682 0,0444 0,0607 0,0622 0,0718 0,0490 0,0633 0,0444 0,0442 0,0427 0,0444 0,047 0,0875 0,0655 0,0052 0,0046 0,0053 0,0046 0,0059 0,0052 0,0060 0,0052 0,0053 0,0053 0,0053 0,0053 0,006 0,0043 0,0045 TL/TS 0,1078 0,0892 0,1009 0,0852 0,0913 0,0740 0,0880 0,0724 0,1009 0,1009 0,1015 0,1009 0,104 0,0823 0,0726 0,0020 0,0018 0,0020 0,0018 0,0024 0,0023 0,0025 0,0024 0,0020 0,0020 0,0020 0,0020 0,002 0,0016 0,0016 IP/(IP+EBITDA) 0,0655 0,0737 0,0558 0,0674 0,0595 0,0638 0,0500 0,0567 0,0558 0,0557 0,0553 0,0558 0,067 0,0651 0,0515 0,0040 0,0039 0,0039 0,0038 0,0046 0,0046 0,0044 0,0045 0,0039 0,0039 0,0039 0,0039 0,005 0,0036 0,0035 PAYREMARK 1,7256 1,8497 1,6866 1,8043 1,8497 1,8527 1,8467 1,8497 1,834 1,8976 1,9523 0,0145 0,0147 0,0165 0,0168 0,0147 0,0147 0,0147 0,0147 0,017 0,0102 0,0102 TAXARREARS 2,5654 2,6839 2,3323 2,4713 2,6839 2,6857 2,6593 2,6839 2,282 2,6088 2,7241 0,0092 0,0094 0,0108 0,0111 0,0094 0,0094 0,0094 0,0094 0,010 0,0070 0,0071 Dividend -3,1728 -3,5374 -3,0066 -3,4573 -3,1324 -3,4098 -2,9531 -3,3082 -3,0066 -3,0046 -3,0436 -3,0066 -3,221 -3,0896 -2,9452 0,0708 0,0706 0,0709 0,0706 0,0736 0,0736 0,0736 0,0736 0,0709 0,0709 0,0708 0,0709 0,068 0,0456 0,0456 TTLFS 3,6937 3,7990 3,6076 3,7532 3,6076 3,6133 3,6164 3,6076 3,082 3,9342 3,7527 0,0084 0,0074 0,0085 0,0075 0,0085 0,0085 0,0085 0,0085 0,010 0,0073 0,0074 Aggr. Variables Output gap -0,1327 -0,0782 -0,1460 -0,1061 -0,1327 -0,1205 -0,1076 -0,1327 -0,210 -0,1119 0,0026 0,0024 0,0030 0,0029 0,0026 0,0025 0,0025 0,0026 0,004 0,0018 Nominal interest rate 0,0731 0,0501 0,0594 0,0351 0,0731 0,0522 0,0971 0,0731 0,065 0,0703 0,0016 0,0015 0,0018 0,0018 0,0016 0,0020 0,0014 0,0016 0,002 0,0012 GDP inflation 0,0116 -0,0173 0,0363 0,0149 0,0116 0,0075 -0,0231 0,0116 -0,061 0,0158 0,0023 0,0022 0,0026 0,0026 0,0023 0,0022 0,0020 0,0023 0,004 0,0020 Real exchange rate -0,0258 -0,0071 -0,0341 -0,0198 -0,0258 -0,0272 -0,0258 -0,014 -0,0214 0,0008 0,0003 0,0009 0,0009 0,0008 0,0008 0,0008 0,001 0,0007 Mean log-likelihood -0,046 -0,053 -0,046 -0,053 -0,039 -0,043 -0,039 -0,043 -0,046 -0,046 -0,046 -0,046 -0,125 -0,034 -0,034 Pseudo R2 0,33 0,23 0,34 0,24 0,16 0,08 0,17 0,08 0,34 0,34 0,34 0,34 0,35 0,36 0,37 Number of obs 8,106,176 8,106,1768,106,1768,106,176 7,949,015 7,949,0157,949,015 7,949,015 8,106,176 8,106,176 8,106,176 8,106,176 2,207,382 16 928 521 16 928 521 Notes: Coefficient estimates in bold style, standard errors in italics. The variables have not been scaled, so the importance of a variable cannot be interpreted directly from the size of the parameter estimate. The pseudo R² values have been calculated according to McFadden (1974). a The number of observations in these estimations are reduced by 157,161 as we are only including firms for which financial statement reports are available. This means that we only include firms for which TTFLS = 0, and omit all defaulting firms that have never submitted financial accounting data. For reasons explained in more detail in Appendix A.2 in the paper, there are a large number of firms (27,492) that have never reported accounting data before they default, and these firms are all assigned TTLFS=0 in our analysis. Also these firms are excluded In the sub-sample considered here, so the number of defaults are only 64,189 in these estimations (as opposed to the 105,605 defaulting firms in Tables 1 and 2).

Table B.2: Marginal effects for the economy-wide default risk model Average of individual marginal effects Marginal effects at the mean Marginal effects at the median I II III IV I II III IV I II III IV Micro Variables EBITDA/TA -0,0083 -0,0084 -0,0081 -0,0083 -0,0030 -0,0037 -0,0020 -0,0025 -0,0037 -0,0063 -0,0023 -0,0040 TL/TA 0,0039 0,0077 0,0038 0,0077 0,0014 0,0034 0,0009 0,0024 0,0017 0,0058 0,0011 0,0038 LA/TL -0,0020 -0,0035 -0,0020 -0,0035 -0,0007 -0,0015 -0,0005 -0,0011 -0,0009 -0,0026 -0,0006 -0,0017 I/TS 0,0006 0,0008 0,0004 0,0007 0,0002 0,0003 0,0001 0,0002 0,0003 0,0006 0,0001 0,0003 TL/TS -2,2790 -2,2790 -2,2790 -2,2790 0,0004 0,0004 0,0003 0,0003 0,0005 0,0007 0,0003 0,0005 IP/(IP+EBITDA) 0,1507 0,1507 0,1507 0,1507 0,0002 0,0004 0,0001 0,0002 0,0003 0,0006 0,0002 0,0004 PAYREMARK 0,0171 0,0181 0,0062 0,0046 0,0076 0,0053 TAXARREARS 0,0254 0,0263 0,0092 0,0067 0,0113 0,0076 Dividend -0,0314 -0,0392 -0,0295 -0,0382 -0,0113 -0,0170 -0,0075 -0,0117 -0,0139 -0,0293 -0,0085 -0,0187 TTLFS 0,0365 0,0421 0,0353 0,0415 0,0132 0,0183 0,0089 0,0128 0,0162 0,0314 0,0102 0,0203 Aggr. Variables Output gap -0,0013 -0,0009 -0,0003 -0,0003 -0,0004 -0,0004 Nominal interest rate 0,0007 0,0006 0,0002 0,0002 0,0002 0,0003 GDP inflation 0,0001 -0,0002 0,0000 -0,0001 0,0000 -0,0001 Real exchange rate -0,0003 -0,0001 -0,0001 0,0000 -0,0001 0,0000 Notes: Marginal effects for the explanatory variables in the economy-wide model. No standard errors are shown: statistical significance of the marginal effects is at the same level as for the parameter estimates in Table 3. Model III corresponds to the economy-wide model in Table 3, while the marginal effects in the columns marked I, II and IV refer to the models estimated for robustness purposes and displayed in Table B1.

Cite this document

APA

Tor Jacobson, Jesper Lindé, & and Kasper Roszbach (2011). Firm Default and Aggregate Fluctuations (IFDP 2011-1029). Board of Governors of the Federal Reserve System, International Finance Discussion Papers. https://whenthefedspeaks.com/doc/ifdp_2011-1029

BibTeX

@techreport{wtfs_ifdp_2011_1029,
  author = {Tor Jacobson and Jesper Lindé and and Kasper Roszbach},
  title = {Firm Default and Aggregate Fluctuations},
  type = {International Finance Discussion Papers},
  number = {2011-1029},
  institution = {Board of Governors of the Federal Reserve System},
  year = {2011},
  url = {https://whenthefedspeaks.com/doc/ifdp_2011-1029},
  abstract = {This paper studies the relationship between macroeconomic fluctuations and corporate defaults while conditioning on industry affiliation and an extensive set of firm-specific factors. By using a panel data set for virtually all incorporated Swedish businesses over 1990-2009, a period which includes a full-scale banking crisis, we find strong evidence for a substantial and stable impact from aggregate fluctuations on business defaults. A standard logit model with financial ratios augmented with macroeconomic factors can account surprisingly well for the outburst in business defaults during the banking crisis, as well as the subsequent fluctuations in default frequencies. Moreover, the effects of macroeconomic variables differ across industries in an economically intuitive way. Out-of-sample evaluations show that our approach is superior to models that exclude macro information and standard well-fitting time-series models. Our analysis shows that firm-specific factors are useful in ranking firms' relative riskiness, but that macroeconomic factors are necessary to understand fluctuations in the absolute risk level.},
}