Transaction Costs and Consumption
Abstract
The Rational Expectations Permanent Income Hypothesis (RE-PIH) fails to explain several features of consumption behavior documented by previous researchers. First, the marginal propensity to consume (MPC) out of unanticipated income shocks tends to decrease as the size of the shocks becomes larger. Second, the MPC out of small income shocks is well above what the RE-PIH predicts. Third, consumption responds to small anticipated income changes, but not to large ones. This paper argues that these findings can be reconciled within a RE-PIH framework that includes a cash-in-advance constraint. In the model, the representative agent is assumed to be fully rational with perfect information and is able to borrow against future income. The agent can hold cash and interest-bearing assets, but has to pay a fixed transaction cost to transfer wealth between cash and assets. I show that the agent follows an s-S rule with respect to cash holdings in making wealth-transfer decisions. The MPC within the no-transfer band is higher than that out of the band. It can be lower than or exactly equal to 1. The model also predicts that agents smooth consumption in response to news of large future income changes but not to small ones. Furthermore the model sheds light on the demand for liquid assets and the equity premium puzzle.
Finance and Economics Discussion Series Divisions of Research & Statistics and Monetary Affairs Federal Reserve Board, Washington, D.C. Transaction Costs and Consumption Geng Li 2007-38 NOTE: Staff working papers in the Finance and Economics Discussion Series (FEDS) are preliminary materials circulated to stimulate discussion and critical comment. The analysis and conclusions set forth are those of the authors and do not indicate concurrence by other members of the research staff or the Board of Governors. References in publications to the Finance and Economics Discussion Series (other than acknowledgement) should be cleared with the author(s) to protect the tentative character of these papers.
Transaction Costs and Consumption Geng Li ∗ Federal Reserve Board May 2007 Abstract The Rational Expectations Permanent Income Hypothesis (RE–PIH) fails to explain several features of consumption behavior documented by previous researchers. First, the marginal propensity to consume (MPC) out of unanticipated income shocks tends to decrease as the size of the shocks becomes larger. Second, the MPC out of small income shocks is well above what the RE–PIH predicts. Third, consumption responds to small anticipated income changes, but not to large ones. This paper argues that these findings can be reconciled within a RE-PIH framework that includes a cash-in-advance constraint. In the model, the representative agent is assumed to be fully rational with perfectinformationandisabletoborrowagainstfutureincome. Theagentcanholdcash and interest-bearing assets, but has to pay a fixed transaction cost to transfer wealth between cash and assets. I show that the agent follows an s-S rule with respect to cash holdings in making wealth-transfer decisions. The MPC within the no-transfer band is higher than that out of the band. It can be lower than or exactly equal to 1. The model also predicts that agents smooth consumption in response to news of large future income changes but not to small ones. Furthermore the model sheds light on the demand for liquid assets and the equity premium puzzle. ∗Iamdeeplyindebtedtomydissertationcommitteemembers–RobertBarsky, MatthewShapiro, Dmitriy Stolyarov and Tyler Shumway – for their superb guidance and encouragement. I thank Susanto Basu, James Feigenbaum,MilesKimball,manyofmycolleaguesattheFederalReserveBoard,andtheparticipantsofseminars and conferences where this paper was presented at various stages for helpful discussions and comments. In particular, I thank Karen Dynan for the meticulously careful comments she gives to the paper. The views presented in the paper are those of the author and are not necessarily those of the Federal Reserve Board or its staff.
1 Introduction The Rational Expectations-Permanent Income Hypothesis (RE–PIH) makes two important predictions about how consumption should respond to income changes. First, under some auxiliary assumptions, the magnitude of consumption changes in response to unanticipated and transitory income shocks should be equal to the annuity value of the shock. Second, in response to anticipated income changes, all of the adjustments in consumption should have been carried out upon the arrival of the news about anticipated future income changes, and no further adjustments are needed. Consumption should be a random walk, and its growth cannot be predicted. Testing the validity of the RE-PIH has been an important part of economic research since the advent of the theory. After several decades of research, the literature has yielded mixed findings. Bodkin (1959) finds that the marginal propensity to consumer (MPC) out of an unanticipated transitory (“windfall”) income shock can be as high as 0.9, which is well above any reasonable annuity value. In contrast, Kreinin (1961) finds that the MPC out of a windfall incomeshockisaround 0.15, whichis closeto whatthe RE-PIHpredicts. Bodkin’sexperiment involved small income shocks, whereas Kreinin’s involved much larger ones. Taken together the findings suggest that the MPC decreases as the size of the income shock becomes larger, a relationship that cannot be explained by the RE-PIH. Hall (1978) and Flavin (1981) challenged the random walk prediction of the RE–PIH, finding that aggregate consumption growth can be predicted by stock market returns and current income information. However, detailed household-level studies suggest that consumption responds only to small anticipated income changes; little excess sensitivity has been detected when the income shocks are large. Various explanations have been offered for a large MPC out of windfall income and for an excess sensitivity in response to anticipated income changes. However, explanations for the differentresponsetolargeandsmallincomeshocksarelacking. Thepresentpaperintroducesa model predicting (1) small windfall income shocks induce a large MPC, (2) the MPC decreases as the shock becomes larger, and (3) consumption exhibits excess sensitivity in response to 1
anticipated income changes but only when the changes are small. The logic of the model is as follows: In a cash-in-advance economy, a consumer can hold cashandilliquidassets(hereafterassets). Thenominalinterestrateiszerooncashandpositive on assets. Only cash can be used to purchase consumption goods. The consumer has to pay a transaction cost to transfer wealth between cash and assets. Assume that the transaction cost is a fixed fee that does not depend on the amount of the transaction. The consumer will choose to pay the transaction cost only if income shocks have made the consumer’s cash balance too low or too high relative to the individual’s unconstrained optimal consumption level. A cash balance that is too low will severely constrain consumption and will cause the consumer to withdraw some money from assets to finance consumption. Likewise, a balance of (non-interest-bearing) cash that is too high will cause the consumer to transfer most of the extra cash into the asset account. Conditional on expected future labor income and current asset holdings, there is an s-S band of cash holdings within which the consumer will choose not to transfer wealth. Optimal consumption – as a function of cash balance holdings – will not be differentiable globally. Nor is it differentiable within the no-transaction band. The consumption function is not even continuous. In particular, there are two segments of the optimal consumption function within the band. To the left of the band, the consumer has a cash holding level that is lower than the unconstrained optimal consumption level. Hence, within this region of cash holding, the MPC out of one additional dollar will be exactly equal to one. When the cash balance has increased to such a level that the consumer is no longer cash constrained, the MPC within the band is still higher than that out of the band. The reason is that if the consumer does not spend all of the cash in hand, the consumer will not receive any return on the cash saved, and may even suffer inflation loss. Therefore, by holding cash in hand, current consumption will be higher, and the profile of consumption growth will be flatter. Overall, the average MPC within the band is higher than that out of the band. As I will show later in the paper, the probability of being within the band is quite large. Only a large income shock will push the consumer out of the band. In other words, the model predicts that a small windfall income shock may be associated with a very high MPC, and that the MPC decreases as the income 2
shocks increase. If the consumer knows that income will be higher next period and the shock is not too large, a cash-constrained consumer will not pay the transaction cost to withdraw money from the asset account to increase current consumption. However, when the future income change is sufficiently large, it is always optimal for the consumer to pay the transaction cost and change current consumption. Therefore, an econometrician who uses cross-sectional data to test excess sensitivity will find that the Euler equation does not hold when the anticipated income changes are small, but that the Euler equation does hold when the income changes are large. It is important to distinguish the cash constraint the consumer faces here from liquidity constraints as typically characterized in the literature. The cash constraint is endogenous in the model of this paper and the consumer optimally chooses to be constrained under some states and not under others. The liquidity constraint is exogenous, and the consumer cannot choose whether to be constrained. In my model the consumer faces a single lifetime budget constraint and is allowed to have a negative level of asset holding, whereas a negative asset balance is not permitted for a liquidity constrained consumer. Finally, being cash constrained for the current period has little predictive power for whether the consumer will be constrained next period, whereas the liquidity constraint is typically persistent. The model is rich: It explains not only the stylized facts mentioned above but also yields some other findings that challenge the predictions of conventional models. For example, it sheds some light on the demand for liquid assets – in my model, cash. When the consumer transfers wealth between cash and illiquid assets, the consumer still wants to hold some cash into the next period even though cash does not accrue any interest. By choosing the optimum amount of cash to hold over periods, the consumer can minimize the probability of having to pay the transaction cost again in the next period. Moreover, holding some extra cash will also reduce the probability of being cash constrained when the consumer chooses not to transfer. I will call this type of demand for cash the precautionary demand for cash. If the model is extended to allow for stochastic asset returns, the consumer will have two types of income shocks: one from labor income and the other one from the asset market. I 3
show that this feature weakens the correlation between contemporaneous consumption growth and asset returns and thus helps explain the equity return premium puzzle; it also produces a correlation between consumption growth and lagged asset returns that is consistent with the findings by Hall (1978). Infrequent adjustment of consumption with respect to asset return shocksisalsopredictedbythemodelsofGabaixandLaibson(2001)andReis(2006). However, themostimportantdistinctionbetweenthemodelinthispaperandtheirsisthatconsumption adjustment is state dependent in the current model, whereas it is time dependent in Gabaix and Laibson (2001) and Reis (2006). The paper will proceed as follows: Section 2 reviews the studies of the MPC out of windfall incomeshocksandexcesssensitivities. Ialsodiscusswhythemodelsofliquidityconstraintsare unable to provide complete explanations for the findings. Section 3 discusses the transaction cost in more detail and sets up the model with only labor income risk. Section 4 provides numerical solutions for the model with the benchmark parameters. In Section 5, I use the simulation results to show that the current model can replicate the stylized facts mentioned above. I also show simulation results describing the precautionary demand for cash, and I explore the sensitivity of the model by varying the parameters. Section 6 extends the model by allowing for risky asset return and discusses whether and to what extent it helps explain the equity premium puzzle. Section 7 concludes and sets up a future research agenda. 2 Evidence on the MPC and Excess Sensitivities Iwillfirstreviewtheevidencethat(1)whentheincomeshockissmall, theMPCismuchlarger than what the RE-PIH predicts, and (2) the MPC tends to increase as the size of the windfall income shock decreases. Next I review the evidence of excess sensitivity in consumption in response to anticipated income changes. The literature suggests that such excess sensitivity can be detected only when the anticipated income changes are small. Finally, I briefly review the literature on liquidity constraints and argue that liquidity constraints cannot completely reconcile all of these contradictions. The literature reviewed in this section is summarized in table 1. 4
2.1 The MPC out of Windfall Income I begin with replicating the results of the baseline RE-PIH model. In this model, the representative consumer maximizes lifetime utility (cid:88)∞ maxE βtU(C ), (1) 0 t Ct t=0 subject to the intertemporal budget constraint A = (1+r)×(A +Y −C ), (2) t t−1 t t where E is the expectation condition on all the information available at time 0, U(C ) is the 0 t intratemporal utility function, r is the real interest rate and β is the subjective discount rate. 1 I assume that β = . The first-order condition (FOC) can be summarized as the Euler 1+r equation, U(cid:48)(C ) = β(1+r)E [U(cid:48)(C )] = E [U(cid:48)(C )], (3) t t t+1 t t+1 with a transversality condition, A t lim ≥ 0. (4) t→∞ (1+r)t If we further assume that the utility function is quadratic, then the solution for the consumption level under a stochastic future income process is equivalent to the solution to the above problem without uncertainty. The optimal consumption level depends only on the discounted expected income, a condition known as the certainty equivalence solution, expressed as (cid:34) (cid:35) (cid:88)∞ r Y CCEQ = E t +A . (5) 1+r 0 (1+r)t 0 t=0 Consumption in each period is at the same level and is equal to the annuity value of the lifetime sum of human and nonhuman wealth. If Y unexpectedly increases by $1, CCEQ will 0 r increaseby dollar. ThevalueisexactlytheMPCoutofwindfallincomethatispredicted 1+r 5
by the RE-PIH and is equal to the annuity value of the shock. For a reasonable level of r, r should not be greater than 10 percent. 1+r Bodkin (1959) examines some U.S. veterans who, in the 1950s, received unanticipated payments from the National Service Life Insurance Fund. Bodkin estimates that the MPC from these payments ranged from 0.7 to 0.9.1 This result is viewed as a challenge to the PIH as no realistic value of r can generate an annuity as high as that. In contrast, Kreinin (1961) finds that for the Israelis who received reparations payments from Germany in 1957, also an unanticipated income shock, the MPC was around 0.15; that number is easier to reconcile with the RE-PIH predictions. Finally, Hall and Mishkin (1982) estimate that the MPC out of transitory income relative to the MPC out of lifetime income is 0.29. For a sufficiently large MPC out of lifetime income, Hall and Mishkin’s estimate of the MPC out of transitory income is also very large and hard to be reconciled with the PIH predictions. One of the key differences between the Bodkin and Kreinin studies is that the former involved payments that were less than 10 precent of the consumer’s annual income, whereas thelatterinvolvedamuchlargerpaymentthatwasequaltoaboutoneyear’sincome. Although Kreinin (1961) does not find direct evidence that the size of the shock was the reason for such a big discrepancy in estimated MPC, a later study by Landsberger (1966) lends strong support to this possibility. Landsberger divides the sample of the Israelis who received the windfall income into quintiles according to the ratio of the windfall income to their regular income. He finds that the MPC decreases consistently from the group with the lowest ratio to the group with the highest ratio. He also reports that for the group with the lowest ratio, the estimated MPC is extremely high. Bodkin (1966) did the similar estimation with the veteran dataset and cannot reject Landsberger’s findings.2 1TheestimationprocedurethatBodkinandlaterstudiesadoptedmightleadtoanupward-biasedestimation of MPC. These studies typically estimate an equation of the form C = α + βY + γN + (cid:178) , where C is i i i i consumption,Y isregular incomeotherthanthewindfallincome,N isthewindfallincome,andγ istheMPC out of the windfall income. If the consumer has received windfall income other than N, in the specification above,econometricianswilltreatitaspartoftheregularincome. ThusMPCoutofregularincomeisdownward biased and MPC out of windfall income is upward biased. 2See Mayer (1972) for a summary of the studies. 6
2.2 Excess Sensitivities to Anticipated Income Changes Consider the Euler equation (3). A key implication is that current consumption growth should not be correlated with any information that is available before the current period. This condition, which holds for all utility functions, is Hall’s celebrated random walk hypothesis. It is also known as the orthogonality condition. In particular, consumption growth should not be correlated with expected income growth. Many authors, however, have found that it does not hold unconditionally in the data. Hall (1978) and Flavin (1981) challenge the orthogonality condition with aggregate data. Similarly, Wilcox (1989), using aggregate data, shows that the announced increases of social security benefits have significant effects on consumption at the monthly frequency because the benefits increases have been announced at least six weeks prior to payments. As better survey data have become available, there has been an increasing volume of studies that test the orthogonality condition in natural experiments that use household–level data. Souleles (1999) showed that household consumption is excessively sensitive to tax refunds, an anticipated income shock. He found significantly higher consumption for households who received tax refund in the refund quarter, compared with the consumption of those who received their refunds in the previous quarter. Souleles (2002) studies how households’ consumption responded to the second phase of the so-called “Reagan tax cuts”. These tax cuts were implemented in August 1981 but were announced long before this time. Again, consumption was found to be excessively sensitive to the tax cuts at the time they went into effect. Parker (1999) shows that consumption is excessively sensitive to another source of anticipated income changes – regular changes in Social Security tax withholding. After an individual’s Social Security tax withholding amount reaches a specified limit in a given calendar year, take-home pay increases. Parker finds that consumption responds to this income change significantly, even though the income change is expected. One common characteristic of the anticipated income changes just discussed is that they are relatively small. In Souleles (1999) the sample mean of tax refund is $874 and the median is $561. In Souleles (2002), the mean change in tax withholding is about $117 per quarter, and the median is $95. In Parker (1999), the Social Security tax rate had been between 6.13 7
percent and 7.5 percent over various years in his sample. The findings about excessive sensitivity are different when income changes are large. Hsieh (2003) uses the Consumer Expenditure Survey to show that the consumption of the Alaskan residents is not responsive to the receipt of oil fund royalties. These payments are anticipated and large: the average is well above $2,000 for each household. Similarly, Souleles (2000) finds that the consumption in autumn of parents who send their children to college are not affected by the tuition payments, which are also anticipated and large. 2.3 Can Liquidity Constraints Explain Everything? One often-proposed explanation for the findings reviewed above is that not all consumers can borrow freely, that is they face liquidity constraints. If the consumer cannot borrow, or at least cannot borrow as much as she wants, the Euler equation may not hold at all times. Typically, when the consumer is liquidity constrained, consumption will be lower than the unconstrained optimal level. Therefore, windfall income will increase consumption by more than what the PIH predicts, and news about higher future income will not lead to a higher current consumption level. Empiricaltestsoftheliquidityconstrainthypothesishavelentmixedsupporttothetheory. Souleles (1999) finds evidence supporting the liquidity constraint model, but Souleles (2002) and Parker (1999) find no evidence that liquidity constraints can help explain the excess sensitivity. Using the Panel Study of Income Dynamics (PSID) data, Zeldes (1989a) finds that, for some criteria for identifying liquidity constraints, the Euler equation does not hold for the constrained consumers, but for other criteria, the estimates are not significant. Runkle (1990) uses the same data source and finds no evidence for liquidity constraints. Another study that does not support the liquidity constraint model is Shapiro and Slemrod (2003). They survey households’ plans to spend their 2001 tax rebate. They find that for households with income between zero and $20,000, 17.6 percent of them said that they would spend the tax rebate, whereas for households with income above $75,000, 24 percent said that they would spend the tax rebate. Higher income households’ greater propensity to spend the same windfall amount that the lower-income households received is not explained by RE-PIH, 8
nor is it explained by the liquidity constraints model. The evidence suggests some limits to the ability of liquidity constraints in explaining such findings. First, to explain the observed excess sensitivity to predictable income changes, note that such income changes must be predictable not only by the consumers but also by the econometrician. It is then not unrealistic to assume that the creditor is also able to predict the income changes of the consumer who is applying for a loan. If, indeed, the creditor can predict that consumer’s income of next period will increase, it is likely that the consumer will get the loan and not be constrained. Finally, in the case of a windfall income shock, liquidity constraints do not imply very high MPCs, unless the consumer is constrained only for the current period. Zeldes (1989a) notes that the Euler equation might still hold for the liquidity constrained consumer, as she should want to be equally distant from the optimal consumption level in all of the constrained periods and thus will still try to smooth consumption. As a result, as long as the consumer realizes thattheconstraintwilllastsomesufficientlylongtime, theMPCoutofwindfallincomeshould not be too large. 2.4 Can Behavioral Models Explain the Puzzles? There are certainly many good behavioral models that shed light on the consumption puzzles. Forinstance, toproduceanincreaseinMPCwhenthesizeofwindfallincomeshocksdecreases, the so-called mental accounting models do a good job (for example, Thaler, 1992). Costly self-control also provides a good explanation. If self-control is costly, it is worthwhile to exert self-control only when the temptation to consume (windfall income) is large (see Benhabib and Bisin 2002). However, I do not pursue these behavioral approaches in this paper. 3 A Model of Optimization with Transaction Costs 3.1 The Environment I assume that the consumer lives in a cash-in-advance economy. The consumer can hold either cash or assets. The nominal interest rate for cash is zero. For the time being, I assume that 9
the asset bears a risk free real interest rate, r. The nominal interest rate, i, is equal to r+π, where π is the inflation rate. Interest is assumed to be accrued from the end of one period to the beginning of the next. In the Section 6, I will allow for stochastic rates of return. All labor income comes in the form of cash. After receiving labor income, the consumer simultaneously decides whether to transfer wealth between cash and assets and how much to consume. The consumer wants to make a transfer in either of two cases. First, because the cash-in-advance assumption requires that consumption in any period not be greater than cash holding in that period, if the desired level of consumption is greater than the cash holding, the consumer will withdrawmoneyfromtheassetaccount. Second, ifsubstantialcashisleftinhandafterpaying for consumption, the consumer will save the extra cash in asset account. The key assumption of the model is that the consumer has to pay a transaction fee for any transfer between cash and assets. When transferring wealth after receipt of labor income, the consumer also has to decide how much cash to hold after the transfer.3 Figure 1 illustrates the timing of the model. 3.2 Discussion of the Transaction Cost The categories of cash and assets broadly correspond to accounts in the real world. What I refertoascashinmymodelincludescash, checkingaccountbalances, savingaccountbalances, other account balances that can be used to purchase goods directly, and other assets that can be converted to cash with very low cost. The asset account in my model includes stocks, mutual funds, pensions, and other assets that are rather costly to convert to cash. Some of the instruments that I label as cash do bear some interest, but the rate of return is not as high as those I label as assets. If there is no transaction cost, theory predicts that the portfolio should be adjusted continuously, and the first-best portfolio is always achieved. In real life, however, each investor trades only sporadically. Odean (1999) reports that the users of some discounted brokerage accounts trade, on average, 1.44 times per year. He also reports that this trading frequency, after taking into account of the brokerage fee the consumer has to pay for each trade, leads to suboptimal returns. Therefore, the optimal trading frequency must be even lower than 3Aswill be discussed later, thecash balanceafter a transferis typicallygreaterthan the optimalconsumption level. 10
1.44 times per year. Agnew, Balduzzi and Sunden (2003) find that some 401(K) participants reshuffle their portfolio at an even lower average frequency – 0.26 times per year. Barber and Odean(2000) calculate the theoretically optimal trading frequency for reasonable levels of transaction costs, and they find that the number of trades per year should be within the range of 0.17 to 0.5 when the transaction cost varies from 0.01 percent to 0.1 percent of the portfolio value. What are these transaction costs? The most observable and direct cost is the brokerage fee. Typically, the individual investor pays the brokerage firm $15 to $30 per transaction. The fee is by no means the only component of the transaction cost, as evidenced by people trading very infrequently in their 401(K) accounts, which charge no explicit fee. Implicit costs may include the time an investor needs to spend on market research and the effort of placing the order. The imperfect liquidity of some financial vehicles can also be associated with a transaction cost. For instance, long-term saving accounts have higher interest rates. They do, however, carry a large financial penalty for withdrawal. Another example is a house, which can be viewed as an asset with extremely high transaction costs to convert to cash. 3.3 Setup of the Model The following notations are used in the model: C is the consumption in period t t θ is a binary variable that indicates whether the consumer pays the transaction cost t θ = 1 if the consumer does and θ = 0 if the consumer does not t t M is the real cash holding before receiving the labor income t A is the real balance of the asset account t ∆ is the real value of labor income t ρ is the coefficient of relative risk aversion (CRRA) i is the nominal interest rate π is the inflation rate r = i−π is the real interest rate ψ is the transaction cost 11
The consumer has to solve the optimization problem as follows, (cid:88)∞ max E βtU(C ), (6) 0 t {θt,Ct,Mt+1} 0 subject to (cid:40) (1+i−π)A if θ = 0, t t A = (7) t+1 (1+i−π)(A +M +∆ −C − Mt+1 −ψ) if θ = 1. t t t t 1−π t (cid:40) (1−π)(M +∆ −C ) if θ = 0, t t t t M = (8) t+1 M to be determined by the consumer if θ = 1. t+1 t C ≤ M +∆ if θ = 0, (9) t t t t A t lim ≥ 0. (10) t→∞ (1+r)t Because the three control variables are jointly determined at time t, I write the maximization problem with respect to the vector of [C ,θ ,M ]. I assume a CRRA type of utility t t t+1 C1−ρ −1 function, U(C) = . For brevity and computational simplicity, this optimization 1−ρ problem is represented as the following Bellman equations, V(A ,M ,∆ ) = max[VNT(A ,M ,∆ ),VTR(A ,M ,∆ )], (11) t t t t t t t t t θt where VNT(A ,M ,∆ ) = maxU(C )+βE V(A ,M ,∆ ), (12) t t t Ct t ∆t+1 t+1 t+1 t+1 and VTR(A ,M ,∆ ) = max U(C )+βE V(A ,M ,∆ ). (13) t t t Ct,Mt+1 t ∆t+1 t+1 t+1 t+1 In this system, we have three value functions:V, VNT, and VTR. V is the solution to the whole optimization problem, VNT is the solution to the problem when the consumer is not 12
allowed to transfer wealth between cash and assets in the current period, and VTR is the solution to the problem when the consumer is forced to pay the transaction cost and make a transfer in the current period ∀A,M and ∆ regardless of optimality.4 In the regime in which the consumer is not allowed to transfer wealth, the control variable is simply C , whereas in t the regime of a forced transfer the consumer has to simultaneously choose C and M . The t t+1 consumption policy functions for equations (12) and (13) are denoted as CNT(A ,M ,∆ ) and t t t CTR(A ,M ,∆ ), respectively. The value function, V, is the upper contour of VTR and VNT. t t t The consumer decides θ as t (cid:40) 1 if VTR(A ,M ,∆ ) > VNT(A ,M ,∆ ), t t t t t t θ = (14) t 0 otherwise, which is obvious. The motion of state variables A and M follows equations (7) and (8). Equation (7) shows t t that when θ = 0, asset A is intact in period t, and real interest, i−π, accrues between period t t t and t+1. A is simply equal to (1+i−π)A . Equation (8) shows that after making the t+1 t consumption expenditure, the consumer will carry a real cash balance (1−π)(M +∆ −C ) t t t into the period t+1. When θ = 1, consumer will either withdraw cash or save cash. If the t consumer chooses to consume the amount C and to carry a real cash balance, M , into the t t+1 next period, A + M + ∆ − C − Mt+1 − ψ will be left in the asset account. In this case, t t t t 1−π M becomes a control variable that has to be pinned down by the consumer. Equation (9) is t+1 the cash-in-advance constraint. If θ = 0, consumption in period t will be constrained by the t amount of cash the consumer holds after receiving labor income, which is equal to M +∆ .5 t t 3.4 Characterizations of the Solution It is well known that for the models of dynamic consumption optimization with stochastic income, the analytical solutions are not attainable except for very few situations, such as 4I suppress the arguments of the functions when it does not cause confusion. 5The setup of the model is similar to that of the consumer sector in Alvarez, Atkeson and Kehoe (2002). Their study focuses on certain money market puzzles, such as why persistent injections of money into the economy decrease nominal interest rates and steepen the yield curve. My model focuses on consumption and its implications for the demand for liquid assets and risky assets returns. Also, the model in Alvarez, Atkeson and Kehoe (2002) is a general equilibrium model, whereas the model in this paper is a partial equilibrium model. 13
when the utility function is quadratic. Because of the nonlinearity in my model, even the assumption of quadratic utility function does not lead to a tidy closed form solution. In the next section I will solve the model using a numerical technique. In this section, I will characterize the solution. Proposition 1 The solutions to the Bellman equations (11), (12), and (13) exist. Proof: Because β < 1, the existence of the solution is established by Theorem 9.6 in Lucas and Stokey (1989). Assume that the value functions are differentiable, so the first-order condition (FOC) for equation (12) is ∂ U(cid:48)(C )−β(1−π) E [V(A , M , ∆ )] ≥ 0. (15) t ∂M ∆t+1 t+1 t+1 t+1 t+1 This FOC suggests that, if prevented from transferring wealth, the consumer should set the marginal utility from current consumption derived from cash equal to the discounted future marginal value derived from cash holdings. Without the ability to transfer wealth, the consumer might be cash constrained. In this case, the inequality holds. For equation (13), the FOC of C is t ∂ U(cid:48)(C )−β(1+i−π) E [V(A ,M ,∆ )] = 0. (16) t ∂A ∆t+1 t+1 t+1 t+1 t+1 And the FOC of the real cash balance at the beginning of next period, M , is t+1 ∂ ∂ (1+i−π) E [V(A ,M ,∆ ] = (1−π) E [V(A ,M ,∆ )]. (17) ∂A ∆t+1 t+1 t+1 t+1 ∂M ∆t+1 t+1 t+1 t+1 t+1 t+1 FOC equation (16) says that when paying a transaction cost, the consumer should, at the margin, be indifferent between consuming another dollar or saving it distributed optimally between assets, A , and cash, M . FOC equation (17) specifies the optimal way of transt+1 t+1 ferring wealth between cash and assets. The consumer must be indifferent between putting the last unspent dollar in the asset account and the cash account. 14
In comparing the Euler equations (15) and (16), we find that the marginal value of saved cash in period t + 1 is multiplied by (1 − π) in the regime of no wealth transfer, whereas in the regime of a forced wealth transfer, the marginal value of saved asset is multiplied by (1+i−π). Therefore,thefirstcaseleadstoastrongerincentivetoincreasecurrentconsumption and reduce future consumption growth. Now I characterize the value functions. We are interested in the characteristics of both the value functions V(A ,M ,∆ ) and E [V(A ,M ,∆ )]. Labor income arrives in the form t t t ∆t t t t of cash only. After receiving that labor income, the consumer no longer has to distinguish between ∆ and M . Therefore, we can rewrite the value functions as V(A ,B ), VNT(A ,B ) t t t t t t and VTR(A ,B ), where B = M +∆ denotes cash in hand. Moreover, if paying a transaction t t t t t cost, the consumer is indifferent to placing income in cash or assets, and we can define W = t A +M +∆ −ψ asthewealthinhandafterthetransaction. Wehavethefollowingproposition: t t t Proposition 2 The consumer makes decisions on transferring wealth between cash and assets following an so-called s-S rule. For any given level of A , such that A > ψ, ∃ B and B such t t that, ∀B > B, the consumer pays the transaction cost and saves cash into the asset account. t ∀B < B the consumer pays the transaction cost and withdraws money from the asset account. t ∀ B such that B ≤ B ≤ B, the consumer does not pay the transaction cost and consumes t t only out of cash. Sketch of Proof: Figure 2 illustrates the typical value functions V, VNT, and VTR. We can show that VNTandVTR cross twice if A > ψ. First consider a consumer with wealth W who is exemptedfrompayingthetransactioncost. Calltheconsumer’soptimalconsumptionlevelC∗. C∗ is the unconstrained optimal consumption; hence it is the first best choice. Then consider another consumer who has to pay the transaction cost to transfer wealth. Let the cash balance B = C∗ and the asset balance A = W −C∗. The consumer should be happy to consume C∗ because it is the first best optimal consumption level should the consumer be exempted from paying the transaction cost and because the consumer’s cash balance can exactly afford that consumption level. Of course, in this case VNT > VTR. Holding A = W −C∗ constant, we can always find a very high level of cash holding, B , and a very low level of cash holding, B , 1 2 15
that make VNT(W −C∗,B ) < VTR(W −C∗,B ) and VNT(W −C∗,B ) < VTR(W −C∗,B ). 1 1 2 2 Because both VNT and VTR are increasing functions, we must have VNT and VTR cross twice, as shown in figure 2. The intersection points B and B are the thresholds of the s-S band.6 We can write B = φ (A) and B = φ (A). For the A < ψ case, there still exists the saving 1 2 t threshold B, but there is no withdrawal threshold B because the consumer has no assets left after paying the transaction cost. For such cases, the s-S rule degenerates to a single-direction rule. Also, see the appendix for a discussion of the differentiability and concavity of the value function V. Now I characterize the policy function C(A ,B ). I focus on the property of the policy t t function for a given A and varying B . t t Proposition 3 The optimal consumption function is (cid:40) CTR(A ,B ) if B ≤ φ (A ) or B ≥ φ (A ), t t t 1 t t 2 t C(A ,B ) = (18) t t CNT(A ,B ) if φ (A ) < B < φ (A ), t t 1 t t 2 t where CTR(W ) and CNT(A ,B ) are defined corresponding to VTR and VNT. Further more, t t t For each given A > ψ, ∃ φ (A) < B∗(A ) < φ (A ) such that C(A ,B ) = B , ∀B ≤ B∗(A ). t 1 t 2 t t t t t t Proposition 3 holds by definition and optimality. It highlights the existence of the region within which the MPC = 1. Finally, I characterize the policy function for M . Recall that with transaction costs, the t+1 consumer has to choose the optimum amount of cash to carry into the next period to minimize the probability of having to pay the transaction cost again. Suppose that after paying for the transaction cost and the cost of consumption in period t, the consumer has W left. The t consumer has to decide how much of W to hold in assets and how much in cash. Suppose t that the consumer has a very small W relative to the expected labor income of next period. t Then it is very likely that the consumer has to pay the transaction cost next period no matter how much cash is being carried from period t into t+1. The reason for that is the consumer’s 6Strictly speaking, I have shown only that VNT and VTR will cross at least twice. To have the s-S band, we need them to cross exactly twice. I would assume the value functions have the property that they cross at most twice. 16
level of savings is far below the target level of savings, which is discussed in the literature on precautionary saving (for example, Carroll, 1992 and 1997). The consumer has to save more in subsequent periods to restore the target level of savings. In such a situation the consumer should put all of W into the asset account. Similarly, W is very large and it is higher than t t the target level of savings, the consumer should run down savings and increase consumption in the following period. Hence it is highly probable that consumption will exceed labor income in the next period. To avoid paying the transaction cost again next period, the consumer has to carry an amount of cash that is close to the difference between consumption and income of next period. However, if this difference is too large, holding such amount of cash will not be optimal because the interest loss is too large (even larger than ψ). In this case, the consumer should hold all of W in assets and make a transfer next period. Therefore, M = 0 if W is t t+1 t very small or very large. Otherwise, M increases with W .7 t+1 t 4 Numerical Solution to the Model Since I want to isolate the effects of the transaction cost from the effects of the liquidity constraint, in the baseline model, I assume a stochastic income process that leaves the consumer withnofutureincomeagainstwhichtoborrow. Iassumethatthedistributionoflaborincome, ∆ , has a positive probability of equalling to zero, ∀ t. As the income in all future periods t can be equal to zero (even with a very small probability), the discounted sum of minimum future income is zero. The model will be calibrated quarterly because most micro level empirical studies use quarterly data. For simplicity, I assume that labor income is i.i.d. over time. Table 2 summarizes the baseline values of the parameters. I assume that r < 1−β to prevent the consumer from accumulating assets without bound. This restriction is what Deaton (1992) and Carroll (1997) term the impatience condition in a simplified environment. Setting r at 0.015 gives us a 6 percent annual rate, and setting β at 0.975 gives us a discount rate of 10 percent annually. The probability of getting zero income, 7Depending on the distribution of the income process, there may be a range of W such that M > 0 t t+1 even if W is small. This outcome would obtain if the income has a positive probability of being very low (for t instance,zero). However,wecanexcludesuchpossibilitiesbyassumingsufficientlylarge∆,suchthat∆ ≥∆, t ∀t. 17
p, is set at 1 percent. Taking ψ = 0.01 follows the calibration in Aiyagari and Gertler (1991). I assume no inflation in the baseline model. I compute the solution using the extensively employed grid searching method.8 I iterate the Bellman equations (11, 12, and 13) until the average gap between the policy functions, C(A,B), oftwoconsecutiveiterationsbecomessmallenough(<0.0001). Wethencancompute the consumption and wealth transfer policy function. Figure 3 is a typical consumption function, projected onto the hyperplane A = 0.5, which is equal to half of the mean of quarterly income. The most striking feature of the consumption function is that it is neither continuous nor monotonic. For comparison, I plot a consumption function of the model with no transaction cost in the same figure. We can see that consumption monotonically increases from slightly higher than 0.4 when B = 0 to 0.895 when B = 0.815. Over this region, consumption is consistently above cash balances B, suggesting that the consumer has paid the transaction cost and has withdrawn money from the asset account. Then there is a sharp discontinuous decrease of consumption to 0.82. The discontinuous decrease indicates that the consumer has entered the no-wealth-transferring region. Within the left part of the region, when B increases from 0.82 to 0.94, consumption increases at a one-for-one rate with B. This region is where the consumer is cash constrained and has MPC=1. Although staying cash constrained violates the Euler equation, within this region the utility loss from imperfect intertemporal substitution is smaller than the utility loss from the income reduction due to payments of the transaction cost. When B increases above 0.94, the consumer is no longer cash constrained. However, the cash left after paying for consumption is insufficient for the consumer to be willing to pay the transaction cost and transfer the cash left into the asset account. Notice that the slope of the consumption function with transaction cost is steeper than the one with no transaction cost, for the reasons discussed before. There is another discrete decrease when B increases to above 1.25. At this point, consumption is 1.005. Cash t left after paying for consumption is 0.245. The interest the consumer can get if the extra cash is saved is 0.0037, which is smaller than the transaction cost paid. Why does the consumer still want to pay the transaction cost? Recall the FOC equations (15) and (16). The intertemporal 8For example, see, Zeldes 1989b. 18
preference when the consumer does not pay the transaction cost is different from the one when the consumer does pay it. The consumer still wants to make the transfer – even when the gain from interest does not compensate for the transaction cost – because the consumer can now set consumption according to a new intertemporal preference that will lead to higher lifetime utility. In other words, in this situation the intertemporal substitution effect dominates the income effect at the current cash balances. Figure 4 illustrates the optimal level of M as a function of W when the consumer t+1 t chooses to pay the transaction cost. If we ignore the very low levels of W , we find that for t low W , M is zero. M starts to increase with W when W is greater than 0.625. Notice t t+1 t+1 t t that M increases at a slower rate than W does. The figure shows that M is a concave t+1 t t+1 function of W .9 t 5 Simulation Results 5.1 Simulated Assets and Cash Distribution The next step is to compute the stationary distribution of the state variables A and M because I want to compute the cross section average MPC based upon such a distribution. Clarida (1987) proves that the distribution of asset holding will converge to some steady-state distribution in an economy in which assets are the only state variable. I cannot prove that result for an economy in which there is more than one state variable. However, simulation results show that the distributions of A and M indeed become stationary after a few rounds t t of iteration starting with any reasonable distributions of A and M . The top panel of figure t t 5 shows the histogram of the converged distribution of asset holdings. The bottom panel shows the converged distribution of cash holdings before receipt of labor income. Table 3 summarizes the characteristics of the converged assets, cash, and wealth distribution of the baseline parameterization. The mean of the asset distribution is about 80 percent of the mean of quarterly income, and the standard deviation is around 0.2, which is larger than the dispersion of the income distribution. The distribution of M has a large spike at M = 0. t t 9Icannotprovetheconcavity. Asdiscussedbefore,whenW becomesverylarge, M jumpsbacktozero. t t+1 However, because of the limited range of my simulation, this effect is not illustrated in the figure. 19
The spike suggests that more than 20 percent of consumers either have been cash constrained or have paid the transaction cost and have chosen to carry no cash into the next period. The mean of the cash holdings distribution is 0.1, and the standard deviation is 0.09. The demand for cash has two component – one is active and the other is passive. The active demand for cash is based on precautionary motivation, as discussed before. The passive demand for cash comes from the consumers who have B > C but have chosen not to pay the transaction cost. t t They simply carry the residual cash, B −C , into the next period. Finally, the probability of t t paying the transaction costs is about 9 percent. To compare our results with the model with no transaction costs, I plot the histogram of the stationary wealth distribution of the model without transaction costs as in Carroll(1997) (upper panel of figure 6). It is well approximated by a normal distribution. The mean of wealth distribution is about 0.88, and the standard deviation is 0.23. The counterpart for wealth in our model with transaction costs is A +M . I plot the histogram of A +M in the t t t t lower panel of figure 6. Although neither M nor A have a normal distribution, A + M is t t t t similar to a normal distribution. The mean of A +M is 0.89, and the standard deviation is t t 0.21. These values suggest that the overall saving behavior has not been changed substantially by the transaction cost. 5.2 Simulated Average MPC from Unanticipated Shocks Now we can compute the MPC for windfall shocks of various sizes. As mentioned earlier, if the population asset and cash holdings are distributed as the converged distribution, the probability of having to pay a transaction cost is about 9 percent. I simulated 20,000 households using the converged distribution of A and M . The top two rows of table 4 summarizes the t t consumption response to various income changes. I compute the average MPC for a shock equal to 5 percent of the mean of quarterly income and the average MPC for a shock equal to 50 percent of the mean of quarterly income. I refer to the model without transaction cost as the standard model. As table 4 shows, in the standard model, for the 5 percent shock (small shock), MPC = 0.13; for the 50 percent shock (big shock), MPC = 0.10. The diminishing MPC in the standard model comes from the concavity of the consumption function, as Carroll 20
and Kimball (1996) have pointed out. However, the MPC is still substantially lower than the estimates of Bodkin(1959) and Hall and Mishkin(1982). In the model with transaction costs, a 5 percent (small) income shock induces an MPC = 0.23, which is 10 percent larger than the MPC in standard model and is close to the estimate by Hall and Mishkin (1982). For a 50 percent (big) shock, MPC = 0.13, which is still larger than the MPC in the standard model. Although the simulated average MPC out of small windfall income in the model with baseline parameters is still substantially lower than that in Bodkin(1959), changing several parameters can cause the MPC to increase. First, inflation will make holding cash more costly, and consumers will want to increase current consumption by even more than in the baseline model. Second, lower parameters of ρ will make cash constraint less costly and consumers will bemorelikelytochoosetobecashconstrainedand, inturn, haveveryhighMPC.Third, larger transaction costs lead to a wider s-S band and higher average MPC. Later in the paper, I will explore to what extent these alternative parameters help to increase the MPC. Finally, there could be some aggregated shock that systematically makes a larger subset of the consumers cash constrained. The more consumers are constrained, the higher the average MPC will be. 5.3 Simulated Average Consumption Growth in Response to Anticipated Shocks To understand why people are sometimes excessively sensitive to anticipated income shocks, we consider the following problem. Suppose at time t, the consumer gets the news that next period’s income will increase by δ . By how much should today’s consumption increase? t+1 Furthermore, if an econometrician tests the orthogonality condition by regressing C − C t+1 t on δ , under what circumstance would he get a positive coefficient? Would the standard t+1 model and the model of transaction costs provide different results? To pin down the optimal consumption in time t, given that future income will increase by δ , the consumer solves t+1 max U(C )+βE V(A ,M(cid:48) ,∆ ) , (19) t ∆t+1 t+1 t+1 t+1 Ct,Mt+1,θt subject to equation (2.7), and 21
(cid:40) (1−π)(M +∆ −C )+δ if θ = 0, M(cid:48) = t t t t+1 t (20) t+1 M +δ M to be chosen if θ = 1. t+1 t+1 1 t The only difference between equation (21) and equation (6) is that I changed V(A ,M ,∆ ) to V(A ,M(cid:48) ,∆ ), where M(cid:48) is defined in equation (22) as the t+1 t+1 t+1 t+1 t+1 t+1 t+1 real cash balance carried from period t plus the anticipated income change δ . In period t+1 t + 1, the consumer will solve the regular optimization problem (equation 11, 12, and 13) again. Using the computed expected value function, E [V(A ,M ,∆ )], we can compute the ∆t t t t optimal C and C . We also compute the optimal C and C for the standard model with t t+1 t t+1 no transaction cost. The bottom two rows of table 4 report the response of consumption to small (5 percent) and big (50 percent) anticipated income shocks in a model with and without transaction costs. We find that for both small and big income shocks, the standard model that has no transaction costs predicts the average of C −C < 0. This prediction is consistent with the t+1 t PIH because the consumer in the model is assumed to be impatient, in the sense that r < β. The model with transaction costs predicts that for small anticipated changes in income, the cross section average of C − C = 0.0015 is larger than zero. This fact suggests excess t+1 t sensitivity to anticipated income change. For large income changes, the cross section average of C −C = −0.0011 is negative and suggests no or very little excess sensitivity. The results t+1 t are further evidence that people respond to small anticipated income shocks more than the theory predicted. Why does the consumer respond only to small anticipated shocks with excess sensitivity? The reason for consumption smoothing is found in the concavity of the utility function and intertemporal substitution. With transaction costs and a gain from consumption smoothing that is not as large as the utility loss from the payment of a transaction cost, the consumer will not adjust consumption immediately after the arrival of the news of the shock. The consumer will be better off if she waits until the receipt of the income. 22
5.4 Simulation Results for Alternative Parameters The above exercises have been repeated for various alternations of the baseline parameters. Table 5 extends table 3 and table 6 extends table 4 to show the effects of parameter alternations. First, I change the magnitude of the transaction costs. I try both a smaller and a larger transaction cost than in the baseline model. In table 5, under the column of ψ = 0.005, the transaction costs are set equal to 0.5 percent of the mean of quarterly income, half of the baseline magnitude. Not surprisingly, because of a lower transaction cost, the consumer no longer worries about having to pay the costs as much as in the baseline model. This lack of concernisreflectedinalowermeanofthedistributionofcashholdingsoverperiods: compared with the baseline model, the mean of cash holdings decreases from 0.1 to 0.07. This decrease in cash holding is partly offset in an increase in asset holdings, the mean of which increases from 0.80 to 0.82. Consequently, total wealth saved decreased slightly relative to the baseline case. Because of the smaller transaction cost, the s-S band is narrower, and the contraction implies that the consumer pays the cost more often (the probability of paying the transaction cost increases to 18.8 percent), and the induced MPC out of both small and big unanticipated income shock is smaller than in the baseline case. As shown in table 6 under the column of ψ = 0.005, the MPC for the 5 percent (small) income shock is 0.202, decreasing from 0.236 in the baseline model, and for the 50 percent (big) income shock, the MPC is 0.117, comparing with 0.132 in the baseline model. We can see that the MPC is still larger than in the case in which there are no transaction costs (standard model in table 4). In addition, the model still implies an excess sensitivity in response to anticipated income shocks when the shocks are small (in this case C −C = 0.001, greater than zero). t+1 t For larger transaction costs (ψ equal to 2 percent of mean income) the findings are more economically significant than in the baseline case. As we can we see in table 5, under the column of ψ = 0.02, the mean of the precautionary cash holding increases to 0.12, and the mean of total saving increases to 0.92. In table 6 the corresponding column suggests that the increased transaction cost also significantly increased MPC out of both small and big unanticipated income shocks, to 0.249 and 0.168, respectively, because of the wider s-S band 23
(the probability of people paying the transaction cost is only 7.7 percent). For anticipated income changes, we get excess sensitivity for both 5 and 50 percent anticipated changes in income, that is C − C > 0. However, when the size of the anticipated income change is t+1 t increased to 100 percent of the mean of quarterly labor income, not shown in the table, the excess sensitivity fades out. I then study the effects of adding inflation to the model. I consider both mild inflation and high inflation. The results are show in table 5 and table 6 under the column of π = 0.01 and π = 0.05. Inflation makes holding cash more costly, so if the consumer decides not to pay the transaction cost, the MPC out of cash will be higher. This result is shown in the Euler equation(15). Higher π will reduce the current period’s marginal utility and increase future marginal utility; therefore, consumption growth decreases. However, because holding cash costs more, the s-S is narrower (the probability of paying a transaction cost is as high as 23.7 percent), and people pay the transaction costs more frequently. The overall effect of inflation is ambiguous without seeing the numerical results. Simulation results suggest that the higher probability of paying the transaction cost is offset by the increase in MPC within the s-S band. When π = 0.01, the MPC is 0.257 for small income shocks and 0.123 for large income shocks. When π = 0.05, the MPC becomes 0.304 and 0.121 for small shocks and large shocks, respectively. Figure 7 shows the optimal consumption as a function of cash holdings after receiving the labor income for a given level of A=0.5. When we compare figure 7 with figure 3, we see that the s-S band in figure 7 is narrower; but that within the band, more people are in the MPC=1 region in Figure 7. Money thus burns a hole in the high inflation episodes if people hold cash. The pattern of excess sensitivity for anticipated income is similar to that in the baseline model. Finally, I show how different intertemporal elasticities of substitution (IES) would affect the results. I consider the case of log preference (ρ = 1) and ρ = 7. The results are presented in table 5 and table 6 under the columns of ρ = 1 and ρ = 7, respectively. Lower ρ yields a higher IES which makes consumers more willing to substitute consumption between periods. For a high IES, we find significantly lower wealth accumulation as the consumer has a weaker precautionarymotivationtosave. ThisfindingisconsistentwithCarroll(1997). Theconsumer 24
also has a weaker precautionary demand for cash. The mean demand for cash is only 6.6 percent of mean labor income. The consumer does not care that much about intertemporal substitution or being cash-constrained. Therefore, the probability of paying the transaction cost is low (0.645). The MPC from unanticipated shocks is very high, and excess sensitivity to anticipated income shocks is robust even for very large shocks. In the case of a low IES, we have the opposite results. Specifically, wealth accumulation is very high, the MPC from an unanticipated income shock is low, and the excess sensitivity in response to an anticipated income shock cannot be detected even for small income shocks. 6 On the Equity Premium Puzzle Assume the CRRA type of utility functions. The standard consumption-based capital asset pricing model (C-CAPM) leads to the following well-known relationship (cid:183) (cid:181) (cid:182) (cid:184) C ρ t+1 E β (1+r )−1 = 0. (21) t t C t ˆ Testsofthetheorywilltypicallyusemacrodatatofitequation(21)withρˆandβ. Theresearch that has been done using this approach typically reports a very high ρˆ, while studies using other approaches suggest that the CRRA coefficient cannot be that high. This discrepancy is the celebrated equity premium puzzle. Even if the high CRRA coefficient is accepted, the puzzle cannot be resolved. Although a high CRRA coefficient implies a large mean of excess returns of risky assets over the risk-free asset, it also implies an unreasonably high risk-free rate at a level that we never see in the data. The estimated high value of ρˆcan be consistent with a low actual level of ρ estimated from other approaches with various extensions to the model. Gabaix and Laibson (2001) pointed out that if consumers adjust their consumption plan only infrequently, say, every D periods, and aggregated consumption growth is then used to fit equation (23), the estimated ρˆ can be upward biased by a factor of up to 6D. They call this effect the 6D bias. Reis (2006) assumed that the consumers get information infrequently, and during the time that they are not updating their information they are “inattentive.” His model also predicts an upward bias to ρˆ. 25
In a similar spirit, I extend the model in Section 3 to allow for labor income risks as well as asset return risks. This model is state dependent, and does not rely on assumptions like infrequent update of information. The model, however, can lead to results that are related to those in Gabaix and Laibson(2001) and Reis(2006). The real return on assets is assumed to follow some stochastic process and r is observed after the consumer has made decisions on t [C ,M ,θ ]. Therefore, the realization of r has an effect on decisions made only in period t t+1 t t t+1 and later. Mechanically, in the model equation (6) to (14), the only equation that needs modification is equation (7). If we take into account the stochastic asset return, the motion equation of state variable A is (cid:40) (1+r )A if θ = 0, t t t A = (22) t+1 (1+r )(A +M +∆ −C − Mt+1 −ψ) if θ = 1. t t t t t 1−π t The consumer will enter period t with asset holding A , which is determined by the realization t of r . This model has an s-S band for A , (A ,A ), such that for each given level of M +∆ , t−1 t t t t t θ = 0 if and only if A ≤ A ≤ A . Within the s-S band for A , there is a subset (A1,A2) ⊂ t t t t t t t (A ,A ) such that C(A ,M ) is constant if A ∈ (A1,A2) for a given level of M +∆ . Such an t t t t t t t t t s-S band can be represented in terms of r . Consumption does not vary with asset returns t−1 if the return is between a certain range. Consider a situation in which the consumer holds some A0 and M0 in period t - 1 after paying for the consumption expenditure but before r is realized. For any realization of ∆∗, t−1 t there always exists one realization of r∗ such that M0 + ∆∗ = C∗[(1 + r∗ )A0,M0 + ∆∗], t−1 t t−1 t where C∗ is the optimal consumption level in an economy where there is no transaction cost. Such a combination of r and ∆ should leave the consumer in a first best situation. 1−t t Now imagine that the consumer still receives the same realization, ∆∗, but the realized t r is only slightly higher than the predicted r . In an economy without transaction cost, t−1 t−1 consumption should increase in response to the higher asset return. However, in the current model, the higher consumption is not optimal. The consumer’s current cash holding can accommodate consumption of only C∗[(1+r∗ )A0,M0+∆∗]; beyond that the consumer has t−1 t to withdraw money from the asset account. When the change in r is not very large, the t−1 consumer still consume C∗[(1 + r∗ )A0,M0 + ∆∗],and remain fairly close to the first best t−1 t 26
situation; hence the consumer should not pay the transaction cost. Thus, for a certain range of r the correlation between consumption growth and r is literally zero for some given t−1 t−1 ∆∗. If we compute the average correlation across the distribution of r and ∆, the correlation t is positive but lower than in the economy without transaction costs. How important will this effect be quantitatively? The answer depends on how many consumers are on the linear part of the consumption function. Only those consumers who are cash constrained will not vary their consumption with the fluctuations in the equity market. The fraction of cash-constrained consumers, in turn, depends on the size of transaction cost, ψ. The larger ψ is, the wider the s − S band is, and the more cash-constrained consumers there are. However, for the magnitude of the transaction cost assumed in this paper, or any value close to that, this model cannot reduce the correlation between consumption growth and asset returns to a level that is consistent with a plausible value of ρ. Unlike the powerful 6D model, which lowers the required ρ by a factor of 24, the transaction cost model lowers the required ρ only mildly. Conversely, the value of ψ that can lower ρ to the neighborhood of 4 or 5 is unrealistically large. It is well known that the equity premium puzzle is very hard to be resolved by the introduction of one new assumption. The transaction cost model may help explain the puzzle better when the model is integrated with other assumptions that have been partly successful in explaining the equity premium puzzle. 7 Conclusion In this paper, I construct a model with transaction costs in a cash-in-advance economy. The model helps to explain the observation that the MPC out of small unanticipated income shocks is in general higher than the MPC out of large such shocks. It also helps explain why econometricians find excess sensitivity to anticipated income changes only when the changes are relatively small. These results do not rely on assumptions about liquidity constraints or any other behavioral assumptions. Furthermore, the model predicts a demand for cash, or in a broader sense, a demand for liquid assets that arises from a precautionary motivation. Finally, an extension of the model serves as a state-dependent paradigm of how consumption 27
responds to stochastic asset returns, and can help to solve the equity premium puzzle. Two caveats apply to the current model. First, the income process adopted in this model is primitive. To make the computation less expensive, I did not allow for any persistent income process or income growth. Also, the model is partial equilibrium. For shocks at the aggregate level, general equilibrium effects will need to be taken into account. The magnitude of the transaction cost plays an important role in determining the s-S band width. Therefore, future research is needed to uncover solid empirical estimates of the magnitude. Recent developments in the research of limited participation in the equity market provide some estimates of the magnitude of related frictions. For example, Vissing-Jørgensen (2002)estimatesthataper-periodstockmarketparticipationcostofjust50dollarsissufficient to explain the choices of half of stock market nonparticipants. The simulated results of the model have also yielded some testable implications such as that the MPC from windfall income is higher when inflation is high, and that the correlation between asset return and consumption growth is higher when labor income is more volatile. Future research will seek to verify these implications. 28
Appendix: Differentiability and Concavity of the Value Functions In FOC equations (15) and (16), I applied operator ∂ and ∂ on the expected value ∂At+1 ∂Mt+1 function. However, I have not proved that the expected value function is differentiable. Define ˜ V(A ,M ) = E [V(A ,M ,∆ )]. If we assume that the probability density function (PDF) of t t ∆t t t t ∆ distribution is f(∆) and, without loss of generality, that M < φ (A ), the following hold: t 1 t 10 (cid:90) ∞ V ˜ (A ,M ) = VTR(A ,B )f(B −M )dB + (23) t t t t t t t (cid:90) Mt φ1(At) [VNT(A ,B )−VTR(A ,B )]f(B −M )dB . t t t t t t t φ1(At) ˜ To make V differentiable with respect to A and M , we need f(.), φ (A ), and φ (A ) to be t t 1 t 2 t ˜ differentiable. Now we want to know under what circumstances we have a concave V. If we ˜ take the second order derivative of V(A ,M ) with respect to M , and assume that f(0) = 0, t t t we have (24) (cid:90) ∂2V ˜ (A ,M ) ∞ t t = −VTR(A ,0)f(cid:48)(0)− VTR(A ,M )f(cid:48)(cid:48)(B −M )dB + ∂M2 t t t t t t t (cid:90) Mt φ2(At) [VNT(A ,B )−VTR(A ,B )]f(cid:48)(cid:48)(B −M )dB . t t t t t t t φ1(At) For very general continuous distributions with support R+, equation (19) will be negative ˜ for relevant values of A and M . Therefore, for a given level of A , V is concave in M . Simit t t t larly, we can show V ˜ is also concave in A for any given M under rather general conditions.11 t t If V ˜ (A ,M ) is concave in A and M , it is easy to show that VTR(A ,B ) and VNT(A ,B ) are t t t t t t t t concave in A and B respectively when we fix the other argument.12 Finally, V(A ,B ) will t t t t be piecewise concave because it is the upper-contour of two concave functions. 10If M >φ (A ), the consumer will never withdraw money from the asset account. t 1 t 11I do not have detailed rigorous proof on concavity and the more concise conditions that guarantee concavities, but numerical simulations further support the claim of concavity. 12Another conjecture that I have not been able to prove is that the value functions are concave jointly in A andM . t t 29
References [1] Agnew, Julie, Pierluigi Balduzzi, and Annika Sunden (2003) “Portfolio Choice and Trading in a Large 401(K) Plan,” American Economic Review, vol. 93 (March), pp. 193-215. [2] Aiyagari, Rao, and Mark Gertler (1991), “Asset Returns with Transactions Costs and Uninsured Individual Risk,’ Journal of Monetary Economics, vol. 27 (June), pp. 311-31. [3] Alvarez, Fernando, Andrew Atkeson, and Patrick Kehoe (2002), “Money, Interest Rates, and Exchange Rates in Endogenously Segmented Market,” Journal of Political Economy, vol. 110 (February), no. 1, pp 73-112. [4] Barber, Brad and Terrance Odean (2000) “Trading is Hazadous to Your Wealth: The Common Stock Investment Performance of Individual Investors,” Journal of Finance, vol. 55 (April), pp. 773-806. [5] Benhabib, Jess, and Alberto Bisin (2002), “The Psychology of Self-Control and Consumption-Saving Decisions,” working paper, New York University. [6] Bodkin, Ronald (1959), “Windfall Income and Consumption,” American Economic Review, vol. 49 (September), pp. 602-14. [7] Bodkin, Ronald (1966), “Windfall Income and Consumption: Reply,” American Economic Review, vol. 56 (June), pp. 540-6. [8] Carroll, Chris D. (1992), “The Buffer-Stock Theory of Saving: Some Macroeconomic Evidence,” Brookings Papers on Economic Activity, vol. 1992, pp. 61-156. [9] Carroll, Chris D. (1997), “Buffer Stock Saving and the Life Cycle/Permanent Income Hypothesis,” Quarterly Journal of Economics vol 112 (Feburary), pp. 156. [10] Carroll, Chris D., and Miles S. Kimball (1996), “On the Concavity of the Consumption Function,” Econometrica, vol. 64 (July), pp. 981-92. 30
[11] Clarida, Richard (1987), “Consumption, Liquidity Constraints and Asset Accumulation in the Presence of Random Income Fluctuations,” International Economics Review, vol. 28 (June), pp. 339-51. [12] Deaton, Angus(1992), UnderstandingConsumption, NewYork: OxfordUniversityPress. [13] Flavin, Marjorie (1981), “The Adjustment of Consumption to Changing Expectations About Future Income,” Journal of Political Economy, vol. 89 (October), pp. 974-1009. [14] Gabaix, Xavier, and David Laibson (2001), “6D Bias and Equity Premium Puzzle,” NBER Macroeconomics Annual, vol. 16, pp. 257-312. [15] Hall, Robert (1978), “Stochastic Implications of the Life Cycle-Permanent Income Hypothesis: Theory and Evidence,” Journal of Political Economy, vol. 86 (December), pp. 971-987. [16] Hall, Robert E., and Frederic S. Mishkin(1982), “The Sensitivity of Consumption to Transitory Income: Estimates from Panel Data on Households”, Econometrica, vol. 50 (March), pp. 461-82. [17] Hsieh, Chang-Tai (2003), “Do Consumers Smooth Anticipated Income Changes? Evidence from the Alaska Permanent Fund,” American Economic Review, vol. 93 (March), pp. 397-405. [18] Kreinin, Mordechai. E. (1961), “Windfall Income and Consumption: Additional Evidence,” American Economic Review, vol. 51 (June), pp. 388-90. [19] Landsberger, Michael(1966), “WindfallIncomeandConsumption: Comment,” American Economic Review, vol. 56 (June), pp. 534-40. [20] Lucas, Robert E. Jr., and Nancy Stokey (1989), Recursive Methods in Economic Dynamics, Cambridge: Harvard University Press. [21] Mayer, Thomas (1972), Permanent Income, Wealth, and Consumption, Berkeley: University of California Press. 31
[22] Odean, Terrance (1999” “Do Investors Trade Too Much,” American Economic Review, vol. 89 (December), pp.1279-98 [23] Parker, Jonathan (1999), “The Reaction of Household Consumption to Predictable Changes in Social Security Taxes,” American Economic Review, vol. 89 (September), pp. 959-73. [24] Reis, Ricardo (2006), “Inattentive Consumers,” Journal of Monetary Economics, vol 53 (November), pp. 1761-800. [25] Runkle, David (1991), “Liquidity Constraints and the Permanent Income Hypothesis,” Journal of Monetary Economics, vol. 27 (February), pp. 73-98. [26] ShapiroMatthew,andJoelSlemrod(2003),“ConsumerResponsetoTaxRebates,”American Economic Review, vol. 93 (March), pp. 381-96. [27] Souleles, Nicholas S. (1999), “The Response of Household Consumption to Income Tax Refunds,” American Economic Review, vol. 89 (September),pp. 947-58. [28] Souleles, Nicholas S. (2000), “College Tuition and Household Savings and Consumption,” Journal of Public Economics, vol. 77 (August), pp. 185-207. [29] Souleles, Nicholas S. (2002), “Consumer Response to the Reagan Tax Cuts,” Journal of Public Economics, vol. 85 (July), pp. 99-120. [30] Thaler, Richard (1992), “Savings and Mental Accounting,” in George Lowenstein and Jon Elster eds., Choices over Time, New York: Russell Sage Foundation. [31] Vissing-Jørgensen, Annette (2002), “Towards an Explanation of Household Portfolio Choice Heterogeneity: Nonfinancial Income and Participation Cost Structures,” NBER Working Paper [32] Wilcox, David (1989), “Social Security Benefits, Consumption Expenditure, and the Life Cycle Hypothesis,” Journal of Political Econom, vol. 97 (April), pp. 288-304 32
[33] Zeldes, Stephen P. (1989a) “Consumption and Liquidity Constraints: An Empirical Investigation,” Journal of Political Economy, vol. 97 (April), pp. 305-41. [34] Zeldes, Stephen P. (1989b) “Optimal Consumption with Stochastic Income: Deviations fromCertaintyEquivalence,” QuarterlyJournalofEconomics, vol.104(May), pp.275-98. 33
weiveR erutaretiL 1 elbaT sgnidniF egnahC llamS ro egnahC giB secruoS ataD dna stnemirepxE ydutS egnahC emocnI fo epyT detceted ytivitisnes ssecxE A/N atad orcaM )1891(nivalF detceted ytivitisnes ssecxE A/N atad orcaM )2891( llaH detceted ytivitisnes ssecxE egnahc llamS XEC ni dnufer xaT )9991(seleluoS detceted ytivitisnes ssecxE egnahc llamS XEC ni segnahc gnidlohhtiw xaT )9991(rekraP egnahC detapicitnA detceted ytivitisnes ssecxE egnahc llamS XEC ni stuc xat nagaeR owt esahP )2002(seleluoS detceted ton ytivitisnes ssecxE egnahc giB XEC ni stnemyap noitiut egelloC )1002(seleluoS detceted ton ytivitisnes ssecxE egnahc giB XEC ni stnemyap dnuF euneveR liO aksalA )3002(heisH 9.0 dna 7.0 neewteb CPM egnahc llamS stnemyap ecnarusnI efiL ecivreS lanoitaN )9591(nikdoB 61.0 dnuora CPM egnahc giB silearsI ot stnemyap snoitarapeR )1691(ninierK nehw CPM gnisaerced fo ecnedive gnortS regral semoceb emocni llafdniw egnahc gib dna llamS silearsI ot stnemyap snoitarapeR )6691(regrebsdnaL egnahC detapicitnanU fo ezis eht neewteb pihsnoitaler eht tcejer tonnaC CPM dna kcohs emocni llafdniw egnahc llamS stnemyap ecnarusnI efiL ecivreS lanoitaN )6691(nikdoB 92.0 dnuora CPM A/N atad DISP )2891(nikhsiM dna llaH ledom stniartsnoc ytidiuqil eht stroppuS ataD XEC ni dnufer xaT )9991(seleluoS stniartsnoC ytidiuqiL ledom stniartsnoc ytidiuqil eht stroppuS atad DISP )a9891(sedleZ ledom eht troppus ton seoD atad XEC ni segnahc gnidlohhtiw xaT )9991(rekraP ledom eht troppus ton seoD atad XEC ni stuc xat nagaeR owt esahP )2002(seleluoS ledom eht troppus ton seoD atad DISP )1991(elknuR ledom eht troppus ton seoD atad yevruS )3002(dormelS dna oripahS
Table 2 Baseline Parameters real interest rate r 0.015 discount rate β 0.025 prob of very low income p 0.01 transaction cost ψ 0.01 mean of income µ 1 ∆ standard deviation of income σ 0.1 ∆ risk aversion coefficient ρ 2 inflation rate π 0 35
Table 3 Converged Distributions under the Baseline Parameterization Mean (SD) of the Converged Asset Distribution 0.79 ( 0.19) Mean (SD) of the Converged Cash Distribution 0.10 (0.09) Mean (SD) of the Converged Wealth Distribution 0.89 ( 0.21 ) Probability of Paying the Transaction Costs 8.8% Notes: The mean and standard deviation are relative to the mean of income. Table 4 MPC out of Various Income Changes Standard Model Baseline Model Unanticipated Small (5%) income shock 0.135 0.236 Big (50%) income shock 0.109 0.132 Anticipated Small (5%) income shock -0.002 0.002 Big (50%) income shock -0.004 -0.001 Notes: The standard model refers to the model with no transaction costs, the benchmark model refers to the model with transaction costs, both with parameters defined in table 2. 36
noitaziretemaraP evitanretlA eht rednu snoitubirtsiD degrevnoC 5 elbaT 7 = r ) C (goL 1 = r 50.0 = p 10.0 = p 20.0 = y 500.0 = y 21.2 35.0 09.0 48.0 08.0 28.0 )63.0 ( )71.0( )22.0 ( ) 02.0 ( ) 12.0 ( ) 12.0 ( noitubirtsid stessa degrevnoc eht fo )DS( naeM 41.0 70.0 20.0 70.0 21.0 70.0 )01.0 ( )70.0( )30.0 ( ) 70.0 ( ) 01.0 ( ) 70.0 ( noitubirtsid hsac degrevnoc eht fo )DS( naeM 52.2 95.0 29.0 09.0 29.0 98.0 )83.0 ( )71.0( ) 22.0 ( ) 12.0 ( ) 12.0 ( ) 22.0 ( noitubirtsid htlaew degrevnoc eht fo )DS( naeM %7.63 %54.6 %7.32 %4.31 %7.7 %8.81 stsoc noitcasnart eht gniyap fo ytilibaborP .emocni fo naem eht ot evitaler era noitaived dradnats dna naem ehT :setoN noitaziretemaraP evitanretlA rednu segnahC emocnI suoiraV fo tuo CPM 6 elbaT 7 = r ) C (goL 1 = r 50.0 = p 10.0 = p 20.0 = y 500.0 = y detapicitnanU 870.0 273.0 403.0 752.0 942.0 202.0 kcohs emocni )%5( llamS 860.0 902.0 121.0 321.0 861.0 711.0 kcohs emocni )%05( giB detapicitnA 350.0- 410.0 600.0 200.0 300.0 100.0 kcohs emocni )%5( llamS 121.0- 350.0 600.0- 400.0- 610.0 500.0kcohs emocni )%05( giB 37
ledoM eht fo gnimiT 1 erugiF 38
snoitcnuF eulaV 2 erugiF noitcnuF eulaV TV V NV B 0 t nehw noitcnuf eulav eht si NV saerehw ,syawla tsoc noitcasnart eht yap ot decrof si remusnoc eht nehw noitcnuf eulav eht si TV :etoN .TV dna NV fo ruotnoc reppu eht si V noitcnuf eulav ehT .srefsnart yna ekam tonnac remusnoc eht 39
Figure 3 Optimal Consumption with and without Transaction Costs With Transaction Cost Without Transaction Cost Consumption 1.2 1.1 1 0.9 0.8 0.7 0.6 0.5 0.4 0 0.5 1 1.5 2 2.5 3 3.5 Cash on Hand 40
Figure 4 Optimal Cash Holdings Cash on Hand 41
Figure 5 Simulated Distribution of Assets and Cash Asset Holding Cash Holdings Probability Probability 42
Figure 6 Simulated Distribution of Wealth Wealth Holdings Assets Plus Cash Holdings Probability Probability 43
Figure 7 Optimal Consumption in A High Inflation Environment 44
Cite this document
Geng Li (2007). Transaction Costs and Consumption (FEDS 2007-38). Board of Governors of the Federal Reserve System, Finance and Economics Discussion Series. https://whenthefedspeaks.com/doc/feds_2007-38
@techreport{wtfs_feds_2007_38,
author = {Geng Li},
title = {Transaction Costs and Consumption},
type = {Finance and Economics Discussion Series},
number = {2007-38},
institution = {Board of Governors of the Federal Reserve System},
year = {2007},
url = {https://whenthefedspeaks.com/doc/feds_2007-38},
abstract = {The Rational Expectations Permanent Income Hypothesis (RE-PIH) fails to explain several features of consumption behavior documented by previous researchers. First, the marginal propensity to consume (MPC) out of unanticipated income shocks tends to decrease as the size of the shocks becomes larger. Second, the MPC out of small income shocks is well above what the RE-PIH predicts. Third, consumption responds to small anticipated income changes, but not to large ones. This paper argues that these findings can be reconciled within a RE-PIH framework that includes a cash-in-advance constraint. In the model, the representative agent is assumed to be fully rational with perfect information and is able to borrow against future income. The agent can hold cash and interest-bearing assets, but has to pay a fixed transaction cost to transfer wealth between cash and assets. I show that the agent follows an s-S rule with respect to cash holdings in making wealth-transfer decisions. The MPC within the no-transfer band is higher than that out of the band. It can be lower than or exactly equal to 1. The model also predicts that agents smooth consumption in response to news of large future income changes but not to small ones. Furthermore the model sheds light on the demand for liquid assets and the equity premium puzzle.},
}