feds · December 31, 2009

The Rigidity of Labor: Processing Savings and Work Decisions Through Shannon's Channels

Abstract

This paper argues that constraining people to choose consumption and labor under finite Shannon capacity produces results in line with U.S. business cycle data. My model has a simple partial equilibrium setting in which risk averse consumers keep high labor supply and low consumption profile at early stage of life to hedge against wealth fluctuations. They rationally choose to keep consumption and labor unchanged until they collect enough information. I find that at high frequency consumption appears to be more sluggish than labor supply. However, when people decide to change consumption they do so by a large amount. This combination leads to higher variance of consumption with respect to labor supply. My model also finds high persistence and strong comovement of consumption and employment and delayed response of consumption and labor with respect to wealth. Furthermore, my framework generates endogenously a wedge between marginal rate of substituition and marginal rate of transformation or wages. Such wedge is bigger and more volatile the lower information flow. These findings suggest that rational inattention offers a promising avenue to bridge the gap between theory and U.S. business cycle data.

Finance and Economics Discussion Series Divisions of Research & Statistics and Monetary Affairs Federal Reserve Board, Washington, D.C. The Rigidity of Labor: Processing Savings and Work Decisions Through Shannon’s Channels Antonella Tutino 2010-02 NOTE: Staff working papers in the Finance and Economics Discussion Series (FEDS) are preliminary materials circulated to stimulate discussion and critical comment. The analysis and conclusions set forth are those of the authors and do not indicate concurrence by other members of the research staff or the Board of Governors. References in publications to the Finance and Economics Discussion Series (other than acknowledgement) should be cleared with the author(s) to protect the tentative character of these papers.

The Rigidity of Labor Processing savings and work decisions through Shannon(cid:146)s channels Antonella Tutino (cid:3) First version: May 2008 ; This version: February 2009 Abstract This paper argues that constraining people to choose consumption and labor under (cid:133)nite Shannon capacity produces results in line with U.S. business cycle data. My model has a simple partial equilibrium setting in which risk averse consumers keep high labor supply and low consumption pro(cid:133)le at early stage of life to hedge against wealth (cid:135)uctuations. They rationally choose to keep consumption and labor unchanged until they collect enough information. I (cid:133)nd that at high frequency consumption appears to be more sluggish than labor supply. However, when people decide to change consumption they do so by a large amount. This combination leads to higher variance of consumption with respect to labor supply. My model also (cid:133)nds high persistence and strong comovement of consumption and employmentanddelayedresponseofconsumptionandlaborwithrespecttowealth. Furthermore,myframeworkgeneratesendogenouslyawedgebetweenmarginalrate ofsubstituitionandmarginalrateoftransformationorwages. Suchwedgeisbigger and more volatile the lower information (cid:135)ow. These (cid:133)ndings suggest that rational inattention o⁄ers a promising avenue to bridge the gap between theory and U.S. business cycle data. E-mail: Antonella.Tutino@frb.gov. The views in this paper are solely the responsibility of the (cid:3) author and should not be interpreted as re(cid:135)ecting the views of the Federal Reserve Board or any other person associated with the Federal Reserve System. 1

1 Introduction Existing macroeconomic theories have trouble (cid:133)tting empirical regularities in aggregate behavior. Existing models match some U.S. business cycle facts at the expense of others. Despite the progress in the (cid:133)eld and rich modelling tools, one peculiar dimension of the business cycle that still troubles the literature is the labor market. As a matter of theory, starting from a framework in which consumers and (cid:133)rm optimize, market clears and no (cid:133)scal distortion, equilibrium in the labor in equilibrium the marginal rate of substitution (MRS) between consumption and labor ought to be equal to the marginal product of labor (MPL) given by wages. As a matter of fact, the data shows a discrepancy between the two, called the labor wedge. The challenge for macroeconomic theories has been to match magnitude and volatility of such a wedge (Hall 1997). Over the years, a number of explanations and modelling devices have been proposed to address the movement in the labor wedge over the business cycle. The principal suspects are di⁄erent sorts of shocks to either the demand or the supply side of the labor market or both. Whether the shocks hit wage mark-ups,1 disutility of labor,2 taxes 3 or productivity as in the search and matching literature,4 the common denominator seems to be the close connection between ad hoc modelling technologies and results. Moreover and most importantly, models based on exogenous shocks do not only fall short in microfounded justi(cid:133)cation but also fail in providing empirically plausible predictions along other dimensions.5 This paper suggests an explanation of labor wedge and, more generally, business cycle facts, based on rational inattention.6 My choice to model people constrained by (cid:133)nite information processing capacity agrees with intuition and, as the paper shows, points towards the right direction to capture empirical evidence. In particular, my model is able to generate endogenously a wedge between MRS and MPL in an optimizing framework without the need of exogenous shocks. In my model, rational households consume, work and pay attention to their wealth to maximize their lifetime utility. In the baseline model, I assume that consumers are risk adverse and have a constant disutility of labor. Under rational inattention, they cannot know the exact value of their wealth in each period due to information-processing frictions. Each period they choose information about wealth they can attend to and decide on the basis of that information how much to consume and work. Realized consumption and employment are then used to update rationally their knowledge of wealth. In my setting wages follows a Markov process and its distribution is known before consumers make their work and consumption decision. Thus, (cid:135)uctuations in wealth are due to 1cfr. e.g., Gal(cid:236), Gertler and Lopez-Salido (2007), Smets and Wouters (2007). 2cfr., e.g., Rotemberg and Woodford (1987), Smets and Wouters (2003), Gal(cid:236) and Rabanal (2004). 3cfr. e.g., Uhlig (2003), McGrattan and Prescott (2007). 4cfr. e.g., Pissarides (1985), Mortensen and Pissarides (1994), Shimer (1995), Hall (1995) and Gal(cid:236) and Trigari (2006). 5cfr. Pissarides (2007) and Shimer (2008). 6cfr., Shannon (1948), Sims (1988, 1998, 2003, 2006), Tutino (2008). 2

movements in labor and consumption as well as movements in wages. Consumers keep track of wealth by processing information in the limits of their Shannon capacity. Having a bound on information processing rate suits the observation that people do not check their account on a daily basis, nor they are likely to keep track of the incidence of their expenses and hours worked on their lifetime wealth at high frequency. Using Shannon channels as modelling device has also the nice feature of providing a natural bound for information (cid:135)ow which depends only on the distribution of the variables that are passed through the channel, regardless of the speci(cid:133)city of the channel. This in turns makes the choice of this kind of information friction free from unexplained assumptions on individual characteristics. More importantly, the predictions of the model agree with U.S. business cycle data along the dimensions analyzed. I show that even in a simple partial equilibrium with focus on the supply side of the labor market, a model in which people choose labor and consumption under information processing constraint (cid:224) la Sims is able to explain several features of U.S. business cycle. In particular, my setting delivers four results: 1. ratio of standard deviation of consumption over hours worked bigger than one; 2. persistence in consumption and labor supply; 3. comovement of consumption, labor and wealth; 4. endogenously derived labor wedge with positive variance. To understand the mechanism behind these results, consider what happens in an equivalent model with full information. When wages are deterministic and interest rates are high enough to promote savings, a consumer aware of his wealth with log utility and constantdisutilityoflaboraccumulateswealththroughhighlaborsupplywhich, inturns, increase period-by-period income while keeping consumption smooth. In such a context (cid:135)uctuations in wealth are mostly due to (cid:135)uctuations in labor supply. With information processing constraints, consumers cannot know the exact value of their wealth even with deterministic wages. They keep track of their wealth imperfectly by choosing signals as informative about wealth as channel capacity allows them to. Log utility in consumption and linear disutility of labor together with low information (cid:135)ow make household work hard and save at early stage of their life. With low information (cid:135)ow and, as a result, low informativeness of the signal each period, households keep savings and labor supply high to make sure they can sustain their consumption. As wealth accumulates, the signal on high values of wealth becomes sharper, calling for a major adjustment in behavior. The size of the adjustment is bigger the lower their processing capacity. This result is intuitive: the longer consumers wait to modify their behavior, the bigger is the variation in consumption and labor once they acknowledge the change in wealth through information processed. Furthermore, in my model, consumption is more sluggish than labor supply. The rationale for this (cid:133)nding lies in the preferences of consumers who dislike having to change their consumption frequently while having constant disutility of labor. Such preferences imply that when people review their consumption plan by realizing that they have saved too much or too little with respect to their lifetime possibilities, they do so by large amounts. This in turns leads to higher volatility of realized consumption with respect to labor supply and provides a rationale for the (cid:133)rst result. The e⁄ect is stronger the lower the information (cid:135)ow. 3

The autocorrelation of consumption and employment derive from a similar logic. With signals that bring about low information, changes in behavior are slow at high frequency: the news about wealth are not enough to modify yesterday(cid:146)s consumption and labor supply. This implies that consumers are (cid:146)stuck(cid:146)with the same choices and keep accumulating information until evidence of changes in wealth suggests otherwise. As for the third (cid:133)nding, my model predicts stronger comovement of contemporaneous consumption, labor and wealth the higher the information (cid:135)ow. If information capacity is low, then contemporaneous consumption and labor commove strongly with lagged values of wealth. Contemporaneous consumption, labor and wealth are strongly linked viathebudgetconstraint. Moreover, thebudgetconstrainta⁄ectsthechoiceofthepolicy function -optimal joint distribution between wealth and behavior. With high information (cid:135)ow, consumers(cid:146)optimal policy commands to make wealth and behavior as related as possible so that the outcome from consumption and labor supply are very informative about wealth. If information (cid:135)ow is low, consumption and labor are strongly correlated with past values of wealth. This result is driven by the rational (Bayesian) update of consumers(cid:146)information. Each period the household gets information about wealth and observes consumption and labor choices. Low information (cid:135)ows makes the signal on wealth imprecise forcing the household to rely mostly on the information content of his consumption and labor. This translates into periods of inertial behavior until the information collected signals enough variation in wealth to justify a change in the choices. This observation together with Bayesian updating explain why, in presence of (cid:133)nite rate of information, consumers delay their reaction to changes in wealth. These resultsarerobusttohigherspeci(cid:133)cationofriskaversionandforthecaseof(cid:133)niteelasticity of substitution of leisure in di⁄erent periods. The last result is that my model derives endogenously a wedge between MRS and MPL. Moreover, a reasonable calibration of the model leads to magnitude and variance of this wedge in line with U.S. data 7. The key point is that people react to (cid:135)uctuations of wealth according to the information they have chosen. As a result, consumption and labor are function of the optimal signal that consumers select. The lower the information (cid:135)ow, the less informative the signal, the more uncertain consumers are about their wealth. Consumers react to such uncertainty by modifying their consumption and labor on the basis of their perception of the wealth given the signal. This mechanism generates a discrepancy between the actual wage (MPL) and the marginal rate of substitution between consumption and labor (MRS) based on consumers(cid:146)information. Together with the contribution to the macroeconomic labor literature, this paper is closely related to the literature of rational inattention, with particular reference to Sims (2003, 2006), Tutino (2008) and Mackowiak and Wiederholt (2008a, 2008b). This paper departs from Sims (2003) and Mackowiak and Wiederholt (2008a, 2008b) since the exante characterization of uncertainty is not limited to the Gaussian distribution nor the framework is constrained to be linear quadratic. Instead, as in Sims (2006) and Tutino (2008), this paper presents a fully endogenous choice of distribution of uncertainty in 7cfr. Shimer (2008). 4

a dynamic context and allows for risk aversion in the speci(cid:133)cation of agents(cid:146)preferences. The present framework extends Tutino (2008) by augmenting the choice space of people to consumption and labor as opposite to only consumption. This extension generates endogenously an allocation of attention, and, in turn, a di⁄erent degree of persistence between consumption and labor on the basis of the stochastic properties of the joint distribution of decisions and wealth chosen by the consumers. An example might help clarifying the intuition behind this result. Suppose that a person works the same amount of hours everyday. Given this behavior, the person learns nothing about wealth through his labor supply. In this case, (cid:135)uctuations of wealth are acknowledged mostly through the information content of consumption realizations and the optimal distribution of uncertainty is similar to the one derived in Tutino (2008). The household might have a better understanding of his (cid:133)nancial possibilities by varying either consumption or leisure or both and thereby improve on both consumption and leisure. If wages are relatively stable, it might be optimal to keep labor supply (cid:133)xed and o⁄set (cid:135)uctuations in wealth with changes in consumption. On the other hand, if wages change signi(cid:133)cantly, it might pay o⁄to vary the amount of labor supply and maintain a smooth consumption pro(cid:133)le. Amountanddirectionsofthesechangesinbehaviordependontherelativecostof changing consumption with respect to labor supply, the relative bene(cid:133)ts of being better informed about wealth through either source and households(cid:146)preferences implied by the curvature of the utility function. Thus, in this framework, predominance of income vs. substitution e⁄ect does not depend only on people(cid:146)s utility as standard macro literature delivers, but also on the relative attention people pay to current consumption and hour worked as source of information about wealth. Therestofthepaperisstructuredasfollows: Thenextsectionpresentsthemodeland its main assumptions while section 3 illustrates the computation strategy. Section 4 is the core of the paper. It analyzes the main (cid:133)ndings and contrasts them with US business cycle data. Section 5 provides statistics and predictions of di⁄erent speci(cid:133)cations of the model. Section 6 concludes. 2 The Model The model is a one sector partial equilibrium discrete time problem. The economy is populated by numerous households who want to maximize the expected discounted value of theirutilityde(cid:133)nedoveraconsumptiongoodandleisure. Iassumethattheconsumers have in(cid:133)nite elasticity of substitution between leisure in di⁄erent periods.8 Their wealth evolves according to their previous period savings augmented by a (cid:133)xed and exogenous interest rate and their income is given by the hours of work they decide to supply in exchange for a salary. The households cannot process all the information available about their wealth due to limited processing capacity. Instead, they have a belief about it. The assumption that 8The main results of the paper are not a⁄ected if one assumes (cid:133)nite elasticity of substitution. This is shown in Section 5. 5

wealth is not known is meant to capture the complexity associated with mapping current value of wealth (e.g., the number that people see at the end of their personal income statement ) with their current and future possibility of consumption and leisure. The bound on information (cid:135)ow prevents households to perform this task immediately and with in(cid:133)nite precision. The model re(cid:135)ects households(cid:146)initial uncertainty about wealth before processing any information through the assumption that people enter their life with a prior on wealth.9 Householdsdecidewhichsignalsaboutwealthtochooseinordertomaptheir(cid:133)nancial possibilities to consumption and work. They can freely choose the characteristics of their signals under the condition that the information between wealth and behavior provided by the signal (cid:135)ows at (cid:133)nite rate. I assume that the households in the economy share the same characteristics in terms of preferences, endowment and their capacity of processing information. This allows me to focus on a representative agent(cid:146)s economy. 2.1 Timing Before the formal description, I provide a chronological view of the events to guide through the model. A person enters his life with a belief on wealth, b(W). He decides on a signal that conveys information jointly on his wealth (W) and decision (A C;L ) of (cid:17) f g consumption (C) and work (L): Let p ( ; ) denote the joint probability of wealth and (cid:3) w a (cid:1) (cid:1) decisions implied by the optimal choice of the signal. Note that the signal can provide informationaboutanydimensionofbehavior-A-andwealth-W-thatthepersonwants, with the restriction that the informativeness of the signal cannot exceed his processing capacity. Such a signal provides him with a rule of conduct for consumption and labor choices. In period (cid:28) = 1, the household draw from the optimal choice p ( ; ) his (cid:3) w a (cid:1) (cid:1) consumption pro(cid:133)le (c ), and labor supply (l ). (cid:3) (cid:3) Figure 0: A typical day of a rationally inattentive person. Then, he observes the outcomes of his choices and use the observation to update rationally his knowledge of wealth (b(w )). This complete his day. The day after he 0 ja follows the same routine starting with b(w ) as his new prior. Figure 0 describes the 0 ja events. 9Cfr.: Sims (2003, 2006) and Tutino (2008). 6

2.2 Statement and Recursive Formulation of Consumers(cid:146)Problem I discuss each element of the model in turn, starting from the constraints. First I present the budget constraint and discuss its role in updating the knowledge of wealth for an information constrained consumers. Next I turn to the information-(cid:135)ow constraint, key of the model. Finally I present the objective function and set up the full problem. Thestructureof theeconomyfollowscloselytheoneof Tutino(2008)towhichI refers for the mathematical details. For completeness, Appendix A proves rigorously that the problem admits a recursive formulation and that the resulting Bellman equation is a contraction. 2.2.1 Budget Constraint and Update Consumers maximize their lifetime utility function, de(cid:133)ned over a consumption good and leisure. Let C denote the consumption good and L be labor. I collect the actions at time t in the set At (cid:17) f C s ;L s gt s . (cid:20) (cid:20)1 Consumers are limited in their choices by a budget constraint W = R (W C )+s L (1) t+1 t t t t t (cid:0) where R = R is the (constant) interest rate on savings, (W C ), s is the wage the t t t t (cid:0) agent receive in exchange of L units of labor. The process characterizing the wages is t Markovian with constant mean E(s ) = s(cid:22). Consumers cannot observe the exact value of t wealth due to information-processing constraints. I assume that households make their consumption and labor choices knowing that the mean of the wages is (cid:133)xed at s(cid:22). This knowledge is embedded in a prior, g(w ); over the possible realizations of wealth. As t consumers go through life, they update rationally this belief with signals on wealth and the observation of their past behaviors. Let a c ;l be a particular behavior of consumer at time t where c is a speci(cid:133)c t t t t (cid:17) f g outcome of the random variable C at time t and similarly l is a speci(cid:133)c outcome of the t random variable L. Note that a is drawn from the optimal choice p ( ; ). (cid:3) w a (cid:1) (cid:1) The posterior of wealth conditional on observing a particular a = a~ follows by Bayes(cid:146) law : g (w ) = T (w ;w;a~) p(w a~)dw (2) 0 0 ja~ 0 (cid:1) j Zw where T (w ;:;:) is the transition function commanded by the dynamic of wealth (1) 10 0 and p(w a~) takes into account the potential noise in the current observation of the state. j This noisy observation is carried over one period ahead to infer next period(cid:146)s state. 10Actually, the operator T (w ;w;a~) assigns probabilities to w conditional on the value of a~ and w. 0 0 Sincecurrentvaluesofw arenotobservable,theoperatorT (:)appliesanexpectationovertheunknown 7

2.2.2 Choice Variable Before processing any information about wealth (W), consumption (C) and labor (L) are random variables from the perspective of the consumers. To see why, (cid:133)rst consider that the household cannot choose C and L optimally without relating wealth and behavior (C and L). Coming into the world with a probabilistic knowledge of W, mapping W into C and L translates into (cid:133)nding a joint relation among wealth, consumption and labor, that makes the information about wealth as related as possible to consumption and labor. The selection of information about wealth useful to consumption and labor, that is, the joint probability distribution of wealth, consumption and labor, p( ; ; ), is key in the w c l (cid:1) (cid:1) (cid:1) optimization of the consumers since it a⁄ects current beliefs and their updates.11 To clarify this point, suppose that information (cid:135)ows at in(cid:133)nite rate. In this case, the optimal p( ; ; ) will be degenerate assigning to each value of w one value for w c l (cid:1) (cid:1) (cid:1) c (w) and l (w). By contrast, suppose that processing information about wealth is too (cid:3) (cid:3) di¢ cult for the consumer, then the consumer will be better o⁄ processing very limited amount of information about wealth. This is equivalent to choosing the minimal amount of information about wealth that allow the consumer to set c and l constant for each values of w without breaking his budget. Such a behavior implies that consumption and labor will be almost independent on wealth. When the information-processing e⁄ort lies in between this two extreme cases, optimizing consumers aim at setting p( ; ; ) such w c l (cid:1) (cid:1) (cid:1) that the conditional probability of wealth given consumption and labor is as close to wealth as possible given the information constraint and the preference of the consumers. To understand how consumers(cid:146)preference are necessary to determine p( ; ; ), conw c l (cid:1) (cid:1) (cid:1) sider a risk averse person. A risk averse individual would be more interested in knowing when his account is close to be overdrawn than a risk-neutral one. It follows that the risk averse type wants to process more information about low values of wealth which in turns implies low values of consumption and via the e⁄ect that income has on wealth (eq.1), also about low values of labor. If the consumer (cid:133)nds it extremely costly to process information, he watches out for those values of wealth, consumption and labor that reduce his utility the most by collecting information along this particular dimension. A consumer with relative better abilities of processing information can allocate his attention to news about several combinations of wealth, consumption and labor, ranked according to his utility.12 w. For a particular realization of a~= C =c~;L=~l , the operators is de(cid:133)ned as: n o T (w ;w;a~) E(W )=R (w c~)dw+s(cid:22)~l 0 0 (cid:17) (cid:0) Zw 11Alternatively, one can think of the choice of p( ; ; ) as equivalent for the consumers to choosing w c l (cid:1) (cid:1) (cid:1) a signal about wealth and behavior. The consumer decides the scope of the signal according to his preferences but the overall informativeness of the signal is constrained by his information-processing skills. 12Exploring the interaction between information processing constraints and risk aversion is relatively noveltotheliteratureinrationalinattentionwhichhasfocussedmainlyontheLinearQuadraticGaussian 8

From the above discussion, it follows that an optimal policy function for a rationally inattentive individual involves choosing p( ; ; ). Consumption and labor are drawn w c l (cid:1) (cid:1) (cid:1) from the optimal policy p ( ; ; ). (cid:3) w c l (cid:1) (cid:1) (cid:1) 2.2.3 Information Constraint As brie(cid:135)y explained before, people with limited processing capacity, select optimally information about wealth and behavior within their cognitive possibilities. I model the restrictionthatthesecognitivepossibilitiesare(cid:133)niteusingShannon(cid:146)smutualinformation (Shannon (1954), Sims (2003, 2006)) between the random variables W and A. This technology measures the maximum reduction in uncertainty associated to a system as di⁄erence between the initial uncertainty (entropy of W) and the knowledge of variable W provided by the observation of A (conditional entropy of W given A). Since mutual information depends only on the joint distribution of W and A for a given belief, this way of modelling residual uncertainty is applicable without additional restrictions on the nature and characteristics of the channel. In my setting, Shannon capacity captures the ability of consumers to interpret news about their wealth, thereby regulating the speed of reaction of their behavior to these news. An intuitive analogy to Shannon capacity is the activity of a person(cid:146)s brain. Suppose the person reads about the drop in prices in the housing market. According to the time he has and his knowledge of real estate, his reading of the news can lead him to make signi(cid:133)cant steps towards buying an house right away. If the person does not have enough time to absorb the information about real estate and loan market and to map this information into his current and future net worth, he might want to push forward his decision to buy until he has (cid:133)gured out this link. I model people(cid:146)s ability to map information about wealth into consumption and labor decisions by assuming a constant and exogenous shadow cost on the informationprocessing constraint -mutual information between W and A-. In the model, such a cost is denoted by (cid:18). This assumption has the interpretation that mapping each bit of information about wealth into consumption and labor decisions costs the same processing e⁄ort to the consumers. Di⁄erent from Sims (2003) and Mackowiak and Wiederholt (2008a) where the capacity is (cid:133)xed and exhausted every period, (cid:133)xing the shadow cost of processing information has the appealing property that consumers can e⁄ectively choose the amount of uncertainty they want to reduce each period according to their (perceived) (cid:133)nancial conditions and their preferences.13 For instance, a person who (cid:133)nds it extremely costly to process information about wealth -i.e., high (cid:18)-, might choose to pay attention to wealth only after he observes that his consumption/leisure pro(cid:133)le has changed signi(cid:133)cantly over time. By contrast, a person with relative better abilities to (LQG) framework (Sims 2003, Mackowiak and Wiederholt (2008a, 2008b). Cfr. Tutino (2008) for a discussion of the advantages of moving into a fully endogenous choice of signal with respect to the LQG framework. 13The approach of (cid:133)xing the shadow cost of information instead of the capacity is adopted in Sims (2006) and Tutino(2008). 9

process information -i.e., low (cid:18)-, might (cid:133)nd it optimal to keep close track of his wealth in order to enjoy combination of consumption and leisure that maximize his utility. Formally, let (p( ; )) be the mutual information implied by the choice of the joint w a I (cid:1) (cid:1) distributionofW andA, (p( ; )). Theconstraintthatlimitstheamountofprocessable w a (cid:1) (cid:1) information at each time t is given by : p(w ;a ) t t (cid:20) = (p( ; )) = p(w ;a )log dw da (3) t t w a t t t t I (cid:1) (cid:1) p(w~ ;a )dw~ g(w ) Z t t t t ! (cid:0)R (cid:1) The expression in (3) says that the maximum uncertainty that the consumer can reduce about his wealth, through observation of consumption and labor supply is at most (cid:20) bits per unit of time. Mapping formulae into the same intuition gathered in the previous section, had the consumer had in(cid:133)nite processing capacity, he would be able to choose a signal which makes each of his actions informative about a particular value of wealth. This results in a policy function for consumption, labor and wealth that depends on the -now observable- value of wealth. On the other extreme, with no processing capacity, the best one can do is to assign all the probability to a particular value of A. This makes the variables W and A independent of each other, ( (p( ; )) 0). Every w a I (cid:1) (cid:1) ! day, such a person spends the same amount of cash in consumption and the same amount of time working, regardless of his (cid:133)nancial possibilities. In the intermediate case, if the person can process a (cid:133)nite amount of information, he attends to information that make his saving and labor decisions as close as possible to wealth. 2.2.4 Objective Household(cid:146)s problem is to maximize the in(cid:133)nite horizon expected utility of consumption and leisure discounted at factor (cid:12). Let (cid:18) be the ((cid:133)xed and exogenous) shadow cost of processing information (cid:20) in (3) below. The control for their maximization is a signal p(w;a) that solves : 14 V (g(w)) = max u(c;l)p(w;a) (cid:18)(cid:20)+(cid:12) (V (g (w )))p(w;a) (4) p(w;a) (cid:0) 0 0 ja ! 2D w a w a XX XX subject to p(w;a) (cid:20) = (p( ; )) = p(w;a)log (5) w a I (cid:1) (cid:1) ( p(w~;a))g(w) w a (cid:18) w~ (cid:19) XX P and (2) and the requirement that p(w;a) where is the set of all distribution 2 D D that satis(cid:133)es p(w~;a) = g(w~) (6) a X 14For a formal prove that the in(cid:133)nite problem of the household can be written as Bellman equation see Appendix A. 10

0 p( ; ) 1, w;a (7) w a (cid:20) (cid:1) (cid:1) (cid:20) 8 p(w;a) = 1: (8) w a XX Taking (cid:133)rst order condition with respect to p(w;a) results in 15 @p(w;a) : 0 = u(c;l)+(cid:12)V ( )+(cid:18) log p(w;a) p(w;a) @ w00 w00; (cid:1) a (cid:15)ja (cid:18) w00 p(w 00 ; (cid:1) a ) (cid:19) (cid:0) w a (cid:0)P @(p(w (cid:0) ;a)) (cid:1)(cid:1)! XX P @ V g w @g(w ) +(cid:12) 0 0 jat 0 ja p(w;a) (9) @g w @p(w;a) w a " (cid:0) (cid:0) (cid:0)0 jat (cid:1)(cid:1)(cid:1) # ! XX where (cid:0) (cid:1) @g w @( T (w w;a)p(w a)) 0 jat = w 0 j j @p(w;a) @p(w;a) (cid:0) (cid:1) P T (w w;a) @T (w w;a) 0 0 = j + j p(w a): w a00 p( (cid:1) w ;a 00 )p~(a) w @p(w;a) j X X P De(cid:133)ne the solution to the optimization problem of the consumers as the distribution p ( ; ). Summing over all the possible realizations of W and C leads to the marginal (cid:3) w a (cid:1) (cid:1) distribution of labor p (L) p ( ; ) (10) (cid:3) (cid:3) w a (cid:17) (cid:1) (cid:1) w a A L X 2Xf n g and likewise summing over w and l, delivers the marginal distribution of consumption: p (C) p ( ; ) (11) (cid:3) (cid:3) w a (cid:17) (cid:1) (cid:1) w a A C X 2Xf n g where the notation s S X indicate that the possible values s takes in S exclude 2 f n g the subset X S. The realized outcomes c ;l 16 are then drawn from the optimal (cid:26) It tI joint p ( ; ; ). Appendix B derives the solution of a static, low-dimensional version of (cid:3) w c l (cid:1) (cid:1) (cid:1) (cid:8) (cid:9) problem (4)-(8) that admits a quasi analytical solution. 15Notethatthe(cid:133)rstorderconditionin(9)isvalidfor(cid:18) >0. If(cid:18) =0, thentheprobabilitiesg(w)and p( ; ) are degenerate. In this case, (p( ; )) = 0, so using Fano(cid:146)s inequality (Thomas and Cover w a w a (cid:1) (cid:1) I (cid:1) (cid:1) 1991), c( (p( ; ))) = c(w) w a I (cid:1) (cid:1) l( (p( ; ))) = l(w) w a I (cid:1) (cid:1) which makes the (cid:133)rst order conditions for this case the full information solutions. 16Thesuperscript denotesthedependencyofsuchoutcometotheoptimaldistributionofuncertainty, I p ( ; ). (cid:3) w a (cid:1) (cid:1) 11

3 Solution Strategy To appreciate how consumers(cid:146)preferences interact with the information processing constraint, I consider the following speci(cid:133)cations for the utility function: c1 (cid:0) (cid:13) (cid:11)l 1 (cid:13) (cid:0) u(c;l) = lo(cid:0)gc (cid:11)l : 8 (cid:0) logc (cid:11) l(cid:17)+1 < (cid:0) (cid:17)+1 where (cid:13) is the coe¢ cient of risk aversio:n on the utility of consumption, (cid:17) is the inverse of Frisch elasticity of labor supply, ", and (cid:11) [0;1] is a constant disutility associated to 2 labor. The computation methodology follows closely Tutino(2008) to which I refer for technical details. However, there is a computational di⁄erence that I shall highlight. The di⁄erence concerns the construction of the simplex. This paper uses a uniform random grid to generate the simplex. Such a method is more e¢ cient in terms of computational time than a non-uniform random grid and it requires less point to span the simplex. To sketch the methodology, we start with n as the number of possible values that w can assume. Then, each point of the simplex, (cid:1), is an n array each of whose rows contains m random values which belong to the interval [0;1] and are subject to the condition that their sum is 1. The distribution of values is uniform in the sense that it has the conditional probability distribution of a uniform distribution over the whole m-cube, given that the sum per row is 1: The algorithm randomly determines the placement of random points in the n 1 dimensional simplexes. (cid:0) To map the (cid:133)ner state space into Matlab possibilities, I interpolate the value function with the new values of (2) using a kernel regression of V ( ) into g (w ): I use an (cid:1) 0 0 ja Epanechnikov kernel with smoothing parameter h = 3.17 Table 1 reports the benchmark 17The reason behind h = 3 comes from experimental trials for h [2;5] with increments of 0:5 2 (i:e:; h=2; h=2:5;::). While the results do not change substatially as I vary h in this range, the computational time is lower when I set h=3. 12

parameter values and the grids.18 The value iteration converges in about 180 iterations. Benchmark Values Discretization Core State Space W [1 : 1 : 10] Consumption Space C 8 [1 : 1 : 10] 15 Labor Space L [1 : 0:5556 : 6] Joint Distribution per simplex point 100 10 (cid:2) Marginal C 10 1 (cid:2) Marginal L 10 1 (cid:2) Marginal W 10 1 (cid:2) s(cid:22) 1.2 (cid:11) 0.02 (cid:13) 3 (cid:17) 0 R 1.03 Discount Factor, (cid:12) 0.9966 Table 1 I shall point out another caveat for the model calibration, concerning the ratio of the interest rate, R, and the inverse of the discount factor, (cid:12). The calibration in Table 1 implies that the annualized rate of return induces consumers to save until wealth reaches its maximum. Under full information and with utility u(c;l) = log(c) (cid:11) l, (cid:0) (cid:3) such an high interest rate associated to constant disutility of labor makes the consumer accumulate wealth by varying labor supply more than consumption. Under rational inattention, infrequency and sharpness of the adjustments of consumption and labor makes the variance of consumption higher than the one of labor. To highlight the sharp di⁄erences between full information models and rational inattention ones, I maintain the assumption 1 < R(cid:12) throughout the numerical simulations. The parametrization of the model is summarized in Table 1. 18The strategy used to reduce the dimensionality of the choice space is to pair c ;l per each row i j f g and associate to them a value of wealth. In a 2 states, 2 choices of l and 2 choices of c case, that results into: w w 1 2 c ;l p p 1 1 1 2 f g c ;l p p : 2 1 3 4 f g c ;l p p 1 2 5 6 f g c ;l p p 2 2 7 8 f g The corresponding marginals are then evaluated as e.g., p(l )=(p +p )+(p +p ): 1 1 2 3 4 13

4 Predictions of the Model and the Business Cycle. The goal of this section is to provide suggestive evidence on how the model performs when compared to U.S. data. The data that I use for consumption are non durable goods from the Bureau of Economic Analysis while data for average hours worked are available from the Bureau of Labor Statistics. All the series have quarterly frequency from 1964.I to 2007.II.. I construct (cid:133)gures and statistics by detrending the data with the HP (cid:133)lter, using a value of (cid:21) = 1600. I then used the detrended data to compute mean, standard deviation and the correlations. I compare the results of the model with the detrended series under the observation that processing information through a Shannon channel (cid:133)lters out high frequency component of the variable(s) of interest (see Guo et al. (2005), Verdø (1996, 1999) and Sims (1998, 2003). The choice of HP (cid:133)lter over other (cid:133)lter is simply to ease the comparison with the business cycle literature. As for the simulated series, I de(cid:133)ne a model as a set M = (cid:18);(cid:13);(cid:17) and I compare the f g business cycle facts to speci(cid:133)cations of the model that assume (cid:18) = 2;0:2;0:02; (cid:13) = 1 and (cid:17) = 0. I choose three values of (cid:18) as a proxy for three types of individuals that face three di⁄erent shadow costs of processing information ranging from low ((cid:18) = 0:02) to medium ((cid:18) = 0:2) to high ((cid:18) = 2). As for these particular numerical values, I verify empirically that given the discretization of core states and core decisions and the baseline model with log utility of consumption and linear disutility in labor, a value of (cid:18) between 1 and 3 leads to the same quantitative results in terms of choice of distribution. Thus, I pick the middle value in the set (cid:18) [1;3) for the high costs. The optimal choice of the joint 2 distribution p(w;a) is similar for value of (cid:18) (0:1;0:6]. Again, I pick the middle value 2 in the interval for the second choice of (cid:18). Similar reasonings conduct to (cid:18) = 0:02 as lower value of (cid:18) when (cid:18) takes up values in (0:01;0:05). Notice that for values of (cid:18) above 5, households acquire very little information about wealth and set consumption and labor basically constant. Also, for values of (cid:18) below 0:05 deliver a solution very close to the full information case. To get a quantitative assessment of what these costs mean, I compute the average di⁄erence in lifetime utility between the in(cid:133)nite capacity case and each of the (cid:18)-cases considered. Thatis, Isetu(c;l) = log (cid:11)l andIcomputeE u(c;l) u c ;l ;(cid:18) where I I (cid:0) (cid:0) E(u(c;l)) is average lifetime utility under in(cid:133)nite capacity case while E u c ;l ;(cid:18) is I I (cid:0) (cid:0) (cid:1)(cid:1) the utility under a particular value of (cid:18) when (cid:18) takes values in ( 2 ; 0:2 ; 0:02 ). With f g f g(cid:0)f(cid:0) g (cid:1)(cid:1) an average value of E(u(c;l)) 1:5 under full information, (cid:18) = 2 is associated to a loss ’ of about 20% in lifetime utility, (cid:18) = 0:2 implies a loss of about 11% while (cid:18) = 0:02 delivers a loss of about 8%. The values used for this computation can be found in Table 10a 10c in appendix B. I assume that an economy is populated evenly by those three (cid:0) types. 19 Thecomputationsareasfollows. Foreach(cid:18),mean,standarddeviationandcorrelation 19Asitispossibletoseefromthestatisticsforeach(cid:18)inTable10a-10einAppendixB,themainresults do not change signi(cid:133)cantly if instead of having an average of the results across (cid:18), I assume an average value of (cid:18). 14

of the simulated series are calculated after I take averages of 7;000 Monte Carlo runs and 10! simplex points. The statistics for each of this series are in Tables 10a 10c. In (cid:0) tables 10d 10e the same methodology produces the results for the cases M = 0:2;3;0 (cid:0) f g and M = 0:2;1;1 respectively. The results for the simulated series in Tables 2-4 and f g Table 6 are calculated by computing an arithmetic average of the series just described for (cid:18) = 2;0:2;0:02. I do not (cid:133)lter out low frequency variations in the solution paths generated by my model since there are no exogenous shocks at high frequency. I refer to the business cycle facts occurring between 1964.I and 2007.II as BC and to the rational inattention predictions with consumption and labor as RC which stands for Rigidity of Choice. 4.1 Business Cycle Facts through Shannon(cid:146)s lenses. I focus on a set of business cycle facts regarding consumption, labor and their volatility, persistence and comovement. Then I derive the labor wedge20 that the model produces. Finally I calibrate the model to match the (cid:133)rst moment of the labor wedge from the data and see what costs of information processing it implies. 4.1.1 BC Fact 1: The ratio of standard deviations of consumption and hours 1.31 My model predicts excess volatility of consumption with respect to labor supply. This results depends on the bound of information-processing capacity and its interplay with risk aversion and disutility of labor. Consider a consumer with log utility in consumption and constant disutility of labor. Had the household had in(cid:133)nite capacity, he would have chosen to smooth consumption by varying labor supply. With positive and (cid:133)nite information-processing capacity, he does not track wealth perfectly at high frequency. He selects a signal about wealth and changes labor and consumption according to the informativeness of the signal. Willing to smooth consumption, the household rationally chooses to save and work hard while he accumulates information about his (cid:133)nancial possibilities. Once he realizes that he is rich, he increases consumption. However, to maintain a relatively high consumption pro(cid:133)le for prolonged time, he keeps working hard. If he has accumulated too much savings -due to low informativeness of period by period signals- when the variation in consumption occurs is sizeable. Furthermore, such variation would be bigger than the one of labor to avoid taking risks on future wealth. Table2comparesthemodelM ((cid:18);1;0) 1 (M (2;1;0)+M (0:2;1;0)+M (0:02;1;0)) (cid:17) 3 20cfr. Prescott (2004) and Shimer (2005). 15

to the U.S. data. The mean of the wages is constant throughout the simulation at 1:2. US Business Cycle : Quarterly Data (1964.I-2007.II),HP (cid:133)lter,,% Std.Dev Non Dur. Consumption(C) 0.84 Hours (L) 0.64 Simulated Data,average across models Std.Dev Consumption (c) 1.13 Labor (l) 0.98 Table 2: Statisticalproperties ofUS Business Cycle and Model The explanation that my model provides to BC 1 can be summarized by the following result: RC Finding 1. For the model M ((cid:18);1;0), the standard deviation of consumption over labor is 1.15. The volatility is higher the higher the information costs. For a given degree of risk aversion, higher information costs make consumption and labor more sluggish with sudden adjustments following accumulation of wealth. This behavior of consumption and labor generates higher variance. 4.1.2 BC Fact 2: Non durable consumption and hours have 1st order serial correlation higher than 80% In my model, consumption and labor are more persistent the lower the information (cid:135)ow. Inthelattercase, itoccursalsothatcontemporaneousconsumptionandlaborlagwealth. Wealth, Consumption and Labor:q=0.2,g=1,h =0 8 7 Wealth 6 5 Consumption 4 3 Labor 2 1 0 10 20 30 40 50 60 70 80 Figure 1: Sample Path ofMacroeconomic variables,Average across 10,000 monte Carlo,M= 0:2;1;0 : f g 16

The intuition for these (cid:133)ndings lies on the mechanism through which consumers update their knowledge of wealth, expressed in (2). Each period they choose a signal on wealth, decide consumption and labor based on the information from the signal and, given their choices, update their knowledge of wealth. The higher the processing cost, the less informative the signal. This in turn means that most of the update derives from the observations of past values of consumption and labor. As wealth accumulates, the signal consistently reports high values of wealth which trigger a reaction in behavior. This process is re(cid:135)ected in both delayed response to (cid:135)uctuations of wealth and strong autocorrelation between current and past values of consumption and labor. The comparison between models and (cid:133)ndings is in Table 3. US Business Cycle : Quarterly Data (1964.I-2007.II),HP (cid:133)lter,% Std.Dev Autocorr Non Durable Goods(C) 0.84 93 Average Hours (L) 0.54 88 Simulated Data,average across models Std.Dev Autocorr Consumption (c) 1.13 92 Labor (l) 0.98 87 Table 3: Statisticalproperties ofUS Business Cycle and Model The explanation for BC Fact 2 can be summarized as: RC Finding 2. For the model M ((cid:18);1;0), the autocorrelation of consumption and labor are above 80%. Moreover, contemporaneous consumption and labor lag wealth by one period. 4.1.3 BC Fact 3: The contemporaneous correlation of consumption and hours is 78%. US Business Cycle : Cross-Correlation,(1964.I-2007.II),HP,% C( 1) C L( 1) L (cid:0) (cid:0) C 0.93 1 0.77 0.78 L 0.66 0.78 0.88 1 Table 4:BC Fact 3 My model (cid:133)nds a strong comovement of labor and consumption. It also predicts a strong correlation between contemporaneous consumption and wealth when information (cid:135)ow is high. The reason for a strong relationship among contemporaneous values of wealth, labor and consumption in my model is twofold. First, the variables are related through the budget constraint (1) which is used by the consumers to update their prior on wealth. Second, the optimal policy of the consumer is the stationary joint distribution between choices (consumption and labor) and state (wealth). 17

Cross-Correlation w( 1) w (cid:0) c 0.71 0.61 l 0.74 0.69 w 0.91 1 Table 5: Cross correlation ofwealth;average across M (cid:18);(cid:13)=1;(cid:17)=1 f g For a given shadow cost and disutility of labor, the higher the coe¢ cient of risk aversion the higher the correlation between lagged and contemporaneous consumption and wealth. The same is true for labor and wealth. This results arises from the observation that risk averse individuals, say, (cid:13) = 3 pay more attention to wealth than individuals with (cid:13) = 1 for a given cost of processing. Controlling for the cost of information and the degree of risk aversion, people with increasing disutility of labor, (cid:17) = 1, pay more attention to wealth than people with (cid:17) = 0. However, this higher information is used to both consumption smoothing and increasing leisure. Even though the substitution e⁄ect prevails over the income e⁄ect also for (cid:17) = 1, the resulting impact on the correlation between wealth and consumption is lower than for the case (cid:17) = 0. Consumption and wealth for different utilities,q=0.2 7 Wealth, g=3,h =0 6 5 4 3 Consumption, g=1,h =1 2 1 0 10 20 30 40 50 60 70 80 Figure 2: Consumption and wealth,(cid:18)=0:2.Solid line:(cid:13)=3;(cid:17)=0;Dotted Line: (cid:13)=1;(cid:17)=1. The reason why lagged value of wealth are highly correlated with contemporaneous consumptioncomesfromtheinteractionbetweenthecurvatureoftheutilityfunctionand the information costs. High coe¢ cient of risk aversion together with high information cost trigger a conservative consumption pro(cid:133)le and a consistent increase in consumption when the signal conveys information about high value of wealth (cfr. RC Finding 1). With low elasticity of labor supply, the strong and positive comovement of consumption 18

and labor (RC Finding 2) makes labor react in a way similar to consumption. When the elasticity of labor supply increases, people try to balance consumption smoothing and increase in leisure with the information available on wealth. The result is a weaker positive correlation of behavior and current and lagged values of wealth . Simulated Data: Cross-Correlation c( 1) c l( 1) l (cid:0) (cid:0) c 0.90 1 0.83 0.90 l 0.71 0.90 0.84 1 Table 6: Cross Correlation ofUS business cycle and averages across models The explanation of the third business cycle fact from the model is: RC Finding 3. For the model M ((cid:18);1;0) the contemporaneous cross correlation between consumption and labor is 0.90%. 4.1.4 BC Fact 4: U.S. labor wedge has mean 0.4 and standard deviation =0.11 (cid:24) US Business Cycle : Laborwedge 1994-2006,% mean std. dev. (cid:9) 0.40 0.11 source: Shimer (2008) Table7: Shimer(cid:146)s BC Fact 4 In absence of information-processing frictions ((cid:18) = 0), and assuming an household with utility u(c;l) = log(c) (cid:11)~ l the labor wedge for the model is21 (cid:0) (cid:3) (cid:9) s (cid:11)~ c (12) (cid:17) (cid:0) (cid:3) where the wage, s, represents the marginal productivity of labor (MPL) and U =U = l c (cid:11)~ c is the marginal rate of substitution (MRS). Prescott (2004) and Shimer (2008) (cid:0) (cid:3) estimates the average wedge (cid:9) to be 0:4 in U.S. When information (cid:135)ows at (cid:133)nite rate ((cid:18) > 0), consumers never observe wages directly but only through the impact wages have on signals about wealth. Thus, the MRS for information-constrained consumers has to be equal to their perception of the wages given the signal, p ( ; ). De(cid:133)ne such perception of wages s~( (p( ; ))). Then, the wedge (cid:3) w a w a (cid:1) (cid:1) I (cid:1) (cid:1) in my model is de(cid:133)ned as s~( (p( ; ))) s or: w a r (cid:17) j I (cid:1) (cid:1) (cid:0) j = s (cid:11)~ c( (p ( ; ))) (13) (cid:3) w a r (cid:0) (cid:3) I (cid:1) (cid:1) MPL MRS which increases with (cid:18): |{z} | {z } 21cfr. e.g., Rotemberg and Woodford (1991, 1999), Hall(1997), Shimer (2008). 19

To get a sense of the magnitude involved, to obtain an average wedge 0:4, I set r ’ (cid:11)~ = 0:24 and (cid:18) = 2. This speci(cid:133)cation implies the statistics in Table 8. (cid:18) = 2; (cid:11)~ = 0:24 Mean St.Dev (cid:26)(x ;x ) t t 1 (cid:0) Consumption (c) 3.20 1.08 0.93 Labor (l) 2.28 0.98 0.89 Wealth (w) 6.04 1.19 0.74 Information Flow ((cid:20)) 0.93 0.71 Table 8: Sample modelstatistics. Average 7,000 Monte Carlo In particular, the average consumption is c( (p ( ; ))) = 3:20 and the consumption (cid:3) w a I (cid:1) (cid:1) loss from not being well informed is about 16% of households(cid:146)average lifetime consumption. 22 Even though the model cannot address jointly the supply and demand sides of the labormarket23 andit is limitedtotheintensive margin, I thinkit o⁄ers apromisingangle to explain the discrepancy between the marginal rate of substitution and the marginal product of labor. Several studies have been trying to match theory and data to explain this wedge 24: cyclical (cid:135)uctuation of hours driven by variation in taxes25, time varying disutility of labor and worker(cid:146)s market power26 and search and matching models27. Ratherthanexaminingtheoriginsandbargainingofwagecontracts, thesuggestionof my model is to look more closely at consumers(cid:146)behavior. In particular, my results show that adding a bound on information processing generates endogenously a wedge between the wage that prevails on the market and the perception of the workers of such wages, s~( (p( ; ))) s . In my model this labor wedge depends on the informationw a r (cid:17) j I (cid:1) (cid:1) (cid:0) j r processing capacity of the consumers. Furthermore, the variance of depends on their r choice of the informativeness of the signal. To see why, suppose that a person knows that he is rich in a given period of his life. Since information-processing is costly, he may decide to care a little about his current wealth and, as a result, MRS > s. By contrast, a person who doesn(cid:146)t have a lot of wealth wants to pay more attention to his (cid:133)nancial activities which leads to a better monitoring of the wealth, or MRS < s. Note also that the choice of processing di⁄erent (cid:135)ows of information according to consumers(cid:146)needs and preferences make the variance of s~( (p( ; ))) positive. w a I (cid:1) (cid:1) 22To see why, I compute the loss as the information rent, E(c c( (p ( ; )))), where E(c ) is (cid:3) (cid:3) w a (cid:3) (cid:0) I (cid:1) (cid:1) theaverageconsumptionpath when (cid:18) =0whichisequalto3:78andtheexpectationE(:)istakenwith respect to time. This means that the cost of being uninformed is about 16% of household(cid:146)s average lifetime consumption. 23cfr. Gertler and Trigari (2006). 24cfr. Shimer (2008) and Pissarides (2007). 25cfr. Uhlig (2003) ,Chen, Imrohorouglu and Imrohorouglu (2007). 26cfr. Smets and Wouters (2007), Gali, Gertler and Lopez-Salido (2007). 27cfr. Pissarides (1994), Shimer (2005), Hall(2005) 20

US Business Cycle : laborwedge 1994-2006,% mean std. dev. (cid:9) 0.40 0.11 Simulated data: laborwedge mean std. dev. 0.40 0.41 r Table 9: Laborwedge in data and model This last result are summarized by the last (cid:133)ndings. RC Finding 4. The labor wedge has mean 0.40 and standard deviation 0.41. The statistical properties of the wedge derive from those of the endogenously chosen distribution p ( ; ). (cid:3) w a (cid:1) (cid:1) 5 A closer look at Model Statistics The results of the previous section are robust to di⁄erent characterizations of M ((cid:18)). Once the value function for each model converged, I simulate the path of consumption, labor, wealth and information (cid:135)ow for T = 75 averaging the results along the dimension of the simplex as well as 7;000 di⁄erent Monte Carlo runs. To get a sense on how the shadow cost of information a⁄ects consumption and labor behavior, consider M ((cid:18)) (cid:18);((cid:13) 1;(cid:17) = 3) where (cid:18) = 2;0:2; 0:02: Table 10a 10e (cid:17) f ! g (cid:0) shows the statistics for these and (cid:13) = 3; (cid:17) = 1 cases. All tables are collected in Appendix B. The (cid:133)rst observation is that as the information costs increases, average consumption, labor and information (cid:135)ow decrease, while the standard deviations of these series increases. This is also true for wealth. These results are intuitive. Under full information, thecharacteristicsoftheutilityfunction((cid:13) 1)commandaconsumptionpro(cid:133)lesmooth ! throughout the life-cycle. Moreover, with constant disutility of labor, ((cid:17) = 0) labor supply adjusts according to wealth (cid:135)uctuations to accommodate consumption smoothing. When information (cid:135)ows at (cid:133)nite rate, rational households choose signal about wealth with the same purpose. If processing-information has low cost, (cid:18) = 0:02, consumers can choose a signal about wealth so informative to allow them to use labor supply to smooth (cid:135)uctuations in wealth and, in turn, consumption. On the other hand, if information is costly, (cid:18) = 2, consumers keep track of wealth slowly and, as a consequence, do not modify consumption and labor often. When they do change their behavior, they do so by a signi(cid:133)cant amount. The resulting path for wealth inherits the higher variance of consumption and labor and, on average, has higher mean than in the previous case due to the increase in savings in periods of inertial behavior. A sample path of consumption under di⁄erent (cid:18)-scenarios is in Figure 3. 21

Comparing consumption foru(c;l)=log(c) (cid:11)l (cid:0) Consumption Profile withg=1,h =0, differentq's 5 q=0.02 4.5 4 q=0.2 3.5 no itpm u sno c 3 q=2 2.5 2 1.5 0 10 20 30 40 50 60 70 time Figure 3: Sample path ofconsumption fordi⁄erent (cid:18)(cid:146)s. Figure 4 con(cid:133)rms the intuition that consumption is smoother the lower information costs. Consumers with (cid:18) = 0:02 save at the beginning of the simulated period to enjoy high level of consumption later on. By contrast, consumers with (cid:18) = 2 track with di¢ culties their wealth and this is re(cid:135)ected in a prolonged period of savings while processing information about wealth. This results into slow and consistent adjustments of consumption during the simulated period. One point worth attention is the existence of precautionary savings generated by information (cid:135)ow constraints. Individuals with less processing capacity ((cid:18) = 2) push forwards an increase in consumption more than the other people ((cid:18) = 0:2 and (cid:18) = 0:02). Types (cid:18) = 2 acknowledge the accumulation of wealth due to the additional savings later in the simulation. This forces them to increase their consumption for a short period of time at the end of the simulation period. Given the strong correlation between consumption and labor and the preferences of the individuals, it is not surprising that people with (cid:18) = 0:02 work harder at the beginning of the simulation to (cid:133)nance their good purchases, though they manage to 22

enjoy some leisure at the end of the simulation (see Figure 5). Comparing consumption and laborforu(c;l)=log(c) (cid:11)l (cid:0) q =2 q =0.2 6 6 n n o itpm4 4 rob o itpm 4 4 rob u sn o C fo se ire s em 2 2 3 aL fo se ire s em iT u sn o C fo se ire s em 2 2 3 aL fo se ire s em iT iT iT 0 1 0 1 00 2200 4400 6600 8800 00 2200 4400 6600 8800 time time q =0.02 5 n5 o itpm 4 rob u sn o C fo se ire s em 2 3 4 2 3 aL fo se ire s em iT iT 1 1 00 2200 4400 6600 8800 time Figure 4: Time path consumption and labor,various (cid:18) Correlationbetweenconsumptionandlaborishigherthehighertheinformationcosts. The intuition for this result is that the reaction of both consumption and labor behaviors to accumulation of wealth are delayed by individual(cid:146)s capacity of processing information. As they have better knowledge of how much wealth they have, they review both plans. People actions are mirrored in wealth accumulation. Individuals with (cid:18) = 0:02 build up savings at the beginning of the period to dissave gradually later on. This is akin to 23

consumption smoothing under full information. Comparing consumption(blu) and wealth (red) foru(c;l)=log(c) (cid:11)l (cid:0) q =2 q =0.2 6 10 5 n 8 n 8 o itp m u 4 h tla e W o itp m u 4 h tla e W sn o 6 fo sn o 6 fo C fo se ire s e 2 4 se ire s e m iT C fo se ire s e 2 3 4 se ire s e m iT m m iT iT 0 2 1 2 00 1100 2200 3300 4400 5500 6600 7700 00 1100 2200 3300 4400 5500 6600 7700 time time q =0.02 10 5 n o itp m4 8 h tla e u W sn o 6 fo C fo se ire s e 2 3 4 se ire s e m iT m iT 1 2 00 1100 2200 3300 4400 5500 6600 7700 time Figure 5: Time path wealth and consumption,log-lin utility,various (cid:18) People who are more constrained in their choice of the signal, adjust with delays consumptionto(cid:135)uctuationsinwealth. Suchdelayssmoothconsumptionwhileconsumers are processing information but at the same time, calls for major adjustments afterwards. Note also how consumption and labor lag wealth for (cid:18) = 2: The cross-correlation coe¢ cients between lagged wealth and current consumption is 0:65 while the contemporaneous correlation is only 0:47. A similar result holds for labor and lagged values of wealth. This (cid:133)nding is also consistent with intuition. Every period households receive little information about their wealth and rely on past values of consumption and labor to update their knowledge. While waiting, wealth accumulates and so does information until the consumers are convinced to change their behavior. This mechanism implies that behavioral response to movement in wealth is lagged. Finally, Figures 3-5 illustrate also the high persistence of the series documented in table 12a 12d. Not surprisingly (cid:0) the persistence is higher the higher the information cost . For completeness, it is worth mentioning how the model behaves with higher coe¢ cient of risk aversion and positive and (cid:133)nite elasticity of substitution. Figure 6 displays the stationary marginal distribution of consumption and wealth when (cid:13) = 1 and (cid:17) = 0. These marginal distributions are computed from the joint distribution to which the value iteration converges as average over initial beliefs about wealth. 24

For low values of (cid:18), households choose to be better informed about extreme values of consumption so that they can enjoy a balanced consumption throughout their lifecycle. This is, in essence, consumption smoothing. Comparing marginaldistributions ofconsumption and laborwhen u(c;l)=log(c) (cid:11)l (cid:0) Marginal distribution of consumption, Different Information Costs 0.35 Comparing marginal distribution of labor 0.25 q=0.02 q=0.2 q=0.02 0.3 q=2 q=0.2 q=2 0.2 0.25 0.2 0.15 0.15 0.1 0.1 0.05 0.05 0 0 0.53 1.07 1.6 2.13 2.67 3.2 3.73 4.27 4.8 5.33 1 1.56 2.11 2.67 3.22 3.78 4.33 4.89 5.44 6 Figure 6: MarginalDistribution ofconsumption and labor various (cid:18) Theoptimalmarginalprobabilityoflaborassignshighprobabilityonlowlaborsupply when (cid:18) is low: as the capacity increases, consumers wants to smooth consumption and sacri(cid:133)ce their leisure to do so (recall, labor e⁄ort has constant disutility in this case, (cid:17) = 0). If information is costly to process, consumers will keep cannot process enough information, they will keep constant their consumption and labor e⁄ort. This in turn implies working less When the signal about wealth is less informative on a period-by-period basis (high (cid:18)), income e⁄ect takes over the substitution e⁄ect: consumers are less certain that their wealth is high enough to enjoy leisure and as a result increase the keep constant their consumption and labor e⁄ort. The marginal probability distribution of labor re(cid:135)ects this pattern by allocating higher probabilities on lower values of labor (and consumption) the higher (cid:18) is. Tables 10d 10e show the relationship between risk aversion, (cid:13); Frisch elasticity of (cid:0) substitution, 1=(cid:17), and shadow cost of information, (cid:18). For a given (cid:18) and elasticity of substitution, thehigherthecoe¢ cientofriskaversion, thehigherthemeanandthelowerthe variance of consumption. This (cid:133)nding makes intuitive sense since a risk averse household would save a lot during the early stages of life to enjoy high consumption throughout later on due to the accumulated savings. Savings come from both low consumption and high labor supply at the beginning of the simulation triggered by the fear of running out of wealth. Once consumers realize they have built a su¢ cient bu⁄er to cover for consumption and leisure expenses, they increase consumption and reduce -though by a lower extent- labor supply. The peak in consumption for these types of households occur later in the simulation while labor supply is higher at the beginning than it is later on. Thus, information costs enhance precautionary savings. 25

Finally, for a given (cid:18) and (cid:13), a lower the Frisch elasticity of substitution (from (cid:17) = 0 to (cid:17) = 1) generates lower mean and lower variances for both consumption and labor. Keeping the degree of risk aversion (cid:133)xed, a low elasticity of substitution for labor supply increase the income e⁄ect over the substitution e⁄ect. However, the presence of information processing constraint still favors substitution e⁄ect mitigating the income e⁄ect. If the signal on wealth is very noisy, consumers supply more labor than they would do in case of perfect information and (cid:17) = 1, since they are not certain that their wealth is actually decreasing. When the information collected signals that the wealth has been increased, labor supply suddenly decreased. The opposite occurs when consumers receive more and more information about a decrease in wealth. 6 Conclusions I presented a model in which rational households optimize their lifetime utility under information-processing constraint (cid:224) la Sims. I show that such a model, even in its simplicity, is able to replicate many empirical regularities of U.S. business cycle data: higher volatility of consumption with respect to labor, persistence and strong comovement of consumption and employment, lagged response of consumption and labor with respect to wealth and an endogenous labor wedge. In particular, the last prediction has been a challenge for macroeconomists. Existing macroeconomic theories have been explaining the labor wedge using exogenous shocks and ad-hoc market frictions, without microfounded justi(cid:133)cations and, more importantly, without a correspondence in the data. In my setting the di⁄erence between marginal rate of substitution and marginal product of labor, i.e., wage, occurs as a result of consumers(cid:146)endogenous choice of information. When households face information-processing constraints, they select signals on their wealth and make consumption and labor decisions based on those signals. Each period, wealth evolves because of savings which depends on consumption and income. The latter in turn depends on labor supply and an exogenous stochastic wage whose distribution is (cid:133)xed and known. Since movements in income a⁄ect the growth of wealth and consumers keep track of it by signals, the less informative the signals, the more persistent their choices. Once wealth accumulates and households acknowledge this growth through information collected, they change their behavior consistently. This mechanisms results in a discrepancy between the marginal rate of substitution of consumption and labor and wages. Furthermore, the magnitude and the variance of the endogenous wedge are in line with the data. The same mechanism is able to capture persistence and lags of the main macroeconomic variables over U.S. business cycle. The (cid:133)ndings of this paper suggest that making a leap to a rational inattention setting is worth the computational e⁄ort. For it gets us closer to understand and interpret empirical regularities in U.S. data than the current theoretical macro literature. Acknowledgement 1 I am deeply indebted to Chris Sims whose countless suggestions, shaping in(cid:135)uence and guidance were essential to improve the quality of this paper. I thank Ricardo Reis for valuable advice, enduring and enthusiastic support. I am grateful to Per 26

Krusell for his insightful advice and stimulating discussions. I would also like to thank Nobu Kiyotaki, Philippe-Emanuel Petalas, Charles Roddie and Sam Schulhofer-Wohl. Any remaning errors are my own. 27

References [1] Aguiar, Mark, and Erik Hurst, 2005. (cid:147)Consumption versus Expenditure.(cid:148)Journal of Political Economy. 113 (5): 919(cid:150)948. [2] Barro, RobertJ., 1977.(cid:147)Long-termcontracting, stickyprices, andmonetarypolicy.(cid:148) Journal of Monetary Economics. 3 (3): 305(cid:150)316. [3] Becker,GaryS.,1965.(cid:147)ATheoryoftheAllocationofTime.(cid:148)TheEconomicJournal. 75 (299): 493(cid:150)517. [4] Blanchard, Olivier J., and Jordi Gal‹(cid:146), 2006. (cid:147)A New Keynesian Model with Unemployment.(cid:148)MIT Mimeo. [5] Chari, V.V., Patrick J. Kehoe, and Ellen R. McGrattan, 2007. (cid:147)Business Cycle Accounting.(cid:148)Econometrica. 75 (3): 781(cid:150)836. [6] Chen, Kaiji, Ayse Imrohoroglu, and Selahattin Imrohoroglu, 2007. (cid:147)A Quantitative Assessment of the Decline in the U.S. Savings Rate.(cid:148)University of Southern California Mimeo. [7] Erceg, Christopher J., Dale W. Henderson, and Andrew T. Levin, 2000. (cid:147)Optimal Monetary Policy with Staggered Wage and Price Contracts.(cid:148)Journal of Monetary Economics. 46 (2): 281(cid:150)313. [8] Gali(cid:146), Jordi, Mark Gertler, and J. David Lopez-Salido, 2007. (cid:147)Markups, Gaps, and theWelfareCostsofBusinessFluctuations.(cid:148)TheReviewofEconomicsandStatistics. 89 (1): 44(cid:150)59. [9] Gal‹(cid:146), Jordi, and Paul Rabanal, 2004. (cid:147)Technology Shocks and Aggregate Fluctuations: How Well Does the RBC Model Fit Postwar US Data?.(cid:148)NBER Macroeconomics Annual. 19: 225(cid:150)318. [10] Gertler, Mark, and Antonella Trigari, 2006. (cid:147)Unemployment Fluctuations With Staggered Nash Wage Bargaining.(cid:148)NBER Working Paper 12498. [11] Guo, D., S. Shamai , and S. Verdœ, 2005 (cid:147)Mutual information and minimum meansquare error in Gaussian channels,(cid:148)IEEE Trans. Inf. Theory, vol. 51, no. 4, pp. 1261(cid:150)1283, Apr. 2005 [12] Hall, Robert E., 1997. (cid:147)Macroeconomic Fluctuations and the Allocation of Time.(cid:148) Journal of Labor Economics. 15 (1): 223(cid:150)250. [13] Hall, Robert E., 2005. (cid:147)Employment Fluctuations with Equilibrium Wage Stickiness.(cid:148)American Economic Review. 95 (1): 50(cid:150)65. [14] Hansen, Gary D., 1985. (cid:147)Indivisible Labor and the Business Cycle.(cid:148)Journal of Monetary Economics. 16 (3): 309(cid:150)327. 28

[15] Imai, Susumu, and Michael P. Keane, 2004. (cid:147)Intertemporal Labor Supply and Human Capital Accumulation.(cid:148)International Economic Review. 45 (2): 601(cid:150)641. [16] Kaldor, Nicholas, 1957. (cid:147)A Model of Economic Growth.(cid:148)The Economic Journal. 67 (268): 591(cid:150)624. [17] Kalai, Gil. A subexponential randomized simplex algorithm (extended abstract).In Proceedings of the twenty-fourth annual ACM symposium on Theory of computing, pages 475(cid:150)482. ACM Press, 1992. [18] Kalai, Gil. Linear programming, the simplex algorithm and simple polytopes. Math. Programming, 79(1-3,Ser. B):217(cid:150)233, 1997. [19] Krusell, Per, and Anthony A. Smith, Jr., 1998. (cid:147)Income and Wealth Heterogeneity in the Macroeconomy.(cid:148)Journal of Political Economy. 106 (5): 867. [20] Lovejoy, W. Computationally feasible bound for partially observable Markov Decision problem, Operations Research, 39:162:175, [21] Lucas, Robert E. Jr., and Leonard A. Rapping, 1969. (cid:147)Real Wages, Employment, and In(cid:135)ation.(cid:148)The Journal of Political Economy. 77 (5): 721(cid:150)754. [22] Mackowiak, B., and M. Wiederholt, 2008a, "Optimal Sticky Prices under Rational Inattention", American Economic Review, forthcoming [23] Mackowiak, B., and M. Wiederholt, 2008b, "Business Cycle Dynamics under Rational Inattention", Mimeo. [24] MaCurdy, Thomas E., 1981. (cid:147)An Empirical Model of Labor Supply in a Life-Cycle Setting.(cid:148)Journal of Political Economy. 89 (6): 1059(cid:150)1085. [25] Mankiw, N. Gregory, 1989. (cid:147)Real Business Cycles: A New Keynesian Perspective.(cid:148) Journal of Economic Perspectives. 3 (3): 79(cid:150)90. [26] McGrattan, Ellen R., and Edward C. Prescott, 2007. (cid:147)Technical Appendix: Unmeasured Investment and the Puzzling U.S. Boom in the 1990s.(cid:148)Federal Reserve Bank ofMinneapolis Sta⁄Report 395. [27] Mertens, Karel, and Morten O. Ravn, 2008. (cid:147)The Aggregate E⁄ects of Anticipated and Unanticipated US Tax Policy Shocks: Theory and Empirical Evidence.(cid:148)Cornell University Mimeo. [28] Modigliani, Franco, 1977. (cid:147)The Monetarist Controversy or, Should We Forsake Stabilization Policies?.(cid:148)American Economic Review. 67 (2): 1(cid:150)19. [29] Mortensen, Dale T., and Christopher A. Pissarides, 1994. (cid:147)Job Creation and Job Destruction in the Theory of Unemployment.(cid:148)The Review of Economic Studies. 61 (3): 397(cid:150)415. 29

[30] Mulligan,CaseyB.,2002.(cid:147)ACenturyofLabor-LeisureDistortions.(cid:148)NBERWorking Paper 8774. [31] Parkin, Michael, 1988.(cid:147)AMethodforDeterminingWhetherParametersinAggregativeModelsAreStructural.(cid:148)Carnegie-RochesterConferenceSeriesonPublicPolicy. 29: 215(cid:150)252. [32] Pissarides, Christopher A., 1985. (cid:147)Short-Run Equilibrium Dynamics of Unemployment, Vacancies, and Real Wages.(cid:148)The American Economic Review. 75 (4): 676(cid:150) 690. [33] Pissarides, Christopher A., 2007. (cid:147)The Unemployment Volatility Puzzle: Is Wage Stickiness the Answer?.(cid:148)LSE Mimeo. [34] Prescott, Edward C., 2004. (cid:147)Why Do Americans Work So Much More Than Europeans?.(cid:148)Federal Reserve Bank of Minneapolis Quarterly Review. 28 (1): 2(cid:150)13. [35] Ragan,KellyS.,2006.(cid:147)Taxes,Transfers,andTimeUse: FiscalPolicyinaHousehold Production Model.(cid:148)Ph.D. thesis, University of Chicago. [36] Rogerson, Richard, 1988. (cid:147)Indivisible Labor, Lotteries and Eequilibrium.(cid:148)Journal of Monetary Economics. 21 (1): 3(cid:150)16. [37] Rogerson, Richard, 2007. (cid:147)Taxation and Market Work: Is Scandinavia an Outlier?.(cid:148) Economic Theory. 32 (1): 59(cid:150)85. [38] Rogerson, Richard, and Johanna Wallenius, 2007. (cid:147)Micro and Macro Elasticities in a Life Cycle Model with Taxes.(cid:148)NBER Working Paper 13017. [39] Romer, Christina D., and David H. Romer, 2007. (cid:147)A Narrative Analysis of Postwar Tax Changes.(cid:148)University of California, Berkeley Mimeo. [40] Rotemberg, Julio J., and Michael Woodford, 1991. (cid:147)Markups and the Business Cycle.(cid:148)NBER Macroeconomics Annual. pp. 63(cid:150)129. [41] Rotemberg, Julio J., and Michael Woodford, 1997. (cid:147)An Optimization-Based Econometric Framework for the Evaluation of Monetary Policy.(cid:148)NBER Macroeconomics Annual. 12: 297(cid:150)346. [42] Sims, Christopher A., 1998, "Stickiness, Carnegie-rochester Conference Series On Public Policy, 49(1), 317(cid:150)356. [43] Sims, Christopher A., 2003, "Implications of Rational Inattention", Journal of Monetary Economics, 50(3), 665(cid:150)690. [44] Sims, Christopher A., 2006 "Rational Inattention: A research Agenda", American Economic Review Paper and Proceedings. [45] Tutino, Antonella, 2008 "The Rigidity of Choices, Lifetime savings under information processing constraints". Princeton Universtiy, Mimeo. 30

[46] Uhlig, Harald, 2003. (cid:147)How Well do we Understand Business Cycles and Growth? Examining the Data with a Real Business Cycle Model.(cid:148)inWolfgang Franz, Hans J¤urgen. Ramser, and Manfred Stadler (ed.), Empirische Wirtschaftsforschung: Methoden und Anwendungen, vol. 32 of Wirtschaftswissenschaftliches Seminar Ottobeuren, pp. 295(cid:150)319. [47] S. Verdœ, 1996 "The Exponential Distribution in Information Theory," Invited Paper, Problemy Peredachi Informatsii, vol. 32, no.1, pp.100-111, Jan.-Mar. 1996 (In Russian). English version in Problems of Information Transmission, vol. 32, no. 1, pp. 86-95, Jan.-Mar. 1996. [48] S. Verdœ, 1999 , Information Theory: Fifty Years of Discovery, IEEE Press, Piscataway, NJ, S. Verdu and S. McLaughlin, Eds. 31

7 Appendix A: Bellman Recursion and its properties Thisappendixfollowcloselytheworkof Tutino(2008). Itestablishesthemainproperties of the Bellman recursion in the discrete Rational Inattention consumption-labor model. 7.1 The Bellman Recursion is a Contraction Mapping. Proposition 1. For the discrete Rational Inattention consumption labor value recursion H and two given functions V and U, it holds that HV HU (cid:12) V U ; jj (cid:0) jj (cid:20) jj (cid:0) jj with 0 (cid:12) < 1 and : the supreme norm. That is, the value recursion H is a (cid:20) jj jj contraction mapping. Proof. The H mapping displays: HV (g) = maxHpV (g); p with HpV (g) = u(c;l)p(a w) g(w) (cid:18)(cid:20)+(cid:12) (V (g ( )))p(a w)g(w) : j (cid:0) a0 (cid:1) j " ! # w W a A w W a A X2 X2 X2 X2 Suppose that HV HU is the maximumat point g. Let p denote the optimal control 1 jj (cid:0) jj for HV under g and p the optimal one for HU 2 HV (g) = Hp1V (g); HU (g) = Hp2U (g): Then it holds HV (g) HU (g) = Hp1V (g) Hp2U (g): jj (cid:0) jj (cid:0) Suppose WLOG that HV (g) HU (g): Since p maximizes HV at g , I get 1 (cid:20) Hp2V (g) Hp1V (g): (cid:20) Hence, HV HU = jj (cid:0) jj HV (g) HU (g) = jj (cid:0) jj Hp1V (g) Hp2U (g) (cid:0) (cid:20) Hp2V (g) Hp2U (g) = (cid:0) (cid:12) [(Vp2(g ( ))) (Up2(g ( )))]p g(w) a0 (cid:1) (cid:0) a0 (cid:1) 2 (cid:20) w Wa A X2 X2 (cid:12) ( V U )p g(w) 2 jj (cid:0) jj (cid:20) w Wa A X2 X2 (cid:12) V U : jj (cid:0) jj Recalling that 0 (cid:12) < 1 completes the proof. (cid:20) 32

7.2 The Bellman Recursion is an Isotonic Mapping Corollary For the discrete Rational Inattention consumption-laving value recursion H and two given functions V and U, it holds that V U = HV HU (cid:20) ) (cid:20) that is the value recursion H is an isotonic mapping. Proof. Let p denote the optimal control for HV under g and p the optimal one for 1 2 HU HV (g) = Hp1V (g); HU (g) = Hp2U (g): By de(cid:133)nition, Hp1U (g) Hp2U (g): (cid:20) Fromagiveng, itispossibletocompute g ( ) foranarbitraryc andthenthefollowing a0 (cid:1) jp1 will hold V U = (cid:20) ) g(w);c; 8 V g ( ) U g ( ) = c0 (cid:1) jp1 (cid:20) c0 (cid:1) jp1 ) (cid:16) (cid:17) (cid:16) (cid:17) V g ( ) p g U g ( ) p g = a0 (cid:1) jp1 (cid:1) 1 (cid:20) a0 (cid:1) jp1 (cid:1) 1 ) X a 2 A (cid:16) (cid:17) X a 2 A (cid:16) (cid:17) u(c;l)p g(w)+(cid:12) V g ( ) p g 1 a0 (cid:1) jp1 (cid:1) 1 ! w X2 W X a 2 A X a 2 A (cid:16) (cid:17) u(c;l)p = 1 (cid:20) ) ! w W a A X2 X2 Hp1V (g) Hp1U (g) = (cid:20) ) Hp1V (g) Hp2U (g) = (cid:20) ) HV (g) HU (g) = (cid:20) ) HV HU: (cid:20) Note that g was chosen arbitrarily and, from it, g ( ) completes the argument that a0 (cid:1) jp1 the value function is isotone. 33

7.3 The Optimal Value Function is Piecewise Linear Proposition 2. If the utility is CRRA or LOG with a parameter of risk aversion (cid:13) 2 (0;+ ) and inverse of Frisch elasticity of labor supply (cid:17) [0;+ ) and if Pr 1 2 1 (a ;w ) satis(cid:133)es (5)-(8), then the optimal n step value function V (g) can be j i n (cid:0) expressed as: V (g) = max (cid:11) (w )g(w ) n n i i (cid:11)i f ngi i X where the (cid:11) vectors, (cid:11) : W R, are W dimensional hyperplanes. (cid:0) ! j j(cid:0) Proof. The proof is done via induction. I assume that all the operations are wellde(cid:133)ned in their corresponding spaces. Let (cid:0) be the set that contains constraints (5)-(8) .For planning horizon n = 0, I have only to take into account the immediate expected rewards and thus I have that: V (g) = max u(c;l)p g(w) (14) 0 p (cid:0) " ! # 2 w W a A X2 X2 and therefore if I de(cid:133)ne the vectors (cid:11)i (w) u(c;l)p (15) 0 i (cid:17) ! (cid:8) (cid:9) X a 2 A p 2 (cid:0) I have the desired V (g) = max (cid:11)i;g (16) 0 0 (cid:11)i(w) f 0 gi (cid:10) (cid:11) where :;: denotes the inner product (cid:11)i;g (cid:11)i (w);g(w). For the general case, h i h 0 i (cid:17) 0 w W using equation (??): X2 u(c;l)p(c;l w) g(w)+ j V n (g) = m p a (cid:0) x2 w X2 W X a 2 A ! 3 (17) 2 6 6 +(cid:12) (V n (cid:0) 1 (g a0 ( (cid:1) ) a ))p(c;l j w)g(w) 7 7 6 w X2 W X a 2 A 7 4 5 by the induction hypothesis V (g( ) ) = max (cid:11)i ;g ( ) (18) n (cid:0) 1 (cid:1) ja (cid:11)i n (cid:0) 1 a0 (cid:1) f n (cid:0) 1gi (cid:10) (cid:11) Plugging into the above equation (??) and by de(cid:133)nition of :;: , h i Pr(w;c;l) V (g ( )) = max (cid:11)i (w ) T ( ;w;c;l) (19) n (cid:0) 1 a0 (cid:1) (cid:11)i n (cid:0) 1 0 (cid:1) Pr(c;l) ! f n (cid:0) 1gi w X02 W w X2 W X a 2 A 34

With the above: u(c;l)p g(w)+ 2 ! 3 V n (g) = max w X2 W X a 2 A p (cid:0) 2 6 +(cid:12)max (cid:11)i (w ) T(;w;c;l) p g(w) 7 6 6 6 f (cid:11)i n (cid:0) 1gi w X02 W n (cid:0) 1 0 w X2 W X a 2 A P (cid:1) r(c;l) (cid:1) ! ! 7 7 7 4 5 1 = max u(c;l) p; g(w) +(cid:12) max (cid:11)i (w )T ( ;w;c;l) p; g p (cid:0) " h (cid:1) i Pr(c;l) (cid:11)i * n (cid:0) 1 0 (cid:1) (cid:1) +# 2 X a 2 A f n (cid:0) 1gi w X02 W (20) At this point, it is possible to de(cid:133)ne (cid:11)j (w) = (cid:11)i (w )T ( : w;c;l) p: (21) p;a n 1 0 (cid:1) (cid:1) (cid:0) w W X02 Note that these hyperplanes are independent on the prior g for which I am computing V : Thus, the value function amounts to n 1 V (g) = max u(c;l) p; g +(cid:12) max (cid:11)j ;g ; (22) n p 2 (cid:0) " h (cid:1) i X a 2 A Pr(c;l) f (cid:11)j p;a gj (cid:10) p;a (cid:11) # and de(cid:133)ne: (cid:11) = arg max (cid:11)j ;g : (23) p;a;g p;a (cid:11)j p;a f gj (cid:10) (cid:11) Note that (cid:11) is a subset of (cid:11)j and using this subset results into p;a;g p;a 1 V (g) = max u(c;l) p; g +(cid:12) (cid:11) ;g n p;a;g p (cid:0) " h (cid:1) i Pr(c;l) h i # 2 a A X2 1 = max u(c;l) +(cid:12) (cid:11) ; g : (24) p;a;g p (cid:0) * (cid:1) Pr(c;l) + 2 a A X2 Now 1 (cid:11)i = u(c;l) p+(cid:12) (cid:11) (25) n i (cid:1) Pr(c;l) p;a;g ( ) (cid:8) (cid:9) [8 g X a 2 A p 2 (cid:0) is a (cid:133)nite set of linear function parametrized in the action set. 7.4 .. and Convex (PCWL) Proposition 3. Assuming the CRRA or LOG utility function and the conditions of Proposition 1, let V be an initial value function that is piecewise linear and convex. 0 Then the ith value function obtained after a (cid:133)nite number of update steps for a rational inattention consumption-saving problem is also (cid:133)nite, piecewise linear and convex (PCWL). 35

Proof. The (cid:133)rst task is to prove that (cid:11)i sets are discrete for all n. The proof f ngi proceeds via induction. Assuming CRRA/LOG utility and since the optimal policy belongs to (cid:0), it is straightforward to see that through (15), the set of vectors (cid:11)i , f 0gi c1 (cid:13) l1+(cid:17) (cid:11)i (cid:0) (cid:11) p(c;l w) g(w) 0 i (cid:17) 1 (cid:13) (cid:0) 1+(cid:17) j ! ! (cid:8) (cid:9) w X2 W X a 2 A(cid:18) (cid:0) (cid:19) p 2 (cid:0) is discrete. For the general case, observe that for discrete controls and assuming M = (cid:11)j , the sets (cid:11)j are discrete, for a given action p and consumption c, I can only n 1 p;c gener(cid:0)ate (cid:11)j vectors. Now, (cid:133)xing p it is possible to select one of the M (cid:11)j vectors for (cid:12)(cid:8) (cid:9)(cid:12) p;c(cid:0) (cid:8) (cid:9) p;c(cid:0) e(cid:12)ach one(cid:12)of the observed consumption c and, thus, (cid:11)j is a discrete set. The previous f ngi proposition, shows the value function to be convex. The piecewise-linear component of the properties comes from the fact that (cid:11)j set is of (cid:133)nite cardinality. It follows that f ngi V is de(cid:133)ned as a (cid:133)nite set of linear functions. n 8 Appendix B : Model Statistics and Graphs 8.1 Tables (cid:18) = 2; (cid:13) = 1;(cid:17) = 0 f g Mean St.Dev (cid:26)(x ;x ) t t 1 (cid:0) Consumption (c) 3.55 1.29 0.92 Labor (l) 2.75 1.12 0.88 Wealth (w) 5.73 2.11 0.94 Information Flow ((cid:20)) 0.53 0.91 Cross-Correlation c( 1) c l( 1) l w( 1) w (cid:0) (cid:0) (cid:0) (c) 0.92 1 0.86 0.89 0.75 0.47 (l) 0.88 0.89 0.88 1 0.72 0.41 (w) 0.30 0.47 0.37 0.41 0.94 1 Table 10a: Statisticalproperties ofthe Model,M(2;1;0) (cid:18) = 0:2; (cid:13) = 1;(cid:17) = 0 f g Mean St.Dev (cid:26)(x ;x ) t t 1 (cid:0) Consumption (c) 4.05 1.11 0.95 Labor (l) 2.98 0.97 0.94 Wealth (w) 5.84 1.86 0.93 Information Flow ((cid:20)) 0.98 0.63 Cross-Correlation c( 1) c l( 1) l w( 1) w (cid:0) (cid:0) (cid:0) (c) 0.95 1 0.90 0.90 0.77 0.84 (l) 0.87 0.90 0.94 1 0.82 0.79 (w) 0.87 0.84 0.75 0.79 0.93 1 Table 10b: Statisticalproperties ofthe Model,M(0:2;1;0) 36

(cid:18) = 0:02; (cid:13) = 1;(cid:17) = 0 f g Mean St.Dev (cid:26)(x ;x ) t t 1 (cid:0) Consumption (c) 4.26 0.75 0.97 Labor (l) 3.76 0.73 0.77 Wealth (w) 6.06 1.21 0.89 Information Flow ((cid:20)) 1.52 0.34 Cross-Correlation c( 1) c l( 1) l w( 1) w (cid:0) (cid:0) (cid:0) (c) 0.97 1 0.86 0.94 0.48 0.88 (l) 0.86 0.94 0.77 1 0.76 0.87 (w) 0.88 0.78 0.77 0.87 0.89 1 Table 10c: Statisticalproperties ofthe Model,M(0:02;1;0) (cid:18) = 0:2; (cid:13) = 3;(cid:17) = 0 f g Mean St.Dev (cid:26)(x ;x ) t t 1 (cid:0) Consumption (c) 4.11 1.31 0.93 Labor (l) 3.90 0.92 0.91 Wealth (w) 5.88 1.84 0.90 Information Flow ((cid:20)) 1.10 0.45 Cross-Correlation c( 1) c l( 1) l w( 1) w (cid:0) (cid:0) (cid:0) (c) 0.93 1 0.90 0.85 0.64 0.72 (l) 0.76 0.85 0.91 1 0.66 0.75 (w) 0.50 0.62 0.58 0.55 0.90 1 Table 10d: Statisticalproperties ofthe ModelM(0:2;3;0) (cid:18) = 0:2; (cid:13) = 1;(cid:17) = 1 f g Mean St.Dev (cid:26)(x ;x ) t t 1 (cid:0) Consumption (c) 4.09 0.99 0.94 Labor (l) 2.90 0.74 0.95 Wealth (w) 5.07 1.31 0.97 Information Flow ((cid:20)) 1.01 0.59 Cross-Correlation c( 1) c l( 1) l w( 1) w (cid:0) (cid:0) (cid:0) (c) 0.94 1 0.88 0.90 0.73 0.67 (l) 0.95 0.90 0.95 1 0.44 0.54 (w) 0.56 0.67 0.64 0.64 0.97 1 Table 10e: Statisticalproperties ofthe ModelM(0:2;1;1) 37

8.2 Figures Comparing consumption fordi⁄erent utilities,(cid:18)=0:2 q =0.2 5 g=3,h =0 4.5 4 g=1,h =1 g=1,h =0 3.5 3 2.5 2 1.5 0 10 20 30 40 50 60 70 Figure 7 :Blue: (cid:13)=1;(cid:17)=0;Green: (cid:13)=1;(cid:17)=1;Violet:(cid:13)=3;(cid:17)=0. Comparing laborand wealth fordi⁄erent utilities,(cid:18)=0:2 q =0.2 8 Wealth, g=1,h =1 7 6 Wealth, g=3.h =0 5 4 3 2 Labor, g=1,h =0 1 0 10 20 30 40 50 60 70 80 Figure 8 :Blue: (cid:13)=1;(cid:17)=0;Green: (cid:13)=1;(cid:17)=1;Violet:(cid:13)=3;(cid:17)=0. 38

Comparing savings fordi⁄erent utilities,(cid:18)=0:2 q =0.2 3.5 g=3,h =0 3 g=1,h =0 2.5 2 1.5 1 g=1,h =1 0.5 0 0 10 20 30 40 50 60 70 Figure 9 :Blue: (cid:13)=1;(cid:17)=0;Green: (cid:13)=1;(cid:17)=1;Violet:(cid:13)=3;(cid:17)=0. Comparing wealth,consumption,savings and income fordi⁄erent (cid:18)0s,u(c;l)=log(c) (cid:11)l (cid:0) 10 5 8 4 no htlaew 6 itpm u sno 3 c 4 2 2 1 0 10 20 30 40 50 60 70 0 10 20 30 40 50 60 70 5 5 4 4 3 em o cn i 3 sgn iva s2 2 1 1 0 0 10 20 30 40 50 60 70 0 10 20 30 40 50 60 70 Figure 10 :Blue: (cid:18)=2;Violet:(cid:18)=0:2;Green: (cid:18)=0:02. 39

9 Appendix C: Rigidity of Labor and Consumption Choices This section builds up a low-dimension intuition for the solution strategy of the model in the section 2 before turning to the formal solution and its (cid:133)ndings. Consider a consumer who can choose to consume a quantity in the set (cid:10) c c . Each period, he c low high (cid:17) decides whether to work (cid:10) ( l = 0 l = 1 ) in exchange for a salary s. Assume for l (cid:17) f g_f g (cid:0) (cid:1) simplicity no asset but a (cid:133)xed initial endowment w(cid:22) = 2: The budget constraint is: c w(cid:22) +s1 (26) l=1 (cid:20) f g where 1 indicates whether the consumer works. l=1 f g Letu(c;l) logc (cid:11)l denotestheutilityoftheconsumer. Moreover, tomakematters (cid:17) (cid:0) concrete, let c = 2, c = 4 and (cid:11) = 0:3 and s = 2 with probability p and s = 1 with low high probability (1 p) (cid:0) Under full information capacity and no uncertainty, the agent will work i⁄: (cid:11)c s (27) (cid:20) The solution for this problemis clearly(c ;l = 1) i⁄p 0:2 and (c ;l = 0) if.p > 0:2: high low (cid:20) Now assume that it is prohibitively costly for the agent to know the probability of the outcomes for s. In this case, it is optimal for the consumer to choose (c ;l = 0). low Under rational inattention, the agent can reduce his uncertainty up to an amount given by his ability of processing information. Such a constraint, expressed in terms of change in entropy is the Shannon channel. The reduction in uncertainty is obtained by choosing the distribution of a signal informative about the underlying state (salary) as muchastheShannonchannelallowsit. Inparticular, thereare3possiblechoicestheconsumercanmakeandthatsatisfy(26),i.e., (c ;l = 1); (c ;l = 1); (c ;l = 0) : high low low With the constraint that the joint distribution p( c;l ;s) delivers as marginal for s (cid:8) f g (cid:9) Pr(s = 1) = p and Pr(s = 2) = 1 p, the joint distribution is (cid:0) C;L S s s 1 2 n (c ;l = 1) z z low 2 3 : (c ;l = 0) p z (1 p) z z low 2 1 3 (cid:0) (cid:0) (cid:0) (cid:0) (c ;l = 1) 0 z high 1 The problem of the consumer is then to maxE(u(c;l)) zi = u(c ;l = 1)z +u(c ;l = 1)(z +z )+u(c ;l = 0)(1 z z z ) high 1 low 2 3 low 1 2 3 (cid:0) (cid:0) (cid:0) s.t. p( c;l ;s) I(p( ; )) = p( c;l ;s)log f g s a (cid:1) (cid:1) f g ( p( c;l ;s))g(s) s c;l (cid:18) s 0 f g 0 (cid:19) XXf g P 40

The (cid:133)rst order condition are @z : 1 z 1 u(c ;l = 1) u(c ;l = 0) = (cid:21) ln high low (cid:0) 1 z z z (cid:18) (cid:18) (cid:0) 1 (cid:0) 2 (cid:0) 3(cid:19)(cid:19) @z : 2 z +z 1 z z z 2 3 1 2 3 u(c ;l = 1) u(c ;l = 0) = (cid:21) ln ln (cid:0) (cid:0) (cid:0) low low (cid:0) z (cid:0) p z (cid:18) (cid:18) 1 (cid:19) (cid:18) (cid:0) 2 (cid:19)(cid:19) @z : 3 z +z 1 z z z 2 3 1 2 3 u(c ;l = 1) u(c ;l = 0) = (cid:21) ln ln (cid:0) (cid:0) (cid:0) low low (cid:0) z (cid:0) (1 p) z z (cid:18) (cid:18) 3 (cid:19) (cid:18) (cid:0) (cid:0) 1 (cid:0) 3(cid:19)(cid:19) The set of (cid:133)rst order conditions yield a system of simultaneous trascendental equations. This system of transcendental equations involving logarithms can be solved using the LambertW function., which is an inverse mapping satisfying W (y)eW(y) = y and thus logW (y) + W (y) = logy. This function has multiple branches, Branches 0 and 1 are the only ones that can take on non-complex values. Let y = ex. To solve the (cid:0) three equations, combine the last two F.O.C.(cid:146)s and plug the solution in the (cid:133)rst using the constraints on the marginals. Let ! u(c ;l = 1) u(c ;l = 0) where i (c ;c ). i i low high low (cid:17) (cid:0) (cid:17) Then the solution for z is given by i (cid:21)=(! +p) i z = (cid:0) : (28) i W (((cid:21)=(! +p))e (cid:21)(1+p)=!) i (cid:0) 41

Figure 11 below illustrates the behavior of z as function of (cid:21), p and ! . 1 1 z as function of Prior and Utilities 1 1.4 1.2 1 0.8 l 0.6 0.4 0.2 0 1 0.8 3 2.8 0.6 2.6 2.4 2.2 0.4 2 1.8 0.2 1.6 1.4 1.2 0 1 p u(c ,l=1) u(c ,l=0) high low Figure 11 :An analiticalsolution 10 Appendix D Pseudocode Let (cid:18) be the shadow cost associated to (cid:20) = I(A;W). De(cid:133)ne a Model as a pair ((cid:13);(cid:17);(cid:18)). For a given speci(cid:133)cation : Step 1: Build the simplex. Construct a uniform grid to approximate each g(w)-simplex (cid:15) point. Step 2: For each simplex point, de(cid:133)ne p(w;c;l). and initialize with V g ( ) = 0: (cid:15) a0 j (cid:1) (cid:16) (cid:17) Step 3: For each simplex point, (cid:133)nd p (w;a;c) s.t. (cid:3) (cid:15) V (g(w)) = max c1 t(cid:0) (cid:13) (cid:11)l(cid:17)+1 (cid:18)[(cid:20)] : 0 jp (cid:3) (w;c;l) p(cid:3)(w;c;l) (cid:26)w W a A 1 (cid:0) (cid:13) (cid:0) (cid:17)+1 p (cid:3) (w;c;l) (cid:0) (cid:27) 2 2 (cid:16) (cid:17) P P 42

Step 4: For each simplex point, compute g ( ) = T ( ;w;c;a)p (w c;l). Use a (cid:15) a0 j (cid:1) w W (cid:1) (cid:3) j kernel regression to interpolate V (g(w)) into g ( ). 2 0 a0 j P(cid:1) Step 5: Optimize using csminwel and iterate on the value function up to convergence. (cid:15) Obs. Convergence and Computation Time vary with the speci(cid:133)cation ((cid:13);(cid:17);(cid:18)). 120-220 iterations each taking 8min-20min ! Step 6. For each model ((cid:13);(cid:17);(cid:18)), draw from the ergodic p (w;c;l) a sample (c ;l ;w ) (cid:3) t t t (cid:15) and simulate the time series of consumption, wealth, expected wealth and information (cid:135)ow by averaging over 1000 draws. Step 7. Generate histograms of consumption and impulse response function of consump- (cid:15) tion to temporary positive and negative shocks to income. 43

Cite this document
APA
Antonella Tutino (2009). The Rigidity of Labor: Processing Savings and Work Decisions Through Shannon's Channels (FEDS 2010-02). Board of Governors of the Federal Reserve System, Finance and Economics Discussion Series. https://whenthefedspeaks.com/doc/feds_2010-02
BibTeX
@techreport{wtfs_feds_2010_02,
  author = {Antonella Tutino},
  title = {The Rigidity of Labor: Processing Savings and Work Decisions Through Shannon's Channels},
  type = {Finance and Economics Discussion Series},
  number = {2010-02},
  institution = {Board of Governors of the Federal Reserve System},
  year = {2009},
  url = {https://whenthefedspeaks.com/doc/feds_2010-02},
  abstract = {This paper argues that constraining people to choose consumption and labor under finite Shannon capacity produces results in line with U.S. business cycle data. My model has a simple partial equilibrium setting in which risk averse consumers keep high labor supply and low consumption profile at early stage of life to hedge against wealth fluctuations. They rationally choose to keep consumption and labor unchanged until they collect enough information. I find that at high frequency consumption appears to be more sluggish than labor supply. However, when people decide to change consumption they do so by a large amount. This combination leads to higher variance of consumption with respect to labor supply. My model also finds high persistence and strong comovement of consumption and employment and delayed response of consumption and labor with respect to wealth. Furthermore, my framework generates endogenously a wedge between marginal rate of substituition and marginal rate of transformation or wages. Such wedge is bigger and more volatile the lower information flow. These findings suggest that rational inattention offers a promising avenue to bridge the gap between theory and U.S. business cycle data.},
}