feds · April 20, 2017

The Cyclical Behavior of Unemployment and Wages under Information Frictions

Abstract

I propose a new mechanism for sluggish wages based on workers' noisy information about the state of the economy. Wages do not respond immediately to a positive aggregate shock because workers do not (yet) have enough information to demand higher wages. This increases firms' incentives to post more vacancies, which makes unemployment volatile and sensitive to aggregate shocks. The model is robust to two major criticisms of existing theories of sluggish wages and volatile unemployment: flexibility of wages for new hires and pro-cyclicality of the opportunity cost of employment. Calibrated to U.S. data, the model explains 70% of unemployment volatility. Accessible materials (.zip)

Finance and Economics Discussion Series Divisions of Research & Statistics and Monetary Affairs Federal Reserve Board, Washington, D.C. The Cyclical Behavior of Unemployment and Wages under Information Frictions Camilo Morales-Jimenez 2017-047 Please cite this paper as: Morales-Jimenez, Camilo (2017). “The Cyclical Behavior of Unemployment and Wages under Information Frictions,” Finance and Economics Discussion Series 2017-047. Washington: Board of Governors of the Federal Reserve System, https://doi.org/10.17016/FEDS.2017.047. NOTE: Staff working papers in the Finance and Economics Discussion Series (FEDS) are preliminary materials circulated to stimulate discussion and critical comment. The analysis and conclusions set forth are those of the authors and do not indicate concurrence by other members of the research staff or the Board of Governors. References in publications to the Finance and Economics Discussion Series (other than acknowledgement) should be cleared with the author(s) to protect the tentative character of these papers.

The Cyclical Behavior of Unemployment and Wages under Information Frictions∗ Camilo Morales-Jim´enez† March 1, 2017 Abstract I propose a new mechanism for sluggish wages based on workers’ noisy information about the state of the economy. Wages do not respond immediately to a positive aggregate shock because workers do not (yet) have enough information to demand higher wages. This increases firms’ incentives to post more vacancies, which makes unemployment volatile and sensitive to aggregate shocks. The model is robust to two major criticisms of existing theories of sluggish wages and volatile unemployment: flexibility of wages for new hires and pro-cyclicality of the opportunity cost of employment. Calibrated to U.S. data, the model explains 70% of unemployment volatility. ∗I am especially grateful to Boragan Aruoba, Luminita Stevens, John Haltiwanger, and John Shea for their valuable suggestions and support. I would also like to thank Katherine Abraham, Pablo Cuba-Borda,SebnemKalemli-Ozcan,EthanKaplan,FelipeSaffie,MarisolRodriguez-Chatruc,Loukas Karabarbounis, Ellen McGrattan, Ryan Michaels, Iourii Manovskii and seminar participants at the University of Maryland, University of Pennsylvania, University of Minnesota and Federal Reserve Bank of Philadelphia for valuable suggestions and helpful comments. The views expressed in this paper are solely the responsibility of the author and should not be interpreted as reflecting the views of the Board of Governors of the Federal Reserve System or of anyone else associated with the Federal Reserve System. †Contact: camilo.moralesjimenez@frb.gov 1

1 Introduction Search and matching models are an appealing way to study fluctuations in the labor market, as they define unemployment in a manner that is consistent with statistical agencies’ convention and describe in an attractive way the functioning of the labor market, how firms and workers are matched, and how wages are negotiated. However, Shimer (2005) points out that the volatility of unemployment predicted by the standard search and matching model is low. One approach to resolving this “Shimer Puzzle” has emphasized the amplifying effects of sluggish wages. This approach has been criticized in recent years on the basis that, empirically, wages for new hires (from unemployment and other jobs) exhibit little rigidity (Pissarides, 2009) while the opportunity cost of employment is pro-cyclical (Karabarbounis & Chodorow-Reich, 2016). In this paper, I propose a new mechanism that predicts sluggish wages based on workers’ noisy information about the state of the economy. This new mechanism is robust to the aforementioned critiques and generates business cycle dynamics for unemployment and wages that are consistent with the empirical evidence. In my model, wages for new hires are flexible, but wages do not adjust immediately to the true state of the economy because workers learn slowly about aggregate shocks. This delayed adjustment increases firms’ incentives to expand employment, making unemployment volatile and sensitive to aggregate shocks. My model is able to explain 70%ofoverallunemploymentvolatilityandgenerateswagesemi-elasticitieswithrespect totheunemploymentrateofaround-3%fornewhires, asreportedbyPissarides(2009). The model is in many respects similar to a standard RBC model with search and matching in the labor market. I introduce heterogeneous firms and assume that they differ in their permanent total factor productivity (TFP) levels, which are public information. In equilibrium, the most productive firms are larger and pay higher wages. To distinguish between new hires coming from unemployment and job changers, I assume that workers search on the job for better-paid jobs. The only source of aggregate uncertainty is aggregate TFP, which is not directly observed by workers. Instead, workers form expectations based on a public and noisy signal they receive each period. Thus, TFP shocks are only partially perceived by workers, who slowly learn about aggregate conditions as time goes by. This information friction affects the decisions of households and workers including those related to consumption and saving. Firms and workers 2

negotiate wages each period. Workers negotiate wages based on their beliefs about the aggregate state of the economy. Hence, after a positive productivity shock, wages remain relatively constant because workers do not immediately possess the proper information to demand higher wages, which generates sluggish wages within jobs. The persistence in wages within jobs increases firms’ incentives to hire workers in an expansion, as they get to keep a larger fraction of the match surplus. However, in equilibrium, the high-paying/most-productive firms hire proportionally more new workers than the low-paying/less-productive firms in response to a positive productivity shock, as documented by Kahn and McEntarfer (2014); Haltiwanger, Hyatt and McEntarfer (2015); and Moscarini and Postel-Vinay (2008, 2012). This differential employment response occurs because there is a significant increase in job-to-job flows, which reduces the average duration of a match for less productive firms and therefore the value of an additional worker. Hence, in an expansion, high-paying firms have to increase their wages the most in order to attract all workers they demand. As a result, even though wages within jobs adjust slowly to the true state of the economy, the average wage for new hires exhibits a large response to productivity shocks on impact because a new hire faces more and better-paying job opportunities in an expansion than in a recession. I calibrate my model using U.S. data for the period 1979 to 2015. To address the cyclicalityofwagesforjobstayersversusnewhires, IusetheCurrentPopulationSurvey (CPS)andIPUMS-CPS (Flood, King, Ruggles, &Warren, 2015)microdata tocompute the average wage for all workers, job changers, and new hires from unemployment controlling for individual characteristics (e.g. Solon, Barsky, & Parker, 1994; Haefke, Sonntag, & van Rens, 2013; Muller, 2012).1 These series show that job changers earn a lower wage than the average worker in the economy but a larger wage than new hires from unemployment, suggesting that unemployed workers are more likely to find a job atlowpayingjob andmoveupthejob ladder. However, I findlowwagesemi-elasticities with respect to the unemployment rate using these wage series: -0.3% for all workers, -0.6% for job changers, and -1.7% for new employees. Only the latter semi-elasticity is statistically significant. The model calibrated to the U.S. economy is able to generate a large volatility in labor market quantities (unemployment, vacancies, vacancy-unemployment ratio), and relatively low volatility in wages and consumption, as in the data. Also, the model 1Henceforth, I will refer to new hires from unemployment as “new employees.” 3

generates wage semi-elasticities with respect to the unemployment rate of around -3% for new hires, and -1% for all workers, which are similar to the estimate of Pissarides (2009) and larger than the estimates of Hagedorn and Manovskii (2013) and Gertler, Huckfeldt, and Trigari (2014). Even though most of the wage cyclicality in my model is driven by the differential growth rate between high and low-paying firms, I show that my model generates differential net job flows that are consistent with the empirical evidence presented by Haltiwanger et al. (2015). These moments (volatility, wage semi-elasticities, differential growth rates) are not a target in my calibration. I present a simple test to the main prediction of this paper: Wages should increase when workers are more optimistic about economic conditions. Using the University of Michigan Surveys of Consumers, I show that wage growth is positively correlated with workers’ expectations at monthly and quarterly frequencies. However, this relationship is only statistically significant at quarterly frequencies. I estimate that a 1 standard deviation increase in my measure of workers expectations is associated with a 0.4, 0.6 and 0.5 percent increase in the average wage for all workers, new employees, and job changers, respectively. This work builds on the literature that addresses the Shimer puzzle (Shimer, 2005; Constain & Reiter, 2008) by studying the amplifying effects of sluggish wages on job creation. This literature is large and includes, for example: Hall (2005); Hall and Milgrom (2008); Christiano, Eichenbaum and Trabandt (2016); Gertler and Trigari (2009); Kennan (2009); Menzio (2005); and Venkateswaran (2013). My paper differs in at least three aspects with respect to this literature. First, I propose a new mechanism for sticky wages based on workers who face information frictions regarding aggregate variables. This mechanism, does not rely on any assumption about the persistence of aggregateshocks(Menzio,2005)orthedistributionoffirms(Kennan,2009). Incontrast to Venkateswaran (2013), what drives sticky wages in my model is the fact that workers are willing to work for wages that do not adjust to the true state of the economy. That is, it is not enough to explain why firms offer wages that are very persistent —workers need to be willing to accept them. Second, my model is able to generate significant unemployment volatility in spite of the procyclicality of the Flow Opportunity Cost of Employment (FOCE) (Chodorow- Reich & Karabarbounis, 2014; Brugemann & Moscarini, 2010).2 Given that households 2FOCE is defined as the forgone value of unemployment benefits plus the forgone value of non- 4

make consumption and saving decisions based on the same information friction, investment (capital accumulation) absorbs most of the shock in the initial periods, which prevents consumption and the FOCE from increasing. Hence, even though the FOCE eventually rises, it takes time because workers (not firms) have information frictions regarding aggregate variables. Third, this paper looks at the distributional implications of productivity shocks. I show that high-wage firms expand employment the most during an expansion and how this mechanism generates different wage dynamics across firms and groups of workers. Even though the information friction is the same for all agents, wages at high-paying firms are more sensitive to the business cycle than wages at low-paying firms. As a consequence of this differential employment and wage growth, the average wage for new hires is more sensitive to the business cycle than the average wage for all workers. This paper is also related to the literature about information frictions. My model is close in spirit to Lucas (1972), in which agents’ inability to distinguish between aggregate and idiosyncratic shocks generates money non-neutrality. Following Angeletos and La’O (2012), the information friction presented in this paper has both a nominal and a real part, as it affects not only price (wage) decisions but also real allocations (saving, consumption). Paying limited attention to aggregate shocks is a standard result in the rational inattention literature that started with Sims (2003). For example, Mackowiak and Wiederhold (2009) present a model in which agents optimally decide to receive a noisy signal about aggregate conditions, as I assume in this paper, because acquiring information is costly. Similarly, Acharya (2014) and Reis (2006a, 2006b) show that agents optimally decide to update their information set sporadically when they face a cost of acquiring and processing information. On the empirical side, Coibion and Gorodnichenko (2012) find that the behavior of forecast errors is more consistent with a model in which agents receive noisy signals about aggregate conditions, as I assume in this paper. In addition, Carroll (2003) formulates and finds evidence in favor of a model in which consumers have a larger degree of information rigidity than other agents. Similarly, Roberts (1998) finds evidence of non-rational expectations in survey data, and Branch (2004) argues that surveys reject the rational expectation hypothesis not because agents use an ad hoc expectation rule, but rather because agents optimally decide not to use a more complicated expectation (predictor) function. working activities in term of consumption. 5

Finally, this paper is related to the literature that studies the cyclicality of wages over the business cycle. Pissarides (2009) argues that vacancy decisions depend only on the wage for new hires, which seem to exhibit little rigidity, and points out that the wage semi-elasticity with respect to the unemployment rate for new hires is around -3%, compared with a semi-elasticity of -1% for job stayers. The Pissarides critique has been recently challenged by Gertler et al. (2014) and Hagedorn and Manovskii (2013) based on the cyclicality of the match quality. However, whether wages for new hires are more procyclical than wages for existing workers is still an open question and is beyond the scope of this paper. Nevertheless, I use CPS and IPUMS-CPS microdata in order to construct the average wage for all workers and new hires (adjusted for individual characteristics) and assess the predictions of my model. It is worth noting that in my model, wages for new hires are flexible and I show that my model is able to reproduce a wage semi-elasticity with respect to the unemployment rate of around -3% for new hires, and -1% for all workers, which is not a target in my calibration. Hence, this paper points out that wage flexibility for new hires does not imply that wages adjust immediately to the true state of the economy. The rest of this paper is organized as follows. I present my model in section 2, and section 3 presents quantitative analysis. I discuss some alternative issues and test the main implication of my model using survey data in section 4. Finally, section 5 concludes. 2 Theoretical Framework The model presented in this section is, in many aspects, similar to a standard real business cycle model with search and matching in the labor market as in Andolfatto (1996) and Merz (1995). I introduce job changers in this model by assuming that there is a distribution of productivity across firms that induces a distribution of wages in the economy. Hence, employed workers search on the job for better paying jobs. The main difference of my model with respect to the relevant literature is that workers face information frictions about aggregate conditions. As in Lucas (1972), workers form expectations about current aggregate economic conditions based on noisy signals. 6

2.1 Household There is a representative household made up of a continuum of members with mass normalized to 1.3 The household is the owner of all firms in the economy, and it supplies capital and labor to firms. Capital is supplied in a perfectly competitive market at the rental rate r, while labor supply is subject to search frictions. I assume complete consumption insurance, which implies that workers seek to maximize income for the household. Consumption and savings decisions are made at the household level, but household members make their decisions based on the same information set I . h Throughout this paper, E [x] is the expected value of x conditional on the information I h set I , and E[x] is the expectation conditional on perfect information. h 2.1.1 Consumption and Saving Consumption and savings decision are made at the household level to maximize the life-time utility function c1−σ h ˜1+η U(ω,Ω) = −Ψ +βE[U(ω(cid:48),Ω(cid:48))] (1) 1−σ 1+η subject to the budget constraint (2), the aggregation of labor (3), and a perceived law of motion for the economy (4): (cid:90) 1 (cid:90) 1 c+k(cid:48) ≤(r+1−δ )k + w h dj + π dj +b·u−T (2) k j j j 0 0 (cid:18)(cid:90) 1 (cid:19) 1+ 1 ξ h ˜ = h1+ξdj (3) j 0 Ω(cid:48) =λh(Ω) (4) where (cid:48) denotes next period’s value. ω = {k,{h }1 ,I } is the vector of state variables j j=0 h for the representative household, and Ω is a vector that summarizes the aggregate state of the economy. c is consumption, k is capital, w is the wage paid by firm j, and j (cid:82)1 π stands for firm j’s profits. u = (1 − h )dj is the total number of unemployed j 0 j workers, and b is unemployment compensation, which is financed by lump sum taxes 3For expositional purposes, I derive in this section the value of employment and unemployment based on the model assumptions. For a detailed derivation of these value functions as in Merz (1995) and Andolfatto (1996), see appendix E. 7

(T = b · u). Parameter ξ in (3) governs the elasticity of substitution between h and x h for all x (cid:54)= y. The household and its members form expectations based on their y information set I and on a perceived law of motion for the economy (λh(·)). Hence, h the first order condition for consumption: (cid:104) (cid:105) c−σ = βE (1−δ +r(cid:48))c(cid:48)−σ (5) I h It is worth noting that the consumption decision is also affected by information frictions because the expectation in equation (5) is conditional on the information set I . To the extent that aggregate shocks are partially perceived, the household will h respond to productivity innovations by accumulating capital in an attempt to smooth consumption through time. As a result, the marginal disutility of labor (in terms of consumption) does not increase, which prevents wages from going up. This mechanism will be clear in section 2.5. 2.1.2 Workers A worker can be employed or unemployed at each point in time. Unemployed workers receive unemployment compensation b and are matched with a firm with probability q. Conditional on a match, a worker is matched with firm j with probability (cid:0)vj (cid:1) , where v v is the total number of vacancies in the economy and v stands for firm j’s vacancies. j Hence, the value of unemployment U(ω,Ω) is given by: (cid:26) (cid:18) (cid:90) 1 v (cid:19)(cid:27) U(ω,Ω) = b+E Q (1−q)U(ω(cid:48),Ω(cid:48))+q W (ω(cid:48),Ω(cid:48)) x dx (6) x v 0 whereQ = β (cid:0) c(cid:48)(cid:1)−σ isthestochasticdiscountfactorbetweenthisperiodandthenext c periodandW (ω,Ω)isthevalueofemploymentatfirmj. Meanwhile,employedworkers j are separated from their job with exogenous probability δ , in which case they have to h spend at least one period in unemployment before they can be matched with another firm. I assume that employed workers can search on the job and are matched with another firm with probability¯iq. However, I assume that employed workers only change jobs if they find a firm that offers an equal or better continuation value. Throughout this paper, I refer to jobs that deliver an equal or greater continuation value as better jobs. Hence, denoting W (ω,Ω) as the value of employment at firm j, the net value of j 8

employment is given by: (W (ω,Ω)−U(ω,Ω)) = w −z j j j +E{Q((1−δ )(1−¯iqF )(W (ω(cid:48),Ω(cid:48))−U(ω(cid:48),Ω(cid:48))) h j j +(1−δ )¯iqF (W ˜ (ω(cid:48),Ω(cid:48))−U(ω(cid:48),Ω(cid:48))) h j j −q (cid:0) W ¯ (ω(cid:48),Ω(cid:48))−U(ω(cid:48),Ω(cid:48)) (cid:1) )} (7) where: h ˜η−ξ z = b+Ψ hξ (8) j c−σ j (cid:90) 1 v x F = dx (9) j v j (cid:32) (cid:33) (cid:90) 1 v W ˜ (ω(cid:48),Ω(cid:48)) = W (ω(cid:48),Ω(cid:48)) x dx (10) j x (cid:82)1 v dy j j y (cid:90) 1 v W ¯ (ω(cid:48),Ω(cid:48)) = W (ω(cid:48),Ω(cid:48)) x dx (11) x v 0 The first line in equation (7) is the net flow income of a worker employed at firm j. The second term (z ) is the FOCE, which is defined as the forgone value of unemj ployment benefits plus the forgone value of non-working activities (derived from the household’s utility function (1)). The second line in equation (7) says that with probability (1 − δ )(1 −¯iqF ) a worker is not exogenously separated from firm j and is h j not matched with a better job, where F is the probability of finding a weakly better j job than j. The third line captures that with probability (1−δ )¯iqF a worker is not h j exogenously separated from firm j and is matched with a better job, W ˜ (ω(cid:48),Ω(cid:48)) is the j expected value of the new job for job changers leaving firm j. Finally, the fourth line in (7) is the worker’s outside option for next period. If unemployed, a worker finds a job with probability q and receives, on expectation, a continuation value equal to W ¯ (ω(cid:48),Ω(cid:48)). Notice that the net value of employment (W (ω,Ω)−U(ω,Ω)) is a decreasing funcj tion in z and therefore in consumption. An increase in consumption makes z go up j j and reduces the net value of employment. As a consequence, wages must increase when consumption increases in order to compensate workers for the decline in the value of 9

employment. It is worth noticing that equations (7) and (8) explain why high-paying firms are more sensitive to aggregate shocks than low-paying firms in this model. Since high-paying firms are larger in equilibrium, the FOCE (z ) is an increasing function in j j and, as a consequence, so is zj. Given that z is increasing in firm size, we should pj j also expect high-paying firms to have more volatile wages than low-paying firms.4 Finally, notice that the expectations in equations (6) and (7) are not conditional on the household’s information set I because equations these describe what a worker h will actually receive in expectation and not what workers expect to receive. However, workers will have to form expectations about W (ω,Ω) and U(ω,Ω) to negotiate wages, j as described in section 2.5. 2.2 Firms There is a continuum of firms indexed by j with a mass normalized to 1. All firms produce a homogeneous good that is sold in a competitive market. A priori the only difference among firms is their (permanent) TFP level, which is denoted by a . Without j loss of generality, I assume that a ≥ a for all x ≥ y. Firms produce with capital x y and labor, and their output can be used for consumption or for capital accumulation. At the beginning of each period, firms rent capital and open new vacancies, v. A vacancy is matched with a worker with probability q˜. As is standard in the literature, a filled vacancy becomes productive in the subsequent period. However, not all matches become productive. If a vacancy is matched with a worker who is currently employed at a better job, the match is dissolved. Hence, the job filling rate for firm j (q˜) is j (cid:0) (cid:1) given by q˜ = q˜u +q˜c, where q˜u = q˜· u is the probability of filling a vacancy with an j j s (cid:16) (cid:17) unemployed worker and q˜c = q˜· (cid:82)j (1−δ h )¯ihxdx is the probability of filling a vacancy j 0 s 4The underlying source of this differential growth is not important for the results of this paper. However,KahnandMcEntarfer(2014)findthathigh-payingfirmsdonothaveamorecyclicaldemand than low-paying firms and that high-paying firms have a larger drop in wage rigidity during recessions than low paying firms. 10

with a job changer. The problem for firm j is given by: (cid:2) (cid:3) Π (ω ,Ω) = max π +E QΠ (ω(cid:48),Ω(cid:48)) (12) j f j j f vj,kj s.t. κ π = eaj+akαh1−α −w h −rk − (q˜v )1+χ (13) j j j j j j 1+χ j j h(cid:48) = (1−δ )(1−¯iqF )h +q˜v (14) j h j j j j Ω(cid:48) = λf(Ω) (15) where a stands for aggregate TFP, which is common to all firms. ω = {h } is the f j vector of state variables for firm j, and equation (15) is the perceived law of motion for the economy. Denoting marginal labor productivity by p = (1−α)eaj+akαh−α, the j j j first order conditions with respect to v and k are given by: j j (cid:2) (cid:3) v : −κ(q˜v )χ +E Q·J(cid:48)(ω(cid:48),Ω(cid:48)) ≤ 0 (16) j j j j f (cid:18) (cid:19)(cid:18) (cid:19) h α j k : p −r = 0 (17) j j k 1−α j where J (ω ,Ω) = ∂Πj(ω f ,Ω) is the firm’s value of an additional worker, or the continuaj j ∂hj tion value of a filled vacancy: J (ω ,Ω) = p −w +E (cid:2) Q·(1−δ )(1−¯iqF )·J (ω(cid:48),Ω(cid:48)) (cid:3) (18) j f j j h j j f Notice that even though the exogenous separation rate δ is the same for all firms, h the total separation rate varies across firms. If we define δ = 1−(1−δ )(1−¯iq F ) hj h t j as firm j’s total separation rate, we can see that low-wage (less-productive) firms have higher separation rates. Therefore, labor market conditions affect the value of a new vacancy through the firm specific separation rate δ . hj 2.3 Information Sets I assume that workers (households) face information frictions in the sense that they do not perfectly know the current value of aggregate TFP (a), which is the only source of aggregate uncertainty. I assume that there is a public signal (aˆ), based on which 11

workers form expectations. I assume that this public signal is also observed by firms, so that workers’ beliefs are common knowledge. The public signal is the sum of the aggregate TFP an a noise signal denoted by n. The aggregate TFP (a) and the noise (n) are assumed to follow two independent AR(1) processes. aˆ = a+n (19) a(cid:48) = ρ a+e(cid:48); e ∼ N(0,ς ) (20) a a a a n(cid:48) = ρ n+e(cid:48) ; e ∼ N(0,ς ) (21) n n n n To formally define the equilibrium of this economy and find the solution of this model, I assume that workers can perfectly observe the state of the economy with a lag ofT periodswhereT isalargeinteger. Hence, theinformationsetfortherepresentative household is given by: I = {aˆT,Ω }, where aˆT represents the last T realizations of h −T n, and Ω is the value of the vector Ω T periods ago.5 −T 2.4 Matching The total number of matches in the economy m(v,s) is an increasing function in the (cid:82)1 total number of vacancies (v = v dj) and the total number of job searchers (s = 0 j u + (cid:82)1 (1 − δ )¯ih dj), where u = 1 − (cid:82)1 h dj is the number of unemployed workers. 0 h j 0 j Following the literature, m(v,s) is assumed to be homogeneous of degree 1. Hence, q = m(θ,1) and q˜= m(1,θ−1) where θ = v/s is labor market tightness. 2.5 Wage Negotiation I assume that wages are completely flexible and are negotiated at the start of every period according to a simple game, through which firms and workers bargain over the match surplus, S = J (ω ,Ω) + W (ω,Ω) − U(ω,Ω). For expositional purposes, j j f j →− I will abuse notation slightly in this section and define functions J (w,ω ,Ω) and j f −→ W (w,ω,Ω) as the value of a filled vacancy and employment for an arbitrary wage w. j Wages in this economy are negotiated according to the following game: (1) The firm offersawagextotheworker. (2)Theworkerobservesthefirm’soffer. Uponacceptance, −→ →− the game ends with payoffs of W (x,ω,Ω) − U(ω,Ω) to the worker and J (x,ω ,Ω) j j f 5Appendix B explains this in more details. 12

to the firm. (3) If the worker rejects the firm’s offer, the match is destroyed with exogenous probability 1−ϑ (with payoffs to both agents of 0); otherwise, the worker demands a wage y. (4) The firm observes this demand. Upon acceptance, the game −→ →− ends with payoffs of W (y,ω,Ω)−U(ω,Ω) for worker and J (y,ω ,Ω) for firm. If the j j f firm rejects the worker’s offer, the game ends with payoffs of zero for both agents. The extensive-from representation of this game is given in Figure 1. Figure 1: Wage Determination Game Firm offers a wage equal to x A R (cid:16)→− −→ (cid:17) J (x,ω ,Ω), W (x,ω,Ω)−U(ω,Ω) Nature j f j ϑ 1−ϑ Worker demands a wage equal to y (0,0) A R (cid:16)→− −→ (cid:17) J (y,ω ,Ω), W (y,ω,Ω)−U(ω,Ω) (0,0) j f j Note: Thisfigureshowstheextensive-formrepresentationofthewagedeterminationgame. Firmsand workers bargain over the match surplus (S ) by making wage offers/demands. Details are provided in j the text. 2.5.1 Equilibrium Wage and Discussion Even though this model assumes information frictions, an important benchmark is the case in which all agents have perfect information. In this spirit, the following lemma establishes the equilibrium of this game under perfect information, which will be used to compare the results under information frictions. Lemma 1. If all agents in the economy have complete and perfect information, the following strategy profiles constitute the unique sub-game perfect Nash equilibrium of this game: • Worker: To accept only wage offers greater than or equal to x∗ (first stage), and to demand a wage equal to y∗ (second stage). • Firm: To offer x∗ (first stage) and to accept only wage demands that are less than or equal to y∗ (second stage). 13

→− →− where x∗ and y∗ are such that J (x∗,ω ,Ω) = (1−ϑ)·S and J (y∗,ω ,Ω) = 0. j j j j j Proof. See Appendix A.1 Hence, under perfect information, the solution to this game coincides with the solution to the Nash-bargaining game when the worker’s bargaining power is equal to ϑ. Therefore, I will call ϑ the long-term bargaining power of workers. Now, before characterizing the solution to this game with information frictions, the following lemmas tell us that, in equilibrium, firms cannot credibly communicate the true state of the economy to the workers. Lemma 2. Suppose that agents are information-constrained as described in section 2.3. If there is an equilibrium in which firms’ strategy is to reveal the aggregate state of the economy, the best strategy for firms is the same strategy described in Lemma 1. Proof. See Appendix A.2 Lemma 3. If agents in the economy are information-constrained as described in section 2.3, then in equilibrium, firms do not follow a strategy in which they perfectly reveal the true state of the economy. Proof. See Appendix A.3 Lemmas 2 and 3 make clear that a solution in which firms reveal the true state of the economy is not possible. The intuition is simple: firms have incentives to lie. Firms will always be tempted to tell workers that aggregate productivity is lower than it actually is, so wages can be lower. As a consequence, workers do not rely on firms’ offer to form expectations about aggregate conditions. Before defining the solution for this game with information frictions, I make the following assumption: →− Assumption 1. For all realizations of a and n, J (x∗∗,ω ,Ω) ≥ 0 where x∗∗ is such j f (cid:104)−→ (cid:105) that E W (x∗∗,ω,Ω)−U(ω,Ω) = ϑ·E [S ] I j I j h h That is, if both parties agree upon a wage x∗∗ such that, according to the worker’s information set, a fraction ϑ of the match surplus goes to the worker, the firm still gets a positive payoff for all realizations of the true productivity and the signal.6 Next, the following lemma presents the solution to this game. 6I check that this assumption holds in my calibration by simulating the economy for 1 million periods and computing, for each period, mJ = min {J }. Figure 9 in Appendix D plots mJ and j j shows that mJ is always positive and never close to zero. 14

Lemma 4. If agents in the economy are information-constrained as described in section 2.3, the following strategy profiles constitute a Perfect Bayesian Nash equilibrium that satisfies the intuitive criterion: • Worker: To accept only wage offers greater than or equal to x∗∗ (first stage), and to demand a wage equal to y∗∗ (second stage). • Firm: to offer x∗∗ (first stage), and to accept only wage demands that are less than or equal to y˜∗∗. (cid:104)→− (cid:105) (cid:104)→− (cid:105) wherex∗∗ andy∗∗ aresuchthatE J (x∗∗,ω ,Ω) = (1−ϑ)E [S ]andE J (y∗∗,ω ,Ω) = I j j I j I j j h h h 0 Proof. See Appendix A.4 Notice that in equilibrium, wages are a function of what workers would have demanded if given the chance, even though they do not get to make such a wage demand in equilibrium. If firms anticipate that workers will ask for a fraction X of their perceived match surplus, they will offer a wage such that workers get ϑ·X of the match surplus. Notice that this result is common in the literature. In the classic paper of Rubinstein (1982), there are no counter-offers in equilibrium because the first player to move makes an offer that takes into account what the other player would get in the second stage of the game.7 Regarding the solution with information frictions, Lemma 4 is an important result for this paper. Given that firms have incentives to lie about true productivity (Lemma 3), workers will only use their own information set to assess wage offers. Hence, wage demands will be based on information frictions. To the extent that aggregate TFP shocks are partially perceived, wage demands will be less sensitive to aggregate conditions because workers’ expectations are smoother than aggregate shocks. 2.6 Equilibrium We can now characterize the vector that describes the aggregate state of the economy as Ω = {k,{h }1 ,aT,nT}. As before, aT and nT refer to the last T realizations of a j j=0 7Similarly, HallandMilgrom(2008)andChristianoetal. (2016)assumethatwagesarenegotiated according to an alternating wage offer game, in which there are no counter-offers in equilibrium for the same reason 15

and n, respectively.8 Definition 1. A recursive competitive equilibrium for this economy is a list of functions {U(ω,Ω), W (ω,Ω), U(ω,Ω), Π (ω ,Ω), J (ω ,Ω)} [Value Functions], {{w (Ω)}1 , j j f j f j j=0 r(Ω)}[Prices], {h (ω ,Ω), k (ω ,Ω), v (ω ,Ω), π (ω ,Ω), W ˜ (ω,Ω), z (Ω)}1 , W ¯ (ω,Ω), j f j f j f j f j j j=0 c(ω,Ω), k(ω,Ω), y(Ω), s(Ω), v(Ω), θ(Ω)} [Allocations], {{q˜(Ω), q˜c(Ω), F (Ω)}1 , j j j j=0 q(Ω), qu(Ω)}[Probabilities], and {λ, λf, λc}[Law of motion] such that given a law of motion for {aˆ, a, n}[Exogenous variables]: (1) The representative household optimize: c(ω,Ω) and k(cid:48)(ω,Ω) satisfy optimality condition (5) and the household’s budget constraint (2). (2) Firms optimize: v (ω ,Ω), k (ω ,Ω), and h (ω ,Ω) satisfy optimality conditions j f j f j f (16), (17) and the law of motion for h (69) for all j. j (3) Wages are a solution to wage bargaining game 2.5. (4) Value functions are consistent with equations (1), (7), (6), (12), and (18). (5) At each point in time, workers’ beliefs are determined by their information set I , h their perceived law of motion for the economy (4), and Bayes’ rule. (6) The decision rules of households and firms imply a law of motion for the economy such that: λf = λh = λ. (7) Probabilities: q˜ = q˜u + q˜c, q˜u = q˜(u/s), q˜c = q˜( (cid:82)j (1 − δ )¯i(h /s)dx), F = j j j 0 h x j (cid:82)1 (v /v)dx, and q = m(v,s)/s. j x ˜ ¯ (8) Allocations: π (ω ,Ω), z (Ω), W (ω,Ω), W(ω,Ω) and θ(Ω) are consistent with j f j j equations (13), (8), (10), (11), and θ(Ω) = (v(Ω)/s(Ω)). (9) Aggregation: y = (cid:82) eaj+akαh1−αdj, s = u+ (cid:82) ¯ih dj, u = 1− (cid:82) h dj, v = (cid:82) v dj, and j j j j j (cid:82) k = k dj. j Appendix F presents the equations that describe the equilibrium of this economy, and the computation of this model is explained in Appendix B. 3 Quantitative Analysis Iassessthemodel’spredictionsusingquarterlydatafortheUnitedStatesfortheperiod 1979 to 2015. I present business cycle statistics for the quarterly time series (seasonally adjusted) of unemployment, vacancies, output, consumption, investment, aggregate 8Section Appendix B describe the solution method in more detail. 16

TFP, and real wages (deflated by CPI and constructed using the CPS microdata) for new employees, job changers, and all workers. I take the quarterly average of series that are available monthly. Following Shimer (2005), all variables are HP-filtered in logs with a smoothing parameter of 105.9 Data source and details are presented in Appendixes I and J. Table 1: Business Cycle Statistics: U.S. Economy 1979:Q1 to 2015:Q4 u v v/u y c Inv wa wu wc a Standard deviation 0.19 0.19 0.38 0.02 0.02 0.10 0.02 0.03 0.02 0.02 Autocorrelation 0.97 0.96 0.97 0.95 0.96 0.94 0.9 0.69 0.65 0.91 u 1 -0.92 -0.98 -0.80 -0.63 -0.82 -0.22 -0.21 -0.13 -0.47 v 1 0.98 0.76 0.56 0.85 0.06 0.01 0.01 0.50 v/u 1 0.79 0.6 0.86 0.14 0.11 0.07 0.49 y 1 0.91 0.82 0.54 0.40 0.49 0.81 c 1 0.61 0.62 0.47 0.55 0.76 Correlation Matrix Inv 1 0.22 0.07 0.25 0.67 wa 1 0.79 0.78 0.51 wu 1 0.71 0.32 wc 1 0.45 a 1 Note: Statistics for the U.S. economy are based on: u: Unemployment level. v: Helpwanted index (Barnichon, 2010). v/u: Vancancy-unemployment ratio. y: Real GDP. c: Consumptionofnon-durablegoodsandservices. Inv: Realprivatedomesticinvestment. wa: Average wage in the economy. wu: Average wage for new employees. wc: Average wage for job changers. a: Solow residual. All series are seasonally adjusted, logged, and detrended via the HP filter with a smoothing parameter of 100,000. Calculations based on: U.S. Bureau of Labor Statistics, Unemployment level [UNEM- PLOY] and Consumer Price Index for All Urban Consumers [CPIAUCSL], U.S. Bureau of Economic Analysis, Real Gross Domestic Product [GDPC1], Personal Consumption Expenditures: Nondurable Goods [PCNDG96] and Services [PCESVC96], Real Gross PrivateDomesticInvestment[GPDIC96],allretrievedfromFRED,FederalReserveBank ofSt. Louis,https://fred.stlouisfed.org/. Barnichon(2010),CurrentPopulationSurvery (CPS), IPUMS-CPS (Flood et al., 2015), and Federal Reserve Bank of San Francisco. Table 1 presents unconditional business cycle statistics for the U.S. economy. As has been previously documented in the literature, unemployment is one of the most volatile series (e.g. Shimer, 2005; Costain & Reiter, 2008). Unemployment is 10 times more volatile than TFP, and 8 times more volatile than output, in contrast to consumption, 9In general my results are not very sensitive to this parameter. 17

whose standard deviation is almost as large as that of TFP. Vacancies and the vacancyunemployment ratio are also highly volatile as their standard deviations are 19.5% and 37.5%, respectively, in comparison with 2.3% and 1.6% for output and TFP, respectively. Investment also fluctuates a lot over the business cycle. Even though it is not as volatile as unemployment or vacancies, the standard deviation of investment is 4.5 times larger than that of output. In contrast, wages are not very volatile. The standard deviations of the average wage of all workers (wa), new hires from unemployment (wu), and job changers (wc) are 2.1%, 3.4% and 2.4% respectively. Also, it is noteworthy that wages are the less persistent and less correlated series with unemployment and output. 3.1 Parameterization I calibrate this model to a monthly frequency and compute quarterly averages of the model generated series in order to compare my model results with the U.S. data. I take the values for the intertemporal elasticity of substitution (σ), the inverse of the Frisch elasticity (η), and the output elasticity of labor (α) from previous literature and set these parameters equal to 1, 0.5, and 0.33, respectively. I set ϑ equal to 0.5, which implies equal bargaining power for workers and firms in steady state. The unemployment benefit b is set to 0.041 following the evidence presented by Chodorow- Reich and Karabarbounis (2014). I set δ and β so that the annual depreciation rate is equal to 10% and the annual interest rate is equal to 5% in steady state. Following den Han, Ramey and Watson (2000), I assume the following matching function: m(s,v) = sv , where l is such that the job-finding rate (q) is equal to 1 (sl+vl)l 0.27 in steady state, which implies an average duration of unemployment equal to 15 weeks. The exogenous separation rate δ is set such that the unemployment rate is h equal to 5.5% in steady state. I discretize the firm productivity distribution (a ) into 101 points, and I calibrate j it such that marginal labor productivity (p ) is distributed according to a truncated j (cid:2) (cid:1) normal between p,∞ with a mode equal to 1 and standard deviation equal to 0.6, as reported by Long, Dziczek, Luria, and Wiarda (2008). In order to guarantee a positive match surplus in equilibrium for all firms, I set p equal to 0.4, which is 10 times larger than b. I calibrate the disutility of labor parameter Ψ such that the average of the ratio 18

zj across firms is equal to 0.72, which is consistent with the value found by Hall and pj Milgrom (2008). The value for¯i is calibrated such that the number of job changers per month in steady state is equal to 2.5% of the total population (Fallick & Fleischman, 2004). Table 2: Parameter Values Externally Calibrated Parameter Value Description σ 1 Intertemporal elasticity of substitution ξ 0.5 Inverse of Frisch elasticity α 0.33 Labor share in production function ρ 0.951/3 Persistence of productivity shocks a ϑ 0.5 Workers’ bargaining power in steady state Internally Calibrated Parameter Value Description ς 2·ς Standard deviation of noise shocks. n a ρ 0.841 Persistence of noise shocks. n δ 0.015 Exogenous separation rate. h Ψ 1.158 Desutility of labor parameter. b 0.041 Unemployment benefits. l 1.3806 Matching function parameter. ¯i 0.485 Relative search intensity of employed workers. β 0.996 Discount factor. ξ 0.2922 Elasticity of substitution between jobs. χ 0.3225 Hiring cost function convexity. δ 0.0087 Capital depreciation rate. k Note: This table summarizes the parameterization of the model. Details are reported in section 3.1. I use parameters ξ and χ to target some moments of the wage and size distribution across firms for the United States. Kahn and McEntarfer (2014) present the average monthly earnings across firms for each wage quintile, which are reproduced in Table 9 in Appendix C. According to this table, a worker employed at a firm in the highest wage quintile earns, on average, 3.61 times more than a worker employed at a firm in the lowest wage quintile. I calibrate ξ to match this moment. On the other hand, based on the firm size distribution reported by the Business Dynamics Statistics from 1977 19

to 2014, I calibrate parameter χ to target the average size of firms with 100 to 249 employees as a fraction of the average size of the smallest firms.10 Finally, the persistence of aggregate TFP is set equal to 0.951/3 and the standard deviation calibrated to match the standard deviation of TFP presented in Table 1. Following Coibion and Gorodnichenko (2012), ςn is set to 2 and ρ is calibrated such ςa n that the perception error has a quarterly persistence of 0.8.11 Table 2 summarizes the aforementioned calibration parameters. Figure 2: Firm and Employment Distribution in Steady State (a) p j vs. a j (b) w j vs. F j (c) Firm Size (d) δ vs. q˜ (e) Distribution of New Hires (f) Distribution of Workers hj j Note: This figure plots the distributions of employment and productivity across firms in steady state along with the separation rate, job filling rate, and wage associated with each firm. 10Table 10 in Appendix C presents these statistics. Given that average size increases drastically at the high end of the distribution, I do not target the average size of the largest firms 11Perception error IS defined as aggregate TFP - perceived aggregate TFP. 20

3.2 Model Results 3.2.1 Steady State Figure 2 illustrates the properties of this model in steady state. Panel 2a plots the distribution of idiosyncratic TFP across firms (f(a )) and the marginal labor producj tivity associated with each a . Panel 2b shows the wage rate (w ) and the probability of j j finding a better job conditional on a match for employed workers (F ). Panel 2c shows j the average firm size as a function of the firm’s labor productivity p (solid black line), and the distributions of employment (dashed line). Panel 2d plots the separation rate (δ ) and the job filling rate (q˜) associated with each level of labor productivity. Panels hj j 2e and 2f, plot the distribution of new employees (solid black lines) and job changers (dashed black lines) along with the labor productivity distribution (red line in panel 2e) and the distribution of all workers (dotted black line in Panel 2f). Based on Figure 2, we can see that workers employed at the most productive firms earn higher wages and face a lower probability of finding a better job (F ). On the other j hand, high-productive firms are larger, retain a larger fraction of their workers and fill vacancies faster than low-productive firms (Panels 2c and 2d). As a consequence, the distribution of employment is shifted to the right in comparison with the distribution of the labor productivity (Panel 2c), and unemployed workers are more likely to find a job at a low paying firm and then move up the job ladder. Table 3: Average Wages in Steady State All workers Job Stayers New Hires Job Changers New Employees Model 1.00 1.01 0.85 0.94 0.70 Data 1.00 1.01 0.90 0.98 0.89 Notes: Allwagesareexpressedasafractionoftheaveragewageforallworkers in the economy. The first row of this table reports the average wage for different groups of workers in steady in the model. The second row presents dataaveragefortheperiod1979-2015. Theaveragewagefornewhiresandjob stayers are weighted averages of the average wage of all workers, job changers and new employees. Table 3 reports the average wage for different types of workers as a fraction of the average wage for all workers in the model and in the data. According to the model results, the average wage for job stayers is higher than the average wage for any other 21

group, while the average wage for new employees is the lowest. High-paying firms are larger and a great share of their hires come from other firms, while low-paying firms rely moreonthepoolofunemploymenttofillavacancy. Thisstoryhasbeendocumentedby Moscarini and Posterl-Vinay (2008) and Haltiwanger et al. (2005) and is also consistent with the data reported in the second row.12 Even though the model does a good job matching the wage and size distribution in the economy, the average wage for new employees is significantly lower in my model than in the data. Figure 3: Impulse Response Function to a 1% Increase in Aggregate Productivity Note: This figure plots model Impulse Response Functions (IRFs) to a 1% increase in aggregate TFP. Solid black lines are the IRFs for a model in which workers face information frictions, and dashedlines are the IRFs generated by a model in which all agents have perfect information. 3.2.2 Impulse Response Functions Next, Figures 3 and 4 plot the Impulse Response Functions (IRFs) of the aggregate variables of this model to a 1% increase in aggregate TFP (solid black lines). To decipher the role of information frictions, I simultaneously plot the IRFs generated by a calibrated model in which agents have perfect information (dashed lines). In 12Appendix I reports additional graphs. 22

addition, Figure 5 plots the IRFs for firms located at the 20th, 40th, 50th, 60th, and 80th percentiles of the wage distribution weighted by employment. Because TFP shocks are partially perceived by workers, wages are less sensitive to aggregate productivity innovations, and firms have more incentives to expand employment (Figure 4). In particular, the assumed information friction has two reinforcing effects on wages. First, workers’ expectations are highly sluggish. Hence, in a boom, workers do not demand a largeincrease in wages because theydo not have enough information to conclude that the economy has entered an expansionary path. Second, given workers’ beliefs, consumption does not change significantly on impact, which curbs the increase of the flow of opportunity cost of employment (z ) from increasing and makes j wages even less responsive. Figure 4: Impulse Response Function to a 1% Increase in Aggregate Productivity Note: This figure plots model Impulse Response Functions (IRFs) to a 1% increase in aggregate TFP. Solid black lines are the IRFs of a model in which workers face information frictions, and dashed lines are the IRFs generated by a model in which all agents have perfect information. q, q˜, and q˜u denote the job finding rate, the probability that a vacancy is matched with a worker, and the job filling rate (from unemployment), respectively. However, the most productive firms experience a larger expansion in employment as a consequence of a positive aggregate TFP innovation. Notice that the value of a new 23

hireisaffectedbytwocountervailingeffects. Ontheonehand,theproductivityincrease, combined with sluggish real wages, tends to increase the value of an additional worker for firms in an expansion. On the other hand, the expansion in overall employment increases the flow of job changers, which rises the separation rate and reduces the the value of an additional worker, as firms expect the match to not last as long. Given that the separation rate increases the less for high-paying firms, they expand employment the most. As a result, the differential employment growth rate between high and low paying firms is positive and procyclical, which is consistent with the empirical evidence (e.g. Kahn & McEntarfer, 2014; Haltiwanger et al., 2015). Similarly, since low-paying firms rely more on hiring from the pool of unemployment, they experience a larger decline in q˜ than high-paying firms because of the decline in unemployment. j Figure 5: Distributional Dynamics to a 1% Increase in Aggregate Productivity Note: This figure plots the Impulse Response Functions (IRFs) for a model with information frictions for different firms to a 1% increase in aggregate TFP. Solid gray lines are the IRFs for firms at the 20th percentile of wage distribution weighted by employment. The dashed-gray lines are the IRFs for firms at the 40th percentile. The solid red lines are the IRF for the median firm. The dashed black lines are the IRF for firms at the 60th percentile, and the solid black lines are the IRFs for firms at the 80th percentile. z denotes the flow opportunity cost of employment for firm j. j This differential growth rate in employment implies a larger increase in the FOCE 24

(z ), and as a consequence in wages, for high-paying firms. Because workers can perj fectly distinguish among firms and they know that highly-productive firms are more sensitive to the business cycle, employees at the most productive firms demand higher wage in expansions than employees at less-productive firms. Hence, the differential employment growth rate occurs despite the larger adjustment in wages for high-paying firms, as documented by Kahn and McEntarfer (2014). Noiseshocks, however, donotonlyreducewagedemandsthroughworkers’perceived productivity. WorkerswillpartiallyattributenoiseshockstoTFPinnovations,resulting in higher wage demands and unemployment as shown in Figure 6. Since workers know that the demanding higher wages in response to noise shocks is not optimal, wage demands will tend to be even lower in response to productivity shocks. In other words, wage demands weight the positive effects of a positive TFP shock and the negative effects of a positive noise shock. Figure 6: Unemployment and Wage Responses to a 1% Increase in Noise Shock (a) Unemployment (b) Average Wage Note: ThisfigureplotstheImpulseResponseFunctionstoa1%increaseinthesignalnoise(n). Lines represent % deviation with respect to its steady state value. 3.2.3 Simulated Business Cycles Statistics Table 4 reports the theoretical business cycle statistics predicted by a model in which workers face information frictions, and Table 5 does the same for the model with full information.13 In comparison with the U.S. data, my model with information frictions 13The model was used to generate artificial series of the same length and frequency as in my data. These series were used to compute, in the same way, the same business cycle statistics reported in 25

doesagoodjobpredictingahighvolatilityforlabormarketvariablesandalowvolatility for wages and consumption. This is in part because of the amplifying effect of the information friction. The volatility of u, v, and v/u are three times larger in the model with information frictions than in the model with full information, but the volatility of consumption and wages is just slightly larger in the former. The autocorrelation predicted by my model are close to the data, and my model is able to display a lower autocorrelation for the average wage of new employees and job changers. Even though the information friction tends to reduce the correlation between the average wage of the economy with other economy variables, those correlations continue to be significantly larger than in the data. Similarly, the correlations between the average wage for new employees and all variables of the economy have a different sign as in the data. Table 4: Simulated Business Cycle Workers Face Information Frictions u v v/u y c Inv wa wu wc a Standard deviation 0.14 0.18 0.31 0.03 0.01 0.09 0.02 0.02 0.01 0.02 Autocorrelation 0.93 0.87 0.91 0.93 0.97 0.92 0.94 0.76 0.74 0.89 u 1 -0.89 -0.97 -0.83 -0.24 -0.91 -0.56 0.47 0.08 -0.72 v 1 0.98 0.78 0.07 0.90 0.39 -0.21 0.11 0.75 v/u 1 0.83 0.15 0.93 0.48 -0.34 0.03 0.76 y 1 0.61 0.94 0.86 -0.01 0.40 0.96 c 1 0.32 0.83 0.21 0.45 0.50 Correlation Matrix Inv 1 0.69 -0.13 0.26 0.93 wa 1 0.08 0.44 0.80 wu 1 0.87 0.18 wc 1 0.54 a 1 Note: Statistics for the simulated economy under information frictions: u: Unemployment level. v: Vacancies v/u: Vancancy-unemployment ratio. y: Output. c: Consumption. Inv: Investment. wa: Averagewageintheeconomy. wu: Averagewage for new employees. wc: Average wage for job changers. a: Aggregate TFP. All series are seasonally adjusted, logged, and detrended with the HP filter with a smoothing parameter of 100,000. Table 1. This exercise was repeated 10,000 times. Then, each statistic reported in Tables 4 and 5 is the average of that moment across the 10,000 simulations. 26

Table 5: Simulated Business Cycle Model with Full Information u v v/u y c Inv wa wu wc a Standard deviation 0.04 0.06 0.10 0.02 0.01 0.06 0.02 0.01 0.01 0.02 Autocorrelation 0.95 0.92 0.95 0.92 0.96 0.91 0.92 0.77 0.82 0.89 u 1 -0.93 -0.98 -0.93 -0.76 -0.92 -0.9 -0.7 -0.74 -0.9 v 1 0.99 0.96 0.71 1.00 0.92 0.87 0.88 0.99 v/u 1 0.96 0.74 0.98 0.93 0.81 0.83 0.97 y 1 0.87 0.96 0.98 0.9 0.91 0.98 c 1 0.69 0.88 0.74 0.77 0.75 Correlation Matrix Inv 1 0.92 0.88 0.88 0.99 wa 1 0.87 0.89 0.94 wu 1 0.96 0.93 wc 1 0.92 a 1 Note: Statistics for the simulated economy under perfect information: u: Unemployment level. v: Vacancies v/u: Vancancy-unemployment ratio. y: Output. c: Consumption. Inv: Investment. wa: Average wage in the economy. wu: Average wage for new employees. wc: Average wage for job changers. a: Aggregate TFP. All series are seasonally adjusted, logged, and detrended with the HP filter with a smoothing parameter of 100,000. 3.2.4 Wages and Employment Composition The left panels of Figure 7 plot the IRF to a 1% increase in aggregate TFP for average wages for each type of worker in the model with information frictions (panel 7a) and in the model with full information (panel 7c). Notice that the average wage for new hires, job changers, and new employees has a larger response, soon after the TFP shock. However, these wage differences are driven primarily by heterogeneity across firms. To see the importance of this heterogeneity, the right panels of Figure 7 plot the average wage for all groups of workers when wages are adjusted for this composition effect following Horrace and Oaxaga (2001).14 By comparing the left and right panels of Figure 7, we can infer that the initial increase in the wages of new hires, job changers and new employees is due almost entirely to the large increase in employment at high- 14ThecompositionadjustedwageforgroupGisdefinedastheaveragewageforafixedcomposition of workers, which is given by the distribution of workers in steady state. 27

paying firms. 15 Figure 7: Wages Responses to a 1% Increase in Aggregate Productivity (a) Information Frictions Model (b) Information Frictions Model Simple Average Composition Adjusted (c) Full Information Model (d) Full Information Model Simple Average Composition Adjusted Note: This figure plots the evolution of the average wage for different groups of workers in response to a 1% increase in aggregate productivity. Panels 7a and 7c plot the evolution of average wages not adjusted for composition effects. Panels 7b and 7d plot the evolution of average wages adjusted for composition effects as proposed by Horrace and Oaxaga (2001). How does the wage flexibility in my model compare with the data? and How large is the reallocation of workers from low to high paying firms in my model? These questions are answered in Tables 6 and 7. On the one hand, Pissarides (2009) argues that the wage semi-elasticity with respect to the unemployment rate for new hires is around -3% compared with -1% for job stayers, indicating a larger wage flexibility for new hires than usually assumed and questioning the importance of wage stickiness to explain 15Actually, in the model with information frictions, the average wage response for new employees is lower than the average wage response for job changers, which is slightly lower than the average wage response for job stayers. This is because unemployed workers are more likely to find a job at a low payingfirmsandthentomoveupthejobladder(Figure2f),andlowpayingfirmsincreasetheirwages less in booms (Figure 5). 28

the unemployment volatility. I compute this wage semi-elasticity using the wage series constructedfrommicrodataandmodelsimulatedseriesbyfittingthefollowingequation consistent with Pissarides (2009): log(wG/wG ) = α +β ·(ur −ur )+e (22) t t−1 0 u t t−1 t where wG is the average wage for group G. α is a constant, ur is the unemployment t 0 t rate at time t, and e is an error term. Table 6 reports the estimated values for β using t u U.S.dataincolumn1. MyestimatesaresmallerthanreportedbyPissarides(2009),and only the semi-elasticity for new employees is statistically significant at 10%. Columns 2 and 3 in Table 6 present the theoretical values for β in the model with full information u and with information frictions respectively.16 The model with full information predicts a wage semi-elasticity of -1% for all workers but positive and large semi-elasticities for new employees and job changers. This is because, the largest wage responses are on impact, but the largest unemployment response is not. In contrast, the model with information frictions generates negative wage semi-elasticities for all groups of workers (-2.93% for job changers, -2.65 for new employees, and -1.12% for all workers). In the model with information frictions, the largest wage responses are not on impact, and we observe a significant change in the average wage for new hires because high-paying firms expand employment the most in booms (Figure 7), which induces a higher correlation between changes in unemployment and wages for new hires. On the other hand, I asses the employment flows generated by my model in Table 7. Column (1) of Table 7 reproduces the findings of Haltiwanger, et al (2015) regarding the reallocation of workers from low to high paying firms during the business cycle. As before, columns (2) and (3) report the analog theoretical moments. In particular, Table 7 reports the estimated values of β in the following specification: Y = γ +π +β ·CYC +(cid:15) (23) t t t t where Y is differential net job flows, net poaching flow, and net nonemployment flows t 16Usingmymodel,Igenerateartificialseriesofthesamesizeandfrequencyasmydata(148monthly observationsfortheaveragewageofallandnewemployeesand88monthlyobservationsfortheaverage wage of job changer). Using these artificial series, I estimate β by fitting equation (22). I repeat this u exercise 10,000 times and get the theoretical value of β by taking the average across the 10,000 u simulations. 29

Table 6: Wage Semi-Elasticities with Respect to the Unemployment Rate Model Simulated Data US Data Full Information Information Frictions All Workers -0.27 -1.01 -1.12 (0.27) New Employees -1.66∗ 3.98 -2.65 (0.99) Job Changers -0.57 3.42 -2.93 (1.21) Note: Each specification includes monthly dummies and a lag of the independent variable. Robust standard error in parenthesis. ∗, ∗∗, ∗∗∗ indicatestatisticallysignificanceat10%, 5%, and1%respectively. SampleperiodforallandnewemployeesisJanuary1979toDecember 2015 and for job changers is January 1994 to December 2015. between high and low wage firms.17 γ is a constant, π includes seasonal dummies and a time trend, and (cid:15) is an error term. CYC is a cyclical variable, which can be the HP-filtered unemployment rate or the first different in the unemployment rate. According to Table 7, both models generate differential employment flows that are consistent with the empirical estimates of Haltiwanger, et al (2015) when I use the HP-filtered unemployment rate as a cyclical indicator. But both models generate little differential employment growth when the first different in the unemployment rate is used as a cyclical indicator. Given that my model does not display a larger differential employment flow than observed in the data, the wage cyclicality reported in Table 6 does not seem to be driven by an excessive reallocation of workers from low to highpaying firms. 4 Robustness and Extensions 4.1 A Simple Test Using Survey Data Inthissubsection, Itestthemainimplicationofmymodel: Wagesshouldincreasewhen workers are more optimistic about economic conditions. To this end, I re-estimate equation (22) introducing a variable of workers’ expectations. Hence, my empirical 17High wage firm indicates that the firm is in the two top quintiles of the wage distribution across firms. Low wage firm indicates that the firm is in the bottom quintile of the wage distribution. 30

Table 7: Differential Net Flows, Coefficient on Cyclical Variable High Wage minus Low Wage Model Simulated Data Data Full Information Information Frictions Deviation from HP Trend Net Job Flows -0.269 -0.210 -0.257 Net Poaching Flows -0.253 -0.252 -0.292 Net Nonemployment -0.016 0.042 0.035 Flows First Difference Net Job Flows -0.557 0.007 0.018 Net Poaching Flows -1.460 0.004 0.013 Net Nonemployment -0.903 0.003 0.005 Flows Note: DataforthefirstcolumnarefromHaltiwanger,Hyatt,andMcEntarfer (2015) Table 1. Each model was used to generate artificial data over a time horizon of 55 quarters, which is consistent with the sample size of Haltiwanger, et al. (2015). Each model was simulated 10,000 times. Thecoefficientonthecyclicalvariablewascomputedforeachartificial series, and the theoretical coefficient was estimated by averaging across the 10,000 simulations. model takes the following form: (cid:0) (cid:1) log(wG)−log(wG ) = α +β ·(ur −ur )+β ·∆Ew +e (24) t t−1 0 u t t−1 E t t Where variable ∆Ew is a measure of change in workers expectations. As a proxy t of Ew, I use the Index of Consumer Sentiments (ICS) of the Surveys of Consumers t conducted by the University of Michigan. The ICS is reported monthly and is intended to be a indicator of how consumers view prospects (better or worst) for their own financial situation and for the economy in the near and long term. The ICS is used in levels because it asks for expected changes instead of expected levels. Also, for an easier interpretation in my estimation, I normalize the ICS such that it has a zero mean and a standard deviation equal to one. The first three columns in Table 8 present the results of estimating equation (24) by OLS. For an easier interpretation, all values in Table 31

8 are expressed as semi-elasticities. By including a variable of workers expectations, all wage semi-elasticities with respect to the unemployment rate become smaller and statistically insignificant. Also, these results show that wages are positively correlated with workers’ expectations, but this coefficient is only statistically significant for the average wage of all workers. A one standard deviation increase in the ICS is associated with a 0.14% increase in the average wage for all workers. Table 8: Wage Growth vs. Expectations Monthly Frequency Quarterly Frequency All New Job All New Job Workers Employees Changers Workers Employees Changers β -0.03 -1.51 -0.42 -0.31 -0.23 0.87 u (0.28) (1.02) (1.28) (0.28) (0.94) (0.98) β 0.14∗∗∗ 0.08 0.10 0.35∗∗∗ 0.63∗∗ 0.48∗∗∗ E (0.05) (0.18) (0.21) (0.11) (0.27) (0.24) β -0.07 -0.12 -0.71 ue (0.33) (0.92) (1.00) β -0.25 -0.49 -0.05 y (0.20) (0.42) (0.50) R2 0.21 0.27 0.26 0.17 0.33 0.15 N 442 426 253 146 139 83 Note: This table reports the estimated coefficient on the unemployment rate (β ), workers’ expectations (β ), expected change in the unu E employment rate over the next year (β ), and expected GDP growth ue forthefollowingyear(β )foreachgroupofworkers. Eachspecification y includes monthly (quarterly) dummies and a lag of the independent variable. Robust standard errors in parentheses. ∗, ∗∗, ∗∗∗ indicate statistically significance at 10%, 5%, and 1% respectively. Sample period for all and new employees is January 1979 to December 2015 and for job changers is January 1994 to December 2015. Toseeifwagesarecorrelatedwithotheragents’expectations, thelastthreecolumns of Table 8 report the results of estimating the following equation: (cid:0) (cid:1) log(wG)−log(wG ) = α +β (ur −ur )+β Ew +β ure +β yee (25) t t−1 0 u t t−1 E t ue t y t t where ure is the expected change in the unemployment rate over the next year, and t ye is the expected GDP growth for the following year. ure and ye are taken from the t t t 32

Survey of Professional Forecasters and are available at a quarterly frequency. These results show that workers’ expectations are more significantly correlated with wages at a quarterly frequency. A 1 standard deviation increase in the ICS is associated with a 0.35%, 0.63%and0.48%increaseintheaveragewageforallworkers, newemployeesand job changers respectively. Also, it is worth noticing that the ICS is the only statistically significant variable in Table 8, suggesting a significant relationship between wages and workers’ beliefs. 4.2 Unilateral Deviations in Equilibrium Nothing in my model prevents firms from paying a higher wage in equilibrium. Even though higher wages increase firms’ payroll, offering a higher wage can reduce the fraction of workers who are poached by other firms, and as a consequence, increase profits. When workers’ bargaining power is low (high), firms incentives to offer higher wages could be large (small). However, in my calibration with a bargaining power equal to 0.5, if we allowed firms to offer higher wages, no firm has incentives to offer a higher wage in equilibrium. In Appendix G, I present this results formally following the theoretical framework of Moscarini and Postel-Vinay (2013). 4.3 Firms Face Information Frictions Assuming that firms and workers face information frictions is not a straight forward exercise in my model. Depending on how this new friction is introduced in the benchmark model, hiring decisions and the unemployment response can become larger or smaller. However, as long as workers face information frictions regarding aggregate conditions, and assuming that the workers’ information set is always a subset of the firms’ information set, wages will have fewer pressures to increase in booms because the value of the outside option (the value of unemployment) is underestimated. Notice that the two equations governing the dynamics of wages and hiring decisions in this model are given by: (cid:104)−→ (cid:105) E W (w ,ω,Ω) = ϑ·E [S ]+E [U(ω,Ω)] (26) I j j I j I h h h (cid:2) (cid:3) κ(q˜v )χ = E Q·J(cid:48)(ω(cid:48),Ω(cid:48)) (27) j j Ij j f 33

where I is the information set of firm j. Hence, wages depend on workers’ expectaj tions, while hiring decisions depend on firms beliefs. In the benchmark model, workers underestimate changes in S and U (ω,Ω) reducing the volatility of wages. This rises j the volatility of Jj, which increases firms’ hiring decision. In Appendix H, I present three variations of my benchamark model in which firms face information frictions. I show that models with information frictions, in general, continue to display larger unemployment responses to TFP shocks than a model with full information. Also, I show that wage responses continue to display a clear hump-shaped response to TFP shocks in models with information frictions because workers take time to understand changes in the economy. 5 Conclusion I propose a new mechanism for sluggish wages based on workers’ noisy information about the state of the economy. In my model, workers receive noisy signals about the currentstateoftheeconomyandlearnslowlyaboutaggregateconditions. Hence, wages do not immediately respond to a positive aggregate shock because workers do not (yet) have enough information to demand higher wages. This delayed adjustment in wages increases firms’ incentives to post more vacancies, making unemployment more volatile and sensitive to aggregate shocks. My calibrated model is able to explain 70% of overall unemployment volatility. My model is robust to two major critiques of existing theories of sluggish wages and volatile unemployment: the flexibility of wages for new hires and the cyclicality of the opportunity cost of employment. On the one hand, my model predicts a very cyclical opportunity cost of employment, as the value of non-working activities in terms of consumption increases in expansions. On the other hand, my model assumes flexible wages for new hires and generates a wage semi-elasticity with respect to the unemployment rate for new hires and all workers of around -3% and -1% respectively, which is similar to the estimate of Pissarides (2009) and larger than the estimates of Hagedorn and Manovskii (2013), Gertler et al. (2014) and my own estimations using the CPS and IPUMS-CPS (Flood et al., 2015) microdata (-0.27% for all workers, -1.66% for new employees, and -0.57% for job changers). Consistent with recent empirical evidence (e.g. Kahn & McEntarfer, 2014; Halti- 34

wanger et al. 2015), my model predicts that high-wage highly productive firms expand employment more than low-wage firms and also exhibit larger wage adjustments in expansions. This differential growth rate implies that the distribution of new hires shifts to the most productive and highest paying firms in response to positive productivity shocks. This has important consequences for new hires, as they find more and better paying jobs in expansions. I use the University of Michigan Surveys of Consumers and my wage series constructed from the CPS and IPUMS-CPS microdata to test whether wages increase when workers are more optimistic about economic conditions. Even though workers’ expectations and wages are positively correlated, this relationship is only statistically significant at quarterly frequencies. I estimate that a 1 standard deviation increase in my measure of workers’ expectations is associated with a 0.35%, 0.63%, and 0.48% increase in the average wage for all workers, new employees and job changers respectively. Also, I find no statistically significant correlation between unemployment and output expectations from the Survey of Professional Forecasters and wages. References [1] Acharya, Sushant (2014) “Costly information, planning complementarities and the Phillips Curve.” Federal Reserve Bank of New York Staff Report # 698. [2] Andolfatto, David (1996) “Business Cycles and Labor-Market Search.” American Economic Review vol. 86(1), pages: 112-132. [3] Angeletos, George-Marios, and Jennifer La’O (2012) “Optimal Monetary Policy with Informational Frictions.” NBER Working Paper # 17525. [4] Barnichon, Regis (2010) “Building a Composite Help-Wanted Index.” Economics Letters vol. 109(3), pages: 175-178. [5] Branch, William A. (2004) “The Theory of Rationally Heterogeneous Expectations: Evidence from Survey Data on Inflation Expectations.” The Economic Journal vol. 114(497), pages: 592-621. 35

[6] Brugemann, Bjorn, and Giuseppe Moscarini (2010) “Rent Rigidity, Asymmetric Information, And Volatility Bounds in Labor Markets.” Review of Economic Dynamics vol. 13(3), pages: 575-596. [7] Carroll, Christopher D. (2003) “Macroeconomic Expectations of Househlds and Professional Forecasters.” Quarterly Journal of Economics vol. 118(1), pages: 269-298. [8] Chodorow-Reich, Gabriel, and Loukas Karabarbounis (2014) “The Cyclicality of The Opportunity Cost of Employment.” NBER Working Paper # 19678. [9] Christiano, Lawrence J., Martin S. Eichenbaum, and Mathias Trabandt (2016) “Unemployment and Business Cycles.” Econometrica vol. 84 (4), pages: 1523- 1569. [10] Coibion, Olivier, and Yuriy Gorodnichenko (2012) “What Can Survey Forecasts Tell Us About Information Rigidities?” Journal of Political Economy vol. 120(1), pages 116-159. [11] Costain, James S., and Michael Reiter (2008) “Business Cycles, Unemployment Insurance, And The Calibration of Matching Models.” Journal of Economic Dynamics and Control vol. 32(4), pages: 1120-1155. [12] Costain, James S., and Anton Nakov (2011) “Distributional Dynamics under Smoothly State-Dependent Pricing.” Journal of Monetary Economics vol. 58(6), pages: 646-665. [13] den Haan, Wouter J., Garey Ramey, and Joel Watson (2000) “Job Destruction and Propagation of Shocks.” American Economic Review vol. 90(3), pages 482- 498. [14] Fallick, Bruce, and Charles A. Fleischman (2004) “Employer-to-Employer Flows intheU.S.LaborMarket: TheCompletePictureofGrossWorkerFlows”Finance and Economics Discussion Series # 2004-34. [15] Flood, Sarah, Miriam King, Steven Ruggles, and J. Robert Warren (2015) “Integrated Public Use Microdata Series, Current Population 36

Survey: Version 4.0. [dataset].” Minneapolis: University of Minnesota. http://doi.org/10.18128/D030.V4.0. [16] Gertler, Mark, Chris Huckfeldt, and Antonella Trigari (2014) “Unemployment Fluctuations, Match Quality, and The Wage Cyclicality of New Hires.” MIMEO New York University. [17] Gertler, Mark, and Antonella Trigari (2009) “Unemployment Fluctuations With Staggered Nash Wage Bargaining.” Journal of Political Economy vol. 117(1), pages: 38-85. [18] Haefke, Christian, Marcus Sonntag, and Thijs van Rens (2013) “Wage Rigidity And Job Creation.” Journal of Monetary Economics vol. 60(8), pages: 887-899. [19] Hagedorn, Marcus, and Iourii Manovskii (2008) “The Cyclical Behavior of Equilibrium Unemployment And Vacancies Revisited.” American Economic Review vol. 98(4), pages: 1692-1706. [20] Hagedorn, Marcus, and Iourii Manovskii (2013) “Job Selection and Wages over the Business Cycle.” American Economic Review vol. 103(2), pages: 771-803. [21] Hall, Robert E. (2005) “Employment Fluctuations with Equilibrium Wage Stickiness.” American Economic Review vol. 95(1), pages: 50-65. [22] Hall, Robert E., and Paul R. Milgrom (2008) “The Limited Influence of Unemployment on The Wage Bargain.” American Economic Review vol. 98(4), pages: 1653-1674. [23] Haltiwanger, John C., Henry Hyatt, and Erika McEntarfer (2015) “Cyclical Reallocation of Workers Across Employers by Firm Size and Firm Wage.” MIMEO, University of Maryland. [24] Horrace, William C., and Ronald L. Oaxaga (2001) “Inter-Industry Wage Differentials And The Gender Wage GAP: an Identification Problem.” Industrial And Labor Relations Review vol. 54(3), pages: 611-618. [25] Kahn, Lisa and Lisa McEntarfer (2014) “Employment Cyclicality and Firm Quality.” NBER Working Paper # 20698. 37

[26] Kennan, John (2009) “Private Information, Wage Bargaining and Employment Fluctuations.” Review of Economic Studies vol. 77(2), pages: 633-664. [27] Klein, Paul(2000)“UsingtheGeneralizedSchurformtoSolveaMultivariateLinear Rational Expectations Model.” Journal of Economic Dynamics and Control vol. 24(10), pages: 1405-1423. [28] Long, Mark C., Kristin M. Dziczek, Daniel D. Luria, and Edith A. Wiarda (2008) “Wage and Productivity Stability in U.S. Manufacturing Plants.” Monthly Labor Review May 2008, pages: 24-36. [29] Lucas, Robert E. (1972) “Expectations and the Neutrality of Money.” Journal of Economic Theory vol. 4(2), pages: 103-124. [30] Mackowiak, Bartosz, and Mirko Wiederholt (2009) “Optimal Sticky Prices Under Rational Innattention.” American Economic Review vol. 99(3), pages: 769-803. [31] Menzio, Guido (2005) “High-Frequency Wage Rigidity.” MIMEO, University of Philadelphia. Journal of Political Economy vol. 119(3), pages: 468-510. [32] Merz, Monika (1995) “Search in The Labor Market And The Real Business Cycle.” Journal of Monetary Economics vol. 36(2), pages: 269-300. [33] Moscarini, Giuseppe, andFabienPostel-Vinay(2008)“TheTimingofLaborMarket Expansions: New Facts And a New Hypothesis.” in NBER Macroeconomics Annual 2008, Volumne 23 ed. by D. Acemoglu, K. Rogoff, and M. Woodford. University of Chicago Press. [34] Moscarini, Giuseppe, and Fabien Postel-Vinay (2012) “The Contribution of Large and Small Employers to Job Creation in Times of High and Low Unemployment.” American Economic Review vol. 102(6), pages: 2509-2539. [35] Moscarini, Giuseppe, and Fabien Postel-Vinay (2013) “Stochastic Search Equilibrium.” Review of Economic Studies vol. 80(4), pages: 1545-1581. [36] Muller, Andreas I. (2012) “Separations, Sorting And Cyclical Unemployment.” IZA Discussion Paper # 6849. 38

[37] Reiter, Michael (2009) “Solving heterogeneous-agent models by projection and perturbation.” Journal of Economics Dynamics and Control vol 33 (3), pages: 649-665. [38] Reis, Ricardo (2006) “Inattentive Producers.” Review of Economic Studies vol. 73(3), pages: 793-821. [39] Reis, Ricardo (2006) “Inattentive Consumers.” Journal of Monetary Economics vol. 53(8), pages: 1761-1800. [40] Roberts, John M. (1998) “Inflation Expectations and the Transmission of Monetary Policy.” Federal Reserve Board FEDS working paper # 1998-43. [41] Shimer, Robert(2005)“TheCyclicalBehaviorofEquilibriumUnemploymentand Vacancies.” American Economic Review vol. 95(1), pages: 25-49. [42] Rubinstein, Ariel (1982) “Perfect Equilibrium in a Bargaining Model.” Econometrica vol. 50(1), pages: 97-109. [43] Schmitt, John (2003) “Creating a Consistent Hourly Wage Series From The Current Population Survey’s Outgoing Rotation Group, 1979-2002.” Unpublished Manuscript. Center for Economic and Policy Research. [44] Sims, Christopher A. (2003) “Implications of Rational Inattention.” Journal of Monetary Economics vol 50(3), pages: 665-690. [45] Sims, Christopher A. (2002) “Solving Linear Rational Expectations Models.” Computational Economics vol. 20(1-2), pages: 1-20. [46] Solon, Gary, Robert Barsky, and Jonathan A. Parker (1994) “Measuring The CyclicalityofRealWages: HowImportantIsCompositionBias?”Quarterly Journal of Economics vol. 109(1), pages: 1-25. [47] Venkateswaran, Venky (2013) “Heterogeneous Information and Labor Market Fluctuations.” MIMEO, Penn State University. 39

A Proofs A.1 Proof of Lemma 1 If all agents in the economy have complete and perfect information, the following strategy profiles constitute the unique sub-game perfect Nash equilibrium of this game: • Worker: To accept only wage offers greater than or equal to x∗ (first stage), and to demand a wage equal to y∗ (second stage). • Firm: To offer x∗ (first stage) and to accept only wage demands that are less than or equal to y∗ (second stage). →− →− where x∗ and y∗ are such that J (x∗,ω ,Ω) = (1−ϑ)·S and J (y∗,ω ,Ω) = 0. j j j j j Proof. I begin at the third stage of the game (i.e. when the worker makes an offer). At →− this stage, the firm will accept any wage demand y as long as J (y,h ,Ω) ≥ 0. Hence, j j →− the worker will demand a wage y∗ such that J (y∗,h ,Ω) = 0 and she keeps all the j j match surplus. Thus, at the second stage (i.e. when the worker has to accept or reject the firm’s offer), the worker knows that if she rejects this offer, her expected payoff at the third stage will be ϑ·S . Therefore, she will only accept wage offers that are greater j −→ than or equal to x∗ where W (x∗,ω,Ω)−U = ϑ·S . Finally, at the first stage of the j j game (i.e. when the firm makes an initial offer), the firm anticipates a payoff of zero if →− it makes an offer less than x∗ and a payoff of J (x,h ,Ω) if x ≥ x∗. Hence, the firm j j offers exactly x∗ to the worker and she accepts it. A.2 Proof of Lemma 2 Suppose that agents are information-constrained as described in section 2.3. If there is an equilibrium in which firms’ strategy is to reveal the aggregate state of the economy, the best strategy for firms is the same strategy described in Lemma 1. Proof. As we are considering the equilibrium of the game, if firms are following a revealing strategy, workers know it and behave rationally. As a consequence, workers can perfectly infer the current state of the economy based on the firm’s wage offer. Hence, a worker knows that she will receive, in expectation, ϑ · S if she rejects a j firm’s offer. Therefore, the optimal strategy for workers is the following: 40

• To infer the current level of the aggregate productivity based on firm’s offer x: a = x−1(a) • To accept only wage offers greater than or equal to x∗ where: −→ W (x∗,ω,Ω)−U = ϑ·S j j →− J (x∗,ω,Ω) = 0 j • To demand a wage equal to y∗ if she has the chance such that: −→ W (y∗,ω,Ω)−U = S j j Now, given the workers’ strategy, the firm anticipates a payoff of zero if it makes an →− →− offer less than x∗ and a payoff of J (x,h ,Ω)−U if x ≥ x∗. Given that J (x,h ,Ω) j j j j is strictly decreasing in x, the optimal strategy for firms, assuming that they follow a revealing strategy is the following: • To offer x∗. • To accept only wage demands that are less than or equal to y∗. As a consequence, if there exists an equilibrium in which firms reveal the true state of the economy, in equilibrium firms offer exactly x∗ and workers will accept it. In other words, workers rationally believe that if a firm extends a wage offer x, it has to be the case that x = x∗. A.3 Proof of Lemma 3 If agents in the economy are information-constrained as described in section 2.3, then in equilibrium, firms do not follow a strategy in which they perfectly reveal the true state of the economy. Proof. Suppose not. By Lemma A.2, if there is an equilibrium in which firms reveal the true state of the economy, firms always offer x = x∗ and workers accept all wage offers (x) because they rationally believe that x is always equal to x∗. However, in order for these strategies to be an equilibrium, firms cannot have incentives to deviate. 41

Suppose that firms deviate to a strategy in which they offer x˜ = 0.5x∗. Workers will accept this offer because they believe x˜ = x∗, and firms will be better off because J (x˜) > J (x∗). Therefore, there is not an equilibrium in which firms reveal the true j j state of the economy. A.4 Proof of Lemma 4 If agents in the economy are information-constrained as described in section 2.3, the following strategy profiles constitute a Perfect Bayesian Nash equilibrium that satisfies the intuitive criterion: • Worker: To accept only wage offers greater than or equal to x∗∗ (first stage), and to demand a wage equal to y∗∗ (second stage). • Firm: to offer x∗∗ (first stage), and to accept only wage demands that are less than or equal to y˜∗∗. (cid:104)→− (cid:105) (cid:104)→− (cid:105) wherex∗∗ andy∗∗ aresuchthatE J (x∗∗,ω ,Ω) = (1−ϑ)E [S ]andE J (y∗∗,ω ,Ω) = I j j I j I j j h h h 0. Proof. I begin at the third stage of the game (i.e. when the worker gets to make an offer). At this stage, the firm will accept any wage demand y as long as its expected value is greater than or equal to zero. Given the firm’s strategy, the firm’s offer does not reveal its information. Therefore, the worker will demand a wage y∗∗ such that, given her information set, the firm’s value is zero. Thus, at the second stage (i.e. when the worker has to accept or reject the firm’s offer), the worker knows that if she rejects this offer, her expected payoff at the third stage will be ϑ·E [S ]. Therefore, she will I j h only accept wage offers that are greater than or equal to x∗∗. Finally, at the first stage of the game (i.e. when the firm makes an offer), the firm anticipates a payoff of zero if →− it makes an offer less than x∗∗ and a payoff of J (x,h ,Ω) ≥ 0 if x ≥ x∗∗. Hence, the j j firm offers exactly x∗∗ to the worker and she accepts it. To prove that this equilibrium satisfies the intuitive criterion, define Θ as the set of ˆ allpossiblerealizationsof{a,n}. Then, for agiveninformationsetI , defineΘ(w) ⊆ Θ h as the set of pairs {a,n} for which a wage offer w is not equilibrium dominated for firm 42

j. Hence: Θ ˆ (w > x∗∗) = ∅ (28) Θ ˆ (w < x∗∗) = Θ (29) Any wage offer above the equilibrium wage x∗∗ is always equilibrium dominated. Given that a worker is willing to work for a wage x∗∗, offering a higher wage will only reduce the firm’s profits. However, wage offers below the equilibrium wage are not equilibrium dominated for any realization of {a,n} because firm’s profits will increase if a worker is willing to work for a lower wage. As a consequence, if a worker receives a wage offer below the equilibrium wage, she cannot extract more information from that offer. Regardless of the realization of a and n, offering a lower wage could always be a profitable deviation for firms. Formally: Pr(a = x|I ,w < x∗∗) = Pr(a = x|I ) ∀ x ∈ Θ (30) h h E[a|I ,w < x∗∗] = E[a|I ] (31) h h Conditional on receiving a wage offer below the equilibrium wage, the worker’s expectations do not change. 43

B Computation Tocomputethesolutiontothismodelnumerically, itisimportanttofindanddetermine a law of motion for the economy, based on which the household forms expectations and makes decisions. This task may not be simple for a large vector Ω, given a distribution of firms and workers. Another challenge is that the vector of state variables should incorporate a set of variables that capture agents’ beliefs, which could include prior beliefs about Ω or lags of the vector of state variables. This could not only increase significantly the size of the vector of state variables but also may require the computation of higher order expectations (expectations of agents’ expectations). Hence, I propose a procedure that combines the solution method for heterogeneous agent models developed by Reiter (2009) and the Kalman filter. In particular, the law of motion for the aggregate economy will be linear (Reiter method), which will allow me to form expectations using the Kalman filter. I show that I can keep track of agents’ expectations by keeping track of the last T realizations of the aggregate shocks in the economy, where T is a “large” integer.18 Hence, to solve the model with noisy signals, you only need to include the last T realizations of the aggregate shocks as state variables.19 In more detail, the Reiter method solves heterogeneous agent models by taking a first-order approximation of the model around the deterministic steady state of the economy.20 Assume that the following system of equations describes the equilibrium of the economy: f(Ω,Ω(cid:48),Υ,Υ(cid:48),E) = 0 (32) where Υ is the vector of endogenous variables of the economy and E is the vector of exogenous shocks. The Reiter method then finds the solution in three steps: 1. A finite representation of the economy is provided by discretizing the distribution of agents. 18T is large enough if T˜ >> T generates almost the same results. In my calibration, I find that T=100 was more than enough. 19Notice that vector Ω in the equilibrium definition in section 2.6 included the last T realization of the aggregate shocks. 20For a detailed application of the Reiter method, see Costain and Nakov (2011). 44

2. The deterministic steady state of the economy is found by imposing E = 0 and finding the solution to: f∗ = f(Ω∗,Ω∗,Υ∗,Υ∗,0) = 0 (33) 3. The model is linearized numerically around the steady state, which yields the system of linear equations: f∗(Ω−Ω∗)+f∗(Ω(cid:48) −Ω∗)+f∗(Υ−Υ∗)+f∗(Υ(cid:48) −Υ∗)+f∗E = 0 (34) 1 2 3 4 5 where f∗ is the partial derivative of (33) with respect to its i-th argument. This i system is solved using a standard method such as Sims (2002) or Klein (2000). Hence, the Reiter method induces a law of motion for the economy of the form: Ω(cid:48) = FΩ+E (35) Υ = GΩ (36) where F and G are matrices of coefficients. Therefore, the law of motion for the economy is described by λ = {F,G}. The challenge for a model with information frictions comes from the fact that the law of motion λ is derived from a perceived law of motion λh, which in equilibrium has to be equal to the actual law of motion λ. I exploit the linearity of the Reiter method and proceed as follows:21 1. Define a tolerance level. 2. Guess a linear law of motion for the economy λh{1} = {Fh{1},Gh{1}}. A good initial guess may be the law of motion of the model under perfect information. 3. Let the household form expectations based on this guess and the Kalman filter, which is explained in 10. 4. Find the solution of the model using the Reiter method, which is given by a new law of motion λ{1} = {F{1},G{1}}. 21The linearity of the model makes the model tractable as I can compute expectations based on a linear filter. Otherwise, I would need to use non-linear filters (such as the particle filter), which would substantially increase the complexity of the problem for a large vector Ω. 45

5. If the maximum difference between λh{1} and λ{1} is less than the predetermined tolerance level, stop and conclude that λh{1} = λ. Otherwise, update the household’s perceived law of motion as follows: λh{n+1} = d·λh{n} +(1−d)·λ{n}; 0 < d < 1 (37) where d is a fraction that determines how smoothly the guess is updated. 6. Go back to step 3. B.1 Computing Expectations In this subsection, I show how to compute workers expectations given a vector of state variablesΩ = {k,{h }1 ,aT,nT}, andalinearlawofmotionfortheeconomyasin(35) j j=0 and (36). Reiter (2009) shows how to find the aggregate law of motion for the economy for heterogeneous agent models. In this subsection, I focus on how to compute workers’ expectations, which is the novel part of my paper. In particular, I show that we only need to keep track of the last T realization of the exogenous state variables in order to compute expectations. Therefore, we do not need to include as a state variable agents’ beliefs or the realization of vector Ω, T periods ago. This represents a significant gain in efficiency because the dimensionality of the problem does not significantly increase. Solving numerically this model, I found that setting T = 100 was more than enough, ˜ meaning that a value of T >> T did not represent any significant difference. However, the optimal value for T is a function of how informative the signal is. For example, when ρ and ρ are close, it becomes more difficult to distinguish between TFP and a n noise shocks. Agents’ expectations take longer to converge to the true values, and a larger value of T is needed. This note is based on the Kalman filter and follows the notation of Hamilton (1994). I start with some definitions. Then, I show how to compute expectations regarding the vector of state variables. I conclude by computing expectations about endogenous variables and forecasting economic conditions. 46

B.1.1 Preliminary Definitions Define e = [a n ](cid:48) as the vector of exogenous state variables, which evolves according t t t to: e = Fe +V (38) t+1 t t (cid:34) (cid:35) ρ 0 a F = (39) 0 ρ n E[V V(cid:48)] = Q (40) t t (cid:34) (cid:35) ς 0 a = (41) 0 ς n (cid:104) (cid:105)(cid:48) ˜ ˜ The vector of states variables Ω can be partitioned as Ω = Ω e where Ω is the t t t vector of endogenous predetermined state variables. Then, the dynamics for Ω and Υ (vector of non-predetermined variables) are given by: Ω = FΩ +E (42) t+1 t t Υ = JΩ (43) t t E = [∅ V ] (44) t t (cid:34) (cid:35) F F F = Ω˜ e (45) ∅ F E[E E(cid:48)] = R (46) t t (cid:34) (cid:35) ∅ ∅ = (47) ∅ Q Hence, assuming that vector Ω was perfectly observed at time t−T , we only need to from expectations about e to infer agents’ beliefs regarding other variables in the economy. B.1.2 Computing Expectations about e Forming beliefs about e is a classic signal extraction problem. In this section, I show that the expectation of e conditional on information available at time x, which is t 47

denoted by e can be expressed as a linear combination of the last T −1 shocks. t|x In this case, the signal is given by aˆ = a + n, and the evolution of conditional expectations (e is given by: t|x a = H(cid:48)e (48) t t (cid:0) (cid:1) e = Fe +B a −H(cid:48)Fe (49) t|t t|t−1 t t t|t−1 where matrices H and B : t (cid:104) (cid:105)(cid:48) H = 1 1 (50) B = P H (cid:0) H(cid:48)P H (cid:1)−1 (51) t t|t−1 t|t−1 P = FP F(cid:48) +Q (52) t|t−1 t|t P = P −P H (cid:0) H(cid:48)P H (cid:1)−1 P (53) t|t t|t−1 t|t−1 t|t−1 t|t−1 P = ∅ (54) t−T|t−T It can be verified that: T−1 (cid:88) e = e + C V (55) t|t t t−j t−j j=0 C = (B H(cid:48) −I) (56) t t C = −C F (B H(cid:48) −I) 0 < j <= T (57) t−j t−j+1 t−j ˜ B.1.3 Computing Expectations about Ω ˜ We can rewrite the law of motion for Ω as follows: t Ω ˜ = F Ω ˜ +F e (58) t Ω˜ t−1 e t−1 T−2 (cid:88) = FTΩ ˜ +FT−1F e + Fj F e (59) Ω˜ t−T Ω˜ e t−T Ω˜ e t−1−j j=0 ˜ ˜ Given that workers perfectly know Ω and e , the expected value of Ω given t−T t−T 48

information available at time t is given by: T−2 (cid:88) Ω ˜ = FTΩ ˜ +FT−1F e + Fj F e (60) t|t Ω˜ t−T Ω˜ e t−T Ω˜ e t−1−j|t−1−j j=0 Combining, (59), (60), and (55), we get: T−2 (cid:88) Ω ˜ = Ω ˜ + Fj F V ˜t−1−j (61) t|t t Ω˜ e j=0 T (cid:88) V ˜t−1−j = C V (62) t−1−j−i t−1−j−i i=0 According to (55) and (62), I only need to keep track of the last T realization of ˜ e instead of keeping track of the whole vector of expectations Ω and e to compute t|t t|t workers expectations.22 B.1.4 Expectations about Υ and Forecast It is now straight forward to define the expectations of Υ given information available at time t. Υ = GΩ (63) t|t t|t (cid:104) (cid:105)(cid:48) ˜ Ω = Ω e (64) t|t t|t t|t (cid:104) (cid:105) Hence, partitioning matrix G = G G , the forecast f periods ahead is given by: Ω˜ e    f   Ω ˜ F F ∅ Ω ˜ t+f|t Ω˜ e t|t       e  =  ∅ F ∅  e  (65) t+f|t t|t       Υ G G ∅ Υ t+f|t Ω˜ e t|t 22Notice that you can also keep track of V directly. 49

C Other Tables Table 9: Wage Distribution Wage Quintile Fraction Monthly Earnings (% of Lowest) of Firms Earnings Data Model Lowest 20% 1,842.16 1.00 1.00 2sd 20% 2,754.87 1.50 1.40 3rd 20% 3,458.19 1.88 1.86 4th 20% 4,354.70 2.36 2.42 Highest 20% 6,665.13 3.62 3.62 All 100% 3,815.01 2.07 2.06 Note: This table reports average monthly earnings by pay quintile. Average earnings are weighted by employment. Data source is Kahn and McEntarfer (2014). Table 10: Firm Size Distribution Firm Size Fraction Average Size (% of Smallest) of Firms Size Data Model 1 to 4 55.1% 2.23 1.00 1.00 5 to 9 21.1% 6.66 2.99 6.21 10 to 19 12.2% 13.70 6.15 14.73 20 to 49 7.4% 30.75 13.81 28.74 50 to 99 2.3% 69.85 31.37 48.24 100 to 249 1.2% 152.78 68.62 68.87 250+ 0.7% 1,648.26 740.34 117.00 All 100.0% 21.64 9.72 8.79 Note: This table reports firm size statistics for the United States. for the period 1977 to 2014. Firm sizeisdefinedasnumberofemployeesperfirm. Averagesizeiscomputedastotalnumberofemployees over total number of firms. Calculations based on U.S. Census Bureau, Business Dynamics Statistics, https://www.census.gov/ces/dataproducts/bds/. 50

D Other Figures Figure 8: Wage and Size Distribution in Steady State: Model vs. Data (a) Wage Distribution (b) Size Distribution 80.00 4.00 80.00 4.00 Data Model Data Model Data Model Data Model 70.00 3.50 70.00 3.50 60.00 3.00 60.00 3.00 50.00 2.50 50.00 2.50 40.00 2.00 40.00 2.00 30.00 1.50 30.00 1.50 20.00 1.00 20.00 1.00 10.00 0.50 10.00 0.50 - - - - 1 to 4 5 to 9 10 to 19 20 to 49 50 to 99100 to 249 All Lowest 2sd 3rd 4th Highest All 1 to 4 5 to 9 10 to 19 20 to 49 50 to 99100 to 249 All Lowest 2sd 3rd 4th Highest All Note: This figure plots the distributions of employment (panel 8b) and wages (panel 8a). Gray bars plot the model distribution in steady state, while the black bars represent the distribution for the United States. Firms with more than 250 employees are excluded in panel 8b to preserve the scale. Calculations based on Kahn and McEntarfer (2014) and U.S. Census Bureau, Business Dynamics Statistics, https://www.census.gov/ces/dataproducts/bds/. Figure 9: Simulated mJ =min {J } j j Note: ThisfigureplotstheevolutionofmJ =min {J }indicatingthatAssumption1isneverviolated. j j 51

Figure 10: Evolution of Workers’ Beliefs Dynamics in Response to a 1% Increase in Aggregate TFP Note: Thisfigureplotstheevolutionofworkers’beliefsandexpectationsinresponsetoa1%increasein aggregateproductivity. Inthetoppanels,solidblacklinesrepresentthetrueevolutionofeachvariable, anddashedblacklinesdepictworkers’expectationsaboutthosevariables. Thebottompanelsplotthe difference between the true realization and the workers’ expectations. Figure 11: Index of Consumer Sentiment Note: This Figure plots the Index of Consumer Sentiment (ICS) from the University of Michigan Surveys of Consumers. The ICS is re-scaled such that it has a 0 mean and a standard deviation equal to 1. Shadow areas represent NBER recession dates. The sample period is January 1979 to December 2015. Data source is University of Michigan, Surveys of Consumers, https://data.sca.isr.umich.edu. 52

E Detailed Household’s Problem This appendix presents the household’s problem in recursive form and the complete derivation of the employment and unemployment functions. The household’s utility function is given by: c1−σ h ˜1+η U(ω,Ω) = −Ψ +βE[U(ω(cid:48),Ω(cid:48))] (66) 1−σ 1+η Hence, the household’s problem is: max E {U(ω,Ω)} (67) I c,k(cid:48),{h(cid:48)}1 h j j=0 subject to the budget constraint, the law of motion of labor, and the perceived law of motion of the economy: (cid:90) 1 (cid:90) 1 c+k(cid:48) = (r+1−δ )k + w h dj + π dj +b·u−T (68) k j j j 0 0 (cid:16)v (cid:17) (cid:90) j (cid:16)v (cid:17) h(cid:48) = (1−δ )(1−q¯iF )h +q j u+ q¯i j (1−δ )h dx (69) j h j j v v h x 0 (cid:18)(cid:90) 1 (cid:19) 1+ 1 ξ h ˜ = h1+ξdj (70) j 0 (cid:90) 1 u = (1−h )dj (71) j 0 Ω(cid:48) = λh(Ω) (72) where E [·] is the expectation conditional on the household information set I . ω = I h h {k,{h },I } is the vector of state variables for household, and Ω is a vector that j h summarizes the aggregate state of the economy. Letting φ and φ denote the Lagrange c j 53

multipliers for equations (68) and (69), the first order conditions are given by: (cid:8) (cid:9) c : E c−σ −φ = 0 (73) I c h k(cid:48) : E {−φ +βφ(cid:48)(r(cid:48) +1−δ )} = 0 (74) I h c c k h(cid:48) : E {−φ −E{βΨh ˜(cid:48)η−ξh(cid:48)ξ +βφ(cid:48) (cid:0) w(cid:48) −b (cid:1) j I h j j c j (cid:90) 1 (cid:18) v(cid:48) (cid:19) +(1−δ )(1−q(cid:48)¯iF(cid:48))βφ(cid:48) −q(cid:48) βφ(cid:48) x dx h j j x v(cid:48) 0 (cid:90) 1 (cid:18) v(cid:48) (cid:19) +(1−δ )q(cid:48)¯i βφ(cid:48) x dx}} = 0 (75) h x v j Hence, combining (73) and (75) and lagging one period: E {(W (ω,Ω)−U(ω,Ω))} =E {w −z I j I j j h h +E{Q((1−δ )(1−q¯iF )(W (ω(cid:48),Ω(cid:48))−U(ω(cid:48),Ω(cid:48))) h j j +(1−δ )q¯iF (W ˜ (ω(cid:48),Ω(cid:48))−U(ω(cid:48),Ω(cid:48))) h j j −q (cid:0) W ¯ (ω(cid:48),Ω(cid:48))−U(ω(cid:48),Ω(cid:48)) (cid:1) )}} (76) where: φ (W (ω(cid:48),Ω(cid:48))−U(ω(cid:48),Ω(cid:48))) = j (77) j βφ(cid:48) c Also from the first order conditions, we can verify that the optimality conditions for c is given by: (cid:104) (cid:105) c−σ = βE (1−δ +r(cid:48))c(cid:48)−σ (78) I h 54

F Recursive Competitive Equilibrium (Equations) This appendix presents the equations that characterize the recursive competitive equilibrium. (cid:104) (cid:105) c−σ = βE (1−δ +r(cid:48))c(cid:48)−σ (79) I h (cid:2) (cid:3) κ (q˜v ) = E ·Q·J(cid:48)(ω(cid:48),Ω(cid:48)) (80) v j j j f (cid:18) (cid:19)(cid:18) (cid:19) h α j r = p (81) j k 1−α j h(cid:48) = (1−δ )(1−¯iqF )h +q˜v (82) j h j j j j (cid:90) 1 c+k(cid:48) = (r+1−δ )k + [w h +π ]dj (83) k j j j 0 z = b+Ψcσh ˜ηhξ (84) j j j π = y −w h −rk −κ(q˜v ) (85) j j j j j j j (cid:18) c(cid:48)(cid:19)−σ Q = β (86) c (cid:16)v(cid:17) θ = (87) s y = eaj+akαh1−α (88) j j j (cid:90) 1 v x F = dx (89) j v j q˜ = q˜u +q˜c (90) j j (cid:16)u(cid:17) q˜u = q˜· (91) s (cid:18)(cid:90) j (1−δ )¯ih (cid:19) q˜c = q˜· h x dx (92) j s 0 q = m(v,s)/s (93) (cid:90) 1 v = v (ω ,Ω)dj (94) j f 0 55

(cid:90) 1 y = y (ω ,Ω)dj (95) j f 0 (cid:90) 1 s = u+ ¯ih (ω ,Ω)dj (96) j f 0 (cid:90) 1 u = (1−h (ω ,Ω))dj (97) j f 0 (cid:90) 1 k = k (ω ,Ω)dj (98) j f 0 (cid:104)−→ (cid:105) ϑ·E [S ] = E W (w ,ω,Ω)−U(ω,Ω) (99) I j I j j h h c1−σ h ˜1+η U(ω,Ω) = −Ψ +βE[U(ω(cid:48),Ω(cid:48))] (100) 1−σ 1+η (cid:18)(cid:90) 1 (cid:19) 1+ 1 ξ h ˜ = h1+ξdj (101) j 0 (cid:26) (cid:18) (cid:90) 1 v (cid:19)(cid:27) U(ω,Ω) = b+E Q (1−q)·U(ω(cid:48),Ω(cid:48))+q · W (ω(cid:48),Ω(cid:48)) x dx x v 0 (102) (W (ω,Ω)−U(ω,Ω)) = w −z j j j +E{Q((1−δ )(1−q¯iF )(W (ω(cid:48),Ω(cid:48))−U(ω(cid:48),Ω(cid:48))) h j j +(1−δ )q¯iF (W ˜ (ω(cid:48),Ω(cid:48))−U(ω(cid:48),Ω(cid:48))) h j j −q (cid:0) W ¯ (ω(cid:48),Ω(cid:48))−U(ω(cid:48),Ω(cid:48)) (cid:1) )} (103) (cid:90) 1 v W ˜ (ω(cid:48),Ω(cid:48)) = W (ω(cid:48),Ω(cid:48)) x ·F−1dx (104) j x v j j t (cid:90) 1 v W ¯ (ω(cid:48),Ω(cid:48)) = W (ω(cid:48),Ω(cid:48)) x dx (105) x v 0 (cid:2) (cid:3) Π (ω ,Ω) = π +E QΠ (ω(cid:48),,Ω(cid:48)) (106) j f j j f J (ω ,Ω) = p −w +E (cid:2) Q·(1−δ )(1−¯iqF )·J (ω(cid:48),Ω(cid:48)) (cid:3) (107) j f j j h j j f S = J (ω ,Ω)+W (ω,Ω)−U(ω,Ω) (108) j j f j 56

aˆ = a+n (109) a(cid:48) = ρ ·a+e(cid:48) (110) a a n(cid:48) = ρ ·n+e(cid:48) (111) n n 57

G Unilateral Deviations in Equilibrium (Details) Nothing in my model prevents firms from paying a higher wage in equilibrium. Even though higher wages increase firms’ payroll, offering a higher wage can reduce the fraction of workers who are poached by other firms, and as a consequence, increase profits. When workers’ bargaining power is low, firms incentives to offer higher wages could be large. However, in my calibration with a bargaining power equal to 0.5, no firm has incentives to offer a higher wage. Suppose that firm j is considering offering a higher wage to its employees in order to reduce its separation rate and increase its profits. Hence, following Moscarini and Postel-Vinay (2013), the problem for firm j would be given by: κ Π (ω ,W ¯ ,Ω) = max eaj+akαh1−α −w h −rk − (q˜v )1+χ +E (cid:2) QΠ (ω(cid:48),W(cid:48),Ω(cid:48)) (cid:3) j f vj,kj,wj,W j (cid:48) j j j j j 1+χ j j j f j (112) s.t. h(cid:48) = (1−δ )(1−¯iqF(W(cid:48)))h +q˜(W(cid:48))v (113) j h j j j j E [W ] ≥ ϑ·E [S ] (114) I j I j h h (cid:2) (cid:3) (cid:2) (cid:3) E W(cid:48) ≥ ϑ·E S(cid:48) (115) I h j I h j ¯ E [W ] ≥ W (116) I j h Ω(cid:48) = λf(Ω) (117) v , k ≥ 0 (118) j j where restrictions (114) and (115) imply that the wage offer cannot be lower than what workers would get otherwise, and restriction (116) implies that the continuation value for workers cannot be lower than what was promised at the end of the last period. Given that the optimality conditions for k and v do not change, let me concentrate j j on W(cid:48). Given that E [W ] = max{ϑ·E [S ],W ¯ }, the optimality condition for W(cid:48) j I h j I h j j is: (cid:20) ∂F(W(cid:48))(cid:21) E −Qq˜(W(cid:48))v +QJ(cid:48)(ω ,W(cid:48),Ω)h (1−δ )¯iq j ≤ 0 (119) j j f j j h ∂W(cid:48) j (cid:2) (cid:3) equation (119) holds with equality if W(cid:48) > ϑ·E S(cid:48) . If firm j increase its wage j I h j 58

in one unit, its wage bill will increase by q˜(W(cid:48))v next period, but it will retain an j j additional fraction (1 − δ )¯iq ∂F(W j (cid:48)) of its current employees, each one of which will h ∂W(cid:48) j increase firm’s profits by J(cid:48)(ω ,W(cid:48),Ω).23 Hence, in equilibrium, equation (115) holds f j with equality if: (cid:20) v ∂F(W(cid:48))(cid:21) Π = E −Qq˜ j +QJ(cid:48)(ω ,Ω)(1−δ )¯iq j ≤ 0 (120) j j h f h ∂W(cid:48) j j Figure 12 shows that Π is negative for all firms in steady state. Hence, for this j particularcalibration,firmsdonothaveincentivestounilaterallydeviateinequilibrium. Figure 12: Π For All Firms in Steady State j Note: This figure plots the value of Π for each firm in steady state (y axis) as a function of marginal j labor productivity for each firm in steady state (x axis). 23MoscariniandPostel-Vinay(2013)showthatthewagebilleffect(thefirstterminequation(119)) does not depend on the firm’s initial size (h ). This is because firms are indifferent about the timing j of wages paid to deliver a utility label W =max{ϑ·S ,W¯}, since workers and firms share the same j j discount factor. 59

H Firms Face Information Frictions Assuming that firms and workers face information frictions is not a straight forward exercise in my model. Depending on how this new friction is introduced in the benchmark model, hiring decisions and the unemployment response can become larger or smaller. However, as long as workers face information frictions regarding aggregate conditions, and assuming that the workers’ information set is always a subset of the firms’ information set, wages will have fewer pressures to increase in booms because the value of the outside option (the value of unemployment) is underestimated. Notice that the two equations governing the dynamics of wages and hiring decisions in this model are given by: (cid:104)−→ (cid:105) E W (w ,ω,Ω) = ϑ·E [S ]+E [U(ω,Ω)] (121) I j j I j I h h h (cid:2) (cid:3) κ(q˜v )χ = E Q·J(cid:48)(ω(cid:48),Ω(cid:48)) (122) j j Ij j f where I is the information set of firm j. Hence, wages depend on workers’ expectaj tions, while hiring decisions depend on firms beliefs.In the benchmark model, workers underestimate changes in S and U (ω,Ω) reducing the volatility of wages. This rises j the volatility of Jj, which increases firms’ hiring decision. Now let me consider the following three variations of my benchmark model: Model 2: Firms and workers have the same information set. Aggregate productivity (a) is never observed, and all agents have to form expectations based on the public and aggregate signal aˆ. Model 3: Firms and workers have the same information set. Employers and workers at firm j observe their overall productivity (a +a) at all times but cannot decompose j unexpected changes into aggregate and idiosyncratic shocks. Model 4: Workers’ information set is as described in section 2.3. Firms observe their overall productivity (a + a) at all times but cannot decompose unexpected changes j into aggregate and idiosyncratic shocks. Firms also observe aggregate public signal aˆ. 60

Figure 13: Impulse Response Functions to a 1% Increase in Aggregate Productivity Comparing Different Information Structures (a) Unemployment (b) Average Wage All Workers (c) Average Wage New Employees (d) Average Wage Job Changers Note: This figure plots the Impulse Response Function of unemployment and wages to a 1% increase inaggregateTFP.SolidblacklinesaretheIRFsofamodelinwihchworkersfaceinformationfrictions and firms have perfect information. Dashed black lines are the IRFs generated by a model in which all agents have perfect information. Models 2 (solid gray lines), 3 (dashed gray lines), and 4 (dotted gray lines) assume that firms and workers face information frictions as described in section 4.3. In Models 2 and 4, wages will continue to be very sluggish because workers underestimate the responses of S and U (ω,Ω) to TFP shocks. However, in Model 3, workers’ j expectations about S will tend to be more volatile than in the benchmark model while j changes in U (ω,Ω) will continue to be undervalued. Hence, wages will tend to be more procyclical in Model 3 than in my bench mark model. When looking at hiring decision in each one of these models, we should observe a lower hiring response to TFP shocks in Model 2 than in my benchmark model, as firms’ expectations about J (ω ,Ω) do not significantly change. Nonetheless, hiring j j responses in Model 3 should be larger than in Model 2 as firms’ expectations regarding the value of an additional worker (equation (122)) become more volatile. In Model 3, firms anticipate a larger productivity in the future but do not expect a significant increase in their separation rate and wages in response to TFP shocks, which makes 61

Table 11: Wage Semi-Elasticities with Respect to the Unemployment Rate Model Simulated Data Benchmark Model 2 Model 3 Model 4 All Workers -1.12 0.32 -0.02 -0.72 New Employees -2.65 3.49 -4.77 -7.15 Job Changers -2.93 1.69 -6.05 -8.81 Note: Thistablereportsthetheoreticalwagesemi-elasticitieswithrespecttotheunemployment rate based on equation (24). In the benchmark model only workers face information frictions. In models 2, 3, and 4 firms and workers face information frictions as described in this Appendix. firms hire more workers in booms. Finally, hiring responses in Model 4 should be even larger than in Model 3. In Model 4, in response to a positive TFP shocks, firms expect a lower change in wages than in model 3 because workers’ expectations about S are j lower in Model 4 than in Model 3. EventhoughhiringdecisionsinmybenchmarkmodelshouldbelargerthaninModel 2butlowerthaninModel4,therelationshipbetweenModel3andmybenchmarkmodel is not clear. In Model 3, firms expect a lower increase in their separation rate but a larger increase in their wages than in my benchmark model. However, both models should display larger unemployment responses than a model with perfect information. Figure 13 plots the IRFs for unemployment and wages generated by these models. For these responses, I assumed that the persistence of idiosyncratic productivity shocks is equal to ρ . a As expected, Models 2 and 4 generate smaller and larger unemployment responses, respectively, thanthebenchmarkmodel. However, forthisparticularcalibration, Model 3 displays smaller unemployment responses to TFP shocks than my benchmark model. In Models 3 and 4, unemployment peaks earlier and is less persistent than in other models because firms overreact to aggregate shocks, and they compensate for these overreactions in later periods when they have amassed more information. For example, in response to a positive TFP shock, firms post a lot of vacancies on impact, but they reduce the number of vacancies (post less) as they learn about aggregate conditions and realize that the value of an additional worker is not as high as they had thought. In contrast, wage responses are larger in Models 3 and 4, but lower in Model 2, than in the benchmark model. As long as workers have more information (Model 3) or 62

Table 12: Differential Net Job Flows, Coefficient on Cyclical Variable. High Wage minus Low Wage. Model Simulated Data Benchmark Model 2 Model 3 Model 4 Deviation from HP Trend Net Job Flows -0.257 -0.696 -0.728 -0.538 Net Poaching Flows -0.292 -0.502 -0.652 -0.503 Net Nonemployment 0.035 -0.036 -0.075 -0.034 Flows First Difference Net Job Flows 0.018 -0.035 0.009 0.012 Net Poaching Flows 0.013 -0.030 0.007 0.009 Net Nonemployment 0.005 -0.005 0.002 0.003 Flows Note: Thistablereportsthetheoreticaldifferentialnetjobflows as discuss in the paper. In the benchmark model only workers faceinformationfrictions. Inmodels2,3and4firmsandworkers face information frictions as described in this Appendix. unemployment is more sensitive to the business cycle (Model 4), workers will demand higher wages in response to TFP shocks. It is worth pointing out that all models with information frictions display hump-shaped wage responses because workers demand higher wages as they learn that the economy is in an expansion and that the value of theoutsideoptionisgreaterthantheythought. However, thesemodelstendtogenerate larger wage semi-elasticities (Table 11) and differential growth rates (Table 12) than my benchmark model (and the data). 63

I Data Source I assess the model’s predictions using quarterly data for the United States for the period 1979 to 2015. I present business cycle statistics for the quarterly time series (seasonally adjusted) of unemployment, vacancies, output, consumption, investment, aggregate TFP, and real wages (deflated by CPI) for new employees, job changers, and all workers. I take the quarterly average of series that are available monthly. Following Shimer (2005), all variables are HP-filtered in logs with a smoothing parameter of 105.24 Unemployment is the total number of unemployed people from the CPS. Vacancies are the composite help-wanted index computed by Barnichon (2010). Output is real gross domestic product. Aggregate productivity is measured as the Solow residual, which is available and updated on the Federal Reserve Bank of San Francisco’s website. Consumption consists of non-durable goods and services. Finally, investment is real gross private domestic investment. I include investment as a variable of interest because theeffectoftheinformationfrictiononinvestmentplaysanimportantroleinmymodel. Given the debate about the cyclicality of wages, I use the CPS and IPUMS-CPS (Flood et al., 2015) microdata to construct the average hourly wage for three group of workers: all workers, new employees, and job changers. In order to compute these wages, IfollowMuller(2012)andHaefkeetal. (2013)whoalsousedtheCPSmicrodata to construct similar series. The series presented in this paper are the coefficients of time fixed effects in Mincer equations controlling for education, a fourth order polynominal in experience, gender, race, marital status, state, 10 occupation dummies and 14 industry dummies. Since 1994, the CPS has asked individuals whether they still work at the same job as in the previous month, making it possible to identify job changers. However, it is not possible to identify job-to-job transitions prior to that year. Hence, the sample period for the average wage for job changers is 1994-2015. Figure 14 plots these wage series along with the average hourly earnings of production and nonsupervisory employees (red line) for comparison. 24In general my results are not very sensitive to this parameter. 64

Figure 14: Average (Log) Real Hourly Wages Note: This figure plots the average log real hourly wages for all workers (solid black line), new employees (dashed-black lines), and job changers (solid gray line). These series are the coefficient of time fixed effect in a Mincer equation using CPS and IPUMS-CPS microdata. Details of these series are provided in Appendix J. The red line (left axis) is the log real average hourly earnings of production andnonsupervisoryemployees. AllwagesaredeflatedbyCPI.ShadowareasrepresentNBERrecession dates. The sample period is January 1979 to December 2015. Calculations based on: U.S. Bureau of Labor Statistics, Nonfarm Business Sector: Compensation Per Hour [COMPNFB], U.S. Bureau of Labor Statistics, Consumer Price Index for All Urban Consumers. All Items [CPIAUCSL], all retrieved from FRED, Federal Reserve Bank of St. Louis, https://fred.stlouisfed.org/. CPS and IPUMS-CPS (Flood et al., 2015). Based on the series reported in Figure 14, we can see that, on average, new hires fromunemploymentandjobchangersearnawagethatis12.6%and3.7%lowerthanthe average worker, respectively, suggesting that workers who were previously unemployed usually get employed at low paying jobs and move up the job ladder. Notice that these differences in wages cannot be explained by education, experience, gender, race, marital status, state, occupation, or industry, as those variables were included in the Mincer equation. 65

J Wages from CPS and IPUMS-CPS I use the Current Population Survey (CPS) and IPUMS-CPS (Flood, Kind, Ruggles & Warren, 2015) microdata to construct wage series adjusted for workers characteristics. The CPS is the main labor force survey for the United State, and it is the primary source of labor force statistics such as the national unemployment rate. The CPS consists of a rotating panel where households and their members are surveyed for four consecutive months, not surveyed for the following eight months, and interviewed again for another four consecutive months. The CPS includes individual information such as employment status, sex, education, race, and state. However, individual earnings and hours worked are collected only in the fourth and eighth interviews. In addition, since 1994, individuals have been asked if they still work in the same job reported in the previous month, making it possible to identify job changers. IPUMS-CPS is a project from the University of Minnesota that integrates, disseminates and harmonizes CPS microdata. IPUMS-CPS is free and includes, among many other variables, harmonized series for education, occupation and industry and a unique person and household ID, which makes it easier to link individuals across samples and facilitates data analysis. Following Muller (2012) and Haefke, Sonntag and van Rens (2013), my empirical model is based on the following MINCER equation for the wage of individual i at time t (w ): it (cid:32) (cid:33) T (cid:88) log(w ) = x(cid:48) β + αa ·D +αnhu ·D ·D +αnhc ·D ·D +e it it x j j j j it,nhu j j it,nhc it j=1 (123) x is a vector of individual characteristics, and β , {αa, αnhu, αnhc}T are coeffiit x j j j j=1 cients. D is a time dummy equal to 1 if j = t and 0 otherwise. D is a dummy j it,nhu variableequalto1ifworkerispenttimeinunemploymentduringthepastthreemonths and 0 otherwise. D is a dummy variable equal to 1 if worker i was previously emit,nhc ployedatanotherfirmduringthepastthreemonthsandhasnotbeenunemployedwhile switching jobs. Hence, the average (log) wage for all workers (wa), new employees (wu) 66

and job changers (wc) are given by: wa = α (124) t t wu = α +αnhu (125) t t t wc = α +αnhc (126) t t t The hourly wage rate is constructed by dividing weekly earnings by weekly hours. Following Schmitt (2003), top-coded weekly earnings are imputed assuming a lognormal cross-sectional distribution for earnings. Following Haefke et al. (2013) I drop hourlywageratesbelowthe0.25thandabovethe99.75thpercentileseachmonth. Inorder to uniquely identify workers in the CPS files, I use the IMPUMS-CPS ID variables: CPSID and CPSIDP.25 Vector x includes a fourth order polynomial in experience, gender, race, mariit tal status, state, 10 occupation dummies, and 14 industry dummies. For occupation, industry and education, I use harmonized variables OCC1950, IND1950, and EDUC provided by IPUMS-CPS. Experience is defined as age minus years of education minus 6. Following the literature, individual i’s weight is the product of the individual’s weight reported by the BLS and hours worked. Due to sample design, it is not possible to match individuals between July 1985 and December 1985 and between June 1995 and November 1995. Hence, with the exception oftheaveragewageforallworkers, wageserieshaveamissingvalueinthosemonths. To compute business cycle statistics for these wage series, I compute the quarterly average wage and impute the missing quarters using the average wage for all workers. 25I follow IPUMS-CPS recommendations, and I drop a few observations for which changes in sex or race are reported and for individuals whose age changes more than 2 years between samples 67

Cite this document
APA
Camilo Morales-Jimenez (2017). The Cyclical Behavior of Unemployment and Wages under Information Frictions (FEDS 2017-047). Board of Governors of the Federal Reserve System, Finance and Economics Discussion Series. https://whenthefedspeaks.com/doc/feds_2017-047
BibTeX
@techreport{wtfs_feds_2017_047,
  author = {Camilo Morales-Jimenez},
  title = {The Cyclical Behavior of Unemployment and Wages under Information Frictions},
  type = {Finance and Economics Discussion Series},
  number = {2017-047},
  institution = {Board of Governors of the Federal Reserve System},
  year = {2017},
  url = {https://whenthefedspeaks.com/doc/feds_2017-047},
  abstract = {I propose a new mechanism for sluggish wages based on workers' noisy information about the state of the economy. Wages do not respond immediately to a positive aggregate shock because workers do not (yet) have enough information to demand higher wages. This increases firms' incentives to post more vacancies, which makes unemployment volatile and sensitive to aggregate shocks. The model is robust to two major criticisms of existing theories of sluggish wages and volatile unemployment: flexibility of wages for new hires and pro-cyclicality of the opportunity cost of employment. Calibrated to U.S. data, the model explains 70% of unemployment volatility. Accessible materials (.zip)},
}