feds · January 31, 2009

Modeling Earnings Dynamics

Abstract

In this paper we use indirect inference to estimate a joint model of earnings, employment, job changes, wage rates, and work hours over a career. Our model incorporates duration dependence in several variables, multiple sources of unobserved heterogeneity, job-specific error components in both wages and hours, and measurement error. We use the model to address a number of important questions in labor economics, including the source of the experience profile of wages, the response of job changes to outside wage offers, and the effects of seniority on job changes. We provide estimates of the dynamic response of wage rates, hours, and earnings to various shocks and measure the relative contributions of the shocks to the variance of earnings in a given year and over a lifetime. We find that human capital accounts for most of the growth of earnings over a career although job seniority and job mobility also play significant roles. Unemployment shocks have a large impact on earnings in the short run as well a substantial long long-term effect that operates through the wage rate. Shocks associated with job changes and unemployment make a large contribution to the variance of career earnings and operate mostly through the job-specific error components in wages and hours.

Finance and Economics Discussion Series Divisions of Research & Statistics and Monetary Affairs Federal Reserve Board, Washington, D.C. Modeling Earnings Dynamics Joseph Altonji, Anthony Smith, and Ivan Vidangos 2009-08 NOTE: Staff working papers in the Finance and Economics Discussion Series (FEDS) are preliminary materials circulated to stimulate discussion and critical comment. The analysis and conclusions set forth are those of the authors and do not indicate concurrence by other members of the research staff or the Board of Governors. References in publications to the Finance and Economics Discussion Series (other than acknowledgement) should be cleared with the author(s) to protect the tentative character of these papers.

Modeling Earnings Dynamics1 Joseph G. Altonji , Anthony A. Smith , and Ivan Vidangos (cid:3) (cid:3)(cid:3) (cid:3)(cid:3)(cid:3) February 10, 2009 1 Yale University and NBER. Yale University. Federal Reserve Board. We are grateful to Richard (cid:3) (cid:3)(cid:3) (cid:3)(cid:3)(cid:3) Blundell, Mary Daly, Rasmus Lentz, Costas Meghir, Paul Oyer, and Luigi Pistaferri for helpful discussions and suggestions. We also thank participants in seminars at the Bank of Spain, UC Berkeley, CEMFI, U. of Chicago, the Federal Reserve Bank of San Francisco, the Federal Reserve Board, Georgetown U., Pennsylvania State U., Princeton, U. of Rochester, Stanford, Vanderbilt, and Yale, and conference sessions attheSocietyofEconomicDynamics(June2005), theWorldCongressoftheEconometricSociety(August 2005), the Cowles Foundation Macro/Labor Economics Conference (May 2006), NBER (Nov. 2006), the Econometric Society Winter Meetings (January 2007) and the Society for Computational Economics (June 2008) for valuable comments. Our research has been supported by the Cowles Foundation and the EconomicGrowthCenter, YaleUniversity, andbyNSFgrantSES-0112533(Altonji). Theviewsexpressed in the paper are our own and not necessarily those of the Federal Reserve Board, Yale University, NBER, or other members of their sta⁄s. We are responsible for the remaining shortcomings of the paper.

Abstract In this paper we use indirect inference to estimate a joint model of earnings, employment, job changes, wage rates, and work hours over a career. Our model incorporates duration dependence in several variables, multiple sources of unobserved heterogeneity, job-speci(cid:133)c error components in both wages and hours, and measurement error. We use the model to address a number of important questions in labor economics, including the source of the experience pro(cid:133)le of wages, the response of job changes to outside wage o⁄ers, and the e⁄ects of seniority on job changes. We provide estimates of the dynamic response of wage rates, hours, and earnings to various shocks and measure therelativecontributionsoftheshockstothevarianceofearningsinagivenyearandoveralifetime. We (cid:133)nd that human capital accounts for most of the growth of earnings over a career although job seniority and job mobility also play signi(cid:133)cant roles. Unemployment shocks have a large impact on earnings in the short run as well a substantial long long-term e⁄ect that operates through the wage rate. Shocks associated with job changes and unemployment make a large contribution to the variance of career earnings and operate mostly through the job-speci(cid:133)c error components in wages and hours.

1 Introduction In this paper we build and estimate a simultaneous model of earnings. The model consists of equations for transitions into and out of employment, an equation for job to job mobility, a wage equation, an hours equation, and an earnings equation. The model features both observed and unobserved permanent heterogeneity, job speci(cid:133)c wage and hours components, a persistent component that a⁄ects the wage of a worker in all jobs, state dependence in employment and job mobility, tenure and experience e⁄ects, and measurement error. We have three main goals. The (cid:133)rst is to advance the literature in labor economics on how employment, hours, wages, and earnings are determined over a career. We examine the e⁄ects of education, race, experience, job tenure and unobserved heterogeneity, employment shocks, shocks to general skills, and draws of new job opportunities o⁄ering di⁄erent hours and wages. We trace out the response of wages, hours, and earnings to the various shocks and determine the channels through which they operate. Our analysis has implications for a number of long-standing questions in labor economics. For example, we provide estimates of the relative importance of general skill accumulation,jobshopping,andjobtenureforcareerwagegrowthandquantifythespeci(cid:133)cchannels through which an exogenous employment shock a⁄ects the path of wage rates, hours, and earnings. We study the e⁄ects of shocks on the future variance of earnings changes as well as on the average path. Our second goal is to provide a comprehensive account of what causes inequality in earnings at a point in time and over the lifetime. We measure the contribution of each of the various shocks, permanent unobserved heterogeneity, and education to the variance in earnings, wages, and hours over the course of a career. Our third goal is to provide a richer model of earnings for use in studies of consumption and saving as well as in dynamic stochastic general-equilibriummodels that are a cornerstone of modern macroeconomicsandpublic(cid:133)nance. Suchmodelshavebeenusedtostudythedistributionofwealth, the costs of business cycles, asset pricing, and other important questions.1 The quantitative implications of the calibrated theoretical models used in these lines of research depend on certain key features of the earnings process, such as the degree of earnings uncertainty and the persistence of earnings innovations.2 Almost all of the existing structural studies base their modeling and calibration choices for the 1Examples include Huggett (1996), Krusell & Smith (1998), Castaæeda, D(cid:237)az-GimØnez, & R(cid:237)os-Rull (2003), Storesletten, Telmer, & Yaron (2004a)) on consumptions and wealth, Imrohoroglu (1989), Krusell & Smith (1999), Storesletten, Telmer, & Yaron (2001a) on the costs of business cycles, and Telmer (1993), Heaton & Lucas (1996), Krusell & Smith (1997), Storesletten, Telmer, & Yaron (2007) on asset pricing. 2See,forexample,Deaton(1991),Aiyagari(1994),KrusellandSmith(1997),Guvenen(2007),andthediscussion in Blundell, Pistaferri and Preston (2008). 1

earnings process on the large empirical literature on univariate statistical models.3 Much has been learned about the statistical properties of career earnings from this work. However, with only one indicator, univariate models, no matter how richly speci(cid:133)ed, cannot identify the various sourcesof earnings(cid:135)uctuations, theirrelativeimportance, theirdynamicbehavior, ortheeconomics underlying how labor market outcomes are determined. Without such information, it is di¢ cult to think about the potential welfare consequences of speci(cid:133)c sources of variation or of policies such as unemployment insurance, employment regulations, wage subsidies, or earned income tax credits that insure against particular types of shocks to income. Furthermore, the innovations in the univariate representation of a multivariate time series process may be aggregates of current and past shocks in the multivariate representation. This will lead to mistakes in characterizing what the surprises to the agent are even under the assumption that the agent(cid:146)s information set is the same as the econometrician(cid:146)s. Only a few studies of earnings dynamics have considered multivariate models. These include Abowd and Card(cid:146)s (1987, 1989) analyses of hours and earnings, and Altonji, Martins, and Siow(cid:146)s (2002) second order vector moving average model of the (cid:133)rst di⁄erence in family income, earnings, hours, wages, and unemployment. Altonji, Martins and Siow(cid:146)s use of their model to study consumption and labor supply behavior and decompose the variance of innovations in the marginal utility of income into various sources is not entirely successful, but it does illustrate the potential that a multivariate model of the income process provides. The models that we consider, in contrast to those mentioned above, incorporate discrete events such as job changes, employment loss, interactions between job changes and wages, and e⁄ects of these discrete events on the variance of wage and hours shocks.4 Therearetwodistinctpathsthatonemighttakeinformulatingamultivariatemodelofearnings. The (cid:133)rst approach is the development of a statistical model of the process with little attention to an underlying theory of household decisions and constraints. This approach is in the spirit of the literature on univariate earnings processes, but the absence of theory limits what one can learn about how earnings are determined. The second approach is to develop a model that is based on lifetime utility maximization. Grounding the model of the income process in a utility maximization frameworkprovides afoundationforusingtheresults toanalyzepolicies whenearnings arepartially endogenous. The main disadvantage is the di¢ culty of specifying and estimating a model that incorporates labor supply choices, job search decisions, hours constraints, voluntary separations, 3Key early contributions include Lillard and Willis (1978), Lillard and Weiss (1979), Hause (1980), MaCurdy (1982). More recent contribution include Baker (1997), Geweke and Keane (2000), Haider (2001), Baker and Solon (2003), Guvenen (2007), and Meghir and Pistaferri (2004). The latter paper introduces ARCH shocks. 4A number of recent studies provide structural models of wage rates, job mobility, and employment dynamics, includingBarlevy(2008),Buchinskyetal(2008),andBaggeretal(2007),whoprovidereferencestoafewadditional studies. Wolpin (1992) is an early e⁄ort. We discuss the evidence below. 2

and involuntary job changes. Indeed, we do not know of any papers that have studied work hours and employment using a lifecycle utility maximization model that incorporates job speci(cid:133)c hours constraints, let alone job mobility decisions.5 Estimation of a structural model that is as rich as the one that we work with would require solving an intertemporal model of job search, labor supply (in the presence of hours constraints), and savings as part of the estimation strategy and is probably out of reach at the present time from a computational point of view. Low, Meghir, and Pistaferri (2008) take a major step in this direction by studying earnings risk and social insurance in the context of an intertemporal model of consumption, employment participation, wages, and mobility. They work with a simpler model of the earnings process than we do, but are able to measure welfare costs of the risk associated with innovations in the persistent wage component, an employer speci(cid:133)c wage component, and job loss and unemployment. We view our study as complementary to theirs. Although our model falls short of a fully speci(cid:133)ed behavior model, the equations can be viewed as approximations to the decision rules relating choices to state variables that would arise in a structural model based on lifetime utility maximization. The parameters of the rules depend on an underlying set of "deep" parameters that characterize consumption preferences, job search technology, etc. The class of models that we consider is rich enough to address a number of core behavior questions in labor economics, but tractable enough (at least the simpler versions) to be used in place of univariate income models that dominate the literature on savings, portfolio choice, etc. Furthermore, it provides a natural path along which to extend the analysis to include other important economic risks that individuals face, including changes in family structure through marriage, divorce, and the death of a spouse. We estimate the model using data on male household heads from the Panel Study of Income Dynamics. Given the presence of interactions among discrete and continuous variables, unobserved heterogeneity and state dependence in multiple equations, measurement error, and a highly unbalanced sample, conventional maximum likelihood and method of moments approaches are not feasible. For this reason, we use indirect inference (I-I), which is one of a family of simulation based approaches to estimation that involve comparing the distribution of arti(cid:133)cial data generated from the structural model at a given set of parameter values to features of the actual data.6 A complication arises in our case because our model includes discrete as well as continuous variables. With discrete variables, the simulatedvalues of moments of the arti(cid:133)cial dataare not continuous inmodel parameters, which makes gradient based numerical optimization methods problematic. Given our 5HamandReilly(2002)ispartofaliteraturethattestsforhoursrestrictionsinanintertemporallaborsupplyand consumption framework using Euler equations and within period marginal rate of substitution conditions. Blundell and MaCurdy (1999) survey the labor supply literature. 6 The method was introduced, under a di⁄erent name, in Smith (1990, 1993) and extended by Gourieroux, Monfort, and Renault (1993) and Gallant and Tauchen (1996). It is closely related to the simulated method of moments. 3

model size, derivative-free methods are also unattractive. Consequently, we use a smoothed version of the procedure suggested by Keane and Smith (2003). Estimation of our model is not straightforward, and a secondary contribution of our research is to explore the feasibility and performance of I-I in large models with a mix of discrete and continuous variables.7 There are too many results to concisely summarize, but a few deserve emphasis. First, education, race, and the two forms of unobserved permanent heterogeneity play an important role in employment transitions and job changes. Second, in keeping with a large literature on the labor supply of male household heads, wages have only a small (negative) e⁄ect on employment and on annual work hours. Third, even after accounting for unobserved individual heterogeneity and job speci(cid:133)c heterogeneity, we (cid:133)nd a strong negative tenure e⁄ect on job mobility. Fourth, job changes are induced by high outside o⁄ers and deterred by the job speci(cid:133)c wage component of the current job. Fifth, unemployment at the survey date is associated with a large decline of .62 log points in annual earnings. About 60 percent of the reduction is due to work hours, which recover almost completelyafteroneyear. Theother40percentisduetoadeclineof.25intheloghourlywagerate. Lost tenure and a drop in the job-speci(cid:133)c wage component contribute .064 and .027, respectively, to the wage reduction. The wage recovers by about .10 after 1 period and more slowly after that. Sixth,wagesdonotcontainarandomwalkcomponentbutarehighlypersistent. Thepersistence is the combined e⁄ect of permanent heterogeneity, the job speci(cid:133)c wage component, and strong persistence in a stochastic component representing the value of the worker(cid:146)s general skills. Seventh, shocks leading to unemployment or to job changes have large e⁄ects on the variance as well as the mean of earnings changes. Eighth, job shopping, the accumulation of tenure, and the growth in general skills account for log wage increases of .111, .122, and .580, respectively, over the (cid:133)rst thirty years in the labor market. Finally, job mobility and unemployment play a key role in the variance of career earnings. Job speci(cid:133)c hours and wage components, unemployment shocks, and job shocks together account for 36.7%, 48.2%, and 46.8% of the variance in lifetime earnings, wages, and hours, respectively. Job speci(cid:133)c wage shocks are more important than job speci(cid:133)c hours shocks for earnings. Job speci(cid:133)c wage shocks dominate for wages, with employment shocks also playing a substantial role. For hours, job speci(cid:133)c hours shocks dominate. Education accounts for about 1/3 of the variance in lifetime earnings and wages but makes little di⁄erence for hours. In our full sample, unobserved permanent heterogeneity accounts for about 11% of the variance in earnings and about 46% of the variance of hours but matters little for wages, although this breakdown is somewhat sensitive to the model and sample used. 7OtherrecentpapersthatapplyI-ItopaneldataincludeBaggeretal(2007),Nagypal(2007),andTartari(2006). 4

The paper continues in section 2, where we present the earnings model. In section 3 we discuss the data, which are drawn from the Panel Study of Income Dynamics (PSID) and in section 4 we discussestimation. WepresenttheresultsinSection5, beginningwithadiscussionoftheparameter estimates and then turning to an analysis of the (cid:133)t of the model, impulse response functions to various shocks, and variance decompositions. In Section 6, we brie(cid:135)y discuss results for alternative samples, including whites by education level. In the (cid:133)nal section we summarize our main (cid:133)ndings and provide a research agenda. 2 Models of Earnings Dynamics The main features of Model A, our main speci(cid:133)cation, are as follows. Labor market transitions, wages, and hours depend on three exogenous variables(cid:151)race, education, and potential experience(cid:151) as well as on two permanent unobserved heterogeneity components. The unobserved heterogeneity components can be labelled, loosely speaking, (cid:147)innate ability(cid:148)and (cid:147)propensity to move(cid:148). A typical worker enters the labor market after leaving school and receives initial draws of an employment status shock that determines whether the worker is employed or unemployed and an autoregressive wage component capturing part of (cid:147)general productivity(cid:148)that has the same value in all jobs. The worker also receives initial draws of a job-speci(cid:133)c wage component and a job-speci(cid:133)c hours component. There is state dependence in both employment and job-to-job transitions. In each period an unemployed worker receives an unemployment transition shock and an employed worker receives an employment transition shock. If the worker remains employed from one period to the next, thenwhethertheworkerchangesjobsdependsonthedrawofthejob-speci(cid:133)cwagecomponent forthenewjob, thecurrentjob-speci(cid:133)cwagecomponent, potentialexperience, jobseniority, thetwo permanent heterogeneity terms and an i.i.d. shock. A typical worker(cid:146)s wage depends on one of the heterogeneity terms (ability), the autoregressive general-productivity component, the job-speci(cid:133)c wage component, potential experience, and seniority. Unemployment spells have a negative e⁄ect on the autoregressive general-productivity component, and workers draw new job-speci(cid:133)c wage and hours components when they leave unemployment. Annual hours depend on employment status, the heterogeneity terms, the wage, and a job-speci(cid:133)c hours component that is identical across jobs. Finally, earnings are determined by wages and hours. We work with several variants of Model A as well as with a second model, which we refer to as Model B. Model B does not include job-speci(cid:133)c wage or hours components. However, it includes an autoregressive wage component which allows both the current wage to depend on the past wage and the variance of wage shocks to depend on whether the individual is continuing an existing job. In the next two subsections, we de(cid:133)ne notation and list the equations of Model A and then discuss the model. We then turn to Model B. 5

2.1 Model A. A word about notation (cid:133)rst. We control for economy wide e⁄ects using year dummies, but leave them implicit in most of the analysis. The subscript i; which we sometimes suppress, refers to the individual, t is potential years of labor market experience of i for a particular observation. We i sometimes refer to it as "time" and usually suppress the i subscript. The subscript j(t) refers to the job that i holds at t: The notation j(t) makes explicit the fact that individuals may change jobs. In particular, j(t) = j(t 1) if i experiences a job change without being unemployed at either 6 (cid:0) t or t 1 or if i is employed at t but was unemployed at t 1. The (cid:13) parameters refer to intercepts (cid:0) (cid:0) and to slope coe¢ cients. For each intercept and slope parameter the superscripts identify the dependent variable. The subscripts of slope parameters identify the explanatory variable. We use (cid:14) to denote coe¢ cients on the (cid:133)xed person speci(cid:133)c unobserved heterogeneity components (cid:22) i and (cid:17) ; the job match heterogeneity wage component (cid:29) ; and the job speci(cid:133)c hours component i ij(t) (cid:24) . The superscripts for the (cid:14) parameters denote the dependent variable and the subscripts ij(t) (cid:22) and (cid:17) identify the heterogeneity component. We use (cid:26) with appropriate subscripts to denote autoregression coe¢ cients. The "k are iid N(0;(cid:27)2) random variables where the superscripts k it k correspond to the dependent variables. The equations of Model A are as follows. Employment to Employment Transition (EE) (1) E = I[(cid:13)EE +(cid:13)EE(t 1)+(cid:13)EE(t 1)2 +(cid:13)EEwage0 +(cid:13)EE BLACK +(cid:13)EE EDUC it 0 t i (cid:0) t2 i (cid:0) w^ it BLACK i EDUC i +(cid:13)EE min(ED ;9)+(cid:14)EE(cid:22) +(cid:14)EE(cid:17) +"EE > 0] given E = 1; ED i;t 1 (cid:22) i (cid:17) i it i;t 1 (cid:0) (cid:0) where E is an employment dummy, I( ) is an indicator function, ED is lagged employment it i;t 1 (cid:1) (cid:0) duration and is determined recursively by ED it = E it (cid:1) (ED i;t (cid:0) 1 +1), and wage0 it is what the wage would be in t if the individual were to continue employment in the job held at t 1. (cid:0) Unemployment to Employment Transition (UE): (2) E = I[(cid:13)UE +(cid:13)UE(t 1)+(cid:13)UE(t 1)2 +(cid:13)UE BLACK +(cid:13)UE EDUC +(cid:13)UEUD it 0 t i (cid:0) t2 i (cid:0) BLACK i EDUC i UD i;t (cid:0) 1 +(cid:14)UE(cid:22) +(cid:14)UE(cid:17) +"UE > 0] given E = 0; (cid:22) i (cid:17) i it i;t 1 (cid:0) whereUD isthenumberofyearsunemployedatthesurveydateandUD = (1 E ) (UD + i;t 1 it it i;t 1 (cid:0) (cid:0) (cid:1) (cid:0) 1): Job Change While Employed (JC): 6

(3) JC = I[(cid:13)JC +(cid:13)JC(t 1)+(cid:13)JC(t 1)2 +(cid:13)JC TEN +(cid:13)JC BLACK +(cid:13)JC EDUC it 0 t i (cid:0) t2 i (cid:0) TEN i;t (cid:0) 1 BLACK i EDUC i +(cid:14) (cid:29) +(cid:14) (cid:29) +(cid:14)JC(cid:22) +(cid:14)JC(cid:17) +"JC > 0] E E (cid:29) 0 j 0 (t) 0i;j 0 (t) (cid:29)j(t (cid:0) 1) i;j(t (cid:0) 1) (cid:22) i (cid:17) i it (cid:1) it (cid:1) i;t (cid:0) 1 where (cid:29) is a job speci(cid:133)c error component, (cid:29) is a draw of the job speci(cid:133)c component for i;j(t 1) 0i;j (t) (cid:0) 0 an alternative job j (t) in t, and TEN is employer tenure at the previous survey date, which 0 i;t 1 (cid:0) evolves according to TEN = (1 JC ) E E (TEN +1): it it it i;t 1 i;t 1 (cid:0) (cid:1) (cid:1) (cid:0) (cid:1) (cid:0) Log Wages: (4) wagelat = (cid:13)w +(cid:13)wX +(cid:13)w P(TEN )+(cid:14)w(cid:22) +(cid:29) +! it 0 X it TEN it (cid:22) i ij(t) it (5) (cid:29) = (1 S )(cid:29) +S (cid:29) ij(t) (cid:0) it ij(t (cid:0) 1) it 0ij 0 (t) (6) (cid:29) = (cid:13)(cid:29) JC +(cid:26) (cid:29) +"(cid:29) 0ij (t) JC it (cid:29) i;j(t 1) ij(t) 0 (cid:0) (7) ! = (cid:26) ! +(cid:13)! (1 E )+(cid:13)! (1 E )+"! it ! i;t (cid:0) 1 1 (cid:0) Eit (cid:0) it 1 (cid:0) Ei;t (cid:0) 1 (cid:0) i;t (cid:0) 1 it (8) wage = E wagelat it it (cid:1) it where wagelat is the (cid:147)latent(cid:148)wage, which we de(cid:133)ne below, X is a vector of exogenous variables it it including t, BLACK and EDUC ; P(TEN ) is a fourth order polynomial in TEN , (cid:29) is i i it it ij(t) the job match speci(cid:133)c wage component, ! is an autoregressive component of the latent wage, it S = (JC +E (1 E )) is a job separation indicator that equals 1 if JC is 1 or if the individual it it it i;t 1 (cid:0) (cid:0) was unemployed in t 1 and employed in t: The variable wage is the actual wage rate, which we it (cid:0) de(cid:133)ne as 0 for persons who are unemployed. Log Annual Work Hours of the Head of Household (9) hours = (cid:13)h +(cid:13)h X +((cid:13)h +(cid:24) )E +(cid:13)hwagelat +(cid:14)h(cid:22) +(cid:14)h(cid:17) +"h it 0 X it E ij(t) it w it (cid:22) i (cid:17) i it where (cid:24) is a job match speci(cid:133)c hours component. ij(t) Log earnings (10) earn = (cid:13)e +(cid:13)e X +(cid:13)e(wagelat (cid:13)w (cid:13)wX )+(cid:13)e(hours (cid:13)h (cid:13)h X )+e it 0 X it w it (cid:0) 0 (cid:0) X it h it (cid:0) 0 (cid:0) X it it e = (cid:26) e +"e it e i;t 1 it (cid:0) 7

Error Components and Initial Conditions: The (cid:133)xed person speci(cid:133)c error components (cid:22) and (cid:17) are N(0;1), iid across i, independent of i i each other, and independent of all transitory shocks and measurement errors. We parameterize the errors of the various equations so that (cid:22) may be thought of as the (cid:133)xed unobserved heterogeneity i component of wages (or (cid:147)innate ability(cid:148)). We also allow (cid:22) to in(cid:135)uence EE; UE, JC, and hours. The factor (cid:17) is assumed to have no in(cid:135)uence on wages. One may think of it as a factor that is i related to labor supply and to job and employment mobility preferences (or (cid:147)propensity to move(cid:148)). We impose the sign normalizations (cid:14)w > 0 and (cid:14)JC > 0. (cid:22) (cid:17) The job match hours component (cid:24) and the innovation "(cid:29) in (cid:29) are N(0;(cid:27)2) and N(0;(cid:27)2), ij(t) it ij(t) (cid:24) (cid:29) respectively. The shocks "EE;"UE;"JC;"h;"!;"e are N(0;(cid:27)2), where k = EE;UE;JC;h;!; and e: it it it it it it k Theyareiidacrossiandtandindependentfromoneanotherandallmeasurementerrorcomponents de(cid:133)ned below. The initial conditions are Employment: E = I[b +(cid:14)EE(cid:22) +(cid:14)EE(cid:17) +"EE > 0] i1 0g (cid:22) i (cid:17) i i1 Wages: wagelat = (cid:13)w +(cid:13)wX +(cid:14)w(cid:22) +(cid:29) +! i1 0 X i1 (cid:22) i ij(1) i1 ! N(0;(cid:27)2 ) i1 (cid:24) !1;g Wage Job Match : (cid:29) N(0;(cid:27)2 ) ij(1) (cid:24) (cid:29)1 Earnings Error : e N(0;(cid:27)2) i1 (cid:24) e Other Initial Conditions: TEN = 0;ED = E ;UD = 1 E ;JC = 0: i1 i1 i1 i1 i1 i1 (cid:0) The intercept b of the initial employment condition and the variance of initial wages (cid:27)2 0g !1;g depend on the race-education group g, where the groups are de(cid:133)ned by (BLACK & EDUC 12), (cid:20) (BLACK & EDUC > 12), (not BLACK & EDUC 12), and (not BLACK & EDUC > 12). (cid:20) Measurement Error and Observed Wages, Hours, and Earnings: The observed (measured) variables are: (11) wage = E (wagelat +mw) (cid:3)it it (cid:1) it it (12) hours = hours +mh (cid:3)it it it (13) earn = earn +me (cid:3)it it it The measurement errors mw, mh, me are N(0;(cid:27)2 ), (cid:28) = w;h;e, iid across i and t; mutually it it it m(cid:28) independent, and independent from all other errors components in the model. 8

2.2 Discussion of Model A TheEEequationstatesthatthelatentvariablethatdeterminesE forpreviouslyemployedworkers it dependsonaquadraticint ,alinearfunctionofED withaceilingat9years,BLACK ,EDUC i i;t 1 i i (cid:0) and the error (cid:14)EE(cid:22) +(cid:14)EE(cid:17) +"EE. Early on we experimented with including TEN as well as (cid:22) i (cid:17) i it i;t 1 (cid:0) ED but in simulation experiments found that we had trouble distinguishing the e⁄ects of the i;t 1 (cid:0) two. Standard labor supply models imply that employment at t should depend on the current wage opportunity, which we proxy with wage . It also depends on the permanent wage heterogeneity 0it component (cid:22) as well as the hours preference and mobility component (cid:17) . i i The UE transition probability has the same form as EE, with unemployment duration UD i;t 1 (cid:0) replacing ED . Because there are relatively few multi-year unemployment spells, we exclude i;t 1 (cid:0) UD restricting(cid:13)UE to0inmostoftheanalysis. Weexperimentedwithspeci(cid:133)cationscontaining i;t 1; UD (cid:0) the lagged latent wage rate or the expected value of the period t wage but had di¢ culty identifying the e⁄ects of these variables, perhaps because we observe relatively few unemployment spells. We do include the wage heterogeneity component (cid:22) as well as (cid:17) : i i The JC equation refers to job to job changes for workers who are employed in both t and t 1. (cid:0) In our speci(cid:133)cationof the linkbetween mobility and wages, the main distinction we drawis between job changes from employment and job changes that involve unemployment. We believe that this is the most important distinction for the determination of wages and annual work hours, although it would be interesting in future work to distinguish between quits and layo⁄s on the basis of self reports Standard job search and job matching models predict a negative coe¢ cient on (cid:29) ; since ij(t 1) (cid:0) higher values of the job match component of the current job should reduce search activity and raise the reservation wage. In the model each worker is assigned a potential draw of (cid:29) based on 0ij (t) 0 (6), which we discuss momentarily. Search models predict a positive coe¢ cient on (cid:29) , but the 0ij (t) 0 magnitude should depend on the probability that the worker actually receives the o⁄er. That is, the relative magnitudes of the two coe¢ cients should depend on o⁄er arrival rates and need not be equal.8 We include TEN as well as (t 1) because models of (cid:133)rm (cid:133)nanced or jointly (cid:133)nanced i;t 1 (cid:0) (cid:0) speci(cid:133)c capital investment suggest that it will play a role, and the decline in separation rates with TEN in cross section data is very strong. Little is known about how much of the association i;t 1 (cid:0) between TEN and JC is causal because of the di¢ culty of distinguishing state dependence i;t 1 it (cid:0) from the individual heterogeneity ((cid:22) and (cid:17)) and job match heterogeneity ((cid:29)) in dynamic discrete choice models, particularly when data are missing on early employment histories for most sample 8One could introduce parameters corresponding to (cid:133)xed o⁄er arrival rates for employed workers and for unemployed workers into the model and add the value of (cid:29) into the unemployment equation. Low et al (2008) 0ij (t) 0 work with such a speci(cid:133)cation. In our job change equation, (cid:29) may reduce mobility both because it raises the ij(t 1) reservation wage and because it lowers search intensity. (cid:0) 9

members. Indeed, Buchinsky et al (2008) is the only other study that we know that accounts for both individual and job speci(cid:133)c heterogeneity and deals with initial conditions problems when estimating the e⁄ects of TEN and t on job changes.9 WheninterpretingresultsforEE andJC, onemustkeepinmindthatouremploymentindicator refers to the survey date. We undoubtedly miss short spells of unemployment that fall between surveys. Duetodatalimitations, wecannottellwhetherapersonhaschangedjobsbetweensurveys only once or multiple times. Furthermore, if a person is employed at t 1, unemployed for part (cid:0) of the year, and employed in a new job at t, we would count this as a job to job change even if, for example, the job change is due to a layo⁄into unemployment. A relatively simple alternative would be to make use of information on the number of weeks that the individual was unemployed during the year. However, one would want to distinguish between short spells of unemployment that are associated with temporary layo⁄s with the strong expectation of recall and unemployment spells due to a permanent layo⁄. This is possible only at the survey date. Fortunately, earnings depend on employment through annual work hours and the transitory error component in the hours equation should capture the e⁄ect on hours of unemployment spells of varying duration. The 25th, 50th, 75th, and 90th percentiles of hours of unemployment are 120, 680, 1200, and 1600 when EMP = 0 and 0, 0, 0, and 64 when EMP = 1.10 it it The wage model (4) is unusual in our use of the concept of a latent wage. For employed individuals wagelat and the actual wage wage are the same. For an unemployed individual wagelat it it it captures the process for wage o⁄ers that exceed is reservation wage. At a given point in time the 0 individual might not have such an o⁄er. Our formulation allows us to capture in a parsimonious way the idea that worker skills and worker speci(cid:133)c demand factors evolve during an unemployment spell. From a practical point of view, the formulation also allows us to deal with the fact that wages are only observed for jobs that are held at the survey date. The variable wagelat depends on (cid:133)ve components. The (cid:133)rst is the regression index (cid:13)wX , which it X it captures the e⁄ects of potential experience t ; education, race, andeconomywide variation(through i year dummies). Since we control for both tenure e⁄ects and gains from job shopping, the e⁄ect of potential experience is a general human capital e⁄ect. The second of the (cid:133)ve components is tenure. 9Buchinskyetal(2008)also(cid:133)ndnegativee⁄ectsinasimultaneousmodelofwages,employment,andjobchanges. Farber (1999) discusses models of the e⁄ect of tenure on mobility and surveys the empirical evidence. He presents evidence showing a negative e⁄ect of tenure when one uses prior mobility as a control for individual heterogeneity. 10 It is conceptually straightforward to specify the model on a quarterly or monthly basis. Simulated data that matches the periodicity, level time of aggregation, and dating within the calendar year of the various PSID variables could be constructed from the higher frequency data from the model. One could use both measures of weeks of unemployment over the previous calendar year and unemployment at the survey date. One would have to think carefully about the speci(cid:133)cation of shocks(cid:151)few employers reset wages on a monthly or quarterly basis. One might also wish to incorporate smoothness restrictions on distributed lags, along the lines of Altonji, Martins and Siow(cid:146)s (2002)useofAlmonlagsinaquarterlymodel. Webelievethatthereismeritinstartingwiththesimplerspeci(cid:133)cation that we employ. 10

The third is the heterogeneity component (cid:22) . The fourth is a stochastic component ! ; which i it depends on ! , unemployment, and the error component "!. The dependence of ! on the past i;t 1 it it (cid:0) may re(cid:135)ect persistence in the market value of the general skills of i and/or the fact that employers base wage o⁄ers on past wages. We will have more to say about the second mechanism when we turn to model B. The (cid:133)fth is the job match speci(cid:133)c term (cid:29) . When persons leave unemployment ij(t) or move from job to job without unemployment, they draw a new value of (cid:29) : The new value ij(t) depends on (cid:29) , a mean shift term (cid:13)(cid:29) in the case of a job change without unemployment, and ij(t 1) JC (cid:0) the shock "(cid:29) : We set (cid:13)(cid:29) = 0 when (cid:29) and (cid:29) are included in the JC model (models A.2 ij(t) JC ij(t 1) ij(t) t (cid:0) and A.3 below), because in that case any shift in the mean of (cid:29) is accounted for endogenously by ij(t) the e⁄ects of (cid:29) and (cid:29) on mobility. In standard search models with exogenous o⁄er arrivals, ij(t 1) ij(t) (cid:0) the job speci(cid:133)c component of the o⁄er, (cid:29) does not depend on (cid:29) although accepted o⁄ers 0ij (t) ij(t 1) 0 (cid:0) (cid:29) will. In such models the correlation between accepted o⁄ers (cid:29) and (cid:29) arises because ij(t) ij(t) ij(t 1) (cid:0) the reservation wage is a positive function of (cid:29) : Nevertheless, we allow o⁄ers (cid:29) to depend ij(t 1) 0ij (t) (cid:0) 0 on (cid:29) through the parameter (cid:26) for three main reasons. The (cid:133)rst is that employers may ij(t 1) (cid:29) (cid:0) base o⁄ers to prospective new hires in part on wages in the prior (cid:133)rm, including the (cid:133)rm speci(cid:133)c component. Bagger et al (2007), building on Postel-Vinay and Robin (2002) and Postel-Vinay and Turon (2005), is one of a few recent papers in which outside (cid:133)rms tailor o⁄ers to surplus in the current job. This surplus will be related to (cid:29) to the extent that (cid:29) is the worker(cid:146)s portion ij(t 1) ij(t 1) (cid:0) (cid:0) of a job speci(cid:133)c productivity component. However, in contrast to those papers, we do not allow the current employers to change (cid:29) in response to outside o⁄ers. (Wages do change through ij(t 1) (cid:0) ! :) The second reason (cid:29) will depend on (cid:29) is that (cid:29) is not likely to be entirely job it 0ij (t) ij(t 1) ij(t 1) 0 (cid:0) (cid:0) speci(cid:133)c in the presence of demand shocks a⁄ecting jobs in a narrowly de(cid:133)ned industry, occupation, and region. The third is that the network available to an individual may be related to the quality of the job that he is in. As it turns out, our estimates of (cid:26) are large(cid:151)about .60.11 We were not v successful in limited experimentation with estimating models in which the link between (cid:29) and ij(t) (cid:29) when JC = 1 di⁄ers from the link following unemployment, although standard job search ij(t 1) (cid:0) models with exogenous layo⁄s imply that it should. The equation for hours includes X . It also includes (cid:17) , (cid:22) , and the product of the job speci(cid:133)c it it i i hourscomponent(cid:24) andE :Weinclude(cid:24) becausethereisstrongevidencethatworkhoursare ij(t) it ij(t) heavily in(cid:135)uenced by a job speci(cid:133)c component. This component presumably re(cid:135)ects work schedules 11Industry speci(cid:133)c and/or occupation speci(cid:133)c human capital are not accounted for in the model and are likely to in(cid:135)uence estimates of (cid:26) more than (cid:26) given that industry and occupation changes tend to occur across employers. (cid:29) ! They would also a⁄ect the estimates of the return to seniority that we import from Altonji and Williams (2005). See Neal (1995), Parent (2000), and Kambourov and Manovskii (2009) for somewhat con(cid:135)icting evidence on the importance of occupation speci(cid:133)c, industry speci(cid:133)c, and (cid:133)rm speci(cid:133)c human capital. Extending the model to distinguish occupation and/or industry is conceptually straightforward but would require models of occupation and industry transitions and attention to measurement error. We leave this to future work. 11

imposed by employers.12 A new value of (cid:24) is drawn when individuals change jobs. The iid ij(t) error component "h picks up transitory variation in straight time hours worked, overtime, multiple it job holding, and unemployment conditional on employment status at the survey. It may re(cid:135)ect temporary shifts in worker preferences as well as hours constraints. Hours also depend on wagelat and E . For most observations, wagelat is the actual wage. it it it However, many individuals are unemployed at the survey date but work part of the year. We use wagelat as the measure of the wage the individual would typically receive. Because wage it shocks turn out to be highly persistent and because we strongly question the standard labor supply assumption that individuals are free to adjust hours on their main job in response to short term variation in wage rates, we think of the coe¢ cient on the latent wage as a response to a relatively permanentwagechangeratherthanaFrishelasticity. Westickwiththisinterpretationeventhough we control for (cid:22) in both the wage and hours equations. i Log earnings earn depends on (residual) wagelat and hours . The coe¢ cients (cid:13)e and (cid:13)e it it it w h might di⁄er from 1 for a number of reasons, including overtime, multiple job holding, bonuses and commissions, job mobility, and the fact that for some salaried workers the wage re(cid:135)ects a set work schedule but annual hours worked may vary. We also include a (cid:133)rst order autoregressive error component e to capture some of these factors. In previous drafts of the paper we freely estimated it (cid:13)e and (cid:13)e and obtained values close to 1 for most speci(cid:133)cations. However, for some versions of w h the model it is helpful to restrict the coe¢ cients to be 1, which we do below. We have not considered models with an ARCH error structure. However, the model implies that the variance of wage, hours, and earnings changes are state dependent and also depend on t: This is because the odds of a job change and an unemployment spell depend on TEN; ED, potential experience, and (cid:29) and because job changes and unemployment spells are associated ij(t) with innovations in (cid:29) and (cid:24) : The variances also depend on the permanent components of X ij(t) ij(t) it (education and race) and on the unobserved heterogeneity components (cid:22) and (cid:17) . i i Manystudiesoftheincomeprocesssimplyignorethepresenceofmeasurementerroreventhough surveys by Bound et al.(2001) and others indicate that it is substantial. Altonji et al.(2002) and someotherstudieshaveattemptedtodirectlyestimatethevariancesofmeasurementerrorinwages, hours, and earnings under a classical measurement error assumption. Here, we draw loosely upon studies of measurement error in the PSID and other panel data sets to come up with a range of estimates of the measurement error parameters. For most of our models our choices imply that mw it accounts for 35% of var((cid:1)wage ), 25% of var((cid:1)hours ), and 25% of var((cid:1)earn ). We abstract (cid:3)it (cid:3)it (cid:3)it from measurement error in employment, which we believe is relatively unimportant, as well as in 12See Altonji and Paxson (1986) and Senesky (2005), who show that hours changes are much larger across jobs than within jobs for both quits and layo⁄s, and that one cannot account for this as a labor supply response to di⁄erences in wages, nonpecuniary job characteristics, or changes in labor supply preferences. 12

the job change indicator, which we suspect is more serious. (See Altonji and Williams (1998)). Our reported standard errors do not account for uncertainty about the measurement error parameters.13 2.3 Model B The main di⁄erences between Model A and Model B are in the wage and JC equations. The wage equation for Model B is (14) wagelat = (cid:13)w +(cid:13)wX +(cid:14)!(cid:22) +! it 0 X it (cid:22) i it ! = (cid:26) [1+(cid:30) S ]! +(cid:13)! JC +(cid:13)! (1 E )+(cid:13)! (1 E ) it ! 1 it i;t (cid:0) 1 JC it 1 (cid:0) Et (cid:0) it 1 (cid:0) Et (cid:0) 1 (cid:0) i;t (cid:0) 1 +[1+(cid:30) S ]"! 2 it it ! N(0;(cid:27)2 ) (initial condition). i1 (cid:24) !1g The above wage model does not include the job speci(cid:133)c wage component (cid:29) but introduces the coe¢ cients (cid:30) and (cid:30) .14 These allow the degree of persistence and the variance in the wage 1 2 innovation to shift with a job change or end of an unemployment spell. As noted above, our speci(cid:133)cation of state dependence in wages captures the fact that many employers use past wage rates, along with other information, in determining wage o⁄ers for new hires, as well as the fact that previous wage rates are a reference point for incumbent workers when evaluating an o⁄er. It may also re(cid:135)ect dependence between the productivity of a worker today and productivity last year. One might expect the degree of dependence to be weaker across jobs than within jobs ((cid:30) < 0). 1 The JC equation is the same as (3) with the (cid:29) terms excluded. We have estimated versions of Model B with and without wage in the EE and JC equations but to save space report the i;t 1 (cid:0) version without the wage terms. The hours equation excludes the job speci(cid:133)c hours component i;t 1 (cid:0) (cid:24): We use Model B for three reasons. First, it is a smaller step from univariate models of the wage process than Model A. Second, it has proved to be easier to estimate. Finally, it is more tractable than Model A for use in dynamic programming models of consumer behavior.15 13Theassumptionofnormallydistributed,classicalmeasurementerrorrunscountertoevidencethatactualreports are a mixture of correct responses and responses with error. Furthermore, Bound et al (2001) summarize evidence that measurement error is mean reverting to some extent, with individuals smoothing shocks when they report on economic variables. In principle, our methods can accommodate almost any measurement error assumption. We stick with the simpler formulation for lack of hard quantitative evidence on richer measurement error speci(cid:133)cations that we can import into our model. 14Inearlyworkweexperimentedwithallowingthee⁄ectof(cid:22) togrowlinearlywithexperience,butdidnotobtain i sensible results. 15Vidangos (2008) has simulated a model of optimal lifetime consumption using a family income process that embeds a version of model B as the wage process for the household head. 13

3 Data We use the 1975-1997 waves of the PSID to assemble data that refers to the calendar years 1975- 1996. Because some observations are lost due to the use of lags, the current values of the variables in our model range from 1978 to 1996. We include members of both the SRC strati(cid:133)ed random sample and SEO sample. The latter consists primarily of households that were low income in 1968 and substantially overrepresents blacks. We also present results for the SRC sample only. We also include nonsample members who married PSID sample members. The sample is restricted to male household heads. We include both single and married individuals. Observations for a given person-year are used if the person is between age 18 and 62, was working, temporarily laid o⁄or unemployed in a given year, was not self employed, had valid data on education (EDUC) and had no more than 40 years of potential experience. We treat persons on temporary layo⁄at the survey date as employed. We eliminate a small number of observations in which the individual reports being retired, disabled, a housewife, a student, other, or "don(cid:146)t know". (See Appendix tables A1 and A2).16 Potential experience t is age max(EDUC ;10) 5: BLACK is one if the individual is black i it i i (cid:0) (cid:0) and 0 otherwise. ED is the number of years in a row that a person is employed at the survey date. it In 1975 and for persons who join the sample after 1975, we set ED to tenure with the current it employer.17 The variable UD is the number of consecutive years up to t 1 that the individual has not i;t 1 (cid:0) (cid:0) been employed at the survey date. We set UD to 0 if the (cid:133)rst time we observe i is in year t: i;t 1 (cid:0) Few unemployment spells exceed 1 year, so the error is probably small. The wage measure wage (cid:3)it is the reported hourly wage rate at the time of the survey. It is only available for persons who are employed or on temporary layo⁄.18 Finally, we censorreportedhours at 4000, add200toreportedhours before takinglogs toreduce theimpactofverylowvaluesofhoursonthevariationinthelogarithm,andcensorobservedearnings and observed wage rates (in levels, not logs) to increase by no more than 500% and decrease to no less than 20% of their lagged values. We also censor wages to be no less than $3.50 in year 2000 16We allow persons to come out of retirement and include future observations following a retirement spell if the individual is working, temporarily laid o⁄or unemployed. As reported in Appendix Table A1, 1.85% of the PSID sample reports an employment status as disabled in a given year. 17An alternative would be to apply exactly the same censoring that occurs in the PSID in the simulated data. In the simulated data ED would be set to tenure when t in the simulated case is equal to t for the (cid:133)rst value we see it in the corresponding PSID case. 18Thismeasureisthelogofthereportedhourlywageatthesurveydateforpersonspaidbythehourandisbased on the salary per week, per month, or per year reported by salary workers. It is unavailable prior to 1970 and is limited to hourly workers prior to 1976. We account for the fact that it is capped at $9.98 per hour prior to 1978 by replacing capped values for the years 1975-1977 with predicted values constructed by Altonji and Williams (2005). They are based on a regression of the log of the reported wage on a constant and the log of annual earnings divided by annual hours using the sample of individuals in 1978 for whom the reported wage exceeds $9.98. 14

dollars. After observations are lost due to construction of lagged values, or missing data, we use information on 4,632 individuals. Each individual contributes between 1 and 19 observations. The 5th, 25th, median, 75th, and 95th percentile values of the number of observations a given individual contributes are 1, 3, 6, 11, and 18 respectively (see Appendix Table A3). Of course, persons who are present for many years contributed disproportionately to the total of 33,933 person-year observations. The number of observations per year varies from 1,200 in 1979 to 2,007 in 1991. The sample is highly unbalanced. As we have already noted, an advantage of simulation based estimators such as I-I is that by incorporating the sample selection process into the simulation, one canhandleunbalanceddata. Weassumethatobservationsaremissingatrandom, althoughthereis reason to believe that the heterogeneity components and shocks in(cid:135)uence attrition from the sample. In principle, one could augment the model with an attrition equation. Alternatively, it would be straightforward to simply use sample weights to reweight the PSID when evaluating the likelihood function of the auxiliary model if suitable weights were available. However, PSID sample weights are designed to keep the data representative of successive cross sections of the US population that originate in the families present in the base year.19 They do not adjust for factors that alter the US population, such as di⁄erences in birth rates by race or education. Furthermore, there are no sample weights for persons who move into PSID households through marriage. Consequently, we do not use weights. In essence, we are assuming both that observations are missing at random and that the model parameters do not vary across demographic groups or over time. The results are fairly robust to restricting the analysis to the SRC sample, as we discuss in Section 6. We also report separate results for SRC whites by education level. In Table 1a we present the mean, standard deviation, minimum and maximum of the variables used in our structural model. The mean of E is .97, so we observe relatively few unemployment it spells. Note also that the mean of EE is .98. Given these magnitudes, even relatively large it movements in the latent variable index determining EE have only a small e⁄ect on whether EE it it is 1 or 0: In Table 1b we provide additional information about our sample, including the mean and standard deviation for education, race, potential experience, and the calendar year. 4 Estimation Methodology We begin with a brief overview of our estimation procedure. We then de(cid:133)ne the auxiliary model used in the estimation procedure as well as additional moment conditions that we use. Note (cid:133)rst that to reduce computational complexity, we estimate the coe¢ cients on vector X in equations it 19See Fitzgerald, Gottschalk and Mo¢ tt (1998) for an analysis of attrition in the PSID. They conclude that at least through 1989 the PSID is fairly representative of the US population once internal sample weights are used. 15

with continuous dependent variables by (cid:133)rst regressing hours , wage , and earn on X . The (cid:3)it (cid:3)it (cid:3)it it vector X consists of a constant, years of education, BLACK , t , t2, t3, and a set of year dummies. it i i i i We then work with the residuals of these variables when estimating the remaining parameters by I-I.20 4.1 Indirect inference Forclarity, we will refertoModel A(orB) above as the (cid:147)structural(cid:148)model, eventhoughthe models do not express the parameters of the decision rules for EE, UE, JC, etc., in terms of preference parameters and parameters governing job search, mobility, and exogenous layo⁄s. We denote the k (cid:147)structural(cid:148)parameters by (cid:12). Indirect inference involves the use of an (cid:147)auxiliary(cid:148)statistical model that captures properties of the data. This auxiliary model has p k parameters (cid:18). The (cid:21) method involves simulating data from the structural model (given a hypothesized value of (cid:12)) and ^ choosing the estimator (cid:12) to make the simulated data match the actual data as closely as possible according to some criterion that involves (cid:18). Let the observed data consist of a set of observations on N individuals in each of T time periods: y ;x , i = 1;:::;N, t = 1;:::;T, where y is endogenous to the model and x is exogenous. it it it it f g The auxiliary model parameters (cid:18) can be estimated using the observed data as the solution to: ^ (cid:18) = argmax (y;x;(cid:18)); (cid:18) L where (y;x;(cid:18)) is the likelihood function associated with the auxiliary model, y y and it L (cid:17) f g x x . it (cid:17) f g Given x and assumed values of the structural parameters (cid:12), we use the structural model to generate M statistically independent simulated data sets y~m((cid:12)) , m = 1;:::;M. Each of the f it g M simulated data sets has N individuals and is constructed using the same observations on the ~ exogenous variables, x. For each of the M simulated data sets, we compute (cid:18) ((cid:12)) as m ~ (cid:18) ((cid:12)) = argmax (y~ ((cid:12));x;(cid:18)); m m (cid:18) L where the likelihood function associated with the auxiliary model is evaluated using the mth simulated data set y~ ((cid:12)) y~m((cid:12)) rather than the real data. Denote the average of the estimated m (cid:17) f it g parameter vectors by ~ (cid:18)((cid:12)) M 1 M ~ (cid:18) ((cid:12)). (cid:17) (cid:0) m=1 m 20Note that we include the constants (cid:13)Pw; (cid:13)h; and (cid:13)e in the wage, hours and earnings models even though we also 0 0 0 includeaconstantwhenconstructhours, wage, andearningsresiduals. Ourreportedstandarderrorsdonotaccount for(cid:133)rststageestimationofthe(cid:13) parameters. Wedoubtthatadjustmentwouldmakemuchdi⁄erencebecausethe X estimated standard errors forthe elements of (cid:13) are small. Also, note that the coe¢ cients on the experience pro(cid:133)le X of wages capture not only the e⁄ects of general human capital and age but also average growth in (cid:29) and tenure j(t) with t. When we estimate Model A.3 by indirect inference, we include a quadratic in experience in the equation for the wage to account for this. We did not include these terms in models A.1, A.2 and B.1, which exclude tenure from the wage equation. 16

^ I-I generates an estimate (cid:12) of the structural parameters by choosing (cid:12) to minimize the distance between ^ (cid:18) and ~ (cid:18)((cid:12)) according to some metric.21 As described in Keane and Smith (2003) and ^ elsewhere, there are (at least) three possible ways to specify such a metric. Here we choose (cid:12) to minimize the di⁄erence between the constrained and unconstrained values of the likelihood function of the auxiliary model evaluated using the observed data. In particular, we calculate ^ ^ ~ (cid:12) = argmin[ (y;x;(cid:18)) (y;x;(cid:18)((cid:12)))]: (cid:12) L (cid:0)L ^ Gourieroux, Monfort, and Renault (1993) show that (cid:12) is a consistent and asymptotically normal estimate of the true parameter vector (cid:12) . The reason is that as N becomes large holding M 0 ~ ^ ^ and T (cid:133)xed, (cid:18)((cid:12)) and (cid:18) both converge to the same (cid:147)pseudo(cid:148)true value (cid:18) = h((cid:12) ) where h is a 0 0 nonstochastic function. Accommodating missing data in I-I is straightforward: after generating a complete set of simulateddata, onesimplyomitsobservationsinthesamewayinwhichtheyareomittedintheobserved data. As we have already discussed, we assume that the pattern of missing data is exogenous. In the simulated data, we simply omit observations according to the same pattern. In some cases, it is convenient to estimate auxiliary models in which missing observations are replaced with some arbitrary value (such as 0). In such circumstances, the same principle applies: use the same arbitrary values in both the simulated and observed data sets. In our structural model, the observed data y consists of both continuous and discrete random ^ variables. Discrete random variables complicate the calculation of (cid:12) because the objective function (i.e., the di⁄erence between the constrained and unconstrained values of the likelihood) is discontinuous in the structural parameters (cid:12). Discontinuities arise when applying I-I to discrete choice models because any simulated choice y~m((cid:12)) is discontinuous in (cid:12) (holding (cid:133)xed the set of random it draws used to generate simulated data from the structural model). Consequently, the estimated ~ set of auxiliary parameters (cid:18)((cid:12)) is discontinuous in (cid:12). The non-di⁄erentiability of the objective function in the presence of discrete variables prevents the use of gradient-based numerical optimization algorithms to maximize the objective function and requires instead the use of much slower algorithms such as simulated annealing or the simplex method. Tocircumventthesedi¢ culties, weuseKeaneandSmith(cid:146)s(2003)modi(cid:133)cationtoI-I,whichthey call generalized indirect inference. Suppose that the simulated value of a binary variable y~m equals it 1 if a simulated latent utility u~m((cid:12)) is positive and equals 0 otherwise. Rather than use y~m((cid:12)) when it it computing ~ (cid:18)((cid:12)), we use a continuous function g(u~m((cid:12));(cid:21)) of the latent utility. The function g is it chosen so that as the smoothing parameter (cid:21) goes to 0, g(u~m((cid:12));(cid:21)) converges to y~m((cid:12)). Letting it it 21Whengeneratingsimulateddatasets,theseedinthepseudorandomnumbergeneratoris(cid:133)xedsothatthedraws of the random innovations are the same for di⁄erent values of (cid:12). 17

~ (cid:21) go to 0 as the observed sample size goes to in(cid:133)nity ensures that (cid:18)((cid:12) ) converges to (cid:18) , thereby 0 0 delivering consistency of the I-I estimator of (cid:12) . Our choice of g is 0 exp(u~m((cid:12))=(cid:21)) g(u~m((cid:12));(cid:21)) = it : it 1+exp(u~m((cid:12))=(cid:21)) it Because the latent utility is a continuous and smooth function of the structural parameters (cid:12), g is a smooth function of (cid:12). Moreover, as (cid:21) goes to 0, g goes to 1 if the latent utility is positive and to 0 otherwise. When the structural model contains additional variables that depend on current and lagged values of indicator variables y~m, these additional variables will also be discontinuous in (cid:12). In our it structural model, for instance, variables such as employment duration and job tenure depend on the history of indicator variables such as employment status and job changes. Since employment durationandtenurearediscontinuousin(cid:12),theyalsocontributetocreatingadiscontinuousobjective functionintheestimationprocess. Oursmoothingstrategy,however,ensuresthatallthesevariables will also be continuous in (cid:12), provided that they depend continuously on y~m. In other words, it replacing the indicator functions by their continuous approximations g(u~m((cid:12));(cid:21)) ensures that all it other variables that depend on (cid:12) through g(u~m((cid:12));(cid:21)) are continuous. Care must be taken in it choosing (cid:21), because approximation error in indicator functions for a particular year accumulate in the approximate functions for employment duration and tenure. We searched for a combination of the smoothing parameter (cid:21) and the number of simulations M that generates su¢ cient smoothness in the objective function, while keeping bias small and computation time manageable. The larger these parameters are, the smoother the objective function will be, but large values of (cid:21) introduce bias and large values of M increase computation time. Based upon simulation experiments, we chose a small value of (cid:21), .05, which is large enough to smooth the objective surface su¢ ciently given our choice of 20 for M. Our simulation experiments as well as the parametric bootstrap results reported below indicate that the associated bias in the estimates is small for almost all of our parameters. We use a parametric bootstrap procedure to conduct inference. Given consistent estimates of the structural parameters, we repeatedly generate (cid:147)arti(cid:133)cial(cid:148)observed data sets fromthe structural model, estimate the parameters of the structural model for each such data set, and then calculate the standard deviations of the parameter estimates across the data sets. These standard deviations serve as our estimates of the standard errors of the structural parameter estimates associated with the actual observed data.22 Standard errors of functions of model parameters, such as the impulse response functions and variance decompositions are constructed as the standard deviation across parametric bootstrap replications. 22As a check, we also computed standard errors using a nonparametric bootstrap procedure based on resampling from the PSID for some speci(cid:133)cations. We used 100 replications and obtained similar results. 18

4.2 The Auxiliary Model Our auxiliary model consists of a system of seemingly unrelated regressions (SUR) with 7 equations and 25 covariates that are common to all 7 equations. We implement the model under the assumption that the errors follow a multivariate normal distribution with an unrestricted covariance matrix. One may write the system as (15) Y = Z (cid:5)+u ; u N(0;(cid:6)); u iid over i and t; it it it it it (cid:24) where Y = [E E ;E (1 E );JC E E ;wage ;hours ;earn ;ln(1+wage 2)]; it it (cid:1) i;t (cid:0) 1 it (cid:1) (cid:0) i;t (cid:0) 1 it (cid:1) it (cid:1) i;t (cid:0) 1 (cid:3)it (cid:3)it (cid:3)it (cid:3)it 0 and (16) Z = [Const;(t 1);(t 1)2;BLACK ;EDUC ;ED ;UD ;TEN ; it i i i i i;t 1 i;t 1 i;t 1 (cid:0) (cid:0) (cid:0) (cid:0) (cid:0) E E ;E E ;E (1 E );E (1 E ); i;t 1 i;t 2 i;t 2 i;t 3 i;t 1 i;t 2 i;t 2 i;t 3 (cid:0) (cid:1) (cid:0) (cid:0) (cid:1) (cid:0) (cid:0) (cid:1) (cid:0) (cid:0) (cid:0) (cid:1) (cid:0) (cid:0) JC E E ;JC E E ; i;t 1 i;t 1 i;t 2 i;t 2 i;t 2 i;t 3 (cid:0) (cid:1) (cid:0) (cid:1) (cid:0) (cid:0) (cid:1) (cid:0) (cid:1) (cid:0) wage ;wage ;hours ;hours ;earn ;earn ; (cid:3)i;t 1 (cid:3)i;t 2 (cid:3)i;t 1 (cid:3)i;t 2 (cid:3)i;t 1 (cid:3)i;t 2 (cid:0) (cid:0) (cid:0) (cid:0) (cid:0) (cid:0) wage (t 1);wage (t 1)2;wage JC ; wage JC ;wage E ] (cid:3)i;t (cid:0) 1 (cid:1) i (cid:0) (cid:3)i;t (cid:0) 1 (cid:1) i (cid:0) (cid:3)i;t (cid:0) 1 (cid:1) it (cid:3)i;t (cid:0) 2 (cid:1) i;t (cid:0) 1 (cid:3)i;t (cid:0) 2 (cid:1) i;t (cid:0) 1 0 Since (cid:5) has 25 7 elements and (cid:6) is a 7 7 covariance matrix with 28 unique elements, (cid:2) (cid:2) the auxiliary model has 203 parameters. In contrast, Model A.3 has only 46 parameters that we estimate by I-I (not counting the measurement error parameters, tenure coe¢ cients, and (cid:26) ): As ! we discuss momentarily, a few of the 46 parameters are estimated all or in part using additional moment conditions rather than exclusively by I-I. Consequently, the number of features of the data used to (cid:133)t the structural model greatly exceeds the number of parameters. In estimating the model we use the likelihood function that corresponds to (15). Note that the assumption u N(0;(cid:6)) with u iid over i and t is false for several reasons, including the fact it it (cid:24) thatY containsbinaryvariablesandthatbothwage andln(1+wage 2) appear. However, thefact (cid:3)it (cid:3)it that we use a misspeci(cid:133)ed likelihood a⁄ects the e¢ ciency of our procedure rather than consistency. Our choice of what to include in the auxiliary model is as much a matter of art as science, but is motivated by the following principles. First, we use a common set of right-hand-side variables in the seven equations of the auxiliary model to avoid having to iterate between (cid:5) and (cid:6) to maximize 19

the likelihood function. The disadvantage, however, is that we do not tailor the right-hand-side variables to the particular dependent variable. As a result, the auxiliary model probably contains moreparametersthanareneededtodescribethedata. Furthermore, wearerestrictedinourability to add additional right-hand-side variables to particular equations, such as additional interactions between (t 1) and other lagged variables, because the total number of variables would get out of i (cid:0) hand. Although it would be useful to explore di⁄erentiating the equations of the auxiliary model in future work, our simulations indicate that most of our parameters are quite well determined by the auxiliary model that we have chosen. Thesecondprincipleistoincludevariablesthatappearasexplanatoryvariablesinthestructural model. Thisaccountsforthepresenceof(t 1),(t 1)2,EDUC ;BLACK ;ED ;andTEN . i i i i i;t 1 i;t 1 (cid:0) (cid:0) (cid:0) (cid:0) We also include UD even though we constrain (cid:13)UE to equal 0: Since the model is dynamic and i;t 1 it (cid:0) includes state dependence terms in most equations, we include two lags of each dependent variable except ln(1 + wage 2). The lags help distinguish between state dependence and heterogeneity. (cid:3)it Finally, we include the interaction terms wage (t 1), wage (t 1)2, wage JC , (cid:3)i;t 1 (cid:1) i (cid:0) (cid:3)i;t 1 (cid:1) i (cid:0) (cid:3)i;t 1 (cid:1) it (cid:0) (cid:0) (cid:0) wage JC ; and wage E to help capture any nonstationarity in wages. (cid:3)i;t (cid:0) 2 (cid:1) i;t (cid:0) 1 (cid:3)i;t (cid:0) 2 (cid:1) i;t (cid:0) 1 One disadvantage of our choice for (16) is that the (cid:133)rst three observations for each individual are lost due to lags. This makes it di¢ cult to identify parameters of the models for the initial wage and employment status. It also makes it di¢ cult to identify changes with experience in the variance of shocks at the beginning of a career. In principle, one could add additional equations with 0, 1, or 2 lags to the auxiliary model to accommodate observations with missing data. The cost would be a more complex auxiliary model. (Alternatively, one can set values of missing lags to 0 in both the simulated and actual data.) In simulation experiments we did not (cid:133)nd that adding such equations helped a great deal with identi(cid:133)cation of model parameters. In the next section, we discuss use of the variance of wages when t 5 for each race-education group to help identify (cid:20) parameters of the initial condition for wages. We also discuss the initial condition for employment. 4.3 Use of Additional Moments and Other Information Sources to Identify Parameters ^ ^ In the equation for initial employment we estimate the intercepts b as b = b (cid:27)^ where 0g 0g (cid:3)0g (cid:1) E1 (cid:27)^ = ( ^ (cid:14) EE )2 +( ^ (cid:14) EE )2 +1 and ^ b is the coe¢ cient estimate from a Probit regression of E on a E1 (cid:22) (cid:17) (cid:3)0g it constanqt estimated on PSID data for t 5 for each of 4 groups g de(cid:133)ned by race and whether the (cid:20) person has more than a high school education. We use the (cid:133)rst (cid:133)ve years rather than simply the (cid:133)rst because we have relatively few observations for each group when t = 1.23 23As it turns out, ^b is 0.918 for blacks with a high school degree or less, 1.427 for blacks with more than high (cid:3)0g school, 1.391 for whites with high school or less, and 1.671 for whites with more than high school. We obtained similar results for other model parameters when we constrain ^b to be the same for all groups and use t 3 to (cid:3)0g (cid:20) 20

To identify(cid:27) , we use the fact that model Aimplies that the variance of the observed (residual) !1 wage, wres ; of an employed individual from race-education group g is (cid:3)i1 Var(wres ;g) Var(wage (cid:13)w (cid:13)wX ) = (cid:27)2 +((cid:14)w)2 +(cid:27)2 +(cid:27)2 : (cid:3)i1 (cid:17) (cid:3)i1 (cid:0) 0 (cid:0) X i1 (cid:29)1 (cid:22) !1g mw Because of sample size considerations we estimate Var(wres ;g) as the variance of (residual) wage (cid:3)i1 observations in the PSID corresponding to t 5. Var(wres ) equals .109 for blacks with a high (cid:20) (cid:3)i1 school degree or less, .124 for blacks with more than a high school degree, .109 for whites with high d school or less, and .141 for whites with more than high school. We then obtain (cid:27) setting it !1g to (cid:27)2 = Var(wres ;g) ((cid:14)w)2 (cid:27)2 (cid:27)^2 at each iteration of the I-I procedure, where (cid:27)^2 is !1g (cid:3)i1 (cid:0) (cid:22) (cid:0) (cid:29)1(cid:0) mw mw preset to :07952 on the basis of measurement error studies for the PSID. In Model B, (cid:27)2 does not (cid:29)1 d appear. Estimates of other parameters are not very sensitive to constraining (cid:27)2 to be the same !1g for all groups and estimating (cid:27)2 using t 3 rather than 5: !1g (cid:20) InthecaseofmodelA,wealsousealargenumberofmomentconditionsspanningamuchlonger time span than the 3 lags in our auxiliary model to identify (cid:26) and help distinguish persistence due ! to ! from persistence due to (cid:22) and (cid:29) . For workers who are continuously employed between it i ij(t) t g and t+j and who do not change jobs between t and t+j; (cid:0) (17) cov(wres wres ;wres ) = ((cid:26)j+g (cid:26)g)var(! t g): (cid:3)i;t+j (cid:0) (cid:3)it (cid:3)i;t (cid:0) g ! (cid:0) ! t (cid:0) g j (cid:0) We approximate var(! t g) with a constant plus a third order polynomial in t g. We comt g (cid:0) j (cid:0) (cid:0) pute cov^(wres wres ;wres ) for each j;g combination satisfying 1 j j and 1 < g (cid:3)i;t+j(cid:0) (cid:3)it (cid:3)i;t g (cid:20) (cid:20) max (cid:20) (cid:0) g . We estimate (cid:26) and the parameters of the polynomial approximation by weighted minimum max ! distance using the number of sample observations used to estimate cov^(wres wres ;wres ) (cid:3)i;t+j(cid:0) (cid:3)it (cid:3)i;t g (cid:0) astheweights,eliminatingmomentsestimatedusingfewerthan5observations. Thepointestimates and approximate standard errors for j ;g = 5, j ;g = 6, j ;g = 7, j ;g = 8, max max max max max max max max and j ;g = 9 are .882 (.027), .919 (.020), .927 (.014), .94 (.011), and .94 (.009) respectively. max max We chose :92 as our point estimate.24 For Model A.1, we obtain .913 when we ignore (17) and freely estimate (cid:26) by I-I. ! In model B we make use of an expression for the di⁄erence in the variance of wage growth conditional on JC +E (1 E ) = 1 and conditional on JC +E (1 E ) = 0 to express the it it i;t 1 it it i;t 1 (cid:0) (cid:0) (cid:0) (cid:0) wage innovation variance shift factor (cid:30) in terms of other parameters of the model. See Appendix 2 estimate it. 24The number of moments varies from 2,789 when j and g are 5 to 7,906 when j and g are 9: The max max max max standard errors account for heteroskedasticity but not for correlation among the moments, which use overlapping data. They are probably understated. In the case of Model A.3, equation (17) is an approximation because Model A.3 states that employment transition probabilities depend on ! . This means that the evolution of ! depends on t t the number of periods of continous employment: j+g: We doubt that this is an important problem. Setting (cid:26) to ! .90 instead of .92 has little e⁄ect on the variance decompositions. 21

1.25 We do not use moment conditions analogous to (17) to estimate (cid:26) , although the estimate ! we obtain by I-I, .95, is in the neighborhood of the above values. Finally, when we include tenure in the wage equation (model A.3 in Table 2), we impose Altonji and Williams(cid:146)(2005) estimates of tenure polynomial P(TEN ) based on PSID data for the years it 1975-2001 rather than attempting to estimate it by I-I, which would have required the addition of a number of additional variables to the auxiliary model.26 The parametric bootstrap standard errors reported below do not account for sampling error in the above sample moments of the PSID or in the tenure parameters. 4.4 Mechanics of Estimation Our chosen values of (cid:21) = 0:05 and M = 20 yield a smooth objective function that allows the use of fast gradient-based optimization algorithms with little evidence of bias.27 Not surprisingly given the size and complexity of our models, the objective function displays multiple local optima with respect to some of the parameters. We experimented extensively with di⁄erent starting values to make sure that we are (cid:133)nding the global optimum. We began the process by obtaining estimates of a series of (cid:147)reduced-form equations(cid:148)that correspond to the equations in the structural model. As part of the process, we have used grid evaluations for the set of parameters that appear most problematic, andwehaveusedthesmallermodelstohelpus (cid:133)ndgoodinitial guesses andthenbuild up to more complex ones. The problem is more serious in the case of Model A.1-Model A.3 than for Model B. This is one of the reasons that we make use of the moments (17) to help distinguish (cid:26) from (cid:26) : ! (cid:29) The fact that we have between 39 and 46 parameters estimated by indirect inference, the large size of the auxiliary model, and the number of simulations make computation very time-consuming even though we use a fast gradient based optimization algorithm. To reduce estimation time, we exploit the highly parallelizable structure of our estimation methodology.28 25Due to an error, the condition we actually imposed di⁄ers from the condition derived in Appendix 1. Reestimating the model using the correct condition made almost no di⁄erence for the parameter estimates. We will recompute the bootstrap standard errors and simulations for Model B in a future draft. 26Thepro(cid:133)lethatweusecorrespondstoTable6,PanelD,column2oftheirpaper.Itis.0272563 Ten :0023283 (cid:3) (cid:0) (cid:3) Ten2+:00815Ten3=100 :000914Ten4=1000. The implied return to 2, 5, 10 and 20 years of tenure are .046 (.0064), (cid:0) .008 (.0011), .112 (.016), and .119 (.029). It is obtained using Altonji and Shakotko(cid:146)s (1987) instrumental variables approach, which treats t as exogenous and uses the within job j variation in Ten ; Ten2 ; Ten3 , and Ten4 to ijt ijt ijt ijt identify the e⁄ects of tenure. Our (cid:133)nding of a modest link between t and (cid:29) implies that Altonji and Williams(cid:146) ij(t) estimates are biased downward by a small amount. 27We use a standard quasi-Newton algorithm with line search, which can additionally handle simple bounds on the choice variables. The algorithm approximates the(inverse)HessianbytheBFGSformula, anduses an active set strategy to account for the bounds. Gradients are computed by (cid:133)nite di⁄erences. 28Speci(cid:133)cally,foranygivenvalueofthestructuralparameters,theM simulationsrequiredtoevaluatetheobjective function are essentially independent and can be conducted simultaneously by k di⁄erent processors. Using our parallelized computer algorithm on k M +1 (balanced) processors reduces computation time by a factor of about (cid:20) M dk M(cid:0) 1e,where de istheceilingfunction. AllprogramsarewritteninFORTAN90. Theparallelizationisimplemented 22

4.5 Local Identi(cid:133)cation and Analysis of Estimation Bias We have chosen the auxiliary model with an eye toward distinguishing among state dependence, (cid:133)xed heterogeneity, and transitory shocks and an eye toward establishing links across equations in the heterogeneity components. However, one cannot easily verify that the parameters of our model are identi(cid:133)ed by matching up the parameters against sample moments. In particular, the fact that the number of moments that play a role in the likelihood function of the auxiliary model is much larger than the number of structural model parameters does not establish identi(cid:133)cation of any particular parameter. Consequently, we use Monte Carlo experiments extensively to establish local identi(cid:133)cation and analyze the adequacy of our auxiliary model given the sample size and demographic structure of the available data and to check for bias. For a hypothesized vector of parameter values, we simulate data and then verify that the parameter values that maximize the likelihood function of the auxiliary model are close to the hypothesized values. Using a number of model speci(cid:133)cations, including ones that di⁄er somewhat from the ones presented in the paper, we informally experimented with varying parameter values to get a sense of how robust identi(cid:133)cation is to the particular values. We also used these experiments to investigate whether the objective function has (cid:135)at regions near the solution, or multiple global optima. In general we have found that identi(cid:133)cation of most of the parameters is quite robust. However, our Monte Carlo studies also indicate that a few of the parameters are poorly determined given the sample size. We also found local optima involving alternative combinations of subsets of the parameters. Bringing in additional information through the moment conditions described above solved the most serious problems. However, some of the parameters remain sensitive to changes in the auxiliary model, and starting values must be chosen carefully. This is particularly true of the coe¢ cients of the experience pro(cid:133)les in the EE, UE and to a lesser extent, the JC equations, for reasons that we not fully understand. In table 2 below there is also evidence of bias for some of the parameters in these equations. Overall, however, the relatively small values of the bootstrap standard errors in the tables below indicates that for the sample size and demographic structure of the PSID sample, our auxiliary model is quite informative about most of the model parameters. Furthermore, in almost all cases the means of the bootstrap replications are close to the point estimates, indicating that the degree of bias in the procedure is small for most of our parameters. using the Message Passing Interface (MPI). We estimate the model using 21 CPUs supplied by 11 Intel Xeon 5150 dual core processors, which reduces estimation time by a factor of 20, to between 2 to 7 hours. 23

5 Empirical Results First, we discuss the parameter estimates for Model A, with heavy emphasis on Model A.3. Model A.3 is basically the model presented above with the duration dependence term in the UE equation restricted to 0 and (cid:14)(cid:29) set to 0: We also present simpler versions of Model A. Model A.2 is identical JC to Model A.3 except that we exclude tenure terms from the wage equation. Columns 1a-1c refer to Model A.1, which in addition excludes wage0 it from EE and (cid:29) ij(t 1) and (cid:29) 0ij (t) from JC, and (cid:0) 0 allows (cid:14)(cid:29) to be nonzero. We also brie(cid:135)y discuss results for Model B, with emphasis on the wage JC equation. Second, we evaluate the (cid:133)t of the model by comparing means and standard deviations of the PSID data to the corresponding values based on simulated data from the model and by comparing simple regression relationships in actual and simulated data. Third, we present impulse response functions. Finally, we decompose the variance of wages, hours, and earnings into the contributions of the main types of shocks in our model. In the case of Model A.3, inference is based on 300 bootstrap replications. Because the bootstrap procedure is very computer intensive, we only use 100 replications for the other models. 5.1 Parameter Estimates for Model A Columns 3a, 3b, and 3c of Table 2 report parameter estimates, the means of the parametric bootstrapestimates, andstandarderrorestimatesforModelA.3. Columns2a-2crefertoModelA.2 and Columns 1a-1c refer to Model A.1. The row headings indicate the variable or error component that the parameter estimates correspond to and also list the parameter names. The estimates are grouped by equation, beginning with EE. 5.1.1 Employment Transitions and Job Changes The EE coe¢ cients on t and t2 imply that the latent variable determining E conditional on it E = 1 declines slowly with t until t is about 12 and then rises slowly. However, the di⁄erence i;t 1 (cid:0) between t = 30 and t = 0 in the latent variable is only .21, and the e⁄ect on the odds of a transition is small because the EE probability is very high. Since the value of min(ED ;9) is rising rapidly t 1 (cid:0) over the (cid:133)rst few years in the labor market, the overall relationship between EE and t is weak. The point estimates should be taken with a grain of salt, because the relative values of the constant, the coe¢ cients on t and t2; and the coe¢ cient on min(ED ;9) are sensitive to the exact speci(cid:133)cation. t 1 (cid:0) Furthermore, the bootstrap replications provide evidence of bias. The coe¢ cient on min(ED ;9) is .044 (.018), indicating modest positive duration dependence t 1 (cid:0) in the odds of remaining employed. Below we show that a regression of E on ED conditional it i;t 1 (cid:0) on E = 1 gives similar results in data simulated from Model A.3 and in PSID data, which i;t 1 (cid:0) 24

indicates that the combined e⁄ect of duration dependence and unobserved heterogeneity in the model does a good job of matching the weak positive state dependence found in the data. The coe¢ cient on wage0 is -.058 (.076). At (cid:133)rst glance the negative sign would seem to be it opposite the expected sign because there is only a substitution e⁄ect at 0 work hours. However, it is probably better to think of employment at the survey date as a movement along the extensive margin when one views labor supply from the perspective of a year. In any event, the coe¢ cient is not statistically signi(cid:133)cant and the implied e⁄ect on the EE probability is small. The UE equation is the least satisfactory equation in the model. The estimated t pro(cid:133)le implies that the exit probability declines with experience and then increases, but the standard errors are very large. As we document below, the model does a poor job of tracking the relationship between UE and t in the data. We experimented with models that included UD but had di¢ culty i;t 1 (cid:0) estimating the duration coe¢ cient, perhaps because the overall number of unemployment spells is small and relatively few individuals were unemployed for two or more surveys in a row. (Most work on duration dependence in unemployment spells uses weekly, monthly or quarterly data). Simulations reported below show that the equation without state dependence is consistent with the negative link between UE and UD found in the PSID, presumably because of the important i;t 1 (cid:0) role played by permanent heterogeneity. However it understates that relationship to some extent. The latent variable for JC declines slightly with t over the (cid:133)rst 20 years, but is strongly decreasing in job tenure. The coe¢ cient on TEN is :0673 (:0156), indicating that 10 years of i;t 1 (cid:0) (cid:0) seniority shift the index determining JC by .673 standard deviations of the job change shock "JC. it It is noteworthy that we obtain a large negative e⁄ect of tenure on JC even after accounting for unobserved person speci(cid:133)c heterogeneity ((cid:22) and (cid:17)) and for job match heterogeneity. The job match components (cid:29) and (cid:29) play an important role in job mobility without ij(t 1) 0ij (t) (cid:0) 0 unemployment, andtheyhave signs andrelative magnitudes that are consistent withthe theoretical discussion above. The coe¢ cient on (cid:29) is -.923 (.127). To get a sense of the magnitude, note ij(t 1) (cid:0) thatthestandarddeviationof(cid:29) is.273forapersonwith10yearsofexperience. Consequently, ij(t 1) (cid:0) a one standard deviation increase in (cid:29) lowers the JC index by -.252. Since the coe¢ cient on ij(t 1) (cid:0) TEN is -.067, this is roughly equivalent to the e⁄ect of 3.8 years of seniority. An increase from 0 to .273 in the value of (cid:29) lowers the probability of a job change for a white individual with ij(t 1) (cid:0) 12 years of education, 2 years of seniority, 10 years of experience, (cid:22) = 0 and (cid:17) = 0 from .130 to .084, keeping (cid:29) constant. The current value (cid:29) has a coe¢ cient of .594.29 A one standard 0ij (t) 0ij (t) 0 0 deviation shock to (cid:29) raises the job change probability by 0.066. 0ij (t) 0 Thus far, we have not discussed the role of race and education or the unobserved individual 29Thetotale⁄ectof(cid:29) ontheJC probabilityis.029. Itissmallerthanthepartiale⁄ectbecauseaunitshift ij(t 1) in (cid:29) shifts the distrib(cid:0)ution of (cid:29) by .625. ij(t 1) 0ij (t) (cid:0) 0 25

heterogeneity terms. BLACK has a substantial negative e⁄ect on the latent variable for EE and a substantial negative e⁄ect on UE, while EDUC has a substantial positive e⁄ect on both. In the JC equation BLACK is positive, with an e⁄ect that is equivalent to having about 2.5 fewer years of tenure. EDUC does not alter JC. The coe¢ cient on the (cid:147)innate ability(cid:148)factor (cid:22) is .443 (.094) in the EE equation and .637 (.128) in the UE equation. Since the standard deviations of (cid:22), (cid:17), "EE, and "UE are all 1, the factor loadings imply that (cid:22) accounts for 15.6% of the error variance in it it the EE equation and 27.9% in the UE equation. The productivity factor (cid:22) has a coe¢ cient of -.280 (.136) in the JC equation. All three results are sensible in light of the fact that (cid:22) has a positive sign in both the wage and hours equations. In thinking about the magnitudes, keep in mind that the factor loadings represent the partial e⁄ect of the heterogeneity components in a given period holding spell duration constant. The mobility/hours preference component (cid:17) is normalized to have a positive sign in the JC equation. It enters the EE, UE and JC indices with coe¢ cients of -.237 (.107), .222 (.187), and .531 (.100), respectively, and accounts for 4.5%, 3.4%, and 19.7% of the error variances of these equations. The magnitude implies only a small e⁄ect on the EE transition probability and the sign intheEE equationispositiveintheothermodels. Theresultssuggestthat(cid:17) raisestheprobabilities of transiting out of unemployment and of moving from job to job without unemployment. It has a coe¢ cient that is essentially zero in the hours equation in the case of Model A.3 but is larger in the other speci(cid:133)cations. Across Models A.1, A.2, A.3 and B.1, the relative importance of (cid:22) and (cid:17) varies somewhat. 5.1.2 The Wage Model We begin with the parameters of the autoregressive component, ! = (cid:26) ! +(cid:13)! (1 E )+(cid:13)! (1 E )+"!: it ! i;t (cid:0) 1 1 (cid:0) Eit (cid:0) it 1 (cid:0) Ei;t (cid:0) 1 (cid:0) i;t (cid:0) 1 it The estimated standard deviation of the initial condition ! varies with race and education but i1 is between .248 and .306 in Model A.3. The autoregressive coe¢ cient (cid:26)^ is .92, which implies ! considerable persistence but is well below unity. The shocks "! have a standard deviation of it .095 (.003). This value strikes us as large given that we separately account for the e⁄ects of job speci(cid:133)c error components. The only other study we know that allows for a persistent general wage component, a job speci(cid:133)c error term, and endogenous mobility is Low et al (2008). They obtain a value of .104 but set (cid:26) = 1:30 In the data, the standard deviation of wage changes for stayers is ! .136 after adjusting for measurement error. The coe¢ cients of -.1895 (.010) on (1 E ) and .1041 (.013) on (1 E ) imply that being it i;t 1 (cid:0) (cid:0) (cid:0) unemployed at the survey date has a large e⁄ect on the mean of the wage that persists for some 30They do not include the (1 E ) terms in their speci(cid:133)cation. it (cid:0) 26

time, even when the value of lost tenure is held constant. As will become clear from the impulse response functions, unemployment also leads to a loss of tenure as well as to a reduction on average in the value of the job match component, which implies further reductions in wages.31 The coe¢ cient (cid:14)w on (cid:22) is only .049 (.028), so the direct contribution of unobserved permanent (cid:22) heterogeneity to the variance of wages is small once we account separately for job match heterogeneity. Note, however, that (cid:22) also has an additional e⁄ect on wages through its connection to employment transitions and job changes. One should also keep in mind that (cid:22) is net of the e⁄ect of X , which contains the important permanent variables EDUC and BLACK; and that the stanit dard deviation of the initial condition ! is large. Almost 30 percent of the e⁄ect of ! is still i1 i1 present at t = 15: Note that the component (cid:22) is much more important in Model A.1 and in Model B.1, perhaps because there is no selective quit behavior in A.1 and B.1 does not incorporate a job speci(cid:133)c wage component at all. It is also about twice as important in the SRC sample. The parameters of the job match component (cid:29) are quite interesting. The initial condition ij(t) (cid:29) has a standard deviation of .197 (.021) in A.3. The autoregression parameter (cid:26) is .625 (.022) ij(1) (cid:29) and the value of (cid:27)^ is large: .269 (.007). As we have already noted, the substantial persistence of (cid:29) (cid:29) across jobs suggests that wage o⁄ers are based in part on salary history, that demand shocks ij(t) mayre(cid:135)ectnarrowoccupation, industryandregionandthusmaynotbeentirelyjobspeci(cid:133)c, and/or that the search network available to workers depends on job quality. As we shall see below, the contribution of the job speci(cid:133)c component to the variance of wages and earnings is substantial. As we discuss in Appendix 2, one can use Model A.3 to decompose E(wage t); the experience it j pro(cid:133)le of wages, into the contributions of general human capital P(t), job mobility E((cid:29) ) t), and ij(t) j accumulated job seniority (cid:13)w E(P(T ) t). Figure 1 graphs the components and thus addresses TEN it j the fundamental question of what accounts for wage growth over a career. Most of the return to potential experience is due to general skill accumulation. Job shopping and the accumulation of tenure account for 14.1 percent and 13.1 percent, respectively, of the overall growth of wages over the (cid:133)rst ten years. They account for 13.7 percent and 15.0 percent of growth over the (cid:133)rst 30 years.32 Using social security records for quarterly earnings (rather than hourly wage rates) Topel and Ward (1992) (cid:133)nd larger gains from job mobility early in careers than we (cid:133)nd. We suspect their 31In Models A.1 and A.2, which exclude tenure from wages, (cid:13)^! is about -.22 and (cid:13)^! is about .10. 32Using the PSID Buchinsky et al (2008) estimate a simultan 1 e(cid:0)o E u i s t model of employme 1 n(cid:0)t, E j i; o t (cid:0)b 1 mobility, and wage rates that incorporates tenure e⁄ects, general experience, and job speci(cid:133)c error components. They (cid:133)nd a large e⁄ect of human capital accumulation and returns to seniority that are more than double the values from Altonji and Williams (2005) that we impose but do not present estimates of the gains from job mobility. Bagger et al (2007) do not allow for a direct e⁄ect of seniority on wages such as would arise from shared investment in (cid:133)rm speci(cid:133)c capital butobtainanindirecte⁄ectthatarisesthroughtheresponseof(cid:133)rmstooutsideo⁄ers. Theyattributeaveragegrowth of wages within the (cid:133)rm and growth of wages across (cid:133)rms to job search. Using Danish matched employer/employee data they (cid:133)nd that human capital accounts for about half of all wage growth for workers with more than 12 years of education that occurs after the (cid:133)rst (cid:133)ve years in the labor market. Human capital accumulation is neglibible for less educated groups. 27

estimates are overstated by the school to work transition and growth across jobs in hours worked, while ours are understated because we miss some job changes and we do not use the (cid:133)rst three years of wages.33 5.1.3 Hours and Earnings In the hours equation (cid:13)^h, the e⁄ect of E ; is .413 (.008). This indicates that unemployment at it E it the survey date is associated with relatively long completed spells of nonemployment. Short spells will tend to be missed given our point in time measure at annual frequencies. They will show up in the hours component "h. The wage elasticity is small and negative, which is consistent with a it large literature on the behavior of male household heads. The coe¢ cients on (cid:22) and (cid:17) are .125 and .0145 respectively, suggesting only a modest role for individual heterogeneity (net of EDUC and BLACK) in annual hours in a given year. However, permanent heterogeneity turns out to be quite important over the lifetime. The importance of (cid:22) relative to (cid:17) varies across the speci(cid:133)cations. The standard deviation of "h is .169, indicating substantial year to year variation in hours even when it the job speci(cid:133)c component (cid:24) does not change. The standard deviation of (cid:24) is large(cid:151).157 (.014). Turning to earnings, recall that the coe¢ cients (cid:13)e and (cid:13)e are constrained to equal 1. The w h earnings component e has an autoregression coe¢ cient of .553 (.007) and the standard deviation it of the shock "e is .211 (.002). it 5.2 Estimates of Model B Columns 4a-4c of Table 2 report estimates for Model B.1. As we noted earlier, Model B does not containjobspeci(cid:133)cwageorhourscomponents. However, itallowstheautoregressioncoe¢ cientand the standard deviation of wage shocks to shift when individuals change jobs or leave unemployment. The speci(cid:133)cation is closest to that of Model A.1. We had relatively little di¢ culty estimating this model and estimate the autoregressive parameter (cid:26) by indirect inference rather than by using the ! moment conditions (17), which do not apply to Model B.1 without modi(cid:133)cation. The results for Model B.1, in keeping with those for Model A, provide clear evidence that job changes, whether with or without unemployment, involve substantial wage risk. The coe¢ cient on lagged wages is .958 for job stayers but only .723 if the job changes. At the same time, wage innovations have a standard deviation of .0934 (.002) for stayers but are about 3 times larger for persons who have changed jobs ((1+2.10)*0.0934). The estimate of (cid:14)w is .151 (.012), so permanent (cid:22) heterogeneity plays a more important role in Model B.1 than in A.3.34 33Model A.1 does not allow for selective quit behavior but includes the term (cid:13)(cid:29)JC . The coe¢ cient (cid:13)^(cid:29) is :042, 0 it 0 which indicates that on average JC increases (cid:29) by .042. Models A.1 and A.2 exclude tenure e⁄ects on wages. ij 34We experimented with adding the health status equation (18) H =I[(cid:13)H +(cid:13)H(t 1)+(cid:26) H +(cid:14)H& +"H >0] it 0 t (cid:0) H i;t (cid:0) 1 & i it 28

5.3 Evaluating the (cid:133)t of the Model: We simulate careers for 138,960 individuals using the parameter estimates for Model A.1, A.2, A.3 and Model B.1(cid:151)30 for each individual in the PSID sample. From each simulated career we select data so that the temporal pattern, education level, and race matches that of a corresponding PSID case. Note that in all cases the simulated variables incorporate measurement error. We examine the (cid:133)t of the model in two ways. First, we compare the means and standard deviations of the key variables implied by the model with corresponding values from the PSID. We then turn to a comparison of the regression relationships among key variables that are implied by the model with those of the corresponding PSID estimates. 5.3.1 Predicted and Actual Mean and Standard Deviations of Key Variables, by Potential Experience Figure 2 compares the actual standard deviations of wage , earn and hours in the PSID to the (cid:3)it (cid:3)it (cid:3)it 95 percent con(cid:133)dence interval estimates of the standard deviations based on data simulated from ^ Model A.3. The standard error bands in the (cid:133)gure re(cid:135)ect both sampling error in (cid:12) and sampling error due to randomness in the careers of the individuals in a particular sample.35 The bands are tight, which means that we sometimes conclude that the PSID values are statistically di⁄erent from the model predictions even when the values are close in economic terms. Across experience levels, the model overpredicts SD(wage ) by about 6% (not shown) and (cid:3)it implies less of an increase with experience than we (cid:133)nd in the actual data. We suspect that most of the discrepancy is due to the fact that we have removed the e⁄ects of education and race from the PSID wage measure, but include them in the equations for EE, UE; and JC in the model. Consequently, the e⁄ects of race and education on wages, hours, and earnings that to models that are very similar to A.1 and to model B, where H is an indicator for poor health, and is 1 for those it who answer yes to the question, (cid:147)Do you have any physical or nervous condition that limits the type of work or the amount of work you can do?(cid:148), & captures (cid:133)xed heterogeneity in health, and "H is an iid health shock. We added i it the indicator H to the EE, UE, wage, and hours equations. We also added an equation for H to the auxiliary it it model and added two lags of H to all equations of the auxiliary model. The heterogeneity term & accounts for it 67% of the variance of the composite error term for the health latent variable. We also (cid:133)nd strong state dependence in health and (not surprisingly) that health status worsens with age. The e⁄ects of health are small for EE, UE, and for wages. Work hours are about 6% lower for people in poor health, everything else equal. The relatively small e⁄ectsofhealth,atleastaswehavemeasuredit,onemployment,wages,andhoursimplythathealthshocksaccount for little of the variance in career earnings. We focus on models without the health equation for this reason. See Vidangos (2008) for models with health and permanent disability. 35We obtain the distribution of SD(wage ) implied by the model as follows. First,we obtain the point estimate (cid:3)it SD(wage ) by using the point estimates of (cid:12) for Model A.3 to simulate 30 careers for each member of the PSID. (cid:3)it (Thatis,wepreservetherace,educationandexperiencemixoftheavailabledata.) Toobtainthestandarddeviation odf SD(wage ) given the PSID sample size and demographic properties, we repeat the simulation for each of the 300 (cid:3)it parametricboostrapestimatesof(cid:12) usingonly1careerforeachmemberofthePSIDandthencomputethestandard devdiation of the 300 values of SD(wage (cid:3)it ): The bands we report are the point estimate SD(wage (cid:3)it ) plus or minus 1.984 SD(SD(wage )). Other variables are handled in similar fashion. For each value of t in the table the results (cid:3)it are the average over t 1, t; and t+1 with the exception of t=40; which is the averagedfor t=39 and t=40. (cid:0) 29

operateindirectlythroughEE, UE, andJC ratherthandirectlythroughwagesandhoursin(cid:135)uence the model predictions for SD(wage ) but not the PSID value.36 The values for Model A.2 (not (cid:3)it reported) are reasonably similar to A.3, while the values for Model B.1 in Appendix Figure A1 shows a (cid:135)atter pro(cid:133)le for SD(wage ). (cid:3)it The sample value for SD(hours ) is .285, which lies a bit below the model value .293. The (cid:3)t model implies that SD(hours ) varies little with t and misses the increase whent is 40, which might (cid:3)t re(cid:135)ect the e⁄ects of partial retirement not captured by the experience pro(cid:133)les in the model. The results for Model A2 and B.1 are similar. The actual and simulated SD for earnings are 0.567 and 0.585, respectively. However, there is an erratic pattern in the data that is not matched by the model predictions, which display a smooth hump-shape pattern with a peak around t = 20. The error at t = 40 for earnings mirrors the error for hours and probably re(cid:135)ects partial retirement. The left panels of Figure 3 compare the PSID values and the model predictions for the mean of E and for the mean of JC conditional on E = 1 and E = 1. The PSID values lie close to t t t t 1 (cid:0) the model estimates. The overall mean for E is .967 in the data and .960 based on the model. t Overall, the model overstates JC by about .014 but tracks the experience pro(cid:133)le fairly closely. t The upper right panel of Figure 3 reports the sample means and simulated means of EE transitions, which match reasonably well. However, the lower right panel shows that A.3 does poorly in explaining the experience pro(cid:133)le of exits from unemployment (UE). The actual and simulated means of UE are .74 and .69, a substantial discrepancy. Furthermore, the model does not track the experience pro(cid:133)le well.37 Model B.1 does somewhat better than model A.3 despite the fact that the EE and UE equations of A.3 and B.1 are the same (Appendix Figure A2). Figure 4 examines the behavior of the mean of TEN, ED, and UD. The (cid:133)ts for TEN and UD are reasonably close, although the two models overpredict UD by an average of about 0.4 years. 36Wecouldnotthinkofaneasywaytocheckthis,butinearlierworkexcludingeducationandracefromtheseequations, the model closely matched the standard deviations for SD(wage ). There were still some minor discrepancies (cid:3)it in the experience pro(cid:133)les. 37Aswasmentionedearlier,weexperienceddi¢ cultyinestimatingthee⁄ectoftintheUEequation,byI-I,whether or not we allow for state dependence. We also had di¢ culty with the EE equation. We are puzzled as to why. It may be that longer lags than we use would be helpful in pinning down the parameters. For the UE speci(cid:133)cation reported in the paper we also estimated the UE equation by maximum likelihood, treating ((cid:14)UE)2 +((cid:14)UE)2 as a (cid:17) (cid:22) single parameter. Even though UE does not depend on duration, this is only an approximation. The problem stems from the fact that (cid:17) and (cid:22) appear in both the EE and UE equations. Consequently, the distributions of (cid:17) and (cid:22) conditional on E = 0 depend on t 1, BLACK; and EDUC; and the stochastic components that t 1 in(cid:135)uence the wage. They w(cid:0)ill not have a norma (cid:0) l distribution. In any event, the MLE estimates (standard errors) E = -.838 .121EDUC -.439BLACK +.33t=10 -.048t2=100 ; [((cid:14)UE)2+((cid:14)EE)2]:5 =:630 are i(cid:3)t (cid:17) (cid:17) (.439) (.029) (.109) (.273) (.066) (:119) (N=1065, number of individuals contributing spells=748.) The I-I estimates of the cod¢ cients on BLACK and EDUC are somewhat larger. The experience pro(cid:133)les are also quite di⁄erent. The I-I estimate of [((cid:14)UE)2+((cid:14)EE)2]:5 (cid:17) (cid:17) is .675, which is close to the MLE estimate. We cannot take the same approach with the EE equation due to state dependence in that equation. d 30

The models overpredict ED by a substantial amount. This is probably attributable to our use of TEN as the initial value for ED when an individual (cid:133)rst enters the sample (see the Data Section). The results for the models A.2 and B.1 are similar. 5.3.2 Comparison of Regression Relationships Among Key Variables Tables3a-3dreportaseriesof regressions. The(cid:147)a(cid:148)columnsarebasedonthePSIDsample, andthe (cid:147)b(cid:148)and (cid:147)c(cid:148)columns are based on data simulated using the estimates of Model A.3 and Model B.1. We also report robust panel standard errors for the PSID estimates.38 Columns 1a and 1b of Table 3a report regressions of E on (t 1)=10; (t 1)2=100, and ED conditional on E = 1. This is t i i t 1 t 1 (cid:0) (cid:0) (cid:0) (cid:0) a stripped down version of the EE equation in the structural model. There are some di⁄erences in the experience pro(cid:133)les. The coe¢ cient on ED is .0025 in the PSID and .0031 in the simulation, t 1 (cid:0) a close correspondence. Columns 2a and 2b report results for a version of the UE equation. The di⁄erences in the coe¢ cientsontheexperiencepro(cid:133)learesubstantial,afactthatisre(cid:135)ectedinthefailureofthemodel to(cid:133)ttheexperiencepro(cid:133)le. Themodelunderstatesthedegreeofpersistenceinunemploymentspells to some extent. The equations for JC in columns 3a and 3b match fairly closely, although the weights on the experience terms are somewhat di⁄erent. Table 3b examines the dynamics of wages. When only one lag is included, the coe¢ cient on wage is .885 in the PSID and .905 in the simulation. When two lags are included, the sums of (cid:3)t 1 (cid:0) the coe¢ cients are very close but there is a substantial di⁄erence in the coe¢ cient pattern. The coe¢ cient on JC is small and negative in the PSID and in the simulated data for A.3. It is small t and positive for B.1. Table 3c examines hours. The results based on the simulated data and actual data match reasonably closely, although the sum of the coe¢ cients on the lags of hours is about .6 in the simulated data and about .55 in the actual data (columns 1a and 1b). The wage coe¢ cient is essentially 0 in the actual data and -.0191 in the simulated data(cid:151)a close correspondence. Finally, in Table 3d we report earnings regressions. Note that all of the dynamics in earnings stem from dynamics in the wage, hours, and the autoregressive earnings component e : The sum it of the coe¢ cients on earn and earn is .873 in the PSID data and .800 in simulated data, (cid:3)i;t 1 (cid:3)i;t 2 (cid:0) (cid:0) so that the model understates the persistence of earnings by a small amount. There is also some 38For the simulated data the point estimates are based upon a sample 30 times as large as the PSID with the samedemographicstructure. Thecoe¢ cientstandarderrorsreportedforthesimulateddataarebasedonthislarge sample and are intended to provide a sense of numerical accuracy rather than sampling error. We could provide the latterbyestimatingtheregressionsoneachofaseriesofsimulatedsamplesthatmatchPSIDdemographicstructure with only one career per person. One sample in the series would be created using one of the 300 values of the model parametersobtainedthroughourparametricbootstrapprocedure. ThePSIDstandarderrorsshouldprovidearough guide to whether the coe¢ cients based on the simulated data are di⁄erent from the PSID regression coe¢ cients. 31

di⁄erence between the data and the model in the coe¢ cients on wage and hours (Column 2a and (cid:3)t (cid:3)t 2b). Overall, the match between the model and the data is good, although there is room for improvement, particularly in the case of UE: 5.4 Mean and Variance Impulse Response Functions Figures 5a-c report impulse responses to shocks that occur when t = 10. The point estimates are constructed as follows. First, using the parameter estimates for Model A.3, we simulate a large number of cases through t = 9. Then we impose the shock indicated in the (cid:133)gures in period 10. After that, we continue the simulation in accordance with the model parameters. The (cid:133)gures show the mean paths of wages, hours and earnings relative to the base case. The base case represents the mean of the simulated paths in the absence of the speci(cid:133)ed intervention in period 10.39 Since wages and hours are re(cid:135)ected in earnings with coe¢ cients of 1, we focus on earnings to save space. The diamond line in Figure 5a reports the response of the mean of earn to a one it standard deviation shock to "!, the error term in the autoregressive component of wages. Earnings it rise by about .088, and the e⁄ect slowly decays, governed by the value .92 for (cid:26)!. The pattern w for earnings closely mirrors the response to wages because the coe¢ cient on the wage is 1 and the response of hours to the wage is small. The line with circles shows the e⁄ect of becoming unemployed when t = 10: The pattern is very interesting. The log of earnings drops by about -.62, recovers by more than two thirds after one year, and then slowly returns to the base case. The initial drop is the combination of a drop of about -.38 in log hours and a drop of about -.25 in the wage. Hours recover almost completely after one period. The wage increases by about .08 in the (cid:133)rst year and recovers slowly after that. The drop in wages is due to three main factors. First, the distributed lag coe¢ cients on unemployment in the wage equation and (cid:26)^ indicate that unemployment reduces ! by -.190 (0.010) ! it followed by an increase a year later of .104 (0.013) plus .190*(1-.92) if the person leaves unemployment. After that, the response of ! to unemployment is governed by (cid:26)^ . Second, the loss of it ! tenure lowers the wage by an average of .064 relative to the baseline average for persons at t = 10: Third, since there is no selectivity in the job change induced by the unemployment spell, on average workers su⁄er a decline in (cid:29) equal to (1 (cid:26) )E((cid:29) t = 10); or .027. On average, endogenous ij(t) (cid:29) ij(t) (cid:0) j mobility following the unemployment spell leads (cid:29) to back up toward the base case mean for a ij(t) given value of t: The pattern of a long-lasting impact of unemployment on earnings is broadly consistent with a 391.984 standard error bands were obtained by computing impulse responses using each of the 300 values of the model parameters obtained by parametric bootstrap (100 values in the case of B.1). The bands are quite narrow, and we omit them to avoid cluttering the (cid:133)gures. They are available upon request. 32

number of previous studies, including Jacobson, Lalonde, and Sullivan (1993), who use establishment earnings records. Using the PSID and a (cid:133)xed e⁄ects strategy, Stevens (1997) (cid:133)nds a 30% drop in earnings and a 14% drop in wages in the year of a layo⁄. Earnings recover substantially in the (cid:133)rst year, but wages recover very slowly. Her estimate of the initial earnings loss is smaller than ours, perhaps because those who are laid o⁄do not necessarily become unemployed, and those who are unemployed at the survey date tend to be in a long spell. Our model and the PSID data permit us to examine e⁄ects that operate through wages and hours separately, as well as to identify the speci(cid:133)c channels of in(cid:135)uence.40 Finally, the (cid:133)gures report the response of wages, hours, and earnings to an exogenous job change. In this case, JC is set equal to one in period 10 for individuals with E = E = 1 it t t 1 (cid:0) which one should think of as resulting from a large positive realization of the iid component "JC it that negatively a⁄ects the relative attractiveness of the current job rather than from a large draw of (cid:29) . The line marked with "x" shows the average response. The negative e⁄ect on earnings 0ij (t) 0 re(cid:135)ects the value of lost tenure (.063). Since the job change is not selective on (cid:29) , (cid:29) declines ij ij(t) by (1 (cid:26) )E((cid:29) t = 10) or .027. The line with triangles is the e⁄ect of an exogenous job change (cid:29) ij(t) (cid:0) j that is accompanied by a value of "(cid:29) that is one standard deviation above its mean, or .269. The ij(t) net positive e⁄ect is large and highly persistent. These results are mirrored in wages (Figure 5b). In addition, we show the e⁄ect of an exogenous job change that is accompanied by a 1 standard deviation increase of .157 in the job speci(cid:133)c hours component (cid:24) . This is associated with a ij(t) positive increase in hours worked and in earnings that decays in half in the (cid:133)rst few years but slowly thereafter. Since (cid:24) is independent across jobs, the persistence stems from the fact that ij(t) when t is greater than 10, job changes with or without unemployment are infrequent.41 We also use the model to estimate the e⁄ects of an exogenous job loss and an exogenous job change on earnings risk using the methodology described above. The circle line in Figure 6a graphs the ratio of var(earn earn ) following an exogenous unemployment shock when t = 10 to it i;t 1 (cid:0) (cid:0) the baseline variance for the model. The variance ratio is slightly below 1 when t = 10, it is 1.46 when t = 11; declines to 1.17 when t = 12; and then slowly declines to 1 over the next ten years. The corresponding ratio for var(earn ) is about .83 when t = 10; presumably because di⁄erences it 40Kletzer (1998) surveys the literature on job loss and wages. A number of studies examine how employer and industry tenure a⁄ects the size of the loss. When the problem of unobserved worker heterogeneity (but not job heterogeneity) is addressed there appear to be modest tenure e⁄ects of the loss that are consistent with Altonji and William(cid:146)s (2005) estimates used here. Neal (1995), Carrington (1993) and Parent (2000) argue that industry tenure is more important than (cid:133)rm tenure. Kambourov and Manovskii (2009) argue that occupational tenure is more important than (cid:133)rm or industry tenure. As we noted earlier, one could extend the model we consider to include industry and occupation transition equations, but leave this to future research. 41We also computed, but do not report, the e⁄ects of shocks that occur when t = 3. The immediate e⁄ect of unemployment on earnings and wages is somewhat smaller than when t = 10 because the decline in tenure and in (cid:29) is smaller. The e⁄ects are also less persistent. Job changes accompanied by shocks to (cid:29) and to (cid:24) also have less persistent e⁄ects. 33

in wages matter less when everyone is unemployed, 1.08 when t = 11, and then slowly declines to about 1.03. The exogenous job change induces a big spike in the ratio var(earn earn ) it i;t 1 (cid:0) (cid:0) when t = 10. The corresponding ratio for var(earn ) rises slowly following the shock, presumably it because in some cases the exogenous job change induces additional ones. We have produced corresponding (cid:133)gures for shocks at t = 3 (not shown). The impact on the variance is somewhat smaller and less persistent. 5.5 Variance Decompositions We have used our model to measure the relative importance of the initial condition and shocks to the autoregressive wage component, the iid hours shocks, the iid earnings shocks, job changes and employment spells and the associated shocks, the permanent heterogeneity components (cid:22) and (cid:17), and the e⁄ects of education and race. However, because the sample overrepresents blacks, we report variance decompositions using the white sample. To do this, we (cid:133)rst compute the variance in the sum of the annual values of earn , wage , and hours over a 40 year career. We then repeat the it it it simulation after setting the variance of the particular random component in the model to 0. We use the drop in the variance relative to the base case as the estimated contribution of the particular type of shock. Since the model is nonlinear, the contributions do not sum to 100% and may be negative. We have normalized them to sum to 100. We report results for the levels of variables, accounting for the experience pro(cid:133)le in all variables. The decompositions of the sums of the annual values of logs of earnings, hours, and wages are similar (not reported). We use the parametric ^ bootstrap distribution of the (cid:12) to estimate the standard deviation of variance contributions, which are reported in parentheses. We continue to focus on Model A.3, but also brie(cid:135)y discuss results for Model A.2 and B.1. The results for Model A.3 are in Table 4a. The (cid:133)rst row refers to the sum of lifetime earnings. The earnings shocks "e account for 6.6% of the variance in lifetime earnings even though they it account for about 17% of var(earn ) in a given year (Table 4b). The reason for the relatively it small contribution is that the shocks are not very persistent. Similarly, the value in column II indicates that iid hours shocks "h contribute only 2.4% of the variance in lifetime earnings but it account for between 7.7% and 9.3% in annual earnings (Table 4b). One can easily self-insure against these shock categories. In contrast, in column III, the initial condition "! and the iid i1 shocks to ! are together responsible for 12.4% of the variance in lifetime earnings. The earnings it results re(cid:135)ect the fact that these shocks account for 20.6% of the variance in lifetime wages. They contribute little to the variance in hours because the response of hours to wages is small. The most striking result is in Column IV, which shows the collective impact of job speci(cid:133)c hours and wage components, unemployment spells, and job changes. Altogether, mobility and unem- 34

ployment related shocks account for 36.7%, 48.2%, and 46.8% of the variance in lifetime earnings, wages, and hours, respectively. Given the interactions among the job change and employment related factors, we break down their relative contributions by (cid:133)rst turning o⁄the job speci(cid:133)c hours shocks, then turning o⁄both hours and job speci(cid:133)c wage shocks, then turning o⁄hours, wage, and unemployment shocks, and (cid:133)nally turning o⁄hours, wage, and unemployment and the idiosyncratic job change shocks ("JC). The estimates are reported in columns VIII, IX, X, and XI. For earnings, it job speci(cid:133)c wage shocks are more important than hours shocks. Job speci(cid:133)c wage shocks dominate for wages, while job speci(cid:133)c hours shocks dominate for hours.42 Finally, we turn to the three permanent heterogeneity components for whites: (cid:17), (cid:22), and EDUC. Surprisingly, the estimates in column V indicate that the mobility preference component (cid:17) does not play much of a role. The point estimate is actually negative. However, (cid:22) accounts for 11.4% of the variance in lifetime earnings and 46.2% of the variance in work hours but explains none of the variance in wages.43 The positive direct e⁄ect that (cid:22) has on the wage variance is o⁄set by its role in reducing transitions into unemployment and job changes. Education is very important, contributing 31.4% of the variance in lifetime earnings and 34.6% of the variance in lifetime wages but only 4.9% of the variance in lifetime hours. The results for Model A2 (not reported) are basically similar to those for Model A3, except that (cid:22) plays a somewhat larger role in the variance of earnings and wages. The results for Model B.1 are in Tables 5a and 5b. Model B.1 does not include job speci(cid:133)c wage or hours components. Without these features, the interpretation of the results in terms of underlying economic factors is less transparent than those for A3. However, job changes with and without unemployment are associated with reduced persistence and large innovations in ! . This is re(cid:135)ected in the fact that it the initial condition for ! and the "! shocks account for 24.6% of the variance in lifetime earnings it it and 45.59% of the variance in lifetime wages, respectively. The two unobserved heterogeneity components (cid:22) and (cid:17) together account for about 30% of the earnings variance, 12% of the wage variance, and 85% of the hours variance. Education is also very important for both wages and earnings. Education, (cid:22), and (cid:17) are much less important for variance in a given year. Note that one can use the model to examine the contributions of the shocks between, for instance, t and t+5 to the variance in earnings over the same period or subsequent periods, but we exclude such computations. 42A few of the estimated variances contributions are negative. We have veri(cid:133)ed that this re(cid:135)ects nonlinearity in the model. Variance in one shock can reduce the in(cid:135)uence of other shocks. 43Note that because the lifetime variance of log hours is lower than the lifetime variance of log wages, the 46.2% impact of (cid:22) on work hours translates into only an 11.4% impact on the variance in lifetime earnings. 35

6 Results for Other Samples In this section we brie(cid:135)y summarize results using the full SRC sample, the SRC sample of whites with some college or more, and the SRC sample of whites with a high school degree or less and no post secondary vocational education. We estimate wage, hours, and earnings residuals separately for each sample, removing the race dummies from the models for whites. We use the tenure pro(cid:133)le from Altonji and Williams (2005) for all subsamples. In the case of Model A.3 we continue to use (cid:26) = :92 for the full SRC sample and the SRC sample of whites. For the SRC samples of whites ! by educational attainment we use (cid:26) = :90 because the evidence based on (17) pointed to a slightly ! lower value. Columns 1b and 2b of Table 6 report estimates of Model A.3 and Model B.1 for the full SRC sample. Because of the computational burden, we have only computed standard errors for the full SRC sample. For ease of comparison, we report estimates for the combined SRC-SEO sample in Columns 1a and 2a, which are the same as Columns 3a and 3c of Table 2 . Overall, the point estimates for the SRC sample are very similar to those for the SRC-SEO sample. The coe¢ cient on BLACK is smaller in the SRC sample, which may re(cid:135)ect the fact that the SEO sample was drawn from households in low income areas. Individual heterogeneity plays a somewhat more important role in the wage equation. Figure 7a reports the average response of earnings to shocks at t=10 and may be compared to Figure 5a for the SRC-SEOsample. The results are very similar to those for the full sample. Panel A of Table 7 reports variance decompositions of lifetime earnings, wages, and hours. The results are also quite similar to those for the SRC-SEO sample in Table 4a. Columns 1d and 1e in Table 6 report model estimates for SRC subsamples of whites with a high school degree or less and whites with some college or more. (Individuals with a high school degree and some postsecondary vocational education are excluded from both samples.) For comparison, we also report estimates for the full SRC sample of whites in column 1c. The point estimates are quite similar overall. However, a few di⁄erences are worth noting. First, mobility is less sensitive to seniority for the high education sample than for the low education sample. Second, JC is more responsive to outside o⁄ers in the case of the high education sample. Third, unemployment is less common for the high education sample. These facts are re(cid:135)ected in the decompositions of the experience pro(cid:133)le of wages in Appendix Figures B1 and B2, which show little growth in (cid:29) with t for the less educated sample. We also (cid:133)nd that (cid:27) is considerably larger for the high education ! sample(cid:151).100 versus .075. The standard deviation of the iid component of hours is much larger for the less educated sample, which probably re(cid:135)ects greater variation in overtime hours and in unemployment spells between surveys. The variance decompositions in Panel B and Panel C of Table 7 indicate that the persistent productivitycomponent! ismoreimportantforthehigheducationsamplethantheloweducation it 36

sample for wages and earnings. Employment shocks, iid hours shocks, within group heterogeneity in education, and (cid:22) are more important for the low education sample. The job speci(cid:133)c hours component (cid:24) is more important for the high education sample. 7 Conclusion In this paper, we study earnings across individuals and over careers. To this end, we construct a model of earnings dynamics from equations governing employment transitions, job changes without unemployment, wages, and work hours. Since both state dependence and heterogeneity are important and one cannot determine the role of one without accounting for the other, our models incorporate state dependence in employment, job changes, and wages, while also including multiple sources of unobserved heterogeneity as well as job-speci(cid:133)c error components in both wages and hours. These turn out to play an important role in the variance of lifetime earnings. The equations of our model provide a rich statistical description of the earnings process but can also be viewed as (cid:133)rst approximations to the decision rules suggested by structural models of employment transitions, job search, and labor supply. Our simulation based estimation strategy permits us to handle a highly unbalanced sample in the context of a model that mixes discrete and continuous variables and allows for both state dependence and multifactor heterogeneity and for measurement error. Vidangos (2008) shows the potential for using models of the type we develop by studying the implications of a related multi-equation model of family income for precautionary behavior and welfare within the context of a lifecycle consumption model.44 Our results address many important questions concerning wages, hours and earnings over a career. In keeping with many other studies, we (cid:133)nd that education, race, and unobserved permanent heterogeneity all play an important role in employment transitions and job changes and that labor supply of male household heads is inelastic. By accounting for both unobserved individual heterogeneity and job speci(cid:133)c heterogeneity, we are able to show that a substantial portion of the strong negative relationship between job seniority and job mobility found in many previous studies is causal. Job changes are induced by high outside o⁄ers and deterred by the job speci(cid:133)c wage component of the current job. Job o⁄ers are strongly positively related to the job speci(cid:133)c component on the current job, in contrast to the usual assumption in the search literature that o⁄ers are drawn at random. We discuss a number of possible explanations in the text. Overall, wages are highly persistent but do not contain a random walk component. The persis- 44He allows for additional sources of variation in family income such as health and disability shocks. The consumption model is used to quantify the welfare e⁄ects of uncertainty generated by each source of variation and to measurethecontributionofeachsourcetotheaccumulationofprecautionarysavings. Perhapssurprisingly,he(cid:133)nds thatforplausiblevaluesofthecoe¢ cientofrelativeriskaversion,consumerswouldbewillingtogiveuponlyasmall percentage of consumption to eliminate risks. Low et al(cid:146)s (2008) estimates of the value of insurance are larger. 37

tenceresultsfrompermanentheterogeneity, thejobspeci(cid:133)cwagecomponent, andstrongpersistence in the stochastic component that re(cid:135)ects the value of the worker(cid:146)s general skills. We also contribute to the displaced workers literature by providing a full decomposition of earnings losses from unemployment. We (cid:133)nd that short-term earnings losses from unemployment aredominatedbyhoursandthelong-termcostsaredominatedbywages,withlosttenure,movement to a lower paying job, and a drop in the autoregressive skill component all playing a role. We (cid:133)nd general human capital accumulation is the dominant source of wage growth over a career, although job tenure and job mobility both play signi(cid:133)cant roles. Finally, job mobility and unemployment play a key role in the variance of career earnings. They operate primarily by leading to large changes in job speci(cid:133)c components of wages and hours rather than through their direct e⁄ects on wages and hours. For whites in our full sample, job speci(cid:133)c hours and wage components, unemployment shocks, and job shocks together account for 36.7%, 48.2%, and 46.8% of the variance in lifetime earnings, wages, and hours, respectively. Job speci(cid:133)c wage shocks are more important than job speci(cid:133)c hours shocks for earnings. Job speci(cid:133)c wage shocks dominate for wages, with employment shocks also playing a substantial role. For hours, job speci(cid:133)c hours shocks dominate. Education accounts for about 1/3 of the variance in lifetime earnings and wages but makes little di⁄erence for hours. In our full sample, unobserved permanent heterogeneity accounts for about 11% of the variance in earnings and about 46% of the variance of hours but matters little for wages, although this breakdown is somewhat sensitive to the model and sample used. Therearenumberofextensionstothemodelthatwouldbeworthexploring. Thusfar,wesimply removeyeare⁄ectsfromwages, hours, andearnings, butitwouldbenaturaltoaddaggregateshocks to the model. It would also be natural to extend the model to explore changes in the stability of earnings, building on work by Gottschalk and Mo¢ tt (1994, 2008), Haider (2001), Shin and Solon (2008) and others. This would require a very di⁄erent auxiliary model. With matched employer-employee data such as those used by Abowd et al (1999) and Bagger et al (2007), one could distinguish (cid:133)rm speci(cid:133)c risk associated with observed as well as unobserved variables from job match speci(cid:133)c risk. A much more ambitious extension would be to construct a model of the household income of an individual that incorporates marriage, divorce, and death of a spouse. This will be pursued in separate work. Given the large number of issues that the paper already addresses, we do not attempt the formidable task of seeking to identify how much of the stochastic variation in earnings that we analyze is anticipated by agents, how far in advance they anticipate it, or how much is insured. Adding a family income model (with private and public transfers) as in Vidangos (2008) gets partially at the question of insurance. Dealing with expectations is more di¢ cult. One needs 38

either data on expectations or an expanded model that incorporates decisions that depend on and/or reveal the information set of the agent, such as consumption choices. Work by Blundell and Preston (1998), Blundell, Pistaferri, and Preston (2008), Cunha, Heckman, and Navarro (2005), Cunha and Heckman (2006), and Guvenen and Smith (2008) illustrate the latter approach. 39

8 References Abowd, J.M. and Card, D.E. (1987).(cid:147)Intertemporal labor supply and long-term employment contracts(cid:148), American Economic Review, 77(1), 50-68. Abowd, J.M. and Card, D.E. (1989). (cid:147)On the covariance structure of hours and earnings changes(cid:148), Econometrica, 57(2), 411-445. Abowd, J.M., F. Kramarz, D.N. Margolis (1999). (cid:147)High Wage Workers and High Wage Firms(cid:148), Econometrica 67 (2) , 251(cid:150)333. Aiyagari, S.R. (1994). (cid:147)Uninsured idiosyncratic risk and aggregate saving(cid:148), Quarterly Journal of Economics 109, 659-684. Altonji, J. G., A.P. Martins and A. Siow (2002). (cid:147)Dynamic Factor Models of Wages, Hours, and Earnings(cid:148), Research in Economics 56(1), 3-59. Altonji, J. G., and C. H. Paxson (1986). (cid:147)Job Characteristics and Hours of Work(cid:148), in Research in Labor Economics, Vol. 8, Part A, ed. by R. G. Ehrenberg, Greenwich: Westview Press, 1-55. Altonji, J. G. and C. R. Pierret (2001). (cid:147)Employer Learning and Statistical Discrimination(cid:148), Quarterly Journal of Economics, 116, 313-350. Altonji, J. G. and R. A. Shakotko (1987): (cid:147)Do Wages Rise with Job Seniority?(cid:148)Review of Economic Studies, 54, 437-59. Altonji, J. G. and N. Williams (1998). (cid:147)The E⁄ects of Labor Market Experience, Job Seniority, and Mobility on Wage Growth(cid:148), Research in Labor Economics, 17, 233-276. Altonji, J. G. and N. Williams (2005). (cid:147)Do Wages Rise With Job Seniority? A Reassessment(cid:148), Industrial and Labor Relations Review, 58(3), 370-397. Bagger, J., F. Fontaine, F. Postel-Vinay, and J.M. Robin (2007). (cid:147)A Tractable Equilibrium Search Model of Individual Wage Dynamics with Experience Accumulation", unpublished paper. Baker, M. (1997). (cid:147)Growth-rate heterogeneity and the covariance structure of life cycle earnings(cid:148), Journal of Labour Economics, 15(2), 338-375. Baker, Michael and Gary Solon (2003)"Earnings Dynamics and Inequality Among Canadian Men, 1976-1992: Evidence from Longitudinal Income Tax Records", Journal of Labor Economics 21 (2003), 289(cid:150)321. Barlevy, Gadi, (2008) "Identi(cid:133)cation of Search Models Using Record Statistics". Review of Economic Studies, 75(1):29-64. Blundell,R.andI.Preston(1998),(cid:147)Consumptioninequalityandincomeuncertainty(cid:148),Quarterly Journal of Economics 113, 603-640. Blundell, R., L. Pistaferri, and I. Preston (2008). (cid:147)Consumption Inequality and Partial Insurance(cid:148), American Economic Review 98(5), 1887-1921. 40

Blundell, R. and T. MaCurdy (1999). (cid:147)Labor Supply: A Review of Alternative Approaches(cid:148), in Handbook of Labor Economics, Vol. 3A. Blundell, R. and I. Preston (1998). (cid:147)Consumption Inequality and Income Uncertainty(cid:148). Quarterly Journal of Economics 113(2), 603-640. Bound, J., Brown, C., and Mathiowetz, N. (2001). (cid:147)Measurement Error in Survey Data.(cid:148)in Handbook of Econometrics, V. 5, eds. E. E. Leamer and J. J. Heckman, pp 3705-3843. Buchinsky,M. , FougŁre, D., Kramarz, F. and Tchernis, R. (2008). "Inter(cid:133)rm Mobility, Wages, and the Returns to Seniority and Experience in the U.S." (March). IZADiscussion Paper No. 1521. Carrington, W. J. (1993). (cid:147)Wage Losses for Displaced Workers.(cid:148)Journal of Human Resources, 28 (3) (Summer), pp. 435(cid:150)62. Castaæeda, A., D(cid:237)az-GimØnez, J., and R(cid:237)os-Rull V. (2003). (cid:147)Accounting for the U.S. Earnings and Wealth Inequality(cid:148), Journal of Political Economy, 111(4), 818-857. Cunha, F., J. J. Heckman, and S. Navarro (2005). (cid:147)Separating Uncertainty from Heterogeneity in Life Cycle Earnings, The 2004 Hicks Lecture(cid:148). Oxford Economic Papers 57(2), 191(cid:151)261. Cunha, F., J. J. Heckman (2006). (cid:147)Identifying and Estimating the Distributions of Ex Post and Ex Ante Returns to Schooling: A Survey of Recent Developments(cid:148), unpublished paper, University of Chicago. Deaton, A. (1991). (cid:147)Saving and liquidity constraints(cid:148), Econometrica, 59(5), 1221-1248. Farber, H. (1999), (cid:147)Mobility and stability: The dynamics of job change in labor markets", in O. Ashenfelter and D. Card Editors, Handbook of Labor Economics Volume 3, Part 2, 2439-2483. Fitzgerald, Gottschalk, and Mo¢ tt (1998). (cid:147)An Analysis of Sample Attrition in Panel Data: The Michigan Panel Study of Income Dynamics(cid:148), Journal of Human Resources 33(2):251-299. Geweke, J. and Keane, M. (2000). (cid:147)An empirical analysis of earnings dynamics among men in the PSID: 1968-1989(cid:148), Journal of Econometrics, 96, 293-356. Gibbons, R., and L. Katz (1991). (cid:147)Layo⁄s and Lemons(cid:148), Journal of Labor Economics, IX, 351-80. Gottschalk, P. and R. Mo¢ tt (1994), (cid:147)The Growth of Earnings Instability in the U.S. Labor Market.(cid:148), Brookings Papers on Economic Activity, Issue 2, p217-272. Gottschalk, P., and R. Mo¢ tt (2008). (cid:147)Trends in the Transitory Variance of Male Earnings in the U.S.: 1970-2004(cid:148), draft. Gourieroux, C., Monfort, A., and Renault, E. (1993). (cid:147)Indirect Inference(cid:148), Journal of Applied Econometrics 8, S85-S118. Gourinchas, P.O., and Parker, J. (2002). (cid:147)Consumption over the Life Cycle(cid:148), Econometrica 70(1) 47-89. Guvenen, F. (2007). (cid:147)Learning Your Earning: Are Labor Income Shocks Really Very Persis- 41

tent?(cid:148), American Economic Review, 97(3), 687-712. Guvenen, F and A. Smith. (2008). (cid:147)Inferring Labor Income Risk From Economic Choices: An Indirect Inference Approach(cid:148), preliminary draft. Haider, S.J.(2001). (cid:147)EarningsInstabilityandEarningsInequalityofMalesintheUnitedStates: 1967-1991(cid:148), Journal of Labor Economics, 19(4), 799-836. Ham, J. and Reilly, K. (2002). (cid:147)Testing Intertemporal Substitution, Implicit Contract, and Hours Restriction Models of the Labor Market Using Micro Data(cid:148), American Economic Review 92(4), 905-927. Hause, J.C. (1980). (cid:147)The (cid:133)ne structure of earnings and the on-the-job training hypothesis(cid:148), Econometrica, 48(4), 1013-1029. Heaton, J. and Lucas, D.J. (1996). (cid:147)Evaluating the e⁄ects of incomplete markets on risk sharing and asset pricing(cid:148), Journal of Political Economy, 104(3), 443-487. Hubbard, G., Skinner, J., and Zeldes, S. (1994). (cid:147)Expanding the Life-Cycle Model: Precautionary Saving and Public Policy(cid:148), American Economic Review (Papers and Proceedings), 84(2), 174-179. Huggett, M. (1996). (cid:147)Wealth Distribution in Life-Cycle Economies(cid:148), Journal of Monetary Economics, 38(3), 469-494. Imrohoroglu, A. (1989). (cid:147)Cost of Business Cycles with Indivisibilities and Liquidity Constraints(cid:148), Journal of Political Economy, 97(6), 1364-1383. Jacobson, L., LaLonde, R., and Sullivan, D. (1993). (cid:147)Earnings Losses of Displaced Workers(cid:148), American Economic Review, 83(4), 685-709. Kambourov, G. and Manovskii, I. (2009). (cid:147)Occupation Speci(cid:133)city of Human Capital(cid:148), International Economic Review, 50(1), 63-115. Keane, M. and Smith Jr., A . A. (2003). (cid:147)Generalized Indirect Inference for Discrete Choice Models(cid:148), unpublished manuscript, Yale University. Kletzer, L. G. (1998), (cid:147)Job Displacement(cid:148), The Journal of Economic Perspectives, Vol. 12, No. 1 (Winter), pp. 115-136. Krusell, P. and Smith Jr., A. A. (1997). (cid:147)Income and Wealth Heterogeneity, Portfolio Selection, and Equilibrium Asset Returns(cid:148), Macroeconomic Dynamics, 1, 387-422. Krusell, P. and Smith Jr., A . A. (1998). (cid:147)Income and Wealth Heterogeneity in the Macroeconomy(cid:148), Journal of Political Economy, 106(5), 867-896. Krusell, P. and Smith Jr., A . A. (1999). (cid:147)On the welfare e⁄ects of eliminating business cycles(cid:148), Review of Economic Dynamics 2, 254-272. Lillard, L. and Weiss, Y. (1979). (cid:147)Components of variation in panel earnings data: American scientists 1960-1970(cid:148), Econometrica 47(2), 437-454. 42

Lillard, L. and Willis, R. (1978). (cid:147)Dynamic aspects of earning mobility(cid:148), Econometrica 46(5), 985-1012. Low, H., Meghir, C., and Pistaferri, L. (2008). (cid:147)Wage Risk and Employment Risk over the Life Cycle(cid:148), IZA Discussion Paper No. 3700. MaCurdy, T.E.(1982). (cid:147)Theuseoftimeseriesprocessestomodeltheerrorstructureofearnings in a longitudinal data analysis(cid:148), Journal of Econometrics, 18, 83-114. Meghir, C. and Pistaferri, L. (2004). (cid:147)Income variance dynamics and heterogeneity(cid:148), Econometrica, 72(1), 1-32. Nagypal, E. (2007).(cid:147)Learning-by-Doing versus Learning About Match Quality: Can We Tell Them Apart?(cid:148), Review of Economic Studies, 74 (2), 537-566. Neal, D.(1995). (cid:147)Industry-Speci(cid:133)cHumanCapital: EvidencefromDisplacedWorkers,(cid:148)Journal of Labor Economics, 13(4), 653(cid:150)677. Neal, D. (1999). (cid:147)The Complexity of Job Mobility Among Young Men,(cid:148)Journal of Labor Economics, 17(2), 237(cid:150)261. Parent, D. (2000): (cid:147)Industry-Speci(cid:133)c Capital and the Wage Pro(cid:133)le: Evidence from the National Longitudinal Survey of Youth and the Panel Study of Income Dynamics,(cid:148)Journal of Labor Economics, 18(2), 306(cid:150)323. Postel-Vinay, F. and Robin, J.-M. (2002). (cid:147)Equilibrium Wage Dispersion with Worker and Employer Heterogeneity(cid:148), Econometrica 70(6), 2295-2350. Postel-Vinay, F. and Turon, H. (2005). (cid:147)On-the-job Search, Productivity Shocks, and the Individual Earnings Process(cid:148), unpublished manuscript, University of Bristol. Schoenberg, U. (2005). (cid:147)Testing for Asymmetric Employer Learning(cid:148), unpublished paper, University of Rochester. Senesky, S. (2005). (cid:147)Testing the Intertemporal Labor Supply Model: Are Jobs Important?(cid:148), Labour Economics, 12, 749-772. Shin, D. and G. Solon (2008). (cid:147)Trends in Men(cid:146)s Earnings Volatility: What Does the Panel Study of Income Dynamics Show?(cid:148), NBER Working Paper W14075. Smith, A.A., Jr. (1990). (cid:147)Three Essays on the Solution and Estimation of Dynamic Macroeconomic Models(cid:148), Ph.D. thesis (Duke University). Smith, A.A., Jr. (1993). (cid:147)Estimating Nonlinear Time-Series Models using Simulated Vector Autoregressions(cid:148), Journal of Applied Econometrics 8, S63-S84. Stevens, A.H. (1997) "Persistent E⁄ects of Job Displacement: The Importance of Multiple Job Losses" Journal of Labor Economics, 15(1) Part 1, 165-188. Storesletten, K., Telmer, C., and Yaron, A. (2001a). (cid:147)The Welfare Costs of Business Cycles Revisited: Finite Lives and Cyclical Variation in Idiosyncratic Risk(cid:148), European Economic Review, 43

45, 1311-1339. Storesletten, K., Telmer, C., and Yaron, A. (2001b). (cid:147)How Important are Idiosyncratic Shocks? Evidence from Labor Supply(cid:148), American Economic Review (Papers and Proceedings), 91, 413-417. Storesletten, K., Telmer, C., and Yaron, A. (2004a). (cid:147)Consumption and Risk Sharing Over the Life Cycle(cid:148), Journal of Monetary Economics, 51(3), 609-633. Storesletten, K., Telmer, C., and Yaron, A. (2004b). (cid:147)Cyclical Dynamics in Idiosyncratic Labor Market Risk(cid:148), Journal of Political Economy, 112(3), 695-717. Storesletten, K., Telmer, C., and Yaron, A. (2007). (cid:147)Asset Pricing with Idiosyncratic Risk and Overlapping Generations(cid:148), Review of Economic Dynamics, forthcoming. Tartari, M. (2006). (cid:147)Divorce and the Cognitive Achievement of Children(cid:148), unpublished paper, Department of Economics, University of Pennsylvania. Telmer, C. (1993). (cid:147)Asset-Pricing Puzzles and Incomplete Markets(cid:148), Journal of Finance, 48(5), 1803-1832. Topel, R. (1991). (cid:147)Speci(cid:133)c Capital, Mobility, and Wages: Wages Rise with Job Seniority(cid:148), Journal of Political Economy, 99(1), 145-176. Topel, R. and Ward, M. (1992). (cid:147)Job Mobility and the Careers of Young Men(cid:148), Quarterly Journal of Economics, 107(2), 439-479. Vidangos, I. (2008). (cid:147)Fluctuations in Individual Labor Income: A Panel VAR Analysis(cid:148), unpublished manuscript, Federal Reserve Board. Vidangos, I. (2008). (cid:147)Household Welfare, Precautionary Saving, and Social Insurance under Multiple Sources of Risk(cid:148), unpublished manuscript, Federal Reserve Board. Wolpin, K. (1992). (cid:147)The Determinants of Black-White Di⁄erences in Early Employment Careers: Search,Layo⁄s,Quits,andEndogenousWageGrowth(cid:148),Journal of Political Economy,100(3), 535-560. 44

9 Appendix 1: A Moment Condition for (cid:30) and (cid:30) in Model 1 2 B Recall that the autoregressive wage component in equation (14) is: ! = (cid:26) [1+(cid:30) S ]! +(cid:13)! JC +(cid:13)! (1 E )+(cid:13)! (1 E ) it ! 1 it i;t (cid:0) 1 JC it 1 (cid:0) Et (cid:0) it 1 (cid:0) Et (cid:0) 1 (cid:0) it (cid:0) 1 +[1+(cid:30) S ]"! 2 it it Using the equation for the observed wage wage , de(cid:133)ne (cid:3)it w~age [wage (cid:13)w (cid:13)wX [(cid:13)! JC +(cid:13)! (1 E )+(cid:13)! (1 E ))] (cid:3) it (cid:17) (cid:3)it (cid:0) 0 (cid:0) X it (cid:0) JC it 1 (cid:0) Et (cid:0) it 1 (cid:0) Et (cid:0) 1 (cid:0) it (cid:0) 1 = wres [(cid:13)! JC +(cid:13)! (1 E )+(cid:13)! (1 E ))] (cid:3)it (cid:0) JC it 1 (cid:0) Et (cid:0) it 1 (cid:0) Et (cid:0) 1 (cid:0) it (cid:0) 1 = ! ((cid:13)! JC +(cid:13)! (1 E )+(cid:13)! (1 E ))+(cid:14)!(cid:22) +mw: it (cid:0) JC it 1 (cid:0) Et (cid:0) it 1 (cid:0) Et (cid:0) 1 (cid:0) it (cid:0) 1 (cid:22) i it Using the above equations, : w~age (cid:26) (1+(cid:30) S )w~age (cid:3) it (cid:0) ! 1 it (cid:3)it 1 (cid:0) (19) = (cid:14)!(1 (cid:26) (1+(cid:30) S ))(cid:22) +(1+(cid:30) S )"! +mw (cid:26) (1+(cid:30) S )mw : (cid:22) (cid:0) ! 1 it i 2 it it it (cid:0) ! 1 it i;t 1 (cid:0) Let Z =[wres ;JC ;(1 E );(1 E ))] and (cid:9) = [1; (cid:13)! ; (cid:13)! ; (cid:13)! ]. One may it (cid:3)it it (cid:0) it (cid:0) it (cid:0) 1 (cid:0) JC (cid:0) 1 (cid:0) Et (cid:0) 1 (cid:0) Et (cid:0) 1 rewrite the above equation as (20) [(cid:9); (cid:26) (cid:9); (cid:26) (cid:30) (cid:9)][Z ;Z ;S Z ] 0 (cid:0) ! 0 (cid:0) ! 1 0 i0t i0t 1 it i0t 1 0 (cid:0) (cid:0) = (cid:14)!(1 (cid:26) (1+(cid:30) S ))(cid:22) +(1+(cid:30) S )"! +mw (cid:26) (1+(cid:30) S )mw (cid:22) (cid:0) ! 1 it i 2 it it it (cid:0) ! 1 it i;t 1 (cid:0) Denote the left-hand side of (20) by L((cid:26) ;(cid:30) ;(cid:9);Data), and the right-hand side by ! 1 R((cid:14)!;(cid:26) ;(cid:30) ;(cid:30) ;(cid:27) ;(cid:27) ). (cid:22) ! 1 2 ! mw Now, consider the following variances of R((cid:14)!;(cid:26) ;(cid:30) ;(cid:30) ;(cid:27) ;(cid:27) ) conditional on S : (cid:22) ! 1 2 ! mw it VR Var[R(:) S = 1] = [(cid:14)!(1 (cid:26) (1+(cid:30) ))]2var((cid:22) S = 1)+(1+(cid:30) )2(cid:27)2 +(cid:27)2 +(cid:26)2(1+(cid:30) )2(cid:27)2 1 (cid:17) j it (cid:22) (cid:0) ! 1 i j it 2 ! mw ! 1 mw and VR Var[R(:) S = 0] = ((cid:14)!(1 (cid:26) ))2var((cid:22) S = 0)+(cid:27)2 +(cid:27)2 +(cid:26)2(cid:27)2 . 0 (cid:17) j it (cid:22) (cid:0) ! i j it ! mw ! mw Assuming that [var((cid:22) S = 1) var((cid:22) S = 0)] is small, the di⁄erence is: i it i it j (cid:0) j 45

(21) DR((cid:26) ;(cid:30) ;(cid:30) ;(cid:27) ;(cid:27) ) VR VR [2(cid:30) (cid:26) ((cid:26) 1)+(cid:26)2(cid:30)2)]((cid:14)!)2(cid:27)2 ! 1 2 ! mw (cid:17) 1 (cid:0) 0 ’ 1 ! ! (cid:0) ! 1 (cid:22) (cid:22) +[(1+(cid:30) )2 1](cid:27)2 +(cid:26)2[(1+(cid:30) )2 1](cid:27)2 : 2 (cid:0) ! ! 1 (cid:0) mw The corresponding conditional variances of L((cid:26) ;(cid:30) ;(cid:9);Data) are ! 1 VL [(cid:9); (cid:26) (cid:9); (cid:26) (cid:30) (cid:9)]Var([[Z ;Z ;S Z ] S = 1][(cid:9); (cid:26) (cid:9); (cid:26) (cid:30) (cid:9)] 1 (cid:17) 0 (cid:0) ! 0 (cid:0) ! 1 0 i0t i0t 1 it i0t 1 0 j it 0 (cid:0) ! 0 (cid:0) ! 1 0 0 (cid:0) (cid:0) and VL [(cid:9); (cid:26) (cid:9); (cid:26) (cid:30) (cid:9)]Var([[Z ;Z ;S Z ] S = 0][(cid:9); (cid:26) (cid:9); (cid:26) (cid:30) (cid:9)] 0 (cid:17) 0 (cid:0) ! 0 (cid:0) ! 1 0 i0t i0t 1 it i0t 1 0 j it 0 (cid:0) ! 0 (cid:0) ! 1 0 0 (cid:0) (cid:0) Notethatwres isnotobservedwhenE = 0andwres isnotobservedwhenE = 0: Con- (cid:3)it it (cid:3)it 1 it 1 (cid:0) (cid:0) sequently, theelementsofVar([[Z ;Z ;S Z ] S = 1])andVar([[Z ;Z ;S Z ] S = 0]) i0t i0t 1 it i0t 1 0 j it i0t i0t 1 it i0t 1 0 j it (cid:0) (cid:0) (cid:0) (cid:0) that involve E and E are 0. We are assuming that selection on E and E does not a⁄ect it it 1 it it 1 (cid:0) (cid:0) the variance of (cid:22) very much. i Let DL((cid:26) ;(cid:30) ;(cid:9);DATA) VL VL: At each stage of iteration, given estimates (cid:26) ;(cid:30) ;(cid:9) ^ , and ! 1 (cid:17) 1 (cid:0) 0 ! 1 the moments from the PSID data, we compute DL = DL((cid:26) ;(cid:30) ;(cid:9) ^ ;DATA). We set DL equal to ! 1 b b the expression for DR((cid:26) ;(cid:30) ;(cid:30) ;(cid:27) ;(cid:27) ;(cid:27) ;(cid:14)!) in (21), evaluated at (cid:26) ;(cid:30) ;(cid:27) ;(cid:27) , and solve for ! 1 2 ! mw (cid:22) (cid:22) ! 1 ! mw b b b b ^ (cid:30) . 2 b b b b This yields: DL (cid:26)2(cid:27)2 (cid:30) ((cid:30) +2) (cid:26) (cid:30) [(cid:26) (cid:30) 2(1 (cid:26) )]( ^ (cid:14) ! )2(cid:27)^2 (22) (cid:30) ((cid:26) ;(cid:30) ;(cid:9) ^ ;(cid:27) ;(cid:27) ;(cid:27)^ ; ^ (cid:14) ! ) = (cid:0) ! mw 1 1 (cid:0) ! 1 ! 1 (cid:0) (cid:0) ! (cid:22) (cid:22) 2 ! 1 ! mw (cid:22) (cid:22) s (cid:27)2 ! b b b b b b b b b b b b b b b b One can see that (cid:30) is increasing in DL: 2 b b 10 Appendix 2: Decomposing Career Wage Growth into the E⁄ects of General Human Capital, Tenure, and Job Shopping The experience pro(cid:133)le of wages E(wage t) is the sum of the e⁄ect of general human capital accuit j mulation, the accumulation of job tenure and the gains from job shopping. That is, E(wage t) = hc(t)+(cid:13)w E(P(TEN ) t)+E(v t) it j TEN it j ij(t) j where , hc(t); is the value of general human capital, (cid:13)w E(P(TEN ) t) is the expected value of TEN it j thetermsofthetenurepolynomial, andE(v t)istheexpectedvalueofthejobmatchcomponent. ij(t) j 46

We approximate E(wage t) using a cubic polynomial in t and obtain estimates from the regression it j of wage on a cubic in t, education, race, and a set of year dummies. The coe¢ cients of the (cid:3)it experience polynomial are reported in Table 2. Note that in the estimation of Model A.3 but not the other models, we account for the fact that (cid:13)w E(P(TEN ) t) + E(v t) is removed TEN it j ij(t) j from the PSID wage residuals that are used with our I-I estimator by adding a quadratic in t to the wage equation. This polynomial is a quadratic approximation to [(cid:13)w E(P(TEN ) t) + (cid:0) TEN it j E(v t)]:Failure to include t and t2 when estimating the other model parameters is likely to bias ij(t) j the parameters involving the job match coe¢ cient. We simulate data fromthe model to compute the values of E((cid:29) t) and (cid:13)^w E(P(TEN ) t), ij(t) j TEN it j where (cid:13)^w is taken from Altonji and Williams (2005). In (cid:133)gure 1 we graph TEN E(wage t);hc(t);(cid:13)w E(P(TEN ) t) and E((cid:29) t): it j TEN it j ij(t) j As one can see, most of the return to potential experience is due to general skill accumulation or the e⁄ect of age. Job shopping and the accumulation of tenure account for 14.6 percent and 13.5 percent, respectively, of the overall growth of wages over the (cid:133)rst ten years. They account for 12.1 percent and 15.8 percent of growth over the (cid:133)rst 35 years. In thinking about this, one should keep in mind that job losses counter the e⁄ects of selective mobility on growth in E((cid:29) t). The fact ij(t) j that we exclude the (cid:133)rst three years of labor market experience in the I-I estimator and miss job changes probably leads to an understatement of the return to job shopping. 47

Table 1a Descriptive Statistics - PSID Sample Variable Obs. Mean StDev Min Max E 33,933 0.97 0.18 0 1 t E | E = 1 32,868 0.98 0.15 0 1 t t-1 E | E = 0 1,065 0.71 0.45 0 1 t t-1 JC 33,933 0.08 0.28 0 1 t ED 33,933 11.58 7.45 0 42.25 t UD 33,933 0.05 0.31 0 8 t TEN 33,933 9.34 7.81 0 42.25 t wage* 32,889 2.73 0.49 1.25 4.98 t hours* 33,933 7.73 0.29 5.30 8.34 t earn* 33,933 3.53 0.67 -5.19 6.49 t w(a) 32,828 0.03 0.39 -2.00 2.22 t h(a) 33,933 0.04 0.28 -2.51 0.87 t e(a) 33,933 0.06 0.57 -8.91 2.44 t The table presents descriptive statistics for variables used in the structural and auxiliary models. All variables are constructed from the PSID. Lead values are excluded for sample statistics. (a) Variable is the residual from a 1-st stage least-squares regression against race, years of education, a cubic in potential experience, and year indicators. Table 1b Additional Descriptive Statistics - PSID sample Variable Obs. Mean StDev Min Max Potential Experience 33,933 19.34 8.80 4 40 Education (years) 33,933 12.94 2.38 6 17 Black 33,933 0.29 0.45 0 1 Calendar Year 33,933 1987.5 5.25 1978 1996 The table presents descriptive statistics for additional variables describing the PSID sample. Lead values are excluded.

Table 2 Point Estimates - Various Specifications Model A.1 Model A.2 Model A.3 Model B.1 Column 1a 1b 1c 2a 2b 2c 3a 3b 3c 4a 4b 4c Equation / Variable Parameter Point Est. MC Mean S.E. Point Est. MC Mean S.E. Point Est. MC Mean S.E. Point Est. MC Mean S.E. E-E Equation (cons) γEE 0.9360 0.6855 (0.1878) 0.9366 0.6624 (0.2186) 1.0141 0.7833 (0.1853) 1.0309 0.8374 (0.1359) 0 (t i -1)/10 γEE t -0.6208 -0.3675 (0.1453) -0.8330 -0.5520 (0.1639) -0.3707 -0.1664 (0.0976) -0.5654 -0.3432 (0.1225) (t i -1)2/100 γEE t2 0.2242 0.1590 (0.0408) 0.2714 0.1963 (0.0414) 0.1465 0.0927 (0.0231) 0.1908 0.1289 (0.0342) (ED t-1 ) γEE ED 0.0211 0.0186 (0.0217) 0.0298 0.0310 (0.0246) 0.0440 0.0439 (0.0176) 0.0711 0.0740 (0.0086) BLACK γEE -0.2910 -0.2670 (0.0529) -0.3047 -0.2671 (0.0646) -0.3608 -0.3261 (0.0571) -0.3117 -0.2724 (0.0429) BLACK EDUC γEE 0.1096 0.1063 (0.0190) 0.1264 0.1186 (0.0246) 0.0801 0.0758 (0.0159) 0.0694 0.0621 (0.0102) EDUC (wage' t ) γEE w' -0.2169 -0.1877 (0.0941) -0.0582 -0.0217 (0.0763) (μ) δEE 0.4574 0.4275 (0.0678) 0.5833 0.5117 (0.1049) 0.4426 0.3921 (0.0936) 0.3427 0.3076 (0.0333) μ (η) δEE 0.1949 0.1809 (0.0930) 0.1102 0.1140 (0.1062) -0.2370 -0.1868 (0.1070) 0.1005 0.0750 (0.0366) η U-E Equation (cons) γUE -0.0264 -1.0104 (0.8326) -0.6039 -1.4430 (0.7368) 0.0771 -0.9389 (0.8416) -0.1514 -0.9070 (0.7311) 0 (t i -1)/10 γUE t -0.8677 -0.5467 (0.4987) -0.0218 0.0824 (0.4277) -1.0505 -0.5641 (0.4892) -0.5021 -0.2730 (0.3896) (t i -1)2/100 γUE t2 0.2608 0.2115 (0.1350) 0.0699 0.0702 (0.1195) 0.3330 0.2339 (0.1358) 0.1696 0.1322 (0.1040) BLACK γUE -0.5375 -0.4685 (0.1212) -0.6158 -0.5488 (0.1493) -0.4860 -0.4123 (0.1325) -0.4810 -0.4424 (0.1253) BLACK EDUC γUE 0.1798 0.2132 (0.0503) 0.1600 0.1985 (0.0619) 0.1742 0.2009 (0.0537) 0.1510 0.1746 (0.0490) EDUC (μ) δUE 0.2570 0.2548 (0.1696) 0.2118 0.1879 (0.1546) 0.6372 0.5434 (0.1281) 0.2276 0.2053 (0.1591) μ (η) δUE 0.5789 0.4650 (0.1233) 0.6204 0.4967 (0.1140) 0.2218 0.1257 (0.1873) 0.5889 0.4830 (0.0943) η JC Equation (cons) γJC -0.3114 -0.2486 (0.1523) -0.3781 -0.3414 (0.1429) -0.6264 -0.5177 (0.1628) -0.5048 -0.4706 (0.1725) 0 (t i -1)/10 γJC t -0.2132 -0.2615 (0.1116) 0.0018 -0.0775 (0.1261) -0.0983 -0.2112 (0.1062) -0.2125 -0.3026 (0.1579) (t i -1)2/100 γJC t2 -0.0134 0.0049 (0.0271) -0.0637 -0.0368 (0.0316) -0.0455 -0.0113 (0.0247) -0.0137 0.0144 (0.0388) (TEN t-1 ) γJC TEN -0.0786 -0.0705 (0.0149) -0.1138 -0.1065 (0.0159) -0.0673 -0.0544 (0.0156) -0.0767 -0.0612 (0.0166) BLACK γJC 0.0885 0.0924 (0.0390) 0.0538 0.0629 (0.0333) 0.1658 0.1839 (0.0554) 0.1033 0.1165 (0.0407) BLACK EDUC γJC -0.0325 -0.0328 (0.0104) -0.0222 -0.0206 (0.0087) -0.0184 -0.0205 (0.0108) -0.0189 -0.0174 (0.0102) EDUC (υ ) δJC -0.5082 -0.4801 (0.0697) -0.9230 -0.9187 (0.1274) t-1 υ-1 (υ) δJC 0.2101 0.2333 (0.0659) 0.5936 0.6155 (0.1410) t υ (μ) δJC -0.5491 -0.5357 (0.0651) -0.3522 -0.3503 (0.0681) -0.2796 -0.2935 (0.1362) -0.5449 -0.5587 (0.0707) μ (η) δJC 0.0650 0.0827 (0.0890) 0.1446 0.1414 (0.0845) 0.5308 0.5209 (0.0995) 0.1270 0.1409 (0.0692) η The table presents estimates and standard errors for models A.1, A.2, A.3, and B.1. Estimates were obtained by Indirect Inference, unless indicated otherwise. Parametric bootstrap standard errors are in parentheses. Bootstraps are based on 100 replications, except for model A.3 which uses 300 replications. (i) Estimate obtained in first-stage least-squares regression. (ii) Estimate obtained using additional moment conditions. See discussion in Section 4. (iii) Imposed.

Table 2 (cont.) Point Estimates - Various Specifications Model A.1 Model A.2 Model A.3 Model B.1 Column 1a 1b 1c 2a 2b 2c 3a 3b 3c 4a 4b 4c Equation / Variable Parameter Point Est. MC Mean S.E. Point Est. MC Mean S.E. Point Est. MC Mean S.E. Point Est. MC Mean S.E. Wage Equation BLACK γw BLACK (i) -0.2048 (0.0038) -0.2048 (0.0038) -0.2048 (0.0038) -0.2048 (0.0038) EDUC γw EDUC (i) 0.1049 (0.0008) 0.1049 (0.0008) 0.1049 (0.0008) 0.1049 (0.0008) Tenure polynomial no no yes no (ti-1)/10 (*) γw t (i) 0.7514 (0.0211) 0.7514 (0.0211) 0.7514 (0.0211) 0.7514 (0.0211) (ti-1)2/100 γw t2 (i) -0.2430 (0.0118) -0.2430 (0.0118) -0.2430 (0.0118) -0.2430 (0.0118) (ti-1)3/1000 γw t3 (i) 0.0278 (0.0019) 0.0278 (0.0019) 0.0278 (0.0019) 0.0278 (0.0019) (μ) δw μ 0.1264 0.1297 (0.0124) 0.0633 0.0653 (0.0218) 0.0490 0.0477 (0.0278) 0.1505 0.1512 (0.0118) (JCt) γυ 0 0.0420 0.0442 (0.0074) 0.0355 0.0359 (0.0068) (υt-1) ρυ 0.5651 0.5867 (0.0215) 0.5962 0.6142 (0.0227) 0.6252 0.6399 (0.0224) (ευ) συ 0.2656 0.2765 (0.0064) 0.2710 0.2820 (0.0063) 0.2686 0.2775 (0.0070) (ευ 1) συ1 0.1443 0.1528 (0.0315) 0.1833 0.1943 (0.0162) 0.1967 0.2103 (0.0205) (ωt-1) ρω 0.9200 (ii) 0.9200 (ii) 0.9200 (ii) 0.9577 0.9603 (0.0027) (ωt-1) φ1 -0.2379 -0.2339 (0.0115) (1-Et) γω 1-Et -0.2016 -0.2016 (0.0100) -0.2317 -0.2298 (0.0104) -0.1895 -0.1877 (0.0102) -0.1858 -0.1866 (0.0099) (1-Et-1) γω 1-Et-1 0.0978 0.0997 (0.0121) 0.1010 0.1017 (0.0121) 0.1041 0.1052 (0.0132) 0.0626 0.0630 (0.0110) (εω) σω 0.0954 0.0916 (0.0026) 0.0929 0.0891 (0.0025) 0.0950 0.0922 (0.0029) 0.0934 0.0904 (0.0017) (εω) φ2 2.1000 2.1969 (0.0553) (εω 1) (Black, Low Educ) σω1 (ii) 0.2572 0.2477 (0.0177) 0.2557 0.2450 (0.0142) 0.2488 0.2343 (0.0195) 0.2834 0.2827 (0.0063) (εω 1) (Black, High Educ) σω1 (ii) 0.2836 0.2751 (0.0159) 0.2822 0.2727 (0.0127) 0.2760 0.2632 (0.0171) 0.3076 0.3070 (0.0058) (εω 1) (White, Low Educ) σω1 (ii) 0.2563 0.2467 (0.0178) 0.2547 0.2440 (0.0142) 0.2478 0.2333 (0.0196) 0.2826 0.2819 (0.0063) (εω 1) (White, High Educ) σω1 (ii) 0.3133 0.3057 (0.0142) 0.3120 0.3034 (0.0114) 0.3064 0.2950 (0.0152) 0.3351 0.3345 (0.0053) Hours Equation (cons) γh 0 -0.3630 -0.3737 (0.0084) -0.3609 -0.3710 (0.0081) -0.3632 -0.3747 (0.0076) -0.3633 -0.3745 (0.0081) BLACK γh BLACK (i) -0.1055 (0.0043) -0.1055 (0.0043) -0.1055 (0.0043) -0.1055 (0.0043) EDUC γh EDUC (i) 0.0178 (0.0007) 0.0178 (0.0007) 0.0178 (0.0007) 0.0178 (0.0007) (Et) γh E 0.4104 0.4114 (0.0082) 0.4110 0.4122 (0.0070) 0.4129 0.4142 (0.0075) 0.4157 0.4168 (0.0074) σξ 0.1631 0.1802 (0.0143) 0.1611 0.1762 (0.0131) 0.1574 0.1726 (0.0136) (wt) γh w -0.0680 -0.0681 (0.0128) -0.0698 -0.0682 (0.0139) -0.0692 -0.0670 (0.0148) -0.0929 -0.0921 (0.0148) (μ) δh μ 0.0707 0.0714 (0.0170) 0.0894 0.0846 (0.0188) 0.1248 0.1204 (0.0135) 0.0929 0.0935 (0.0188) (η) δh η 0.0991 0.0953 (0.0198) 0.0848 0.0888 (0.0224) 0.0145 0.0200 (0.0304) 0.1545 0.1539 (0.0102) (εh) σh 0.1676 0.1654 (0.0026) 0.1679 0.1659 (0.0024) 0.1686 0.1667 (0.0023) 0.1800 0.1802 (0.0009) Earnings Equation (cons) γe 0 -0.0043 -0.0059 (0.0026) -0.0047 -0.0061 (0.0026) -0.0061 -0.0071 (0.0024) -0.0044 -0.0060 (0.0022) (wt) γe w (iii) 1.0000 1.0000 1.0000 1.0000 (ht) γe h (iii) 1.0000 1.0000 1.0000 1.0000 ρe 0.5510 0.5508 (0.0084) 0.5498 0.5506 (0.0073) 0.5527 0.5528 (0.0068) 0.5481 0.5476 (0.0072) (εe) σe 0.2110 0.2108 (0.0015) 0.2105 0.2103 (0.0015) 0.2109 0.2107 (0.0016) 0.2109 0.2106 (0.0014) The table presents estimates and standard errors for models A.1, A.2, A.3, and B.1. Estimates were obtained by Indirect Inference, unless indicated otherwise. Parametric bootstrap standard errors are in parentheses. Bootstraps are based on 100 replications, except for model A.3 which uses 300 replications. (*) The potential-experience profile estimated in the first-stage regression reflects the effects of general human capital accumulation, job tenure accumulation, and job shopping. Since the effects of job shopping are endogenously accounted for in model A.3, by the inclusion of a job-specific wage component that affects job mobility, model A.3 includes a "correction term" in the wage model, estimated by indirect inference. The correction term is a quadratic in potential experience. The estimated correction term is: -0.0343 - 0.0753*(ti-1)/10 + 0.0072*(ti-1)2/100, with corresponding standard errors (0.0340), (0.0344), and (0.0075). (i) Estimate obtained in first-stage least-squares regression. (ii) Estimate obtained using additional moment conditions. See discussion in Section 4. (iii) Imposed.

Table 3a Regressions Comparing PSID Sample and Data Simulated from Models A.3 and B.1 Employment and Job Change Regressions PSID Model A.3 Model B.1 1a (1) 2a (2) 3a (3) 1b (1) 2b (2) 3b (3) 1c (1) 2c (2) 3c (3) Variable E t E t JC t E t E t JC t E t E t JC t PE t-1 /10 -0.0069 0.1446 -0.0702 -0.0348 -0.2304 -0.0420 -0.0415 -0.1189 -0.0541 (0.0050) (0.0901) (0.0090) (0.0010) (0.0154) (0.0018) (0.0010) (0.0156) (0.0018) PE2 t-1 /100 -0.0003 -0.0319 0.0169 0.0052 0.0632 0.0107 0.0070 0.0296 0.0135 (0.0011) (0.0217) (0.0020) (0.0002) (0.0038) (0.0004) (0.0002) (0.0039) (0.0004) ED t-1 0.0025 0.0031 0.0030 (0.0002) (0.0000) (0.0000) UD t-1 -0.1071 -0.0559 -0.0498 (0.0181) (0.0011) (0.0011) TEN t-1 /10 -0.0803 -0.0912 -0.0922 (0.0026) (0.0004) (0.0004) Constant 0.9638 0.7453 0.2173 0.9691 0.9647 0.2159 0.9759 0.8808 0.2266 (0.0047) (0.0866) (0.0086) (0.0010) (0.0142) (0.0017) (0.0010) (0.0145) (0.0017) Observations 27651 708 27055 816079 34691 793445 816281 34489 793549 R-squared 0.01 0.05 0.05 0.02 0.08 0.07 0.02 0.06 0.07 RMSE 0.14 0.44 0.26 0.16 0.45 0.28 0.16 0.45 0.28 The table presents least-squares regression results comparing PSID data and data simulated from estimated models A.3 and B.1. Regressions on simulated data are based on a simulated sample which is 30 times as large as the PSID sample, but has the same demographic structure (by potential experience) as the PSID sample. Standard errors are in parentheses. (1) Sample restricted to observations where E =1. t-1 (2) Sample restricted to observations where E =0. t-1 (3) Sample restricted to observations where E=1 and E =1. t t-1 Table 3b Regressions Comparing PSID Sample and Data Simulated from Models A.3 and B.1 - Wage Regressions PSID Model A.3 Model B.1 1a 2a 3a 4a 1b 2b 3b 4b 1c 2c 3c 4c Variable w t w t w t w t w t w t w t w t w t w t w t w t w t-1 0.8850 0.6173 0.6142 0.9051 0.7261 0.7256 0.9010 0.7240 0.7244 (0.0029) (0.0063) (0.0063) (0.0005) (0.0012) (0.0012) (0.0005) (0.0012) (0.0012) w t-2 0.3164 0.3172 0.1984 0.1979 0.1964 0.1973 (0.0063) (0.0063) (0.0012) (0.0012) (0.0012) (0.0012) JC t -0.0389 -0.0129 0.0185 (0.0044) (0.0007) (0.0007) PE t-1 /10 (0.2209) (0.1395) (0.1002) (0.0378) (0.0074) (0.0072) PE2 t-1 /100 0.0615 0.0456 0.0179 (0.0188) (0.0037) (0.0036) PE3 t-1 /1000 (0.0074) (0.0061) (0.0028) (0.0028) (0.0006) (0.0005) TEN t-1 /10 0.3944 0.4107 0.2519 (0.0175) (0.0029) (0.0028) TEN2 t-1 /100 (0.1600) (0.2207) (0.0882) (0.0138) (0.0023) (0.0022) TEN3 t-1 /100 0.0249 0.0342 0.0136 (0.0029) (0.0005) (0.0005) Constant 0.0132 0.0147 0.0175 0.0591 0.0058 0.0059 0.0070 -0.0010 0.0043 0.0041 0.0025 0.0200 (0.0011) (0.0011) (0.0012) (0.0223) (0.0002) (0.0002) (0.0002) (0.0044) (0.0002) (0.0002) (0.0002) (0.0043) Observations 27055 22587 22587 32828 793445 652618 652618 976539 793549 652765 652765 976949 R-squared 0.77 0.8 0.8 0.07 0.83 0.84 0.84 0.04 0.82 0.83 0.83 0.05 RMSE 0.18 0.17 0.17 0.37 0.17 0.16 0.16 0.4 0.17 0.16 0.16 0.39 The table presents least-squares regression results comparing PSID data and data simulated from estimated models A.3 and B.1. Regressions on simulated data are based on a simulated sample which is 30 times as large as the PSID sample, but has the same demographic structure (by potential experience) as the PSID sample. Standard errors are in parentheses.

Table 3c Regressions Comparing PSID Sample and Data Simulated from Models A.3 and B.1 - Hours Regressions PSID Model A.3 Model B.1 1a 1b 1c Variable h h h t t t PE /10 -0.0081 0.0084 0.0069 t-1 (0.0086) (0.0018) (0.0018) PE2 /100 0.0006 -0.0021 -0.0019 t-1 (0.0018) (0.0004) (0.0004) h 0.3697 0.3236 0.2758 t-1 (0.0067) (0.0011) (0.0011) h 0.1826 0.2741 0.2748 t-2 (0.0065) (0.0011) (0.0011) w -0.0005 -0.0191 -0.0113 t (0.0036) (0.0007) (0.0007) Constant 0.0368 0.0219 0.0276 (0.0091) (0.0019) (0.0019) Observations 23322 689672 689749 R-squared 0.23 0.28 0.24 RMSE 0.21 0.24 0.23 The table presents least-squares regression results comparing PSID data and data simulated from estimated models A.3 and B.1. Regressions on simulated data are based on a simulated sample which is 30 times as large as the PSID sample, but has the same demographic structure (by potential experience) as the PSID sample. Standard errors are in parentheses. Table 3d Regressions Comparing PSID Sample and Data Simulated from Models A.3 and B.1 - Earnings Regressions PSID Model A.3 Model B.1 1a 2a 1b 2b 1c 2c Variable e e e e e e t t t t t t PE /10 0.0304 -0.0105 -0.0113 t-1 (0.0130) (0.0029) (0.0029) PE2 /100 -0.0078 0.0023 0.0025 t-1 (0.0028) (0.0006) (0.0006) e 0.6873 0.5488 0.5330 t-1 (0.0069) (0.0011) (0.0011) e 0.1859 0.2513 0.2684 t-2 (0.0070) (0.0011) (0.0011) w 0.9232 0.9601 0.9624 t (0.0043) (0.0008) (0.0008) h 0.7701 0.8757 0.8703 t (0.0068) (0.0011) (0.0012) Constant -0.0214 0.0214 0.0137 0.0007 0.0169 0.0039 (0.0137) (0.0017) (0.0030) (0.0003) (0.0030) (0.0003) Observations 23915 32828 717450 976539 717450 976949 R-squared 0.65 0.65 0.57 0.69 0.57 0.69 RMSE 0.32 0.3 0.39 0.31 0.38 0.3 The table presents least-squares regression results comparing PSID data and data simulated from estimated models A.3 and B.1. Regressions on simulated data are based on a simulated sample which is 30 times as large as the PSID sample, but has the same demographic structure (by potential experience) as the PSID sample. Standard errors are in parentheses.

Table 4a Decomposition of Cross-Sectional Variance in Lifetime Earnings, Wage, and Hours - Model A.3. Shocks turned off one at a time (for all t). Column I II III IV V VI VII VIII IX X XI Shock Breakdown of Composite 'Shock' Variable εe εh εw Composite η μ Educ ξ υ E JC Lifetime Earnings 6.56 2.35 12.41 36.65 -0.78 11.42 31.38 7.80 27.58 1.71 -0.44 (SE) (0.18) (0.07) (0.93) (2.42) (2.18) (4.04) (0.91) (1.36) (2.29) (0.28) (0.17) Lifetime Wage 0 0 20.61 48.18 -3.06 -0.29 34.56 0 47.14 1.69 -0.64 (SE) (0.00) (0.00) (1.29) (2.28) (1.32) (2.77) (1.01) (0.00) (2.43) (0.37) (0.28) Lifetime Hours 0 4.49 0.45 46.81 -2.78 46.17 4.86 42.65 0.61 3.73 -0.18 (SE) (0.00) (0.13) (0.17) (7.48) (4.22) (7.73) (0.42) (7.27) (0.29) (0.46) (0.06) Entries in columns I to VII display the contribution of a given type of shock to the variance of lifetime earnings, wage, and hours, and are expressed as a percentage of the lifetime variance in the basecase. In the basecase we simulate of the full estimated model. To compute the contribution of a particular shock, we simulate the model again, setting the variance of a given shock to zero for all t. We then compute the variance of the appropriate variables. The difference relative to the basecase is the contribution of the given shock. Since the model is nonlinear, the contributions don't sum up to 100%. We normalize columns I to VII to sum to 100. Column IV is the combined contribution of the job match wage and hours components, employment and unemployment shocks, and job change shocks. In columns VIII through XI we decompose Column IV. Column VIII shows the marginal contribution of ξ, IX the marginal contribution of υ with var(ξ) set to 0, X the marginal contribution of unemployment spells with Var(ξ) and Var(υ) set to 0, and column XI displays the marginal contribution of job changes with Var(ξ) and Var(υ) set to 0, and no unemployment. Bootstrap standard errors are in parentheses.

Table 4b Decomposition of Cross-Sectional Variance in Earnings, Wage, and Hours at different t - Model A.3. Shocks turned off one at a time (for all t). Column I II III IV V VI VII VIII IX X XI Shock Breakdown of Composite 'Shock' Variable/Horizon εe εh εw Composite η μ Educ ξ υ E JC Earnings t = 1 14.0 9.3 23.4 19.0 0.1 10.0 24.2 8.4 10.5 0.2 0 (0.28) (0.28) (2.44) (2.59) (0.57) (2.64) (0.37) (1.50) (2.32) (0.07) (0.00) t = 5 17.7 7.7 17.9 26.1 0.3 8.5 21.9 7.8 18.8 0.9 -1.5 (0.38) (0.27) (1.54) (2.07) (1.05) (2.59) (0.37) (1.35) (1.95) (0.22) (0.19) t = 10 17.9 8.5 15.8 28.6 0.4 8.0 20.8 7.6 21.9 0.9 -1.8 (0.37) (0.27) (1.15) (1.98) (1.33) (2.89) (0.50) (1.28) (1.89) (0.22) (0.20) t = 20 17.0 8.3 14.7 32.2 -0.1 7.2 20.7 6.5 25.6 1.1 -0.9 (0.45) (0.26) (0.98) (2.03) (1.39) (3.03) (0.56) (1.26) (1.96) (0.19) (0.15) t = 30 16.8 8.7 13.1 32.6 2.5 6.6 19.7 6.9 25.4 0.7 -0.4 (0.42) (0.34) (0.91) (1.99) (1.74) (3.19) (0.59) (1.22) (1.91) (0.14) (0.12) t = 40 17.5 7.9 14.5 34.0 1.5 5.6 19.0 7.0 26.9 0.3 -0.2 (0.46) (0.32) (0.89) (1.84) (2.02) (2.70) (0.54) (1.19) (1.88) (0.08) (0.09) Wage t = 1 0 0 45.1 22.7 0 1.7 30.4 0 22.7 0 0 (0.00) (0.00) (4.50) (4.60) (0.00) (1.77) (0.11) (0.00) (4.60) (0.00) (0.00) t = 5 0 0 35.7 36.5 -0.1 1.0 27.0 0 37.4 0.7 -1.5 (0.00) (0.00) (2.52) (2.77) (0.50) (1.55) (0.47) (0.00) (2.72) (0.24) (0.26) t = 10 0 0 31.4 42.5 -0.8 0.7 26.2 0 43.2 1.0 -1.7 (0.00) (0.00) (1.84) (2.29) (0.80) (1.77) (0.61) (0.00) (2.31) (0.31) (0.31) t = 20 0 0 27.8 48.1 -1.6 0.0 25.8 0 47.3 1.6 -0.8 (0.00) (0.00) (1.55) (2.14) (1.05) (2.09) (0.73) (0.00) (2.22) (0.32) (0.24) t = 30 0 0 26.8 49.7 -0.7 -0.9 25.1 0 48.8 1.2 -0.3 (0.00) (0.00) (1.56) (2.02) (1.08) (1.90) (0.74) (0.00) (2.08) (0.26) (0.19) t = 40 0 0 27.8 49.8 0.2 -1.5 23.8 0 49.4 0.6 -0.1 (0.00) (0.00) (1.58) (1.94) (1.04) (1.84) (0.87) (0.00) (1.96) (0.16) (0.12) Hours t = 1 0 39.5 0.5 35.8 0.3 21.8 2.0 33.9 0.3 1.7 0 (0.00) (1.31) (0.19) (5.27) (1.97) (4.33) (0.07) (5.54) (0.12) (0.64) (0.00) t = 5 0 36.5 0.5 37.4 0.3 23.1 2.3 32.3 0.1 4.9 0.1 (0.00) (1.31) (0.19) (4.93) (1.97) (3.94) (0.38) (5.07) (0.14) (0.83) (0.03) t = 10 0 37.1 0.4 36.9 0.5 22.5 2.6 32.3 0.0 4.5 0.1 (0.00) (1.42) (0.16) (4.88) (2.03) (3.86) (0.35) (4.84) (0.17) (0.52) (0.04) t = 20 0 37.1 0.4 36.7 0.6 22.5 2.7 31.9 0.0 4.8 0.0 (0.00) (1.31) (0.16) (5.10) (1.98) (4.00) (0.38) (5.11) (0.21) (0.49) (0.03) t = 30 0 39.5 0.5 36.0 -0.5 22.4 2.1 34.1 -0.1 2.1 0.0 (0.00) (1.44) (0.18) (5.46) (2.23) (4.24) (0.30) (5.47) (0.20) (0.32) (0.02) t = 40 0 40.3 0.3 35.1 -0.6 23.0 1.8 35.0 -0.1 0.2 0.0 (0.00) (1.50) (0.17) (5.59) (2.37) (4.40) (0.37) (5.55) (0.19) (0.22) (0.02) Entries in columns I to VII display the contribution of a given type of shock to the variance in earnings, wage, and hours for a cross section of simulated individuals with potential experience t. The contribution is expressed as a percentage of the variance in the basecase. In the basecase we simulate the full estimated model. To compute the contribution of a particular shock, we simulate the model again, setting the variance of the given shock to zero for all t. We then compute the variance of the appropriate variables at the specified value of t. The difference relative to the basecase is the contribution of the given shock. Since the model is nonlinear, the contributions don't sum up to 100%. We have normalized columns I to VII to sum to 100. Column IV is the combined contribution of the job match wage and hours components, unemployment shocks, and job change shocks. In columns VIII through XI we decompose Column IV. Column VIII is the marginal contribution of ξ, IX is the marginal contribution of υ with var(ξ) set to 0, X is the marginal contribution of eliminating unemployment spells with Var(ξ) and Var(υ) set to 0, and column XI is the marginal contribution of job changes with Var(ξ) and Var(υ) set to 0, and no unemployment. Bootstrap standard errors are in parentheses.

Table 5a Decomposition of Cross-Sectional Variance in Lifetime Earnings, Wage, and Hours - Model B.1 Shocks turned off one at a time (for all t). Column I II III IV V VI VII VIII Shock Variable εe εh εw J E η μ Educ Lifetime Earnings 6.85 2.66 24.62 2.35 1.88 11.53 18.76 31.35 (SE) (0.13) (0.03) (1.61) (0.56) (0.29) (1.36) (3.13) (0.76) Lifetime Wage 0 0 45.59 3.46 1.83 0.75 11.46 36.90 (SE) (0.00) (0.00) (2.09) (0.88) (0.50) (0.25) (1.89) (0.83) Lifetime Hours 0 4.98 1.87 0.23 3.03 64.61 20.39 4.91 (SE) (0.00) (0.09) (0.52) (0.09) (0.40) (8.28) (7.96) (0.29) Entries in columns I to VIII display the contribution of a given type of shock to the variance of lifetime earnings, wage, and hours, and are expressed as a percentage of the lifetime variance in the basecase. In the basecase we simulate of the full estimated model. To compute the contribution of a particular shock, we simulate the model again, setting the variance of a given shock to zero for all t. We then compute the variance of the appropriate variables. The difference relative to the basecase is the contribution of the given shock. Since the model is nonlinear, the contributions don't sum up to 100%. We normalize columns I to VIII to sum to 100.

Table 5b Decomposition of Cross-Sectional Variance in Earnings, Wage, and Hours at different t - Model B.1 Shocks turned off one at a time (for all t). Column I II III IV V VI VII VIII Shock Variable/Horizon εe εh εw J E η μ Educ Earnings t = 1 14.8 10.9 25.4 0.0 0.2 8.3 17.3 23.1 (0.23) (0.15) (1.40) 0.00 (0.06) (1.07) (2.52) (0.30) t = 5 17.6 9.9 26.3 6.2 1.0 8.1 11.8 19.1 (0.31) (0.20) (1.13) (0.69) (0.22) (0.71) (2.09) (0.30) t = 10 17.1 8.7 26.7 7.6 1.4 7.1 12.0 19.3 (0.27) (0.16) (1.12) (0.81) (0.20) (0.69) (2.01) (0.30) t = 20 17.0 9.8 25.9 6.1 1.2 7.6 11.7 20.6 (0.31) (0.15) (1.19) (0.68) (0.24) (0.72) (2.04) (0.35) t = 30 17.3 9.4 25.3 4.4 1.1 8.2 13.9 20.4 (0.33) (0.15) (1.31) (0.53) (0.21) (0.77) (2.04) (0.36) t = 40 18.1 10.3 25.2 2.3 0.5 7.8 14.0 21.9 (0.36) (0.17) (1.35) (0.45) (0.15) (0.76) (2.01) (0.35) Wage t = 1 0 0 56.0 0 0 0 13.5 30.6 (0.00) (0.00) (1.99) (0.00) (0.00) (0.00) (2.11) (0.12) t = 5 0 0 54.5 11.6 0.8 0.5 8.1 24.5 (0.00) (0.00) (1.48) (1.01) (0.29) (0.27) (1.35) (0.42) t = 10 0 0 53.2 13.6 1.5 0.6 7.5 23.5 (0.00) (0.00) (1.45) (1.04) (0.38) (0.25) (1.14) (0.41) t = 20 0 0 52.5 11.3 2.2 0.6 8.2 25.2 (0.00) (0.00) (1.46) (0.84) (0.41) (0.27) (1.02) (0.48) t = 30 0 0 51.3 7.6 2.6 1.7 11.0 25.8 (0.00) (0.00) (1.51) (0.70) (0.42) (0.35) (1.17) (0.54) t = 40 0 0 53.9 4.0 1.6 0.6 11.8 28.2 (0.00) (0.00) (1.67) (0.77) (0.35) (0.23) (1.40) (0.65) Hours t = 1 0 47.8 1.5 0 2.0 35.6 11.1 2.1 (0.00) (0.92) (0.44) (0.00) (0.69) (4.68) (4.42) (0.08) t = 5 0 44.7 1.6 0.0 5.1 34.5 11.3 2.6 (0.00) (1.05) (0.48) (0.08) (0.90) (4.39) (4.24) (0.35) t = 10 0 44.3 1.8 0.0 5.1 34.4 11.6 2.8 (0.00) (0.81) (0.53) (0.11) (0.45) (4.51) (4.26) (0.31) t = 20 0 44.1 1.7 0.0 5.1 34.1 12.0 3.0 (0.00) (0.86) (0.53) (0.12) (0.53) (4.63) (4.39) (0.28) t = 30 0 47.2 1.7 0.0 2.1 35.9 10.7 2.5 (0.00) (0.91) (0.50) (0.09) (0.40) (4.57) (4.27) (0.14) t = 40 0 49.0 1.8 0.1 0.0 36.0 10.7 2.4 (0.00) (0.89) (0.52) (0.09) (0.29) (4.65) (4.36) (0.11) Entries in columns I to VIII display the contribution of a given type of shock to the variance in earnings, wage, and hours for a cross section of simulated individuals with potential experience t. The contribution is expressed as a percentage of the variance in the basecase. In the basecase we simulate the full estimated model. To compute the contribution of a particular shock, we simulate the model again, setting the variance of the given shock to zero for all t. We then compute the variance of the appropriate variables at the specified value of t. The difference relative to the basecase is the contribution of the given shock. Since the model is nonlinear, the contributions don't sum up to 100%. We have normalized columns I to VIII to sum to 100.

Table 6 Point Estimates - Models A.3 and B.1 on SRC sample and subsamples Model A.3 Model B.1 Column 1a 1b 1c 1d 1e 2a 2b Equation / Variable Parameter SRC+SEO All SRC White Low Educ High Educ SRC+SEO All SRC E-E Equation (cons) γEE 0 1.0141 (0.1853) 0.8305 (0.1683) 0.6175 0.5553 0.7246 1.0309 0.8617 (ti-1)/10 γEE t -0.3707 (0.0976) -0.5002 (0.1393) -0.5523 -0.4733 -0.8043 -0.5654 -0.5832 (ti-1)2/100 γEE t2 0.1465 (0.0231) 0.1953 (0.0385) 0.2080 0.1633 0.2691 0.1908 0.1893 (EDt-1) γEE ED 0.0440 (0.0176) 0.0746 (0.0186) 0.0893 0.0758 0.0966 0.0711 0.0795 BLACK γEE BLACK -0.3608 (0.0571) -0.1691 (0.0914) -0.3117 -0.3176 EDUC γEE EDUC 0.0801 (0.0159) 0.0742 (0.0155) 0.0856 0.1065 0.0809 0.0694 0.0777 (wage't) γEE w' -0.0582 (0.0763) -0.0796 (0.1070) -0.1007 -0.2954 -0.2639 (μ) δEE μ 0.4426 (0.0936) 0.3816 (0.0820) 0.3582 0.4245 0.3805 0.3427 0.3537 (η) δEE η -0.2370 (0.1070) -0.2110 (0.0719) -0.1944 -0.2066 -0.1539 0.1005 0.0693 U-E Equation (cons) γUE 0 0.0771 (0.8416) 0.9653 (0.5116) 1.8163 2.8077 0.5927 -0.1514 0.7095 (ti-1)/10 γUE t -1.0505 (0.4892) -1.3641 (0.5706) -0.7142 -1.7540 -0.5310 -0.5021 -0.9452 (ti-1)2/100 γUE t2 0.3330 (0.1358) 0.3695 (0.1685) 0.2603 0.6474 0.1059 0.1696 0.2968 BLACK γUE BLACK -0.4860 (0.1325) -0.1473 (0.2112) -0.4810 -0.1374 EDUC γUE EDUC 0.1742 (0.0537) 0.0946 (0.0336) -0.0239 -0.0809 0.1000 0.1510 0.0805 (μ) δUE μ 0.6372 (0.1281) 0.2948 (0.1530) 0.3484 0.3444 0.4068 0.2276 0.1685 (η) δUE η 0.2218 (0.1873) 0.1227 (0.1537) 0.2701 0.0448 -0.0836 0.5889 0.3212 JC Equation (cons) γJC 0 -0.6264 (0.1628) -0.3423 (0.1876) -0.6065 -0.2481 -1.8496 -0.5048 -0.3078 (ti-1)/10 γJC t -0.0983 (0.1062) -0.1509 (0.1432) 0.2697 -0.1849 0.7347 -0.2125 -0.1783 (ti-1)2/100 γJC t2 -0.0455 (0.0247) -0.0445 (0.0332) -0.1415 -0.0028 -0.2748 -0.0137 -0.0178 (TENt-1) γJC TEN -0.0673 (0.0156) -0.0528 (0.0194) -0.0863 -0.1237 -0.0570 -0.0767 -0.0605 BLACK γJC BLACK 0.1658 (0.0554) -0.0665 (0.1307) 0.1033 -0.0796 EDUC γJC EDUC -0.0184 (0.0108) -0.0368 (0.0133) -0.0262 -0.0201 0.0204 -0.0189 -0.0383 (υt-1) δJC υ-1 -0.9230 (0.1274) -0.8088 (0.1402) -0.5495 -0.3932 -0.4633 (υt) δJC υ 0.5936 (0.1410) 0.7846 (0.1545) 0.5035 0.1717 0.8330 (μ) δJC μ -0.2796 (0.1362) -0.3175 (0.1012) -0.1943 -0.2007 -0.2534 -0.5449 -0.5133 (η) δJC η 0.5308 (0.0995) 0.5071 (0.1012) 0.3593 0.3599 0.3607 0.1270 0.1834 The table presents estimates for models A.3 and B.1 restricting the PSID to the SRC sample and subsamples. Estimates were obtained by Indirect Inference, unless indicated otherwise. Parametric bootstrap standard errors are presented in parentheses for the SRC+SEO and the SRC samples. Bootstraps are based on 300 replications for the SRC+SEO sample and on 100 replications for the SRC sample (we limit the latter to 100 because of the computational cost of the calculations). (i) Estimate obtained in first-stage least-squares regression. (ii) Estimate obtained using additional moment conditions. See discussion in Section 4. (iii) Imposed.

Table 6 (cont.) Point Estimates - Models A.3 and B.1 on SRC sample and subsamples Model A.3 Model B.1 Column 1a 1b 1c 1d 1e 2a 2b Equation / Variable Parameter SRC+SEO All SRC White Low Educ High Educ SRC+SEO All SRC Wage Equation BLACK γw BLACK (i) -0.2048 (0.0038) -0.2350 (0.0085) -0.2048 -0.2350 EDUC γw EDUC (i) 0.1049 (0.0008) 0.1083 (0.0011) 0.1069 0.0948 0.1271 0.1049 0.1083 Tenure polynomial yes yes yes yes yes yes yes no no (ti-1)/10 γw t (i) 0.7514 (0.0211) 0.8028 (0.0270) 0.8182 0.8038 0.8027 0.7514 0.8028 (ti-1)2/100 γw t2 (i) -0.2430 (0.0118) -0.2644 (0.0151) -0.2714 -0.2768 -0.2478 -0.2430 -0.2644 (ti-1)3/1000 γw t3 (i) 0.0278 (0.0019) 0.0305 (0.0025) 0.0312 0.0334 0.0248 0.0278 0.0305 cons. (*) a0 -0.0343 (0.0340) -0.0542 (0.0447) -0.0514 -0.1077 0.0486 (ti-1)/10 (*) a1 -0.0753 (0.0344) -0.0816 (0.0462) -0.0505 0.0671 -0.1910 (ti-1)2/100 (*) a2 0.0072 (0.0075) 0.0092 (0.0103) 0.0052 -0.0194 0.0334 (μ) δw μ 0.0490 (0.0278) 0.1015 (0.0269) 0.0796 0.1006 0.1827 0.1505 0.1420 (JCt) γυ 0 0.0355 0.0327 (υt-1) ρυ 0.6252 (0.0224) 0.6041 (0.0366) 0.6136 0.5902 0.6513 (ευ) συ 0.2686 (0.0070) 0.2739 (0.0089) 0.2776 0.2690 0.2942 (ευ 1) συ1 0.1967 (0.0205) 0.1048 (0.0206) 0.1318 0.1696 0.0947 (ωt-1) ρω 0.9200 (ii) 0.9200 (ii) 0.9200 (ii) 0.9000 (ii) 0.9000 (ii) 0.9577 0.9567 (ωt-1) φ1 -0.2379 -0.2015 (1-Et) γω 1-Et -0.1895 (0.0102) -0.1485 (0.0122) -0.1370 -0.1866 -0.1740 -0.1858 -0.1561 (1-Et-1) γω 1-Et-1 0.1041 (0.0132) 0.0744 (0.0153) 0.0372 0.0737 0.0548 0.0626 0.0246 (εω) σω 0.0950 (0.0029) 0.0937 (0.0032) 0.0937 0.0753 0.1004 0.0934 0.0929 (εω) φ2 2.1000 2.1303 (εω 1) (Black, Low Educ) σω1 (ii) 0.2488 (0.0195) 0.2858 (0.0126) 0.2816 0.2532 0.2462 0.2834 0.2878 (εω 1) (Black, High Educ) σω1 (ii) 0.2760 (0.0171) 0.3098 (0.0116) 0.3059 0.2800 0.2737 0.3076 0.3116 (εω 1) (White, Low Educ) σω1 (ii) 0.2478 (0.0196) 0.2850 (0.0127) 0.2807 0.2522 0.2452 0.2826 0.2869 (εω 1) (White, High Educ) σω1 (ii) 0.3064 (0.0152) 0.3371 (0.0106) 0.3335 0.3100 0.3043 0.3351 0.3388 Hours Equation (cons) γh 0 -0.3632 (0.0076) -0.4116 (0.0095) -0.4112 -0.3902 -0.4627 -0.3633 -0.4022 BLACK γh BLACK (i) -0.1055 (0.0043) -0.0636 (0.0096) -0.1055 -0.0636 EDUC γh EDUC (i) 0.0178 (0.0007) 0.0136 (0.0009) 0.0139 0.0226 0.0197 0.0178 0.0136 (Et) γh E 0.4129 (0.0075) 0.4384 (0.0096) 0.4417 0.4305 0.4698 0.4157 0.4362 σξ 0.1574 (0.0136) 0.1632 (0.0140) 0.1628 0.1426 0.1873 (wt) γh w -0.0692 (0.0148) -0.1024 (0.0151) -0.0943 -0.1672 -0.1224 -0.0929 -0.1032 (μ) δh μ 0.1248 (0.0135) 0.0947 (0.0152) 0.0892 0.1195 0.0873 0.0929 0.0812 (η) δh η 0.0145 (0.0304) 0.0290 (0.0220) 0.0306 0.0251 0.0238 0.1545 0.1403 (εh) σh 0.1686 (0.0023) 0.1402 (0.0027) 0.1360 0.1664 0.0926 0.1800 0.1545 Earnings Equation (cons) γe 0 -0.0061 (0.0024) -0.0005 (0.0032) -0.0053 0.0038 -0.0083 -0.0044 -0.0005 (wt) γe w (iii) 1.0000 1.0000 1.0000 1.0000 1.0000 1.0000 1.0000 (ht) γe h (iii) 1.0000 1.0000 1.0000 1.0000 1.0000 1.0000 1.0000 ρe 0.5527 (0.0068) 0.6178 (0.0095) 0.6251 0.5793 0.6849 0.5481 0.6132 (εe) σe 0.2109 (0.0016) 0.1715 (0.0018) 0.1662 0.1785 0.1419 0.2109 0.1720 Number of individuals 4,632 2,651 2,455 1,143 1,027 4,632 2,651 Number of observations 33,933 20,502 19,131 8,446 8,305 33,933 20,502 The table presents estimates for models A.3 and B.1 restricting the PSID to the SRC sample and subsamples. Estimates were obtained by Indirect Inference, unless indicated otherwise. Parametric bootstrap standard errors are presented in parentheses for the SRC+SEO and the SRC samples. Bootstraps are based on 300 replications for the SRC+SEO sample and on 100 replications for the SRC sample (we limit the latter to 100 because of the computational cost of the calculations). (*) The potential-experience profile estimated in the first-stage regression reflects the effects of general human capital accumulation, job tenure accumulation, and job shopping. Since the effects of job shopping are endogenously accounted for in model A.3, by the inclusion of a job-specific wage component that affects job mobility, model A.3 includes a quadratic in potential experience as an adjustment in the wage model, which is estimated by indirect inference. (i) Estimate obtained in first-stage least-squares regression. (ii) Estimate obtained using additional moment conditions. See discussion in Section 4. (iii) Imposed.

Table 7 Decomposition of Cross-Sectional Variance in Lifetime Earnings, Wage, and Hours - Model A.3 SRC Sample and SRC Whites Sample by Education Column I II III IV V VI VII VIII IX X XI Shock Breakdown of Composite 'Shock' Variable εe εh εw Composite η μ Educ ξ υ E JC Panel A: SRC Lifetime Earnings 5.3 1.6 11.7 35.4 -0.3 12.8 33.4 7.4 26.9 1.5 -0.4 (SE) (0.2) (0.1) (1.0) (2.7) (1.9) (4.1) (1.1) (1.4) (3.0) (0.3) (0.2) Lifetime Wage 0 0 20.3 45.0 -2.8 3.5 34.0 0 43.5 1.6 -0.2 (SE) (0.0) (0.0) (1.5) (3.6) (1.6) (4.4) (1.4) (0.0) (3.7) (0.3) (0.4) Lifetime Hours 0 3.5 1.2 57.8 0.0 31.1 6.4 53.2 1.2 3.5 -0.1 (SE) (0.0) (0.2) (0.3) (9.5) (3.9) (8.4) (0.6) (9.3) (0.5) (0.5) (0.1) Panel B: SRC Whites, Low Education Lifetime Earnings 5.9 2.5 5.1 41.3 -1.6 33.1 13.7 9.7 26.9 4.8 0.0 Lifetime Wage 0 0 12.2 63.7 -4.6 13.7 14.9 0 57.5 5.7 0.5 Lifetime Hours 0 5.0 1.4 47.5 2.2 41.1 2.8 40.5 4.3 2.9 -0.2 Panel C: SRC Whites, High Education Lifetime Earnings 5.7 0.8 10.1 44.4 2.0 29.3 7.6 13.4 29.3 1.6 0.1 Lifetime Wage 0 0 18.3 55.8 -0.1 18.3 7.7 0 53.4 1.9 0.4 Lifetime Hours 0 1.7 0.8 77.2 -0.7 20.2 0.8 73.5 2.4 1.3 -0.1 Entries in columns I to VII display the contribution of a given type of shock to the variance of lifetime earnings, wage, and hours, and are expressed as a percentage of the lifetime variance in the basecase. In the basecase we simulate of the full estimated model. To compute the contribution of a particular shock, we simulate the model again, setting the variance of a given shock to zero for all t. We then compute the variance of the appropriate variables. The difference relative to the basecase is the contribution of the given shock. Since the model is nonlinear, the contributions don't sum up to 100%. We normalize columns I to VII to sum to 100. Column IV is the combined contribution of the job match wage and hours components, employment and unemployment shocks, and job change shocks. In columns VIII through XI we decompose Column IV. Column VIII shows the marginal contribution of ξ, IX the marginal contribution of υ with var(ξ) set to 0, X the marginal contribution of unemployment spells with Var(ξ) and Var(υ) set to 0, and column XI displays the marginal contribution of job changes with Var(ξ) and Var(υ) set to 0, and no unemployment. The table presents bootstrap standard errors for the full SRC sample (in parentheses).

Table A1 Composition of PSID Sample before Sample Selection Based on Employment Status. Emp. Status Percentage Working 87.98 Temp. Laidoff 1.48 Unemployed 5.9 Retired 0.87 Disabled 1.85 Housewife 0.19 Student 1.17 Other 0.56 The table presents the composition of the PSID sample, in terms of employment status, before we impose any sample restrictions based on employment status. The sample here meets all selection criteria which are not based on employment status. Table A2 Percentage of Observations Excluded Based on Employment Status. PE Percentage PE Percentage PE Percentage PE Percentage 1 16.6 (a) 11 2.5 21 3.2 31 6.6 2 9.5 12 2.8 22 3.4 32 7.2 3 6.3 13 2.9 23 4.5 33 8.2 4 4.8 14 3.5 24 5.1 34 9.0 5 4.4 15 2.5 25 5.4 35 9.4 6 3.4 16 2.7 26 5.4 36 11.7 7 2.8 17 3.2 27 5.6 37 13.4 8 2.8 18 2.9 28 5.1 38 14.5 9 2.5 19 3.1 29 5.6 39 18.4 10 2.3 20 3.6 30 6.3 40 21.97 (b) The table presents the percentage of observations excluded, based on employment status at the survey date, for each value of potential experience (PE). (a) All students at PE=1. (b) Of these, 13.5 are retired, 7.5 disabled. Table A3 Distribution of Number of Observations Contributed Per Individual in PSID Sample. Percentile Min 5% 25% 50% 75% 95% Max Number of observations 1 1 3 6 11 18 19 per individual The table presents the cross-sectional distribution, across individuals, of the number of observations contributed to the sample by individual. Lead values are excluded.

Cite this document
APA
Joseph Altonji, Anthony Smith, & and Ivan Vidangos (2009). Modeling Earnings Dynamics (FEDS 2009-08). Board of Governors of the Federal Reserve System, Finance and Economics Discussion Series. https://whenthefedspeaks.com/doc/feds_2009-08
BibTeX
@techreport{wtfs_feds_2009_08,
  author = {Joseph Altonji and Anthony Smith and and Ivan Vidangos},
  title = {Modeling Earnings Dynamics},
  type = {Finance and Economics Discussion Series},
  number = {2009-08},
  institution = {Board of Governors of the Federal Reserve System},
  year = {2009},
  url = {https://whenthefedspeaks.com/doc/feds_2009-08},
  abstract = {In this paper we use indirect inference to estimate a joint model of earnings, employment, job changes, wage rates, and work hours over a career. Our model incorporates duration dependence in several variables, multiple sources of unobserved heterogeneity, job-specific error components in both wages and hours, and measurement error. We use the model to address a number of important questions in labor economics, including the source of the experience profile of wages, the response of job changes to outside wage offers, and the effects of seniority on job changes. We provide estimates of the dynamic response of wage rates, hours, and earnings to various shocks and measure the relative contributions of the shocks to the variance of earnings in a given year and over a lifetime. We find that human capital accounts for most of the growth of earnings over a career although job seniority and job mobility also play significant roles. Unemployment shocks have a large impact on earnings in the short run as well a substantial long long-term effect that operates through the wage rate. Shocks associated with job changes and unemployment make a large contribution to the variance of career earnings and operate mostly through the job-specific error components in wages and hours.},
}