feds · May 13, 2019

Measuring Labor-Force Participation and the Incidence and Duration of Unemployment

Abstract

The underlying data from which the U.S. unemployment rate, labor-force participation rate, and duration of unemployment are calculated contain numerous internal contradictions. This paper catalogs these inconsistencies and proposes a reconciliation. We find that the usual statistics understate the unemployment rate and the labor-force participation rate by about two percentage points on average and that the bias in the latter has increased since the Great Recession. The BLS estimate of the average duration of unemployment overstates by 50 percent the true duration of uninterrupted spells of unemployment and misrepresents what happened to average durations during the Great Recession and its recovery. Accessible materials (.zip)

Finance and Economics Discussion Series Divisions of Research & Statistics and Monetary Affairs Federal Reserve Board, Washington, D.C. Measuring Labor-Force Participation and the Incidence and Duration of Unemployment Hie Joo Ahn and James D. Hamilton 2019-035 Please cite this paper as: Ahn, Hie Joo, and James D. Hamilton (2019). “Measuring Labor-Force Participation and the Incidence and Duration of Unemployment,” Finance and Economics Discussion Series 2019-035. Washington: Board of Governors of the Federal Reserve System, https://doi.org/10.17016/FEDS.2019.035. NOTE: Staff working papers in the Finance and Economics Discussion Series (FEDS) are preliminary materials circulated to stimulate discussion and critical comment. The analysis and conclusions set forth are those of the authors and do not indicate concurrence by other members of the research staff or the Board of Governors. References in publications to the Finance and Economics Discussion Series (other than acknowledgement) should be cleared with the author(s) to protect the tentative character of these papers.

Measuring Labor-Force Participation and the Incidence and Duration of Unemployment Hie Joo Ahn ∗† James D. Hamilton ‡ Federal Reserve Board University of California, San Diego March 16, 2019 Revised: May 6, 2019 Abstract TheunderlyingdatafromwhichtheU.S.unemploymentrate,labor-forceparticipationrate, and duration of unemployment are calculated contain numerous internal contradictions. This paper catalogs these inconsistencies and proposes a unified reconciliation. We find that the usual statistics understate the unemployment rate and the labor-force participation rate by about two percentage points on average and that the bias in the latter has increased since the GreatRecession. TheBLSestimateoftheaveragedurationofunemploymentoverstatesby50% the true duration of uninterrupted spells of unemployment and misrepresents what happened to average durations during the Great Recession and its recovery. Keywords: unemployment rate, labor-force participation rate, unemployment duration, measurement error ∗The views in this paper are solely the responsibility of the authors and should not be interpreted as reflecting the views of the Board of Governors of the Federal Reserve System or of any other person associated with the Federal Reserve System. We thank Alessandro Barbarino, Travis Berge, Michael Elsby, Andrew Figura, Glenn Follette, Norm Morin, John Stevens and Robert Valletta for comments on earlier drafts of this paper and Jesse Wedewer for excellent research assistance. Data and software to reproduce results in this paper available at http://http://econweb.ucsd.edu/~jhamilton/AH2_code.zip. †E-mail: hiejoo.ahn@frb.gov ‡E-mail: jhamilton@ucsd.edu

1 Introduction. The Current Population Survey (CPS) is the primary source of information about the laborforce participation rate, unemployment rate, and duration of unemployment for the United States. There are multiple internal inconsistencies in the data from which the fundamental statistics are calculated— if one reported number is correct, another must be wrong. In this paper we catalog these inconsistencies and propose a unified reconciliation of all the problems. One source of inconsistency is rotation-group bias. In any given month, some households are beingvisitedforthefirsttime(rotation1),othersarebeinginterviewedforthesecondtime(rotation 2), with 8 different rotations contributing to the statistics reported for that month. One would think that in a random sample, the numbers calculated from different rotations for a given month should all be the same. But as documented by Bailar (1975), Solon (1986), Halpern-Manners and Warren (2012) and Krueger, Mas, and Niu (2017), the reported unemployment rate differs significantly across rotations. In our sample (July 2001 to April 2018), the average unemployment rate among those being interviewed for the first time is 6.8%, whereas the average unemployment rate for the eighth rotation is 5.9%. Even more dramatic is the rotation-group bias in the laborforce participationrate. Thisaverages66.0%forrotation1and64.3%forrotation8inoursample. Rotation-groupbiasaffectsanyinferenceonedrawsfromtheCPSdata. Rotation-groupbiasmeans that if one follows a fixed group of individuals over time, on average outflows from unemployment seem to exceed inflows. We reconcile this by modeling statistically the way in which people’s answers change the more times they have been interviewed. We interpret households in different rotations as being surveyed using a different interview technology. We calculate the answer to the following counterfactual question: if a group of households in rotation j in month t were being interviewed for the first time instead of the jth time, how would their answers have been different? We find that the tendency of individuals who would have been counted as U in rotation group 1 to be counted instead as N in later rotation groups has increased over our sample. A second source of inconsistency documented by Abowd and Zellner (1986) is that missing observations are not random. Meyer, Mok and Sullivan (2015) noted that households in the CPS have become increasingly less likely to answer surveys or to provide all answers.. The standard 1

approach is to calculate statistics for a given month based only on individuals for whom there is an observation that month. But if missing observations are not randomly drawn from the overall population, this may be an increasing source of bias in CPS estimates. Our solution is to add a fourth category of labor-force status. We regard an individual in any month as either employed (E), unemployed (U), not in the labor force (N), or missing (M). On this basis we construct a data set in which all identities relating stocks and flows are respected; for example, thesumofEE,NE,ME,andUE transitionsbetweent−1andtexactlyequalsthetotal number of E at t. Combining this with our description of rotation-group bias allows us to produce the first fully reconciled description of stocks and flows in the CPS data. Moreover, by looking at howME,MN, andMU transitionsdifferfromthe restofthepopulation, we areabletoadjust the treatment of missing observations based on what we know about those individuals when data are collected from them. We find that missing individuals are more likely than the general population to be unemployed. In addition, the biases introduced by missing observations have increased over time and are bigger when the labor market is slack. Our paper is the first to document the cyclical features in the bias coming from nonrandom missing observations. OuradjustmentformissingobservationsissimilarinspirittothatinAbowdandZellner(1986), though there are a number of important differences. For any month t they make one adjustment looking backward in time based in MX transitions and a second adjustment looking forward in time based on XM transitions, giving them potentially two different estimates for each month t. By contrast, we take a unified approach to the full data set. Abowd and Zellner’s adjustments do not deal with the problem of rotation-group bias or the other measurement issues for which we also develop solutions. And they only calculate average unemployment rates over what is now a historically old sample. By contrast, we adjust estimates month-by-month up to the present. A third problem in the CPS is inconsistency between the unemployment duration that an individual reportsattandthelabor-forcestatusrecordedforthat individualatt−1. Forexample, consider those individuals who were counted as N when in rotation 1 in month t−1 and U when surveyed in rotation 2 in month t. In the second survey, the individual would be asked how long he or she has been looking for work. Two-thirds of these individuals say that they have been looking for longer than 4 weeks, and 16% say it has been one or two years. Our estimates below conclude that people overestimate their duration of unemployment. But no plausible degree of 2

overestimation could turn a 4-week unemployment spell into 2 years. We conclude that some of those classified as N should instead be counted as U. Arelatedanomalyistheinconsistencybetweenunemploymenthazardratesandthereportedduration of unemployment. In 2011, whenthe mean duration of unemployment reacheda record-high 40 weeks, the average monthly unemployment-exit probability measured by matching individuals observedfortwo consecutive months— also knownas labor-force flows data— wasabout 0.3 andthat of those unemployed longer than 6 months was about 0.25. Such outflow probabilities would be consistent with a mean duration of unemployment of about 4 months (1/0.25) on average in 2011, which is less than half of what is claimed in the official estimates. OurresolutionoftheseinconsistenciesistoadoptabroaderconceptofU thanthatusedbyBLS. Weproposeto reclassifythosewhotransitionfromN att−1 toU at twithreportedjobsearchatt oflongerthan4weeksashavingbeenU att−1. Inadditiontohelpingreconciletheinconsistencies noted above, this is also supported by the observation that re-employment probabilities for those who make NU transitions with reported search durations exceeding 4 weeks are similar to those with UU continuations. We also find that someone with a UUU history is similar to that for someone with UNU whose reported search durations when U are consistent with those for the UUU individual. Thisalso leads us to interpret some UN transitionsasUU continuations. This adjustment goes a long way to reconciling the inconsistencies between reported unemployment durations and UU continuation probabilities. Our adjusted UU continuation probabilities also lead to an alternative estimate of average unemployment durations. Afinalsourceofinconsistencyarisesfrompeople’spreferenceforreportingcertainnumbersover others. On average there are more people who say they have been looking for work for 6 months thansaytheyhavebeenlookingfor23weeks, thoughthefractionofthoseunemployedfor23weeks should be greater than that of those unemployed for 6 months. In addition, people are more likely to report an even number of weeks than an odd number for shorter spells. Our resolution of this problem is to postulate a flexible latent distribution of perceived durations that is then reported by individuals with a certain structure of number-reporting preference; for related approaches see Baker(1992),TorelliandTrivellato(1993),andRyuandSlottje(2000). Ourapproachiscompletely new compared to these studies in that our parameterization allows direct linkage of data on stocks, 3

flows, and durations and in that both digit and interval preference are jointly considered. Our framework describes the reported values extremely accurately, and the adjustment on net revises down the mean duration of unemployment. The importance of these concerns is illustrated in Panel A of Figure 1. This asks a very fundamental question: if someone is unemployed at t−1, what is the probability that person will still be unemployed at t? Researchers have used the CPS data to answer this question in two different ways. A measure based on reported unemployment durations calculates the ratio of individuals who are unemployed at t with a reported duration greater than 4 weeks to the total number of individuals unemployed at t−1. Variants of this calculation have been used by van den Berg and van der Klaauw (2001), Elsby, Michaels and Solon (2009) and Shimer (2012). This measure is plotted as the solid black line in Panel A. An alternative measure based on labor-force flows looks at the subset of individuals who are U at t−1 and either E, N, or U at t and calculates the number of UU continuations as a fraction of the sum. Variants of this approach were used by Fujita and Ramey (2009) and Elsby, Hobijn and S¸ahin (2010). The flow-based measure is plotted asthe dashedgreenline. Ifallmagnitudeswere measuredaccuratelythe twoestimatesshouldgive a similar answer. But in practice they are wildly different. The duration-based measure averages 70.7% over our sample, while the flow-based measure averages 53.7%. These differences are caused by the multiple inconsistencies mentioned above. The flow-based measure underestimates the true continuation probability because (1) some UN transitions are a result of rotation-group bias and (2) some UN and UM transitions should be interpreted as UU continuations. The duration-based measure overestimates the probability, because a substantial number of people interpret the duration of job search as including on-the-job search or the time since the last salient job; see Elsby et al. (2011), Farber and Valletta (2015), and Kudlyak and Lange (2018). Our reconciled estimate is shown in the dotted blue line in Panel A, and falls in between the other two estimates. The flow-based estimate was a little more accurate at the beginning of the sample, whereas the duration-based measure is closer to our adjusted series today. Another fundamental question is, how many people become unemployed each month? One estimate (e.g., Shimer, 2012) is simply the number of unemployed individuals reporting durations of less than 5 weeks. The solid line in Panel B of Figure 1 shows this value as a percent of the civilian population. As noted by Elsby et al. (2011), it underestimates new inflows into 4

unemployment since half of EU and NU transitions report unemployment durations of 5 weeks or longer. Alternatively,onecanlookatEU andNU flowsdirectly,andBLSmakessomeadjustments to those numbers (shown in dashed green) to try to address some of the problems studied in our paper. However, our analysis suggests that BLS does not completely correct for either rotationgroup bias or for nonrandom missing observations. Our reconciled series (dotted blue) is usually significantly higher than the BLS adjusted estimate. Panel C of Figure 1 compares our adjusted estimate of the unemployment rate with the BLS estimate. Ourmeasureis1.9%higheronaverage,andthegapincreasedduringtheGreatRecession. The gap recovered gradually after the recession and has only recently returned to its pre-recession level. The gap between our measure and the BLS measure of the labor-force participation rate (Panel D) is 2.2% on average. It also increased in the Great Recession and remained elevated as of 2018. We conclude that labor-force participation declined slightly less over this period than suggested by the BLS series. WhereasBLSestimatesofunemploymentdurationarebasedonindividuals’ reporteddurations of job search, our estimates are based on reconciled spells of unemployment. In going from the dashed green to dotted blue lines in Panel A, we adjusted unemployment continuations up considerablyfromthestandardestimates,butwedidnotadjusttheseallthewayuptothoseimplied by reported durations in black. As a result, our reconciled estimates of average unemployment durations (shown as dotted blue in Panel E) are considerably below those from BLS (solid black), as also concluded by Kudlyak and Lange (2018). Our estimates of average duration did not rise as muchduringtheGreatRecessionassuggestedbytheBLSseriesbasedonreporteddurations. Also, our reconciled estimates subsequently recovered to pre-recession levels, whereas the BLS reported durations do not. Estimates of the employment-population ratio are also influenced by rotation-group bias, but are unaffected by missing observations and misclassification of the long-term unemployed. For some purposes this ratio might therefore serve as a more robust measure of economic slack than the unemployment rate. A number of important studies have approached the problem of measurement error in the CPS data in a very different way from ours. A common assumption is that the reported data differ from latent true values, with identification coming from assumptions about the joint dynamics of 5

the true values and measurement error. Prominent examples include Biemer and Bushery (2000), Feng and Hu (2013), and Shibata (2016). These studies did not deal with rotation-group bias, nonrandom missing observations, inconsistency between reported duration and the previous labor force status, orreporting errorsofunemploymentdurationintheircorrection. BiemerandBushery (2000) and Shibata (2016) assumedthat true labor-force transitions were first-orderMarkov. Feng and Hu (2013) relaxed this assumption, though Shibata (2016) noted that their approach generates implausible transition probabilities. By contrast, our approach does not impose any Markov assumptions and produces plausible transition probabilities and unemployment durations that are consistent with these probabilities. Our approach also explains well the non-Markov predictability of labor-force status documented by Kudlyak and Lange (2018). Although our methods and assumptions are very different from these studies, we nevertheless reach the same broad conclusions that the BLS significantly underestimates the average unemployment rate and overestimates the average duration of unemployment. The plan of the paper is as follows. Section 2 describes the structure of the CPS survey and our data set. Section 3 uses averages over the complete sample to document the various inconsistencies in the CPS data and develops the statistical framework that will form the basis for our reconciliation. Section 4 describes the steps we use to reconcile these inconsistencies in the full-sample averages. Section 5 describes how we use this approach to come up with better estimates for each individual month in the sample, detailing the calculations behind the adjusted series plotted in Figure 1. Section 6 briefly concludes. 2 Data construction. Since 2001, each month around 60,000 housing units are included in the Current Population Survey. An effort is made to contact each address and determine the number of individuals aged 16 or over who are not in the armed forces or in an institution such as prison or a nursing home. An individual is counted as employed (E) if during the reference week of the survey month the individual didany workat all forpay, for theirownbusiness, orwere temporarily absent fromwork due to factors like vacations, illness, or weather. People are counted as unemployed (U) if they were not E but were available for work and made specific efforts to find employment some time 6

during the previous 4 weeks. Individuals who are neither E nor U are counted as not in the labor force (N). One person in the household can provide separate answers for each of the individuals living at that address. The next month and each of the following two months, the interviewer attempts to contact the same address to ask the same questions. In any given month there are around 7,500 (60,000/8) qualifying households being interviewed for the first time (denoted rotation 1), and another 7,500 each being interviewed for the second, third or fourth time (rotations 2, 3, or 4). After the fourth monththehouseholdisnotinterviewedforthenext8months,butisreinterviewedagain1yearafter the first interview (rotation 5) and again for each of the following 3 months (rotations 6, 7, and 8). For data since 1994, if an individual was unemployed in two consecutive months, the interviewer does not ask again the duration of unemployment the second month, but simply adds time elapsed since the previous interview to the previous answer. Thus new unemployment duration data are only collected in rotations 1 and 5, or in the other rotations for someone who was E, N or missing from the sample the month before. The survey is imperfect for purposes of tracking the experience of an individual across months due to various measurement problems; for discussion of these see Madrian and Lefgren (2000) and Nekarda (2009). Each address has a unique identifier, and an effort is made to associate an individual person within that household with a particular 2-digit number. Our study is unique in treatingmissing(M)asaseparateobservedcategoryforsomeonewhoseinformationisnotavailable in a particular rotation or is inconsistent from the information reported for that individual in other rotations. AsinAbowdandZellner(1986),wewilluseinformationaboutthatindividualinmonths where itisavailable to correctforthefactthatindividualswhoare sometimesmissing (whichcould come in part from households that are more prone to reporting errors or to having people moving in or out) may differ in systematic ways from individuals for whom 8 separate months of data are available. We check if an individual with the same household and personal identifier is reported to have the same gender and an age that does not differ by more than 2 years across rotations. If so, we consider that individual successfully matched. If not, we designate that individual as M in the months for which no status is available or for which the age and gender records are inconsistent with those reported across the majority of the 8 rotations.1 1Nekarda (2009) used race in addition to age and gender and Madrian and Lefgren (2000) also used education. 7

[j] Therawdataforourstudythusconsistofy ,thesumofthenumberofindividuals(multiplied X,t by a weight associated with that individual) who are in rotation j ∈ {1,...,8} in month t with [j] reported status X ∈ {E,N,M,U}, and y , the weighted sum of individuals reporting X in X1,X2,t 1 rotation j−1 in month t−1 and X in rotation j in month t for j ∈J ={2,3,4}∪{6,7,8}. See 2 Table A-1 in the online appendix for a summary of notation used in this study. A key advantage of our approach is that, unlike the values used by most researchers, our data on stocks and flows are internally consistent by construction, always satisfying the accounting identities [j] [j] [j] [j] [j] y =y +y +y +y (1) X2,t E,X2,t N,X2,t M,X2,t U,X2,t [j−1] [j] [j] [j] [j] y =y +y +y +y (2) X1,t−1 X1,E,t X1,N,t X1,M,t X1,U,t for all t,X ,X and j ∈J. 1 2 Onedrawbackofthisprocedureisthatweneed16monthsofobservationstodeterminewhether to categorize someone as M in a givenmonth. For example, our sample starts in 2001:7. Someone whose history beginning in 2001:7 was EEMM −MMMM will be counted as M in rotation 3 in 2001:9 by our method, whereas someone who would have had the same history if initially surveyed in 2001:5 would never appear in the sample2. This causes the number of individuals who are classified as M to be artificially depressed in the first year of the sample. A similar effect arises at the end of the sample, with individuals whose record wouldhave been MMEE−EEEE not being apparent if their rotations 1 or 2 come would have come at the end of the sample. We therefore adjusted the counts of M and MM at the beginning and end of the sample upward based on the average counts of M for each rotation over the nearest year of complete observations; for details see Appendix A. Since changes in M occur relatively slowly in our sample, this adjustment has little effect on any of the key measures we develop. We made additional adjustments when new households were added and other households dropped in the 2004 and 2014 sample redesigns.3 Boththesevariablesaresusceptibletoambiguityandcouldbereporteddifferentlyforafixedindividual,particularly when a different individual answers the questions for the household. We topcode age at 65 years or older, so an individualinthisagegroupwiththesameaddress,samegender,andsameidentifyingnumberisconsideredmatched. 2See Appendix A for detailed examples. 3With the expansion of the survey from 50,000 to 60,000 households, beginning in July 2001, some individuals were added and others dropped across a number of rotations, with waves of new individuals added to subsequent rotations5. Trackingindividualsbeforeandafterthisbreakisconsiderablyharderthanhandlingthesampleredesign in 2004 and 2014. For this reason we simply begin our analysis with the modern design adopted in July 2001. 8

BLS also assigns a weight to each individual. People with characteristics that are underrepresented in a particular month are given a larger weight. These weights are a partial response of BLS to the issue that missing individuals are not a random sample of the population. We want to include this correction to demonstrate the need for additional corrections for missing individuals. We can not use the exact BLS weights to do this because the BLS may assign a given individual different weights in two different months, which is another reason in addition to missing observations why (1) and (2) do not hold in the BLS data. Our approach was to assign a fixed weight for an individual across all 8 possible observations based on the BLS weight for that individual in the first month for which data are recorded for that person, as described in Appendix A. 3 Statistical description of labor-force status data. In this section we develop statistical descriptions of a number of features of the CPS data. 3.1 Unemployment durations reported in rotations 1 and 5. First we consider the durations of unemployment that are reported on average over our sample by people who are being interviewed for the first time (rotation 1). The blue bars in the top panel of Figure 2 plot the fraction of unemployed reporting the indicated duration of job search in weeks. Clearly there are some significant reporting errors arising from number preference. Respondents are more likely to report spells as an integer number of months, and for longer spells as either 6 months, 1 year, 18 months, or longer than 99 weeks. For shorter spells, people are more likely to report an even number of weeks instead of an odd number; for example, on average there are more people reporting 2 weeks than 1 and 6 weeks than 5. Respondents are extremely unlikely to report a duration of zero weeks, and for this reason we group the 0-week and 1-week observations together into a category of reported duration less than or equal to one week. To interpret these numbers in an internally consistent way, we impose the restriction that the only way an individual could have been unemployed for τ weeks would be if the individual had † been unemployed for τ −1 weeks the week before. Thus if π (τ) denotes an internally consistent U † summaryofthe fraction of the populationwho have beensearching for τ weeks, the functionπ (τ) U must be monotonically decreasing in τ. For our baseline specification we propose to represent this 9

function as a mixture of two exponentials with decay rates p and p , respectively. We form a 1 2 † (99×1) vector π whose τth element for τ = 1,2,...,98 is an internally consistent representation U of the fraction of the working-age population who perceive having been unemployed for a duration of τ weeks at a fixed point in time, while the 99th element is the fraction with perceived duration greater than 98 weeks: † † † π =π +π (3) U 1U 2U ′ † π iU =π U w i (1−p i ) (cid:1) 1 p i p2 i ··· p9 i 7 p9 i 8/(1−p i ) (cid:2) for i =1,2. (4) (99×1) Here π denotes the fraction of the population who are unemployed and w the fraction of those U i individuals who are type i. Such a distribution would be the outcome of a steady state in which there was a fraction π w (1−p ) of the population who lose their jobs each week and for each of U 1 1 whom the probability of continuing unemployed in any subsequent week is p , and an additional 1 inflow of π w (1−p ) individuals with continuation probability p .4 U 2 2 2 We allow for the various forms of number preference noted above by introducing a (99×99) matrix A(θ ) whose elements are determined by a (13 × 1) vector θ . The first element θ A A A,1 allows a preference for reporting short durations as an even rather than an odd number of weeks, assuming that someone whose true duration is τ = 1,3,5, or 7 in fact reports duration 2, 4, 6, or 8 with probability θ . The value of θ represents the probability that someone will round their A,1 A,2 duration up or down by a week to reach an integer number of months for durations within one week of 1, 2, 3 or 4 months, while someone two weeks away from either of two months is presumed to round down with probability θ /2 and up with probability θ /2. As we move to longer A,3 A,3 durations we allow for the possibility that the rounding tendencies become stronger, introducing new pairs of parameters for durations between 5-7 months, 8-11 months, or 12 or more months. The last elements of θ allow for preferences for integer multiples of 6 months for longer durations. A Foreachτ theτthcolumnofAsumstounityandcharacterizestheprobabilitythatsomeonewhose true duration category is τ will report each of the possible categories i between 1 and 99, where i or τ =99 is interpreted as true or reported durations longer than 98 weeks. Appendix B provides 4We will later examine some testable implications of such an interpretation by looking at the actual unemployment-continuation probabilitiesfor differentindividuals and also look atalternativefunctional forms. But fornowwepropose(3)and(4)asasimplebutflexibleparametricfunctionalformwithwhichtoimposemonotonicity on π†(τ). U 10

more details on the structure we use to represent the matrix A. Note that our framework does not impose the assumption of the existence or magnitude of any particular reporting error, as it includes as a special case no reporting error of any kind when θ =0. A [1] Let y be the number of individuals in rotation group 1 sampled at date t who report status X,t X for X one of E (employed),N (not in labor force),M (labor-force status for that individual is missing), or U (unemployed). We summarize further detail in the last category in terms of [1] y (τ) which is the number of unemployed who report having been looking for work for τ weeks U,t for τ = 1,...,99.5 We compare the observed values y [1] (τ) with the predicted values represented U,t by the (99×1) vector † π˙ =Aπ . (5) U U We also let π denote the overall fraction of the population reporting status X ∈ {E,N,M,U}. X If we treated observations as independent across months t the log likelihood of the rotation 1 observations alone would then be [1] T [1] [1] [1] ℓ (λ ) = [y lnπ +y lnπ +y lnπ ] (6) X X t=1 E,t E N,t N M,t M (cid:3) + T 99 y [1] (τ)lnπ˙ (τ). t=1 τ=1 U,t U (cid:3) (cid:3) We can maximize this with respect to θ ,p ,p ,w ,w ,π ,π ,π ,π subject to the constraint A 1 2 1 2 E N M U that all probabilities are between 0 and 1 and sum to unity.6 Estimates are reported in column 1 of Table 1, along with quasi-maximum-likelihood standard [1] errors in column 2 which allow for the possibility that y is correlated across time (calculated as X,t described in Appendix C). The predicted reported values π˙ (τ) are compared with the average U reported values in the top panel of Figure 2.7 This framework is able to describe the reported † values extremely accurately. The estimated latent function π (τ) along with its two contributing U components are plotted as a function of τ in the bottom panel of Figure 2. We also considered 5The duration is top-coded at 99 weeks in our data. 6Maximum likelihood estimates of some parameters are known analytically. Let y = T y denote the X t=1 X,t total number of observations in category X and n=(yE +yN +yM +yU) the total number o(cid:1)f observations. Then πˆX =yX/nforX ∈{E,N,M,U}. Thesevaluescanbesubstitutedintoexpression(6)andtheresultingconcentrated likelihood then maximized with respect to θ ,p ,p ,w with w =1−w . A 1 2 1 2 1 7As noted in the previous footnote, by the nature of the maximization problem, the estimated values πˆX for X =E,N,M exactly match the historical fractions yX/(yE+yN +yM +yU). 11

an alternative functional form based on a Weibull distribution, as discussed in Appendix D. The mixture of exponentials has a much better fit to the data than that for the Weibull specification, and we will use it in our baseline analysis. Forrotations2-4and6-8, BLSimputesadurationtothosereportingUU continuations, making durations for these individuals a hybrid of perceived and imputed quantities. This can create a downward bias in the number of individuals unemployed for less than 5 weeks as discussed by Abraham and Shimer (2001) andShimer (2012) and blurs the inconsistency between perceived and imputed durations. Since our goal is to characterize perceived durations separately from objective durations, we do not use the imputed duration in the second month in unemployment. However, there are no imputations for unemployment durations for those people in rotation 5. We therefore [1] [5] repeated the analysis with y in (6) replaced by y . Parameter estimates and standard errors X,t X,t are reported in columns 3 and 4 of Table 1. These are very similar to those inferred from the rotation 1 observations alone. 3.2 Characteristics of NU, EU, and MU transitions. Next consider the status of individuals in rotation 2 who had been counted as not in the labor force when surveyed in rotation 1. Figure 3 focuses on the subset who in the second month (when they were in rotation 2) reported being unemployed, giving the percentage reporting each duration of job search. Two-thirds of these people say they have been looking for a job for longer than 4 weeks, despite the fact that the previous month they did not report actively looking for a job and so were countedasout ofthelaborforce. Eight percentof NU individualssaythattheyhave been looking for a job for a full year and another 8% report having been looking for work for two years or longer. Although the question is intended to measure the spell of continuous active job search, just how actively an individual was looking for work the previous month is potentially subjective. But it is clear that many of those counted as N the previous month perceived themselves to have been looking for a job at the time despite the N classification. Of those people who report right after an NU transition that they have been looking for work for more than 4 weeks, what distribution characterizes their perceived duration of job search? We represent the probability of transitions from N to E,N,M, or U with parameters π ,π ,π ,π , respectively, where these four numbers sum to unity. Of those who NE NN NM NU 12

make an NU transition and report an unemployment duration greater than 4 weeks, suppose that their perceived duration can again be represented by a mixture of two exponentials with decay parameters p or p . We assume that some fractions q ,q ,q , and q of those 1,NU 2,NU 1,NU 2,NU 3,NU 4,NU making the NU transition will perceive their unemployment duration to be 1,2,3, or 4 weeks respectively, treating these values of q completely unrestrained. A fraction q perceive a j,NU 5,NU duration greater than 4 weeks drawn from an exponential distribution with parameter p and 1,NU a fraction q are characterized by p , with 6 q =1. We thus calculate 6,NU 2,NU j=1 j,NU (cid:3) q for τ =1,2,3,4 τ,NU  π † NU (τ) =    q 5,NU (1−p 1,NU )p 1 τ , − N 5 U +q 6,NU (1−p 2,NU )p 2 τ , − N 5 U for τ =5,6,...,98 . (7)  q p94 +q p94 for τ =99  5,NU 1,NU 6,NU 2,NU     † The predicted probability of each reported duration is then given by π˙ =π Aπ . NU NU NU [2] Let y denote the number of individuals who counted as not in the labor force in rotation 1 NX,t [2] in month t−1 and reported status X at date t where X ∈{E,N,M,U}. Let y (τ) denote the NU,t number of NU who report unemployment duration τ ∈ {1,...,98,≥ 99} in rotation 2. Then the contribution to the likelihood for months t=1,...,T from rotation 2 NX transitions is [2] T [2] [2] [2] ℓ (λ ) = [y lnπ +y lnπ +y lnπ ] (8) NX NX t=1 NE,t NE NN,t NN NM,t NM (cid:3) + T 99 y [2] (τ)lnπ˙ (τ). t=1 τ=1 NU,t NU (cid:3) (cid:3) This expression can then be maximized with respect to λ = NX (θ′ ,p ,p ,π ,π ,π ,π ,q ,q ,...,q )′ subject to the constraints A,NU 1,NU 2,NU NE NN NM NU 1,NU 2,NU 6,NU that all parameters fall between 0 and 1, π +π +π +π =1 and 6 q =1. NE NN NM NU j=1 j,NU (cid:3) Quasi-maximum-likelihood estimates λˆ are reported in column 5 of Table 1 and predicted NX values π˙ compared with historical average values for y in Figure 3. Note that θ was NU NU A estimated in column 1 solely from individuals who were recorded as being unemployed in rotation 1, in column 3 solely from individuals who were unemployed in rotation 5, and in column 5 solely from individuals who were recorded as being out of the labor force inrotation 1 and unemployed in rotation 2. Although the vector θ was estimated from very different data, the estimated values A 13

are quite similar. Likewise pˆ and pˆ turn out to be very close to the values pˆ and pˆ 1,NU 2,NU 1 2 estimated from rotations 1 and 5. Nextconsiderthestatusinmonthtofindividualswhowererecordedasemployedwhensampled inrotation1inmontht−1. Twenty-ninepercentofthosewhomakeEU transitionsreportdurations longer than 4 weeks. Unlike the NU transitions, we do not interpret these as necessarily implying an inaccuracy in either the E or U designation. Kudlyak and Lange (2018) noted these could represent records of individuals who were employed in t−1 but were engaged in on—the-job search for a new job.8 It is nevertheless interesting to characterize transitions from employment using the same framework as above, replacing NX in (8) with EX. Parameter estimates and standard errors are reported in columns 7 and 8 of Table 1. Much fewer EU transitions perceive themselves as long-time job seekers (q = 0.17 versus q = 0.51). Interestingly, the estimates of p , 6,EU 6,NU 1 p and θ are similar to those inferred in columns 1, 3, and 5; unemployed individuals in each of 2 A these categories can be broadly characterized in terms of the same two types. Finally, we consider the status in rotation 2 of individuals who were missing in rotation 1, replacing EX with MX. Quasi-maximum-likelihood estimates are reported in column 9 of Table 1. Notably, again we find very similar estimates for p ,p and θ as in the other data sets, with 1 2 A q much closer to q than to q . 6,MU 6,NU 6,EU 3.3 Characteristics of UX transitions. We next examine UX transitions. We havemodeledthe fractionofthepopulation that reports † beingunemployedwithdurationτ asgivenbytheτthelementofthevectorξ +ξ whereξ =Aπ 1 2 i iU † for π given in (4). If we observe someone reports a duration of τ, the specification allows us to iU calculate the probability that the individual is type i using the formula η (τ)=ξ (τ)/[ξ (τ)+ξ (τ)] (9) i i 1 2 8AhnandShao(2017)furtherdocumentedthaton-the-jobsearchconstitutesanon-negligiblefractionofaggregate job search. Hall and Kudlyak (2019) found that many job losers make frequent transitions between short-term employment, unemployment, and out of the labor force before finding a long-term job. 14

fori =1or2. Thefunctionη (τ)isplottedinFigure4.9 Someonewhoreportsadurationofτ =1 2 week is quite unlikely to have come from the second distribution, whereas someone who reports a duration greater than 40 weeks is almost certain to have come from the second distribution. The function dips down at duration τ = 26 weeks because, given the tendency of answers to clump at this value, this observation includes many individuals whose true duration is less than 26 weeks and accordingly contains a higher mix of type 1 relative to those reporting 25 weeks. Though the function η (τ) can be motivated on the basis of a particular parametric model of i the distribution of reported unemployment durations, the measure can alternatively be viewed as a simple device to allow for the possibility that different outcomes are expected for individuals who report longer spells of unemployment relative to those who report shorter spells.10 Let the scalar γ be the probability that an individual of type i makes a transition from i,UX unemployment in rotation group 1 to status X = E,N,M, or U in rotation 2, so γ +γ + i,UE i,UN γ +γ =1 for both i=1 and i=2. Let π˙ denote a (99×1) vector whose τth element i,UM i,UU UX is the probability that someone who reports duration τ in month t has status X in month t+1. Under the above assumptions this would be predicted to be π˙ =η γ +η γ . (10) UX 1 1,UX 2 2,UX [2] For y (τ) the observed number of individuals who report U with duration τ in rotation 1 and UX,t status X in rotation 2, we then have the likelihood function ℓ [2] (λ )= T 99 [y [2] (τ)lnπ˙ (τ)+y [2] (τ)lnπ˙ (τ) UX UX t=1 τ=1 UE,t UE UN,t UN (cid:3) (cid:3) [2] [2] +y (τ)lnπ˙ (τ)+y (τ)lnπ˙ (τ)]. (11) UM,t UM UU,t UU We fixed η to be the function plotted in Figure 4 and maximized (11) with respect to 2 {γ ,γ ,γ ,γ } subject to the constraint that γ +γ +γ +γ = 1 i,UE i,UN i,UM i,UU i=1,2 i,UE i,UN i,UM i,UU 9For purposes of this graph, this function was calculated using the values of w 1 ,p 1 ,p 2 ,θA from Table 3, which pool all observations from all rotations to estimate these parameters. 10One could alternatively try to get at this idea by setting η (τ)=0 for τ ≤K and unity for τ >K. That kind 2 of simple dichotomization into short-term and long-term unemployment would have the drawbacks that it requires pickinganarbitrarycut-offK andimpliesanabruptdiscontinuityinoutcomesexpectedforindividualsslightlybelow K relativetothoseslightlyaboveK. Bycontrast,theapproachwefollowhereusesasmoothfunctionη (τ)relating 2 reported duration τ to expected outcomes. 15

for i =1,2. Quasi-maximum-likelihoodestimatesandstandarderrorsarereportedinrows1and2ofTable2. Type 1 individuals have a 32% probability of being employed next month, whereas the probability fortype 2 individualsisonly 12%. Type 1 individualshave a 37% probability of being unemployed next month, whereas for type 2 the probability is 58%. We also repeated the analysis using only data for individuals who were unemployed in rotation 5, with very similar results. Considerwhatwewouldhaveexpectedthesecoefficientstohavebeenifperceivedunemployment durationsmatchedwithactualunemployment-continuationprobabilities. Iftype1individualstruly had a weekly unemployment-continuation probability of p = 0.8094, we would expect to observe 1 a monthly continuation probability of 0.80944.33 = 0.40. If we condition on missing observations havingthesamedistributionasobservedE,NandU,thisvaluewouldbebroadlyconsistentwiththe value we’d predict from Table 2 of γ /(1−γ ) =0.41. By contrast, the perceived long-term 1,UU 1,UM unemployed are another story. Their perceived weekly unemployment-continuation probability of p = 0.9734 would imply a monthly continuation probability of 0.97344.33 = 0.89, far larger 2 than the estimate γ /(1 − γ ) = 0.63. Even more dramatically, a monthly continuation 2,UU 2,UM probability of 0.63 would mean a probability of remaining unemployed for 6 months of 0.636 = 0.06. But in the BLS data, the fraction of those unemployed who report durations over 26 weeks averages 27%. Far fewer people than are reported in the data should be unemployed longer than 6 months if people left the pool of long-term unemployed at anything like the rate implied by γ . 2,UU The observed unemployment continuation probabilities are not consistent with the distribution of reported unemployment durations. The result is robust whether one uses our parametric model or any other. For example, Appendix D derives the analogous result using a Weibull characterization of durations. Any model that accurately describes the cross-section of durations— and ours does so quite well— is going to predict an unemployment-continuation probability similar to the stock-based measure plotted as the solid line in Figure 1, which we noted is inconsistent with the flow-based measure. The main advantage of our parametric approach is that it highlights that this inconsistency between the stock-based and flow-based measures comes entirely from those whom we have characterized as the perceived long-term unemployed. For a broad summary of the features of the data, we pool together all observations for all 16

rotations but allowing γ to be completely independent of the value of p , while treating the i,UU i values of θ ,p , and p as the same across all rotation groups. This summary of the full data set A 1 2 was obtained by maximizing the full-sample likelihood [1] [5] [j] [j] [j] [2] [6] ℓ=ℓ +ℓ + ℓ +ℓ +ℓ +ℓ +ℓ . (12) X X j∈J EX NX MX UX UX (cid:8) (cid:9) (cid:3) These full-sample estimates are reported in Table 3. 3.4 Rotation-group bias. Another source of error in the CPS data is the difference across different rotations in the reported labor-force status. Table 4 reports the monthly average number of sampled individuals with measured labor force status E,N,M, or U for each of the 8 rotation groups.11 Column 6 showsthattheaverageunemploymentratedeclinessharplyasafunctionofrotationgroup,starting out at 6.8% for rotation 1 but falling all the way to 5.9% for rotation 8. Column 7 reveals another interesting fact that appears not to have been noticed by other researchers: the measured laborforceparticipationratefallsevenmoresharply. Column3documentsathirdtendency—individuals are much more likely to be missed in rotation 1 and 5 compared to other groups. [j] We can summarize these tendencies with some simple regressions. Let x = t [j] [j] [j] [j] [j] 100y / y +y +y +y denote the percentage of individuals in rotation group j sam- X,t E,t N,t M,t U,t (cid:8) (cid:9) [j] [j] [j] [j] pled inmonth t with measured status X =E,N,M, orU; thuse +n +m +u exactly equals t t t t 100 for every j and every t. Consider an 8-variable panel regression with time fixed effects where [j] the dependent variable is n , j =1,...,8,t=1,...,T: t [j] [j] n =α +δ j+α d +α d +ε . (13) t nt n n1 1t n5 5t nt Here α is the time fixed effect for month t, δ captures a linear trend across rotations (with nt n increased fraction of N in later rotations captured by δ > 0), d = 1 if j = 1 and 0 otherwise n 1t allows for something special about the first rotation group, while d = 1 if j = 5 serves a similar 5t function for rotation 5. The estimates of these parameters along with standard errors are reported 11For example, the entry in the first row and column is T−1 T y[1] . t=1 E,t (cid:1) 17

in column 2 of Table 5, and their implications are plotted as the thin red curve in Figure 5. These coefficientscapturethetendencyforthepercentageofindividualsclassifiedasN toincreasesharply across rotation groups. [1] [8] Coefficients for panel regressions in which e ,...,e are the 8 dependent variables are reported t t in column 1 of Table 5 and plotted as the thick black curve in Figure 5. Coefficients when unemployment is the dependent variable are in column 3 and plotted as the dashed blue line. The rising trend across rotations in N (δ =0.0011) is accounted for by falling trends in E and U N (δ +δ =−0.0011).ThebulgesinM inrotation1(α =0.0159)androtation5 (α =0.0149) E U M1 M5 are accounted for by drops in E and N in those rotations.12 4 Reconciling the inconsistencies. In this section, we propose methods to reconcile the inconsistencies identified in Section 3. 4.1 Rotation-group bias. We have seen that a given household can give different answers depending on the number of times the household has previously been interviewed. We interpret this as differences in interview technology: theprocessbywhichdataareobtaineddiffersacrossrotations,andthenumbersshould be interpreted as meaning different things. As a first step we summarize these differences in the form of a counterfactual question: if an individual in rotation j had instead been interviewed using the technology i, how would their answers have differed? We initially show how to answer this question for i = 1 and then find the answer for any i. We then ask, which interview technology i should be used to summarize the data? We identify several reasons why we prefer to use the answers that people give the first time they are interviewed (i =1). Modeling the differences in interview technology. For each rotation j = 1,2,...,8, let π[j] = (π [j] ,π [j] ,π [j] ,π [j] )′ denote the fraction over the full sample of individuals who reported status X E N M U wheninterviewedinrotationj. Foreachj ∈J ={2,3,4}∪{6,7,8},oftheindividualswhoreported j status X in rotation j−1, some fraction π are observed to report status X in rotation j for 1 X1,X2 2 12Thesefindingsareconsistentwith Krueger,Mas,andNiu’s(2017)findingthatrotation-groupbiasisassociated withnonresponsesandwithBailar’s(1975)conclusionthattherotation-groupbiasoftheunemploymentratecanbe explained by the participation margin. 18

[j] [j] [j] [j] X ∈{E,N,U,M}; thus π +π +π +π =1 for all X and j ∈J. Collect these observed i XE XN XU XM probabilities in a matrix [j] [j] [j] [j] π π π π EE NE ME UE   [j] [j] [j] [j] π π π π Π[j] = EN NN MN UN  j ∈J.    π [j] π [j] π [j] π [j]   EM NM MM UM     [j] [j] [j] [j]   π π π π  EU NU MU UU   Notice that each column of Π[j] sums to unity. For example, for the first column, if someone reported status E when interviewed in rotation j − 1, they must have had one of the statuses E,N,M, or U in rotation j. For an individual who reported status X[j] in rotation j, consider the counterfactual answer that individual would have given if interviewed using the interview technology that was used for rotation 1: r [j] = Prob(would have answered X[1] using technology 1 given answered X[j] using technology j). X[j],X[1] Collect these counterfactual probabilities in a matrix [j] [j] [j] [j] r r r r EE NE ME UE   [j] [j] [j] [j] r r r r R[j] = EN NN MN UN  j ∈J.    r [j] r [j] r [j] r [j]   EM NM MM UM     [j] [j] [j] [j]   r EU r NU r MU r UU    Notice that each column of R[j] sums to unity. For example, for the first column, given that an individual reported status E when interviewed in rotation j, they would have to have given one of the answers E,N,M,U if interviewed using the technology of rotation 1. From the analysis above, [j] we expect r > 0; some of the individuals who report labor status N in rotation j would have NU [j] reported status U if they had been interviewed for the first time. We also expect r > 0 and EM [j] r > 0; some of the individuals who were reported as status E or N in rotation j would have NM been missing using the interview technology of rotation 1. 19

Notice that R[j]π[j] =π[1] for j ∈J. (14) For example, the first row states [j] [j] [j] [j] [j] [j] [j] [j] [1] r π +r π +r π +r π =π . EE E NE N ME M UE U E This equation states that the fraction who reported E in rotation 1 can be viewed as the fraction who reported X[j] in rotation j times the probability someone reporting X[j] would have reported E using technology 1, added across the four possible X[j]. InSection3.4wefoundthatthedeclineinU acrossrotationsisaccountedforbyacorresponding trend up in N and that differences in M in rotations 1 and 5 correspond to matching drops in E and N. We propose to capture the key differences in interview technology using three parameters θ[j] =(θ [j] ,θ [j] ,θ [j] )′:13 EM NM NU [j] 1−θ 0 0 0 EM   [j] [j] 0 1−θ −θ 0 0 R[j] = NM NU . (15)    θ [j] θ [j] 1 0   EM NM     [j]   0 θ 0 1  NU   Note we can estimate θ[j] for j =2,3,...,8 immediately from rows 1, 4, and 2 of (14): [j] [1] [j] 1−θ =π /π (16) EM E E [j] [1] [j] [j] θ =(π −π )/π (17) NU U U N [j] [j] [1] [j] 1−θ −θ =π /π . (18) NM NU N N These values for θ[j] are plotted in Figure 6. The left panel shows that 1-2% of the individuals who get counted as employed in rotations 2-4 or 6-8 would have been missing from the survey if 13We take the (3,3) and (4,4) elements of R[j] to be unity because a higher fraction of the population is M or U in rotation 1 than in other rotations. For example, the third equation in (14) states that the fraction missing in rotation 1 is the fraction missing in rotation j plus some portions θ[j] and θ[j] of the fractions that are E and N EM NM in rotation j: π[1] =π[j]+θ[j] π[j]+θ[j] π[j]. Note that the normalization of the third and fourth columns of R[j] M M EM E NM N still allows equation (14) to fit exactly the observed average values of every element of π[j] for every j. 20

the rotation 1 interview technology had been used. On the other hand, rotation 5 (which follows an 8-month break) reports similar numbers of E as rotation 1 (θ [5] near 0).14 The middle panel EM captures a rising tendency for those who would have been counted as N in later rotations to have been counted as U in the first interview. The right panel indicates that a large and rising fraction of those counted N in later rotations would have been M in rotation 1. For purposes of what we are doing so far (summarizing average values over the full sample), expression(14)alongwith(16)-(18)isjustanaccountingidentitythatholdsexactlyintheobserved [j] [j] [j] data. By choosing the three parameters θ ,θ ,θ we can match the three free values in EM NU NM π[j] exactly (the fourth being pinned down by the fact that elements of π[j] sum to unity).15 Our representation will have substantive implications when we now consider the matrix of transition probabilities and in Section 5 when we develop a time-varying generalization of this approach. Notice that the averages over the full sample exactly satisfy the following accounting identity: Π[j]π[j−1] =π[j] j ∈J. (19) Premultiply (19) by R[2] for j =2 and use result (14): R[2]Π[2]π[1] =R[2]π[2] =π[1]. In other words, R[2]Π[2] could be used as the counterfactual transition matrix Π∗ if people were interviewed with the same technology in rotation 2 as in rotation 1, satisfying the requirement for consistency between transition probabilities and marginal probabilities that Π∗π[1] = π[1] for Π∗ =R[2]Π[2]. More generally, premultiplying (19) by R[j] we see R[j]Π[j](R[j−1])−1R[j−1]π[j−1] =R[j]π[j] 14The estimate of θ[5] from equation (16) is actually very slightly negative (−0.0049). The values plotted in EM Figure 6 and used in the calculations below set θ[5] =0. This makes essentially no difference for any results. EM 15Onecanshowthatequations(16)-(18)implythatrow3of(14)alsoholds. Addrows1,2,and4of(14)together to deduce π[j]+π[j]+π[j]−θ[j] π[j]−θ[j] π[j] =π[1]+π[1]+π[1]. E U N EM E NM N E U N Subtracting both sides from 1 gives π[j]+θ[j] π[j]+θ[j] π[j] =π[1] M EM E NM N M as required by the third row of (14). In general, since each column of R[j] sums to unity, if elements of π sum to unity, then the elements of R[j]π also sum to unity: 1′R[j]π=1′π=1 for 1 a vector of four ones. 21

R[j]Π[j](R[j−1])−1π[1] =π[1] for j ∈J (20) where R[1] is defined to be the identity matrix. Thus Π∗[j] =R[j]Π[j](R[j−1])−1 satisfies the internal consistency requirement Π∗[j]π[1] =π[1] for each j. Let π∗ denote the counterfactual fraction if everyone was interviewed with rotation technology 1 and Π∗ the counterfactual transition probabilities. We have seen that the matrices Πˆ∗[j] = R[j]Π[j](R[j−1])−1 give us different estimates of Π∗ for different rotations. We propose to estimate Π∗ as the value that minimizes the errors in predicting Π[j] on the basis of Π∗. Specifically, taking R[j] as given we choose Π∗ so as to minimize the sum of squared elements of Π[j]−(R[j])−1Π∗R[j−1] for j ∈J. (21) Note we do not have a matrix of transition probabilities into rotation j = 1, and we do not use the transition probabilities from rotation 4 to rotation 5 because this 8-month transition is a fundamentally different object from the other 1-month transition probabilities. Instead for rotations1and5wecomparetheobservedfractionsπ[1]andπ[5] withtheunconditionalprobabilities implied by Π∗ π[1]−π∗ (22) π[5]−(R[5])−1π∗ (23) where π∗ is calculated as in Hamilton (1994, eq. [22.2.26]): I −Π∗ 4 B =  (24) 1′     π∗ =(B′B)−1B′e (25) 5 where 1′ denotes a (1×4) vector of ones and e denotes column 5 of I . 5 5 WehaveanestimateofR[j] from(15)-(18), whileΠ[j] andπ[j] areobserveddata. Ourapproach is then to estimate the elements of Π∗ (subject to the constraints that every element is between 0 and 1 and that columns sum to unity) so as to minimize the sum of squares of the 96 = 16×6 elementsin(21)plusthesumofsquaresofthe8elementsin(22)and(23). Theresultingestimates 22

of π∗ and Π∗ are reported in Tables 6 and 7. The above framework predicts that the fraction of individuals reporting status E,N,M, or U when interviewed using technology j would be given by πˆ[j] =(Rˆ[j])−1π∗. (26) These predicted shares are compared with the actual shares reported for each rotation in Figure 7. Our representation fits the values in each π[j] essentially perfectly. Our approach also implies a predicted value for the observed fraction of individuals with measured transitions from X[j−1] to X[j]: Πˆ[j] =(Rˆ[j])−1Π∗Rˆ[j−1]. (27) Figure 8 plots these predicted values along with the actual reported fractions for j ∈ J.16 These show a reasonable fit, though not perfect. One could try to model in more detail features such as the tendency for those missing in rotation 1 to be reported as employed in rotation 2 and for those not in the labor force in rotation 1 to be missing in rotation 2. Notwithstanding, our simple parsimonious framework does a reasonable job of capturing transitions. We defined the value of π∗ in terms of the rotation 1 technology. But now that we’ve found π∗, we can also calculate the answer using any other technology. For example, (Rˆ[5])−1π∗ gives the answer in terms of the rotation 5 technology. The BLS approach, which simply averages the rotations together, is implicitly reporting results in terms of an “average” technology, which in our formulation would be described as (1/8) 8 (Rˆ[j])−1π∗. j=1 (cid:3) Where does rotation-group bias come from? The framework above allows us to reconcile stocks and flows in the CPS data and summarize that reconciliation using any interview technology or combination of interview technologies. In practice we need to choose a particular technology for purposes of that summary. Which one we choose depends on what we think is the source of the bias. Halpern-MannersandWarren(2012)suggestedthatadmittingtohavefailedinfindingajobfor 16Note we do not offer a predicted value for transitions from X[4] to X[5] since there are 8 intervening months between rotations 4 and 5. 23

consecutive months may produce feelings of stigma or shame, which could lead people to say “no” in follow-up interviews when asked if they are still actively looking for a job. Those who report U are asked “what are the things youhave done to findwork during the last 4 weeks?” and“howlong had you been looking for a job?” Halpern-Manners and Warren also raised the possibility that, havinglearnedthequestionsthatfollowupiftheyreportU,respondentsmaybelievetheywouldbe askedfewerorlessonerousfollow-upquestionsiftheyinsteadreportN orE. Hapern-Mannersand Warrenconcludedthatthesetwofactorsleadtoadownwardbiasinestimatesoftheunemployment rate based on later rotations. We saw in Figure 5 that most of the decrease in U across rotations is accounted for by an increase in N. And the long durations of unemployment search that people report suggest that significant numbers of those classified as N in fact view themselves as looking for work. For these reasons, we think that the answer people give the first time they are asked whether they are unemployed is the best one to use for purposes of reconciling the full set of observed data, and we will use π∗ as our preferred reconciliation of rotation-group bias. However, we emphasize that our framework could be used to calculate an alternative reconciled estimate π∗[j] =(Rˆ[j])−1π∗ from the perspective of any specified technology j.17 4.2 Nonrandom missing observations. Theconventionalapproachsimplythrowsoutmissingobservations,whichamountstoassuming thatthosemissingfromthesurveyarejustlikethoseincluded. However,ourreconciledprobabilities in Table 7 show that someone who is employed has a 6.2% probability of being missing in the next month, whereas someone who is unemployed has 8.7% probability. Of those making ME, MN, or MU transitions, 6.2% are unemployed, although the unemployed only comprise 4.5% of the observed E, N, or U on average. In addition, of those making MU transitions, 65% claim that they have been searching for work longer than 4 weeks. In sum, missing individuals are more likely to be unemployed than a typical person in the observed data. To correct for the bias coming from nonrandom missing observations, we impute a labor-force 17Krueger,MasandNiu(2017)foundthatsurveynonresponseisanimportantsourceofrotation-groupbias. Inour approach, we separately model the role of rotation-group specific nonresponses (as represented by the third column and third row of R[j]), the effect of the nonrandom nature of nonresponse rates (as represented by the parameters mX in Section 4.2), and the contribution of changes over time in each of these factors (Section 5). 24

status in month t−1 to individuals observed to make ME, MN, or MU transitions into period t. Suppose that some fraction m of those missing in month t−1 are just like those who were E counted as employed that month in terms of their transition probabilities, while fractions m or N m share the same transition probabilities as those counted as N or U. We regard the remaining U m = 1−m −m −m as “dormant observations” in the sense of having zero probability of M E N U being recorded as E, N, or U in month t.18 The probabilities of observing ME, MN, and MU transitions would then be given by π∗ π∗ π∗ π∗ m ME EE NE UE E      π∗ = π∗ π∗ π∗ m .  MN   EN NN UN  N        π∗   π∗ π∗ π∗  m   MU   EU NU UU  U       This system of equations can be solved to find (m ,m ,m ) = (0.0951,0.0465,0.0121). Our E N U suggested correction for nonrandom missing observations is then π∗ +π∗ m E M E   π∗ +π∗ m .  N M N     π∗ +π∗ m   U M U    4.3 Reinterpreting flows from N into longer-term unemployment. As shown in Section 3.2, a majority of the people making NU transitions interpret their status at t−1 as part of an extended period of job search. Our proposal is to characterize individuals who do so as having been U rather than N at t−1. The average fraction of the population that is in this category is given by 99 y [j] (τ) m ♯ =T−1 T  τ=5 j∈J N,U,t =0.0038. (28) N t=1  y [j−(cid:3)1] +y (cid:3)[j−1] +y [j−1] +y [j−1]  (cid:3) j∈J E,t−1 N,t−1 M,t−1 U,t−1 (cid:8) (cid:9) (cid:3)   Our adjustment subtracts m ♯ from π∗ and adds it to π∗. N N U In addition to the individual’s own perception, this adjustment is supported by a number of other facts observed in the data. Someone who was counted as N in month t−2 and unemployed 18This would include people who are in the military, incarcerated, moved away from the address, or yet to move in, for example. 25

with duration 5 weeks or greater in month t−1 has a 13% probability of being employed in month t. This turns out to be quite close to the 15% probability that someone with a UU history in t−2 and t−1 will be employed in month t, consistent with our conclusion that both groups of individuals should have both been counted as U in t−2.19 Similarconclusionsemergefromlookingatlongerandmoredetailedhistories. Thefirstcolumn of Table 8 examines UUU continuations in months t−3,t−2, and t−1 for which the reported durations would be consistent with a true UUU continuation.20 As we go down the rows, the history is consistent with a longer initial duration in month t−3. Our framework would predict that the employment probability in month t would decrease as we move down the rows. This is because type 2 individuals, who have a lower probability than type 1 of becoming employed at t, make upa larger fraction of the pool at t−1 as we move downthe rows.21 This is exactly what we observe in the data. The third column looks at individuals with an intervening N status in month t−2 but with the same U in t−3 and t−1 as in column 1. These probabilities also tend to decrease as we move down rows. This provides further support for our conclusion that the status of individuals in t−2 is similar across the two columns.22 AdditionalcorroborationcomesfromthesubsetofN thataredesignatedbyBLSas“marginally attachedworkers.”Thesearepeoplewhosaytheywantandareavailableforworkorhavelookedfor a jobsometime withinthe last 12 months, eventhoughtheydid notclaimtohave activelysearched withinthesurveymonth. WefindfromtheAmericanTimeUseSurveythatamongindividualswho report positive job search time in the survey, unemployed individuals spend 143 minutes per day 19Ouranalysisoftheimportanceoflaborforcestatushistoryinthereemploymentprospectofjoblessworkerswas inspired by Kudlyak and Lange (2018). 20For example, U1.4,U5.14,U5.14 refers to someone who reported being newly unemployed in t−3 and being t−3 t−2 t−1 unemployed between 5 and 14 weeks in t−2 and t−1. 21See Ahn and Hamilton (2019). 22Our adjustments are related to those of Rothstein (2011), Elsby et al. (2011), Elsby, Hobijn, and S¸ahin (2015), and Farber and Valletta (2015) who reclassified all UNU as UUU. By contrast, we only classify UNU as UUU if the final U reports a duration of job search greater than 4 weeks. We classify NUN as UUX if the intervening U has duration greater than 4 weeks and where X is allocated statistically based on m♭ in (29). Kudlyak and N Lange (2018) noted that UNU are similar to UUU in terms of their probability of finding a job but dissimilar in terms of the wage they subsequently earn. We acknowledge Kudlyak and Lange’s conclusion that UNU have some different characteristics from UUU, and indeed our specification predicts differences even within the group of individualswhoareUNU.Ourpredictionsareconfirmedbythedifferencesobservedinperiodtemploymentprospects for U N U as we movedown the rows of Table8. The question is, what is the appropriate designation of the t−3 t−2 t−1 labor-forcestatus of these individuals during the intervening Nt−2 ?We base our designation on how peopledescribe theirstatusint−2whenaskedint−1. Wefindthattheindividuals’owndescriptionsareconfirmedbytheirobjective probability of obtaining a job at t. We regard that objective probability as the most important confirmation of how actively they were really looking fora job. Overall, we interpretthe evidence uncovered by both Elsby, Hobijn, and S¸ahin (2015) and Kudlyak and Lange (2018) as broadly supportive of our approach. 26

for job search and marginally attached workers spend 154 minutes per day on average. Marginally attached workers account for only 2.2%ofthose typically designated as N but represent 40% of the observed NU5.+ transitions. Thus for a little less than half of those we retrospectively reclassify as U at date t−1 we have independent corroboration before date t that those individuals could end up behaving like unemployed job seekers. Moreover, over 2003-2015, the marginally attached category only includes 10% of those classified as N who spend time looking for a job according to the American Time Use Survey, suggesting that the N category includes many job seekers who behave like the unemployed. Ahn and Shao (2017) and Mukoyama, Patterson, and Sahin (2018) documented that among those who spend more than zero minutes searching on a survey day, the time spent searching for a job by those categorized as N is similar to that for the unemployed. Our conclusion that significant numbers of N should instead be viewed as U thus receives support from a variety of other sources of evidence. 4.4 Recovering unemployment-continuation probabilities. Here we describe our adjustments to estimates of unemployment-continuation probabilities. [1] Correcting for rotation-group bias. Recall that γ represents the probability that a type i,UX i individual who was reported to be unemployed in rotation 1 would have reported status X ∈ {E,N,M,U} in rotation 2. Our first task is to translate the answers from the rotation 2 interview technology into terms of the rotation 1 technology. This can be done by premultiplying the vector γ [1] by R[2]. We would likewise premultiply the rotation 5 estimate γ [5] by R[6]. For i,U i,U greater accuracy, in Table 3 we pooled rotation 1 with rotation 5 transitions for purposes of jointly estimating parameters. Adjusting these pooled estimates for rotation-group bias is achieved by 0.3239 0.1168     0.2058 0.2164 γ∗ =(1/2)(R[2]+R[6])γ =  γ∗ =(1/2)(R[2]+R[6])γ = . 1,U 1,U   2,U 2,U    0.1038   0.0831               0.3665   0.5837      The resulting adjustments for rotation-group bias are relatively minor. Adjusting for misclassified N. We concluded that many people who are currently counted as 27

N are better classified as U. This observation means that some UN observations could really be UU continuations. In Section 3.3 we found that the discrepancy between reported unemployment durations and observed unemployment-continuation probabilities mainly comes from γ , the 2,UU unemployment-continuation probability fortype 2 individuals. Here we explore whether a fraction ξ of the γ transitions should be regarded as UU continuations. Since type 2 individuals UN 2,UN account for 95% of those unemployedfor15 weeks and over(hereafter, U15.+), we look forevidence in the observed outcomes in month t of individuals who were U15.+ in t−2 and N in t−1. Someone with a history U15.+N has a 22.5% probability of being U15.+ in t. Since we t−2 t−1 interpret the last two months (N U15.+) of the U15.+N U15.+ sequence as a UU continuation, t−1 t t−2 t−1 t we are forced to interpret the first two months U15.+N also as a UU continuation, requiring t−2 t−1 ξ >0.225. UN We also observe that someone with a U15.+N history has a 7.55% probability of being emt−2 t−1 ployedatt,farhigherthanusuallyobservedforindividualsclassifiedasN (P(E |N ) =4.63%). t−1 t t−1 SupposeweviewU15.+N individualsasamixtureoftwopopulations,withafractionξ having t−2 t−1 UN thesame employment probabilityinmonthtassomeone whoisobservedtobe U15.+U15.+,andthe t−2 t−1 remainder with the same employment probability as someone who is truly out of the labor force in t−1 as represented by a history of N N : t−2 t−1 P(E |U15.+,N )=ξ P(E |U15.+,U15.+)+(1−ξ )P(E |N ,N ) t t−2 t−1 UN t t−2 t−1 UN t+1 t−2 t−1 0.0755 =0.1071ξ +0.0209(1−ξ ). UN UN This equation gives an estimate of ξ =0.633. UN ♯ Another way to corroborate this is as follows. On average each month, a fraction m =0.0038 N of the population report NU5.+ transitions, and we have interpreted m ♯ q /(q +q ) = N 6,NU 5,NU 6,NU 0.0028 of these as long-term UU continuations. In steady state, one would think that the number of recorded NU transitions that are really long-term UU continuations should be roughly balanced by the fraction of the population with recorded UN transitions that are really long-term UU continuations.23 The fraction of the population that transitions from long-term unemployed to N 23Thebalance is only approximate because the flows from N to U could alternatively be balanced in steady state by UMN flows, for example. 28

is π∗w γ∗ , and if ξ of these are really UU continuations, the fraction of the population with U 2 2,UN UN observed UN that is really long-term UU is m♭ =π∗w γ∗ ξ =(0.0311)(1−0.3920)(0.2164)(0.633)=0.0026. (29) N U 2 2,UN UN This is indeed quite close to 0.0028. This supports the inference that the true unemploymentcontinuation probability for type 2 individuals is γ∗ +ξ γ∗ =0.7207 with ξ =0.633. 2,UU UN 2,UN UN Adjusting for missing observations. Finally, recallthatboththeoriginalestimateγ∗ andour 2,UU preferred estimate γ∗ +ξ γ∗ are continuation probabilities with UM transitions regarded 2,UU UN 2,UN as a separate status. In reality, UM transitions must be one of UE, UN, or UU. Allocating UM transitions proportionally to the observed UE, UN, and UU results in our final adjusted estimates of true unemployment-continuation probabilities: γ∗ 1,UU γ˜ = =0.4089 (30) 1,UU 1−γ∗ 1,UM γ∗ +ξ γ∗ 2,UU UN 2,UN γ˜ = =0.7860. (31) 2,UU 1−γ∗ 2,UM The estimate γ˜ is below p4.33 =0.89, the value we would have expected based on reported 2,UU 2 unemployment durations. Nevertheless, the adjustment goes a long way toward reconciling perceiveddurationswithobjectivecontinuationprobabilities. Onesourceoftheremainingdiscrepancy between our estimate of the objective continuation probability γ˜ and the perceived duration 2,UU of job search p is on-the-job search. Recall from Section 3.2 that EU5.+ transitions account for 2 29% of EU observations, with many EU individuals reporting duration longer than 6 months. As noted by Kudlyak and Lange (2018), we could interpret these individuals as correctly reporting how long they have been looking for a job or looking for a better job, while still defending the estimate γ˜ as a correct summary of the true probability of remaining unemployed without an 2,UU intervening spell of employment. A second possible source of discrepancy between γ˜ and p 2,UU 2 is that individuals are reporting not the length of a continuous spell of unemployment but instead how long it has been since theirlast goodjob (Elsby et al. (2011); Farber and Valletta (2015)). We conclude that our procedure of adjusting unemployment-continuation probabilities up, but not all 29

the way to those implied by reported job-search durations, is the correct way to reconcile the data. 4.5 Unemployment and labor-force participation rates. ♯ We have found that on average in month t a fraction of the population m in equation (28) is N reported as N but should instead be regarded as U based on an their observed N U5.+ transition. t t+1 Our calculations also suggest that a fraction of the population m♭ in equation (29) is reported as N N but shouldbe regardedasU basedontheirprevious observed U N transition. There is some t t−1 t double-counting from individuals who would be counted in both groups as a result of U N U5.+ t−1 t t+1 transitions. We could get an estimate of how large this overlap is by calculating the number of people with labor-force history U N U5.+ as a fraction of the total population and multiplying t−1 t t+1 ♮ by w ξ , which product we denote m = 0.0006. Our estimate of the fraction of the population 2 UN N that is truly unemployed is then π˜ =π∗ +π∗ m +m ♯ +m♭ −m ♮ =0.0406. (32) U U M U N N N The corresponding adjustments for the fraction who are N or E are π˜ =π∗ +π∗ m −(m ♯ +m♭ −m ♮ )=0.2444. (33) N N M N N N N π˜ =π∗ +π∗ m =0.4537. (34) E E M E From these we calculate an adjusted unemployment rate and labor-force participation rate: π˜ U =8.21% (35) π˜ +π˜ E U π˜ +π˜ E U =66.91%. π˜ +π˜ +π˜ E U N Table 9 summarizes the effects of the various adjustments. The first row reports the average unemployment rate andlabor-force participationrate over oursample asreportedby the BLS.The secondrow reports the value if we correct only forrotation-group bias, that is, the calculationfrom equations (32)-(34) if m =m =m =m ♯ =m♭ =m ♮ =0. This adjustment alone would add E N U N N N half a percentage point to the unemployment rate and 1.2% to the labor-force participation rate. 30

The third row shows the contribution of also taking account of the nonrandom nature of missing observations (that is, allows for nonzero m ,m ,m ), while the final row shows the effect of all E N U three adjustments. Altogether, our adjustments add 1.9% to the unemployment rate and 2.2% to the labor-force participation rate. The last column of Table 9 shows that while rotation-group bias matters for the employmentpopulation ratio, the ratio is unchanged after correcting for missing observations or misclassified N. Thus the employment-population ratio could be a robust measure of the labor-market slack in the presence of increasing nonresponses and errors in responses in the CPS. 4.6 Unemployment duration. Here we calculate average unemployment durationsthat wouldbe consistentwithourestimates of true unemployment-continuation probabilities. To do this we need an estimate of the fraction w˜ of total unemployed individuals individuals π˜ that are of type i. For the first term in equation i i (32), π∗,weknowthefractionoftype ifromtheestimateofw fromTable3. Weassumethesame U i fraction w could be used to impute types to the missing unemployed for the second term. The i thirdtermin(32), m ♯ ,isderivedfromobservedNU5.+ transitions,forwhichweestimateddirectly N that the fraction of type 1 is given by q /(q +q ). The last two terms by construction 5,NU 5,NU 6,NU come solely from type 2 individuals. We thus estimate w (π∗ +π∗ m )+m ♯ q /(q +q ) w˜ = 1 U M U N 5,NU 5,NU 6,NU =0.3622 1 π˜ U and w˜ =1−w˜ . 2 1 Next we adapt the weekly formulation (4) to a monthly frequency to calculate a true average duration of unemployment. If the true monthly unemployment-continuation probability for type i individuals is γ˜ , then in steady state the average fraction of the unemployed with true duration i,UU of n months would be given by w˜ (1−γ˜ )γ˜n−1 +w˜ (1−γ˜ )γ˜n−1 . 1 1,UU 1,UU 2 2,UU 2,UU Table 10 uses this expression to calculate the fraction of the truly unemployed π˜ for whom the U 31

true duration is less than 5 weeks (1 month), 5-14 weeks (2-3 months), 15-26 weeks (4-6 months) and longer than 26 months (7 months and over), along with the average duration.24 Our estimate of the average duration of unemployment is only 16 weeks, about 9 weeks lower than the BLS reports. Kudlyak and Lange (2018) constructed estimates of the number of newly unemployed as a fraction of total unemployed by (1) counting all E U as newly unemployed despite the duration t−1 t of search reported at t, and (2) also counting all N U as newly unemployed. Our estimate t−1 t of the fraction of individuals unemployed for less than 5 weeks, 35.1%, is in between their two estimates (29.1%and 46.1%, respectively) because we designate some, but not all, of the N U as t−1 t unemployed at t−1. Their two methods produced estimates of 37.5% and 24.1%, respectively, for the fraction of unemployed with duration greater than 14 weeks, with our estimate of 33.4% again inbetweenthosetwo. Althoughtheirapproachdidnotallowthemtouncovertheaverageduration of unemployment, their calculations confirm our conclusion that the BLS estimates substantially overstate the number of long-term unemployed. 5 Adjusting monthly estimates. Up to this point in the paper we have used the entire sample to describe our interpretation of full-sample averages. In this section we show how to generalize this approach to allow for changes over time in the various sources of bias. Our principle in doing so is to start with initial estimates based on month t observations alone, just as the BLS does. We then make adjustments to these estimates using some time-varying parametersθ . Oneapproachwouldbetoassumethatallthesourcesofbiasthatwehaveidentified t are constant over time, and simply use the full-sample estimates of parameters θ described earlier to adjust each month’s observation. This approach would miss changes over time in these biases which could be important. An alternative approach would be to estimate adjustment parameters 24These calculations used w˜ = 0.3622, γ˜ = 0.4089, and γ˜ = 0.7860. The fraction between 5 and 14 1 1,UU 2,UU weeeks was found from w˜ (1−γ˜ )(γ˜ +γ˜2 )+w˜ (1−γ˜ )(γ˜ +γ˜2 ). 1 1,UU 1,UU 1,UU 2 2,UU 2,UU 2,UU The average duration in weeks is w˜ w˜ 4.33 1 + 2 . (cid:2)1−γ˜ 1−γ˜ (cid:3) 1,UU 2,UU 32

θ foreachmonthseparately, treating the observations formonth tas a completely separate sample t from the others. This would add so much estimation error that any adjustments would be useless for practical purposes. Our approach uses a hybrid of the two methods. We use exponential smoothing to calculate a weighted average of recent observations through date t to infer how the adjustment parameters θ are changing over time. If θ denotes an estimate using observations t t from month t alone, we calculate θ =(1−λ)θ +λθ . t t t−1 This constructs θ as a weighted average of values of θ through date t with most recent values t t given the biggest weight. When λ=0 this would correspond to using only date t data to calculate θ , whereas when λ = 1 it yields the full-sample estimate θ when started at θ = θ. We choose t 1 the weighting parameter λ close to unity so that plots of θ evolve smoothly smoothly over time, t eliminatinghigh-frequencymeasurementerrorbutstillcapturinglonger-runtrends. Notethatthis smoothing is used only to calculate the adjustment parameters and not to the primary data that forms the initial estimate for any month t. [j] Rotation-group bias parameters. We first summarize how the rotation-bias parameters θ , EM [j] [j] θ , and θ have changed over time. Our first step is to construct weighted moving averages NU NM of the counts of individuals in each labor-force status in each rotation, [j] [j] [j] y =(1−λ)y +λy , X,t X,t X,t−1 [j] where y denotes the observed weighted number of individuals reporting labor status X ∈ X,t {E,N,M,U} in rotation j in month t. We set λ = 0.98, which means that observations 3 years prior to t receive half the weight of observation t in determining the smoothed count y [j] .25 We X,t then calculated the corresponding smoothed fractions as [j] [j] [j] [j] [j] [j] π =y / y +y +y +y . X,t X,t E,t N,t M,t U,t (cid:8) (cid:9) 25That is, 0.9836 = 0.48. We started the recursion by setting y˜[j] = (1/36) 36 y[j] the average of the first X,1 t=1 X,t three years of observations. (cid:1) 33

From these we calculated time-varying rotation-bias parameters as [j] [1] [j] θ =max 1− π /π ,0 EM,t E,t E,t (cid:19) (cid:8) (cid:9) (cid:20) [j] [1] [j] [j] θ =max π −π /π ,0 NU,t U,t U,t N,t (cid:19)(cid:8) (cid:9) (cid:20) [j] [j] [1] [j] θ =max 1−θ − π /π ,0 , NM,t NU,t N,t N,t (cid:19) (cid:8) (cid:9) (cid:20) [j] [j] [j] and exponentially smoothed these as well. For example, θ =(1−λ)θ +λθ . EM,t EM,t EM,t−1 [j] [j] [j] [j] The resulting series for θ , θ , and θ are plotted in Figure 9. The value of θ , EM,t NU,t NM,t EM,t which characterizes the tendency to record people as E in rotation j who would have been M in [j] rotation1, hasfallensomewhatovertime. Bycontrast, θ ,whichgovernsthetendencyofpeople NU,t who would have been counted as U in earlier rotations to be designated as N in later rotations, [j] hasincreasedovertime. The thirdparameter, θ ,whichcharacterizes the tendencyofsomeone NM,t who would have been counted as M in rotation 1 to be counted as N in later rotations, has not changed much over time. Correcting for rotation-group bias. Withthesevalueswecancalculatethatavectorofobserved [j] [j] [j] probabilities for those in rotation j in month t, π , would have been reported as R π if the t t t people had been interviewed using the rotation 1 interview technology instead of in rotation j, where [j] 1−θ 0 0 0 EM,t   [j] [j] 0 1−θ −θ 0 0 R [j] = NM,t NU,t . t    θ [j] θ [j] 1 0   EM,t NM,t     [j]   0 θ NU,t 0 1    LetΠ∗ denotethecounterfactual(4×4)matrixoftransitionprobabilitiesifallindividualsinmonths t t−1 andt hadbeen interviewed using the interview technology ofrotation 1 in both months. This counterfactual Π∗ implies a predicted value for the observed Π [j] given by (R [j] )−1Π∗R [j−1] . We t t t t t−1 therefore chose Π∗ so as to make the elements in the following equations as small as possible: t Π [j] −(R [j] )−1Π∗R [j−1] for j ∈J ={2,3,4}∪{6,7,8}. (36) t t t t−1 34

We likewise let π∗ denote the (4×1) vector of unconditional probabilities if all individuals were t interviewedinmonthtusingtechnology1. NotethesesatisfytheaccountingidentityΠ∗π∗ =π∗. t t−1 t Thus if we had an estimate of π∗ for the previous period, we would predict a value for π [1] of t−1 t π∗ = Π∗π∗ and predict a value for π [5] of (R [5] )−1Π∗π∗ . This means that Π∗ should also have t t t−1 t t t t−1 t the property that it makes all the terms in the following equations small as well: π [1] −Π∗π∗ (37) t t t−1 π [5] −(R [5] )−1Π∗π∗ . (38) t t t t−1 Our procedure is to proceed iteratively through the data. We set the initial value of π∗ for t observation t = 1 as π∗ = π [1] . For each t = 2,3,... we choose the 16 elements of Π∗ so as to 1 1 t minimize the sum of squares of the 104 terms in (36)-(38) subject to the constraints that each element of Π∗ is between 0 and 1 and each column of Π∗ sums to 1. Given Π∗ we then calculate t t t π∗ =Π∗π∗ t t t−1 and proceed to the next observation t+1. Note there isno smoothing involvedinourestimate ofΠ∗,whichisbasedsolelyonthe observed t [1] [5] [j] valuesofπ ,π ,andΠ forj ∈J ={2,3,4}∪{6,7,8}. Thesmoothingwasonlyusedforpurposes t t t [j] of calculating R underthe assumption that changes in the interview technology occur slowly over t time. Correcting for missing observations. From the reconciled transition probabilities Π∗ we next t calculateestimatesofthefractionoftheM observationsfordatet−1that, basedontheirobserved status as E, N, or U at date t, should be imputed to E, N, or U at date t−1. We do this by finding the values of m , m , and m that solve E,t−1 N,t−1 U,t−1 π∗ π∗ π∗ π∗ m ME,t EE,t NE,t UE,t E,t−1      π∗ = π∗ π∗ π∗ m .  MN,t   EN,t NN,t UN,t  N,t−1        π∗   π∗ π∗ π∗  m   MU,t   EU,t NU,t UU,t  U,t−1       35

We also smooth these as m =(1−λ)m +λm . X,t X,t X,t−1 The m parameters have more high-frequency movement than terms like θ . We accordingly X,t EM,t use a shorter effective window by setting λ = 0.97, which gives observations 2 years ago half the weight current observations for purposes of calculating m . The resulting values of m are X,t X,t plotted in the first three panels of Figure 10. Both m¯ and m¯ rise over time, while m is Nt Et U,t highly counter-cyclical without exhibiting a particular trend. The secular rise in m¯ and m¯ Nt Et suggests that the upward trend in missing individuals likely comes from N and E. The countercyclical behavior of m¯ tells us that unemployed individuals are more likely to be missed during Ut a weak labor market. Correcting for transitions from N to long-term unemployment. We proposed that individuals who reported status N in month t−1 and reported in month t that they were unemployed and had been looking for work for longer than 4 weeks should be counted as U rather than N in month t−1. The fraction of the population for whom this is observed to be the case in month t−1 is given by 99 y [j] (τ) ♯ τ=5 j∈J N,U,t m = . N,t−1 [j−(cid:3)1] (cid:3)[j−1] [j−1] [j−1] y +y +y +y j∈J E,t−1 N,t−1 M,t−1 U,t−1 (cid:21) (cid:22) (cid:3) Estimatesofmonthlytransitionprobabilities. Wedonotmodelchangesovertimeinthenumberpreference parameters θ but hold these fixed at the full-sample estimates reported in Table 3 for A all months t.26 However, we estimated all the other parameters θ in Table 3 maximizing (12) t separately for each month t and smoothed these using λ = 0.9. Figure 10 plots some of the key magnitudes of interest. The fractions of NU and MU transitions that individuals perceive as continuationsoflong-termunemployment(q andq )rosesharplyduringtheGreatRecession 6,NU,t 6,MU,t and have been slow to return to their historical averages. Both the perceived weekly UU continuation probability for type 1 individuals p and the objective monthly probability γ¯ react to 1t 1,UU,t seasonal hiring, consistent with the high seasonality in unadjusted short-term unemployment, and both fell during the Great Recession.27 For type 2 individuals, there is a time trend in perceived 26We also estimated θA,t separately for each month, and found no significanttrend or cyclical componentin θA,t. Keeping θ fixed at θ reduces noise and measurement error and does not affect substantive conclusions. A,t A 27Thefeature ofthedatathatgives riseto this conclusion istheobservationthatindividualswithunemployment durationsof5-14weeksweremuchmorelikelytoremainunemployedduringtheGreatRecession,meaningthatmore 36

p that is not fully matched by that for the objective γ¯ probability, though both increased 2t 2,UU,t significantly in the Great Recession and were slow to come down afterward. The fraction w¯ of 2t type 2 workers among the reported unemployed rose through 2011 and has been slowly declining since. Estimates of unemployment rate and labor-force participation rate. We construct monthly estimates of (29), the fraction of the population with reported UN who are better interpreted as long-term UU, from m♭ =π∗ w¯ γ¯ ξ Nt Ut 2t 2,UN,t UN where we fix ξ = 0.633 at the full-sample average28. We calculate k♮, the fraction of m ♯ + UN Nt m♭ that comes from double-counting the same individuals, from our full-sample estimate of that Nt fraction: ♮ m 0.0006 k♮ = N = =0.094 m ♯ +m♭ 0.0038+0.0026 N N giving rise to the monthly estimate m ♮ = k♮(m ♯ +m♭ ). Our final estimates that correct for Nt Nt Nt rotation-group bias, non-randomly missing observations, and misclassified N are then π˜ π∗ +π∗ m E,t−1 E,t−1 M,t−1 E,t−1     π˜ π∗ +π∗ m −m ♯ −m♭ +m ♮  N,t−1 = N,t−1 M,t−1 N,t−1 N,t−1 N,t−1 N,t−1 .      π˜   π∗ (1−m −m −m )   M,t−1   M,t−1 E,t−1 N,t−1 U,t−1        π˜ U,t−1     π∗ U,t−1 +π∗ M,t−1 m U,t−1 +m ♯ N,t−1 +m♭ N,t−1 −m ♮ N,t−1       Our adjusted estimates of the unemployment rate and labor-force participation rate are u˜ =π˜ /(π˜ +π˜ ) t U,t E,t U,t ℓ˜ =(π˜ +π˜ )/(π˜ +π˜ +π˜ ). t E,t U,t E,t N,t U,t Note that these are all seasonally unadjusted magnitudes in order to preserve all the accounting type 1 individuals must have exited unemployment before entering this group. One possible interpretation is that individuals would only voluntarily quit their job in this episode if they knew they could get another job quickly. A drop in p during the Great Recession was also found by Ahn and Hamilton (2019, Figure 4 and Table 1). They 1t found that this feature was unique to the Great Recession and was not seen in other recessions. 28We obtained similar results allowing ξ to change over time. UN,t 37

identitiesassociatedwithobservedtransitions. To relatethesetotheusuallyreportedmagnitudes, we plotted seasonally-adjusted values29 for these rates in Panels C and D of Figure 1. In the top panel of Figure 11, we compare our adjusted estimate u˜ (in dotted blue) with three different t unemployment rates reported by the BLS— the usual U3 unemployment rate (solid black) along with U5 unemployment (dashed red), which includes discouraged workers and all other marginally attachedworkers, andU6unemployment(dashedgreen)whichaddspeoplewhoareemployedparttime for economic reasons. Our adjustment includes more individuals than U5, but far less than U6. The contributions of each of our adjustments to the monthly series for unemployment and labor-force participation are summarized in Figure A-2 in the online appendix. Estimates of monthly continuation probabilities. We calculate reconciled monthly unemployment-continuation probabilities from γ˜ =γ /(1−γ ) and 1,UU,t 1,UU,t 1,UM,t γ +ξ γ 2,UU,t UN 2,UN,t γ˜ = . 2,UU,t 1−γ 2,UM,t Our monthly estimate of the fraction w˜ of type 1 workers among all unemployed is 1,t w (π∗ +π∗ m )+m ♯ q /(q +q ) 1,t U,t M,t U,t N,t 5,NU,t 5,NU,t 6,NU,t w˜ = . 1,t π˜ U,t Theadjustedestimatesγ˜ andw˜ areplottedasdashedredlinesinFigure10. Ourestimateof 2,UU,t 2t thetruemonthlycontinuationprobabilityaveragedacrossallindividualswhoaretrulyunemployed is w˜ γ˜ +w˜ γ˜ , 1,t 1,UU,t 2,t 2,UU,t which is the series plotted as the dotted blue line in Panel A of Figure 1. Estimates of new flows into unemployment. We estimate that a fraction w˜ π˜ of the popuit Ut lation are truly unemployed of type i ∈ {1,2} in month t. Of these, a fraction γ˜ are still i,UU,t+1 unemployed the next month, giving rise to V˜ =w˜ π˜ −γ˜ w˜ π˜ (39) i,t+1 i,t+1 U,t+1 i,UU,t+1 it Ut 29These were calculated using the X11 instruction in RATS. 38

as an estimate of the number of individuals of type i who are newly unemployed in month t+1 and V˜ = V˜ +V˜ as the total number of newly unemployed. This series is plotted as t+1 1,t+1 2,t+1 the dotted blue line in the bottom panel of Figure 11 along with several alternative estimates. Shimer (2012) and other researchers have estimated unemployment inflows from the number of unemployed with reported durations of less than 5 weeks, shown in dashed red as a percent of the civilian population. Others like Fujita and Ramey (2012) base their calculation on the number of EU and NU transitions among those with two consecutive months of nonmissing observations, [j] [j] y +y j∈J E,U,t N,U,t Vˆ = (cid:8) (cid:9) , (40) t (cid:3) [j] [j] [j] [j] [j] [j] [j] [j] [j] y +y +y +y +y +y +y +y +y j∈J E,E,t E,N,t E,U,t N,E,t N,N,t N,U,t U,E,t U,N,t U,U,t (cid:8) (cid:9) (cid:3) shown as the solid turquoise line. The Shimer estimate is significantly below the Fujita-Ramey estimate because the latter includes EU5.+ and NU5.+ transitions. Our estimate is above Vˆ. t The biggest single reason for this is rotation-group bias, which causes flows into unemployment as calculated from the numerator of (40) to be smaller than flows out of unemployment even in months when the measured unemployment rate is constant or even rising. One can see the effect of rotation-group bias by replacing y [j] in (40) by the estimate π∗ π∗ . This j∈J X1,X2,t X1,t−1 X1,X2,t (cid:3) corrects the calculation for rotation-group bias but makes no other adjustments. The resulting series Vˆ∗ is shown as the dashed green line in Figure 11, which is much higher than the estimate t Vˆ from (40). Our fully adjusted series V˜ makes a number of other adjustments that can either t t increase or decrease the estimate relative to Vˆ∗. We exclude NU5.+ transitions because we see t them as continuing spells of unemployment, which lowers the estimate of V. But we also adjust the estimate up as a result of our treatment of missing observations. On average V˜ is above Vˆ∗, t t but rotation-group bias is the biggest single problem with Vˆ. Finally, we note that the BLS also t publishes estimates of the number of EU and NU flows that are consistent with observed stocks of E, N and U. Their series (shown in black) adjusts the data in the direction of our estimates (that is, it is above Vˆ) but is lower than an adjustment that only corrects for rotation-group bias (the t BLS estimate is below Vˆ∗). The relation between our adjustments and those of BLS are discussed t further in Appendix E. Estimates of average duration of unemployment. Let V˜ denote the number of newly i,t−d+1 39

unemployed of type i at t − d + 1 as calculated in (39). If a fraction γ˜ are still be i,UU,t−d+2 unemployed at t−d+2, then the number unemployed for exactly d months as of month t would be given by30 U˜d =V˜ γ˜ ···γ˜ γ˜ γ˜ . (41) it i,t−d+1 i,UU,t−d+2 i,UU,t−2 i,UU,t−1 i,UU,t This implies an average unemployment duration of those who are unemployed in month t of D d(U˜d +U˜d) d˜ = d=1 1t 2t . t (cid:3) D (U˜d +U˜d) d=1 1t 2t (cid:3) Dividing by 4.33 gives the unemployment duration in weeks plotted as the blue dotted lines in Panel E of Figure 1. Our series is much lower on average and less cyclically variable than the BLS measure in black. 6 Conclusion. The data underlying the CPS contain multiple internal inconsistencies. These include the facts that people’s answers change the more times they are asked the same question, stock estimates are inconsistent with flow estimates, missing observations are not random, reported unemployment durations are inconsistent with reported labor-force histories, and people prefer to report some numbers over others. Ours is the first paper to attempt a unified reconciliation of these issues. We conclude that the U.S. unemployment rate and labor-force continuation rates are higher than conventionally reported while the average duration of unemployment is considerably lower. References Abowd, John M., and Arnold Zellner (1985). "Estimating Gross Labor-Force Flows." Journal of Business and Economic Statistics 3, no. 3: 254-283. Abraham, Katharine G., and Shimer Robert (2001). "Changes in Unemployment Duration and Labor Force Attachment," NBER Working Paper 8513. Ahn, Hie Joo, and James D. Hamilton (2019). "Heterogeneity and Unemployment Dynamics." Journal of Business and Economic Statistics, forthcoming. 30Appendix E compares our estimates of the number of long-term unemployed with the number of people who received extended unemployment benefits after regular benefits were exhausted. 40

Ahn, Hie Joo, and Ling Shao (2017). "Precautionary On-the-Job Search over the Business Cycle," Federal Reserve Finance and Economics Discussion Series 2017-025. Bailar, Barbara A (1975). "The Effects of Rotation Group Bias on Estimates from Panel Surveys." Journal of the American Statistical Association 70: 23-30. Baker, Michael (1992). "Digit preference in CPS unemployment data," Economics Letters, 39(1):117-121. Biemer,PaulP.,andJohnM.Bushery(2000). "OntheValidityofMarkovLatentClassAnalysis for Estimating Classification Error in Labor Force Data." Survey Methodology 26: 139-152. Elsby, Michael W. L., Bart Hobijn, andAy¸segül S¸ahin(2010). "The LaborMarket inthe Great Recession," Brookings Papers on Economic Activity, Spring 2010: 1-56. Elsby, Michael W.L., Bart Hobijn, and Ay¸segül S¸ahin (2015). "On the Importance of the Participation Margin for Labor Market Fluctuations," Journal of Monetary Economics 72: 64-82. Elsby, Michael WL, Bart Hobijn, Ay¸segül S¸ahin, and Robert G. Valletta (2011). "The Labor Market in the Great Recession— An Update to September 2011," Brookings Papers on Economic Activity Fall 2011: 353-371. Elsby, Michael W. L., Ryan Michaels, and Gary Solon (2009). "The Ins and Outs of Cyclical Unemployment," American Economic Journal: Macroeconomics, 1(1): 84-110. Farber, Henry S., and Robert G. Valletta (2015). "Do Extended Unemployment Benefits Lengthen Unemployment Spells? Evidence from Recent Cycles in the US Labor Market," Journal of Human Resources 50: 873-909. Feng, Shuaizhang, and Yingyao Hu (2013). "Misclassification Errors and the Underestimation of the US Unemployment rate." American Economic Review 103: 1054-70. Fujita, Shigeru and Garey Ramey (2009). "The Cyclicality of Separation and Job Finding Rates," International Economic Review, 50(2):415-430. Hall, Robert E., and Marianna Kudlyak (2019). "Job-Finding and Job-Losing: A Comprehensive Model of Heterogeneous Individual Labor-Market Dynamics," NBER Working Paper 25625. Halpern-Manners, Andrew, and John Robert Warren (2012). "Panel Conditioning in Longitudinal Studies: Evidence from Labor Force Items in the Current Population Survey." Demography 49, no. 4: 1499-1519. Hamilton, James D. (1994). Time Series Analysis. Princeton: Princeton University Press. 41

Krueger,AlanB.,AlexandreMas,andXiaotongNiu(2017). "TheEvolutionofRotationGroup Bias: Will the Real Unemployment Rate Please Stand Up?" Review of Economics and Statistics 99: 258-264. Kudlyak, Marianna, and Fabian Lange (2018). "Measuring Heterogeneity in Job Finding Rates Among the Nonemployed Using Labor Force Status Histories." Working paper, Federal Reserve Bank of San Francisco. Madrian, Brigitte C., and Lars John Lefgren (2000). "An approach to longitudinally matching Current Population Survey (CPS) respondents," Journal of Economic and Social Measurement 26, no. 1: 31-62. Meyer, Bruce D., Wallace K. C. Mok, and James X. Sullivan (2015). "Household Surveys in Crisis." Journal of Economic Perspectives, 29 (4):199-226. Nekarda, Christopher J (2009). "A longitudinal analysis of the current population survey: Assessing the cyclical bias of geographic mobility." Federal Reserve Board of Governors. Rothstein, Jesse (2011). "Unemployment Insurance and Job Search in the Great Recession," Brookings Papers on Economic Activity, Fall 2011: 143-196. Ryu, HangK., andDanielJ. Slottje(2000). "Estimatingthedensityofunemploymentduration based on contaminated samples or small samples," Journal of Econometrics, 95(1):131-156. Shibata,Ippei(2016). "LaborMarketDynamics: AHiddenMarkovApproach."Workingpaper, IMF. Shimer, Robert (2012). "Reassessing the Ins andOuts of Unemployment," Review of Economic Dynamics, 15(2):127-148. Solon, Gary(1986). "EffectsofRotationGroupBiasonEstimationofUnemployment."Journal of Business & Economic Statistics 4: 105-109. Torelli, Nicola, and Ugo Trivellato (1993). "Modelling Inaccuracies in Job-search Duration Data." Journal of Econometrics 59, no. 1-2: 187-211. Van den Berg, Gerald J., and Bas van der Klaauw (2001). "Combining Micro and Macro Unemployment Duration Data," Journal of Econometrics 102: 271-309. 42

Table 1. Parameters estimated separately for rotation 1, rotation 5, and NX, EX and MX transitions from rotation 1 to rotation 2. [1] [2] [3] [4] [5] [6] [7] [8] [9] [10] rotation std rotation std NX std EX std MX std param 1 only error 5 only error only error only error only error 0.8271 0.0037 0.8272 0.0024 0.7556 0.0096 0.7541 0.0148 0.8338 0.0062 0.9738 0.0026 0.9735 0.0026 0.9746 0.0022 0.9687 0.0035 0.9744 0.0025 p(cid:2) p(cid:3) 0.4243 0.0455 0.4009 0.0484 (cid:4)(cid:2) 0.4256 … 0.4215 … π(cid:6) 0.2358 0.0054 0.2475 0.0055 π(cid:7) 0.3075 0.0037 0.3030 0.0031 π(cid:8) 0.0311 0.0027 0.0280 0.0024 π(cid:9) 0.0386 0.0016 0.8902 0.0030 0.1266 0.0029 π(cid:10)(cid:6) 0.8765 0.0006 0.0317 0.0004 0.0649 0.0030 π(cid:10)(cid:7) 0.0594 … 0.0647 … 0.7979 … π(cid:10)(cid:8) 0.0254 0.0016 0.0134 0.0006 0.0105 0.0007 π(cid:10)(cid:9) 0.0920 … 0.2145 … 0.0882 … q(cid:2) 0.0779 0.0057 0.1911 0.0139 0.1011 0.0095 q(cid:3) 0.0805 0.0052 0.1768 0.0083 0.0784 0.0054 q(cid:12) 0.0530 0.0031 0.1236 0.0095 0.0826 0.0058 q(cid:13) 0.1883 0.0210 0.1204 0.0032 0.2199 0.0226 q(cid:14) 0.5082 0.0433 0.1736 0.0148 0.4298 0.0503 q(cid:15) 0.6965 … 0.2940 … 0.6497 … 0.1227 0.0019 0.1305 0.0074 0.2063 0.0288 0.0930 0.0491 0.0424 0.0336 q(cid:14)+q(cid:15) 0.7735 0.0027 0.7385 0.0030 0.7545 0.0060 0.7194 0.0183 0.7400 0.0079 θ(cid:18),(cid:2) 0.4835 0.0097 0.4571 0.0088 0.4894 0.0166 0.3767 0.0352 0.5150 0.0261 θ(cid:18),(cid:3) 0.9268 0.0035 0.8775 0.0071 0.8562 0.0113 0.8260 0.0261 0.8582 0.0107 θ(cid:18),(cid:12) 0.7219 0.0158 0.6790 0.0120 0.7080 0.0166 0.6891 0.0413 0.7718 0.0367 θ(cid:18),(cid:13) 0.9254 0.0084 0.9028 0.0038 0.8740 0.0147 0.8254 0.0185 0.8836 0.0187 θ(cid:18),(cid:14) 0.9605 0.0080 0.9554 0.0022 0.9541 0.0159 0.9729 0.0104 0.9315 0.0220 θ(cid:18),(cid:15) 0.9000 0.0063 0.8521 0.0149 0.7297 0.0344 0.7640 0.0251 0.7941 0.0276 θ(cid:18),(cid:20) 0.9417 0.0083 0.9445 0.0040 0.9488 0.0136 0.9467 0.0398 0.9339 0.0116 θ(cid:18),(cid:21) 0.1637 0.0078 0.1497 0.0059 0.1994 0.0106 0.1359 0.0094 0.1428 0.0075 θ(cid:18),(cid:22) 0.4920 0.0086 0.4985 0.0040 0.5882 0.0102 0.4845 0.0174 0.4939 0.0160 θ(cid:18),(cid:2)(cid:23) 0.8951 0.0155 0.8880 0.0133 0.9214 0.0092 0.9100 0.0195 0.9036 0.0073 θ(cid:18),(cid:2)(cid:2) 0.1595 0.0267 0.0991 0.0317 0.1196 0.0222 0.1519 0.0159 0.0666 0.0330 θ(cid:18),(cid:2)(cid:3) θ(cid:18),(cid:2)(cid:12) 43

Table 2. Parameters estimated separately for UX transitions from rotations 1 to 2 and 5 to 6. [1] Rotation 1 estimate 0.3183 0.2179 0.0909 0.3729 0.1153 0.2353 0.0735 0.5759 (cid:24)(cid:2),(cid:9)(cid:6) (cid:24)(cid:2),(cid:9)(cid:7) (cid:24)(cid:2),(cid:9)(cid:8) (cid:24)(cid:2),(cid:9)(cid:9) (cid:24)(cid:3),(cid:9)(cid:6) (cid:24)(cid:3),(cid:9)(cid:7) (cid:24)(cid:3),(cid:9)(cid:8) (cid:24)(cid:3),(cid:9)(cid:9) [2] Standard error 0.0053 0.0032 0.0025 … 0.0092 0.0087 0.0028 … [3] Rotation 5 estimate 0.3379 0.2178 0.0890 0.3554 0.1210 0.2224 0.0686 0.5880 [4] Standard error 0.0068 0.0019 0.0014 … 0.0080 0.0065 0.0036 … Table 3. Parameters estimated jointly across all rotations. estimate estimate estimate estimate estimate estimate 0.8094 0.8881 0.2235 0.0870 0.1014 0.3274 0.9734 0.9542 0.1878 0.0811 0.0955 0.2179 p(cid:2) θ(cid:18),(cid:15) (cid:25)(cid:2),(cid:26)(cid:27) (cid:25)(cid:2),(cid:28)(cid:27) (cid:25)(cid:2),(cid:29)(cid:27) (cid:24)(cid:2),(cid:9)(cid:6) 0.3920 0.8045 0.1967 0.0756 0.0969 0.0901 p(cid:3) θ(cid:18),(cid:20) (cid:25)(cid:3),(cid:26)(cid:27) (cid:25)(cid:3),(cid:28)(cid:27) (cid:25)(cid:3),(cid:29)(cid:27) (cid:24)(cid:2),(cid:9)(cid:7) 0.1441 0.9394 0.1148 0.0650 0.0693 0.3646 (cid:4)(cid:2) θ(cid:18),(cid:21) (cid:25)(cid:12),(cid:26)(cid:27) (cid:25)(cid:12),(cid:28)(cid:27) (cid:25)(cid:12),(cid:29)(cid:27) (cid:24)(cid:2),(cid:9)(cid:8) 0.7355 0.1700 0.1365 0.1847 0.2113 0.1181 θ(cid:18),(cid:2) θ(cid:18),(cid:22) (cid:25)(cid:13),(cid:26)(cid:27) (cid:25)(cid:13),(cid:28)(cid:27) (cid:25)(cid:13),(cid:29)(cid:27) (cid:24)(cid:2),(cid:9)(cid:9) 0.4688 0.5203 0.1408 0.5067 0.4255 0.2291 θ(cid:18),(cid:3) θ(cid:18),(cid:2)(cid:23) (cid:25)(cid:14),(cid:26)(cid:27) (cid:25)(cid:14),(cid:28)(cid:27) (cid:25)(cid:14),(cid:29)(cid:27) (cid:24)(cid:3),(cid:9)(cid:6) 0.8766 0.9014 0.0711 θ(cid:18),(cid:12) θ(cid:18),(cid:2)(cid:2) (cid:25)(cid:15),(cid:26)(cid:27) (cid:25)(cid:15),(cid:28)(cid:27) (cid:25)(cid:15),(cid:29)(cid:27) (cid:24)(cid:3),(cid:9)(cid:7) 0.7103 0.1146 0.5817 θ(cid:18),(cid:13) θ(cid:18),(cid:2)(cid:3) (cid:24)(cid:3),(cid:9)(cid:8) Notes to Table 3. Also estimated (but not reported) are separate coefficients for θ(cid:18),(cid:14) θ(cid:18),(cid:2)(cid:12) (cid:24)(cid:3),(cid:9)(cid:9) (cid:30)(cid:10)(cid:6),(cid:30)(cid:10)(cid:7),(cid:30)(cid:10)(cid:8),(cid:30)(cid:10)(cid:9) (cid:31)Ta∈ble! "4,. # A,v$er%a.ge numbers of individuals with indicated status across different rotation groups. [1] [2] [3] [4] [5] [6] [7] rotation E N M U total U/(U+E) (U+E)/(U+E+N) 1 7,905 4,378 5,708 580 18,570 6.8 66.0 2 8,047 4,590 5,373 566 18,575 6.6 65.2 3 8,049 4,634 5,349 547 18,579 6.4 65.0 4 8,032 4,650 5,367 533 18,581 6.2 64.8 5 7,831 4,598 5,628 522 18,578 6.2 64.5 6 7,939 4,685 5,444 514 18,581 6.1 64.3 7 7,970 4,702 5,409 504 18,585 5.9 64.3 8 8,016 4,724 5,342 507 18,588 5.9 64.3 Table 5. Effects of rotation on fractions reporting indicated labor status (coefficients and standard errors for regression (13). [1] [2] [3] [4] -0.0064 -0.0102 0.0007 0.0159 (cid:31) =" (cid:31) =# (cid:31) =( (cid:31) = $ s.e. (0.0021) (0.0011) (0.0002) (0.0032) )(cid:10)(cid:2) -0.0104 -0.0042 -0.0003 0.0149 s.e. (0.0017) (0.0009) (0.0002) (0.0026) )(cid:10)(cid:14) -0.0006 0.0011 -0.0005 -8.95E-07 s.e. (0.0003) (0.0002) (2.65E-05) (0.0005) *(cid:10) 44

Table 6. Estimated average fraction of individuals who would have reported labor status E, N, M, or U if the individual were being interviewed for the first time. ∗ (cid:30)(cid:6) 0.4244 - ∗1 (cid:30)(cid:7) 0.2359 , ∗ 0= 2 < ,(cid:30)(cid:8)0 0.3086 ∗ +(cid:30)(cid:9)/ 0.0311 Table 7. Estimated labor force status transition probabilities measured by the rotation group 1 technology. ∗ ∗ ∗ ∗ (cid:30)(cid:6)(cid:6) (cid:30)(cid:7)(cid:6) (cid:30)(cid:8)(cid:6) (cid:30)(cid:9)(cid:6) 0.8997 0.0366 0.0897 0.2007 - ∗ ∗ ∗ ∗ 1 (cid:30)(cid:6)(cid:7) (cid:30)(cid:7)(cid:7) (cid:30)(cid:8)(cid:7) (cid:30)(cid:9)(cid:7) 0.0255 0.8688 0.0452 0.1992 , ∗ ∗ ∗ ∗ 0=2 < ,(cid:30)(cid:6)(cid:8) (cid:30)(cid:7)(cid:8) (cid:30)(cid:8)(cid:8) (cid:30)(cid:9)(cid:8)0 0.0621 0.0647 0.8564 0.0870 ∗ ∗ ∗ ∗ +(cid:30)(cid:6)(cid:9) (cid:30)(cid:7)(cid:9) (cid:30)(cid:8)(cid:9) (cid:30)(cid:9)(cid:9)/ 0.0126 0.0299 0.0088 0.5130 Table 8. Month t employment probabilities for UUU and UNU histories UUU Probability UNU . Probability , , 0.19 , , 0.14 (> (cid:2) ? .(cid:13) (cid:12) (> (cid:14) ? .(cid:2) (cid:3) (cid:13) (> (cid:14) ? .(cid:2) (cid:2) (cid:13) 0.16 (> (cid:2) ? .(cid:13) (cid:12) #>?(cid:3) (> (cid:14) ? .(cid:2) (cid:2) (cid:13) 0.14 (> (cid:14) ? .(cid:2) (cid:12) (cid:13) ,(> (cid:14) ? .(cid:2) (cid:3) (cid:13) ,(> (cid:2) ? (cid:14) (cid:2) .(cid:3)(cid:15) 0.14 (> (cid:14) ? .(cid:2) (cid:12) (cid:13) ,#>?(cid:3),(> (cid:2) ? (cid:14) (cid:2) .(cid:3)(cid:15) 0.15 (> (cid:2) ? (cid:14) (cid:12) .(cid:3)(cid:15) ,(> (cid:2) ? (cid:14) (cid:3) .(cid:3)(cid:15) ,(> (cid:2) ? (cid:14) (cid:2) .(cid:3)(cid:15) 0.11 (> (cid:2) ? (cid:14) (cid:12) .(cid:3)(cid:15) ,#>?(cid:3),(> (cid:2) ? (cid:14) (cid:2) .(cid:3)(cid:15) 0.10 (> (cid:2) ? (cid:14) (cid:12) .(cid:3)(cid:15) ,(> (cid:3) ? (cid:20) (cid:3) .A ,(> (cid:3) ? (cid:20) (cid:2) .A 0.08 (> (cid:2) ? (cid:14) (cid:12) .(cid:3)(cid:15) ,#>?(cid:3),(> (cid:3) ? (cid:20) (cid:2) .A 0.07 (cid:3)(cid:20).A (cid:3)(cid:20).A (cid:3)(cid:20).A (cid:3)(cid:20).A (cid:3)(cid:20).A (>?(cid:12) ,(>?(cid:3) ,(>?(cid:2) (>?(cid:12) ,#>?(cid:3),(>?(cid:2) Table 9. Effects of adjustments on unemployment rate and labor-force participation rate. Unemployment Labor-force Employmentrate participation rate population ratio Unadjusted BLS 6.3% 64.7% 60.6% Corrected for rotation-group bias only 6.8% 65.9% 61.4% Corrected for rotation-group bias and missing observations 7.1% 66.1% 61.4% Corrected for rotation-group bias, missing observations, 8.2% 66.9% 61.4% and long-term unemployed Table 10. Adjusted and unadjusted estimates of duration of unemployment BLS Adjusted < 5 weeks 29.4 35.1 5-14 weeks 27.8 31.5 15-26 weeks 15.6 18.2 > 26 weeks 27.2 15.2 Average duration 25 weeks 16 weeks 45

Figure 1. Alternative measures of unemployment-continuation probability, new inflows to unemployment, unemployment rate, labor force participation rate, and average duration of unemployment. Notes to Figure 1. Panel A: probability that an unemployed individual will still be unemployed next month, Aug 2001 to April 2018, as calculated by: (1) ratio of unemployed with duration 5 weeks or greater in month t to total unemployed in t -1 (solid black); (2) fraction of those unemployed in t -1 who are still unemployed in t (dashed green); (3) reconciled estimate (dotted blue). Panel B: Number of newly unemployed as a percent of the population, Aug 2001 to April 2018, as calculated by: (1) number of unemployed with duration less than 5 weeks (solid black); (2) EU and NU flows as adjusted by BLS (dashed green); (3) reconciled estimate (dotted blue). Panel C: Unemployment rate, July 2001 to March 2018, as calculated by BLS (solid black), adjusted estimate (dotted blue), and difference (bars). Panel D: labor-force participation rate, July 2001 to March 2018, as calculated by BLS (solid black), adjusted estimate (dotted blue) and difference (bars, scale on right). Panel E: Average duration of unemployment, July 2004 to March 2018, as calculated by BLS (solid black), adjusted estimate (dotted blue), and difference (bars). All series seasonally adjusted. Source: the series labled as BLS are from the Bureau of Labor Statistics, and the other series are based on the authors’ calculation. 46

Figure 2. Reported durations of unemployment for individuals in rotation 1. Notes to Figure 2. Top panel: reported fraction (blue) and predicted by equation (5) (in yellow) of unemployed who have been searching for indicated number of weeks. Bottom panel: total fraction of unemployed (in black) who have been looking for work for weeks and fraction for each type. 𝜏𝜏 Figure 3. Reported and predicted unemployment durations in rotation 2 for individuals who were not in the labor force in rotation 1 and unemployed in rotation 2. Notes to Figure 3. Horizontal axis: duration of unemployment spell in weeks. Vertical axis: of the individuals who were not in the labor force in rotation 1 and unemployed in rotation 2, the percent who reported having been searching for work at the time of rotation 2 for the indicated duration. Figure 4. Probability that someone who reports being unemployed with duration has perceived unemployment duration characterized by decay rate . 𝜏𝜏 𝑝𝑝2 Notes to Figure 4. Horizontal axis denotes the duration of reported unemployment spell in weeks and vertical axis is 𝜏𝜏 𝜂𝜂2(𝜏𝜏). 47

Figure 5. Effect of rotation group on percentage of sampled individuals with indicated reported status. 2 E 1.5 N 1 M U 0.5 0 -0.5 -1 -1.5 1 2 3 4 5 6 7 8 Notes to Figure 5. Graph shows predicted values implied by regression (13). Figure 6. Parameters capturing rotation-group bias. Notes to Figure 6. Probability that someone who reported status E in rotation j would have been counted as missing if interviewed using rotation 1 technology (left), that someone who reported status N in rotation j would have been counted as unemployed if interviewed using rotation 1 technology (middle), and that someone who was counted as not in the labor force in rotation j would have been missing if interviewed using rotation 1 technology (right). Figure 7. Fraction of individuals reporting labor status E, N, M, or U in each rotation group (solid blue) and fraction predicted to report that status for that rotation according to equation (26) (dashed red). 48

Figure 8. Actual reported transition probabilities for each rotation (solid blue) and fraction predicted by equation (27) (dashed red). Figure 9. Changes in rotation-group bias parameters over time. 49

Figure 10. Time variation in selected parameters. Notes to Figure 10. Black lines denote smoothed data summaries and red dashed lines denote estimates that adjust for rotation-group bias, missing observations, and long-term unemployed. 𝜃𝜃̅𝑡𝑡 𝜃𝜃�𝑡𝑡 Figure 11. Alternative measures of unemployment rate and new inflows into unemployment. Notes to Figure 11. Top panel: adjusted unemployment rate ( , in dotted blue), BLS unemployment rate (solid black), U5 (dashed red) and U6 (dashed green). Bottom panel: number of newly unemployed 𝑢𝑢�𝑡𝑡 as a percent of the noninstitutional civilian population 16 years and over. Dotted blue: estimate incorporating all adjustments; dashed red: number of unemployed with duration less than 5 weeks; solid turquoise: number of EU and NU transitions as a fraction of individuals with two consecutive 𝑉𝑉� 𝑡𝑡 non- 50

missing observations; dashed green: latter adjusted for rotation-group bias alone; solid black: BLS adjusted EU and NU flows. Source: the series labeled as BLS, BLS adjusted, U5 and U6 unemployment rates are from the Bureau of Labor Statistics, and the rest is based on the authors’ calculation. 51

Cite this document

APA

Hie Joo Ahn and James D. Hamilton (2019). Measuring Labor-Force Participation and the Incidence and Duration of Unemployment (FEDS 2019-035). Board of Governors of the Federal Reserve System, Finance and Economics Discussion Series. https://whenthefedspeaks.com/doc/feds_2019-035

BibTeX

@techreport{wtfs_feds_2019_035,
  author = {Hie Joo Ahn and James D. Hamilton},
  title = {Measuring Labor-Force Participation and the Incidence and Duration of Unemployment},
  type = {Finance and Economics Discussion Series},
  number = {2019-035},
  institution = {Board of Governors of the Federal Reserve System},
  year = {2019},
  url = {https://whenthefedspeaks.com/doc/feds_2019-035},
  abstract = {The underlying data from which the U.S. unemployment rate, labor-force participation rate, and duration of unemployment are calculated contain numerous internal contradictions. This paper catalogs these inconsistencies and proposes a reconciliation. We find that the usual statistics understate the unemployment rate and the labor-force participation rate by about two percentage points on average and that the bias in the latter has increased since the Great Recession. The BLS estimate of the average duration of unemployment overstates by 50 percent the true duration of uninterrupted spells of unemployment and misrepresents what happened to average durations during the Great Recession and its recovery. Accessible materials (.zip)},
}