Lack of Signal Error (LoSE) and Implications for OLS Regression: Measurement Error for Macro Data
Abstract
This paper proposes a simple generalization of the classical measurement error model, introducing new measurement errors that subtract signal from the true variable of interest, in addition to the usual classical measurement errors (CME) that add noise. The effect on OLS regression of these lack of signal errors (LoSE) is opposite the conventional wisdom about CME: while CME in the explanatory variables causes attenuation bias, LoSE in the dependent variable, not the explanatory variables, causes a similar bias under some conditions. The paper provides evidence that LoSE is an important source of error in US macroeconomic quantity data such as GDP growth, illustrates downward bias in regressions of GDP growth on asset prices, and provides recommendations for econometric practice.
Finance and Economics Discussion Series Divisions of Research & Statistics and Monetary Affairs Federal Reserve Board, Washington, D.C. Lack of Signal Error (LoSE) and Implications for OLS Regression: Measurement Error for Macro Data Jeremy J. Nalewaik 2008-15 NOTE: Staff working papers in the Finance and Economics Discussion Series (FEDS) are preliminary materials circulated to stimulate discussion and critical comment. The analysis and conclusions set forth are those of the authors and do not indicate concurrence by other members of the research staff or the Board of Governors. References in publications to the Finance and Economics Discussion Series (other than acknowledgement) should be cleared with the author(s) to protect the tentative character of these papers.
Lack of Signal Error (LoSE) and Implications for OLS Regression: Measurement Error for Macro Data ∗ Jeremy J. Nalewaik First Draft: October 2007 October 28, 2008 Abstract This paper proposes a simple generalization of the classical measurement error model, introducing new measurement errors that subtract signal from the true variable of interest, in addition to the usual classical measurement errors (CME) that add noise. The effect on OLS regression of these lack of signal errors (LoSE) is opposite the conventional wisdom about CME: while CME in the explanatory variablescausesattenuationbias,LoSEinthedependentvariable,nottheexplanatory variables, causes a similar bias under some conditions. The paper provides evidence that LoSE is an important source of error in US macroeconomic quantity datasuchas GDP growth, illustrates downwardbiasinregressions of GDP growth on asset prices, and provides recommendations for econometric practice. ∗Board of Governors of the Federal Reserve System, 20th Street and Constitution Avenue NW, Washington, DC 20551. Telephone: 1-202-452-3792. Fax: 1-202-872-4927. E-mail: jeremy.j.nalewaik@frb.gov. ThankstoKatherineAbraham,MiriamFeffer,CharlesFleischman,Michael Kiley,DavidLebow,RichardLyons,ClaudiaSahm,JonathanMillar,RobVigfusson,ananonymousreferee, andseminar participants at the Boardof Governorsof the FederalReserve System for comments. The views expressed in this paper are solely those of the author. 1
1 Introduction Thispaperproposesasimplegeneralizationoftheclassicalmeasurement errormodeland studies its implications for ordinary least squares (OLS) regression. The usual model starts with the true variable of interest and adds noise, which we call classical measurement error (CME); see Klepper and Leamer (1984), Griliches (1986), Fuller (1987), Leamer (1987), Angrist and Krueger (1999), Bound, Brown and Mathiowetz (2001) or virtually any econometrics textbook. The generalization discussed here incorporates a different kind of measurement error that subtracts signal from the true variable; this new error term is called the Lack of Signal Error, or LoSE for short. This additional term adds some much-needed flexibility to the classical measurement error model: it allows the mismeasured variable to have either more or less variance than the true variable of interest, in contrast to the classical model, which imposes that the mismeasured variable has higher variance. This restriction does not hold in some important applications in macroeconomics and elsewhere. The implications of LoSE for OLS regression are opposite the usual intuition about measurement error, which is applicable to CME only. The CME intuition says that measurement error in the dependent variable Y of a regression poses no real problems for standard estimation and inference. Parameter estimates are unbiased and consistent, while hypotheses are more difficult to reject because CME increases the variance of regression residuals and thus standard errors. CME in the explanatory variables X causes the real problems for OLS regression, namely attenuation bias and inconsistency. However with LoSE these results are reversed. For the baseline case considered here, LoSE in the explanatory variables X produces no bias or inconsistency while increasing standard errors, similar to CME in Y. It is LoSE in the dependent variable Y that 2
introduces an attenuation-type bias and inconsistency into the regression under some circumstances (inparticular, when theexplanatoryvariables containsome signalmissing from the dependent variable). This point is obvious when we consider the extreme case of maximum LoSE in Y, so Y is just a constant equal to its unconditional mean. Then a standard OLS regression of Y on any explanatory variable X with positive variance recovers β (cid:2) = cov(X,Y) = 0, regardless of the true β. In addition, LoSE in Y shrinks the var(X) variance of regression residuals, thus shrinking parameter standard errors compared to what they would be without this type of mismeasurement. The standard errors are zero in our extreme case, and this raises concerns about the robustness of hypothesis tests. Much of the econometrics literature on non-classical measurement error has focused on binary or categorical response data, for which the classical measurement error assumptions cannot hold; see Card (1996), Bollinger (1996), and Kane, Rouse and Staiger (1999). In a more general linear regression context, Berkson (1950) was an early paper tackling some of the issues addressed here; see the discussion in Durbin (1954), Griliches (1986, section 4), and Fuller (1987, section 1.6.4). Berkson had in mind a regression using “controlled” measurements as the explanatory variable X, readings from a scientific experiment where the unobserved true values of interest X(cid:2) fluctuate around the observed controlled measurements in a random way. Berkson showed that if the unobserved fluctuations X(cid:2)−X are uncorrelated with the measurements X, then regression parameter estimates are unbiased. The literature following Berkson has generally focused on extending his results to regressions employing non-linear functions of X; see Geary (1953), Federov (1974), Carroll and Stefanski (1990), Huwang and Huang (2000), and Wang (2003, 2004). This literature has focused less on the implications of “controlled” measurements of the dependent variable Y. 3
Other papers discussing different LoSE-related estimation issues include Sargent (1989), Bound, Brown and Mathiowetz (2001), and Kimball, Sahm and Shapiro (2008). Perhaps the closest paper to this one is Hyslop and Imbens (2001), which shows some of the major implications of LoSE, while simultaneously considering the implications of a different problem, correlation between the measurement error in X and the regression error. In defining the LoSE in a variable as the difference between its true value and a conditional expectation of that true value, we consider arbitrary conditioning information sets Z; Hyslop and Imbens (2001) consider information sets consisting of only mismeasured X and mismeasured Y in a univariate regression context. Considering general information sets allows us to derive more general results, clarifying under what conditions LoSE produces attenuation bias and what instruments are valid in addressing that bias. A large body of empirical work has now accumulated on mismeasurement of microeconomic survey data, which generally rejects the CME assumptions and points to negative correlation between the measurement errors and the true variables of interest; see Bound andKrueger (1991),Bound, Brown, Duncanand Rodgers(1994), Pishcke (1995), Bollinger (1998), Bound, Brown and Mathiowetz (2001) and the references therein, and EscobalandLaszlo (2008). Suchnegative correlationisanimplicationofLoSE,although other measurement error models may generate such a result as well. The empirical work inthispaperfocusesonadifferenttypeofdata,namelyUSmacroeconomicquantitydata such as gross domestic product (GDP) and gross domestic income (GDI). These data pass through numerous revisions, and the more poorly-measured initial estimates have less variance than the revised estimates, providing a concrete example of measurement error that cannot be CME; see Mankiw and Shapiro (1986). 4
After providing a brief introductory motivation for the generalized measurement error model in Section 2 and showing the implications for OLS regression in Section 3, Section 4.1 of the paper discusses the nature of the source data used to compute US macroeconomic quantity data, and points out some reasons why LoSE may be present in the estimates even after they have passed through all their revisions. GDP growth and GDI growth measure the same underlying concept, but use different source data; see Fixler and Nalewaik (2007) and Nalewaik (2007a). Since the two measures are far from equal in the fully-revised quarterly or annual frequency data, some mismeasurement must remain in either GDP or GDI growth. Section 4.2 reviews the evidence in Fixler and Nalewaik (2007) and Nalewaik (2007a,b) supporting the notion that this mismeasurement is largely LoSE in GDP growth. Some simple calculations comparing GDP and GDI growth show that this LoSE in GDP growth is likely substantial: after 1984, at least 30% of the variance of the true growth rate of the economy appears to be missing. In a wide variety of econometric specifications employed in the macroeconomics and finance, variables like GDP growth, investment growth and consumption growth are regressed on asset prices - interest rates, stock price changes, exchange rate changes, etc. These regressions are of particular interest because asset prices potentially capture some signal missing from the mismeasured quantities, implying attenuation-type biases in the coefficients. Section 4.3 tests for these biases, regressing output growth measures contaminated with different degrees of LoSE on a fixed set of stock or bond prices. In cases where we suspect the dependent variable is contaminated with more LoSE, the regression coefficients are smaller, and the differences across regressions are often statistically significant. For example, the coefficients increase when we switch the dependent 5
variable from the early GDP growth estimates based on limited source data to later GDP growth estimates based on more-comprehensive data. Tellingly, the coefficients increase again when we switch the dependent variable from GDP growth to GDI growth. The hypothesis that measurement error in the dependent variable does not bias OLS regression coefficients, a core piece of conventional wisdom in the profession, is rejected by the data, just as the paper predicts if the measurement error is LoSE. Section 5 concludes the paper with a review of some of the major implications of substantial LoSE in GDP growth. Implications for econometric practice are discussed, using examples of popular regressions in macroeconomics. 2 A Generalization of the Classical Measurement Error Model Let Y(cid:2) be the true value of the variable of interest, Y be a mismeasured estimate of t t that variable, and Z bea (1×l) vector of possibly stochastic variables used to construct t Y . In many cases a government statistical agency or some other organization computes t Y based on information from surveys, administrative records, and other data sources t (source data for short); then Z will be variables drawn from the source data, possibly t including non-linear functions of the original source data. Under the classical measurement error model, Y = Y(cid:2) +ε . t t t The term ε is “noise” or the classical measurement error (CME) in the estimate. In t 6
the current context this is taken to imply independence of ε and Y(cid:2), although the t t weaker assumption cov(Y(cid:2),ε ) = 0 suffices for many purposes. The CME may arise t t from estimation errors or other sources; since many estimates Y are based on surveys, t survey sampling errors are often thought to be a source of CME. Under the generalized model of mismeasurement considered here, the mismeasured estimate Y is as in Fixler and Nalewaik (2007): t (1) Y = E(Y(cid:2)|Z )+ε . t t t t The CME term ε is assumed independent of Z and Y(cid:2). It can be seen immediately t t t that the classical measurement error model is a special case of this more general model, where Z spans Y(cid:2) so E(Y(cid:2)|Z ) = Y(cid:2). t t t t t Define the deviation of the variable of interest from its conditional expectation as: (2) ζ = Y(cid:2) −E(Y(cid:2)|Z ). t t t t This deviation represents the information about Y(cid:2) not contained in Z , and is uncort t related with all functions of Z . With cov(E(Y(cid:2)|Z ),ζ ) = 0, the variance of the true t t t t variable of interest may be decomposed into the variance of the conditional expectation plus the variance of ζ , and: var(ζ ) = var(Y(cid:2)) − var(E(Y(cid:2)|Z )). The variance t t t t t of ζ represents the variance of the information about Y(cid:2) missing from the conditional t t expectation. Substituting into (1): (3) Y = Y(cid:2) −ζ +ε . t t t t 7
Thinking of ε as mismeasurement from noise, ζ represents an opposite kind of mist t measurement, mismeasurement from lack of signal about Y(cid:2) in the information used to t construct Y . As such, ζ may be labelled the Lack of Signal Error, or LoSE for short. t t Taking variances of (3), the LoSE is clearly correlated with Y(cid:2), with cov(Y(cid:2),ζ ) = t t t var(ζ ) in fact, so: t var(Y ) = var(Y(cid:2) )+var(ζ )−2cov(Y(cid:2),ζ )+var(ε ) t t t t t t (4) = var(Y(cid:2) )−var(ζ )+var(ε ). t t t Depending on whether the variance of the LoSE is greater than or less than the variance of the CME, the variance of the estimate Y may be greater than or less than the t variance of the true variable of interest Y(cid:2). With CME alone, the variance of the t estimate Y must exceed thevarianceof thetruevariable. Thekey limitationoftheCME t model is the assumption that cov(Y −Y(cid:2),Y(cid:2)) = 0. It is easy to think of theoretical t t t counterexamples, for example, when Y(cid:2) has positive variance but the estimate Y is just t t a constant for all t; actual counterexamples are provided in the introduction and section 4. The generalization with LoSE allows this covariance to range from 0 to a lower bound of negative var(Y −Y(cid:2)), in which case all the mismeasurement arises from LoSE. t t While the generalized model here is less restrictive than the CME model, some restrictions do remain. Writing: (5) Y +ζ = Y(cid:2) +ε , t t t t the zero covariance between ζ from Y is a restriction, implied by the first term on t t the right of (1) being a conditional expectation. Systematic biases in the estimate, 8
on top of those caused by noise,1 violate this assumption. For concreteness, assume E(Y(cid:2)|Z ) = Z γ. Consider an estimate of Y(cid:2) based on Z that misuses the information, t t t t t so Y = Z (cid:3)γ+ε with (cid:3)γ (cid:3)= γ. The estimate “misses” in a systematic way. For estimation t t t and inference about Y(cid:2) and its relation to other variables (for example in attempting to t estimate β by OLS in the relation Y(cid:2) = X(cid:2)β +U(cid:2) using the mismeasured data), these t t t “misses” clearly lead to biased and inconsistent estimates. However unless additional information is available about the nature of Z γ(cid:3)−Z γ, the direction and magnitude of t t these biases is unclear. In highly stylized examples, the biases may be derived; one such example is Y = α +α Y(cid:2) +ε , with α (cid:3)= 0 and α (cid:3)= 1; this model is employed by de t 0 1 t t 0 1 LeeuwandMcKelvey (1983),Bound, Brown, DuncanandRodgers(1994),Piscke (1995), and Bound, Brown and Mathiowetz (2001). But once one allows for these systematic biases in the estimates, there is generally no reason to prefer one highly stylized example to another, and we are in a wilderness of possibilities. In the case of GDP and GDI growth, the model Y = α + α Y(cid:2) + ε does not fit t 0 1 t t the facts, as the discussion of table 3A and 3B in section 4.3 makes clear. In general, an important goal of all creators of data (government statistical agencies as well as other groups) is to avoid systematic mismeasurement like that described in the previous paragraph. Indeed, their ultimate goal is probably to produce estimates Y that are as t close as possible to E(Y(cid:2)|Z ), with as broad an information set Z as possible given t t t resource constraints. As such, the generalized model (1) is a useful benchmark, and should approximate well the underlying mismeasurement in many situations. It also has the advantage of being mathematically tractable, and the symmetry between adding noise and subtracting signal is intuitive and appealing. 1With the noise term, E(Y t (cid:2)|Y t)(cid:3)=Y t; the estimate is biased in this sense. 9
Beforeconcluding thissection, itisworthemphasizing thatZ neednotbeanexhaust tive information set - i.e. it need not contain all available relevant pieces of information about unobserved Y(cid:2). Resource and other constraints certainly preclude this from bet ing the case, and the sections below considering the implications of LoSE allow for this possibility. 3 Implications for OLS Estimation Consider ordinary least squares estimation of the relation between a mismeasured variable Y and a (1×k) set of mismeasured explanatory variables X , using a sample of t t length T. When stacking together the observations, time subscripts are dropped for convenience: ⎛ ⎞ ⎛ ⎞ Y X ⎜ 1 ⎟ ⎜ 1 ⎟ ⎜ ⎟ ⎜ ⎟ ⎜ Y ⎟ ⎜ X ⎟ ⎜ 2 ⎟ ⎜ 2 ⎟ Y =⎜ ⎟; X =⎜ ⎟. ⎜ . ⎟ ⎜ . ⎟ ⎜ . . ⎟ ⎜ . . ⎟ ⎝ ⎠ ⎝ ⎠ Y X T T Our full set of assumptions follows: Assumption 1 Y(cid:2) = X(cid:2)β +U(cid:2). U(cid:2) is i.i.d., mean zero, with var(U(cid:2)) = σ2 and U(cid:2) t t t t t U(cid:2) s independent of X(cid:2), ∀t,s. Measured Y = E(Y(cid:2)|Zy )+ε , with: t t t t t • The CME ε is i.i.d., mean zero, and independent of all conditioning information t sets, with var(ε ) = σ2. t ε • Zy may be partitioned into two sets of variables, Zy and Zy, with variables in Zy x u x independent of U(cid:2) and Zy, and variables in Zy independent of X(cid:2) and Zy. u u x 10
(cid:10) (cid:10) (cid:11)(cid:11) (cid:10) (cid:11) • The LoSE ζ = X(cid:2) −E X(cid:2)|Zy β + U(cid:2) − E U(cid:2)|Zy = ζxyβ + ζu. ζu is t t t x,t t t u,t t t t i.i.d. and mean zero with var(ζu) = σ2 , and ζxy is i.i.d. and mean zero with t ζ,u t var(ζxy ) = σ2 , a k ×k matrix. t ζ,xy Measured X = E(X(cid:2)|Zx)+εx, with: t t t t • The CMEεx is i.i.d., meanzero, independentof ε and all conditioninginformation t t sets, with var(εx) = σ2 , a k ×k matrix. t ε,x • The variables in Zx are independent of U(cid:2) and Zy. u • The LoSE ζx = X(cid:2) − E(X(cid:2)|Zx) is i.i.d. and mean zero with var(ζ ) = σ2 , a t t t t t ζ,x k ×k matrix. • As T −→ ∞: – 1 (X(cid:2)) (cid:2)X(cid:2) −→ p Q T xx – 1 (E(X(cid:2)|Zy)) (cid:2)E(X(cid:2)|Zy) −→ p Qzy = Q −σ2 T x x xx xx ζ,xy – 1 (E(X(cid:2)|Zx)) (cid:2)E(X(cid:2)|Zx) −→ p Qzx = Q −σ2 T xx xx ζ,x – 1 (E(X(cid:2)|Zy)) (cid:2)E(X(cid:2)|Zx) −→ p Qzb T x xx – 1X(cid:2)X −→ p = Qzx +σ2 . T xx ε,x All relevant fourth moments exist. For most purposes, especially for time series analysis, the i.i.d. and homoskedasticity assumptions here are overly restrictive, but relaxing them is straightforward; we keep these assumptions so we may discuss bias as well as consistency. 11
The assumptions imposed on the information sets Zy and Zx regarding partitioning and independence allow us to factor the joint distribution of the relevant variables as follows: f (U(cid:2),X(cid:2),Zy,Zx ) = f (U(cid:2),Zy )f (X(cid:2),Zy,Zx ). UZ u XZ x Without these assumptions, the conditioning may introduce correlation between the measurement error in X and the regression residual (which includes the measurement error in Y). An example where the conditioning has this effect is in Hyslop and Imbens (2001). As another example, assume the information sets Zy, Zy, and Zx are univariate, x u and let Zx = Zy + Zy; then E(X(cid:2)|Zx) and ζx are correlated with U(cid:2) (as long as Zy u x t t t u captures some variation in U(cid:2)), and the above factorization is not valid. Correlation between the measurement error in the explanatory variables and the regression error can introduce serious biases in some regressions, but I view these biases as distinctly different fromthoseintroducedbyLackofSignal. Tounderstandclearlytheimplications of LoSE, what biases it may introduce and under what conditions, isolating its effects from other biases is useful. Our assumptions allow us to do that. Given assumption 1, Y can be written as: t (cid:10) (cid:11) (cid:10) (cid:11) (6) Y = E X(cid:2)|Zy β +E U(cid:2)|Zy +ε t t x,t t u,t t (cid:10) (cid:10) (cid:11) (cid:11) (cid:10) (cid:11) = X β + E X(cid:2)|Zy −X β +E U(cid:2)|Zy +ε t t x,t t t u,t t (cid:10) (cid:10) (cid:11) (cid:11) = X β + E X(cid:2)|Zy −E(X(cid:2)|Zx )−εx β +U(cid:2) −ζu +ε . t t x,t t t t t t t 12
The OLS regression estimator is: β (cid:2) = (X(cid:2)X) −1X(cid:2)Y (7) = β +(X(cid:2)X) −1X(cid:2) ((E(X(cid:2)|Zy )−E(X(cid:2)|Zx )−εx )β +U(cid:2) −ζu +ε). x Consider the sources of bias and inconsistency in this estimate. It is well known that the CME in Y introduces no bias and inconsistency, since ε is independent of X. Interestingly, the LoSE in U(cid:2) introduces no bias or inconsistency either: given the assumptions about the information sets, E(U(cid:2)|Zy) = U(cid:2) − ζu is uncorrelated with E(X(cid:2)|Zx) and u hence X = E(X(cid:2)|Zx)+εx. The other components in the error of (6) do cause bias and inconsistency; taking expectations and probability limits of (7) yields: (cid:12) (cid:13) (cid:12) (cid:13) (8) E β (cid:2) = β +E (X(cid:2)X) −1X(cid:2) (E(X(cid:2)|Zy )−E(X(cid:2)|Zx )−εx ) β, and: x (cid:10) (cid:11) (cid:10) (cid:11) (9) β (cid:2) −→ p β + Qzx +σ2 −1 Qzb −Qzx −σ2 β. xx ε,x xx xx ε,x The usual attenuation bias and inconsistency from CME in X is evident. The additional inconsistency from LoSE depend on the difference between Qzb and Qzx. Illuminating xx xx special cases arediscussed in the subsections below, especially subsection 3.4, but clearly the correlation between the variables in Zy and Zx is critical here. x (cid:2) The inconsistency of β can be corrected by instrumenting with a (1×m) set of instruments W , with m ≥ k, if the instruments meet the following set of assumptions: t Assumption 2 With P = W (W(cid:2)W) −1W(cid:2), 1X(cid:2)P X −→ p Qw , a positive semi- W T W xx definitematrix, and 1X(cid:2)P ((E(X(cid:2)|Zy)−E(X(cid:2)|Zx)−εx)β +U(cid:2) −ζu +ε) −→ p 0. All T W x relevant fourth moments exist. 13
To correct the biases in OLS, valid instruments must be uncorrelated with the CME in X,astandardcondition. However, anadditionalconditionmustbemet: theinstruments must be uncorrelated withE(X(cid:2)|Zy)−E(X(cid:2)|Zx). This conditionis met by instruments x W that are common to both information sets (if such information exists), so W ⊂ Zx and W ⊂ Zy, since W(cid:2)E(X(cid:2)|Zy) and W(cid:2)E(X(cid:2)|Zx) then have the same probability x x limit. With valid instruments, we have: (cid:10) (cid:11) β(cid:2) = X(cid:2)P W X −1 X(cid:2)P W Y (cid:10) (cid:11) (10) = β + X(cid:2)P W X −1 X(cid:2)P W ((E(X(cid:2)|Z x y )−E(X(cid:2)|Zx )−εx )β +U(cid:2)−ζu +ε), (cid:2) p and β −→ β. The asymptotic distribution of the estimator is: √ (cid:12) (cid:13) (cid:12) (cid:12) (cid:12) (cid:13) (cid:13)(cid:13) T β(cid:2)−β − d → N 0,(Qw ) −1 σ2 −σ2 +σ2 +β(cid:2) Qzy −2Qzb +Qzx +σ2 β . xx U(cid:2) ζ,u ε xx xx xx ε,x where −→d denotes convergence in distribution as T −→ ∞, and N (a,b) is a Gaussian distribution with mean a andvariance b. The usual estimator of the variance of the error (cid:12) (cid:13) (cid:12) (cid:13) (cid:2) term, s2 = 1 Y −Xβ (cid:2) Y −Xβ (cid:2) , converges to the error variance in this asymptotic T distribution: (cid:12) (cid:13) s2 = 1 E(X(cid:2)|Zy )β +E(U(cid:2)|Zy )+ε−(E(X(cid:2)|Zx )+εx )β(cid:2) (cid:2) T x u (cid:12) (cid:13) ∗ E(X(cid:2)|Zy )β+E(U(cid:2)|Zy )+ε−(E(X(cid:2)|Zx )+εx )β(cid:2) x u 1 1 1 = E(U(cid:2)|Zy ) (cid:2)E(U(cid:2)|Zy )+ ε(cid:2)ε+ β(cid:2)E(X(cid:2)|Zy ) (cid:2)E(X(cid:2)|Zy )β T u u T T x x − 1 β(cid:2)E(X(cid:2)|Zy ) (cid:2)E(X(cid:2)|Zx )β(cid:2)− 1 β(cid:2)(cid:2)E(X(cid:2)|Zx ) (cid:2)E(X(cid:2)|Zy )β T x T x + 1 β(cid:2)(cid:2)E(X(cid:2)|Zx ) (cid:2)E(X(cid:2)|Zx )β(cid:2) + 1 β(cid:2)(cid:2)εx(cid:2)εxβ(cid:2) + 1 cross terms. T T T 14
The first two terms converge in probability to σ2 − σ2 + σ2; the terms involving β U(cid:2) ζ,u ε (cid:2) (cid:2) p and β simplify in the limit since β −→ β; and the cross terms converge in probability (cid:10) (cid:11) to zero. Then: s2 −→ p σ2 − σ2 + σ2 + β(cid:2) Qzy −2Qzb +Qzx +σ2 β. The next U(cid:2) ζ,u ε xx xx xx ε,x four subsections discuss the most important implications of LoSE in X and Y for the parameter estimates and standard errors, examining some more specialized examples of this general model that highlight the implications of interest. 3.1 X Mismeasured, Y Not Mismeasured: No LoSE Problems Given the traditional focus on mismeasurement in X on regression estimation, we begin with this subsection making the following assumption, in addition to assumption 1: Assumption 3 Y is not mismeasured: Y = Y(cid:2). t t t Then (6) simplifies to: Y(cid:2) = X(cid:2)β +U(cid:2) t t t = X β +(X(cid:2) −X )β +U(cid:2) t t t t = X β −εxβ +ζxβ +U(cid:2). t t t t Not all of the true variation in X(cid:2) appears in X due to LoSE, but all of that variation t t does appear in Y(cid:2) through X(cid:2)β. The variation in Y(cid:2) missing from X is relegated to t t t t the error term of this equation. The OLS regression estimator in this case is: β (cid:2) = (X(cid:2)X) −1X(cid:2)Y = β +(X(cid:2)X) −1X(cid:2) (−εxβ +ζxβ +U(cid:2) ). 15
Since ζx is uncorrelated with E(X(cid:2)|Zx) + εx = X, the LoSE in X introduces no bias into β (cid:2) in this case. Given assumption 1, 1X(cid:2)ζx −→ p 0, and the LoSE introduces no T inconsistency either. These results relyontheassumption thattheLoSEisthedifference between truth and a conditional expectation, and measurement error of a different form, such as the systematic biases discussed at the end of section 2, would lead to biased and inconsistent parameter estimates. For multivariate regressions, the consistency result also relies on all k explanatory variables being conditioned on the same information set Zx. Bound, Brown, and Mathiowetz (2001), and Kimball, Sahm, and Shapiro (2008) discuss the case where different elements of X are conditioned on different information sets, causing bias and inconsistency. Of course, the CME in X produces the usual attenuation bias. By way of review, and for comparison with later results: (cid:12) (cid:13) (cid:12) (cid:13) (11) E β (cid:2) = β −E (X(cid:2)X) −1X(cid:2)εx β, and: (cid:10) (cid:11) (12) β (cid:2) −→ p β − Qzx +σ2 −1σ2 β. xx ε,x ε,x Instruments uncorrelated with the CME in X yield consistent estimates. To focus more tightly on the implications of LoSE, the remainder of this subsection considers the case of no CME in X: Assumption 4 var(εx) = 0. t (cid:12) (cid:13) Then E β (cid:2) = β, and β (cid:2) −→ p β. The variation in X(cid:2) that appears in Y(cid:2) but is missing from X shows up in the regression error, increasing the variance of the parameter esti- (cid:12) (cid:13) (cid:12) (cid:12) (cid:13)(cid:13) (cid:12) (cid:12) (cid:13)(cid:13) (cid:12) (cid:13) (cid:2) (cid:2) (cid:2) (cid:2) mates. We have var β = E var β|X + var E β|X , but E β|X = β and var(β) = 0, so the second term vanishes. Then since U(cid:2) and ζx are uncorrelated, and 16
both are uncorrelated with X, standard manipulations show: (cid:14) (cid:14) (cid:15)(cid:15) (cid:12) (cid:13) (cid:12) (cid:12) (cid:13)(cid:13) (cid:12) (cid:13)(cid:12) (cid:13) (cid:2) (cid:2) (cid:2) (cid:2) (cid:2) var β = E var β|X = E E β −β β −β |X (cid:12) (cid:12) (cid:13)(cid:13) = E E (X(cid:2)X) −1X(cid:2) (U(cid:2) +ζxβ)(U(cid:2) +ζxβ) (cid:2)X (X(cid:2)X) −1|X (cid:12) (cid:13) = E (X(cid:2)X) −1X(cid:2)E((U(cid:2)U(cid:2)(cid:2) +ζxββ(cid:2)ζx(cid:2) )|X)X(X(cid:2)X) −1 (cid:12) (cid:13) (cid:10) (cid:11) = E (X(cid:2)X) −1 σ2 +β(cid:2)σ2 β . U(cid:2) ζ,x Asymptotically, the analogous distributional results hold, as: √ (cid:12) (cid:13) (cid:10) (cid:10) (cid:11)(cid:11) T β (cid:2) −β −→d N 0,(Qzx ) −1 σ2 +β(cid:2)σ2 β , xx U(cid:2) ζ,x and s2 converges to this error variance σ2 +β(cid:2)σ2 β. So the LoSE in X increases the U(cid:2) ζ,x variance of the regression error, reducing the power of hypothesis tests. 3.2 Y Mismeasured, X Not Mismeasured, X ∈ Zy : Shrunken t x,t Standard Errors In addition to assumption 1, this subsection makes the following assumptions: Assumption 5 X is not mismeasured: X = X(cid:2), and X ∈ Zy . t t t t x,t Then Y(cid:2) = X β + U(cid:2). The relation between X and the information set Zy has an t t t t x,t important effect on the properties of the OLS regression estimates; this subsection considers X ∈ Zy , and the next X (cid:3)∈ Zy . t x,t t x,t (cid:10) (cid:11) (cid:10) (cid:11) Since E X |Zy = X , we have: Y = X β+E U(cid:2)|Zy +ε in this case. The LoSE t x,t t t t t u,t t (cid:10) (cid:11) (cid:10) (cid:10) (cid:11)(cid:11) impacts only U(cid:2), so ζ = U(cid:2)−E U(cid:2)|Zy , and var E U(cid:2)|Zy = σ2 −σ2. The OLS t t t t u,t t u,t U(cid:2) ζ 17
(cid:2) regression estimates β as: β (cid:2) = (X(cid:2)X) −1X(cid:2)Y = β +(X(cid:2)X) −1X(cid:2) (E(U(cid:2)|Zy )+ε) u = β +(X(cid:2)X) −1X(cid:2) (U(cid:2) −ζ +ε). LoSE in U(cid:2) introduces no bias or inconsistency since Zy is uncorrelated with X, so the u overall measurement error in Y introduces no bias or inconsistency in this case. The assumption that Y is a conditional expectation of Y(cid:2) plus noise again plays a critical role for consistency and unbiasedness. Thestandarderrorsaroundthepointestimatesaremoreinteresting. Forthevariance (cid:12) (cid:13) (cid:12) (cid:12) (cid:13)(cid:13) (cid:12) (cid:12) (cid:13)(cid:13) (cid:2) (cid:2) (cid:2) of the point estimates, var β = E var β|X since var E β|X = 0, and: (cid:12) (cid:12) (cid:13)(cid:13) (cid:12) (cid:12) (cid:13)(cid:13) (cid:10) (cid:11) (cid:10) (cid:11) E var β(cid:2)|X = E E X(cid:2)X −1 X(cid:2) (E(U(cid:2)|Zy )+ε)(E(U(cid:2)|Zy )+ε) (cid:2)X X(cid:2)X −1 |X u u (cid:12) (cid:13) (cid:10) (cid:11) (cid:10) (cid:11) = E X(cid:2)X −1 σ2 −σ2 +σ2 , U(cid:2) ζ ε since E(U(cid:2)|Zy) and ε are uncorrelated; the analogous asymptotic results hold. The u CME in Y increases the variance of the regression residuals and parameter estimates, and reduces the power of hypothesis tests, similar to LoSE in X. The LoSE in Y has an opposite effect, decreasing the variance of the regression residuals and parameter estimates. Measurement error of this type actually increases the power of hypothesis tests. Power is typically considered an unambiguous good thing, so is the LoSE in Y the type of measurement error we want? To understand some of the issues here, consider a fable. An econometrician regresses Y(cid:2) on X, estimating β (cid:2) , but cannot reject the hypoth- 18
esis of interest, β = β0, because the standard errors are too large. Instead of stopping there, the econometrician decides to employ some other variables at his disposal, a list of variables Zy that are orthogonal to X but related to Y(cid:2). The econometrician then employs a two-stage procedure: (1) regressing Y(cid:2) on X and some subset of Zy, computing predicted values which he calls Y, and then (2) regressing Y on X, testing hypotheses about the relation between Y(cid:2) and X using standard errors from this second regression. The test rejects β = β0 using the shrunken standard errors. The econometrician submits the paper to a top econometrics journal, and it is accepted to great acclaim, as it shows how to reject all false hypotheses. End of fable. In reality, such a two-step procedure would be unacceptable to any reasonable econometrician. Unfortunately, many macroeconomics papers employing LoSE-contaminated data like GDP growth may have unwittingly engaged in the second stage of this twostage procedure, with a government statistical agency generating the LoSE in Y in the first stage. Either way, lack of robustness is a major concern. If we consider a case where the regression estimates are biased and inconsistent, with U(cid:2) −ζ + ε correlated with X, then arbitrarily shrinking the regression standard errors leads to a higher rate of rejection of hypotheses that are true. The system of hypothesis testing is designed to minimize such type I errors, and LoSE in Y increases the risk of such errors in cases where the model is misspecified. In applications where variances themselves are the object of interest, the problems imposed by LoSE in Y are more straightforward. For example, in a regression forecasting context, the variance of the out-of-sample forecasting errors is often a key measure. The actual variance of the out-of-sample forecast error for the true variable of interest, Y(cid:2) − X β (cid:2) , with β (cid:2) estimated using mismeasured Y , is σ2 + t+k t+k t U(cid:2) 19
(cid:10) (cid:11) (cid:10) (cid:11) σ2 −σ2 +σ2 X E (X(cid:2)X) −1 X(cid:2) . LoSE reduces this variance by σ2, and if that U(cid:2) ζ ε t+k t+k ζ is the predominant source of measurement error, the variance of the forecast errors computed using mismeasured Y give a misleading sense of precision: the deviations of t+k Y(cid:2) from the forecasts are larger, on average, than those mismeasured forecast errors t+k indicate. 3.3 Y Mismeasured, X Not Mismeasured, X (cid:3)∈ Zy : Biased t x,t Point Estimates In addition to assumption 1, this subsection makes the following assumptions: Assumption 6 X is not mismeasured: X = X(cid:2), and X (cid:3)∈ Zy . t t t t x,t This case is applicable when the explanatory variables add information about the dependent variable above and beyond that employed in the conditional expectation of the dependent variable. The mismeasured variable of interest in this case is then (cid:10) (cid:11) (cid:10) (cid:11) Y = E X |Zy β +E U(cid:2)|Zy +ε . The OLS regression estimator is: t t x,t t u,t t β (cid:2) = (X(cid:2)X) −1X(cid:2)Y = β +(X(cid:2)X) −1X(cid:2) ((E(X|Zy )−X)β +U(cid:2) −ζu +ε) x = β +(X(cid:2)X) −1X(cid:2) (−ζxyβ +U(cid:2) −ζu +ε). 20
Bias and inconsistency are evidently issues here. X = E(X|Zy) + ζxy is clearly not x independent of −ζxyβ, and: (cid:12) (cid:13) (cid:12) (cid:13) (13) E β (cid:2) = β −E (X(cid:2)X) −1X(cid:2)ζxy β (cid:12) (cid:13) = E (X(cid:2)X) −1X(cid:2)E(X|Zy ) β. x (14) β (cid:2) −→ p β −(Q ) −1σ2 β xx ζ,xy = (Q ) −1Qzyβ. xx xx The inconsistency of β (cid:2) tends towards zero, since Q equals Qzy plus another positive xx xx semidefinite matrix σ2 . Some variation in X that appears in Y(cid:2) is missing from mis- ζ,xy measured Y, essentially driving down the covariance between X and Y, and driving down the parameter estimates as well since the variance of X is not biased down. If (cid:2) X is univariate, the inconsistency of β is unambiguously towards zero, similar to standard attenuation bias from CME in the explanatory variable of a regression. Indeed, comparing these bias and inconsistency results with (11) and (12), it is clear that CME in X and LoSE in Y of the type in this subsection lead to biases that are essentially equivalent algebraically. Instruments W that meet the conditions of assumption 2 in this case are those for which X(cid:2)P ζxy converges in probability to zero, for example if W ∈ Zy , so that W is W t x,t t independent oftheinformationaboutX(cid:2) missing fromY . Instruments typically thought t t of as valid based on other considerations may not meet this condition. The asymptotic (cid:2) distribution of the IV regression estimates β is: √ (cid:12) (cid:13) (cid:10) (cid:10) (cid:11)(cid:11) T β (cid:2) −β −→d N 0,(Qw ) −1 σ2 −σ2 +σ2 +β(cid:2)σ2 β , xx U(cid:2) ζ,u ε ζ,xy 21
with s2 converging to this asymptotic variance. 3.4 Both X and Y Mismeasured: Illuminating Special Cases Again for simplicity, and to focus on the effects of LoSE, this section considers the case of no CME in X, so assumption 4 holds, as well as assumption 1. Three special cases are illuminating. The first is where the information sets used to construct Y and X coincide in the universe of variables correlated with X, so Zy = Zx. Then x E(X(cid:2)|Zy) = E(X(cid:2)|Zx), so their difference in (8) and (9) disappears, leaving unbiased x andconsistent regressionparameterestimates. Thevarianceandasymptoticdistribution of β (cid:2) , and the probability limit of s2, are as in subsection 3.2. The main concern under these circumstances is the shrinking effect of LoSE on standard errors. The second illuminating case is where Zy ⊂ Zx, so Zx contains all the information x about X(cid:2) in Zy, plus additional information. The difference E(X(cid:2)|Zx)−E(X(cid:2)|Zy) is x x uncorrelated with Zy; substituting this difference for ζxy in subsection 3.3 then leaves x (cid:2) the results of that section unchanged. The estimate β is biased and inconsistent, with the bias towards zero; some variation in measured X that appears in Y(cid:2) is missed by measured Y, biasing down the covariance between X and Y. Valid instruments must be in the information set used to compute the more-poorly measured Y. The last illuminating case is where Zy contains all the information about X(cid:2) in Zx x plus additional information, so Zy ⊃ Zx. Then E(X(cid:2)|Zy)−E(X(cid:2)|Zx) is uncorrelated x x with Zx and X, and if this difference replaces ζx in subsection 3.1, the results in that subsection carry over to this case, except LoSE in U(cid:2) shrinks the error and parameter variances. The estimates are unbiased and consistent. These cases should help provide some intuition about the potential effects of LoSE in 22
particular regression applications where the econometrician has some knowledge of the relative degree of LoSE mismeasurement in the explanatory and dependent variables. For each application, whether Zy ⊃ Zx, Zy = Zx, or Zy ⊂ Zx provides the best x x x description of reality determines which results are most relevant, those from subsection 3.1 (augmented with LoSE in U(cid:2)), 3.2, or 3.3. For example, the extent of any bias in the parameter estimates depends on the degree to which the mismeasured explanatory variables contain signal missing from the dependent variable. 4 Data 4.1 Discussion of U.S. Macro Quantity Data Each estimated growth rate of a macro quantity such as gross domestic product (GDP) is an attempt at measuring the growth in the value of all relevant economic transactions, in the entire economy, from one fixed time period to the next. For an entity as large as the U.S. economy, this is a daunting, almost mind-boggling task, as the number of transactorsandtransactionsistypically enormous, withlittleorno informationrecorded about many of them at high frequencies. Attempts to measure changes in these macro quantities are much more ambitious than attempts to measure similar changes for a single person, household, or even company. Simply due to their broad, universal nature, estimates of macro quantities are likely to miss more information - i.e. be contaminated with more LoSE - than are estimates of micro quantities.2 2Some micro data sources may be contaminated with LoSE as well; see the references on microeconomic survey data in the introduction. As another micro- example, consider the company earnings: it has long been suspected that managementof many publicly tradedcorporations“smooth” quarterly earnings to meet their guidance (prior estimates of what their earnings would be). Such a spurious reductioninthevariabilityofmeasuredearningsgrowthshouldeffectivelyaddLoSEtothosemeasures. 23
Of course, the nature of the available source data determines the information content of the macro variable of interest, and frequency is important in this regard in the case of data from the U.S. National Income and Product Accounts (NIPA).3 The most comprehensive data on GDP and other major NIPA aggregates are only available at the quinquennial frequency (every five years), at the time of the major economic censuses. Even then, resource constraints make true census counts impossible. Many transactions in the underground economy remain unobserved and must be estimated, and some “above-ground” transactions are simply missed by any census.4 At the annual frequency, the GDP source data are typically samples drawn from the census universe. These samples can be quite large, capturing a sizeable fraction of the relevant value of transactions, but they are typically skewed towards measuring the transactions of larger businesses. As such, they may miss variation arising from the transactions of small companies and from businesses starting, shutting down, and operating in the underground economy. The underrepresentation of these segments of the economy may add or subtract variance to the official estimates, depending on the relationofthe poorly-measuredsegments tothebetter-measured segments, but this type 3Thegrowthratesofrealquantitiesareofinterestinmosteconomicsapplications. IntheNIPAs,real quantitiesaretypicallyestimatedbygatheringtheappropriatenominalsourcedataandtheappropriate price indexes, and then deflating the former with the latter. The discussion of LoSE in source data here focuses onthe nominalsourcedata,but there mayexistsignificantLoSEstemming fromthe price indexes as well. Measured price indexes may miss fluctuations in the quality of goods, from either the introductionofnewgoodsormodificationsofexistinggoods;seeBilsandKlenow(2001)andBils(2004). The length oftheir time seriesis short,but BrodaandWeinstein (2007)do providesome evidence that product creation(and hence quality improvementembedded in new products) is pro-cyclical,implying counter-cyclical variation in prices. If standard prices indexes miss this counter-cyclical variation, real quantities deflated by these indexes may not be variable enough. 4In this regard, it should be noted that the Bureauof Labor Statistics and the Census Bureau each maintain a list which attempts to track the entire universe of business establishments in the US, from which each agency draws samples. A 1994 comparison of the two lists found a non-trivial number of non-matches - establishments on one list but not the other. 24
of mismeasurement has the potential to add some LoSE to the data. More worrisome, usable data on the value of transactions at the annual frequency is unavailable for a substantial share of the NIPA aggregates; many of the services categories of personal consumption expenditures (PCE) lack usable source data, for example. It is difficult to imagine how this lack of hard information would not introduce some LoSE into the estimates. At the annual frequency, and also at the quarterly frequency to some extent, government tax and administrative records are used as an additional source of information about the value of transactions, especially on the income side of the accounts in the components of GDI. These data can be informative, but underreporting makes them less than fully comprehensive. At the quarterly and monthly frequency,5 reliance on samples is more pronounced, and the samples are less comprehensive. Smaller samples introduce larger sampling errors, which have traditionally been thought of as introducing CME into the estimates. The samples are typically random, so part of the difference between the population and sample moments is likely random variation uncorrelated with the variation in the population moments. However smaller samples may introduce some LoSE as well; if the samples are not fully representative, they may miss variation arising from some segments of the population.6 And, a greater fraction the NIPA aggregates lacks hard data on the value of transactions at frequencies higher than annual, with the services 5Treatment of seasonality immediately becomes a major issue when moving to frequencies higher than annual, and identification of the seasonal patterns of interest, the “true” seasonal factors, can be tenuous;seeWatson,1987. Seasonaladjustmentprogramsareallessentiallysmoothingalgorithms,and as such they risk introducing LoSE into the data. 6Samples for which topcodes are binding by definition miss variation arising from the top-coded units. ThesamplesusedintheconstructionoftheU.S.NIPAdataarenottop-codedforthemostpart, butanalystsattheBureauofEconomicAnalysis(BEA)dolookatverydetailedcategoriesofdataand trim outliers, which may have an effect similar to topcoding. 25
categories of personal consumption expenditures (PCE) again particularly vulnerable to this criticism.7 Quarterly and monthly growth rates are typically interpolated using related indicators, or estimated as “trend extrapolations.” The lack of hard information again seems highly likely to introduce some LoSE into the estimates. 4.2 GDP growth, GDI growth and LoSE Our first example of measurement error in macroeconomic quantity data that appears to be LoSE comes from examining the numerous revisions to US GDP and GDI growth. These revisions incorporate more comprehensive and higher-quality source data, and so plausibly reduce measurement error in the estimates. For example, suitable source data is unavailable for many components of the “advance” current quarterly GDP estimate released about a month after the quarter ends. Source data for some of those components is incorporated into the revised “final” current quarterly estimate released about two months later, and higher-quality data are incorporated at subsequent annual and benchmark revisions, likely bringing the estimate closer to its true value.8 Then an early estimate of GDP growth or GDI growth can be modelled as a later revised estimate plus a measurement error term that disappears with revision. Table 1 shows that the initial estimates have less variance than the revised estimates, violating the 7This situation has begun to change, with the introduction of the Quarterly Services Survey (QSS) in 2002, but so far the BEA uses the QSS for a relatively small share of PCE services. 8For more onrevisionsto GDP, see Grimm and Weadock (2006). An estimate of GDI growthis not released at the time of the “advance” GDP estimate because of data limitations, but GDI is always releasedatthetimeofthe“final”currentquarterlyestimate. ForGDI,subsequentrevisionsincorporate information from administrative and tax records that is much more comprehensive than the samples used to compute the “final” current quarterly estimates. 26
variance restrictions of the CME model.9 The generalized model here implies that the bulk of the measurement error is LoSE, as noted by Mankiw and Shapiro (1986). Our second example of LoSE in macroeconomic quantity data comes from examining the fully-revised estimates of GDP and GDI growth. Some users of quarterly or annual US GDP growth and its major subcomponents think the measurement error in the data is negligible after it has passed through all the revisions. But GDI growth measures the same underlying concept as GDP growth, so if the two diverge, at least one of them must be mismeasured. Table 2 shows variances and covariances of the estimates, before and after 1984, when the variance of the estimates appears to have fallen dramatically (see McConnell and Perez-Quiros (2000)). Prior to 1984, the variance of each estimate is close to the covariance between the two; the estimates diverge little, providing minimal evidence of mismeasurement. However after 1984, the covariance falls more than the variances, on average; this is especially true for the quarterly growth rates, where the correlation between the estimates falls from 0.93 to 0.68.10 Interestingly, the variance of GDI growth also increases relative to the variance of GDP growth. Under the generalized CME model of section 2, this relatively large GDI variance may stem from some combination of two possible sources: (1) a relatively large amount of CME in GDI growth, boosting its variance, and (2) a relatively large amount of LoSE in GDP growth, 9Theseareannualizedquarterlygrowthrates. Eachquarterlyobservationinthe“advance”or“final” time series is the “advance” or “final” estimate for that quarter, i.e. the estimate releasedone or three months after that quarter closes. We end the sample in 2004 so that all observations in our latest available time series have passed through three annual revisions, ensuring each observation is much more heavily revised than the corresponding “advance” or “final” current quarterly observation. 10At the annual frequency, the correlation falls from 0.98 to 0.94; the decline is smaller at this frequency primarily because the variance of GDP growth falls below its covariance with GDI growth. Thiscannothappenineitherthe pureCMEmodelorthe generalizationfavoredhere- i.e. the variance of eachestimate must be largerthan their covariance;see Fixler and Nalewaik(2007). Giventhat only 20 observationsare employedto compute these moments, this may be a small-sample estimation issue. 27
damping its variance. The evidence favors the latter as the more important source of mismeasurement. First, consider the results in Nalewaik (2007a), who estimates a two-state bivariate Markov switching model where the means of quarterly GDP and GDI growth switch with the state; the low-growth states identified by the model encompass NBER-defined recessions. The conditional variance of GDI in that model, conditional on the estimated state of the world, is actually slightly lower than the conditional variance of GDP, even though its unconditional variance is higher. The higher unconditional variance stems from GDI growing faster than GDP in high-growth periods, on average, and slower than GDP in slow-growth periods in and around recessions. In other words, GDI growth appears to contain more signal about the state of the world than GDP growth: the larger spread between its high- and low-growth means implies greater informativeness about the state. Greater signal in GDI growth implies relatively more LoSE in GDP growth. Second, table 1 shows that the variance of GDI growth becomes relatively large only after the data pass through annual and benchmark revisions. In the earlier estimates, the variance of GDP growth actually slightly exceeds the variance of GDI growth. Since the revisions plausibly bring the estimates closer to their true values, they must either reduce LoSE,adding variance, or reduce CME, subtracting variance. Then the relatively largeincreaseinvarianceofGDImust fromarelatively largedropinLoSE.Therevisions appear to add more signal to GDI growth than GDP growth, which in turn suggests that GDI growth has greater signal overall, since the pre-revision estimates started with roughly equal variance.11 Fixler and Nalewaik (2007) discuss the revisions evidence in 11The results in Nalewaik (2007b) support this interpretation of the revisions. Using the Markov 28
more detail, testing the hypothesis that the idiosyncratic variation in GDI growth is purely CME and rejecting at conventional significance levels. This again implies some LoSE in GDP growth. To get a sense of the magnitude of the potential variance missing from GDP growth due to LoSE, assume that the CME variance in each estimate is negligible, so the differences between GDP and GDI growth stem entirely from differential LoSE: (cid:10) (cid:11) ΔYGDP =E ΔY(cid:2)|ZGDP =ΔY(cid:2)−ζGDP, and: t t t t t (cid:10) (cid:11) ΔYGDI =E ΔY(cid:2)|ZGDI =ΔY(cid:2)−ζGDI. t t t t t Taking variances as in (4) yields: (cid:10) (cid:11) (cid:10) (cid:11) var ΔYGDP = var(ΔY(cid:2) )−var ζGDP , t t t (cid:10) (cid:11) (cid:10) (cid:11) var ΔYGDI = var(ΔY(cid:2) )−var ζGDI , and: t t t (cid:10) (cid:11) (cid:10) (cid:11) (cid:10) (cid:11) (cid:10) (cid:11) cov ΔYGDP,ΔYGDI = var(ΔY(cid:2) )−var ζGDP −var ζGDI +cov ζGDP,ζGDI . t t t t t t t The idiosyncratic variance of one estimate (its variance minus its covariance with the other estimate) is then proportional to the LoSE in the other estimate: (cid:10) (cid:11) (cid:10) (cid:11) (cid:10) (cid:11) (cid:10) (cid:11) var ΔYGDP −cov ΔYGDP,ΔYGDI = var ζGDI −cov ζGDP,ζGDI , and: t t t t t t (cid:10) (cid:11) (cid:10) (cid:11) (cid:10) (cid:11) (cid:10) (cid:11) var ΔYGDI −cov ΔYGDP,ΔYGDI = var ζGDP −cov ζGDP,ζGDI . t t t t t t (cid:10) (cid:11) The information missed by both estimates is cov ζGDP,ζGDI ; the idiosyncratic varit t ance of GDI growth is then the variance of the information about ΔY(cid:2) missing from t measured GDP growth minus the part of that information also absent from GDI growth. switching model in Nalewaik (2007a), Nalewaik (2007b) shows that the revisions increase mean GDI growth in high-growth states and reduce mean GDI growth in low-growthstates, effectively increasing its informativeness about the state of the economy. The revisions increase the gap between the highand low-growthmeans for GDP growthas well, but the increase is not as large as the increase for GDI growth. 29
Rearranging the covariance provides a lower bound on the variance of ΔY(cid:2): t (cid:10) (cid:11) (cid:10) (cid:10) (cid:11) (cid:10) (cid:11)(cid:11) var(ΔY(cid:2) ) = cov ΔYGDP,ΔYGDI + var ζGDI −cov ζGDP,ζGDI t t t t t t (cid:10) (cid:10) (cid:11) (cid:10) (cid:11)(cid:11) (cid:10) (cid:11) + var ζGDP −cov ζGDP,ζGDI +cov ζGDP,ζGDI , so: t t t t t (cid:10) (cid:11) (cid:10) (cid:10) (cid:11) (cid:10) (cid:11)(cid:11) (15) var(ΔY(cid:2) ) > cov ΔYGDP,ΔYGDI + var ζGDI −cov ζGDP,ζGDI t t t t t t (cid:10) (cid:10) (cid:11) (cid:10) (cid:11)(cid:11) + var ζGDP −cov ζGDP,ζGDI . t t t The last column of table 2 uses this equation to set an upper bound on the fraction var(ΔYGDP) of variance of ΔY(cid:2) captured by measured GDP growth: t . Measured GDP t var(ΔY(cid:2)) t growth captures at most 70% of the variation in ΔY(cid:2) after 1984, under the assumption t of negligible CME. Of course the assumption of no noise is an extreme one, particularly for the quarterly estimates. Indeed, the evidence in the next subsection from regressions involving GDP growth, GDI growth, and stock prices suggest about a quarter of the variance of quarterly GDP growth might be noise, but these results actually tighten the upper bound, decreasing it from 70% to 64%. And this is in fact an upper bound, since (cid:10) (cid:11) it does not account for cov ζGDP,ζGDI , the variation in ΔY(cid:2) missed by both measured t t t GDP and GDI growth. The variation missed by both estimates could be substantial. Going forward, if these post-1984 variances and covariances are the norm, the implications of a potentially non-trivial amount of LoSE in macro data such as GDP growth should be taken seriously. For estimation and inference, the post-1984 portion of many samples will become increasingly large and important. 4.3 Regression Evidence of LoSE in GDP growth Beyond careful study of the second moments in tables 1 and 2, section 3.3 provides a strategy for constructing regression tests for LoSE in GDP growth, if we can identify 30
variables that plausibly capture some of its missing variaton. In this subsection we consider stock and bond prices. Of course, some variation in these asset prices likely arises from misinformation, rational or irrational bubbles, and other factors unrelated to fundamentals, but this does not imply that more-informative variation is not present as well. Dynan and Elmendorf (2001) and Fixler and Grimm (2006) show that asset prices predict revisions to GDP growth, evidence that asset prices contain information missed by the initial government estimates. Asset prices may contain information missed by the fully-revised estimates as well, and that information may appear in asset prices in at least two ways. First, information about the state of the economy that is not fully incorporated into GDP growth, but is publicly available and thus observable by the vast majority of asset market participants, is likely incorporated into asset prices. The source data used to compute GDI appears to be part of this information, which does move financial markets; see Faust et al (2003). Second, asset prices aggregate the private information of vast numbers of market participants, privateinformationthatis likely correlatedwith current or futureeconomic activity. For example, the stock price of a company may reflect numerous pieces of private information about that company’s cash flow prospects. Aggregating across all firms, the idiosyncratic variation in firms’ stock prices averages out, so an aggregate stock market index contains the signal about aggregate economic activity dispersed in all the pieces of private information.12 Stock and bond prices are fundamentally tied to 12Even if the aggregate stock price contains useful information about aggregate activity, that does notnecessarilyimplythatanyindividualholdsparticularlyusefulprivateinformation-theaggregation of dispersed private information by the market is key - see Hayek (1945). Nalewaik (2006) makes a similar argument about consumption growth. 31
economic activity, with market participants placing bets with real money about current and future economic prospects.13 To test for LoSE in GDP growth, consider a regression of several of our quarterly output growth measures on current and lagged growth rates of the Wilshire 5000 stock price index.14 Fama (1990) studied a similar specification, which can be motivated theoretically in a number of ways.15 For our purposes here, it suffices that a relation between true output growth Y(cid:2) and stock prices X does exist, governed by a true parameter vector β. Employing the post-1984 sample, the first column of results in table 3 uses the time series of “advance” GDP growth as the dependent variable of the regression, while the second column uses the latest available estimates. Note that each standard error in the second regression exceeds its counterpart in the first (these are Newey-West (1987) corrected for heteroskedasticity and second-order autocorrelation). It seems the LoSE in “advance” GDP growth shrinks the standard errors, as discussed in section 3.2. More importantly, most of the regression coefficients in the first regression appear biased down relative to the second. We must be a little careful here, since intuition about univariate 13In relatedresearch,Evans andLyons(2005,2007)provideadescriptionofhowprivate information about the economy becomes embedded in exchange rates through the marketmakers’filtering of order flow information. 14Thestockpricechangesarequarterlygrowthrates,whiletheoutputgrowthmeasuresareannualized quarterly growthrates as in tables 1 and 2. The stock price index is nominal. The results change little ifthe stockpriceindex is deflatedwiththe GDP deflator;deflatingintroducessomemeasurementerror issues into the explanatory variables and for our purposes here it seems best to avoid that. 15At least two non-mutually-exclusive theories can motivate this relation. First, stock prices may respond to news about current and future economic growth and its effect on expected cash flow to firms. For an analogous specification modelling the relation between income growth and current and lagged consumption growth, see Hansen, Roberds and Sargent (1987) and Nalewaik (2006). Second, stock price variation may have a causal effect on current and future economic growth, through wealth effects on consumer spending, for example. 32
attenuation bias does not necessarily hold for all coefficients in a multivariate setting, but (X(cid:2)X) −1 isclose to diagonalsince Δp is approximately serially uncorrelated, so that t intuition is roughly correct. The last row of the table reports the sum of the coefficients and its standard error for ease of interpretation. The difference between the sums in the first two regressions, 0.072, is statistically significant, with a standard error of 0.036 correcting for cross-correlation and second-order cross-autocorrelation between the two sets of residuals. We reject the hypothesis that the measurement error in “advance” GDP growth does not bias down the regression coefficients. This downward bias of about two-thirds is roughly in line with to the ratio of the variances in table 1, about three-fourths. The third column of results in table 3 uses latest GDI growth as the dependent variable. The large increase in some of the regression coefficients is striking, with the sum of the coefficients increasing by 0.116 compared to the regression using latest GDP growth, with a standard error of 0.040. It is tempting to conclude that GDI growth is more informative about true output growth than GDP growth, leading to a greater LoSE-induced attenuation bias in regressions using GDP growth. However, a couple of alternate interpretations must be addressed, related to the fact that direct measures of corporate profits are included in GDI. Onealternateinterpretation isthatestimates ofcorporateprofitsarenoisy, andstock prices react to some of that noise. Then ε is positively correlated with X in regressions using GDI growth as the dependent variable, biasing the coefficients up. If this were the case, the problem should be more severe for the early estimates of GDI growth, since the market reacts to these initial estimates in real time. Yet the fourth column of results in table 3 shows that the sum of the coefficients using the early “final” estimates of GDI 33
growth is less than half the sum of the coefficients using the revised estimates released several years later. This noise interpretation does not fit the facts. A second alternate interpretation is that GDI growth contains superior information about corporate profits but not output growth, and stock prices are responding to that profits information. This hypothesis can be examined most directly by regressing the growth rate of GDI minus corporate profits (deflated by the GDP deflator) on the stock price changes. If this interpretation is correct, stripping out profits should reduce the sum of the coefficients, but the last column of the table shows that the sum of the coefficients actually increases. We are left with the conclusion that, compared to GDP growth, GDI growth contains superior information about true output growth. Digging a little deeper, table 3A examines a univariate regression where the explanatoryvariable isthe average stock price change over thecurrent andsix previous quarters; table 3B then reverses the regression, using the average stock price change as the dependent variable. Interestingly, the coefficient using GDP growth as the explanatory variable is about three-fourths the size of the coefficient using GDI growth, suggesting some noise in GDP growth (although the 0.146 difference between the slopes has a relatively large standard error of 0.088). Assuming that one-fourth of the variance of GDP growth is in fact noise, then GDP growth captures at most 64 percent of the variance of var(ΔY(cid:2))−var(ζGDP) true output growth, recomputing the upper bound on t t using (15). In var(ΔY(cid:2)) t this case, the variance of the signal in GDP growth is about equal to its covariance with GDI growth, and the ratio of this signal variance to the variance of GDI growth gives the upper bound. If all information about output growth is contained in GDI growth, so ΔY(cid:2) = ΔYGDI, this bound holds and the coefficients in the third columns of table 3, 3A and 3B are the true parameter vector β. However, if some information about 34
true output growth is missing from GDI growth, these coefficients are themselves biased down, and unfortunately we do not know the size of this downward bias. Evidence fromtheforwardandreverse regressions doesnotsupport the measurement error model discussed at the end of section 2, Y = α + α Y(cid:2) + ε . Supposing that t 0 1 t t model were true for a moment, the results in tables 3 and 3A require a larger α for 1 GDI growth than for GDP growth. But then in the reverse regression in table 3B, the larger α implies the coefficient on GDI growth should be smaller than the coefficient 1 on GDP growth. Noise in GDP growth may lower the latter coefficient, of course, but half the variance of GDP growth must be noise to make this particular model consistent with the point estimates in tables 3A and 3B, with none of that noise appearing in GDI growth.16 Given that the covariance between GDP growth and GDI growth accounts for more than half the variance of GDP growth, this seems unlikely. Indeed, a test of the hypothesis that the covariance between GDP and GDI growth (about 3.1) is half the variance of GDP growth (about 4.2) rejects with a p-value of about 0.01. These results using stock prices are largely confirmed by regressions of the different output growth measures on bond prices, as shown in table 4. The explanatory variables are TERM, the difference in yield between 10-year and 2-year US treasury notes, and DEF, the difference in yield between corporate bonds and 10-year treasury notes.17 Numerous papers have used similar variables to forecast output growth; see Chen (1991) and Estrella and Hardouvelis (1991), for example. The table examines regressions at 16The ratio of the coefficients βGDI in table 3A implies that αG 1 DI ≈ 1.5; so with no noise in either βGDP αGDP 1 measure, βGDI in table 3B should be about 0.66. Since that ratio is about 1.32, the attenuation bias βGDP from noise in GDP growthmust be substantial: assuming no noise in GDI growth, half the variance of GDP growth must be noise if this model holds. 17The corporate bond yield measure is the Merrill Lynch High Yield Master II Index. This series extends back only as far as 1986; hence the shorter sample for these regressions. 35
forecasting horizons ranging from one- to eight-quarters ahead; DEF has substantial explanatory power at shorter horizons, while TERM shows some explanatory power at longer horizons. All of the coefficients except one and all of the standard errors increase whenweswitchfrom“advance”GDPasthedependent variabletolatestGDP.Switching from latest GDP to latest GDI, the coefficients again all increase, except for TERM at the one- and two-quarter ahead horizons when its statistical significance and marginal explanatory power are weakest. The last column reports p-values from an F-test of equal coefficients from the GDP and GDI regressions; equality is rejected at the threeand four- quarter ahead horizons. The evidence here again suggests that LoSE in GDP growth biases down these regression coefficients. Similar results obtained from univariate regressions using either TERM or DEF, although the standard errors around the TERM coefficients were large, making definitive statements from those regressions difficult. Using DEF as the dependent variable in reverse regressions, coefficients were smaller using GDP growth as the explanatory variable than using GDI growth, supporting evidence of some noise in GDP growth. The coefficients using GDP growth were between 12 and 42 percent smaller, depending on horizon. The magnitudes of the coefficients from these forward and reverse regressions at several horizons are again inconsistent with the model Y = α +α Y(cid:2) +ε . t 0 1 t t 5 Conclusions and Implications The canonical classical measurement error (CME) model is too restrictive to handle important cases of mismeasurement, including mismeasurement in some widely-used macroeconomic time series. The paper discusses a simple generalization of the CME 36
model that is mathematically tractable, embeds the CME model as a special case, and adds useful flexibility. Instead of just allowing mismeasurement that adds noise to the true variable of interest, the generalization permits mismeasurement that subtracts signal; I label this reduction of signal from mismeasurement the Lack of Signal Error, or LoSE for short. In some ways, this generalization of the CME model provides the second half of the story about errors in variables and their effect on ordinary least squares (OLS) regression, as the results here exhibit a symmetry that is intuitively pleasing. CME in the dependent variable of a regression Y does not bias parameter estimates andincreases standard errors; in the baseline case, LoSE in the explanatory variables X has the same effect. CME in the explanatory variables X does bias regression parameter estimates, of course, towards zero in the univariate case; LoSE in the dependent variable Y introduces a similar attenuation-type bias under some circumstances, namely, when some of the signal missing from the dependent variable Y is captured by the explanatory variables X. LoSE in Y also shrinks the variance of the regression residuals, raising concerns about the robustness of hypothesis tests. The paper reviews evidence in Fixler and Nalewaik (2007) and Nalewaik (2007a,b) that US GDP growth is mismeasured with LoSE. The initial estimates of GDP growth are contaminated with a particularly large amount of LoSE, but the estimates that have passed through the BEA’s long sequence of revisions remain contaminated with LoSE. A separate estimate of output growth produced by the BEA, GDI growth, appears to be contaminated with less LoSE than GDP growth, and a comparison of the two shows that, since the mid-1980s, GDP growth at the annual or quarterly frequency has captured at most 70% of the variance of the true output growth. US GDP growth 37
and its subcomponents like consumption growth have served as the dependent variables in many regression studies in macroeconomics and finance; the potential for biases in theseregressions stemming frommismeasurement ofthedependent variablehasnotbeen contemplated in a serious way prior to this paper. Asset prices are a set of variables that may capture some of the signal missing from GDP growth and its subcomponents, implying attenuation-type biases in regressions of the mismeasured quantities on those prices. The empirical results here confirm that. In regressions of different measures of output growth (initial GDP growth, revised GDP growth, and GDI growth) on either stock prices or bond prices, the measures of output growth that appear contaminated with more LoSE have smaller coefficients, and the changes in the coefficients across regressions are often statistically significant. The set of explanatory variables is fixed from regression to regression; the only thing changing is the degree of measurement error in the dependent variable. We reject the CME intuition that measurement error in the dependent variable does not bias regression coefficients. Some implications of significant LoSE in GDP growth and its major subcomponents follow immediately. First, those variables are simply less informative than many macroeconomists currently believe, given the common but incorrect presumption that the fully-revised estimates are measured with little error. Second, in a macro forecasting context, true forecast errors are larger, on average, than forecast errors computed using data mismeasured with LoSE. Estimated forecast error variances overstate the accuracy of the forecasts for the true variable, usually the object of interest in forecasting. Forestimatingtheparametersofstructuraleconomicmodelsonmacroeconomicdata, LoSE clearly poses some serious problems as well. For example, in estimating parameters underlying the permanent income hypothesis (PIH) with regressions of consumption 38
on income, LoSE in the macroeconomic consumption data is likely to bias those parameters down and shrink their standard errors, risking rejection of hypotheses that are true. A large fraction of consumption lacks source data at the quarterly and even the annual frequency, so this component of GDP is likely to be particularly contaminated with LoSE. As another example, consider Euler equation estimates of the relation between macro consumption growth Δc and interest rates r ; see Campbell and Mankiw t t (1989). True consumption growth may have substantial covariance with interest rates, but mismeasured consumption growth likely misses some of this variation, biasing the OLS regression coefficient towards zero. Lagged interest rates are almost universally assumed to be valid instruments in estimating the Euler equation, and they may be valid for dealing with expectational errors and some other forms of endogeneity. However if interest rates contain information about actual contemporaneous consumption growth missed by measured consumption growth, lagged interest rates likely contain just as much if not more of this missing information, since interest rates are basically forward-looking. Lagged interest rates are not valid instruments, and the instrumental variables parameter estimates remain biased towards zero. On a positive note, the results derived here provide some clear prescriptions for handling different types of mismeasurement, in terms of choice of instruments, and also choice of which variable is dependent Y, and which is explanatory X. In the Euler equationcase,sinceconsumptiongrowthismismeasuredwithLoSE,andtheinterestrate is largely free from mismeasurement, the results here recommend using the interest rate as the dependent variable, opposite the current conventional wisdom in the profession.18 18Oneregressionspecificationthatdoesregressassetpricesonmacroeconomicquantitiesisthehuman capital CAPM, essentially a regression of stock prices on labor income growth. 39
Thegeneralizedmeasurement errormodelwithLoSEislikelyapplicableinawidevariety of econometric specifications beyond the few considered here, and our results should provide helpful insights for making appropriate modifications to econometric practice. References [1] Angrist, J., and Krueger, A. (1999), “Empirical Strategies in Labor Economics,” in Handbook of Econometrics (Vol. 5), eds. O. Ashenfelter and D. Card, Amsterdam: Elsevier. [2] Berkson, J. (1950), “Are There Two Regressions?” Journal of the American Statistical Association, 45, 164-180. [3] Bollinger, C. (1996), “Bounding Mean Regression When a Binary Regressor is Mismeasured,” Journal of Econometrics, 73, 387-399. [4] Bollinger, C. (1998), “Measurement Error in the Current Population Survey: A Nonparametric Look,” Journal of Labor Economics, 16, 576-594. [5] Bound, J., Brown, C., and Mathiowetz, N. (2001), “Measurement Error in Survey Data,” in Handbook of Econometrics (Vol. 5), eds. J.J. Heckman and E. Leamer, Amsterdam: Elsevier. [6] Bound, J., Brown, C., Duncan, G., and Rogers, W. (1994), “Evidence on the Validity of Cross-sectional and Longitudinal Labor Market Data” Journal of Labor Economics, 12, 345-368. 40
[7] Bound, J., and Krueger, A. (1991), “The Extent of Measurement Error in LongitudinalEarningsData: DoTwoWrongsMakeaRight?,”Journal of Labor Economics, 9, 1-24. [8] Broda, C., and Weinstein, D.(2007), “Product Creation and Destruction: Evidence and Price Implications,” University of Chicago working paper. [9] Campbell, J. and Mankiw, G. (1989), “Consumption, Income, and Interest Rates: Reinterpreting the Time Series Evidence,” in NBER Macroeconomics Annual, eds. O. Blanchard and S. Fischer, Cambridge, NBER. [10] Card, D., (1996), “The Effect of Unions on the Structure of Wages: A Longitudinal Analysis.” Econometrica, 64, 957-979. [11] Carroll, R., and Stefanski, L. (1990), “Approximate Quasi-likelihood Estimation in ModelswithSurrogatePredictors,” Journal of the American Statistical Association, 85, 652-663. [12] Chen, N. (1991), “Financial Investment Opportunities and the Macroeconomy,” Journal of Finance, 46, 529-554. [13] de Leeuw, Frank, and McKelvey, Michael J. (1983), “A ’True’ Time Series and Its Indicators” Journal of the American Statistical Association, 78, 37-46. [14] Durbin, J. (1954), “Errors in Variables,” Review of the International Statistical Institute, 22, 23-32. 41
[15] Dynan, K. and Elmendorf, D. (2001), “Do Provisional Estimates of Output Miss Economic Turning Points?” Board of Governors of the Federal Reserve System, FEDS working paper 2001-52. [16] Escobal, J., and Laszlo, S. (2008), “Measurement Error in Access to Markets,” Oxford Bulletin of Economics and Statistics, 70, 209-243. [17] Estrella, A., and Hardouvelis, G. (1991), “The Term Structure as a Predictor of Real Economic Activity.” Journal of Finance, 46, 555-576. [18] Evans, M., and Lyons, R. (2005), “Meese-Rogoff Redux: Macro-Based Exchange Rate Forecasting,” working paper 11042, NBER, Cambridge, MA. [19] Evans, M., and Lyons, R. (2007), “Exchange Rate Fundamentals and Order Flow,” working paper 13151, NBER, Cambridge, MA. [20] Fama, E., (1990), “Stock Returns, Expected Returns, and Real Activity.” Journal of Finance, 45, 1089-1108. [21] Faust, J., Rogers, J., Wang, S., and Wright, J. (2003). “The High-Frequency Response of Exchange Rates and Interest Rates to Macroeconomic Announcements,” BoardofGovernorsoftheFederalReserveSystem, InternationalFinanceDiscussion Paper 784. [22] Fixler, D. and Grimm, B. (2006) “GDP Estimates: Rationality tests and turning point performance,” Journal of Productivity Analysis, 25, 213-229. 42
[23] Fixler, D. and Nalewaik, J. (2007) “News, Noise, and Estimates of the True Unobserved State of the Economy,” Board of Governors of the Federal Reserve System, FEDS working paper 2007-34. [24] Federov, V. V. (1974), “Regression Problems with Controllable Variables Subject to Error,” Biometrika, 61, 49-56. [25] Fuller, W. (1987) Measurement Error Models, New York: John Wiley and Sons. [26] Geary, R. C. (1953), “Non-Linear Functional Relationship Between Two Variables When One Variable is Controlled,” Journal of the American Statistical Association, 48, 94-103. [27] Griliches, Z. (1986), “Economic Data Issues,” in Handbook of Econometrics (Vol. 3), eds. Z. Griliches and M.D. Intriligator, Amsterdam: Elsevier. [28] Grimm,B.andWeadock,T.(2006)“GrossDomesticProduct: RevisionsandSource Data,” Survey of Current Business, 86, 11-15. [29] Hayek, F. (1945) “The Use of Knowledge in Society,” American Economic Review, 35, 519-530. [30] Huwang, L, and Huang, Y. H. (2000), “On Errors-In-Variables in Polynomial Regression-Berkson Case,” Statistica Sinica, 10, 923-936. [31] Hyslop, R., and Imbens, Guido R. (2001). “Bias from Classical and Other Forms of Measurement Error,” Journal of Business and Economic Statistics, 19, 475-481. [32] Kane, T., Rouse, C., and Staiger, D. (1999), “Estimating Returns to Schooling when Schooling is Misreported,” working paper 7235, NBER, Cambridge, MA. 43
[33] Kimball, Miles; Sahm, Claudia; and Shapiro, Matthew (2008), “Imputing Risk Tolerance from Survey Responses” Journal of the American Statistical Association, 103, 1028-1038. [34] Klepper, S., and Leamer, E., (1984), “Consistent Sets of Estimates for Regressions with Errors in All Variables,” Econometrica, 52, 163-183. [35] Leamer, E., (1987), “Errors in Variables in Linear Systems,” Econometrica, 55, 893-909. [36] Mankiw, N. and Shapiro, M., (1986), “News or Noise: An Analysis of GNP Revisions” Survey of Current Business, 66, 20-25. [37] McConnell, M, and Perez-Quiros, G., (2000), “Output Fluctuations in the United States: What Has Changed Since the Early 1980s?” American Economic Review, 90, 1464-1476. [38] Nalewaik, J., (2006), “Current Consumption and Future Income Growth,” Journal of Monetary Economics, 53, 2239-66. [39] Nalewaik, J., (2007), “Estimating Probabilities of Recession in Real Time Using GDP and GDI,” Board of Governors of the Federal Reserve System, FEDS working paper 2007-07. [40] Nalewaik, J., (2007), “Incorporating Vintage Differences and Forecasts into Markov Switching Models.” BoardofGovernors oftheFederalReserve System, FEDSworking paper 2007-23. 44
[41] Newey, W.K. and West, K.D., (1987), “A Simple, Positive Semi-Definite Heteroskedasticity and Autocorrelation Consistent Covariance Matrix.” Econometrica, 55, 703-708. [42] Piscke, J-S. (1995). “Measurement Error and Earnings Dynamics: Some Estimates from the PSID Validation Study,” Journal of Business and Economic Statistics, 13, 305-314. [43] Sargent, T. (1989). “Two Models of Measurement and the Investment Accelerator,” Journal of Political Economy, 97, 251-287. [44] Wang, L. (2003), “Estimation of Nonlinear Berkson-Type Measurement Error Models,” Statistica Sinica, 13, 1201-1210. [45] Wang, L. (2004), “Estimation of Nonlinear Models with Berkson Measurement Errors,” Annals of Statistics, 32, 2559-2579. 45
htworG IDG dna PDG fo segatniV no scitsitatS yrammuS :1 elbaT 4002-3Q4891 ,ataD ylretrauQ (cid:11) (cid:10) (cid:11) (cid:10) IDGYΔ rav PDGYΔ rav egatniV t t . 1.3 ”ecnavdA“ ,ylretrauQ tnerruC 0.4 1.4 ”laniF“ ,ylretrauQ tnerruC 8.4 2.4 elbaliavA egatniV tsetaL htworG IDG dna PDG rof scitsitatS yrammuS :2 elbaT (cid:11) (cid:10) (cid:11) (cid:10) (cid:11) (cid:10) )PDGYΔ(rav t :no dnuob reppU IDGYΔ,PDGYΔ voc IDGYΔ rav PDGYΔ rav )(cid:2)YΔ(rav t t t t t 39.0 4.22 1.42 1.42 2Q4891-7491 ,ylretrauQ 79.0 2.8 5.8 2.8 4891-7491 ,launnA 07.0 1.3 8.4 2.4 4002-3Q4891 ,ylretrauQ 96.0 9.1 6.2 6.1 4002-5891 ,launnA 46
Table 3: Regressions of Different Measures of Quarterly Output Growth on Current and Lagged Stock Price Growth, 1984Q3 to 2004Q4: ΔYi = α+β Δp +β Δp +...+β Δp +e t 0 t 1 t−1 6 t−6 t Measure: ΔYGDP ΔYGDP ΔYGDI ΔYGDI ΔYGDI−CP Vintage: “Advance” Latest Latest “Final” Latest β : 0.012 0.014 0.032 0.031 0.003 0 (0.019) (0.027) (0.026) (0.023) (0.025) β : 0.048 0.054 0.084 0.038 0.083 1 (0.018) (0.024) (0.023) (0.023) (0.030) β : 0.021 0.065 0.056 0.036 0.059 2 (0.024) (0.027) (0.027) (0.027) (0.028) β : 0.058 0.057 0.067 0.044 0.093 3 (0.017) (0.022) (0.023) (0.020) (0.031) β : 0.011 0.015 0.051 0.024 0.064 4 (0.023) (0.028) (0.029) (0.024) (0.028) β : -0.003 -0.007 0.038 -0.019 0.070 5 (0.025) (0.026) (0.022) (0.022) (0.028) β : 0.002 0.023 0.008 0.002 0.032 6 (0.016) (0.017) (0.027) (0.018) (0.030) (cid:16) 6 β : 0.149 0.221 0.337 0.156 0.403 k=0 k (0.055) (0.060) (0.069) (0.069) (0.079) 47
Table 3A: Regressions of Different Measures of Quarterly Output Growth on Current and Lagged Stock Price Growth, 1984Q3 to 2004Q4: ΔYi = α+β(Δp +Δp +...+Δp )/7+e t t t−1 t−6 t Measure: ΔYGDP ΔYGDP ΔYGDI Vintage: “Advance” Latest Latest β: 0.142 0.214 0.325 (0.060) (0.068) (0.073) Table 3B: Reverse Regressions of Current and Lagged Stock Price Growth on Different Measures of Quarterly Output Growth, 1984Q3 to 2004Q4: (Δp +Δp +...+Δp )/7 = α+βrΔYi +e t t−1 t−6 t t Measure: ΔYGDP ΔYGDP ΔYGDI Vintage: “Advance” Latest Latest βr: 0.411 0.454 0.600 (0.194) (0.182) (0.169) 48
Table 4: Regressions of Different Measures of Quarterly Output Growth on Lagged Interest Rates Spreads (TERM and DEF), 1988Q3 to 2004Q4: (cid:10) (cid:11) (cid:10) (cid:11) ΔYi = α+β r10yr −r2yr +β rcorp −r10yr +e t TERM t−k t−k DEF t−k t−k t Measure: ΔYGDP, “Advance” ΔYGDP, Latest ΔYGDI, Latest p-val., equal βs β β β β β β TERM DEF TERM DEF TERM DEF k=1 0.20 -0.50 0.31 -0.61 0.23 -0.79 0.10 (0.26) (0.13) (0.26) (0.13) (0.29) (0.10) k=2 0.42 -0.44 0.48 -0.53 0.43 -0.69 0.13 (0.26) (0.12) (0.31) (0.12) (0.33) (0.13) k=3 0.58 -0.38 0.60 -0.40 0.68 -0.65 0.00 (0.30) (0.12) (0.36) (0.15) (0.37) (0.15) k=4 0.62 -0.23 0.57 -0.28 0.70 -0.50 0.01 (0.32) (0.15) (0.39) (0.17) (0.40) (0.17) k=5 0.59 -0.19 0.67 -0.29 0.75 -0.41 0.40 (0.35) (0.14) (0.38) (0.14) (0.44) (0.19) k=6 0.72 -0.27 0.76 -0.32 0.92 -0.39 0.54 (0.35) (0.10) (0.38) (0.13) (0.41) (0.16) k=7 0.73 -0.19 0.81 -0.20 0.96 -0.39 0.14 (0.35) (0.10) (0.36) (0.13) (0.38) (0.15) k=8 0.66 -0.10 0.72 -0.15 0.94 -0.27 0.27 (0.34) (0.13) (0.36) (0.14) (0.37) (0.15) 49
Cite this document
Jeremy J. Nalewaik (2008). Lack of Signal Error (LoSE) and Implications for OLS Regression: Measurement Error for Macro Data (FEDS 2008-15). Board of Governors of the Federal Reserve System, Finance and Economics Discussion Series. https://whenthefedspeaks.com/doc/feds_2008-15
@techreport{wtfs_feds_2008_15,
author = {Jeremy J. Nalewaik},
title = {Lack of Signal Error (LoSE) and Implications for OLS Regression: Measurement Error for Macro Data},
type = {Finance and Economics Discussion Series},
number = {2008-15},
institution = {Board of Governors of the Federal Reserve System},
year = {2008},
url = {https://whenthefedspeaks.com/doc/feds_2008-15},
abstract = {This paper proposes a simple generalization of the classical measurement error model, introducing new measurement errors that subtract signal from the true variable of interest, in addition to the usual classical measurement errors (CME) that add noise. The effect on OLS regression of these lack of signal errors (LoSE) is opposite the conventional wisdom about CME: while CME in the explanatory variables causes attenuation bias, LoSE in the dependent variable, not the explanatory variables, causes a similar bias under some conditions. The paper provides evidence that LoSE is an important source of error in US macroeconomic quantity data such as GDP growth, illustrates downward bias in regressions of GDP growth on asset prices, and provides recommendations for econometric practice.},
}