ifdp · December 17, 2023

Lessons from Nowcasting GDP across the World

Abstract

In economics, we need to forecast the present because reliable and comprehensive measures of the state of the economy are released with a substantial delay and considerable measurement error. Nowcasting exploits timely data to obtain early estimates of the state of the economy and updates these estimates continuously as new macroeconomic data are released. In this chapter, we describe how the framework used to nowcast GDP has evolved and is applied worldwide.

Board of Governors of the Federal Reserve System International Finance Discussion Papers ISSN 1073-2500 (Print) ISSN 2767-4509 (Online) Number 1385 December 2023 Lessons from Nowcasting GDP across the World Danilo Cascaldi-Garcia, Matteo Luciani, and Michele Modugno Please cite this paper as: Cascaldi-Garcia, Danilo, Matteo Luciani, and Michele Modugno (2023). “Lessons from Nowcasting GDP across the World,” International Finance Discussion Papers 1385. Washington: Board of Governors of the Federal Reserve System, https://doi.org/10.17016/IFDP.2023.1385. NOTE: International Finance Discussion Papers (IFDPs) are preliminary materials circulated to stimulate discussion and critical comment. The analysis and conclusions set forth are those of the authors and do not indicate concurrence by other members of the research staff or the Board of Governors. References in publications to the International Finance Discussion Papers Series (other than acknowledgement) should be cleared with the author(s) to protect the tentative character of these papers. Recent IFDPs are available on the Web at www.federalreserve.gov/pubs/ifdp/. This paper can be downloaded without charge from the Social Science Research Network electronic library at www.ssrn.com.

∗ Lessons from Nowcasting GDP across the World † ‡ § Danilo Cascaldi-Garcia Matteo Luciani Michele Modugno Federal Reserve Board Amazon.com Federal Reserve Board December 2023 Abstract Ineconomics,weneedtoforecastthepresent becausereliableandcomprehensive measures of the state of the economy are released with a substantial delay and considerable measurement error. Nowcasting exploits timely data to obtain early estimates of the state of the economy and updates these estimates continuously as new macroeconomic data are released. In this chapter, we describe how the framework used to nowcast GDP has evolved and is applied worldwide. Keywords: Dynamic factor models, forecasting, nowcasting JEL codes: C33, C53, E37 ∗We are grateful to the editors Michael Clements and Ana Beatriz Galv˜ao, one anonymous referee, and stimulating discussions with Domenico Giannone. Any errors are our responsibility. This is a draft chapter. The final version will be available in the Handbook on Macroeconomic Forecasting edited by Michael Clements and Ana Beatriz Galv˜ao, Edward Elgar Publishing Ltd. The material cannot be used for any other purpose without further permission of the publisher, and is for private use only. †FederalReserveBoard,InternationalFinanceDivision,WashingtonDC20551,USA;E-mailaddress: danilo.cascaldi-garcia@frb.gov. ‡Amazon.com; E-mail address: luciani.matteo@gmail.com. §Federal Reserve Board, Financial Stability Division, Washington DC 20551, USA; E-mail address: michele.modugno@frb.gov. Disclaimerno.1: Theviewsexpressedinthispaperarethoseoftheauthorsanddonotnecessarilyreflect the views and policies of the Board of Governors or the Federal Reserve System. Disclaimer no.2: Luciani’s contribution was part of a continued collaboration based on work done prior to joining Amazon. This publication and its contents are not related to Amazon and do not reflect the position of the company and its subsidiaries. 1

1 Introduction The term “nowcasting” is a contraction of the words “now” and “forecasting,” and it refers to the prediction of the very recent past, the present, and the very near future. This word has been used in meteorology for a long time, and it was introduced into economics by Giannone et al. (2008). Obtaining a reliable measure of the state of the economy is pivotal to making policy and business decisions. Every day, policy institutions, market analysts, and financial and non financial corporations parse troves of economic data released by statistical agencies, private and public surveys, and other sources to assess the health of the economy. Based on these data, they nowcast the current state of the economy; that is, they create a narrative about where the economy is and where it is headed. The difficulty comes in separating meaningful economic signals from the noise. In their seminal paper, Giannone et al. (2008) designed a nowcasting model to formalize key features of how market participants and policymakers read data in real time. A few years later, Ban´bura and Modugno (2014) enriched the state-of-the-art nowcasting model with the appropriate tool to monitor how multiple and asynchronous data releases change the assessment of the state of the economy. In other words, with this tool, the nowcasting model can mimic the behavior of market participants who revise their assessments whenever a new data release differs from their expectations. BeforeGiannoneetal.(2008)introducedtheirnowcastingmodel,real-timemonitoring of macroeconomic conditions was more of an art than a science. The common practice was to use a set of heuristic models and a good dose of judgment to make predictions about the state of the economy. However, judgmental and simplified heuristic procedures are exposed to internal inconsistencies, with the constant risk of putting too much weight on outdated signals or on timely but unreliable releases. In addition, this procedure cannot help interpreting the information content of each data release in a systematic way. Moreover, assessing the state of the economy in real time involves analyzing a large amount of complex information that is continuously released, often with multiple data releases in a single day. Lastly, processes that are not scientific and do not use formal methods cannot be evaluated ex post. In conclusion, updating the assessment of the economy in real time using a procedure that is not entirely automated is costly, risky, and not scalable. The challenge embraced by the literature was to design an entirely automated platform capable of tracking the state of the economy without relying on any judgment or subjective prior information. Hence, the nowcasting literature developed a formal and internally coherent methodology replicating the experts’ judgmental process. To perform this task, an arsenal of tools and methods in econometrics, statistics, and data analysis has been deployed, building upon the nascent developments and insights in big data 2

analytics and taking advantage of improvements in scientific computing, data handling, and visualization. Compared to judgmental predictions, the advantage of having such a platform is that it delivers a transparent monitoring of the economy through a robust methodology and provides a coherent analysis of the links between macroeconomic and cyclical developments. Thefirstnowcastingmodelwasadynamicfactormodel(DFM)equippedwithefficient filtering techniques. This model exploits two essential and robust features of business cycle fluctuations. First, macroeconomic data strongly co-move, so a few common factors summarize their dynamics well—in this context, the common factors are typically associated with the unobserved state of the economy. Second, historically, economic booms and busts persist for a considerable period of time, so the past dynamics of such factors should be informative to understand where we are and where we are heading in the near future. Hence, a DFM provides a parsimonious yet suitable representation for the large set of macroeconomic time series. Formally, DFMs can be written in a state-space form or as a system of two types of equations: measurement equations linking observed series to the unobserved factors, and transition equations describing the dynamics of the unobserved factors. The state-space representation allows the use of Kalman filtering techniques to obtain projections for the observedvariablesandtheunobservedfactors. Mostimportantly, givenanestimateofthe parameters, the Kalman filter can easily cope with challenging features of the nowcasting information set, such as data observed at different frequencies and with missing data. These can appear either at the end of the sample due to asynchronous data releases (ragged edges) or at the beginning of the sample due to only a recent collection of some data sources. TheuseofDFMs,coupledwiththeKalmanfilter,hasalongtraditionineconometrics. However, for a long time, it was considered infeasible for high dimensional data, as they require estimating a large number of parameters. Doz et al. (2012) challenged this view and, by studying the asymptotic properties of the maximum likelihood estimator when the complexity of the model and the sample size increase, showed that these models are viable for analyzing big datasets. They also refined the estimation procedure to make the computation scalable to high-dimensional problems. However, their procedure was not directly suitable for nowcasting with an information set characterized by data with mixed frequencies and missing data. Ban´bura and Modugno (2014) tackled this problem by modifying Doz et al.’s maximum likelihood procedure to efficiently use all the information embedded in incomplete datasets. Subsequently, D’Agostino et al. (2016) provided an alternative solution based on Bayesian inference. Taking stock of the accumulated experience has shown that the model provides predictions whose accuracy equals or exceeds the accuracy of expert judgment predictions. This performance is why, today, almost every central bank in the world has a nowcasting 3

model. For example, various Federal Reserve branches (Atlanta, Cleveland, and New York) periodically publish their nowcasting models’ results, and Bloomberg makes available nowcasts through its platform. All these estimates are widely followed and discussed by the press and analysts at hedge funds, investment banks, and large corporations. Nowcasting has also become an active area of academic research. In a survey article, Stock and Watson (2017) included nowcasting among the 10 most important innovations in time-series econometrics over the previous 20 years, and many papers applying the nowcasting framework to economies all around the world have been published.1 In this chapter, we focus on how DFMs are constructed and estimated for nowcasting gross domestic product (GDP).2 We will briefly discuss the limits of how real-time economic monitoring was conducted before nowcasting and how this framework has overcome those limitations, particularly interpretability. After a brief description of alternative models that have been proposed for real-time monitoring of the economy, we will also discuss how estimation algorithms for DFMs have evolved to efficiently use all the information content embedded in a dataset characterized by a large cross-section of data, with mixed-frequency and mismatched time span coverage. We will then describe the data selection process and conclude with an empirical section that will highlight the performance of some of the nowcasting applications during the Great Financial Crisis and the onset of the COVID-19 pandemic across multiple countries. 2 Models and their interpretability Key data that describe the current state of the economy are available with a significant delay, particularly those collected quarterly, with GDP being a prominent example. For instance, limiting our attention only to G7 countries, the delay between the publication of the first official estimate of GDP and the end of the reference quarter is approximately four weeks in the United States and the United Kingdom, six weeks in Japan, and eight 1The economies include Belgium (de Antonio Liedo, 2015), Brazil (Bragoli et al., 2015), BRICs plus Mexico (Dahlhaus et al., 2017), Canada (Bragoli and Modugno, 2017), China (Yiu and Chow, 2010 and Giannone et al., 2013) ,the Czech Republic (Arnostova et al., 2011; Rusn´ak, 2016), Ecuador (Gonz´alez-Astudillo and Baquero, 2019), Euro Area (Angelini et al., 2010, Camacho and Perez-Quiros, 2010, Angelini et al., 2011, Ban´bura and Ru¨nstler, 2011, Ban´bura et al., 2011, Ban´bura and Modugno, 2014 Carriero et al., 2019, and Cascaldi-Garcia et al., 2023), European countries (Ru¨nstler et al., 2009 andJansenetal.,2016),France(Barhoumietal.,2010andBessecandDoz,2014),Germany(Marcellino and Schumacher, 2010; Andreini et al., 2023), India Bragoli and Fosten, 2018, Indonesia (Luciani et al., 2018),Ireland(D’Agostinoetal.,2012),Japan(Bragoli,2017;HayashiandTachi,2023),MexicoCaruso, 2018,NewZealand(Matheson,2010),Norway(AastveitandTrovik,2012,andLucianiandRicci,2014), Switzerland (Siliverstovs, 2012), Turkey (Modugno et al., 2016), the United Kingdom (Anesti et al., 2018), United States (Giannone et al., 2008; Lahiri and Monokroussos, 2013; Ban´bura et al., 2013; Bok et al., 2018; Antolin-Diaz et al., 2020). Surveys of the literature on nowcasting are provided by Ban´bura et al. (2011, 2013); Luciani (2017); Bok et al. (2018). 2Nowcasting models have also been applied to variables other than GDP, such as, among others, inflation (Modugno, 2013) and trade (D’Agostino et al., 2017). 4

weeks in Canada. France, Germany, and Italy were characterized by a six-week delay until October 2015, four weeks after that. However, plentyofinformation(cor-)relatedtoGDPispublishedathigherfrequencies and earlier than the variable of interest: information about the labor market, industrial production, trade, sales, housing, or surveys about the state of the economy. These are frequently available at monthly frequencies and are released before the current quarter figure of GDP. The first foundational principle of nowcasting was to create a framework that could exploit a large and timely information set (cor-)related to the target variable to generate its early estimates. However, using timely information from various sources has a number of implications regarding the features of the information set: 1) it may be composed of data with different frequencies; 2) data are released in a non synchronous manner and with different degrees of delay, creating the so-called “ragged” or “jagged” edge dataset; 3) data may have a different time availability. Nowcasting is not the first framework deployed that produces estimates of the state of the economy by handling information sets characterized by mixed frequency, unbalanced, and“raggededge”data. Centralbanksandfinancialmarketpractitionershavelongrelied on frameworks mainly based on bridge equations and model averaging. However, those frameworks do not provide tools to interpret how new releases of the input variables affect the change in the early estimate of the target variable. This results from their “partial” model nature, as they are not set up to isolate the unpredictable component in the newest data release, conditional on the existing information set. Another drawback of thesepartialsolutionsistheneedtospecifyadifferentmodel(and, consequently, estimate different sets of parameters) for data vintages with different “ragged edge” structures. As such, it is not possible to interpret the impact of new data releases, which is at odds with the second foundational principle of nowcasting of a framework that links and interprets how new releases of the input variables revise the model estimate of the target variable. This objective can only be reached by using multivariate econometric frameworks that allow isolating from each new data release the innovation that is orthogonal to the entire available information set and linking how this innovation changes the estimate of the target variable—i.e., a multivariate unique framework that can produce a forecast for each variable in the system. The natural candidates that display this characteristic among the econometric frameworks typically used in macro-econometric analysis are DFMs and vector autoregressive models (VARs). However, when the nowcasting literature started, the estimation of VARs on a large number of variables was thought unfeasible, which restricted the set of available options to the DFMs. In recent years, advances in estimation algorithms for VARs have overcome the curse of dimensionality and made it feasible to estimate these models on large datasets, thus making them a palatable solution for nowcasting. We refer 5

to Cimadomo et al. (2022) for an exhaustive exposition of how VARs can be used for nowcasting. In the remainder of this section, we start by describing bridge equations. We then take a deep dive into DFMs, exploring different specifications and adaptations to the kind of data explored. Next, we explain how news can be extracted from “joint models.” The last subsection is dedicated to a quick overview of alternative models proposed for short-term forecasting but that still cannot interpret the evolution of the assessment of the current conditions. Therefore, we group them in the “partial” model family with bridge equations. Before starting the description of the models, let us set the basic notation we will use throughout the paper: yf is variable n, released at frequency f, which describes the t,n value of that variable for period t, where, by convention, t indicates the last day of the reference period.3 2.1 Bridge equations In this type of model, the nowcast and forecasts of yq are obtained via the following t,gdp regression: yq = α+βyq +e , (1) t,gdp t,n t where yq is the aggregation of the predictor yf , which can be available at a frequency t,n t,n f higher than the target variable and therefore needs to be aggregated to match the frequency and units of the target variable.45 Hence, the mixed-frequency problem is solved by temporal aggregation of the predictors to the lower frequency. To handle the ragged edge, bridge models resort to auxiliary models, such as autoregressive moving average (ARMA) or VAR, to forecast y and close the target period of interest. This t,n was the “traditional” tool popularly employed at central banks to obtain early estimates of GDP or its components, and the predictors were usually monthly (see, e.g., Kitchen and Monaco, 2003; Parigi and Golinelli, 2007, Parigi and Schlitzer, 1995; and Baffigi et al., 2004). Equation(1)istypicallyestimatedbyordinaryleastsquares(OLS)andcanbefurther extended to include more predictors or the lags of the dependent variable. If the information set is large, forecast combination is often an alternative (Kitchen and Monaco, 2003; Diron, 2008; Angelini et al., 2011; Ru¨nstler et al., 2009). Bridge equations can also be 3E.g., the monthly industrial production of January 2022 will be represented as ym , and 01/31/2022,ip the quarterly GDP of 2022:Q1 will be represented as yq . 03/31/2022,gdp 4From now on, we will use the convention that the time notation will indicate the final highestfrequency finite fraction of the reference period. For example, if the variable of interest is quarterly and the highest frequency is monthly, τ will indicate the last month in the quarter of interest. 5The aggregation used in bridge equations follows the same logic as in subsection 2.3 for factor models. 6

combinedinaso-calledbottom-upapproachwhereoneobtainsearlyestimatesofGDPby aggregating the early estimates of its components, exploiting national accounts’ identities (see Hahn and Skudelny, 2008; Drechsel and Scheufele, 2012; Baffigi et al., 2004).6 2.2 Dynamic factor models The typical DFM used for nowcasting decomposes every economic indicator into at least two parts: (i) factors common to each indicator in the information set, and (ii) an indicator-specific idiosyncratic component. The main identification assumption behind these models is that the common factors are the only components that explain the comovement among the economic indicators, while the idiosyncratic components capture indicator-specificvariation. Thisidentificationassumptionisformalizedbyimposingthat the idiosyncratic components are orthogonal to each other and to the common factors at each lead and lag.7 More precisely, we specify the model as y = Λ·F +e , (2) t t t F = A·F +u , u ∼ i.i.d. N(0,Q), (3) t t−1 t t e = D ·e +v , v ∼ i.i.d. N(0,R), (4) t t−1 t t where y is a vector of (n×1) standardized economic indicators; F is a vector of (r×1) t t common factors (with r<n); e are the (n×1) idiosyncratic components; Λ is a matrix of t (n×r) loadings of the economic indicators on the factors; A is the (r×r) auto-regressive matrix of the factors (in companion form); D is the (n×n) diagonal auto-regressive matrix of the idiosyncratic components; Q is the variance-covariance matrix of the common factors; and R is the diagonal variance covariance matrix. D and R are assumed to be diagonal to preserve the cross-orthogonality condition among the idiosyncratic components. InordertomakethismodelsuitablefortheKalmanfilter, whichisacentralingredient for both the estimation of the model’s parameter (see section 3) and the production of forecasts, we need its state-space representation. Equations (2) to (4) can therefore be 6Note that the model of Giannone et al. (2008) can also be interpreted as “bridging with factors,” as the factors extracted with the Kalman filter were plugged into an equation similar to (1) to obtain the nowcasts. The Kalman filter allowed using the ragged edge part of the information set (not data at different frequencies)to updatethe factors’ estimatebut not themodel’s parameter. OnceBan´bura and Modugno (2014) and D’Agostino et al. (2016) showed how to efficiently use all the information set for estimating both factors and parameters, “bridging with factors” was outmoded. 7Theassumptionthattheidiosyncraticcomponentsarenotcross-sectionallycorrelatedisasimplified assumption used for exposition purposes. Indeed, in a large macroeconomic dataset, it is most likely the case that these idiosyncratic components are cross-correlated. If those correlations are small, the model can be estimated without additional problems (Doz et al., 2012; Barigozzi and Luciani, 2022). 7

written as (cid:34) (cid:35) (cid:104) (cid:105) F t y = Λ I (5) t n×n e t (cid:34) (cid:35) (cid:34) (cid:35)(cid:34) (cid:35) (cid:34) (cid:35) F A 0 F u t r×n t−1 t = + (6) e 0 D e v t n×r t−1 t where (cid:34) (cid:35) (cid:32)(cid:34) (cid:35) (cid:34) (cid:35)(cid:33) u 0 R 0 t r×1 n×r ∼ i.i.d. N , (7) v 0 0 Q t n×1 r×n Most of the nowcasting applications have been successful, in terms of nowcasting accuracy, withthesimplestspecificationofthismodel—i.e., assumingtheexistenceofone common factor (r=1).8 However, there have been applications where more complicated specifications of the DFM have been deployed due to the nature of the data, the problem under scrutiny, or the need to understand links between specific groups of variables and the GDP. One example is Cascaldi-Garcia et al. (2023). In this paper, the authors formalize how to monitor the euro-area economy following a multi-country approach inspired by the example of market participants, who track both euro-area aggregate data and largest country-specific data, and policymakers, who build euro-area forecasts from projections for individual countries. To do so, they assume a block structure in which each economic indicator loads only on its economy-specific factor, as in   fea t  ffr   t       yea Λea 0 0 0 I 0 0 0  fge  t  t   yfr   0 Λfr 0 0 0 I 0 0  fit    t   =       t  , (8)  yge   0 0 Λge 0 0 0 I 0  eea   t    t  yit 0 0 0 Λit 0 0 0 I  efr  t  t     ege   t  eit t where the data y is partitioned into indicators from the euro-area aggregate (yea), Gert t many (yge), France (yfr), and Italy (yit) that load on one factor per economy, respect t t tively: euro-area aggregate (fea), Germany (fge), France (ffr), and Italy (fit). Moret t t t over, each variable loads also its own idiosyncratic component, included in vectors eea, t ege, efr, and eit. In this specification, each of the matrices Λi, 0, and I is of dimension t t t ni×1, with ni that differs according to the geographical area i = ea,ge,fr,it. 8Usually, some restrictions on the loading matrix are imposed in order to coherently model the relation among variables published at different frequencies, as explained in detail in section 2.3. 8

Another example is Bok et al. (2018), where the authors specify a model with a global factor, which affects all the variables, and a few local factors, which affect only blocks of variables. The scope of the authors is to control for idiosyncrasies in particular subgroups of series. Specifically, to model the local correlations in survey data, they include a soft block, loading only variables representing economic agents’ perceptions and sentiments. In a similar vein, they add two more factors, one for the block of real variables and one for labor variables.9 2.3 Mixed-frequency and time aggregations Most of the nowcasting models for GDP have been developed for datasets that contain time series published at monthly and quarterly frequencies. Therefore, in equation (2), y = [yq;ym]. If those data are seasonally adjusted but are not stationary in t t t mean, monthly and quarterly data are transformed into month-on-month and quarter-onquarter growth rates. If they are stationary, they are transformed into month-on-month or quarter-on-quarter differences.10 To construct a model that explicitly considers the different units of measure within a mixed-frequency dataset, the general strategy has been to assume that the low-frequency variables have a partially observed counterpart with the highest frequency among those included in the dataset. To be more concrete, let’s consider a dataset with several monthly variables and the quarterlyGDP.The latteris treated as a partiallyobservedmonthly variable inwhich the quarterly release is assigned to the third month of the respective quarter. This partially observed variable is assumed to be an aggregation of an unobserved monthly growth rate of GDP (ym, whose log-level is Ym that admits the same factor model representation as t,u t,u the other monthly variables: ym = λ f +e , (9) t,u gdp t t,gdp where e = ρ e +v . (10) t,gdp gdp t−1,gdp t,gdp To link ym with the observed GDP growth rate yq , let us start considering this t,u t,gdp growth rate as the difference of the log-levels of the quarterly GDP Yq and then use the t 9It is also possible to determine the number of factors using statistical tests, which depend on the estimation technique adopted; see Coroneo et al. (2016) for maximum likelihood and Bai and Ng (2002) for principal components. 10Surveys are usually included in levels, given that they already express changes. For example, questionnaires behind most of the surveys used in nowcasting models ask how the current conditions (about confidence, consumption, business, etc.) compare to the previous month/quarter. 9

following triangular aggregation: yq = Yq −Yq = (1−L3)Yq t,gdp t,gdp t−3,gdp t,gdp ≈ (1−L3)(1+L+L2)Ym = (1+L+L2 −L3 −L4 −L5)Ym t,u t,u = (1−L+2L−2L2 +3L2 −3L3 +2L3 −2L4 +L4 −L5)Ym t,u = ym +2ym +3ym +2ym +ym . t,u t−1,u t−2,u t−3,u t−4,u Therefore, the quarterly GDP yq can be written as t,gdp yq = λ (f +2f +3f +2f +f )+... t,gdp gdp t t−1 t−2 t−3 t−4 +e +2e +3e +2e +e (11) t,gdp t−1,gdp t−2,gdp t−3,gdp t−4,gdp and easily cast in the state-space form   f t  f   t−1     f  t−2    f   t−3    (cid:34) (cid:35) (cid:34) (cid:35) f  yq λ 2λ 3λ 2λ λ 1 2 3 2 1 0′  t−4  t,gdp = gdp gdp gdp gdp gdp  e  (12) ym Λ 0 0 0 0 0 0 0 0 0 0  t,gdp  t m   e   t−1,gdp    e   t−2     e  t−3,gdp    e   t−4,gdp  e t,m However, for some countries, due to the lack of reliable seasonal adjustment techniques, both monthly and quarterly data are published only as year-on-year growth rates. This is the case of China, for which Giannone et al. (2013) propose a time aggregation that takes into account the nature of this data, also adopted by Modugno et al. (2016), Dahlhaus et al. (2017), Bragoli and Fosten (2018), and Barcelona et al. (2022). Let us again assume that GDP level data for a given quarter is the sum of monthly unobserved contributions and let y my denote the unobserved monthly year-on-year GDP t,u growth rate. We assume that y my admits the same factor structure of the other year-ont,u year monthly variables in the dataset, similarly to equations (9) and (10). The monthly unobserved year-on-year growth rate can then be linked to a partially observed (at every third month of the quarter) quarterly year-on-year growth rate (y qy ) t,gdp 10

using the following formula: y qy = Yq −Yq = (1−L12)Yq t,gdp t,gdp t−12,gdp t,gdp ≈ (1−L12)(1+L+L2)Ym = (1+L+L2)y my t,u t,u = y my +y my +y my , (13) t,u t−1,u t−2,u which implies that the quarterly variables are required to load equally on the current and lagged values of the unobserved monthly factor. Using a similar logic but with some more cumbersome algebra, weekly and daily data canalsobeincludedinthedatasetforanowcastingmodel. Modugno(2013)andBan´bura et al. (2013) show how to modify the state space by including aggregator variables that link flow and stock variables coherently at frequencies from daily to quarterly. 2.4 News A critical ingredient of a nowcasting model is to interpret how new data releases of indicators included in the information set revise the estimate of the variable of interest. In order to do so, we first need to extract the unpredictable component of each new data releasegiventheavailableinformationset—i.e., theinnovation, andthenthecontribution of this innovation to the forecast revision, the so-called “news.” To extract the innovation, we need a framework that jointly models all the variables in the information set. Such a model allows us to compute expectations for each new data release and isolate the innovation, which is the difference between the actual release and its expectation. The innovation can then be linked to the variable of interest through weights that depend on the model parameters, and the product of those innovations and their corresponding weights is news. Having model-based news for all variables allows the obtainment of the revision of the GDP nowcast as the weighted sum of the news. Computing the news is key for understanding the changes in the model assessment of current economic activity over time and helping evaluate the significance of each data publication. Ban´bura and Modugno (2014) explain these ideas more formally. For the sake of simplicity, in what follows, we abstract from data revision and parameter re-estimation, so the new information we consider is only due to new data releases.11 Let us define yq as our target variable, e.g., real GDP quarterly growth for a given quarter, which τ,gdp we attribute to the last day of the quarter τ. For each data vintage Ω , available in v day v, we can produce an estimate of our variable of interest E[yq |Ω ]. The difference τ,gdp v between Ω and Ω is the new data released between v and v +1. For simplicity, let v v+1 11Hayashi and Tachi (2021) extend the revision analysis by providing a method for breaking down thedecompositionofnowcastchangesintothenew-observations,data-revisions,andparameter-revisions contribution from each individual indicator variables. 11

ym and ym be the only available data, relative to variable n and m and attributed to t,n s,m day t and s, respectively, which have been released between v and v +1. Formally we have Ω ⊂ Ω and Ω \Ω = {ym ,ym }. (14) v v+1 v+1 v t,n s,m Hence the information set is “expanding.” With the new releases, and therefore the new information set Ω , a new estimate v+1 (cid:104) (cid:105) E y |Ω can be generated. Using equation (14) and the properties of conditional τ,gdp v+1 expectations as an orthogonal projection operator, the following decomposition holds: (cid:104) (cid:105) (cid:104) (cid:105) (cid:104) (cid:105) E yq |Ω = E yq |Ω +E yq |I , τ,gdp v+1 τ,gdp v τ,gdp v+1 (cid:124) (cid:123)(cid:122) (cid:125) (cid:124) (cid:123)(cid:122) (cid:125) (cid:124) (cid:123)(cid:122) (cid:125) newforecast oldforecast revision where (cid:34) ym −E(cid:2) ym |Ω (cid:3) (cid:35) I = t,n t,n v . (15) v+1 ym −E(cid:2) ym |Ω (cid:3) s,m s,m v I isthepartofthereleasesym andym thatwasunpredictablewiththeinformation v+1 t,n s,m contained in Ω (given a specific model), or, more formally, I ⊥ Ω . This is the reason v v+1 v why I is labeled news, as it is the new information content available in Ω with v+1 v+1 respect to Ω . Note that it is the news and not the release itself that leads to nowcast v revisions. In particular, if the new values in Ω are exactly as predicted, given the v+1 information in Ω (in other words, “there is no news”), the nowcast will not be revised. v We can further develop the expression for the revision, or the difference between the new and the old nowcast, as E(cid:2) yq |I (cid:3) = E(cid:2) yq I′ (cid:3)E(cid:2) I I′ (cid:3)−1 I . (16) τ,gdp v+1 τ,gdp v+1 v+1 v+1 v+1 Given the model described in equations (2) to (4), assuming that there is only one factor and abstracting from parameter uncertainty, equation (16) can be decomposed as  (cid:104) (cid:105)  Λ′ E (f −E[f |Ω ])(f −E[f |Ω ]) λ E(cid:2) yq I′ (cid:3) =  gdp (cid:104) τ τ v t t v (cid:105) n , and τ,gdp v+1 Λ′ E (f −E[f |Ω ])(f −E[f |Ω ]) λ gdp τ τ v s s v m E(cid:2) I I′ (cid:3) = v+1 v+1 (cid:34) (cid:35) λ E(f −E[f |Ω ])E(f −E[f |Ω ])λ +R λ E(f −E[f |Ω ])E(f −E[f |Ω ])λ n t t v t t v n n,n n t t v s s v m , λ E(f −E[f |Ω ])E(f −E[f |Ω ])λ λ E(f −E[f |Ω ])E(f −E[f |Ω ])λ +R m s s v t t v n m s s v s s v m m,m where Λ′ = [λ 2λ 3λ 2λ λ ], f = [f f f f f ]′, and the approprigdp gdp gdp gdp gdp gdp τ τ τ−1 τ−2 τ−3 τ−4 12

ate expectations can be extracted by the Kalman filter and smoother. Naming δτ,gdp = E(cid:2) yq I′ (cid:3)E(cid:2) I I′ (cid:3)−1 , we can write t,n τ,gdp v+1 v+1 v+1 (cid:16) (cid:17) (cid:16) (cid:17) E(cid:2) yq |Ω (cid:3) −E(cid:2) yq |Ω (cid:3) = δτ,gdp ym −E(cid:2) ym |Ω (cid:3) +δτ,gdp ym −E(cid:2) ym |Ω (cid:3) . τ,gdp v+1 τ,gdp v t,n t,n t,n v s,m s,m s,m v (cid:124) (cid:123)(cid:122) (cid:125) (cid:124) (cid:123)(cid:122) (cid:125) (cid:124) (cid:123)(cid:122) (cid:125) revision news news (17) In other words, the revision can be decomposed as a weighted average of the news in the latest release. What matters for the revision is both the size of the news as well as its relevance for the variable of interest, as represented by the associated weight δτ,gdp. This t,n weightcapturestheimportantanceoftheupdateofthefactorf giventhenewinformation about series n relative to time t for the update of factor f at time τ. Equation (17) can be considered as a generalization of the usual Kalman filter update equation to the case in which new data arrive in a non synchronous manner, or τ ̸= t. As stressed before, this crucial relationship described in equation (17) can be obtained only through “joint models” like DFMs or VARs and in the case of a simultaneous release of several (groups of) variables, bringing the possibility of tracking how single releases have contributed to the forecast revision. While in this review we focus on factor models, most tasks performed with a DFM can also be performed with a VAR. The key is to write the VAR in a state-space form to use the Kalman filter (see, for example, Ban´bura et al., 2015). Differently from DFMs, however, the time aggregation in VARs is not exact but regression based (Cimadomo et al., 2022). 2.5 Other “partial” models Subsection 2.4 has highlighted the importance of relying on joint models for nowcasting. In this subsection, we present advances in “partial” models that are also commonly used for tracking the current state of the economy. 2.5.1 MIDAS-type equations MIDAS represents an evolution of the “partial” model approach. Here, the predictors are included in the regression at their original observation frequency: yq = α+βΓ(L,θ)yf +e , (18) τ,gdp t−hn,n τ where f can be any frequency, t − h may also coincide with τ, and Γ(L,θ) is a lag n polynomial. Since, for large h , many lags of the explanatory variable might be ren quired, the key in this approach is to parsimoniously parameterize Γ(L,θ). Various versions have been proposed (see, for example, Ghysels et al., 2007), including exponential Almon polynomials for which Γ(L,θ) = (cid:80)M γ(m,θ)Lm with θ = (θ ,θ ) and m=1 1 2 13

γ(m,θ) = exp(θ1m+θ2m2) . In contrast to the bridge equations, MIDAS-type regression (cid:80)M m=1 exp(θ1m+θ2m2) implies that the temporal aggregation weights are data-driven. Regarding the problem of ragged edge, the solution in this type of approach can be thoughtofasre-aligningeachtimeseries. Thetimeserieswithmissingobservationsatthe end of the sample are shifted forward to obtain a balanced dataset with the most recent information.12 The parameters in equation (18) depend on h , which is determined by n the difference between the forecast target period and the period of the last observation of the predictor. Consequently, separate models must be estimated for different data vintages as the corresponding h varies. The case of t − h > τ—i.e., when some data n n referring to the target quarter are available—is sometimes labeled as MIDAS with leads (Andreou et al., 2008). Applications of this type of model to short-term forecasting include Clements and Galva˜o (2008, 2009) and Kuzin et al. (2011), who use monthly indicators to forecast GDP, and Andreou et al. (2008), who also include daily financial variables.13 Given that the MIDAS equations suffer from the curse of dimensionality, a popular strategy for dealing with large information sets is forecast combination (see, e.g., Andreou et al., 2008) or substituting the right-hand side observables y with factors extracted from t−hn,n a set of monthly predictors as in Marcellino and Schumacher (2010). As we have already noted, these models are not suited to interpret the impact of new releases on the assessment of the state of the economy. Attempts to circumvent the problem have been based on heuristic procedure, as in Ghysels and Wright (2009), where they construct news using market expectations linked to the change in the forecast by estimating additional auxiliary regressions. 2.5.2 Machine learning Severalpapersreleasedinrecentyearshavestartedtoexploretheperformanceofmachine learning techniques for short-term forecasting.14 Among others, Soybilgen and Yazgan (2021) use bagged decision trees, random forests, and stochastic gradient tree boosting models to produce early estimates of U.S. GDP; Richardson et al. (2021) explore ridge, Lasso, elastic net, and support vector machine regression methods other than gradient boosting and neural networks to estimate in real time New Zealand GDP; and Zhang et al. (2023) compare the performance of various machine learning algorithms to DFMs, static factor models, and MIDAS for short-term forecasting the Chinese annualized real 12Re-aligning has been a popular strategy to deal with ragged-edge data. See, for example, Altissimo et al. (2001, 2010); de Antonio Liedo and Mun˜oz (2010). 13Clements and Galv˜ao (2008) also show how to add a lag of the low-frequency variable to avoid a seasonal response of the dependent variable to the predictors and use the Broyden-Fletcher-Goldfarg- Shanno method to obtain the estimates of the parameters. 14See Goulet Coulombe et al. (2022) for a detailed description of how machine learning techniques can be applied to forecast macroeconomic variables. 14

GDP growth rate. Although some of these applications have displayed some encouraging results in terms of accuracy, most of the characteristics of machine learning approaches are classified as “partial,” as they are set up in a way that does not allow extracting the unpredictable component of new releases. Therefore, machine learning models cannot interpret how new data releases change the early estimates of the state of the economy. Moreover, two other characteristics make these models not palatable for nowcasting. First, the “ragged edge” problem is solved by filling the missing values in the quarter of interest with model-based forecasts. Second, the mixed-frequency problem is solved “outside the models”—that is, data are averaged over the quarter to have the predictors at the same frequency as the target variable. These approaches have two side effects. The machine learning models are fed with redundant information—i.e., the information set of these models contains forecasts that are linear combinations of information already contained in the set. Moreover, the aggregation “outside the models” may lag the detection of the early signal that high-frequency variables can deliver, downplaying its effect. 3 Estimation algorithms WewillnowdescribethealgorithmsusedtoestimateDFMs’parametersandtoinfertheir unobserved components in the context of nowcasting applications. The main issue that this part of the literature has tried to overcome is how to exploit all the information when the available data are characterized by different frequencies and by covering dissimilar time spans due to either mismatched historical availability or the asynchronous timing of the releases, thus creating the so-called “ragged” edge. 3.1 Principal components and Kalman filter One of the most common methods to estimate factor models in the economic literature is the principal component analysis (PCA). PCA estimates the factors and the loadings by finding the pair of F and Λ that minimize the variance of the idiosyncratic component, t subject to the constraint that the covariance matrix of the loadings is an identity matrix. Formally, T 1 (cid:88) 1 min (y −ΛF )′(y −ΛF ), s.t. Λ′Λ = I. (19) t t t t {Ft}T t=1 ,Λ nT t=1 n 15

The solution to this problem yields Λ(cid:98) = V D1/2 (20) r r 1 F(cid:98) = Λ(cid:98) ′y , (21) t t n where D is an r×r diagonal matrix containing the eigenvalues, and V is the n×r matrix r r containing the associated eigenvectors of the covariance matrix of y . This is equivalent t to estimating the factors as a weighted average of the data 1w′y where the weights are n t w = V D1/2. The main intuition of why principal components work is that as the number r r of variables increases to infinity, the common component survives to aggregation, whereas the idiosyncratic component vanishes. Estimationofapproximatefactormodelswith(static)principalcomponentsisstudied in Stock and Watson (2002), Bai and Ng (2002), Bai (2003), and Forni et al. (2009). We refer the interested reader to these references for more details. However, using PCA to estimate loadings and factors in equation (2) does not overcome the complications arising from the mixed frequency and the dissimilar time spans that characterize datasets used for real-time monitoring of the economy. Indeed, PCA can be applied only on the balanced part of the dataset, therefore disregarding historical information, the most recent one, and also all the data series that have a frequency different from the one of the majority of the series in the dataset. Moreover, as a static representationofthedataandfactors, thePCAspecificationpreventsproducingforecasts beyond the sample period. Doz et al. (2011) made a first step towards overcoming part of these limitations with a two-stepprocedure. Theirideaistowritethefactormodelinstatespace, addingequation (3). In the first step, the parameters and the factors of equation (2) are estimated via principal components on the “balanced” part of the information set. The parameters of equation (3) are then estimated via OLS, regressing the estimated factors on their lags. In the second step, factors are re-estimated by applying the Kalman smoother to the part of the information set, including its “unbalanced” part, that contains data with the same frequency. For example, in Giannone et al. (2008), the second step is applied only to the monthly data. Therefore, given the parameters, the factors are also estimated using the unbalanced part of the panel. The drawbacks of this methodology are twofold. First, data with frequencies different from those prevalent in the information set are either not used to estimate the factors (lower-frequency data) or need to be aggregated outside the model (higher-frequency data). In the example above, quarterly data are disregarded from the estimation of the factors. This can be a material limitation when the information set may contain a large amount of series with a different frequency from the prevalent one. Second, the parameters are estimated only using the “balanced” part of the dataset, while low 16

frequency data are included in the analysis only if they have to be forecasted, usually via a bridge equation that uses the estimated factors. Theliteraturehasproposedovercomingtheselimitationsbyestimatingthemodelwith a modified version of the expectation-maximization (EM) algorithm or with a Bayesian algorithm. The following two sections will discuss these two algorithms. 3.2 Expectation-maximization algorithm The EM algorithm is an iterative method to find maximum likelihood estimates of parameters in models with unobserved latent variables. In the case of the model described by equations (2) through (4), at any iteration κ > 0, in the E-step, given an estimate of the parameters Λ(cid:98)(κ−1), A(cid:98)(κ−1), R(cid:98)(κ−1), and Q(cid:98)(κ−1), the factors are extracted using the Kalman filter and the Kalman smoother. Then, given E (cid:2) F |Ω (cid:3) , in the M-step the (κ−1) t v parameters are re-estimated. Specifically, (cid:32) (cid:33)(cid:32) (cid:33)−1 Λ(cid:98) (κ) = (cid:88) Tv E (cid:2) y F′|Ω (cid:3) (cid:88) Tv E (cid:2) F F′|Ω (cid:3) , (22) (κ−1) t t v (κ−1) t t v t=1 t=1 (cid:32) (cid:33)(cid:32) (cid:33)−1 A(cid:98) (κ) = (cid:88) Tv E (cid:2) F F′ |Ω (cid:3) (cid:88) Tv E (cid:2) F F′ |Ω (cid:3) . (23) (κ−1) t t−1 v (κ−1) t−1 t−1 v t=1 t=1 from which we can estimate R and Q as follows: (cid:32) (cid:32) (cid:33)(cid:33) R(cid:98) (κ) = diag 1 (cid:88) Tv E (cid:2) y y′|Ω (cid:3)(cid:3) −Λ(cid:98) (κ) (cid:88) Tv E (cid:2) F y′|Ω (24) T (κ−1) t t v (κ−1) t t v v t=1 t=1 (cid:32) (cid:33) Q(cid:98) (κ) = 1 (cid:88) Tv E (cid:2) F F′|Ω (cid:3) −A(cid:98) (κ) (cid:88) Tv E (cid:2) F F′|Ω (cid:3) . (25) T (κ−1) t t v (κ−1) t−1 t v t=1 t=1 The algorithm runs until the increase in the likelihood between two consecutive iterations is below a certain threshold. Lastly, the algorithm is initialized by estimating F and Λ by principal components and A by OLS. For a rigorous treatment of the EM t algorithm in DFMs, we refer the reader to Doz et al. (2012) and Barigozzi and Luciani (2022). However, this algorithm per se is also not suited to deal with datasets characterized by different frequencies and covering dissimilar time spans. Indeed, if y did not contain t missing observations, we would have that E [y y′|Ω ] = y y′ and E [y F′|Ω ] = y E [F′|Ω ] , (26) κ t t v t t κ t t v t κ t v 17

which can be plugged into the equations above and E (cid:2) F F′|Ω (cid:3) , E (cid:2) F F′ |Ω (cid:3) , and κ t t v κ t t−1 v E (cid:2) F |Ω (cid:3) can be obtained via the Kalman filter and smoother. Given that in nowcasting κ t v applications y contains missing observations due to the mixed-frequency nature of the t data and the dissimilar time spans availability, the EM algorithm needs to be modified. Ban´bura and Modugno (2014) make the EM algorithm suitable to such cases by redefining the vector of data y as t y = W y +(I −W )y , t t t t t where W is a diagonal matrix with ones corresponding to the non-missing entries in y t t and zeros otherwise. With this change, equations (22) and (24) become (cid:32) (cid:33)−1 (cid:32) (cid:33) vec (cid:0) Λ(cid:98) (κ) (cid:1) = (cid:88) Tv E (cid:2) F F′|Ω (cid:3) ⊗W vec (cid:88) Tv W y E (cid:2) F′|Ω (cid:3) (27) (κ−1) t t v t t t (κ−1) t v t=1 t=1 and (cid:32) R(cid:98) (κ) = diag 1 (cid:88) Tv (cid:16) W y y′W′ −W y E (cid:2) F′|Ω (cid:3) Λ (cid:91) (κ)′W T t t t t t t (κ−1) t v t v t=1 − W Λ(cid:98) (κ)E (cid:2) F |Ω (cid:3) y′W +W Λ(cid:98) (κ)E (cid:2) F F′|Ω (cid:3) Λ(cid:98) (κ)′W t (κ−1) t v t t t (κ−1) t t v t (cid:33) (cid:17) + (I −W )R(cid:98) ((κ−1))(I −W ) . (28) t t Therefore, W works as a selection matrix that allows us to obtain the expectations t in equation (26) and the corresponding remaining parts of the estimates when the data are available. 3.3 Bayesian inference An alternative way of estimating DFMs on incomplete datasets is to use Bayesian inference. D’Agostino et al. (2016) propose the following model: p p (cid:88) (cid:88) x = λ f + ρ x +e (29) it is t−s is it−s it s=0 s=1 p (cid:88) f = A f +u (30) t s t−s t s=1 where u ∼ N(0,I ) and e ∼ N(0,ψ ). t r it it Comparedtomodels(2)through(4),themaindifferenceisthatthefactorsareallowed to be loaded dynamically by the variables through the polynomial λ (L) = (cid:80)p λ . To i s=0 is estimate the large number of parameters in the model, D’Agostino et al. (2016) propose 18

an algorithm based on the following priors: R ∼ IG(1,3), i,i 1 Λ ∼ N(0,τ ), i,r,h (h+1)2 1 A ∼ N(0,τ ), r,h h2 Q ∼ N(0,1), r,r where r indicates the factor and h indicates the lag of the factor to which the coefficient is associated. The prior covariance among coefficients associated with different variables and lags is set to zero. Notice that the variance of the prior is lower for the coefficients associated with more distant lags. The hyperparameter τ controls the scale of all the variances and effectively governs the overall level of shrinkage, and the authors fix it to the conventional value of 0.2. These priors, including the choice of the degree of overall shrinkage, are similar to the Minnesota prior proposed by Litterman (1986) in the context of Bayesian VARs. The inference is conducted using Gibbs sampling techniques. If all data and the common factor were observed, drawing from the posterior of the parameters would have been easy since the prior is conjugate. Conditionally on the parameters and the observable data, the common factors and the missing data can be drawn using simulation smoothers (Carter and Kohn, 1994; De Jong and Shephard, 1995; Durbin and Koopman, 2002). In other words, the Gibbs sampler consists of alternating the following two steps: (i) given a draw of the parameters, draw the missing data and the latent factor conditional on the observations using the simulation smoother; and (ii) given a draw of the full data and the latent factors, draw the parameters from their posterior. The algorithm is initialized using the parameters associated with principal components computed by fitting missing data by a spline function. This algorithm has been successfully applied to nowcast U.S. GDP by D’Agostino et al. (2016) and Drechsel et al. (2023) and Norwegian GDP by Luciani and Ricci (2014). 4 Data selection Which variables should we include in the dataset to nowcast GDP? And how many of them? In theory, these should be trivial questions, as when estimating large DFMs, we can add as many variables as we like, even more so because these models are consistently estimated for n growing to infinity. However, it turns out that adding as many variables as we can is not the right recipe. The right recipe consists of adding the right variables; that is, we want to add variables that contain signals and avoid those that contain just 19

noise.15 How do we determine which are the right variables? The answer to this question has stimulated an interesting debate among academics and practitioners at central banks and other financial institutions. The literature has broadly conceived two alternative methods to answer this question: expertise-based and statistical-based selection methods. 4.1 Expertise-based selection The first solution relies on the expertise of the people who monitor the state of the economy daily. Their monitoring activities inform monetary policy decisions at central banks, fiscal policy decisions at governmental agencies, or investment decisions at financial and non financial businesses. Including the data that the experts consider important to assess the state of the economy is a natural choice, as one of the goals of nowcasting is to interpret how new data releases change the model-based assessment of current macroeconomic conditions. In academic papers, scholars rely on the so-called “market-moving indicators” to infer theexperts’preferencesaboutdata. Forexample, newsplatformsanddataproviderssuch as Bloomberg, Forex Factory, and Trading Economics report quantitative or qualitative indexes that indicate the importance of a given data release for their users from which we can identify the market-moving indicators.16 As an alternative to identifying “market moving indicators,” some papers rely on expert judgment. For example, in their seminal paper, Giannone et al. (2008) constructed the dataset with the help of economists at the Board of Governors of the Federal Reserve System, therefore tailoring the dataset to the internal expertise and interest in specific variables, among them several sectoral disaggregated variables. Another example is Barigozzi and Luciani (2021), who started from a large dataset and then eliminated variables by looking for those with very high idiosyncratic cross-correlation and using judgment. Through time, the literature has concluded that for nowcasting GDP, the information set should include two categories of data. The first category is so-called hard data—i.e., data collected by statistical agencies based on measurable quantities, like variables about labor markets (e.g., the unemployment rate), the industrial sector (e.g., the index of 15Adding noise means adding a variable that is idiosyncratic and contains no information about the common factors or adding a variable that is very correlated with another variable in the dataset and, hence,containnoadditionalinformation. Whenweaddavariableverysimilartoanothervariableinthe dataset, we increase the cross-correlation among idiosyncratic components, which is very problematic, asBoivinandNg(2006)showthatexcessivecross-sectionalcorrelationamongidiosyncraticcomponents worsens the model’s forecasting performance, a result confirmed by the simulations in Luciani (2014). 16For example, for each data release, Bloomberg reports a relevance index based on the percentage of Bloomberg users who have set up an automatic alert about that specific release. See Cascaldi-Garcia et al. (2023) for a nowcast application that selects market-moving indicators based on the Bloomberg relevance index. 20

industrial production, and industrial turnover), the construction sector (e.g., the index of production in construction), private consumption (e.g., retail sales and car registrations), and the external sector (e.g., exports and imports of goods). The second category is socalled soft data—i.e., survey indexes that portray feelings and perceptions of economic agents about current and future economic prospects.17 Moreover, the nowcasting literature has concluded that focusing mainly on the headlines of each macroeconomic report while disregarding sectoral disaggregation is a simple and very effective solution. For example, Ban´bura and Modugno (2014) and Ban´bura et al. (2011) show that the marginal impact on the nowcast precision of disaggregated data is minimal, which is in line with market participants primarily focusing on the headlines of each report. Moreover, the same authors show that the model’s nowcasting performance does not deteriorate if the right disaggregated data are included—regardless of relying on market participants or expert judgment—and the factor model is robust; see the empirical analysis in Ban´bura et al. (2010) for forecasting at longer horizons, and the simulation studies of Doz et al. (2011, 2012) and Barigozzi and Luciani (2022) for estimation performance. Another result of the nowcasting literature is that daily and weekly indicators, such as financial variables, do not improve the performance of a nowcasting model either during normal times or during recessions because the highfrequency component of these indicators is detached from the real economy (see Ban´bura et al., 2013). This result does not imply that higher-frequency indicators are unrelated to real economic activity but that the link is through their low-frequency component. As such, the usefulness of high-frequency indicators for nowcasting is dim. 4.2 Statistical-based selection Thesecondsolutionconsistsofselectingthevariablesusingstatisticalcriteriaassuggested by Boivin and Ng (2006) and Bai and Ng (2008)—for example, Bai and Ng (2008) suggest using only the variables informative for forecasting the target variable—which are also at the foundation of the machine learning approach as discussed in Section 2.5.2. However, statistical-based selection does not work well when the data are very correlated, which is the case of macroeconomic data. In particular, De Mol et al. (2008) find no major differencesintheforecastingperformancebetweenmodelsusingstatistical-basedselection and those with no selection.18 Most importantly, variable selection is unstable because of collinearity among predictors. In other words, the set of predictors selected from month to month is very sensitive to minor perturbations of the dataset, such as adding new variables or extending the sample length, which makes (revisions to) the forecasts obtained with this method difficult to interpret. Similar instabilities have also been found 17MeasuresofpricesandmonetaryaggregatesareusuallyfoundnotinformativefornowcastingGDP. This result goes back to Sargent and Sims (1977). 18This result also emerges from a careful reading of the empirical results of Boivin and Ng (2006). 21

in the context of model selection and model averaging (Ouysse, 2011; Stock and Watson, 2012). Finally, Giannone et al. (2021) find that economic data do not appear informative enough to uniquely identify the relevant predictors when a large pool of variables is available to the researcher. 4.3 Alternative data Giventhegrowingavailabilityofinformationandtheincreasedabilitytoprocessandstore data, information that goes beyond macro data has been tested for nowcasting GDP in recent years. One of the most promising typologies of data is corporate accounting data. Indeed, Abdalla et al. (2021) find that factors based on the real-time flow of accounting data from the corporate financial reports are incrementally relevant for nowcasting and forecasting major components of economic output in the BEA’s National Income and Product Accounts. One of the most-studied sources of big data in macroeconomics is Google Trends, but the literature expresses mixed views about its usefulness for nowcasting. While Choi and Varian (2012) argue that Google Trends data help in forecasting near-term values of several economic indicators and Ferrara and Simoni (2022) find improvements in nowcasting GDP accuracy for several countries, Larson and Sinclair (2022) show that such data do not improve the accuracy of nowcasts of unemployment insurance claims, neitherinnormaltimesnorduringtheCOVID-19pandemic. Moreover, aspointedoutby Lazer et al. (2014), Google Trends, like other publicly available indices, are the product of numerous algorithms and decisions made by engineers that are invisible to the user. The problem for forecasters is that these algorithms are not static but are tweaked and adapted as time passes. Therefore, historical values available to us now are not the same asthosethatwereavailableinthepast, andvaluesavailableinthefuturemaybedifferent again.19 5 Empirical application In this section, we put into practice the nowcasting lessons detailed in this chapter. As highlighted by Cascaldi-Garcia et al. (2023), timely soft data such as surveys and confidence indicators, even if only qualitative, contain important information about the state of the economy, which is paramount for nowcasting economic activity. We test this result further by presenting the nowcasting performance of a mixed-frequency DFM combining soft and hard data for seven advanced economies: the euro area, Germany, France, Italy, Canada, Japan, and the U.K. 19Simon van Norden explained these issues in a 2017 post on Econbrowser, available at econbrowser.com/archives/2017/05/guest-contribution-big-data-and-fake-forecasts. 22

We illustrate the performance of such models by looking at two global events of large deterioration of the economic conditions, namely the 2008 Global Financial Crisis (GFC) and the onset of the COVID-19 pandemic. For the GFC, we use pseudo-real-time data for the 2008–09 period. We provide weekly nowcasts of quarter-on-quarter (QoQ) GDP growth (at an annual rate, a.r.), following the evolution of the model predictions until the eve of the official GDP release. For the onset of the pandemic, we provide the weekly nowcasts of GDP growth (QoQ a.r.) for the second quarter of 2020. By using real-time data, we followthe evolution ofthe modelpredictions startingin thefirst weekof January 2020 through the first release of each country’s GDP.20 For the euro area and its main economies, we follow the model set-up and data selection proposed by Cascaldi-Garcia et al. (2023). For Canada, we follow the model set-up and a similar data selection as proposed by Bragoli and Modugno (2017), who combine soft and hard data not only from Canada but also from the U.S.—the model also highly benefits from the official release of monthly GDP. For Japan, we follow a data selection similar to Carriero et al. (2019) and Hayashi and Tachi (2023). Lastly, for the U.K., we follow a data selection similar to Carriero et al. (2019) and Anesti et al. (2018)—as for the model for Canada, the UK model highly benefits from the official release of monthly GDP. 5.1 The Global Financial Crisis The GFC started in 2008 with financial stresses in the U.S. It quickly spread across the world through the banking system, causing substantial real economic contraction that lasted several quarters. The fast-moving contagion made timely and accurate hard data on the state of the economy more important than ever for policymakers to direct counter-cyclical measures. However, the long delay in the release of such data made policy decisions based on such indicators impracticable. The nowcasting literature has extensively shown that mixed-frequency DFMs enriched with soft data would have been able to track economic developments during the GFC in a timely and precise manner. In this section, we bring additional evidence in support of this finding. Figure 1 presents the evolution of the nowcast of GDP growth (QoQ a.r.) for seven advanced economies. Each line corresponds to the evolution of the nowcast as new data become available, while the dots correspond to the official GDP release. As shown in the figure,thenowcastevolutiontrackedquitecloselytheeconomicdisarrayobservedthrough 2008, culminating in a global recession. Moreover, the models were quite accurate even at the height of the crisis, with double-digit contractions. Let us zoom in on the weekly evolution of the euro-area GDP nowcast for 2009:Q3. Figure 2 shows the great advantage of using “joint” models over “partial” models: the 20The first release of 2020:Q2 GDP was published in August for all the countries analyzed. 23

Figure 1 Nowcast evolution during the Global Financial Crisis Note: Nowcast evolution from dynamic factor models for seven advanced economies. Red, green, blue, orange, purple, cyan, magenta, yellow, and pink lines follow the nowcasts for the GDP growth (QoQ a.r.) for 2008:Q1 through 2010:Q1, respectively. Dots correspond to the final official GDP release. 24

possibility of clearly disentangling how releases of all the data included in the information set change the assessment of the current state of the economy. The upper panel shows the weekly evolution of the 2009:Q3 euro-area GDP growth nowcast from the beginning of April 2009 to the end of October 2009. For the euro area, we can decompose the contribution of new releases either by country, middle panel, or by nature of the data (softorhard), lowerpanel. Focusingonthelowerpanel, wecanseethepivotalroleplayed by soft data in capturing the current state of the economy, anecdotally confirming that soft data are very important to extract timely signals. However, soft data may be noisy: although the overall signal points to improved macroeconomic conditions in the euro area for this specific quarter, few soft data releases deliver confounding signals. In contrast, hard data consistently point to improved macroeconomic conditions, even though their contribution becomes relevant only in the middle of the quarter of interest.21 5.2 The COVID-19 experience The onset of the COVID-19 pandemic caused a sharp and intense deterioration of the economy that was unparalleled in recent history. The fast-moving developments observed from March 2020 onward posed challenges to the usual tools for nowcasting, making this event a natural laboratory for these methods. The lockdowns widely imposed across the globe at the onset of the pandemic closed down factories and drastically diminished manufacturing production, services not deemed essential were suspended, and entire sectors such as tourism and air transportation came to a halt. Official quantitative data became difficult to collect and were unreliable, and the long release delay made them close to useless. As such, researchers and policymakers were using the few and unstructured releases of timely sentiment indicators, such as confidence indexes and purchasing managers’ indexes, to grasp the magnitude of the damage the pandemic inflicted on the economy. The results in this section show that a DFM would have pointed to the eventual double-digit contraction over the advanced economies as soon as these timely indicators started to be released. Figure 3 illustrates the weekly evolution of the nowcasts from DFMs for selected advanced economies of 2020:Q2 GDP growth (QoQ a.r.), starting from January 2020. The lockdowns started to be broadly implemented between March and April, so their real quantitative effects would only be manifested once indicators such as industrial production and retail sales were released, which may have a delay of up to 60 days. However, as early as the end of March, confidence indicators already gave hints of an upcoming severe economic contraction. By mid-April, the nowcasts of all selected advanced economies showed predictions close to double-digit contraction in the second quarter of 2020, while almost no hard data had been released yet. Once hard data started to become available, 21See Cascaldi-Garcia et al. (2023) for the full description of the model. 25

Figure 2 News decomposition Note: The upper chart shows the weekly evolution of the 2009:Q3 euro area GDP growth (QoQ a.r.) nowcastfromApril2,2009toDecember3,2009. Themiddlechartshowsthecontributiontothenowcast revision of variables grouped by country. The lower chart shows the contribution of variables grouped by hard and soft data. 26

the model assessed that economic conditions were deteriorating even further, with nowcasts hitting unprecedented marks ranging from -10% to -60% by mid-May, persisting until the eventual first GDP release in August 2020. Figure 3 Nowcast evolution during the onset of the COVID-19 pandemic Note: Nowcast evolution from dynamic factor models for seven advanced economies. Solid lines follow the nowcasts for the quarter-on-quarter GDP growth (QoQ a.r.) for 2020:Q2. Dots correspond to the first official GDP release. In conclusion, two results emerge from our analysis of the COVID-19 pandemic. First, a comprehensive model that summarizes unstructured, unbalanced, and mixed-frequency data can be quite informative about the state of the economy, especially in events of rapid regime changes. Second, timely soft data proved useful to get a first assessment of the magnitude of the impact on the economy from the lockdowns. 5.3 Dealing with the pandemic period in nowcasting models The DFMs that we used in this section to nowcast the onset of the COVID-19 pandemic are linear Gaussian models. As such, they are not well-equipped to fit such an extreme event unlike anything else in history. Going forward, methodological changes may be needed to improve the nowcasting performance of mixed-frequency models when dealing with extreme events. Since March 2020, the literature has proposed many interesting econometric enhancements on how to deal with an extreme episode such as the COVID-19 pandemic, both in the Frequentist and Bayesian environments. These enhancements are either already directly implemented in mixed-frequency set-ups, or future research could engineer them 27

to work with the current set of tools available for nowcasting.22 A non-extensive list of methods proposed include the following: – Excluding the pandemic observations altogether (Schorfheide and Song, 2020) – Estimating a common shift and persistence of the volatility of the shocks during the extreme periods of the pandemic (Lenza and Primiceri, 2021) – Downplaying the importance of extreme observations (Cascaldi-Garcia, 2022) – Modeling extreme observations either as random shifts in the stochastic volatility (Car- ´ riero et al., 2022; Alvarez and Odendahl, 2022), or through non-parametric methods (Huber et al., 2023) – Estimating the model with t-distributed errors (Bobeica and Hartwig, 2023) – Augmenting the information set with an exogenous variable capturing the pandemic period (Ng, 2021) – Modeling outliers directly in the DFM (Antolin-Diaz et al., 2020) 6 Conclusion Nowadays, nowcasting models are the most popular tools used to assess the current state of the economy at central banks, governmental agencies, and financial and nonfinancial corporations all around the world. They have also been the topic of a hefty body of academic literature. In this chapter, we have summarized how the nowcasting framework has evolved and is currently applied to 1) efficiently use large information sets characterized by data with mixed frequency and mismatched time spans due to dissimilar historical availability and asynchronous release time, and 2) provide the final users with a tool that helps to interpret why a given data release has changed the model’s assessment of the state of the economy. References Aastveit, K. A. and Trovik, T. (2012). Nowcasting Norwegian GDP: The role of asset prices in a small open economy. Empirical Economics, 42(1):95–119. Abdalla, A. M., Carabias, J. M., and Patatoukas, P. N. (2021). The real-time macro content of corporate financial reports: A dynamic factor model approach. Journal of Monetary Economics, 118:260–280. Altissimo, F., Bassanetti, A., Cristadoro, R., Forni, M., Hallin, M., Lippi, M., Reichlin, L., and Veronese, G. (2001). Eurocoin: A real time coincident indicator of the euro area business cycle. CEPR Discussion Papers 3108, C.E.P.R. Discussion Papers. 22Ho (2023) summarizes the macroeconomic forecasting performance after the pandemic considering several different methods. 28

Altissimo, F., Cristadoro, R., Forni, M., Lippi, M., and Veronese, G. (2010). New eurocoin: Tracking economic growth in real time. The Review of Economics and Statistics, 92(4):1024–1034. ´ Alvarez, L. J. and Odendahl, F. (2022). Data outliers and Bayesian VARs in the Euro Area. Technical report. Andreini, P., Hasenzagl, T., Reichlin, L., Senftleben-K¨onig, C., and Strohsal, T. (2023). Nowcasting German GDP: Foreign factors, financial markets, and model averaging. International Journal of Forecasting, 39(1):298–313. Andreou, E., Ghysels, E., and Kourtellos, A. (2008). Should macroeconomic forecasters look at daily financial data? Manuscript, University of Cyprus. Anesti, N., Galv˜ao, A. B., and Miranda-Agrippino, S. (2018). Uncertain Kingdom: Nowcasting GDP and its Revisions. Discussion Papers 1824, Centre for Macroeconomics (CFM). Angelini, E., Ban´bura, M., and Ru¨nstler, G. (2010). Estimating and forecasting the euro area monthly national accounts from a dynamic factor model. OECD Journal: Journal of Business Cycle Measurement and Analysis, 2010/1. Angelini, E., Camba-M´endez, G., Giannone, D., Reichlin, L., and Ru¨nstler (2011). Shortterm forecasts of euro area GDP growth. Econometrics Journal, 14(1):C25–C44. Antolin-Diaz, J., Drechsel, T., and Petrella, I. (2020). Advances in nowcasting economic activity: Secular trends, large shocks and new data. Technical report, Working paper. Arnostova, K., Havrlant, D., Ru`ˇzi`eka, L., and To´th, P. (2011). Short-Term Forecasting of Czech Quarterly GDP Using Monthly Indicators. Czech Journal of Economics and Finance (Finance a uver), 61(6):566–583. Baffigi, A., Golinelli, R., and Parigi, G. (2004). Bridge models to forecast the euro area GDP. International Journal of Forecasting, 20(3):447–460. Bai, J. (2003). Inferential theory for factor models of large dimensions. Econometrica, 71:135–171. Bai, J. and Ng, S. (2002). Determining the number of factors in approximate factor models. Econometrica, 70:191–221. Bai, J. and Ng, S. (2008). Forecasting economic time series using targeted predictors. Journal of Econometrics, 146(2):304–317. 29

Ban´bura, M., Giannone, D., and Lenza, M. (2015). Conditional forecasts and scenario analysis with vector autoregressions for large cross-sections. International Journal of Forecasting, 31(3):739–756. Ban´bura, M., Giannone, D., Modugno, M., and Reichlin, L. (2013). Now-Casting and the Real-Time Data Flow. In Elliott, G., Granger, C., and Timmermann, A., editors, Handbook of Economic Forecasting, volume 2 of Handbook of Economic Forecasting, chapter 0, pages 195–237. Elsevier. Ban´bura, M., Giannone, D., and Reichlin, L. (2010). Large Bayesian vector auto regressions. Journal of applied Econometrics, 25(1):71–92. Ban´bura, M., Giannone, D., and Reichlin, L. (2011). Nowcasting. In Handbook on Economic Forecasting, volume 2, pages 63–90. Oxford. Ban´bura, M. and Modugno, M. (2014). Maximum likelihood estimation of factor models on datasets with arbitrary pattern of missing data. Journal of Applied Econometrics, 29(1):133–160. Ban´bura, M.andRu¨nstler, G.(2011). Alookintothefactormodelblackbox: Publication lags and the role of hard and soft data in forecasting GDP. International Journal of Forecasting, 27(2):333–346. Barcelona, W., Cascaldi-Garcia, D., Hoek, J., and Van Leemput, E. (2022). What HappensinChinaDoesNotStayinChina. International Finance Discussion Paper,(1360). Barhoumi, K., Darn´e, O., and Ferrara, L. (2010). Are disaggregate data useful for factor analysis in forecasting French GDP? Journal of Forecasting, 29(1-2):132–144. Barigozzi, M. and Luciani, M. (2021). Measuring US aggregate output and output gap using large datasets. Review of Economics and Statistics. forthcoming. Barigozzi,M.andLuciani,M.(2022). Quasimaximumlikelihoodestimationandinference oflargeapproximatedynamicfactormodelsviatheemalgorithm. arXiv:1910.03821.v4. Bessec, M. and Doz, C. (2014). Short-term forecasting of French GDP growth using dynamic factor models. OECD Journal: Journal of Business Cycle Measurement and Analysis, 2013(2):11–50. Bobeica, E. and Hartwig, B. (2023). The COVID-19 shock and challenges for inflation modelling. International Journal of Forecasting, 39(1):519–539. Boivin, J. and Ng, S. (2006). Are more data always better for factor analysis? Journal of Econometrics, 127:169–194. 30

Bok, B., Caratelli, D., Giannone, D., Sbordone, A. M., and Tambalotti, A. (2018). Macroeconomic nowcasting and forecasting with big data. Annual Review of Economics, 10:615–643. Bragoli, D. (2017). Now-casting the Japanese economy. International Journal of Forecasting, 33(2):390–402. Bragoli, D. and Fosten, J. (2018). Nowcasting Indian GDP. Oxford Bulletin of Economics and Statistics, 80(2):259–282. Bragoli, D., Metelli, L., and Modugno, M. (2015). The importance of updating: Evidence from a Brazilian nowcasting model. OECD Journal: Journal of Business Cycle Measurement and Analysis, 2015(1):5–22. Bragoli, D.andModugno, M.(2017). Anow-castingmodelforCanada: DoU.S.variables matter? International Journal of Forecasting, 33(4):786–800. Camacho, M. and Perez-Quiros, G. (2010). Introducing the euro-sting: Short-term indicator of euro area growth. Journal of Applied Econometrics, 25(4):663–694. Carriero, A., Clark, T.E., Marcellino, M., andMertens, E.(2022). AddressingCOVID-19 Outliers in BVARs with Stochastic Volatility. The Review of Economics and Statistics, pages 1–38. Carriero, A., Galva˜o, A. B., and Kapetanios, G. (2019). A comprehensive evaluation of macroeconomic forecasting methods. International Journal of Forecasting, 35(4):1226– 1239. Carter, C. K. and Kohn, R. (1994). On Gibbs sampling for state space models. Biometrika, 81(3):541–553. Caruso, A. (2018). Nowcasting with the help of foreign indicators: The case of Mexico. Economic Modelling, 69:160–168. Cascaldi-Garcia, D. (2022). Pandemic priors. International Finance Discussion Paper, (1352). Cascaldi-Garcia, D., Ferreira, T. R., Giannone, D., and Modugno, M. (2023). Back to the present: Learning about the euro area through a now-casting model. International Journal of Forecasting. Choi, H. and Varian, H. (2012). Predicting the Present with Google Trends. The Economic Record, 88(s1):2–9. 31

Cimadomo, J., Giannone, D., Lenza, M., Monti, F., and Sokol, A. (2022). Nowcasting with large Bayesian vector autoregressions. Journal of Econometrics, 231(2):500–519. Special Issue: The Econometrics of Macroeconomic and Financial Data. Clements, M. P. and Galva˜o, A. B. (2008). Macroeconomic forecasting with mixedfrequency data. Journal of Business & Economics Statistics, 26:546–554. Clements, M. P. and Galva˜o, A. B. (2009). Forecasting US output growth using leading indicators: an appraisal using midas models. Journal of Applied Econometrics, 24(7):1187–1206. Coroneo, L., Giannone, D., andModugno, M.(2016). UnspannedMacroeconomicFactors in the Yield Curve. Journal of Business & Economic Statistics, 34(3):472–485. D’Agostino, A., Giannone, D., Lenza, M., and Modugno, M. (2016). Nowcasting Business Cycles: ABayesianApproachtoDynamicHeterogeneousFactorModels. InHillebrand, E. and Koopman, S. J., editors, Dynamic Factor Models, volume 35 of Advances in Econometrics, pages 569–594. Emerald Publishing Ltd. D’Agostino, A., McQuinn, K., and O’Brien, D. (2012). Nowcasting Irish GDP. OECD Journal: Journal of Business Cycle Measurement and Analysis, 2012(2):21–31. D’Agostino, A., Modugno, M., and Osbat, C. (2017). A Global Trade Model for the Euro Area. International Journal of Central Banking, 13(4):1–34. Dahlhaus, T., Gu´enette, J.-D., and Vasishtha, G. (2017). Nowcasting BRIC+M in real time. International Journal of Forecasting, 33(4):915–935. deAntonioLiedo, D.(2015). NowcastingBelgium. Eurostat Review on National Accounts and Macroeconomic Indicators, 2:7–48. de Antonio Liedo, D. and Mun˜oz, E. F. (2010). Nowcasting Spanish GDP growth in real time: “one and a half months earlier”. Banco de Espan˜a Working Papers 1037, Banco de Espan˜a. De Jong, P. and Shephard, N. (1995). The simulation smoother for time series models. Biometrika, 82(2):339–350. De Mol, C., Giannone, D., and Reichlin, L. (2008). Forecasting using a large number of predictors: Is Bayesian shrinkage a valid alternative to principal components? Journal of Econometrics, 146(2):318–328. Diron, M. (2008). Short-term forecasts of euro area real GDP growth: an assessment of real-time performance based on vintage data. Journal of Forecasting, 27(5):371–390. 32

Doz, C., Giannone, D., and Reichlin, L. (2011). A two-step estimator for large approximate dynamic factor models based on Kalman filtering. Journal of Econometrics, 164:188–205. Doz, C., Giannone, D., and Reichlin, L. (2012). A Quasi–Maximum Likelihood Approach for Large, Approximate Dynamic Factor Models. The Review of Economics and Statistics, 94(4):1014–1024. Drechsel, K. and Scheufele, R. (2012). A comparison of bottom-up approaches and direct forecasts of German GDP in a data-rich environment. Technical report, Halle Institute for Economic Research (IWH). Manuscript. Drechsel, T., Antolin-Diaz, J., and Petrella, I. (2023). Advances in Nowcasting Economic Activity: TheRoleofHeterogeneousDynamicsandFatTails. CEPRDiscussionPapers 17800, C.E.P.R. Discussion Papers. Durbin, J. and Koopman, S. J. (2002). A simple and efficient simulation smoother for state space time series analysis. Biometrika, 89(3):603–616. Ferrara, L. and Simoni, A. (2022). When are Google data useful to nowcast GDP? An approach via preselection and shrinkage. Journal of Business & Economic Statistics, pages 1–15. Forni, M., Giannone, D., Lippi, M., and Reichlin, L. (2009). Opening the Black Box: Structural Factor Models versus Structural VARs. Econometric Theory, 25(5):1319– 1347. Ghysels, E., Sinko, A., and Valkanov, R. (2007). Midas regressions: Further results and new directions. Econometric Reviews, 26(1):53–90. Ghysels, E. and Wright, J. H. (2009). Forecasting professional forecasters. Journal of Business and Economic Statistics, 27(4):504–516. Giannone, D., Agrippino, S. M., and Modugno, M. (2013). Nowcasting China real GDP. Technical report, Mimeo. Giannone, D., Lenza, M., and Primiceri, G. E. (2021). Economic predictions with big data: The illusion of sparsity. Econometrica, 89(5):2409–2437. Giannone, D., Reichlin, L., and Small, D. (2008). Nowcasting: The real-time informationalcontentofmacroeconomicdata. Journal of Monetary Economics,55(4):665–676. Gonza´lez-Astudillo, M. and Baquero, D. (2019). A nowcasting model for Ecuador: Implementing a time-varying mean output growth. Economic Modelling, 82:250–263. 33

Goulet Coulombe, P., Leroux, M., Stevanovic, D., and Surprenant, S. (2022). How is machine learning useful for macroeconomic forecasting? Journal of Applied Econometrics, 37(5):920–964. Hahn, E. and Skudelny, F. (2008). Early estimates of euro area real GDP growth - a bottom-up approach from the production side. Working Paper Series 975, European Central Bank. Hayashi, F. and Tachi, Y. (2021). The nowcast revision analysis extended. Economics Letters, 209:110112. Hayashi, F. and Tachi, Y. (2023). Nowcasting japan’s GDP. Empirical Economics, 64(4):1699–1735. Ho, P. (2023). Forecasting in the absence of precedent. Journal of Economic Surveys, 37(3):1033–1058. Huber, F., Koop, G., Onorante, L., Pfarrhofer, M., and Schreiner, J. (2023). Nowcasting in a pandemic using non-parametric mixed frequency VARs. Journal of Econometrics, 232(1):52–69. Jansen, W. J., Jin, X., and de Winter, J. M. (2016). Forecasting and nowcasting real GDP: Comparing statistical models and subjective forecasts. International Journal of Forecasting, 32(2):411–436. Kitchen, J. and Monaco, R. M. (2003). Real-time forecasting in practice: The U.S. Treasury sta.’s real-time GDP forecast system. Business Economics, pages 10–19. Kuzin, V., Marcellino, M., and Schumacher, C. (2011). Midas vs. mixed-frequency VAR: Nowcasting GDP in the euro area. International Journal of Forecasting, 27(2):529–542. Lahiri, K. and Monokroussos, G. (2013). Nowcasting US GDP: The role of ISM business surveys. International Journal of Forecasting, 29(4):644–658. Larson, W. D. and Sinclair, T. M. (2022). Nowcasting unemployment insurance claims in the time of COVID-19. International Journal of Forecasting, 38(2):635–647. Lazer, D., Kennedy, R., King, G., and Vespignani, A. (2014). Big data. The parable of Google Flu: traps in big data analysis. Science, 343. Lenza, M.andPrimiceri, G.E.(2021). HowtoEstimateaVARafterMarch2020. Journal of Applied Econometrics (forthcoming). Litterman, R. B. (1986). Forecasting with Bayesian vector autoregressions—five years of experience. Journal of Business & Economic Statistics, 4(1):25–38. 34

Luciani, M. (2014). Forecasting with approximate dynamic factor models: The role of non-pervasive shocks. International Journal of Forecasting, 30:20–29. Luciani, M. (2017). Large-dimensional dynamic factor models in real-time: A survey. In Mazzi, G. L. and Ozyildirim, A., editors, Handbook on Cyclical Composite Indicators, pages 429–451. Eurostat. Luciani, M., Pundit, M., Ramayandi, A., and Veronese, G. (2018). Nowcasting Indonesia. Empirical Economics, 55(2):597–619. Luciani, M. and Ricci, L. (2014). Nowcasting Norway. International Journal of Central Banking, 10(4):215–248. Marcellino, M. and Schumacher, C. (2010). Factor MIDAS for nowcasting and forecasting with ragged-edge data: A model comparison for German GDP. Oxford Bulletin of Economics and Statistics, 72(4):518–550. Matheson, T. D. (2010). An analysis of the informational content of New Zealand data releases: The importance of business opinion surveys. Economic Modelling, 27(1):304– 314. Modugno, M. (2013). Now-casting inflation using high frequency data. International Journal of Forecasting, 29(4):664–675. Modugno, M., Soybilgen, B., and Yazgan, E. (2016). Nowcasting Turkish GDP and news decomposition. International Journal of Forecasting, 32(4):1369–1384. Ng, S. (2021). Modeling macroeconomic variations after COVID-19. Technical report, National Bureau of Economic Research. Ouysse, R. (2011). Comparison of Bayesian moving average and principal component forecast for large dimensional factor models. Discussion Papers 2012-03, School of Economics, The University of New South Wales. Parigi, G. and Golinelli, R. (2007). The use of monthly indicators to forecast quarterly GDP in the short run: an application to the G7 countries. Journal of Forecasting, 26(2):77–94. Parigi, G. and Schlitzer, G. (1995). Quarterly forecasts of the Italian business cycle by means of monthly indicators. Journal of Forecasting, 14(2):117–141. Richardson, A., van Florenstein Mulder, T., and Vehbi, T. (2021). Nowcasting GDP using machine-learning algorithms: A real-time assessment. International Journal of Forecasting, 37(2):941–948. 35

Ru¨nstler, G., Barhoumi, K., Benk, S., Cristadoro, R., Den Reijer, A., Jakaitiene, A., Jelonek, P., Rua, A., Ruth, K., and Van Nieuwenhuyze, C. (2009). Short-term forecasting of GDP using large datasets: a pseudo real-time forecast evaluation exercise. Journal of Forecasting, 28(7):595–611. Rusna´k, M. (2016). Nowcasting Czech GDP in real time. Economic Modelling, 54:26–39. Sargent,T.J.andSims,C.A.(1977). Businesscyclemodelingwithoutpretendingtohave too much a-priori economic theory. In et al., C. S., editor, New Methods in Business Cycle Research. Federal Reserve Bank of Minneapolis. Schorfheide, F. and Song, D. (2020). Real-time forecasting with a (standard) mixedfrequency VAR during a pandemic. International Journal of Central Banking (forthcoming). Siliverstovs, B. (2012). Keeping a finger on the pulse of the economy: Nowcasting Swiss GDP in real-time squared. Kof working paper, KOF Swiss Economic Institute, ETH Zurich. Soybilgen, B. and Yazgan, E. (2021). Nowcasting US GDP using tree-based ensemble models and dynamic factors. Computational Economics, 57:387–417. Stock, J. H. and Watson, M. W. (2002). Forecasting using principal components from a large number of predictors. Journal of the American Statistical Association, 97:1167– 1179. Stock, J. H. and Watson, M. W. (2012). Generalized shrinkage methods for forecasting using many predictors. Journal of Business & Economic Statistics, 30(4):481–493. Stock, J. H. and Watson, M. W. (2017). Twenty Years of Time Series Econometrics in Ten Pictures. Journal of Economic Perspectives, 31(2):59–86. Yiu, M. S. and Chow, K. K. (2010). Nowcasting Chinese GDP: information content of economic and financial data. China Economic Journal, 3(3):223–240. Zhang, Q., Ni, H., and Xu, H. (2023). Nowcasting Chinese GDP in a data-rich environment: Lessons from machine learning algorithms. Economic Modelling, 122:106204. 36

Cite this document

APA

Danilo Cascaldi-Garcia, Matteo Luciani, & and Michele Modugno (2023). Lessons from Nowcasting GDP across the World (IFDP 2023-1385). Board of Governors of the Federal Reserve System, International Finance Discussion Papers. https://whenthefedspeaks.com/doc/ifdp_2023-1385

BibTeX

@techreport{wtfs_ifdp_2023_1385,
  author = {Danilo Cascaldi-Garcia and Matteo Luciani and and Michele Modugno},
  title = {Lessons from Nowcasting GDP across the World},
  type = {International Finance Discussion Papers},
  number = {2023-1385},
  institution = {Board of Governors of the Federal Reserve System},
  year = {2023},
  url = {https://whenthefedspeaks.com/doc/ifdp_2023-1385},
  abstract = {In economics, we need to forecast the present because reliable and comprehensive measures of the state of the economy are released with a substantial delay and considerable measurement error. Nowcasting exploits timely data to obtain early estimates of the state of the economy and updates these estimates continuously as new macroeconomic data are released. In this chapter, we describe how the framework used to nowcast GDP has evolved and is applied worldwide.},
}