ifdp · January 11, 2026

Productivity and Quality of Multi-product Firms

Abstract

This paper introduces a method for estimating productivity and quality at the firm-product level using a transformation function framework. We use firm optimization conditions to establish a one-to-one mapping between observed data and unobserved productivity and quality. We do not need to impute firm-product input shares and can avoid imposing productivity evolution processes. The method is scalable to numerous products and can address the bias caused by unobserved heterogeneous intermediate input prices. We apply the method to a set of Mexican manufacturing industries and examine the roles of across-firm and within-firm technological spillovers, accounting for the trade-off between productivity and quality. Our quantitative analysis shows that an exogenous, product-specific technological improvement generates substantial gains in welfare, amplified by both within-firm and across-firm spillovers by approximately 17 percent and 5 percent, respectively. Moreover, within-firm resource reallocation toward the most productive products accounts for 60 percent of the resulting firm-level productivity gains.

Board of Governors of the Federal Reserve System International Finance Discussion Papers ISSN 1073-2500 (Print) ISSN 2767-4509 (Online) Number 1430 January 2026 Productivity and Quality of Multi-product Firms Mauro Caselli, Arpita Chatterjee, and Shengyu Li Please cite this paper as: Caselli, Mauro, Arpita Chatterjee, and Shengyu Li (2026). “Productivity and Quality of Multi-product Firms,” International Finance Discussion Papers 1430. Washington: Board of Governors of the Federal Reserve System, https://doi.org/10.17016/IFDP.2026.1430. NOTE: International Finance Discussion Papers (IFDPs) are preliminary materials circulated to stimulate discussion and critical comment. The analysis and conclusions set forth are those of the authors and do not indicate concurrence by other members of the research staff or the Board of Governors. References in publications to the International Finance Discussion Papers Series (other than acknowledgement) should be cleared with the author(s) to protect the tentative character of these papers. Recent IFDPs are available on the Web at www.federalreserve.gov/pubs/ifdp/. This paper can be downloaded without charge from the Social Science Research Network electronic library at www.ssrn.com.

∗ Productivity and Quality of Multi-product Firms † Mauro Caselli, University of Trento ‡ Arpita Chatterjee, Federal Reserve Board § Shengyu Li, University of New South Wales December 22, 2025 Abstract This paper introduces a method for estimating productivity and quality at the firm-product level using a transformation function framework. We use firm optimization conditions to establish a one-to-one mapping between observed data and unobserved productivity and quality. We do not need to impute firm-product input shares and can avoid imposing productivity evolution processes. The method is scalable to numerous products and can address the bias caused by unobserved heterogeneous intermediate input prices. We apply the method to a set of Mexican manufacturing industries and examine the roles of across-firm and within-firm technological spillovers, accounting for the trade-off between productivity and quality. Our quantitative analysis shows that an exogenous, product-specific technological improvement generates substantial gains in welfare, amplified by both within-firm and across-firm spillovers by approximately 17 percent and 5 percent, respectively. Moreover, within-firm resource reallocation toward the most productive products accounts for 60 percent of the resulting firm-level productivity gains. Keywords: multi-product firms, productivity, quality, spillover, within-firm reallocation JEL classification: D24, L11, L15, O33. ∗The authors thank Eleni Aristodemou, Zhiyuan Chen, Jan De Loecker, Erwin Diewert, Kevin Fox, Andrea Fracasso, Maurice Kugler, Moyu Liao, Logan Lewis, Matthias Mertens, Scott Orr, Devesh Raval, Ariell Reshef, Mark Roberts, Stefano Schiavo, Petr Sedlacek, Nikos Theodoropoulos, Andreas Tryphonides, Nelli Valmari, Eric Verhoogen, Frederic Warzynski, Daniel Xu, Haiqing Xu, Hongsong Zhang, and many seminar and conference participants for very helpful comments. All errors are the authors’ responsibility. All views/opinions are authors’ own and do not reflect views of the Federal Reserve Board or the Federal Reserve System. †School of International Studies & Department of Economics and Management, University of Trento. Email: mauro.caselli@unitn.it. ‡Federal Reserve Board. Email: chatterjee.econ@gmail.com. §Corresponding author: School of Economics & Centre for Applied Economic Research, Business School, the University of New South Wales, Australia. Email: shengyu.li@unsw.edu.au. 1

1 Introduction The production landscape of many manufacturing industries is dominated by multi-product firms, which operate across a diverse range of product lines. However, existing empirical studies that explore the determinants of firm performance have primarily focused on analyzing variations across different firms, such as heterogeneity in productivity levels and demand characteristics (e.g., Foster et al., 2008; Pozzi and Schivardi, 2016; Kumar and Zhang, 2019). Consequently, there remains a considerable gap in the understanding of the factors that drive within-firm heterogeneity and resource reallocation, as well as their subsequent impact on firm growth. This knowledge gap is due to methodological limitations and data constraints, which hinder the accurate estimation of heterogeneity at the firm-product level. This paper introduces a method to estimate productivity and quality (product appeal) at the firm-product level, along with the transformation function and demand parameters. This method constructs a unique one-to-one mapping from observed data to unobservable variables by using firm optimization conditions. This offers several advantages over recent methods (e.g., Dhyne et al., 2022; Orr, 2022; Valmari, 2023). First, it eliminates the need for imputing within-firm input allocations. Second, it does not need to impose restrictions on productivity evolution, allowing for flexibility in exploring complex productivity dynamics after estimation. Third, it is scalable to handle a large number of products. Fourth, it addresses the estimation bias caused by heterogeneous firm-level intermediate input prices, which are usually unobservable in available data sets.1 To demonstrate the advantages, we apply our method to three major industries in the Mexican manufacturing sector, where multi-product production is a central feature of firms. We examine the role of both across-firm and within-firm technological spillovers in the dynamic evolution of technical efficiency, as well as the role of within-firm resource reallocation in shaping firm performance. Inmodelingtheproductionside,ourmethodisdesignedtoaddressthechallengescommonly faced in estimating multi-product production functions. Most production function estimation methodologies implicitly assume that each firm produces a single product (e.g., Olley and Pakes, 1996; Levinsohn and Petrin, 2003; Ackerberg et al., 2015; Gandhi et al., 2020). In this context, the input allocation is observable to researchers and each firm only has a single dimension of unobservable productivity, which can be controlled for by an observable proxy. Multi-product firms, on the contrary, may have different levels of productivity for each product. Extending the proxy-based methods to the context of multi-product firms requires at least the same number of proxies as the number of products (cf., Dhyne et al., 2022). 1In the cases where intermediate input prices are observed, our method can be modified to allow for non-Hicks’ neutral efficiency (i.e., labor-augmenting efficiency), as shown in recent literature (e.g., Doraszelski and Jaumandreu, 2016; Zhang, 2019; Raval, 2019; Rubens et al., 2024). 2

Moreover, researchers do not observe the within-firm division of inputs used to produce different products because firms usually only report total inputs at the firm level.2 Finally, intermediate input prices, which vary significantly across firms and over time due to various reasons such as bargaining power in the input market and transport costs, as documented by Atalay (2014), should be controlled for to avoid “input price bias” (Ornaghi, 2006; De Loecker et al., 2016; Grieco et al., 2016). However, these firm-level input prices are rarely observable. To address these issues, we model the production technology using a transformation function, which is a mapping from a vector of inputs at the firm level to an aggregator of product-specific outputs. This saves us from modeling how the inputs are divided for the production of each individual product. Each product is associated with a potentially different level of physical productivity (i.e., quantity-based productivity, or TFPQ, as in Foster et al., 2008).3 The productivity levels, together with a parameter in the transformation function that characterizes the technological substitutability of the products, govern the marginal rate of transformation between any two products. The firm observes these productivity levels before making input and output decisions to maximize profits. In the spirit of Grieco et al. (2016), we show that the optimization conditions implied from our model can be inverted to form an explicit one-to-one mapping from observed input and output decisions to unobserved productivity at the firm-product level (regardless of the number of products), whilecontrollingforunobservedintermediateinputprices. Intuitively, thevariationinproduct prices within a firm identifies the productivity difference across products within the firm, after controlling for differences in markups and production scale. We exploit the inverted relationship to replace unobserved productivity in the transformation function, enabling estimation of the transformation function parameters. Once the parameters are estimated, we compute productivity (TFPQ) at the firm-product level from the one-to-one mapping. Although the primary innovation of our method lies on the production side, it is flexible enough to accommodate a variety of demand systems. Conditional on the availability of valid instrumental variables, the approach can be applied to widely used demand models such as Constant Elasticity of Substitution (CES) demand, discrete-choice demand (e.g., Berry, 1994), and random-coefficients logit demand (e.g., Berry et al., 1995). In our empirical application, we adopt a CES demand specification, which is appropriate given the level of product aggregation in our data. To address the classic endogeneity issue in estimating the 2This is an empirical challenge because of the potential input sharing (e.g., machinery and workers) across product lines within a firm (e.g., Cairncross et al., 2025; Koh and Raval, 2025). For example, a printing firm mayusethesamedesignsoftwaretocreatemultipleproducts,suchasproductlabels; workerswithspecialized skills, such as pattern makers, may be used across different product lines within the same footwear firm; in pharmaceutical industries, a firm may use the same reactors to produce different products by adjusting the process parameters. 3Werefertophysicalproductivityassimply“productivity”inthispaperunlessexplicitlystatedotherwise. 3

price elasticity of demand, we depart from the traditional literature by exploiting a key feature of multi-product firms: within-firm profit maximization implies a structural relationship between the revenues of products produced by the same firm. Following estimation, we recover product quality as the residual component of the demand function after controlling for price. After demonstrating the performance of our method through Monte Carlo simulations, we apply it to establishment-level panel data from three major Mexican manufacturing industries—footwear, printing, and pharmaceuticals—that include firm-product-level prices and quantities, along with detailed firm-level input data. Multi-product firms represent approximately 56% of all firms and account for 86% of total revenues in these industries. Given the product classification used, the number of total products ranges from 4 in the footwear industry to 16 in the pharmaceutical industry, with multi-product firms producing an average of 6.9 products per year. Within each industry, the markets for different product categories(e.g., women’sshoesvs. men’sshoesinthefootwearindustry)arelargelysegmented. However, within each product category, firms’ outputs are likely vertically differentiated, as reflected in the substantial dispersion in prices. These empirical features support our use of a CES demand model, which abstracts from competition across horizontally differentiated product markets while capturing vertical differentiation through quality differences. After estimation, the recovered TFPQ and product quality at the firm-product level allow us to examine heterogeneity and performance both within and across firms. Following the literature (e.g., Melitz, 2000), we construct a revenue-based productivity measure (TFPR) that incorporates heterogeneity in both TFPQ and quality at the firm-product level. We find substantial variation in TFPR, with heterogeneity across firms dominating that within firms. Interestingly, although our estimation does not impose any relationship between TFPQ and quality, we find a significant negative correlation (i.e., trade-off) between them, with a coefficient of -0.34. This implies that producing higher quality comes at the cost of lower TFPQ when inputs are held fixed.4 We refer to the component of TFPQ that is adjusted for the cost of quality as technical efficiency. Unlike raw TFPQ, this measure is comparable across firms and products because it accounts for variation in quality. A further advantage of our method is that it does not necessarily require any ex-ante assumptions about the dynamic evolution of technical efficiency. This feature allows us to investigate complex interdependencies in productivity dynamics within multi-product firms after estimating the model parameters, a task that would be considerably more difficult if 4Thisresultisbroadlyconsistentwiththeemergingliteratureemphasizingthenegativecorrelationbetween physical productivity and quality across firms (e.g., Grieco and McDevitt, 2017; Roberts et al., 2018; Orr, 2022; Eslava et al., 2024; Forlani et al., 2023; Li et al., 2025). 4

the productivity process had to be estimated jointly with other parameters. To demonstrate this advantage, we study technological spillovers—both across firms and within firms—while allowing for the trade-off between TFPQ and quality. Compared to the existing literature, which typically focuses on across-firm spillovers (e.g., Malikov and Zhao, 2023), our results suggest that within-firm spillovers are also economically meaningful, although across-firm spillovers are indeed more prominent. To quantify the importance of these spillover channels, we conduct a counterfactual exercise where the technical efficiency of one product is improved exogenously. Compared with the benchmark without spillover, the across-firm spillover contributes an additional 16.6% to the total welfare gain, while the within-firm spillover contributes an extra 5.4%. More importantly, over half of the improvement in firm-level TFPR resulting from the product-specific shock is attributable to within-firm resource reallocation toward more productive products—regardless of spillover types. Our methodology builds on recent advances in the estimation of heterogeneous productivity of multi-product firms. In addressing the common data challenge of input data being observable only at the firm level, while outputs and revenues are reported separately by product, the literature has evolved into two main approaches. The first approach, pioneered by De Loecker et al. (2016), characterizes multi-product production as a collection of singleproduct production functions, coupled with a rule for allocating firm inputs to each of these functions. Subsequent studies have extended this approach. In particular, Orr (2022) models product lines sharing the same technology (i.e., production parameters) but with individual productivity, and shows how demand data can be used to assist estimation under profit maximization conditions. Valmari (2023) develops a similar framework, incorporating flexible production parameters across product-specific production functions. Chen and Liao (2022) generalize the previous papers by allowing single-product firms and multi-product firms to have different production functions and by estimating both non-parametric and parametric production functions for multi-product firms. In contrast, the second approach, led by Dhyne et al. (2022), departs from the assumption that multi-product production is a collection of single-product firms. They adopt a transformation function and show how it can be used to recover the production frontier and estimate firm-product-specific marginal costs. We integrate the strengths of both approaches to overcome their respective limitations. First, we model multi-product production using a transformation function, similar to Dhyne et al. (2022). This avoids the need to allocate firm-level inputs, as in Orr (2022) and Valmari (2023), and allows for potential within-firm input sharing across product lines. Second, in addressing unobserved firm-product productivity, we adopt the profit maximization assumption, similar to Orr (2022) and Valmari (2023). However, instead of imputing input allocation shares, we use the profit-maximizing conditions to establish a one-to-one mapping 5

from observed firm decisions to unobserved productivity, extending the insights of Grieco et al. (2016, 2022), Harrigan et al. (2021) and Li and Zhang (2022) to the context of multi-product firms. Importantly, the number of profit-maximizing conditions, which naturally increase with the number of products, ensures the scalability of our method. This differs from Dhyne et al. (2022), whose method requires a separate proxy for each additional firm-product-level productivity. Rather, it is more similar to recent approaches to identify markdowns (Morlacco, 2020; Caselli et al., 2021; Kirov and Traina, 2023) or factor-augmenting productivity (Demirer, 2022; Raval, 2023) using necessary conditions for optimality with respect to multiple flexible inputs. Third, our method addresses the bias due to unobserved firm-level heterogeneity in input prices without requiring the availability of input price data. This is in contrast to the existing methods (e.g., Orr, 2022; Valmari, 2023), which typically require access to such data. Finally, our method does not rely on modeling the evolution of productivity, which offers a distinct advantage in exploring the evolution of productivity after estimation. Such an advantage is particularly beneficial in studying complex (e.g., interdependent) productivity dynamics, factors that endogenously shape the productivity trajectory (e.g., Chen et al., 2021; Malikov and Zhao, 2023), and frequent product turnovers, such as for exported products. In terms of empirical application, our paper integrates the analysis of the productivity–quality trade-off, technological spillovers, and resource reallocation in the context of multi-product firms. Focusing on firm-level analysis, Grieco and McDevitt (2017) and Li et al. (2025) have documented a significant trade-off between productivity and quality—interpreted as the cost of quality—in the U.S. healthcare sector and the Chinese steel industry, respectively. A natural implication of their findings is that the cost of quality should be explicitly considered when modeling the evolution of productivity. Our paper identifies a similar trade-off at the firm-product level and incorporates this feature into the productivity evolution process to investigate technological spillovers. On the spillover front, our study complements the firm-level literature on productivity spillovers (e.g., Malikov and Zhao, 2023). While most existing research focuses on spillovers across firms, we demonstrate that within-firm spillovers can also be economically significant. These within-firm productivity spillovers reflect economies of scope arising from the internal sharing of knowledge (e.g., Bilir and Morales, 2020; Merlevede and Theodorakopoulos, 2023; Ding, 2025). We show that such spillovers substantially enhance both firm performance and aggregate welfare, primarily through within-firm resource reallocation, a mechanism increasingly recognized in the recent literature on multi-product firms (e.g., Mayer et al., 2021). Our evidence highlights within-firm reallocation as a novel and important channel through which firm-level productivity responds to product-specific shocks. In doing so, our study complements a large body of work emphasizing the role of across-firm resource reallocation in driving aggregate 6

productivity growth (e.g., Aw et al., 2001; Foster et al., 2008; Syverson, 2011; Collard-Wexler and De Loecker, 2015). The remainder of the paper is organized as follows. Section 2 introduces the general theoretical framework of demand and production in the context of multi-product firms. Section 3 develops the estimation methodology for the general framework, while Section 4 describes the data used in the empirical analysis. Section 5 presents the empirical model and demonstrates the performance of the method using Monte Carlo simulations. Section 6 reports the estimation results. Section 7 presents our empirical application and quantitative exercise studying the dynamic evolution of technical efficiency. Section 8 concludes. 2 Theoretical Framework This section develops a general framework of demand and production for multi-product firms, aimed at estimating firm-product-level measures of productivity and quality, along with the associated model parameters. While the general framework highlights the broad applicability of our estimation methodology, researchers may adopt specific functional forms suited to their empirical contexts. In Section 5, we demonstrate such an implementation in the context of our empirical application. Consider an industry with J firms indexed by j = 1,2,...,J. There is a total of N products, indexed by n = 1,2,...,N, that firms can choose to produce. The timeline of the decisions is as follows. At the beginning of period t, the set of products that firm j has decided (at the end of the previous period) to produce in this period is Λ . Each product jt n ∈ Λ is associated with a level of technical efficiency ω and a level of quality ξ , both jt jnt jnt of which have been determined and observed by the firm at the end of the previous period. The firm’s capital stock is also determined in the previous period via an investment decision. Given these state variables, the firm’s static decisions consist of choosing material input and labor input at the firm level and the quantities of individual products to maximize total period profit, conditional on the observed material price, wage rate, and capital stock. The optimization conditions associated with these static decisions form the basis of our estimation strategy. At the end of period t, the firm also makes dynamic decisions regarding its capital stock and product portfolio for the following period, including the selection of products to produce and the associated levels of product quality and technical efficiency. Although our estimation strategy does not explicitly model these dynamic choices, Online Appendix C outlines the structure of these decisions, providing conceptual insight into how they are endogenously determined. 7

2.1 Demand The demand for product n of firm j in period t is modeled as an inverse demand function: P = P (Q ,Q ;ξ ), (1) jnt jnt jt −jt t where P is the product price. Importantly, Q = {Q }, n ∈ Λ is a vector of quantities jnt jt jnt jt of the products produced by firm j in period t; Q = {Q }, k ̸= j is a vector of quantities −jt kt of the products produced by the competitors of firm j in period t; ξ = {ξ }, for all j and jt jnt n, is a vector of quality levels of all products produced by firm j and its competitors. This function may also include a set of product characteristics if they are observable in the data. Empirically, the realized (observed) price of a product is subject to an unexpected shock: P ˜ = P eujnt, (2) jnt jnt where u is assumed to be independent, identically distributed, across firm, product, and jnt time, and E(eujnt) = 1. Crucially, the firm does not observe u (an ex-post shock) when jnt making production decisions of inputs and outputs. In contrast, the firm observes ξ (an jnt ex-ante shock) at the time of production decisions. The explicit modeling of the ex-ante and ex-post shocks is also adopted by Barrows et al. (2024). It is also worth noting that u is jnt the sole source of discrepancy between the model-predicted revenue R and the realized jnt revenue observed by researchers, R ˜ = P ˜ Q = P Q eujnt = R eujnt, while there is jnt jnt jnt jnt jnt jnt no ex-post, unexpected shock to product quantity.5 Depending on the empirical context, the demand system can be specified in various ways, including the widely-used CES demand, discrete-choice demand (e.g., Berry, 1994), and random-coefficients logit demand (e.g., Berry et al., 1995). These demand systems may allow for the possibility that a product’s demand may be affected by cannibalization and competition, arising not only from the products of rival firms but also from other products offered by the same firm. 2.2 Production We use a transformation function to model the production technology. Given the set of products to be produced (Λ ) and associated product quality (ξ , n ∈ Λ ), the firm uses jt jnt jt labor (L ), material (M ), and capital (K ) to produce output quantity (Q , n ∈ Λ ) via jt jt jt jnt jt 5An example of ex-post shock to prices arise if the firm commits to its product quantity before demand is realized. As a result, if the realized market demand exceeds expectations, the firm increases its price by a factor of eujnt, and reduces it when the realized demand is weaker. 8

a transformation function: G(e−ω˜jtQ ) = F(L ,M ,K ). (3) jt jt jt jt The transformation function (3) maps the firm-level input vector (L ,M ,K ) to a vector of jt jt jt outputs Q ≡ {Q }, n ∈ Λ , given the vector of quantity-based productivity (i.e., physical jt jnt jt productivity, or TFPQ) ω˜ ≡ {ω˜ }, n ∈ Λ of firm j in period t. Intuitively, a higher level jt jnt jt of ω˜ means that the firm is able to produce a higher quantity of output Q , conditional jnt jnt on the inputs and the quantity and productivity of other outputs. In this paper, we use TFPQ and productivity interchangeably. The transformation function (3) represents the frontier of production possibility characterized by two aggregating functions F(·) and G(·).6 Function F(·) is a general input aggregator. In empirical settings, it can take functional forms such as CES and translog. We adopt a CES function in our application in Section 5 and describe the implementation with a translog function in Online Appendix A.7 While we assume that the firm uses a single material input M in production, our approach can readily accommodate cases in which firms employ jt multiple types of material inputs—whether horizontally or vertically differentiated—when only the total firm-level expenditure on materials is observed. Details of this extension are provided in Online Appendix B. The function G(·) is an output aggregator. We adopt a functional form that allows for potentially nonlinear technological substitution across products:  1 θ G(e−ω˜jtQ ) ≡  (cid:88) (cid:2) Q e−ω˜jnt (cid:3)θ  , (4) jt jnt   n∈Λjt where θ is a parameter that governs the elasticity of technological substitution across products and thereby influences the marginal cost differences among the products produced by the same firm. When θ = 1, the marginal cost difference between two products depends solely on their relative productivities. In contrast, when θ ̸= 1, marginal cost differences are shaped by both productivity differences and relative output levels. Consequently, the value of θ characterizes the marginal rate of transformation (MRT) between any two products, defined 6The transformation function approach characterizes a firm’s production possibility frontier following an approach pioneered by Powell and Gruen (1968) and used in recent literature (e.g. Cairncross et al., 2025; Koike-Mori and Martner, 2024). 7We exclude the Cobb–Douglas function for the purpose of controlling for unobservable firm heterogeneity of material prices. As will become clear in Section 3, our estimation methodology leverages the first-order conditions of profit maximization to uncover firm heterogeneity in material prices by examining the firm-level variation in the ratio of material expenditures to labor expenditures. However, the Cobb–Douglas functional form of F(·) implies a constant ratio of material to labor expenditures, which prevents us to do so. 9

as the ratio of their marginal costs. For any two products n and m, the MRT represents the amount of product m that must be forgone to produce an additional unit of n, holding inputs and all other outputs constant. Graphically, it corresponds to the slope of the production possibility frontier in the (Q ,Q ) space, conditional on everything else. In our setting, jnt jmt (cid:16) (cid:17)θ−1 MRT = −eθ(ω˜jnt−ω˜jmt) Qjmt . Thus, θ influences how relative marginal costs are related nm Qjnt to relative output levels. Such a relationship is analogous to the marginal cost implications derived from the transformation function model in Dhyne et al. (2022). In Section 3.2, we will use such dependence to identify θ. The CES functional form in (4) is also derived by Cairncross et al. (2025) from product-level production functions under a set of assumptions in the context of multi-product firms. A few features of the transformation function are worth noticing. First, for multi-product firms, the transformation function can be interpreted as the frontier of joint production of all products, Q , n ∈ Λ . This interpretation has three implications: (i) different products are jnt jt manufactured with the same set of inputs; (ii) the inputs can be costlessly transferred across different products within the firm; (iii) producing more of one product means producing less of another product, holding inputs fixed. These implications are consistent with the modeling assumptions used by Dhyne et al. (2022), Orr (2022), and Valmari (2023). Second, our framework does not explicitly model input allocation within a firm. Instead, it accommodates the possibility of jointly utilized inputs across products, similar to the approach in Dhyne et al. (2022). This contrasts with existing methods that impute product-specific (exclusive) input allocations, thus abstracting away from the public-good nature of inputs within firms. Finally, an input-output separability assumption is embodied in our transformation function (3). Specifically, there are no interaction terms between outputs and inputs, although interaction among outputs and among inputs is allowed within the respective aggregators G(·) and F(·). This assumption implies that marginal cost differences across products produced by the same firm do not depend on the input mix. Such separability is also assumed in the literature (e.g., Dhyne et al., 2022; Cairncross et al., 2025). 2.3 Productivity A key element of our model is the quantity-based productivity ω˜ in (3), which varies jnt by firm, product, and period. We model the potential components and evolution of ω˜ jnt to highlight the key differences compared with the assumptions in the existing literature. Specifically, we unpack productivity into two components: ω˜ = ω −h(ξ ), (5) jnt jnt jnt 10

where ω is technical efficiency and h(ξ ) is a function of product quality ξ . We jnt jnt jnt model h(ξ ) as a part of quantity-based productivity because varieties of the same product jnt category produced by different firms can be vertically differentiated by quality and such quality differences have potential implications for productivity. Producing one additional unit of the high-quality product may require more production procedures (e.g., longer refinements in the steel industry in Li et al., 2025), better (or more specialized, exclusive) machinery, higher-quality(ormore)intermediatematerials, higherstandardsofqualitycontrol(e.g., lower septic infections rate in the healthcare industry in Grieco and McDevitt, 2017), and extra dedicated workers (e.g., promoting quality or demand rather than production as discussed by Bond et al., 2021). In turn, this leads to a lower quantity of output, holding the inputs fixed, and thus it implies an increase in the marginal cost of production (or equivalently a lower productivity). Thus, we refer to h(ξ ) as the cost of quality.8 jnt As a result, differences in quantity-based productivity can be due to not only technical efficiency but also the cost of quality. Theoretically, explicitly modeling the cost of quality h(ξ ) as a component of productivity allows for a trade-off between product quantity and jnt quality, conditional on inputs. Empirically, this also implies that comparisons of quantitybased productivity across firms and over time require controlling for quality differences. Thus, instead of modeling the evolution of quantity-based productivity, we model the evolution of technical efficiency, ω , as a Markov process: jnt ω = g (ω ,x )+ϵ , ∀n = 1,2,...,N, (6) jnt n t−1 jt−1 jnt where ϵ is an innovation term. jnt The function g (·) flexibly captures the relevant determinants of the evolution of technical n efficiency, depending on the focus of the application. For instance, in the context of technological spillovers (e.g., Malikov and Zhao, 2023), vector ω may include the technical t−1 efficiency of other products within the same firm as well as the same product produced by other firms. Alternatively, in settings focused on the endogenous evolution of productivity, vector x can include firm-level decisions made in period t−1—such as investment in jt−1 research and development, as emphasized by Doraszelski and Jaumandreu (2013)—which affect the future trajectory of technical efficiency. A key methodological advantage of our approach is that the estimation of production and demand functions does not necessarily rely on the evolution equation (6) or the productivity- 8Note that the term cost of quality in this paper refers only to the impact of quality on the marginal cost of production, rather than the overall cost of quality (including research cost for new products with higher quality, which is more dynamic in nature, or the installation cost of new equipment to produce higher quality products, which are usually one-time fixed costs). 11

quality trade-off (5). This allows researchers to estimate the technical efficiency process after obtaining efficiency measures, enabling a flexible modeling of dynamics. We demonstrate this advantage by exploring within- and across-firm technological spillovers in Section 7. 2.4 Inputs and Outputs Decisions At the beginning of period t, the firm observes a vector of pre-determined variables, which includes the product scope Λ , capital stock K , intermediate input price P , wage rate jt jt Mjt P , technical efficiency ω , and product quality ξ of all the products. Note that observing Ljt jt jt technical efficiency and product quality implies that the firm also knows productivity, ω˜ , jt because the firm knows the trade-off (5). The intermediate input price and wage rate can differ across firms and fluctuate over time, driven by factors such as localized input markets and transportation costs. In empirical work, while the wage rate is typically observable, the intermediate input price is rarely recorded. This creates a challenge due to input price bias, as emphasized by De Loecker et al. (2016). Our empirical approach, detailed in Section 3, is able to address this issue. A key assumption is that firms’ static input and output decisions do not influence input prices contemporaneously. While input prices may be endogenously determined—through negotiations or supply-chain investment decisions—and evolve over time, we treat them as predetermined with respect to static production choices. The firm’s objective is to maximize its total profit from all products in period t after observing its state, by optimally choosing the quantity of material (M ), the quantity of jt labor (L ), and the quantities of all the products to be produced (Q = {Q },n ∈ Λ ): jt jt jnt jt max (cid:80) E(P ˜ Q )−P M −P L Qjt,Mjt,Ljt n∈Λjt jnt jnt Mjt jt Ljt jt subject to: (1) and (3), (7) where the expectation is taken over the unexpected shock u embodied in the realized price jnt ˜ P . However, this does not affect the firm’s decisions on inputs and outputs because the jnt firm does not observe the ex-post shock at the time of decisions and E(eujnt) = 1. 3 Estimation Methodology The estimation method leverages a set of implications from the model that can be used to estimate productivity and quality at the firm-product-period level. The method is built upon the insights of Grieco et al. (2016, 2022), Harrigan et al. (2021) and Li and Zhang (2022), who utilize the first-order conditions of static profit maximization to control for unobservable variables in the production function estimation, but it is extended to the multi-product setting where within-firm allocation of inputs is unobserved. Specifically, while researchers 12

do not observe key variables such as productivity and quality, the firm observes them before making optimal production decisions. Thus, the idea is to invert the implications from the profit maximization problem to establish a unique one-to-one mapping from observable production decisions to variables that are unobservable to researchers and control for them in the estimation of the transformation function. Crucially, under mild conditions, our model admits such a mapping regardless of the number of products. Table 1: Comparison to existing estimation methods Production Firm-product Proxy Evolution Material price Demand system productivity free free* unobservable system DGKP Product • Orr Product • • Valmari Product • • CL Product • • DPSW Transformation • This paper Transformation • • • • • Notes: DGKP refers to De Loecker et al. (2016), Orr refers to Orr (2022), Valmari refers to Valmari (2023), CL refers to Chen and Liao (2022), and DPSW refers to Dhyne et al. (2022). [*] This applies when the input aggregator has a CES form. Compared with the existing methods in the literature, our method has several important innovations, as summarized by Table 1. First, our method models the production technology flexibly as a transformation function and not as a collection of single-product production functions (De Loecker et al., 2016; Orr, 2022; Valmari, 2023; Chen and Liao, 2022). This saves us from potentially restrictive assumptions regarding how firms allocate inputs to produce different products. This is especially important in the presence of shared inputs that serve as public goods within firms. In this regard, Dhyne et al. (2022)’s model is the most similar to ours. Second, our model offers the advantage of scalability as it does not require proxies for product-level productivity and rather relies on static optimization conditions that naturally increase with the number of products. This advantage allows for the analysis of industries with a large number of products without relying on assumptions to aggregate products. Third, our method is designed to deal with the bias caused by unobserved material prices, like De Loecker et al. (2016). We employ the variation of labor and material expenditure ratio (conditional on the wage rate) to identify material prices. This is particularly useful when material prices are heterogeneous across firms and over time but are unobservable to researchers. Fourth, our method has the potential to explore the productivity evolution after the estimation, contrary to the existing methods which rely on productivity evolution for the 13

estimation.9 This section is organized as follows. Section 3.1 establishes a one-to-one mapping between the observed data and unobservable heterogeneity using firm’s static profit maximization conditions. Section 3.2 derives the estimating equations using the established mapping and develops the estimation strategy. 3.1 From Observables to Unobservables: a One-to-one Mapping We begin the description of the estimation strategy by distinguishing the observable and unobservable variables to researchers in the estimation procedure. The researchers observe capital stock K , labor input L , labor expenditure E , material expenditure E , and jt jt Ljt Mjt the quantity Q and price P for each product n ∈ Λ . The researchers do not observe jnt jnt jt the material price P (or equivalently, the material input M ), as well as productivity ω˜ Mjt jt jnt and quality ξ for n ∈ Λ . Our objective is to estimate these unobserved variables alongside jnt jt the parameters of the transformation and demand functions. We establish the relationship between the observed data and the unobservables, leveraging the firm’s profit-maximization behavior. The idea is as follows. Although researchers cannot observe ω˜ , ξ , or P , as described in Section 2.4 these variables are observed by the jnt jnt Mjt firm and thus influence the firm’s optimal input and output decisions. By using the firm’s optimization conditions, we establish a unique one-to-one mapping (up to a set of unknown production and demand parameters) from observable data K , L , E , E , Q , and jt jt Ljt Mjt jnt P to unobservable variables ω˜ , ξ , and P . We develop the strategy as follows. jnt jnt jnt Mjt Mapping to quality. We write quality, ξ , as a function of observed output price and jnt quantity according to the inverse demand function (1). That is, ξ = P−1(P ,Q ), (8) jnt jnt t t where P and Q are the vectors of prices and qualities of all products and firms in period t.10 t t As an identification condition, the demand system must admit a unique solution for the quality levels given observable output prices and quality outcomes. This requirement is satisfied by a broad class of demand functions, including the widely adopted CES demand, discrete-choice demand, and random-coefficients logit demand models. 9This depends on the functional form of the input aggregator, F(·). If F(·) has a CES form or a restricted version of translog, then the methodology can be implemented without relying on the productivity evolution; if F(·) has an unrestricted translog form, then additional conditions are required to estimate all translog parameters. Online Appendix A describes how to use the productivity evolution for such conditions. 10Empirically, the recovered ξ contains the unexpected shock u , which usually appears as a (log-) jnt jnt additive term in popular demand functions. We clarify this further in the setup of CES demand in Section 5. 14

Standard methods for estimating these demand systems, typically relying on the use of appropriate instrumental variables, are well-established in the literature. Consequently, the identification and estimation of the demand system within our framework can be conducted as a standalone process. Once the demand system is estimated, the quality level ξ can be jnt recovered using (8). Moreover, the estimated demand system allows us to compute the price elasticity of demand for any product n of firm j with respect to product m of firm j′ as: ∂Q P jnt j′mt ≡ −η . (9) jtnm ∂P Q j′mt jnt Note that we have slightly abused the notation because product m can be either a product of the same firm j′ = j (i.e., cannibalization) or a product of another firm j′ ̸= j (i.e., competition). That is, the elasticity can be flexible and vary by firm, product, and time. Mapping to material price. We derive the mapping using the firm’s static profit maximization problem. The Lagrange function implied by the problem (7) is: (cid:40) (cid:41) (cid:88) L = P (Q ,Q ;ξ )Q −P L −P M −λ G(e−ω˜jtQ )−F(L ,M ,K ) , jt jnt jt −jt t jnt Ljt jt Mjt jt jt jt jt jt jt n∈Λjt (10) where λ is the Lagrangian multiplier. The random shock u is not included in this equation jt jnt because it is ex post and E(eujnt) = 1. The first-order conditions with respect to labor and material inputs are, respectively: ∂L ∂F(L ,M ,K ) jt jt jt jt = −P +λ = 0, (11) Ljt jt ∂L ∂L jt jt ∂L ∂F(L ,M ,K ) jt jt jt jt = −P +λ = 0. (12) Mjt jt ∂M ∂M jt jt Multiply them by L and M , respectively, and take the ratio of the two to obtain: jt jt ∂F Ljt E ∂Ljt F = Ljt . (13) ∂F Mjt E Mjt ∂Mjt F This equation only involves a single unobservable variable, M , and, for functional forms jt 15

of F(·) such as CES and translog, admits a unique solution:11 M = M(L ,E ,E ,K ), (14) jt jt Ljt Mjt jt and consequently, E Mjt P = . (15) Mjt M(L ,E ,E ,K ) jt Ljt Mjt jt The identification strategy for P is based on the relationship implied by the first-order Mjt conditions for labor and material inputs, and is conceptually aligned with Grieco et al. (2016). Conditional on the wage rate, changes in P induce a non-Hicks-neutral effect by Mjt altering the optimal ratio of labor to material expenditures. Consequently, variations in the labor-to-material expenditure ratio observed in the data (conditional on the wage rate) provide a basis for identifying P . This strategy presents an alternative to the method Mjt proposed by De Loecker et al. (2016), who use output prices as proxies for input prices to address the input price bias arising from unobserved heterogeneity in input prices. Substituting (14) into the first order condition for labor, we obtain a unique solution for the Lagrangian multiplier: P Ljt λ = . (16) jt ∂F(Ljt,M(Ljt,ELjt,EMjt,Kjt),Kjt) ∂Ljt Thatis, theLagrangianmultiplierisderivedfromequalizingthemarginalbenefitandmarginal cost of labor input. This equation can also be cast in terms of material input, which is equivalent to (16) due to how (unobserved) material input and its price are recovered. Mapping to productivity. The first-order condition with respect to each product quantity Q , n ∈ Λ , is: jnt jt   ∂L  (cid:88) ∂P (Q ,Q ;ξ )  ∂G(e−ω˜jtQ ) jt jmt jt −jt t jt = Q +P −λ jmt jnt jt ∂Q ∂Q ∂Q jnt  jnt  jnt m∈Λjt P = jnt −λ e−θω˜jntQθ−1[G(e−ω˜jtQ )]1−θ = 0, (17) µ jt jnt jt jnt (cid:124) (cid:123)(cid:122) (cid:125) (cid:124)(cid:123)(cid:122)(cid:125) marginalcost marginalrevenue 11When the functional form of F satisfies the condition proposed by Proposition 1 of the Online Appendix ofGriecoetal.(2016),thereexistsauniquesolutionforM . Inourempiricalapplication,theCESfunctional jt form of F(·) satisfies this condition. We provide a full procedure for estimating a translog functional form of F(·) in Appendix A under mild conditions for the evolution of technical efficiency, and we establish the condition to ensure the unique solution for M . For the Cobb-Douglas form of F(·), such a solution does not jt exist because the elasticity ratio on the left-hand side of (13) is always a constant. Intuitively, the material pricevariationinaCobb-Douglasproductionfunctiondoesnotchangetheoptimalratiooflaborandmaterial expenditures. Thus, we refrain from using the Cobb-Douglas form of F(·). 16

where 1 µ ≡ (18) jnt 1− (cid:80) 1 Rjmt m∈Λjt ηjtnm Rjnt is the markup of product n and η is the price elasticity of demand defined by (9). jtnm Notably, λ e−θω˜jntQθ−1[G(e−ω˜jtQ )]1−θ is the marginal cost of producing Q . Across jt jnt jt jnt firms, the marginal cost of a product varies due to λ ; within a firm, the marginal cost also jt differs due to productivity ω˜ and scale of production Q . As a result, conditional on a jnt jnt firm, the variation in product prices identifies the productivity difference across products within the firm, after accounting for the markup µ and production scale Q . jnt jnt This idea is formally developed to derive the productivity mapping. Using (17) and substituting M in it by (14), we obtain: jt µ eθω˜jnt = jnt λ Qθ−1[F(L ,M(L ,E ,E ,K ),K )]1−θ, (19) P jt jnt jt jt Ljt Mjt jt jt jnt where we have substituted G(·) by using (3) and λ is given by (16).12 jt In summary, we have established a one-to-one mapping—comprising (8), (15), and (19) — from observable data to the unobservable variables ξ , P , and ω˜ , conditional on the jnt Mjt jnt demand and production parameters to be estimated. This mapping is unique for widely used demand functions, such as CES demand, discrete-choice demand, and random-coefficients demand models, as well as for common production function specifications, including CES and translog functional forms. Conceptually, our approach parallels the proxy-based methodology pioneered by Olley and Pakes (1996), and extended by a large body of methodological work, which uses observable proxies (such as investment and material inputs) to control for unobserved productivity when estimating production functions. However, in the context of multi-product firms with product-level heterogeneity, the proxy-based approach faces a scalability challenge: the number of required proxies grows with the number of products, as recognized by Dhyne et al. (2022). Our methodology leverages firms’ first-order conditions to construct the mapping, offering a key advantage in scalability. As the number of products increases, so does the number of first-order conditions. This scalability is shared with recent approaches such as Orr (2022) and Valmari (2023), while we also adopt a transformation function approach as in Dhyne et al. (2022) to avoid assigning inputs to the production of individual outputs. 12Empirically, the recovered productivity contains the unexpected shock u (as a log-additive term). jnt 17

3.2 Estimating Equations and Estimation Strategy In the previous subsection, we have constructed a one-to-one mapping from observable variables to the unobserved ξ , ω˜ , and P (or M equivalently) up to a set of parameters jnt jnt Mjt jt to be estimated. This mapping is the key to developing the equations to estimate these parameters, which we derive in this subsection. Estimating a general demand system (1) is challenging due to unobservable demand factors (e.g., quality) and the endogeneity of prices. Depending on specific context and functional form of (1), strategies are well-developed (e.g. Berry, 1994; Berry et al., 1995) to address these challenges, mainly using a set of instrumental variables. Since our focus is on the production transformation function, we assume the existence of a valid set of instrumental variables, allowing researchers to estimate the demand system (1). Consequently, the firm-producttime-specific quality ξ and markup µ can be recovered via (8) and (18), respectively. jnt jnt To derive our main estimating equation, we start by multiplying both sides of the equation implied by the first-order condition (17) by Q . Rearranging this equation gives: jnt R jnt = λ e−θω˜jntQθ [G(e−ω˜jtQ )]1−θ, (20) µ jt jnt jt jnt where R = P Q . jnt jnt jnt Sum the above equation over n ∈ Λ to obtain: jt (cid:88) R jnt = λ (cid:2) G(e−ω˜jtQ ) (cid:3)1−θ (cid:88) (cid:0) e−θω˜jntQθ (cid:1) = λ F(L ,M ,K ), (21) µ jt jt jnt jt jt jt jt jnt n∈Λjt n∈Λjt where we have used the transformation function to replace G(·) to obtain the last equality. From the first-order conditions of labor input and material input, (11) and (12), we obtain E +E Ljt Mjt λ F = , (22) jt jt ∂Fjt Ljt + ∂Fjt Mjt ∂Ljt Fjt ∂Mjt Fjt where F is a short-hand notation of F(L ,M(L ,E ,E ,K ),K ). jt jt jt Ljt Mjt jt jt Substitute this equation into (21) to obtain:13 (cid:88) R E +E jnt Ljt Mjt = . (23) µ jnt ∂Fjt Ljt + ∂Fjt Mjt n∈Λjt ∂Ljt Fjt ∂Mjt Fjt 13An alternative expression is (cid:80) Rjnt = ELjt or (cid:80) Rjnt = EMjt . Nonetheless, because n∈Λjt µjnt ∂Fjt Ljt n∈Λjt µjnt ∂Fjt Mjt ∂Ljt Fjt ∂Mjt Fjt we substitute the recovered material quantity (14), which is derived from (13), into these equations, both of these equations are equivalent to (23). 18

Therefore, (23) describes the relationship between revenues (adjusted by the reciprocal of markups) and inputs for a general system of demand and production transformation functions in the context of multi-product firms. This equation is an analog of the popular ratio estimator of markup in De Loecker et al. (2016) in the context of single-product firms.14 De Loecker et al. (2016) focus on uncovering markups after estimating the production function parameters (and the output elasticities), without relying on the estimation of any demand system. In the context of multi-product firms, this equation is also an analog of the markupinput share relationship examined by Cairncross et al. (2025). Both De Loecker et al. (2016) and Cairncross et al. (2025) estimate markups using production data only without making assumptions on the demand structure. Our methodology uses the same relationship but takes a different approach: we utilize the demand system to first uncover the markups and then proceed to estimate the production parameters using (23) as the estimating equation. This strategy offers several advantages. It is scalable for handling a large number of products, addresses bias caused by unobservable material price heterogeneity, and makes it possible to estimate production parameters without relying on productivity evolution process, because all unobserved firm heterogeneity (i.e., multi-dimensional productivity and quality as well material quantity) are substituted by observable variables in the data using the mapping developed in Section 3.1. Itisimportanttonoticethat(23)holdsforthetheoreticallypredictedrevenueR ,because jnt it is derived from the firm’s profit maximization problem. Empirically, researchers do not observe the theoretically predicted revenue R ; instead, the observed revenue contains the jnt unexpected shock as defined in (2): R ˜ = P ˜ Q = P Q eujnt = R eujnt. Substitute jnt jnt jnt jnt jnt jnt this relationship into (23) to replace the theoretically predicted revenue, and rearrange to obtain an empirical estimating equation that involves the observed revenue directly:   (cid:34) (cid:35) ˜ (cid:88) R E +E jnt Ljt Mjt ln µ jnt  = ln ∂Fjt Ljt + ∂Fjt Mjt −u jt , (24) n∈Λjt ∂Ljt Fjt ∂Mjt Fjt where   (cid:34) (cid:35)  (cid:88) R ˜ /µ  u = ln jnt jnt e−ujnt (25) jt (cid:80) ˜  n∈Λjt n∈Λjt R jnt /µ jnt  is a firm-level composite error term. Intuitively, it is a geometric mean (in logarithm) of 14To see this, notice that, for single-product firms, (23) degenerates to Rjt = ELjt+EMjt , and µjt ∂Fjt Ljt+ ∂Fjt Mjt ∂Ljt Fjt ∂Mjt Fjt ∂Fjt Ljt+ ∂Fjt Mjt consequently, the markup can be written as µ = ∂Ljt Fjt ∂Mjt Fjt . jt (ELjt+EMjt)/Rjt 19

firm-productlevelunexpectedshocku , usingwithin-firmshareofR ˜ /µ astheweights.15 jnt jnt jnt We estimate the associated production parameters, denoted as β, using generalized method of moments (GMM): (cid:34) (cid:35)′ (cid:34) (cid:35) 1 (cid:88) 1 (cid:88) ˆ β = argmin u Z W u Z , (26) β N jt jt N jt jt j,t j,t   (cid:34) (cid:35) ˜ E +E (cid:88) R Ljt Mjt jnt where u jt = ln ∂Fjt Ljt + ∂Fjt Mjt −ln µ jnt . ∂Ljt Fjt ∂Mjt Fjt n∈Λjt W is a weight matrix, N is the number of firm-time observations, and Z is a set of jt instrumental variables. Because u is a composite of ex-post error terms of prices, natural jt candidates of instrumental variables include firm-level inputs such as L ,E ,E and K . jt Ljt Mjt jt In addition, in settings where firms compete in product markets, product characteristics of rival firms, if observable, may be also included in the instrumental variable set. Although this estimation strategy offers a straightforward approach to estimating the primary production parameters, the parameter θ which characterizes the technological substitution of outputs within the firm, does not appear in the estimating equation (23). To identify and estimate θ, we leverage the influence of θ on the marginal rate of transformation (via within-firm marginal cost differences) across products within a firm. Take the ratio of the equation implied by the first-order condition (17) of product n to that of product m. The logarithm of the ratio is: (cid:20) (cid:21) (cid:18) (cid:19) P /µ Q jnt jnt jnt ln = (θ−1)ln +v , (27) jnt P /µ Q jmt jmt jmt (cid:124) (cid:123)(cid:122) (cid:125) (cid:124) (cid:123)(cid:122) (cid:125) marginalcostratio,inlog marginalrateoftransformation,inlog where v = θ(ω˜ −ω˜ ) is the relative difference between the productivity of the two jnt jmt jnt products, adjusted by parameter θ. Intuitively, this equation aligns with the definition of the marginal rate of transformation discussed in Section 2.2. The left-hand side corresponds to the ratio of marginal costs, while the right-hand side represents the marginal rate of transformation—both expressed in logarithmic terms. Unless θ = 1, the marginal cost ratio depends not only on productivity differences but also on the relative scale of production between the two products. Therefore, we identify θ by examining how marginal costs 15Inthecontextofthefirm-levelshock(i.e.,u =u ,∀n)orinthecontextofsingle-productfirms,thereis jnt jt (cid:34) (cid:35) only one unexpected shock per firm. Consequently, (24) simplifies to ln (cid:104) R˜ jt (cid:105) =ln ELjt+EMjt +u . µjt ∂Fjt Ljt+ ∂Fjt Mjt jt ∂Ljt Fjt ∂Mjt Fjt This degenerated form aligns with the estimating equation proposed by Grieco et al. (2016). 20

differ across products based on their relative output levels within the same firm. A similar identification strategy is also adopted by Khmelnitskaya et al. (2025). To implement this idea, we treat v as an error term. Since v is correlated with the jnt jnt production scale ratio, there is an endogeneity problem. Empirically, researchers can estimate (27)usingaTwo-StageLeastSquares(2SLS)estimatorwithasetofIVs. Ideally, firm-productlevel instrumental variables are preferred. For example, differences in product characteristics may shift the quantity ratio due to varying demand driven by these characteristics, while being traditionally assumed to be uncorrelated with cost-side (productivity) differences. When firm-product-level instruments are unavailable, firm-level variables, such as the wage rate, can be used as instruments. This approach benefits from examining relative differences between two products within the same firm. For instance, conditional on other factors, a lower wage rate decreases the firm’s overall marginal cost, leading to higher production quantities for both products. However, the product with less elastic demand (e.g., product m) expands more, resulting in a lower quantity ratio (Q /Q ). Thus, the wage rate and jnt jmt production scale ratio are correlated. Of course, the validity of firm-level instruments depends on the assumption that the levels of these variables are uncorrelated with the differences in productivity between two products, which is discussed in Online Appendix D. We examine the performance of our estimation method in a Monte Carlo setting in Section 5.3. As a summary of the full estimating approach, the first step is to estimate the demand system (1) to obtain the estimates of demand parameters and product markups. Second, we estimate θ from the within-firm marginal rate of transformation relationship (27) using 2SLS. The third step is to estimate the production parameters using (23) via GMM. With these estimates, researchers can compute quality and productivity via (8) and (19), respectively. Although the model and estimation strategy are presented in a general framework, researchers should tailor their choice of functional forms to their specific context, balancing flexibility with the feasibility of empirical implementation. In our application, we select a CES demand system and a transformation function with a CES input aggregator, due to the consideration of the characteristics of the industries, the market structure, and the availability of instrumental variables. The next section focuses on the industry characteristics and market structure, providing the context for these choices. 4 Data We estimate our model using firm-level Mexican manufacturing data, collected by the Instituto Nacional de Estad´ıstica y Geograf´ıa (National Institute of Statistics and Geography, INEGI henceforth) and covering the period 1994-2007. We use two datasets: the Encuesta Industrial Anual (Annual Industrial Survey, EIA henceforth), the main annual survey covering the 21

manufacturing sector, and the Encuesta Industrial Mensual (Monthly Industrial Survey, EIM henceforth), a monthly survey that monitors short-term trends related to employment and output.16 These datasets are particularly useful for our analysis because they provide quantity and sales information at the firm-product level. However, similar to most production data, information regarding inputs, viz. physical capital, intermediate input, number of workers and wage bills, are only available at the firm level.17 Firms are classified by INEGI into one of the classes of activity based on their principal product. A class of activity is the most disaggregated level of industrial classification and is defined at six digits according to the 1994 Clasificacio´n Mexicana de Actividades y Productos (Mexican System of Classification for Activities and Products, CMAP henceforth). Firms report quantity and sales information product by product based on their industries. We focus on three specific classes of activities: manufacturing of footwear, mainly of leather (class 324001, footwear in short); printing and binding (class 342003, printing in short); and manufacturing of pharmaceutical products (class 352100, pharmaceuticals in short).18 These three industries were chosen because each industry is made up of more than 500 firm-year observations, a number of observations large enough for implementing our estimation strategy. More importantly, multi-product firms are particularly prevalent in these industries: 56% of firms in these industries are multi-product producers and such firms account for 86% of total revenues and produce on average 6.9 products per year. They also represent a diverse set of manufacturing industries with clear concepts of product quality: for example, advanced design and assembly that provide superior comfort and durability in the footwear industry; acid-free paper and durable binding in the printing industry; potent active ingredients and 16This paper uses plant -level Mexican manufacturing data, collected from the Instituto Nacional de Estadistica y Geografia (National Institute of Statistics and Geography), INEGI. Mauro Caselli obtained accesstothedata. ArpitaChatterjee,theauthoraffiliatedwiththeFederalReserveBoard,hasnounauthorized access to the data. The unit of observation in both surveys is a plant rather than a firm and the sample includes all plants with more than 100 employees as well as a sample of smaller plants. For simplicity and in line with the literature, we will use the term “firm” to refer to a plant. More information on the EIA and EIM can be found in Caselli et al. (2017). 17All nominal variables are deflated using the consumer price index. To facilitate comparison, we normalize average industry output prices to 1. Initial capital stock and investment are deflated using industry-level price indices. 18For the purpose of our analysis, all products with fewer than 100 observations are aggregated together in a residual product category. The residual product category is defined as “Others” (product code 99) in Table A1 in the Online Appendix. The prices and quantities of the aggregated residual product category are estimatedfollowingDiewertetal.(2009). Whilethisaggregationisrequiredtoestimatethedemandelasticity of substitution for each product based on a large enough number of observations, it only implies that the demand elasticity of substitution is by assumption equal across all products included in the residual product category within an industry. In addition, this aggregation involves a relatively small share of products: the main (i.e., not aggregated) products account for between 81% and 93% of observations and between 82% and 90% of revenue across the three industries. Accordingly, the descriptive statistics and patterns demonstrated in this section are reported based on the aggregated categories, which is the data used in our estimation. 22

degrading-preventing packaging in the pharmaceutical industry. There are a few patterns worth noting. First, multi-product production is an essential feature of the firms in our sample. We demonstrate this point by using an index that is analogous to the traditional Herfindahl–Hirschman Index (HHI) as the sum of the squared shares of sales within a firm. A higher HHI index means a higher level of concentration of sales within a firm.19 The index is naturally equal to one for single-product producers. For firms with a larger product scope, HHI decreases sharply becoming close to 0.3 for firm-year pairs producing 5 products and close to 0.2 for firm-year pairs producing 10 or more products.20 These values imply that producers are genuine multi-product firms – they do not concentrate production entirely on their top products, and all products, albeit to different degrees, are important for firms’ total revenues.21 Thus, multi-product firms need to be treated and modeled as such and they cannot be simplified as single-product producers. This characteristic of the industries is also an important feature that enables us to exploit the within-firm relationship to identify model parameters, as discussed in Online Appendix D. Table 2: Descriptive statistics: prevailing multi-product firms Variable Footwear Printing Pharmaceutical Product scope, MPFs only 2.403 6.195 8.000 (0.624) (4.107) (3.018) Share of number of MPFs 0.201 0.538 0.845 Revenue share of MPFs 0.386 0.597 0.939 Total number of products 4 15 16 Total number of firms 72 79 80 Number of firm-year pairs 617 744 867 Notes: The table reports the means and standard deviations (in parenthesis) for each variable by industry. Product scope is the number of products manufactured by firm. MPFs refers to multi-product firms only. The importance of multiple-product production is also present in all the industries of our analysis, albeit with some degrees of variation, as shown in Table 2.22 The percentage 19In Figure A1 in the Online Appendix, we aggregate the firm-level index with weights equal to the firms’ total revenues, by firm-year pairs’ product scope. 20These values indeed show some degree of concentration of sales within firms. For example, if a firm produces 5 products with equal sales, the index would be 0.2. The fact that the index is close to 0.3 implies that there exists an uneven distribution of sales. We explore this heterogeneity using quality and productivity within firms in Section 7. 21To confirm that firms rely heavily on all products for their total sales, Online Appendix Table A2 shows theaveragewithin-firmproductsharesbyproductscope. Forinstance,forfirmsproducing5ormoreproducts, the share of products other than the top product is 0.556 and the share of products with rank 5 and beyond is 0.147, on average. 22Additional descriptive statistics are available in Table A3 in the Online Appendix. 23

of multi-product firms ranges from 20% in the footwear industry to 54% in printing and 85% in pharmaceuticals and they account for an even larger share of revenues (from 39% in the footwear industry to 94% in pharmaceuticals). The average product scope is larger in printing and pharmaceuticals (respectively, 6.2 and 8.0 for multi-product firms) than in the footwear industry (2.4). These differences in average product scope are in line with the number of product categories available in each industry, which ranges from 4 in footwear to 16 in pharmaceuticals. Second, the status of being a multi-product firm is quite persistent, and so is the product scope. In particular, using a simple autoregressive process of the number of products produced by each firm, we measure the persistence coefficients to be 0.87, 0.96, and 0.98 in the three industries, respectively.23 Thus, multi-product firms unequivocally dominate manufacturing production in our data and their within-firm adjustment across products is more salient than the extensive margin adjustment in changing the number of products. These patterns imply that both within-firm and across-firm heterogeneity is important. On the one hand, there exist persistent characteristics at the firm level that determine the performance across firms. On the other hand, within-firm heterogeneity and product scope play a significant role in shaping these characteristics within firms. These implications are in line with the specification for productivity (32), which contains a common component at the firm level to capture the differences across firms as well as an individual component varying at the firm-product level to explain the variation of performance within a firm. Finally, the sample reflects patterns consistent with the choice of the empirical demand model in Section 5. On average, about 16 to 37 firms compete in the market for any given product in any given year. The majority of the firms do not command a dominant share of the market – the median (traditionally defined) HHI across firms at the product-year level ranges between 0.15 in the pharmaceutical industry and 0.31 in the printing industry. More importantly, given the level of product disaggregation, the markets for different products (e.g., women’s shoes vs. men’s shoes in the footwear industry, and more examples in Online Appendix Table A1) are reasonably assumed as segmented. For each product, firms’ outputs are vertically differentiated as evidenced by the large dispersion in prices.24 Overall, these patterns support abstracting from demand cannibalization across products made by the same firm and assuming that firms face monopolistic competition within each product category. 23The entry of new products and the exit of old products only account for 6.8 and 7.3 percent of the observations, respectively. 24For example, the interquartile range of prices in logarithm is about 1.4 (i.e., a 400% difference) within a product category, on average, across the three industries. 24

5 Empirical Model and Estimation This section presents an empirical model and applies the estimation strategy developed in Section 3 to the Mexican manufacturing industries. We choose specific functional forms for the demand and production model considering the characteristics of the industries, product classification, market structure, and availability of instrumental variables. 5.1 Empirical Specification On the demand side, we adopt a CES demand function, assuming that while the demand for each of the N products is segmented, there is monopolistic competition among firms producing vertically differentiated products within the same product line. Formally, the ˜ realized price, P , from the inverse demand function (1), after accounting for the unexpected jnt shock in (2), for product n of firm j in period t, is specified as: 1 1 ˜ lnP = − lnQ + (ϕ +ψ +v )+u , (28) jnt jnt nt jn jt jnt η η n n (cid:124) (cid:123)(cid:122) (cid:125) ξjnt where η denotes the constant elasticity of demand for product n. The term ξ represents n jnt product quality and comprises three components: ϕ (product-time fixed effects), ψ nt jn (firm-product fixed effects), and v (firm-time fixed effects). The term u represents an jt jnt idiosyncratic firm-product-time specific ex-post price shock. We follow the tradition in the literature (e.g., Melitz, 2000; Khandelwal, 2010; Hottman et al., 2016; Pozzi and Schivardi, 2016; Eslava et al., 2024) to refer to the residual recovered from the CES demand system as ˜ ˜ the perceived product quality: ξ ≡ lnQ +η lnP = ξ +η u . jnt jnt n jnt jnt n jnt Theempiricaldemandmodel(28)excludescomplementarityorsubstitutionacrossdifferent product lines. This choice is driven by empirical considerations in our context. First, given the level of product classification in our data, complementarity or substitution on the demand side is unlikely. For example, the demand for women’s shoes is unlikely to be influenced by competition from men’s shoes. Similar functional forms have been employed by De Loecker (2011) and Valmari (2023) in modeling demand functions for multi-product contexts. Second, while estimating a richer model is conceptually appealing, a major difficulty arises from the lack of suitable instrumental variables to address the endogeneity issue associated with unobserved product quality. Traditional instruments, such as cost shifters, may fail in vertically differentiated markets where higher-quality inputs (thus higher input prices) are chosen to enhance product quality. In general, estimating flexible demand systems requires carefully constructed instruments that are uncorrelated with product quality. For example, 25

Berry et al. (1995) utilize the characteristics of other automobiles produced by the firm itself and similar automobiles produced by its rivals. In our case, the dataset does not include such rich and strong instruments. Therefore, we adopt the CES functional form for the demand function, which enables us to leverage the within-firm variation in sales across products to estimate the constant demand elasticity parameter η , as described in Section 5.2. n On the production side, we use a CES input aggregator in the transformation function (3): F(L ,M ,K ) ≡ (cid:2) α Lγ +α Mγ +α Kγ(cid:3) γ ρ , (29) jt jt jt L jt M jt K jt where γ ≡ σ−1 governs the elasticity of substitution across inputs. ρ is a parameter for the σ returns to scale in the transformation of inputs into output. α , α , and α are share L M K parameters associated with labor, material, and capital, respectively. We normalize their sum to 1. We maintain the output aggregator as defined in (4). 5.2 Estimation Applying the methodology described in Section 3.2 to the empirical model, we obtain an explicit, unique mapping from observable data to the unobservable variables: ˜ ˜ ξ = lnQ +η lnP , (30) jnt jnt n jnt (cid:20) α (cid:21) γ 1 (cid:20) E (cid:21)1− γ 1 M Mjt P = P , (31) Mjt Ljt α E L Ljt eθω˜jnt = η n E Ljt (cid:20) α Lγ (cid:18) 1+ E Mjt (cid:19) +α Kγ (cid:21)1−ρ γ θ Qθ−1. (32) (η −1)P ˜ ρα Lγ L jt E K jt jnt n jnt L jt Ljt Equations (30), (31), and (32) empirically represent the mappings of quality (8), material ˜ price (15), and productivity (19) in the general model. Specifically, ξ is expressed as a jnt function of observed price and quantity, capturing how product quality can be inferred from observable market outcomes. Similarly, P is determined as a function of the labor-to- Mjt material expenditure ratio, conditional on the wage rate, in the same spirit as in Grieco et al. (2016). Finally, the identification of ω˜ integrates variations at both the firm level (i.e., L , jnt jt K , E , and E ) and the firm-product level (i.e., P and Q ). jt Ljt Mjt jnt jnt After substituting the mapping to the transformation function to replace the unobserved 26

productivity and material input, we obtain an explicit expression of (24) in logarithm as:   (cid:88) (η −1)ρ (cid:20) (cid:18) α (cid:18) K (cid:19)γ(cid:19)(cid:21) n ˜ K jt ln η R jnt = ln E Mjt +E Ljt 1+ α L −u jt , (33) n L jt n∈Λjt where   u = ln  (cid:88) (cid:34) (ηn η − n 1)R ˜ jnt e−ujnt (cid:35)  . (34) jt  n∈Λjt (cid:80) n∈Λjt (ηn η − n 1)R ˜ jnt  This equation is the multi-product version of the estimating equation proposed by Grieco et al. (2016) (see their equation 8), who assume that each firm produces a single product.25 In the context of multi-product firms, the individual product revenues are adjusted by the reciprocal of their corresponding markups.26 As explained in Section 3.2, the parameter θ is not present in the estimating equation (33) and thus is not identified by (33) alone. Thus, we estimate (27) via 2SLS to identify θ. In our implementation, the IV set consists of a constant and the logarithm of the wage rate (P ), the capital stock (K ), and the ratio of material expenditure to labor (E /L , as a Ljt jt Mjt jt proxy for material prices, conditional on the wage rate).27 This provides an estimate θ ˆ . Nonetheless, we still face two additional challenges to estimate all the parameters.28 First, ρ is not separately identified from demand elasticities in (33). In fact, only a combination of η and ρ (i.e., (ηn−1)ρ) is identified. Second, estimating (33) via GMM requires (at least) the n ηn same number of instrumental variables as the number of products to identify (ηn−1)ρ of each ηn product, because product revenues are correlated with composite shock u . jt To address both challenges simultaneously, we leverage the context of multi-product firms, which provides valuable within-firm variation. We explore the relationship between the revenues of any two products implied by the firm’s static maximization problem, taking into 25More broadly, (33), without logarithms, is also similar to the estimating equations used by Das et al. (2007), Aw et al. (2011), and Li (2018) with data on the firm’s total variable cost to estimate demand elasticities in multiple markets. 26If the elasticities (markups) are the same, then the estimating equation is the same as in Grieco et al. (2016). We also allow for the returns to scale parameter, ρ, to be estimated, while Grieco et al. (2016) assume it to be one. 27To see this, note that (31) is equivalent to P Mjt = (cid:104) α α M L (cid:105) γ 1 (cid:104) E L M jt jt (cid:105)1− γ 1 P L γ 1 jt . Taking logarithm, we obtain (cid:104) (cid:105) (cid:104) (cid:105) ln(P Mjt )= γ 1 ln α α M L +(1− γ 1)ln E L M jt jt + γ 1 ln(P Ljt ). Because we include the logarithm of the wage rate, (cid:104) (cid:105) ln(P ), in the IV set, using ln EMjt is equivalent to using ln(P ) in this setting, although P is not Ljt Ljt Mjt Mjt observable. Our result is quantitatively similar if the expenditure ratio of material and labor is used as an IV. 28Theseadditionalchallengesarisebecauseweareestimatingthereturnstoscaleparameter,ρ,andbecause the standard input data available in the Mexican dataset lacks instrumental variables to estimate the demand function (28) directly. See footnote 29 for further discussion. 27

account that the markets for different products are segmented in our empirical context. Here, η influences the sales of individual products, while ρ represents the returns to scale of the n production transformation function and affects the overall sales of all products. Thus, the firm’s optimal decision on trading off the sales of different products within the firm helps identify η from ρ. In other words, the variation in the sales of a product relative to another n product contains information on how the elasticities of the two products differ. This addresses the first challenge. Meanwhile, the identified relationship between elasticities reduces the number of parameters to be estimated in (33). Consequently, the number of instrumental variables required to estimate the rest of the parameters does not increase with the number of products. This addresses the second challenge. To implement this idea, we take the ratio of (17) of a reference product (i.e., product ˜ ˜ 1) and that of another product n produced by the same firm. Using R = P Q , we jnt jnt jnt obtain:29 1−θ ηn ln(R ˜ ) = c + ηn−1 ln(R ˜ )+ζ , n = 2,...,N, (35) j1t n 1−θ η1 jnt jnt η1−1 where (cid:20) (cid:21) 1 η η −1 1 n c = ln n 1−θ η1 η −1 η η1−1 1 n and   θ  1 ˜ 1 ˜  ζ = (ω˜ + ξ )−(ω˜ + ξ ). jnt 1−θ η1  jnt η −1 jnt j1t η −1 j1t  η1−1 (cid:124) n (cid:123)(cid:122) 1 (cid:125) differenceinTFPR The latter, ζ , contains the difference of the capability (or TFPR, ω˜ + 1 ξ ˜ , as will be jnt η−1 formally defined in Section 6) of producing a product relative to that of the reference product. This equation predicts that the (logarithmic) revenues of two products are linearly related conditional on the difference of production capability. In particular, firm-level inputs are not a part of the equation explicitly. This equation is similar to the estimating equation developed by Grieco et al. (2022), who explore the relationship of revenues of two markets (domestic sales and exports).30 29Note that this approach contrasts with much of the existing literature, which often relies on direct estimation of the demand function (28) using firm-level IVs (e.g., cost shifters). However, these IVs may be correlated with the level of quality, potentially biasing the results. When we estimate the demand function (28) directly using the same firm-level IVs, the resulting demand elasticities are biased downward: the mean elasticities are estimated at 2.539, −2.024, and 0.470 for the three industries, respectively. Of course, when appropriate instrumental variables exist, such as the case in Orr (2022), one can estimate demand function (28), or even a more flexible version of it, directly without resorting to this set of estimating equations. 30One difference is that Grieco et al. (2022) model the error term as an unexpected shock because the productivity and quality of the domestic and export products are assumed to be the same and are canceled. 28

Intuitively, because the demand for each product is segmented in our setting, as discussed in Section 4, the relative revenue of one product over another product in the same firm depends on their own demand elasticities (conditional on their relative levels of TFPR, ζ , as jnt well as the estimated technological substitution parameter θ) rather than on complementarity or substitution between them on the demand side. As a result, the variation of one revenue 1−θ ηn relative to another in (35) provides the identification of the ratio, ηn−1 for n = 2,3,...,N. 1−θ η1 η1−1 In contrast, the variation of revenue levels in (33) identifies (ηn−1)ρ, n = 1,2,...,N. That is, ηn the returns to scale parameter affects the sales of all products but not the relative relationship of sales between different products, while demand elasticities affect both the level and the relative relationship of sales of different products. As a result, ρ and η , n = 1,2,...,N, n are separately identified as long as there are at least two products with different demand elasticities in the industry. More precisely, the elasticities and returns to scale parameter can be identified as long as there is a firm that manufactures two products with different demand elasticities for a number of periods, which is a very mild assumption. The model is over-identified when there are more than two such products produced by the firms in the industry. To estimate (35), we choose the product produced by most firms in the industry as the reference product, in order to maximize the number of observations used in the estimation.31 We treat ζ as an error term. We allow the mean of ζ to vary by product and year and jnt jnt use a set of flexible product-year dummies as controls (which also absorb c ). ζ is likely n jnt ˜ correlated with R – the revenue of product n is lower if the capability of producing n is jnt lower than that of the reference product. We use a set of IVs to address the endogeneity issue. In our implementation, we use the same set of IVs used in estimating (27): the IV set consists of a constant and the logarithm of the wage rate, the capital stock, and the ratio of material expenditure to labor (as a proxy for material prices after conditional on wage rate). Grieco et al. (2022) uses a similar set of firm-level IVs to estimate an equation analogous to (35) in a two-product scenario. The same insight carries over in our context. These firm-level variables ˜ influence the level of revenue (i.e., R ), but they are uncorrelated with the difference of jnt capability (i.e., ζ ) between two products. For example, conditional on everything else, a jnt higher level of capital stock potentially leads to higher revenues of a given product, but it is not necessarily associated with the production capability of one product being larger than that of another product within the same firm. We use these firm-level variables as IVs for all product pairs in (35).32 31In our data, the percentage of firm-year pairs that produce the reference product ranges from 62% in the footwear industry to 72% in printing and 88% in the pharmaceutical industry. 32The model is over-identified if there is more than one IV. For example, if there are 2 IVs, then there are 29

The validity of these IVs relies on the condition that the production of a product is not systematically more intensive in the use of a specific input (e.g., capital) than other products and that the firm-level wage rate and input price are not systematically correlated with the capability differences between products. We use Monte Carlo exercises to demonstrate the performance of our approach and IVs under this condition in Section 5.3. We further discuss this condition and alternative strategies in Online Appendix D. ˆ 1−θ ηn We denote the estimated relationship between elasticities as b = ηn−1, n = 2,...,N, n 1−θ η1 η1−1 and, naturally, ˆ b = 1 by definition. Thus, η = 1+ θ(η1−1) . Substitute it as η in 1 n bn+(1−θ)(η1−1−bnη1) n (33) and solve for u to construct moment conditions for the GMM estimation:33 jt   (cid:88) θ(η −1) (cid:20) (cid:18) α (cid:18) K (cid:19)γ(cid:19)(cid:21) 1 ˜ K jt u jt = lnρ+ln (η −1)+ ˆ b +(θ−1) ˆ b η R jnt−ln E Mjt +E Ljt 1+ α L . n∈Λjt 1 n n 1 L jt (36) There are only four parameters, β ≡ (ρ,η , αK,γ), to be estimated. This means that the 1 αL number of instrumental variables required does not increase with the number of products. Firm-level input choices can serve as valid IVs because they are not correlated with the unexpected shocks. In the implementation, we use Z = (1,E ,E ,L ,K /L ) as IVs. jt Mjt Ljt jt jt jt Equation (36) can only identify αK rather than α , α , and α separately. The full set αL L M K of (α ,α ,α ) can be identified with two constraints naturally implied by the model. The L M K first constraint is a normalization of distribution parameters in the CES production function: α +α +α = 1. The second constraint equalizes the ratio of geometric means of labor L M K expenditure (E ) and material expenditure (E ) to the ratio of distribution parameters in L M the CES production function. That is, αM = EM. This constraint results from taking the αL EL geometric mean of (13), which is implied by the first-order conditions of labor and material 2(N −1) moment equations that can be formed to identify (N −1) coefficients (i.e., η1−1,n=2,...,N). 33The identification of ρ relies on the condition E(eujnt) = 1, as assumed in Sec η t n io − n 1 2.1. Due to the log-additivity of ρ in the main estimating equation (36), the value of ρ does not affect the estimation of the rest of the parameters. Thus, the rest of the parameters can be estimated before ρ is estimated. The following describes how ρ is estimated. Although the composite error term u , defined in (34), does jt not have a zero mean (i.e., E(u ) ̸= 0), the composite error for single-product firms is the same as jt the product-level shock (i.e., u = u ). After the rest of the parameters are estimated, (36) can be jt j1t written for these single-product firms as eu ρ j1t = (cid:20) EMjt +ELj ηˆ t 1 ηˆ (cid:18) − 1 1 1 + R˜ α α ˆ j ˆ 1 K L t (cid:16)K Lj j t t (cid:17)γˆ(cid:19)(cid:21) . Taking the expectation for   both sides, we have ρ 1 =E   (cid:20) EMjt +ELj ηˆ t 1 ηˆ (cid:18) − 1 1 1 + R˜ α α ˆ j ˆ 1 K L t (cid:16)K Lj j t t (cid:17)γˆ(cid:19)(cid:21)   . Therefore, the estimate of ρ can be recovered as   ρˆ=1/E   (cid:20) EMjt +ELj ηˆ t 1 ηˆ (cid:18) − 1 1 1 + R˜ α α ˆ j ˆ 1 K L t (cid:16)K Lj j t t (cid:17)γˆ(cid:19)(cid:21)   . 30

quantities, (11) and (12), of all firms.34 As a result, β can be estimated as: (cid:34) (cid:35)′ (cid:34) (cid:35) 1 (cid:88) 1 (cid:88) ˆ β = argmin u Z W u Z , (37) β N jt jt N jt jt j,t j,t α E M M subject to: α +α +α = 1 and = , L M K α E L L where W is a weight matrix, N is the number of firm-time observations, and u is the jt composite error term (36). ˆ As a summary of the full estimating approach, the first step is to estimate θ from (27) ˆ 1−θ ηn via 2SLS. The second step is to estimate b = ηn−1, n = 2,...,N via 2SLS using the n 1−θ η1 η1−1 relationship imposed by the within-firm relative sales in (35). The third step is to estimate (ρˆ,ηˆ ,αˆ ,αˆ ,αˆ ,γˆ) using (37) via GMM. With these estimates, the demand elasticities can 1 L M K be recovered as ηˆ = 1+ θˆ(ηˆ1−1) . After that, we compute ξ ˜ and ω˜ via (30) n bn+(1−θˆ)(ηˆ1−1−ˆbnηˆ1) jnt jnt and (32), respectively. 5.3 Monte Carlo Exercise In this section, we conduct a Monte Carlo exercise to demonstrate the performance of our estimation method. In the Monte Carlo setting, the choice of product sets is exogenous and random. The productivity and quality levels of each product are not only serially correlated over time but also negatively correlated with each other in the same period. Across products, productivity and quality exhibit different degrees of persistence and dispersion. Consequently, the levels and variability of productivity and quality differ systematically across products, generating heterogeneous revenue shares within a firm, thereby mimicking patterns observed in actual data. Wage rates, material prices, and capital stock are simulated as serially correlated and exogenous AR(1) processes. These variables are correlated with contemporaneous input and output decisions because the firm observes their realized values before choosing inputs and outputs to maximize profit. Our Monte Carlo exercise consists of 200 replications of simulated datasets for J firms observed over T periods, based on a model with a set of true parameters for five products: 34As shown by Grieco et al. (2016), this constraint holds conditional on a normalization of the CES production function. We follow the same procedure to normalize the inputs using their corresponding industry-level geometric means (e.g., Klump and de La Grandville, 2000; Le´on-Ledesma et al., 2010). Nonetheless, to ease our notation, we directly denote the normalized input variables as (L ,M ,K ). As a jt jt jt result, the ratio of the geometric means of material and labor is M =1, which implies αM = EM, by taking L αL EL the geometric mean of (13) across firms. 31

Table 3: Monte Carlo: Estimates of Production and Demand Function Parameters Production Demand Parameter True Estimate Parameter True Estimate α 0.400 0.400 η 3.000 3.022 L 1 (0.005) (0.257) α 0.400 0.400 η 4.000 4.035 M 2 (0.005) (0.363) α 0.200 0.200 η 5.000 5.022 K 3 (0.010) (0.403) σ 2.000 2.000 η 6.000 6.016 4 (0.023) (0.398) ρ 1.100 1.100 η 7.000 7.001 5 (0.024) (0.368) θ 0.900 0.900 (0.004) Note: The parameter estimates are reported as the mean estimates from the Monte Carlo simulations. Standard errors in parentheses are the standard deviations of the estimates. (η ,η ,η ,η ,η ,α ,α ,α ,σ,ρ,θ). In each replication, we simulate productivity (ω˜ ) and 1 2 3 4 5 L M K jnt ˜ quality (ξ ) for each product n, firm j, and period t, as well as wage rates (P ), material jnt Ljt prices (P ), and capital stocks (K ) for each firm j and period t, using AR(1) processes Mjt jt with different persistence parameters and dispersion degrees of innovation shocks. Given these variables, and the production and demand specifications in Section 5, we use the firm’s static profit maximization problem to derive the optimal choices of labor and material inputs (L and M ), as well as the optimal output quantity (Q ) and price jt jt jnt (P ) for firm j and product n in each period t. The observed product price incorporates jnt an idiosyncratic shock: P ˜ = P eujnt, where u is a firm-product-time specific shock. jnt jnt jnt ˜ ˜ Consequently, the observed product revenue is given by R = P Q . jnt jnt jnt The parameter values used in the data-generating process are summarized in Online Appendix Table A4. The variables used in the estimation procedure and the same set of instru- ˜ ˜ mentalvariablesasdetailedinSection5.2are(Q ,...,Q ,R ,...,R ,K ,L ,E ,E ). j1t jnt j1t jnt jt jt Ljt Mjt The simulated data exhibit realistic distributional patterns. First, productivity and quality are negatively correlated (coefficient: –0.2). Second, as shown in Online Appendix Table A6, both the levels and heterogeneity of productivity and quality differ across products, reflecting technological and demand variation. Third, the mean within-firm revenue share varies across products, ranging from 6% to 57%, indicating differences in product importance within firms. Moreover, the dispersion of within-firm revenue shares differs substantially by 32

product, suggesting heterogeneity in the relative importance of each product across firms. Table 3 reports the mean estimates of the key parameters alongside their standard errors. In addition, the estimates for the parameters of (35) also closely match the true values, as shown in Table A5, demonstrating the effectiveness of the IVs proposed in Section 5.2. Overall, the results indicate that the estimation procedure successfully recovers the true parameters of the production and demand functions. 6 Estimation Results This section reports the estimation results, including the production and demand function parameter estimates by industry as well as firm-product level productivity and quality. Because our empirical analysis relies on estimated variables, we employ bootstrapping with 100 samples to compute all standard errors presented in the subsequent tables. Table 4: Production function estimates Parameter Footwear Printing Pharmaceutical α 0.199 0.228 0.218 L (0.014) (0.015) (0.025) α 0.763 0.676 0.574 M (0.039) (0.027) (0.068) α 0.037 0.096 0.208 K (0.049) (0.035) (0.089) σ 1.225 1.264 1.142 (0.516) (0.111) (0.179) ρ 1.054 1.129 1.037 (0.146) (0.123) (0.196) θ 0.950 0.779 0.720 (0.053) (0.066) (0.082) Note: Bootstrapped standard errors clustered at the firm level and stratified by industry and scope are shown in parentheses (100 repetitions). Table 4 presents the production parameters. α is significantly larger than α and α , M L K consistent with the intensive use of intermediate material input across all industries. α in the K pharmaceutical industry is the highest among the three industries, reflecting the importance of capital in this industry. Parameter σ, which is the elasticity of substitution across inputs, i.e., labor, material, and capital, is greater than one across all industries. This is different from those in the classical literature which does not control for heterogeneous material prices. But it is largely consistent with the estimates in Grieco et al. (2016, 2022), Harrigan et al. (2021), and Li and Zhang (2022) based on a similar method but using different datasets from 33

Colombia, France, and China, respectively. It is also close to the average estimate of the elasticity of substitution among Chinese industries by Berkowitz et al. (2017) using a different method. Furthermore, the returns to scale parameter ρ of the three industries is larger than one, but it is not significantly different from one, implying that production is close to constant returns to scale in these industries. Finally, the estimated values of θ range from 0.720 in the pharmaceutical industry to 0.950 in the footwear industry. Taking θ = 1 as the benchmark case where products are perfectly substitutable in production, these estimates suggest that products in the footwear industry (e.g., men’s vs. women’s shoes) are considerably more substitutable in production than those in the pharmaceutical industry (e.g., antiparasitics vs. hormones). Table 5 presents the estimated demand elasticity parameters for different products across the three industries. These estimates generally fall within the range reported in the existing literature (e.g., Roberts et al., 2018; Grieco et al., 2016; Dubois and Lasio, 2018). The estimated variation in demand elasticities across products implies meaningful heterogeneity in product-level markups. In the footwear industry, markups range from 1.218 to 1.400, while in the pharmaceutical industry they are significantly higher, ranging from 1.478 to 1.614. Because firms produce different sets of products in different years, we compute firm-year-level markups as weighted averages of product-level markups, using revenue shares as weights. Across the three industries, the average firm-year markup is 1.403, with a standard deviation of 0.102. This dispersion is smaller than the estimate reported by De Loecker and Warzynski (2012), which reflects a broader range of markup variation. Our measure of markup dispersion reflects only heterogeneity in product demand elasticities and composition across firms and years. Despite this narrower definition, the dispersion in firm-year markups is substantial. Theestimatedqualityandproductivityalsodemonstratesubstantialdispersionacrossfirms, even conditional on a given product. However, if the objective is to compare technological production efficiency, the productivity measure (e.g., TFPQ) is not directly comparable across or within firms, as varieties within the same product category differ in quality levels. In contrast, quality-adjusted output is directly comparable across firms and products, as pointed out by Melitz (2000). Thus, we follow the literature (e.g., Orr, 2022; Li et al., 2025) to construct a revenue-based productivity (TFPR) measure that takes both quality and productivity into account:35 1 ˜ TFPR = ω˜ + ξ . (38) jnt jnt jnt η −1 n As expected, TFPR reflects significant dispersion across firms even within a specific 35Note that quality enters TFPR as 1 ξ˜ . This is due to the demand specification in Section 5.1. ηn−1 jnt 34

Table 5: Demand function estimates Parameter Footwear Printing Pharmaceutical η 4.009 4.128 2.965 1 (1.620) (1.030) (1.166) η 3.497 4.262 2.927 2 (1.777) (0.898) (1.200) η 4.263 3.699 2.998 3 (1.780) (1.192) (1.070) η 5.593 3.890 3.047 4 (1.835) (1.352) (1.274) η 4.276 2.911 5 (0.881) (1.219) η 4.111 2.805 6 (0.931) (1.264) η 3.787 2.923 7 (1.184) (1.158) η 4.016 2.856 8 (1.164) (1.182) η 4.210 2.878 9 (0.909) (1.195) η 4.251 2.866 10 (0.886) (1.188) η 4.004 2.926 11 (1.056) (1.282) η 4.042 2.907 12 (1.070) (1.151) η 4.123 3.176 13 (0.968) (1.395) η 4.200 3.062 14 (0.914) (1.415) η 4.147 2.628 15 (0.952) (1.334) η 2.907 16 (1.138) Note: Elasticity parameters in each column within each industry are ordered to correspond with the column entries of products in Online Appendix Table A1, respectively. Bootstrapped standard errors clustered at the firm level and stratified by industry and scope are shown in parentheses (100 repetitions). 35

product category.36 The mean standard deviation within a product is 2.667 (calculated across all products in the three industries), which is similar in magnitude to that of revenue productivity documented by Grieco et al. (2022) in the Chinese paint industry. Regarding the components of TFPR, the standard deviation of ω˜ within a product has a mean of jnt 2.867, while the standard deviation of 1 ξ ˜ within a product has a mean of 1.474.37 ηn−1 njt Interestingly, our results also reveal that within-firm heterogeneity in TFPR is substantial. Among multi-product firms, the average standard deviation of TFPR across products within a firm is 0.337, which is approximately one-eighth of the standard deviation of TFPR across firms for a given product. This indicates that while across-firm heterogeneity is more prominent, within-firm TFPR dispersion is also economically significant. Overall, our estimation results reflect reasonable parameter estimates and productivity and quality measures at the firm-product level. In the following section, we turn to use these measures to explore the roles of productivity, quality, and within-firm resource reallocation in shaping firm performance. 7 Technological Spillovers & Within-firm Reallocation A key advantage of our estimation method is that we do not need to impose any structure on the dynamic evolution of productivity. When the objective is to study potentially rich interdependencies in productivity dynamics—such as spillovers in the context of multi-product firms—we can estimate these processes after recovering the productivity measure itself rather than jointly estimating the interdependent dynamics with the model parameters. We illustrate this advantage by examining various forms of technological spillovers. In Section 7.1, we consider a conceptually novel within-firm, across-product spillover, which is particularly relevant for multi-product firms, in addition to the traditionally studied within-product, across-firm spillover (e.g., Malikov and Zhao, 2023). In Section 7.2, we show that within-firm reallocation of resources serves as an important mechanism through which multi-product firms are influenced by these technological spillovers. 7.1 Technological Spillovers: Across-firm and Within-firm In this section, we assess spillovers both across firms and within firms. In our context of multi-product firms, an across-firm spillover for a given product is defined as the impact of technical efficiency of the same product produced by other firms on that product’s own 36The distributions of TFPR by product, as well as the distributions of its components, ω˜ and ξ˜ , are njt njt reported in Online Appendix Figures A2, A3 and A4, respectively. 37The standard deviation of ω˜ is slightly larger than that of TFPR because the two components of jnt TFPR, productivity, and quality, are negatively related, as will be clear in Section 7. 36

technical efficiency. A within-firm spillover is defined as the impact of technical efficiency of other products produced by the same firm on the product in question. Formally, we propose an evolution process for firm-product-level technical efficiency ω , jnt following (6), which allows for both across-firm and within-firm spillover components: ω = g ω +g ωf +g ωp +d +ϵ , (39) jnt 1 jnt−1 f jnt−1 p jnt−1 t jnt where d is a time fixed effect and ϵ is an i.i.d. innovation shock. ωf is the acrosst jnt jnt−1 firm, within-product average technical efficiency: ωf = 1 (cid:80) ω , where Nf jnt−1 Nf −1 i̸=j int−1 nt−1 nt−1 is the total number of firms producing product n in period t−1. Similarly, ωp is the jnt−1 across-product, within-firm average technical efficiency: ωp = 1 (cid:80) ω , where jnt−1 Np −1 m̸=n jmt−1 jt−1 Np is the total number of products produced by firm j in period t−1. Both ωf and jt−1 jnt−1 ωp vary at the firm-product-time level. Therefore, the term g ωf captures across-firm, jnt−1 f jnt−1 within-product spillovers in technical efficiency. Similarly, g ωp captures across-product, p jnt−1 within-firm spillovers—that is, the effect of changes in the average technical efficiency of a firm’s other products in period t−1 on the technical efficiency of product n for firm j in period t. Equation (39) can be interpreted as a multi-product extension of the single-product case in Malikov and Zhao (2023), in which we allow for the possibility of spillovers across products within firms. The lagged dependent variable captures persistence within product-firm pairs. In addition, modeling the evolution of technical efficiency rather than TFPQ makes it possible to mitigate differences across products due to different units of measurement. While technical efficiency ω is not directly estimated in our procedure described in jnt Section 5, it is shaped by two key estimated measures of heterogeneity: TFPQ (ω˜ ) and jnt ˜ quality (ξ ), as linked through the TFPQ-quality tradeoff specified in (5). We do not impose jnt such a trade-off in our estimation but after estimation the raw correlation between these two aspects of heterogeneity is negative, as shown in Online Appendix Figure A5. The emerging literature using firm-level data (e.g., Grieco and McDevitt, 2017; Atkin et al., 2019; Li et al., 2025) has documented a cost of quality: conditional on technical efficiency, producing higher-quality products raises marginal costs and therefore reduces measured TFPQ. This cost of quality can generate such a negative correlation between TFPQ and product appeal, a pattern consistent with findings from the broader literature (e.g., Orr, 2022; Forlani et al., 2023; Eslava et al., 2024). We exploit this relationship to characterize technical efficiency ω and estimate its evolution process in (39). jnt 37

Specifically, we adopt a linear version of the TFPQ-quality tradeoff (5):38 ˜ ω˜ = ω −γ ξ , (40) jnt jnt ξ jnt ˜ where γ ξ is interpreted as the cost (in terms of lowering productivity) of increasing quality, ξ jnt holding inputs fixed. γ is the elasticity of productivity with respect to the change in quality. ξ Replacing technical efficiency in (39) by that in (40) gives: ω˜ = g ω˜ −γ ξ ˜ +g γ ξ ˜ +g ω˜f +g γ ξ ˜f +g ω˜p +g γ ξ ˜p +d +ϵ , jnt 1 jnt−1 ξ jnt 1 ξ jnt−1 f jnt−1 f ξ jnt−1 p jnt−1 p ξ jnt−1 t jnt (41) where ω˜f and ξ ˜f are the across-firm, within-product average productivity and qualjnt−1 jnt−1 ity, respectively. Similarly, ω˜p and ξ ˜p are the across-product, within-firm average jnt−1 jnt−1 productivity and quality, respectively.39 Although all variables (except ϵ ) are already estimated from our structural model, the jnt ˜ innovation shock ϵ can be correlated with contemporaneous quality choice ξ . To address jnt jnt such an endogeneity problem, we estimate (41) via GMM using a set of instrumental variables that includes internal instruments in period t−2. According to the timing assumption, these variables are uncorrelated with the i.i.d innovation term ϵ . jnt The estimation results for various specifications of (41) are presented in Table 6. Column (1) reports estimates from a simplified specification that excludes both year fixed effects and spillover terms, while Column (2) adds year fixed effects. Column (3) introduces the across-firm spillover term, g ωf . Finally, Column (4) augments this specification by f jnt−1 including an additional term, g ωp , which captures within-firm technological spillovers. p jnt−1 We treat Column (4) as our main specification for capturing the evolution of firm-productlevel technical efficiency, allowing for a rich pattern of across- and within-firm technological spillovers and the trade-off between productivity and quality. As expected, technical efficiency is highly persistent, as indicated by the estimated value of g . Consistent with the literature, 1 there is a negative trade-off between productivity and quality at the firm-product level: a 1-percent increase in quality lowers productivity by 0.340 percent, holding all else constant. The magnitude is comparable to the estimated productivity-quality trade-off elasticity of 0.2 in the U.S. healthcare industry (Grieco and McDevitt, 2017) and 0.5 in the Chinese steel-making industry (Li et al., 2025). More importantly, the across-firm spillover of technical efficiency is economically significant. For a given product, a 1-percent increase in the average technical efficiency of this product 38Higher order terms of ξ˜ can be added to this relationship to allow for nonlinearity of the tradeoff. jnt 39The within-product and within-firm averages are divided, respectively, by the number of firms and products minus one, since each average excludes its own observation from the sum. 38

Table 6: Productivity, cost of quality, and spillovers Dep. var.: Productivity (1) (2) (3) (4) g 0.905*** 0.903*** 0.884*** 0.842*** 1 (0.035) (0.035) (0.031) (0.030) γ 0.129 0.131 0.337 0.340 ξ (0.201) (0.202) (0.287) (0.242) g 0.145*** 0.137*** f (0.030) (0.028) g 0.056*** p (0.002) Year FE no yes yes yes Observations 4806 4806 4806 4806 Note: The dependent variable is quantity-based productivity at the firm-product-year level. The coefficients are estimated via GMM. The instrument set includes twice-lagged productivity and quality in all columns. In columns (3) and (4), the instrument set also includes the simple average of twice-lagged productivity and quality of the same product produced by other firms. In column (4), the instrument set also includes the simple average of twice-lagged productivity and quality of the other products produced by the same firm. Bootstrapped standard errors clustered at the firm level and stratified by industry and scope are shown in parentheses (100 repetitions). *** p < 0.01, ** p < 0.05. produced by other firms raises the technical efficiency of this product by 0.137 percent. This magnitude is similar to that documented by Malikov and Zhao (2023), who report an acrossfirm spillover elasticity of 0.33 for the Chinese electric machinery manufacturing industry. In addition, our results reveal a within-firm spillover, with an elasticity approximately 40% of the magnitude of the across-firm spillover. This comparison suggests that while spillover effects are larger across firms, within-firm spillovers are also economically meaningful. Our estimates of within-firm spillovers are closely related to economies of scope, an idea emphasized by Koike-Mori and Martner (2024), Argente et al. (2025), Ding (2025), and Khmelnitskaya et al. (2025). The importance of within-firm spillovers has also been noted for multinational firms when studying innovation and the role of intangible assets (see, e.g., Bilir and Morales, 2020; Merlevede and Theodorakopoulos, 2023). While intangible assets (ideas, knowledge) may diffuse more readily within firm boundaries than across them, they can also be rival inputs internally due to limits on managerial attention and information-processing capacity.40 Thus, whether within-firm or across-firm spillovers dominate is an empirical question that depends on the context. Our contribution is to establish the quantitative 40See Crouzet et al. (2022) for a discussion of potential rivalry in intangible capital within the firm. 39

importance of within-firm spillovers—relative to across-firm spillovers—in a context with multi-product firms and substantial product-level heterogeneity in production efficiency. As we demonstrate in Section 7.2, product-specific shocks are amplified into firm-level and industry-level outcomes through internal resource reallocation, operating via both across-firm and within-firm spillovers. 7.2 Within-firm Reallocation in Response to Product-specific Shocks What are the implications of an exogenous, product-specific shock for multi-product firms? Spillovers—both across and within firms—imply that the effects can be complex. Across-firm spillovers reflect the additional impact of a product-specific productivity improvement on all other firms that produce the same product, while within-firm spillovers imply that all other products within the same firm are also directly affected. These direct spillover effects on firmlevel productivity are further amplified by within-firm resource reallocation: a productivity improvement in any product can trigger reallocation of resources across all products within the firm. In this section, we quantify the importance of across-firm and within-firm spillovers in terms of their impacts on social welfare and firm-level productivity via counterfactual analysis, and we highlight within-firm resource reallocation, which is not considered in the single-product firm literature, as a key mechanism through which these effects operate. As a counterfactual exercise, we consider a 1 percentage point exogenous improvement in the technical efficiency of a representative product—denoted without loss of generality as product 1, the reference product produced by the most firms in an industry—in period t for all firms that produce this product.41 Formally, we set ω′ = ω + 0.01 for j1t−1 j1t−1 each firm that produces product 1, while holding the technical efficiency of other products unchanged: ω′ = ω for n ̸= 1. According to the dynamics of technical efficiency in jnt−1 jnt−1 (39), this improvement in period t−1 directly affects the technical efficiency of product 1 in period t through the persistence term, g ω . The across-firm direct spillover effect on 1 j1t−1 all firms that produce product 1 is captured through the term g ωf , while the withinf j1t−1 firm spillover effect on another product n ̸= 1 of firm j operates through g ωp . As a p jnt−1 result, this 1-percent improvement generates differential direct effects on products in period t: ∆ω = g ×0.01+g ×0.01 for the reference product n = 1, and ∆ω = g × 0.01 for j1t 1 f jnt p Np −1 jt−1 other products n ̸= 1, where Np is the number of products produced by firm j in period jt−1 t−1. 41We focus on the short-term effects, holding all dynamic decisions (i.e., product quality, scope, and investment) described in Online Appendix C fixed. The overall long-term impacts of spillovers would likely be even larger, as firms adjust their dynamic decisions in response to spillovers. However, evaluating these long-term effects would require estimating a fully dynamic model, which we leave for future research. 40

These disproportionate impacts across products within a firm not only directly improve the productivity of individual products but also indirectly affect firm-level productivity, a traditional measure of firm performance, through within-firm resource reallocation towards more productive products in multi-product firms: firms allocate a larger share of resources to products whose productivity has improved more. Both the direct and indirect effects contribute to the improvement in firm-level TFPR. To assess the relative importance of these contributions, we aggregate firm-level TFPR from the underlying firm-product-level measures and decompose the overall effect into a direct impact and an indirect impact operating through within-firm reallocation. Inchoosing anaggregation method, we followthe spiritof the standardproductionfunction estimation literature on multi-product firms, where productivity is typically allowed to vary only at the firm level, TFPR , rather than at the firm–product level. This benchmark jt treats firms as multi-product producers but abstracts from within-firm productivity heterogeneity, so reallocation across products does not affect measured firm productivity. Our firm–product–level productivity measure TFPR is closely related to the firm-level concept, jnt but by allowing for within-firm heterogeneity it enables us to study how resource reallocation across products shapes firm-level performance, which is a mechanism that is absent in singleproduct settings or in analyses that rely solely on firm-level productivity. Applying such a concept of firm-level productivity to our model setup in Section 5, Online Appendix E shows that, this firm-level TFPR and our measure of firm-product level TFPR are related as:42 jt jnt  −1 θ eTFPRjt = (cid:88)(cid:0) s e−TFPRjnt (cid:1)θ  , (42) jnt   Λjt 42This relationship is derived under the assumption that the firm produces multiple products, each with its own elasticity parameter in the CES demand function (28), while revenue productivity varies only at the firm level, not by product, as shown in Online Appendix E. A more conventional measure of firm-level TFPR follows the tradition of estimation methods that treat each firm as producing a single (aggregated) product. In that case, with a CES demand function for the aggregated product and elasticity parameter η¯, firm-level TFPR and our firm–product-level TFPR are related by eTFPRjt = (cid:110) (cid:80) Λjt (cid:0) s jnt e−TFPRjnt (cid:1)θ (cid:111)− θ 1 , where ηn R˜ηn−1 s = jnt isaweight. Relativetotherelationshipin(42),theonlydifferenceliesinhowtheshare jnt (cid:104)(cid:80) Λjt R˜ jnt (cid:105) η¯− η¯ 1 s isconstructed. Werefrainfromusingthisaggregationbecausetheelasticityparameterη¯fortheaggregated jnt firm-level product is not a primitive object in our framework. Finally, a more straightforward measure of firm-level TFPR aggregates the firm-product-level measures using sales shares as weights: TFPR = jt (cid:80) s TFPR , where s = R˜ jnt is the within-firm sales share of product n. Accordingly, the n∈Λjt jnt jnt jnt (cid:80) Λjt R˜ jnt within-firm decomposition can be performed in the spirit of the within-industry, across-firm decomposition proposed by Olley and Pakes (1996). Our key result regarding the relative contributions of the direct and indirect impacts remains qualitatively similar under this alternative aggregation. 41

ηn R˜ηn−1 where s = jnt is a weight based on the revenues of different products within jnt (cid:40) (cid:20) ηn (cid:21)θ(cid:41) θ 1 (cid:80) R˜ηn−1 Λjt jnt the firm. Combining the definition of firm-product level TFPR in (38) and the TFPQ-quality trade-off in (40), TFPR can be expressed in terms of technical efficiency and quality: jnt TFPR = ω +( 1 −γ )ξ ˜ .43 The counterfactual exogenous shock affects TFPR jnt jnt ηn−1 ξ jnt jnt ˜ directly through ω , while ξ is held fixed. jnt jnt Given the disproportionate changes in technical efficiency across products (∆ω ), each jnt firm re-optimizes its profit by adjusting its input and product-level outputs. Let R ˜′ jnt denote the resulting product-level revenue in the counterfactual scenario. The corresponding counterfactual firm-level TFPR, computed using (42), is denoted by TFPR′ . The overall jt effect of a 1-percent improvement in the technical efficiency of the reference product is then measured by (TFPR′ −TFPR ). jt jt Table 7: Effects of 1-percent exogenous increase in technical efficiency of the reference product Spillover type No Across-firm Within-firm Both Total welfare, million Pesos 1.543 1.799 1.626 1.881 (0.327) (0.347) (0.327) (0.347) Firm-level TFPR, percentage 0.315 0.368 0.320 0.373 (0.154) (0.158) (0.155) (0.159) – Direct impact 0.126 0.146 0.133 0.154 (0.028) (0.030) (0.028) (0.030) – Within-firm reallocation 0.190 0.222 0.187 0.219 (0.160) (0.163) (0.160) (0.163) Note: The total welfare is measured as the sum of consumer surplus and producer surplus. A detailed decomposition is reported in Online Appendix Table A7. The TFPR improvement is measured in percentage point and calculated as the weighted average of the improvements in TFPR at the firm-year level with firms’ total sales in the baseline scenario as weights. Bootstrapped standard errors, reported in parentheses, are computed based on 100 repetitions. To isolate the role of within-firm resource reallocation, we compute a firm-level TFPR using the updated firm-product-level TFPRs but holding the within-firm revenue shares s jnt fixed at their original values. Denote this measure as TFPR∗ . The difference (TFPR∗ − jt jt TFPR ) reflects the direct impact of the technical efficiency improvement, while the difference jt (TFPR′ −TFPR∗ ) captures the indirect impact operating through within-firm reallocation. jt jt 43Intuitively, this expression implies that while product quality promote TFPR, a sizable portion of its impact is offset by its cost, as documented in Li et al. (2025). 42

We conduct this decomposition for each firm and aggregate the results to the industry level using firm sales as weights. We implement this decomposition for four distinct cases: (a) no spillover (gf = 0,gp = 0); (b) allowing for only across-firm spillover (gf = 0.137,gp = 0); (c) allowing for only within-firm spillover (gf = 0,gp = 0.056); and (d) allowing for both spillovers (gf = 0.137,gp = 0.056). In additiontothedecompositionofTFPRimprovement, wealsoevaluatethewelfareimplications by computing the change in total social welfare. Total welfare is defined as the sum of firm profits and consumer surplus. Given the demand functions in (28), consumer surplus is computed as (cid:80) R˜ jnt. j,n ηn−1 The results are presented in Table 7. A 1-percent increase in the technical efficiency of the reference product leads to an improvement in total welfare of 1.543 million Pesos in the absence of any technological spillover. When only across-firm spillover presents, the welfare gain increases to 1.799 million Pesos—approximately 16.6% higher than in the no-spillover case. In comparison, allowing only within-firm spillovers results in a 5.4% larger welfare improvement relative to the no-spillover baseline. When both spillover channels are active, the total welfare gain is approximately the sum of the net effects from the individual spillover scenarios. This comparison implies that both across-firm and within-firm technological spillovers are economically significant, though the former plays a more dominant role. Figure 1: Contribution of within-firm resource reallocation to TFPR growth 0.5 0.45 0.4 0.35 0.3 0.25 0.2 0.15 0.1 0.05 0 1 2 3 4 5 6 7 8 9 10 Within-firm rank of the impacted product tniop egatnecrep ,noitacollaer mrif-nihtiW Notes: Allfirmsproducingmorethan10productsareclusteredinthe“10”group. A similar pattern emerges in the impact on firm-level TFPR. Notably, within-firm resource reallocation accounts for approximately 60% of the overall TFPR improvement across all four scenarios.44 This highlights the importance of within-firm reallocation in shaping how 44This magnitude is based on the ratio of the indirect effect (Row 4) to the total effect (Row 2) across all columns of Table 7. 43

multi-product firms benefit from productivity shocks. While a large literature emphasizes across-firm reallocation as a driver of aggregate productivity growth—showing that resources tend to flow toward more productive firms (e.g., Baily et al., 1992; Bartelsman and Doms, 2000; Baily et al., 2001; Aw et al., 2001; Foster et al., 2006, 2008; Syverson, 2011; Collard- Wexler and De Loecker, 2015)—our firm-product-level analysis demonstrates that within-firm resource reallocation makes a sizable contribution to the firm-level productivity growth. Interestingly, within-firm resource reallocation plays an even greater role when a firm’s topselling products experience a productivity shock. This pattern is illustrated in Figure 1, which shows that the contribution of within-firm reallocation to firm-level TFPR improvement declines steadily with the within-firm rank of the impacted product.45 Specifically, a 1 percentage point increase in the technical efficiency of a firm’s top product (ranked 1st) results in a 0.4 percentage point improvement in firm-level TFPR attributable to within-firm reallocation. Bycontrast,whenthesameimprovementoccursinthefirm’sleast-sellingproduct (ranked 10th), the contribution from reallocation falls to less than 0.05 percentage points. This result has important implications for the endogenous productivity dynamics of multi-product firms. In settings where firms make dynamic decisions about productivity investment, the relative sales performance of products within the firm may be a key determinant of where research efforts are directed—echoing insights from Kim (2024). 8 Conclusion Multi-product firms account for a significant share of our economy. Yet, the traditional firm-level analysis in the literature masks the within-firm heterogeneity. In this paper, we propose a novel method to estimate firm-product-level productivity and quality along with demand and transformation function parameters. Compared with the existing methods in the literature, our method does not impose assumptions on how inputs are allocated across different products within firms, nor does it necessarily restrict how productivity evolves over time. This flexibility allows researchers to explore complex productivity dynamics after estimation. Importantly, the method can be easily scaled up to estimate production functions with a large number of products, without relying on the availability of productivity proxies. Finally, the method accounts for heterogeneous intermediate input prices that are usually unobservable to researchers and lead to biased estimation results when ignored. We apply our method to three major industries in the Mexican manufacturing sector. Our findings reveal substantial heterogeneity in both quality and productivity—even when conditioning on a given product. Moreover, conditional on input usage, firms face a trade-off between quality and productivity. After accounting for this trade-off, we find that the 45A higher rank indicates a product further from the firm’s top-selling product. 44

underlying technical efficiency exhibits both across-firm and within-firm spillovers. This implies that an exogenous improvement in the technical efficiency of a single product not only affects the efficiency of other firms producing the same product, but also influences the efficiency of other products within the same firm. Notably, a large share of the resulting impact on firm-level TFPR is driven by within-firm resource reallocation. This highlights the quantitative importance of within-firm reallocation as a key mechanism through which multi-product firms enhance their performance. Consequently, this channel holds important implications for understanding aggregate productivity growth. References Ackerberg, D. A., K. Caves, and G. Frazer (2015). Identification properties of recent production function estimators. Econometrica 83(6), 2411–2451. Argente, D., S. Moreira, E. Oberfield, and V. Venkateswaran (2025). Scalable expertise: How standardization drives scale and scope. Technical report, National Bureau of Economic Research. Atalay, E. (2014, June). Materials prices and productivity. Journal of the European Economic Association 12(3), 575–611. Atkin, D., A. K. Khandelwal, and A. Osman (2019). Measuring Productivity: Lessons from Tailored Surveys and Productivity Benchmarking. AEA Papers and Proceedings 109, 444–449. Aw,B.Y.,X.Chen,andM.J.Roberts(2001). Firm-levelevidenceonproductivitydifferentials and turnover in taiwanese manufacturing. Journal of Development Economics 66(1), 51–86. Aw, B. Y., M. Roberts, and D. Y. Xu (2011). R&d investment, exporting, and productivity dynamics. American Economic Review 101, 1312–1344. Baily, M. N., E. J. Bartelsman, and J. Haltiwanger (2001). Labor productivity: structural change and cyclical dynamics. Review of Economics and Statistics 83(3), 420–433. Baily, M. N., C. Hulten, D. Campbell, et al. (1992). Productivity dynamics in manufacturing plants. Brookings Papers on Economic Activity 23(1992 Microeconomics), 187–267. Barrows, G., H. Ollivier, and A. Reshef (2024). Production function estimation with multidestination firms. CESifo Working Papers. Bartelsman,E.J.andM.Doms(2000). Understandingproductivity: Lessonsfromlongitudinal microdata. Journal of Economic Literature 38(3), 569–594. Berkowitz, D., H. Ma, and S. Nishioka (2017). Recasting the Iron Rice Bowl: The Evolution of China’s State Owned Enterprises. The Review of Economics and Statistics 99(4), 735–747. Berry, S. (1994, Summer). Estimating discrete-choice models of product differentiation. RAND Journal of Economics 25(2), 242–262. Berry, S., J. Levinsohn, and A. Pakes (1995). Automobile prices in market equilibrium. Econometrica 63(4), 841–890. Bilir, L. K. and E. Morales (2020). Innovation in the global firm. Journal of Political Economy 128(4), 1566–1625. Bond, S., A. Hashemi, G. Kaplan, and P. Zoch (2021). Some unpleasant markup arithmetic: Production function elasticities and their estimation from production data. Journal of 45

Monetary Economics 121, 1–14. Cairncross, J., P. Morrow, S. Orr, and S. Rachapalli (2025). Identifying firm vs. product markups using productiondata: Micro estimates and aggregate implications. mimeo. Caselli, M., A. Chatterjee, and A. Woodland (2017). Multi-product Exporters, Variable Markups and Exchange Rate Fluctuations. Canadian Journal of Economics 50(4), 1130– 1160. Caselli, M., L. Nesta, and S. Schiavo (2021). Imports and labour market imperfections: Firm-level evidence from France. European Economic Review 131, 103632. Chen, Y., M. Igami, M. Sawada, and M. Xiao (2021). Privatization and productivity in china. The RAND Journal of Economics 52(4), 884–916. Chen, Z. and M. Liao (2022). Production Function Estimation for Multi-Product Firms. Working Paper 3968432, SSRN. Collard-Wexler, A. and J. De Loecker (2015). Reallocation and technology: Evidence from the us steel industry. American Economic Review 105(1), 131–71. Crouzet, N., J. C. Eberly, A. L. Eisfeldt, and D. Papanikolaou (2022). The economics of intangible capital. Journal of Economic Perspectives 36(3), 29–52. Das, S., M. J. Roberts, and J. R. Tybout (2007). Market entry costs, producer heterogeneity and export dynamics. Econometrica 75(3), 837–873. De Loecker, J. (2011). Product differentiation, multi-product firms and estimating the impact of trade liberalization on productivity. Econometrica 79, No. 5, 1407–1451. De Loecker, J., P. K. Goldberg, A. K. Khandelwal, and N. Pavcnik (2016). Prices, markups, and trade reform. Econometrica 84(2), 445–510. De Loecker, J. and F. Warzynski (2012). Markups and firm-level export status. American Economic Review 102(6), 2437–71. Demirer, M. (2022). Production Function Estimation with Factor-Augmenting Technology: An Application to Markups. mimeo. Dhyne, E., A. Petrin, V. Smeets, and F. Warzynski (2022). Theory for extending singleproduct production function estimation to multi-product settings. Nber working paper, National Bureau of Economic Research. Diewert, E., K. J. Fox, and L. Ivancic (2009). Scanner Data, Time Aggregation and the Construction of Price Indexes. UBC Discussion Paper 09-09, Department of Economics, University of British Columbia. Ding, X. (2025). Industry linkages from joint production. Working Paper. Doraszelski, U. and J. Jaumandreu (2013). R&D and Productivity: Estimating Endogenous Productivity. Review of Economic Studies 80, 1338 – 1383. Doraszelski, U. and J. Jaumandreu (2016). Measuring the bias of technological change. Journal of Political Economy (forthcoming). Dubois, P.andL.Lasio(2018). Identifyingindustrymarginswithpriceconstraints: Structural estimation on pharmaceuticals. American Economic Review 108(12), 3685–3724. Eslava, M., J. Haltiwanger, and N. Urdaneta (2024). The Size and Life-Cycle Growth of Plants: The Role of Productivity, Demand, and Wedges. The Review of Economic Studies 91(1), 259–300. Forlani, E., R. Martin, G. Mion, and M. Muuˆls (2023). Unraveling Firms: Demand, Productivity and Markups Heterogeneity. The Economic Journal 133(654), 2251–2302. Foster, L., J. Haltiwanger, and C. J. Krizan (2006). Market selection, reallocation, and 46

restructuring in the us retail trade sector in the 1990s. The Review of Economics and Statistics 88(4), 748–758. Foster, L., J. Haltiwanger, and C. Syverson (2008). Reallocation, firm turnover, and efficiency: Selection on productivity or profitability? American Economic Review 98(1), 394–425. Gandhi, A., S. Navarro, and D. A. Rivers (2020). On the identification of gross output production functions. Journal of Political Economy 128(8), 2973–3016. Grieco, P., S. Li, and H. Zhang (2016). Production function estimation with unobserved input price dispersion. International Economic Review 57(2), 665–690. Grieco, P., S. Li, and H. Zhang (2022). Input Prices, Productivity and Trade Dynamics: Long-run Effects of Liberalization on Chinese Paint Manufacturers. The RAND Journal of Economics 53(3), 516–560. Grieco, P. L. and R. C. McDevitt (2017). Productivity and quality in health care: Evidence from the dialysis industry. The Review of Economic Studies 84(3), 1071–1105. Harrigan, J., A. Reshef, and F. Toubal (2021). Techies, Trade, and Skill-Biased Productivity. CEPR Discussion Papers 15815, C.E.P.R. Discussion Papers. Hottman, C. J., S. J. Redding, and D. E. Weinstein (2016). Quantifying the sources of firm heterogeneity. The Quarterly Journal of Economics 131(3), 1291–1364. Khandelwal, A. K. (2010). The long and short (of) quality ladders. Review of Economic Studies 77, 1450–1476. Khmelnitskaya, E., G. Marshall, and S. Orr (2025). Identifying Scale and Scope Economies using Product Market Data. RAND Journal of Economics forthcoming. Kim, C. (2024). From Research to Development: How Globalization Shapes Corporate Innovation. mimeo. Kirov, I. and J. Traina (2023). Labor Market Power and Technological Change in US Manufacturing. mimeo. Klump, R. and O. de La Grandville (2000). Economic growth and the elasticity of subsitituion: Two theorems and some suggestions. American Economic Review 90(1), 282–291. Koh, P. and D. Raval (2025). Economies of scope from shared inputs. Working Paper. Koike-Mori,Y.andA.Martner(2024). Aggregatingdistortionsinnetworkswithmulti-product firms. Available at SSRN 5020563. Kumar,P.andH.Zhang(2019). Productivityorunexpecteddemandshocks: Whatdetermines firms’investment and exit decisions? International Economic Review 60(1), 303–327. Le´on-Ledesma, M. A., P. McAdam, and A. Willman (2010). Identifying the elasticity of substitition with biased technical change. American Economic Review 100(4), 1330–1357. Levinsohn, J. and A. Petrin (2003). Estimating production functions using inputs to control for unobservables. The Review of Economic Studies 70(2), 317–341. Li, J., S. Li, and H. Zhang (2025). Output Quality, Productivity, and Demand Advantage: Evidence from the Chinese Steel Industry. Working paper, University of New South Wales. Li, S. (2018). A structural model of productivity, uncertain demand, and export dynamics. Journal of International Economics 115, 1–15. Li, S. and H. Zhang (2022). Does External Monitoring from the Government Improve the Performance of State-Owned Enterprises? The Economic Journal 132(642), 675–708. Malikov, E. and S. Zhao (2023). On the estimation of cross-firm productivity spillovers with an application to FDI. Review of Economics and Statistics 105(5), 1207–1223. Mayer, T., M. J. Melitz, and G. I. Ottaviano (2021). Product mix and firm productivity 47

responses to trade competition. Review of Economics and Statistics 103(5), 874–891. Melitz, M. J. (2000). Estimating firm-level productivity in differentiated product industries. unpublished paper. Merlevede, B. and A. Theodorakopoulos (2023). Intangibles within firm boundaries. Working Paper. Morlacco, M. (2020). Market Power in Input Markets: Theory and Evidence from French Manufacturing. mimeo. Olley, G. S. and A. Pakes (1996). The dynamics of productivity in the telecommunications equipment industry. Econometrica 64(6), 1263–1297. Ornaghi, C. (2006). Assessing the effects of measurement errors on the estimation of production functions. Journal of Applied Econometrics 21(6), 879–891. Orr, S. (2022). Within-firm productivity dispersion: Estimates and implications. Journal of Political Economy 130(11), 2771–2828. Powell, A. A. and F. Gruen (1968). The constant elasticity of transformation production frontier and linear supply system. International economic review 9(3), 315–328. Pozzi, A. and F. Schivardi (2016). Demand or productivity: What determines firm growth? RAND Journal of Economics 47(3), 608–630. Raval, D. (2023). Testing the Production Approach to Markup Estimation. The Review of Economic Studies 90(5), 2592–2611. Raval, D. R. (2019). The micro elasticity of substitution and non-neutral technology. The RAND Journal of Economics 50(1), 147–167. Roberts, M., D. Y. Xu, X. Fan, and S. Zhang (2018). The Role of Firm Factors in Demand, Cost, and Export Market Selection for Chinese Footwear Producers. Review of Economic Studies 85(4), 2429–2461. Rubens, M., Y. Wu, and M. Xu (2024). Exploiting or augmenting labor. Technical report, Working Paper. Syverson, C. (2011). What determines productivity? Journal of Economic Literature 49(2), 326–65. Valmari, N. (2023). Estimating production functions of multiproduct firms. Review of Economic Studies 130(11), 3315–3342. Zhang, H. (2019). Non-neutral technology, firm heterogeneity, and labor demand. Journal of Development Economics 140, 145–168. 48

Online Appendix A Input Aggregator: Translog Functional Form While Section 5 adopts a CES input aggregator for the empirical implementation, the method is not restricted to that functional form. In this appendix, we outline an estimation strategy when the input aggregator instead takes a translog functional form. We maintain the setup of the demand system and output aggregator as (1) and (4), respectively. As a result, the parameters associated with the demand model and output aggregator are estimated in the same way as described in the paper. In particular, denote the estimated markup from the demand model as µˆ and denote the estimated parameter in njt ˆ the output aggregator as θ. In what follows, we focus on estimating the parameters specific to the translog input aggregator. The input aggregator takes a full translog functional form: (cid:26) F(L ,M ,K ) = exp α lnL +α lnM +α lnK jt jt jt l jt m jt k jt (cid:27) +α (lnK )(lnL )+α (lnK )(lnM )+α (lnL )(lnM ) kl jt jt km jt jt lm jt jt 1 1 1 + α (lnL )2 + α (lnM )2 + α (lnK )2, (A.1) ll jt mm jt kk jt 2 2 2 where the input variables, (L ,M ,K ), are normalized by their geometric means (e.g., jt jt jt 1 (cid:80) lnL = 0) respectively. Such a normalization is analogous to the normalization N j,t jt conducted for the specification of the CES input aggregator, as described in Footnote 34. Applying the methodology described in Section 3.2, we obtain a mapping from observable data to the unobservable variables:46 ξ = P−1(P ,Q ), (A.2) jnt jnt t t ELjt(α +α lnK +α lnL )−(α +α lnL +α lnK ) lnM = EMjt m km jt ml jt l ll jt kl jt , (A.3) jt (α −α ELjt) ml mmEMjt and µ P eθω˜jnt = jnt Ljt Qθ−1[F(L ,M ,K )]1−θ, (A.4) P F (L ,M ,K ) jnt jt jt jt jnt L jt jt jt (cid:124) (cid:123)(cid:122) (cid:125) λjt 46We assume that α α m m m l ̸= E E M Lj j t t , so the unique solution of M jt exists. 49

where µ in (A.4) is the markup specified in (18), M in (A.4) is the function presented jnt jt as (A.3), and F (L ,M ,K ) = ∂F(Ljt,Mjt,Kjt). The above equations explicitly represent the L jt jt jt ∂Ljt mappings of quality (8), material quantity (14), and productivity (19) in the general model. Consequently, the expression of the main estimating equation, (24), in this setup, is:     (cid:88) R˜ jnt E Ljt +E Mjt ln µ jnt  = ln ∂Fjt Ljt + ∂Fjt Mjt −u jt n∈Λjt ∂Ljt Fjt ∂Mjt Fjt = −lnβ +ln[E −β E ]−ln[1+β lnK +β lnL ]−u , (A.5) 0 Mjt 1 Ljt 2 jt 3 jt jt where β = α − α β , β = αmm, β = (α − α β )/β , β = (α − α β )/β , and 0 m l 1 1 α 2 km kl 1 0 3 lm ll 1 0 (cid:26) (cid:20) ml (cid:21)(cid:27) u = ln (cid:80) R˜ jnt/µjnt e−ujnt is a firm-level composite error term. jt n∈Λjt (cid:80) n∈Λjt R˜ jnt/µjnt As with the CES input aggregator in our empirical implementation, (A.5) alone does not identify all parameters of the translog function as the input aggregator. This is because some of the parameters are cancelled out when substituting the unobserved productivity and material input into the translog function using the mapping. Hence, additional conditions are required to identify all the translog parameters. Because the translog function is more flexible than the CES function, we explore both cross-sectional and time-series assumptions. In particular, the first time-series assumption regards material prices. For demonstration purposes, we assume that material prices evolve exogenously according to an AR(1) process: lnP = h +h lnP +ϵ , (A.6) Mjt 0 1 Mjt−1 Mjt where ϵ is an i.i.d. shock. Mjt The second time-series assumption regards technical efficiency. We only need to utilize the evolution of one product (e.g., reference product 1) within each firm. For demonstration purposes, we assume that the technical efficiency evolution of this reference product in (6) is independent of the evolution of other products in the same firm (i.e., abstracts away from spillovers): ω = g +ω +ϵ . (A.7) j1t 0 j1t−1 j1t The full estimation procedure is described by the following three steps. Step 1: Estimate the revenue relationship (A.5). We first estimate (A.5) via GMM with instrumental variables specified when estimating its general version (26) in Section 3.2. This provides an estimate of β ˆ , β ˆ , and β ˆ .47 Another 1 2 3 important output of the estimation is the fitted value of the right-hand side of (A.5), which 47We do not interpret the constant estimated from (A.5) as β because the composite error term u does 0 jt not have a zero mean (i.e., E(u )̸=0). jt 50

ˆ is denoted as Ψ : jt E +E E E ˆ Ljt Mjt Mjt Ljt Ψ ≡ = = , (A.8) jt ∂Fjt Ljt + ∂Fjt Mjt ∂Fjt Mjt ∂Fjt Ljt ∂Ljt Fjt ∂Mjt Fjt ∂Mjt Fjt ∂Ljt Fjt where the last two equalities come from the ratio of first-order conditions (13). Thus, the elasticity of function F with respect to labor can be computed as υˆ : Ljt ∂F L E jt jt Ljt = ≡ υˆ . (A.9) ∂L F Ψ ˆ Ljt jt jt jt Given the translog functional form (A.1), this elasticity can be written as: ∂F L jt jt = α +α lnL +α lnM +α lnK = υˆ . (A.10) l ll jt lm jt kl jt Ljt ∂L F jt jt Similarly, the elasticity of function F with respect to material can be computed as υˆ : Mjt ∂F M E jt jt Mjt = ≡ υˆ , (A.11) ∂M F Ψ ˆ Mjt jt jt jt and this elasticity can be written as: ∂F M jt jt = α +α lnL +α lnM +α lnK = υˆ . (A.12) m lm jt mm jt km jt Mjt ∂M F jt jt Becausetheinputvariablesarenormalizedbytheirgeometricmeans(e.g., 1 (cid:80) lnL = 0) N j,t jt respectively, taking the average of (A.10) and (A.12) we obtain: 1 (cid:88) 1 (cid:88) αˆ = υˆ , αˆ = υˆ , (A.13) l N Ljt m N Mjt j,t j,t where N is the total number of observations. These are the across-sectional restrictions that are analogous to the restriction imposed for the specification of the CES input aggregator in Section 5.2. ˆ ˆ ˆ With these estimates, we can compute β as β = αˆ −αˆ β . 0 0 m l 1 Step 2: Estimate the evolution process of material prices (A.6). Using (A.12), we can express lnM as: jt 1 lnM = (υˆ −αˆ −α lnL −α lnK ). (A.14) jt Mjt m lm jt km jt α mm This is an equivalent expression of (A.3). 51

As a result, lnP can be written as: Mjt (cid:18) (cid:19) 1 αˆ 1 α m km lnP = lnE + lnL + − υˆ + lnK . (A.15) Mjt jt β ˆ jt α α Mjt α jt 1 mm mm mm Therefore, we can estimate (A.6) via GMM using moment conditions E(ϵ Z ) = 0, (A.16) Mjt Mjt where ϵ = lnP − h − h lnP and Z is the instrument variables Z = Mjt Mjt 0 1 Mjt−1 Mjt Mjt (1,lnE ,lnL ,υˆ ,lnK ). Z is uncorrelated with ϵ because ϵ is not in jt−1 jt−1 Mjt−1 jt−1 Mjt Mjt Mjt ˆ ˆ the information set of period t. The estimated parameters are (h ,h ,αˆ ,αˆ ). 0 1 km mm With these estimates, we can recover the following parameters using the estimate in step 1 as: αˆ = αˆmm, αˆ = αˆ km −βˆ 0βˆ 2, and αˆ = αˆ ml −βˆ 0βˆ 3. ml βˆ 1 kl βˆ 1 ll βˆ 1 Step 3: Estimate the technical efficiency evolution process of the reference product (A.7). We re-write the mapping (A.4) for the reference product as Φ ˆ = eω˜1jtF(L ,M ,K ), (A.17) 1jt jt jt jt (cid:104) (cid:105)1 where Φ ˆ ≡ µˆ1jt ELjt θˆ Q , which can be directly computed using the estimates from the 1jt R1jt υˆLjt 1jt previous steps. Substitute the unobserved M in (A.17) by (A.14), and reorganize the terms to obtain: jt lnF(L ,M ,K ) = lnF ˆ +γ lnK + 1γ (lnK )2, (A.18) jt jt jt jt k jt 2 kk jt where γ = α − αˆmαˆ km, γ = α − αˆ2 km, and lnF ˆ = γˆ + γˆ lnL + 1γˆ (lnL )2 + k k αˆmm kk kk αˆmm jt 0 l jt 2 ll jt γˆ (lnL )(lnK ) + 1γˆ (lnυˆ )2 with coefficients γˆ = − αˆ2 m , γˆ = αˆ − αˆmαˆ lm, γˆ = lk jt jt 2 vv Mjt 0 2αˆmm l l αˆmm ll αˆ − αˆ2 lm , γˆ = αˆ − αˆ lm αˆ km, and γˆ = 1 . ll αˆmm lk kl αˆmm vv αˆmm Note that the only parameters unknown are γ and γ in the right-hand side of (A.18) k kk ˆ and lnF is directly computed from the data and the estimated parameters in the previous jt steps. As a result, (A.17) can be used to solve ω˜ as: 1jt ω˜ = lnΦ ˆ −lnF ˆ −γ lnK − 1γ (lnK )2. (A.19) 1jt 1jt jt k jt 2 kk jt According to the evolution of technical efficiency (A.7) of the reference product and the cost of quality specification (40), we derive the explicit evolution process of technical efficiency 52

of the reference product as: ˆ ˆ ω˜ = g +g (ω˜ +γ ξ )−γ ξ +ϵ . (A.20) j1t 0 1 j1t−1 ξ jt−1 ξ j1t j1t This equation can be estimated via GMM using moment conditions E(ϵ Z ) = 0. (A.21) 1jt 1jt ˆ ˆ Note that ϵ = ω˜ −g −g (ω˜ +γ ξ )+γ ξ , where ω˜ and ω˜ are replaced 1jt j1t 0 1 j1t−1 ξ jt−1 ξ j1t j1t−1 j1t by (A.19) for period t − 1 and t, respectively. Z is the instrument variables Z = 1jt 1jt (1,lnΦ ,lnF ,lnK ,(lnK )2,ξ ˆ ). Z is uncorrelated with ϵ because ϵ 1jt−1 jt−1 jt−1 jt−1 j1t−1 Mjt Mjt Mjt is not in the information set of period t. The estimated parameters are (gˆ ,gˆ ,γˆ ,γˆ ). With 0 1 k kk these estimates, the final parameters of the translog function parameters can be recovered as α = γˆ + αˆmαˆ km and α = γ + αˆ2 km. k k αˆmm kk kk αˆmm The choice of input aggregator functional form (CES vs. translog) depends on the empirical context, because each functional form has advantages and disadvantages. Although a CES aggregator is a more restrictive functional form, its estimation does not rely on the technical efficiency evolution process. This allows researchers to specify a more flexible or complex technical efficiency evolution after estimating the rest of the model as shown in the paper. In contrast, the translog aggregator offers greater flexibility in modeling inputs. However, estimating the translog function requires jointly estimating the technical efficiency evolution process and the translog parameters, which can limit the flexibility of the technical efficiency evolution specification in empirical applications. B Multiple Materials Inputs In the paper, we follow standard practice in the literature by assuming that each firm uses a single intermediate input in the production process. In reality, however, especially for multiproduct firms, total intermediate input expenditures encompass a variety of goods that may differ both horizontally (e.g., rubber versus foam) and vertically (e.g., genuine leather versus synthetic leather). Unfortunately, most datasets report only total material expenditures, with no breakdown of types, prices, or quantities. This data limitation constrains researchers’ ability to isolate the effects of individual inputs in the production process. In this context, a key question is: under what conditions can our method accommodate the fact that firms use multiple (i.e., horizontally and vertically differentiated) material inputs, without requiring additional data? More specifically, can we still estimate the model parameters and recover productivity and quality at the firm-product level when only firm-level data on intermediate input expenditure (rather than input-type specific expenditures or more 53

disaggregated data) are available? The Online Appendix of Grieco et al. (2016) provides a positive answer in their context of single-product firms. The key assumption they need is that the effect of different intermediate inputs on production can be summarized through a homogeneous material index function. With this assumption, the production function parameters and thus productivity can be recovered even if the full vector of intermediate input expenditures is not directly observed. Such an idea can be directly applied to our context of multi-product firms. We present the details as follows. Suppose a firm utilizes a vector of material inputs, M = (M ,M ,...,M ), in jt 1jt 2jt Djt production. These inputs may include different input types and variations of the same input at different quality levels.48 However, the researcher observes only the total expenditure on all materials, E = (cid:80)D P M , rather than the quantity of each specific input, M , Mjt d=1 M djt djt djt or its corresponding price, P . M djt We assume that these material inputs enter the transformation function as follows: G(e−ω˜jtQ ) = F(L ,τ(M ),K ), (B.1) jt jt jt jt where τ : RD → R is an index function that aggregates the contribution of all material + + inputs to production.49 We assume that τ(·) is homogeneous of degree κ. As part of the production technology, the firm is assumed to know τ. Of course, without observing individual material inputs, we are not able to estimate the parameters associated within τ(·), although the value of τ(·) can be recovered. Our goal is to show how our method can be extended to such a context of multiple material inputs without estimating the parameters associated within τ(·). Note that this setup allows firms to use different material inputs to produce different products within the same firm, without explicitly modeling the allocation of each input to specific products. Such an assumption aligns with our broader modeling approach in the paper: rather than specifying separate production functions for individual products, we treat production as a transformation process. This approach enables us to account for input differentiation—whether across input types (horizontal differentiation) or quality levels 48An illustrative example in the Online Appendix of Grieco et al. (2016) is as follows. The material vector consists of three components: (M , M , M ). M and M are vertically differentiated versions of 1 2 3 1 2 the same type of input. While the quality for M is normalized to be 1, the quality for M is modeled 1 2 as a scale parameter δ > 1. M is a component that is horizontally differentiated to the other two. 3 For example, consider M , M , and M as foam sole (lower quality), rubber sole (higher quality), and 1 2 3 leather upper, respectively, for footwear industry. The material index function is modeled as: τ(M ) = jt (cid:32) (cid:33) max (cid:104) M 1 γ j 1 t +M 3 γ j 1 t (cid:105)1/γ1 , (cid:104) (δM 2jt )γ2 +M 3 γ j 2 t (cid:105)1/γ2 . 49Some firms may not use certain inputs. The Online Appendix of Grieco et al. (2016) demonstrate how a discrete choice model of input selection can accommodate such cases. 54

(vertical differentiation)—without requiring an allocation rule for how each material input is assigned to each product. In the context that individual material inputs remain unobserved by researchers, this approach makes it possible to estimate the transformation function parameters and recover firm-product-level productivity. In the following, we demonstrate how this estimation is carried out. As described in the paper, the firm’s static optimization problem is now to choose L jt and the vector M to maximize the profit. In the setup with multiple material inputs, the jt Lagrange function is: D (cid:88) (cid:88) L = P (Q ,Q ;ξ )Q −P L − P M jt jnt jt −jt t jnt Ljt jt Mdjt djt n∈Λjt d=1 (cid:40) (cid:41) −λ G(e−ω˜jtQ )−F(L ,τ(M ),K ) , (B.2) jt jt jt jt jt where λ is the Lagrangian multiplier. jt The first-order conditions with respect to labor and each individual material inputs imply: ∂F(L ,τ(M ),K ) jt jt jt λ = P , (B.3) jt Ljt ∂L jt ∂F(L ,τ(M ),K ) jt jt jt λ τ (M ) = P , ∀d = 1,2,...,D. (B.4) jt d jt Mdjt ∂τ jt where τ (M ) = ∂τ(Mjt). d jt ∂M djt Define a material price index as P = EMjt , where ψ(M ) = (cid:80)D M τ (M ). Using τjt ψ(Mjt) jt d=1 djt d jt this price index, the information in (B.4) can be summarized into a single equation by multiplying by M , summing across d, and dividing it by ψ(M ), djt jt ∂F(L ,τ(M ),K ) jt jt jt λ = P . (B.5) jt τjt ∂τ jt This equation, together with (B.3), can be interpreted as the firm’s first-order conditions, as if it were optimizing while facing a wage rate P and a material price index P for a Ljt τjt single aggregated material input, represented by the quantity index τ = τ(M ). This is jt jt analog to the setup described in Section 3 for the baseline case where only a single material input is considered. Importantly, neither P nor τ needs to be observable. τjt jt In this setting, total observed material expenditure is related to the material price index and the material quantity index via: P τ = EMjt, where E is the total expenditure on τjt jt κ Mjt materials, and κ is the degree of homogeneity of the function τ(·). This relationship follows 55

directly from Euler’s Theorem for homogeneous functions: (cid:80)D M τ (M ) = κτ(M ). d=1 djt d jt jt In the special case where the firm uses a single material input, τ(·) reduces to the identity function, which is homogeneous of degree 1, implying κ = 1. Thus, we can treat τ as analogous to M in the single-input case. Thus, the estimation jt jt strategy described in Section 3 remains applicable, with one key modification: material expenditure E is replaced by EMjt, where κ serves as an additional scaling parameter. Mjt κ The identification of κ depends on the specification of the input aggregator function. In some cases (e.g., a translog input aggregator), κ may not be separately identified from the production function parameters. In such cases, it can be normalized to 1 without loss of generality, as it is absorbed into the primary parameters of the production function. In other cases (e.g., a CES input aggregator), κ is identifiable through the revenue function, where it captures the returns to scale of the material aggregator index τ(·). For example, in setup with the CES functional form of input aggregator as specified in Section 5, the main estimating equation (33) becomes:   ln (cid:88) (η n η −1)ρ R ˜ jnt = ln (cid:20) E κ Mjt +E Ljt (cid:18) 1+ α α K (cid:18) K L jt (cid:19)γ(cid:19)(cid:21) −u jt , (B.6) n L jt n∈Λjt where   u = ln  (cid:88) (cid:34) (ηn η − n 1)R ˜ jnt e−ujnt (cid:35)  . (B.7) jt  n∈Λjt (cid:80) n∈Λjt (ηn η − n 1)R ˜ jnt  After the model parameters are estimated, the firm-product level productivity and quality can be computed in the same way as specified in Section 5. In summary, our method extends naturally to the multiple-material input setting (where the production involves different types or quality levels of material inputs) without requiring additional data, provided that the contribution of material inputs to production can be captured by a homogeneous materials index function in the transformation function rather than being modeled as inputs of individual products. While the specific functional form of this index function is not identified without additional information, this lack of identification does not hinder the recovery of the model’s parameters or firm-product level productivity. In fact, neither the precise functional form nor the dimensionality of the index function needs to be explicitly specified for our estimation approach to remain valid. C Firm’s Dynamic Decisions This appendix describes the dynamic decisions made by the firm as a completion of the full model. At the end of each period t, the firm chooses the set of products to produce, their 56

associated quality levels, and investment in technical efficiency improvement (e.g., research and development), for the next period t+1. These decisions are made conditional on the current state and after observing the adjustment costs of product scope and quality levels. Although the evolution of some state variables such as capital stock can be endogenous, we remain agnostic on modeling their exact evolution processes because our estimation method focuses on the static decisions and does not rely on how these variables evolve over time. The adjustment costs of product scope capture the costs incurred by the firm to install and arrange new production lines. The adjustment costs of product quality contain the costs of modifying the production procedure and sourcing new suppliers of the material input to meet the new quality levels. In making decisions regarding product scope, quality levels, and investment, the firm is forward-looking and takes into account the impact of the current decisions on the future paths of the state variables. In particular, the firm knows that the choice of improving the quality of a product for the next period will reduce the associated (quantity-based) productivity in the next period (i.e., due to the cost of quality). Although we do not estimate the full dynamic model in this paper—due to the high dimensionality of the state space—the model plays a crucial role in clarifying the firm’s dynamic decision-making and serves as the conceptual foundation for the static model.50 Specifically, while the firm’s choices regarding product scope, quality levels, and technical efficiency are inherently endogenous in a dynamic setting, we treat them as predetermined and observed at the time the firm chooses inputs and outputs to maximize current-period profit. Our estimation method, presented in Section 3, relies on this assumption to establish the mapping from observed variables to unobserved productivity and quality. D Discussion of the Instrumental Variables Our strategy for estimating the relationship of demand elasticity parameters η in Section n 5.2 exploits the within-firm relationship between the revenues of two products conditional on relative production capability (TFPR), implemented via an instrumental variable (IV) approach. This section discusses the validity of our IVs and outlines alternative methods for estimating η . n A key requirement for IV validity is a non-trivial degree of heterogeneity in production capability (i.e., TFPR) across firms for each product. If the dispersion in TFPR for a given product is extremely small, our identification strategy—relying on within-firm variation 50Forinstance,eveninthefootwearindustrywithonlyfourproducts,thedynamicstateincludesatleast10 continuous variables: four for technical efficiency, four for product quality, and two for input prices (material and labor). 57

in revenues to estimate the elasticity relationship (35)—will fail. As an extreme example, consider an industry where firms produce a primary product (1) and a secondary product (2). If TFPR varies substantially across firms for product 1 but is constant for product 2, then in estimating (35), the firm-level inputs used as IVs would be correlated with the error term because ζ would be mechanically determined by the TFPR of product 1. Thus, a j2t necessary empirical condition for our IV strategy is that all products exhibit sufficiently large TFPR heterogeneity. In our application to Mexican manufacturing industries, this condition is satisfied. First, products at our level of aggregation display substantial across-firm variation in TFPR, as reflected in the dispersion of prices and sales across firms. In particular, the lowest standard deviation of log output prices for any product is 0.27 and the lowest standard deviation of log sales is 1.25. The TFPR dispersion estimates reported in Online Appendix Figure A2 further confirm that heterogeneity is sufficiently large for each product in our sample. Moreover, within a firm, sales are not concentrated entirely in a single product. Online Appendix Table A2 and Online Appendix Figure A1 (discussed in Section 4) show that all products contribute non-trivially to firm-level revenues. For example, Online Appendix Table A2 reports average within-firm product shares by product scope: among firms producing five or more products, the average share of all products other than the top-selling one is 0.556, and the average share of products ranked fifth or lower is 0.147. Online Appendix Figure A1 shows the within-firm Herfindahl–Hirschman Index (HHI), where a lower value indicates greater diversification within a firm. The HHI falls to around 0.3 for firm-year pairs with five products and to about 0.2 for those with ten or more products, indicating that revenues are not dominated by their top product in these multi-products firms. The insights from the above discussion also apply to the validity of the same IVs used in estimating θ from (27), as this estimation likewise exploits the within-firm relationship. If the above heterogeneity condition does not hold, alternative approaches are available to estimate the demand elasticities. First, demand elasticities can, in principle, be identified directly from the demand function (28) using variation in prices and quantities, provided that suitable IVs uncorrelated with product quality are available. For instance, Orr (2022) estimates a demand system by constructing IVs that exploit variation in product sets and input price growth across firms operating in similar input markets but serving different output markets. In this case, one could directly estimate the demand function as discussed in Section 3 without relying on (35). Second, in the context where the assumption of constant returns to scale (i.e., ρ = 1) can be plausibly imposed, the demand elasticities can be identified from (33) alone, bypassing the need for the strategy of estimating (35). In this case, (33) simplifies 58

to the estimating equation used in Das et al. (2007), Aw et al. (2011), and Li (2018), which relates total variable cost (the counterpart to the right-hand side of (33)) to export revenues (the counterpart to the left-hand side of (33)) across multiple export markets for the same firm. E Aggregating to Firm-level TFPR The literature has a tradition of using revenue-based productivity (TFPR) as a measure of firm performance. While our framework yields a measure of TFPR at the firm-product level, these product-level measures can be aggregated when the interest is in evaluating overall firm performance. This appendix derives the aggregation. We begin with our framework and impose the standard assumption implicitly assumed in the literature using firm-level data: the productivity of producing different products within the same firm is identical (i.e., a common firm-level productivity). Specifically, from the demand function (28), the revenue for product n can be written as R ˜ jnt = P ˜ jnt Q jnt = Q j η n n η t n −1 eη 1 n ξ˜ jnt = (cid:2) Q jnt e−ω˜jnteTFPRjnt (cid:3)ηn ηn −1 , (E.1) where TFPR is defined in (38). jnt Rearranging this expression, we obtain: ηn R ˜ηn−1e−TFPRjnt = Q e−ω˜jnt. jnt jnt Raising both sides to the power θ, summing across all products n ∈ Λ , and then taking the jt 1/θ root yields:  1/θ  1/θ  (cid:88) (cid:104) R ˜ηn ηn −1e−TFPRjnt (cid:105)θ =  (cid:88) (cid:2) Q e−ω˜jnt (cid:3)θ  = F(L ,M ,K ), (E.2) jnt jnt jt jt jt     n∈Λjt n∈Λjt where the second equality follows from the transformation function (3). Given the common revenue-based productivity at the firm level, denoted TFPR , we jt replace TFPR with TFPR in the left-hand side to obtain: jnt jt  1/θ  (cid:88) (cid:104) ηn (cid:105)θ R ˜ηn−1 e−TFPRjt = F(L ,M ,K ). (E.3) jnt jt jt jt   n∈Λjt Therefore, comparing the two equations above, the firm-level TFPR can be related to 59

firm-product-level TFPR as:  −1/θ eTFPRjt =  (cid:88) (cid:0) s e−TFPRjnt (cid:1)θ  , (E.4) jnt   n∈Λjt where ηn R ˜ηn−1 jnt s = (E.5) jnt (cid:26) (cid:80) (cid:104) R ˜ηm ηm −1 (cid:105)θ (cid:27)1/θ m∈Λjt jt is a weight that depends on the relative contribution of product n to firm j’s aggregate revenue. For a single-product firm, this relationship degenerates to an identity equation. F Additional Tables and Figures Table A1: Product list by industry Product name (product code) Footwear, leather (324001) Printing and binding (342003) Pharmaceutical products (352100) Cow leather, for men (1) Printing of calendars and almanacs (5) Bactericides (11) Cow leather, for women (2) Folding boxes (6) Antiparasitics (13) Cow leather, for kids (3) Notebooks and pads (7) Dermatological (15) Others (99) Labels and prints (13) Products with specific actions (19) Brochures and catalogs (14) Circulatory system (21) Continuous forms (15) Digestive system and metabolism (22) Accounting/admin/tax forms (16) Musculoskeletal system (23) Telephone directories (17) Respiratory system (24) Books (18) Sensory organs (25) Journals (19) Genitourinary system (26) Checks (21) Blood and hematopoietic organs (27) Commemorative/business cards (23) Central nervous system (28) Commercial flyers (24) Hormones (32) Posters (25) Vitamins and compounds (43) Others (99) Non-therapeutic products (59) Others (99) 60

Table A2: Within-firm product shares by product scope Product rank (by sales level) Product scope 1 2 3 4 5+ 1 1.000 2 0.770 0.230 3 0.670 0.240 0.090 4 0.568 0.283 0.117 0.032 5+ 0.444 0.204 0.123 0.082 0.147 Note: All firm-year pairs producing 5 products or more are clustered in the “5+” group. All products ranked 5 or lower are clustered in the “5+” group. Table A3: Descriptive statistics Variable Footwear Printing Pharmaceutical Revenue per product (R) 70.890 30.713 104.376 (106.830) (75.803) (211.836) Number of workers (L) 248.722 160.826 465.352 (383.053) (157.036) (492.768) Labor expenditure (E ) 14.532 18.238 92.608 L (30.514) (22.983) (112.902) Material expenditure (E ) 53.287 65.952 269.550 M (81.526) (91.617) (384.567) Capital stock (K) 3.413 22.839 23.196 (7.428) (49.486) (32.074) Notes: The table reports the means and standard deviations (in parenthesis) for each variable by industry. R is revenues by product (1 million 2007 Mexican Peso, 1M MXN); L is the number of workers by firm, K is the capital stock by firm (1000 physical units); E is the expenditure on labor (wage bill) by firm (1M MXN); E is the expenditure on L M intermediates by firm (1M MXN). 61

Table A4: Monte Carlo parameter values Parameter Description Value N Number of products 5 T Number of periods 15 J Number of firms 500 η , n=1,...,5 Demand elasticity parameters 3, 4, 5, 6, 7 n α CES parameter of labor 0.4 L α CES parameter of material 0.4 M α CES parameter of capital 0.2 K σ Elasticity of substitution of inputs 2 ρ Returns to scale parameter 1.1 θ Substitution parameter of output 0.9 gω, , n=1,...,5 Persistence parameters in productivity evolution 0.81 0.82 0.83 0.84 0.85 n gξ, , n=1,...,5 Persistence parameter in quality evolution 0.79 0.78 0.77 0.76 0.75 n gl Persistence parameter in wage rate evolution 0.85 gm Persistence parameter in material price evolution 0.8 gk Persistence parameter in capital evolution 0.8 r Productivity and quality shock correlation -0.2 sd(εω), n=1,...,5 S.D. of productivity shock 0.025 0.020 0.015 0.010 0.005 n sd(εξ), n=1,...,5 S.D. of quality shock 0.025 0.020 0.015 0.010 0.005 n sd(εℓ) S.D. of wage rate shock 0.1 sd(εm) S.D. of material price shock 0.1 sd(εk) S.D. of capital stock shock 0.1 sd(u) S.D. of unexpected firm-product price shock (u ) 0.05 jnt Table A5: Monte Carlo: Estimates of within-firm revenue relationship 1−θ η2 1−θ η3 1−θ η4 1−θ η5 η2−1 η3−1 η4−1 η5−1 1−θ η1 1−θ η1 1−θ η1 1−θ η1 η1−1 η1−1 η1−1 η1−1 True 0.571 0.357 0.229 0.143 Estimate 0.569 0.357 0.228 0.143 Standard error (0.033) (0.020) (0.012) (0.007) Note: The estimates, for the parameters of (35), are reported as the mean estimates from the Monte Carlo simulations. Standard errors in parentheses are computed as the standard deviation of the estimates. 62

Table A6: Monte Carlo: distributional characteristics of key simulated variables Productivity ω˜ ω˜ ω˜ ω˜ ω˜ 1 2 3 4 5 Mean 0.921 0.945 0.971 1.000 1.033 Std. deviation 0.040 0.033 0.025 0.017 0.009 Quality ˜ ˜ ˜ ˜ ˜ ξ ξ ξ ξ ξ 1 2 3 4 5 Mean 0.607 0.591 0.576 0.563 0.550 Std. deviation 0.039 0.030 0.022 0.015 0.007 Within-firm revenue share share share share share share 1 2 3 4 5 Mean 0.569 0.371 0.275 0.159 0.057 Std. deviation 0.179 0.091 0.105 0.100 0.070 Note: The reported means and standard deviations are calculated as the average and standard deviation of the key variables across Monte Carlo simulations. Table A7: Welfare improvement of 1-percent increase of technical efficiency, million Pesos No Across-firm With-firm Both Total welfare 1.543 1.799 1.626 1.881 Consumer surplus 0.845 0.984 0.889 1.029 Producer surplus 0.699 0.814 0.736 0.852 Figure A1: Weighted average within-firm HHI, by number of products 1 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0 1 2 3 4 5 6 7 8 9 10+ Number of products IHH mrif-nihtiw egarevA Notes: All firm-year pairs producing 10 products or more are clustered in the “10+” group. Theweightedaverageiscalculatedusingrevenuesasweights. 63

Figure A2: Distribution of TFPR 0.2 0.15 0.1 0.05 -10 -8 -6 -4 -2 0 2 4 6 8 10 ytisneD Footwear 0.2 0.15 0.1 0.05 -10 -8 -6 -4 -2 0 2 4 6 8 10 ytisneD Printing 0.2 0.15 0.1 0.05 -10 -8 -6 -4 -2 0 2 4 6 8 10 ytisneD Pharmaceutical Notes: TFPRisdemeaned,andonlyproductswithatleast100observationsareincluded. Figure A3: Distribution of productivity, ω˜ 0.25 0.2 0.15 0.1 0.05 -10 -8 -6 -4 -2 0 2 4 6 8 10 ytisneD Footwear 0.2 0.15 0.1 0.05 -10 -8 -6 -4 -2 0 2 4 6 8 10 ytisneD Printing 0.2 0.15 0.1 0.05 -10 -8 -6 -4 -2 0 2 4 6 8 10 ytisneD Pharmaceutical Notes: ω˜ isdemeaned,andonlyproductswithatleast100observationsareincluded. 64

˜ Figure A4: Distribution of quality, ξ 0.25 0.2 0.15 0.1 0.05 -25 -20 -15 -10 -5 0 5 10 15 20 25 ytisneD Footwear 0.15 0.1 0.05 -25 -20 -15 -10 -5 0 5 10 15 20 25 ytisneD Printing 0.15 0.1 0.05 -25 -20 -15 -10 -5 0 5 10 15 20 25 ytisneD Pharmaceutical Notes: ξ˜isdemeaned,andonlyproductswithatleast100observationsareincluded. Figure A5: The relationship between productivity and quality 15 firm-product-year fitted slope: -0.30 10 5 0 -5 -10 -15 -20 -25 -25 -20 -15 -10 -5 0 5 10 15 20 25 65

Cite this document

APA

Mauro Caselli, Arpita Chatterjee, & and Shengyu Li (2026). Productivity and Quality of Multi-product Firms (IFDP 2026-1430). Board of Governors of the Federal Reserve System, International Finance Discussion Papers. https://whenthefedspeaks.com/doc/ifdp_2026-1430

BibTeX

@techreport{wtfs_ifdp_2026_1430,
  author = {Mauro Caselli and Arpita Chatterjee and and Shengyu Li},
  title = {Productivity and Quality of Multi-product Firms},
  type = {International Finance Discussion Papers},
  number = {2026-1430},
  institution = {Board of Governors of the Federal Reserve System},
  year = {2026},
  url = {https://whenthefedspeaks.com/doc/ifdp_2026-1430},
  abstract = {This paper introduces a method for estimating productivity and quality at the firm-product level using a transformation function framework. We use firm optimization conditions to establish a one-to-one mapping between observed data and unobserved productivity and quality. We do not need to impute firm-product input shares and can avoid imposing productivity evolution processes. The method is scalable to numerous products and can address the bias caused by unobserved heterogeneous intermediate input prices. We apply the method to a set of Mexican manufacturing industries and examine the roles of across-firm and within-firm technological spillovers, accounting for the trade-off between productivity and quality. Our quantitative analysis shows that an exogenous, product-specific technological improvement generates substantial gains in welfare, amplified by both within-firm and across-firm spillovers by approximately 17 percent and 5 percent, respectively. Moreover, within-firm resource reallocation toward the most productive products accounts for 60 percent of the resulting firm-level productivity gains.},
}