feds · August 5, 2025

Mega Firms and New Technological Trajectories in the U.S.

Abstract

We provide evidence that mega firms have played an increasingly important role in shaping new technological trajectories in recent years. While the share of novel patentsâdefined as patents introducing new combinations of technological componentsâ produced by mega firms declined until around 2000, it has rebounded sharply since then. Furthermore, we find that the technological impact and knowledge diffusion of novel patents by mega firms have grown relative to those by non-mega firms after 2001. We also explore potential drivers of this trend, presenting evidence that the rise in novel patenting by mega firms is tied to their disproportionate increase in cash holdings and the expansion of their technological scope. Our findings highlight an overlooked positive role of mega firms in the economywide innovation process.

Finance and Economics Discussion Series Federal Reserve Board, Washington, D.C. ISSN 1936-2854 (Print) ISSN 2767-3898 (Online) Mega Firms and New Technological Trajectories in the U.S. Joonkyu Choi, Serguey Braguinsky, Yuheng Ding, Karam Jo, Seula Kim 2025-060 Please cite this paper as: Choi,Joonkyu,SergueyBraguinsky,YuhengDing,KaramJo,andSeulaKim(2025). “Mega Firms and New Technological Trajectories in the U.S.,” Finance and Economics Discussion Series 2025-060. Washington: Board of Governors of the Federal Reserve System, https://doi.org/10.17016/FEDS.2025.060. NOTE: Staff working papers in the Finance and Economics Discussion Series (FEDS) are preliminary materials circulated to stimulate discussion and critical comment. The analysis and conclusions set forth are those of the authors and do not indicate concurrence by other members of the research staff or the Board of Governors. References in publications to the Finance and Economics Discussion Series (other than acknowledgement) should be cleared with the author(s) to protect the tentative character of these papers.

Mega Firms and New Technological Trajectories ∗ in the U.S. † ‡ § ¶ Joonkyu Choi Serguey Braguinsky Yuheng Ding Karam Jo ‖ Seula Kim July 16, 2025 Abstract We provide evidence that mega firms have played an increasingly important role in shaping new technological trajectories in recent years. While the share of novel patents—definedaspatentsintroducingnewcombinationsoftechnologicalcomponents— produced by mega firms declined until around 2000, it has rebounded sharply since then. Furthermore, we find that the technological impact and knowledge diffusion of novel patents by mega firms have grown relative to those by non-mega firms after 2001. We also explore potential drivers of this trend, presenting evidence that the rise in novel patenting by mega firms is tied to their disproportionate increase in cash holdings and the expansion of their technological scope. Our findings highlight an overlooked positive role of mega firms in the economywide innovation process. Keywords: Mega Firms, Innovation, Novel Patents, Knowledge Diffusion JEL Codes: O31, O33, O34, L11, L25 ∗Any opinions and conclusions expressed herein are those of the authors and do not necessarily represent the views of the World Bank, the Federal Reserve Board of Governors, or its staff. Previously circulated under the title “Mega Firms in Recent Trends in the U.S. Innovation: Empirical Evidence from the U.S. PatentData.” WethankSinaAtes, PierreAzoulay, VojislavMaksimovic, andparticipants attheWorkshop on Entrepreneurial Finance and Innovation, Technology & Policy Research Initiative Seminar, 2023 NBER Megafirms and the Economy conference, Ed Snider Center at the University of Maryland, Hitotsubashi University, Federal Reserve Board, and 2025 BSE Summer Forum for helpful comments and suggestions. †Federal Reserve Board of Governors ‡University of Maryland and NBER §The World Bank ¶Pennsylvania State University ‖Pennsylvania State University and IZA

1 Introduction The concentration of economic activities in the largest businesses, so-called mega firms, has been increasing over the past few decades (Autor et al., 2020; Yeh et al., 2022; Hsieh and Rossi-Hansberg, 2023; Kwon et al., 2024). Recent literature explores two broad sets of interpretations for this trend. Some studies have emphasized the rise in market power (De Loecker et al., 2020), possibly driven by increasing entry barriers, regulation, and lobbying activities that stifle competition (Guti´errez and Philippon, 2019; Covarrubias et al., 2020). Other studies have cast doubt on this interpretation (Foster et al., 2022) and instead emphasize increased competition or winner-takes-all dynamics caused by globalization and technological advances that enable large firms to exploit economies of scale (Autor et al., 2020; Hsieh and Rossi-Hansberg, 2023). A key issue in this debate is the role of mega firms in innovation and knowledge diffusion. Several studies raise concerns that mega firms may be strategically building patent thickets to hinder technological competition (Akcigit and Ates, 2023), reallocating inventors toward large incumbents (Akcigit and Goldschlag, 2023), or even acquiring innovative startups to discontinue competing R&D projects to preempt future competition (Cunningham et al., 2021). Taken together, this evidence raises the possibility that mega firms may be slowing the pace of innovation and limiting the spread of technological advances across the broader economy. Against the backdrop of concerns about the negative impact of mega firms, this paper presents new evidence that highlights a different and understudied role of mega firms: their growing contribution to pioneering new technological trajectories that open up opportunities forsubsequentinnovationbyotherfirms. Tothisend,weemploytheconceptofnovel patents, definedaspatentsthatintroducenewcombinationsoftechnologicalcomponentsthathadnot previously been used together (Fleming et al., 2007; Akcigit et al., 2013; Strumsky and Lobo, 2015; Verhoeven et al., 2016; Pezzoni et al., 2022). This definition aligns with Schumpeter’s (1911) view of innovation as new combinations of existing resources (Akcigit et al., 2013) and 1

Weitzman’s (1998) characterization of innovation as a recombination process. While prior studies show that novel patents are more likely to be antecedents of radical breakthrough (Verhoeven et al., 2016), we provide extensive evidence that they are also strongly correlated with a range of indicators of novelty, technological impact, and creative destruction (e.g., Kogan et al., 2017; Akcigit and Kerr, 2018; Arts et al., 2021; Kelly et al., 2021; Kalyani, 2024; Jo and Kim, 2024). We also demonstrate that our main finding is robust to using an alternative, text-based measure of “breakthrough” patents developed by Kelly et al. (2021). To fix ideas, we illustrate an example of a novel patent. U.S. patent No. 8,188,868 titled “Systems for activating and/or authenticating electronic devices for operation with apparel” was filed in 2006 by Nike, which combined CPC group G08C17 with CPC groups A43B3 and A41D1 for the first time.1 This technology integrates wireless transmitting devices into apparel—such as T-shirts and shoes—to allow athletes to monitor vital signs andperformance. Itwasinitiallycommercializedin2006throughacollaborationwithApple, resulting in NIKE+iPod Sports Kit. While it was highly uncertain at the time the extent to which this technology could create a new market or technological domain, ex-post evaluation suggests it laid the groundwork for subsequent innovations in wearable technology, motion sensing, and integrated digital services—including Apple Watch, Fitbit, and Nintendo Wii. A visual depiction of the patent is presented in Figure A1 in the Appendix. Our main finding is that, while the share of novel patents filed by mega firms had been declining since 1980, there has been a robust turnaround since 2001; by the mid-2010s, the sharereacheditshighestlevelsinceoursamplebeganin1980. Alog-differencedecomposition indicates that this rise is driven not only by the faster growth in overall patent applications by mega firms compared to non-mega firms, but also the increasing share of novel patents among mega firms’ total patents. Firm-level panel regressions confirm that mega firms became more likely to apply for novel patents than non-mega firms after 2001, even after 1CPC stands for Cooperative Patent Classification, a system developed by the U.S. Patent and Trademark Office (USPTO). G08C17 is “Arrangements for transmitting signals characterized by the use of a wireless electrical link,” A43B3 is “Footwear characterized by the shape or the use” and A41D1 is “Garments.” 2

controlling for firms’ total number of patent applications and firm size, indicating that this pattern is not solely driven by mega firms becoming larger or by them producing a larger number of patents. This finding also holds within firms: Firms produce more novel patents than before as they become mega firms—particularly after the early-2000s—suggesting that closing on market leadership has become associated with more, not less, novel innovations in recent decades. Furthermore, we show that the share of mega firms in breakthrough patents—those that combine high novelty with high impact—has also risen significantly since the early 2000s. This reinforces our main findings by demonstrating that mega firms are not only generating more novel patents but are also responsible for a growing share of the most influential technological advancements. Importantly, the rebound in the share of novel patents filed by mega firms coincided with the rebound in the share of novel patents in all patent applications in the U.S. While the number of novel patent applications had been steady in the 1990s and until the early 2000s, while their share in total patent applications had been steadily declining due to increase in the number of total patent applications, the number of novel patent applications more than doubled from 2005-2016 and their share in total patent applications in 2016 reached the levels not seen since 1995. To further assess the degree of technological impact of novel patents by mega firms, we adopt the measure suggested by previous studies (e.g., Pezzoni et al., 2022) and track the number of “follow-on patents”—the patents that use the same new technology combination asfirstintroducedbyanovelpatent. Wefindempiricalpatternsconsistentwithnovelpatents being highly experimental: 42% of novel patents do not have any follow-on patents in the first five years since their grant year, while a small fraction become “hits,” i.e., those that generate many follow-on patents and thus have a large impact on shaping new technological trajectories. We also find that, after (and only after) 2001, novel patents generated by mega firms have on average more follow-on patents and are more likely to become hits than those generated by non-mega firms. 3

One straightforward way to examine the degree of knowledge diffusion stemming from novel patents is looking at the dissemination of the follow-on patents beyond the focal firm thatgeneratedthenovelpatent. Wefindthat“self-follow-onrate”—theproportionoffollowon patents generated by the same firm that initially produced the novel patent—is similar for mega firms and non-mega firms prior to 2001. However, after 2001, we find some evidence of a slowdown in knowledge diffusion from novel patents produced by non-mega firms, whereas the trend remains stable for mega firms. Together with mega firms’ novel patents generating more follow-on patents in recent years, this finding suggests that mega firms contribute to knowledge diffusion beyond their boundaries by engaging in technological experiments and generating impactful new combinations, a channel that has been understudied in the literature.2 An important question is what drives the rise in novel innovation by mega firms since the early-2000s. To shed light on this, we develop two hypotheses and explore the supporting empirical evidence. First, we hypothesize that, given the highly experimental nature of novel innovation, firms with larger cash holdings can afford to engage more in such innovation, and that mega firms have been increasingly holding more cash or equivalent market securities relative to non-mega firms since the early-2000s. This hypothesis is consistent with findings in the finance literature that U.S. firms have been increasing their cash holdings in part to fund risky R&D activities (Bates et al., 2009; Brown et al., 2009). We find empirical evidence consistent with this hypothesis. To begin with, firms with larger cash holdings are indeed more likely to engage in novel innovation. Using a local projection method, we show that a one percent increase in cash holdings (controlling for total assets) leads to a gradual rise in novel patent applications, reaching a five percent increase after two years. While total patent applications also increase, the magnitude of the effect is less than half that observed for novel patents, suggesting that cash holdings are 2Patent reassignment and acquisitions may be another way for mega firms to defend their technological leadership (Akcigit and Ates, 2023). Subsequent changes in patent ownership are outside the scope of our analysis as we focus on the initial applicants for novel patents. 4

particularly associated with novel innovation. Examiningaggregatetrends,wefindthatthegapincashholdingsbetweenmegaandnonmega firms exhibits a U-shaped pattern, with a reversal in the late 1990s. This gap narrowed byasmuchas50%ofits1980valuethroughthelate1990s, butexperiencedastrongrebound thereafter. By2016, thefinalyearofour sample, thedifferenceincashholdingshad grownto 15%-30%aboveits1980level. Takentogether, thesefindingssupporttheideathattherisein cash holdings among mega firms after 2001 facilitated their engagement in novel innovation. Second, we investigate whether the increased concentration of inventors in the largest firms noted in the literature (Akcigit and Goldschlag, 2023), may be behind the increase in the share of mega firms in novel patents in recent decades. We find that the average inventor team size has indeed been increasing among mega firms relative to non-mega firms, but this trend is observed across all four decades of our observations and thus cannot, by itself, explain the decline in the share of novel patents produced by mega firms in the 1980s- 1990s, followed by a turnaround after 2000. We then examine the trend in the relative scope (diversity)ofinventorteams’technologicalexpertisebetweenmegaandnon-megafirmswhile controlling for inventor team size and find that mega firms experienced a relative decrease in technology scope of its inventor teams in the first two decades of our data, but their technological scope relative to non-mega firms has been on the increase in the more recent decades. Thus, it appears that broadening technological scope, not sheer inventor team size may have given mega firms a new competitive advantage in terms of novel patents, consistent with the definition of novel patents as those combining different technological components that had not been combined together before. Our findings have important policy implications. If it is true that mega firms are stifling innovation and slowing down knowledge diffusion, there may be a scope for regulatory intervention. If, however, those firms are among the key actors conducting experiments and generating new technological trajectories, then such an approach may backfire. With the U.S. technological dominance facing increasing global challenges, the stakes could not be 5

higher. We provide further discussion in the concluding section. Relation to the Literature Our paper engages with the ongoing debate surrounding the rise of mega firms and its implications for market dynamics and innovation. While priorresearchhasextensivelydocumentedtheincreasingconcentrationofeconomicactivities in large firms (Autor et al., 2020; Yeh et al., 2022; Hsieh and Rossi-Hansberg, 2023) and proposed competing explanations—ranging from increased market power (De Loecker et al., 2020; Guti´errez and Philippon, 2019; Covarrubias et al., 2020) to intensified competition driven by globalization and technology (Autor et al., 2020; Hsieh and Rossi-Hansberg, 2023; Kwon et al., 2024; Foster et al., 2022)—our study shifts the focus to the role of mega firms in innovation, particularly through the lens of novel patents. By analyzing mega firms’ contributions to novel technological combinations, our study sheds new light on whether their growing influence enhances or hinders technological progress and whether their market position is driven by innovative capabilities or by market power that may limit competition. Our paper is also related to the literature on innovation which has recently raised concerns over declining efficiency of R&D investment (Bloom et al., 2020) and the general declining trend in the U.S. innovation creativity (Kalyani, 2024). Since this trend and the afore-mentioned trend toward increasing dominance of mega firms happened concurrently in time, this has led to the examination of the interrelationship between the two trends, with much of the literature focusing on the stifling effect of the rise of mega firms on innovation through strategic patenting and a slowdown in knowledge diffusion (Akcigit and Ates, 2023), increasing concentration of inventors in the largest firms (Akcigit and Goldschlag, 2023), and “killer acquisitions” (Cunningham et al., 2021). Some more recent studies suggest, however, that part of the advantages of mega firms may actually arise from the nature of technology (Gupta et al., 2024). Our main contributions to this body of literature lies in (re-)examining the role of mega firms in innovation through the prism of novel patents and over several decades. 6

Our paper builds upon the strand of the literature which has documented growing importanceofnewcombinationsoftechnologiesininnovationandeconomicgrowth. Thisliterature has examined the technological origins of novel patents that create new knowledge combinations using various measures, most prominently, technological classes assigned to patents by patent examiners (Fleming et al., 2007; Strumsky and Lobo, 2015; Verhoeven et al., 2016; Epicoco et al., 2022). In an unpublished working paper, Akcigit et al. (2013) present broad historical evidence showing the growing importance of new technological combinations (novel patents) over the century and a half in the U.S. innovation. The most closely related paper is Pezzoni et al. (2022) which also looks at follow-on patents (trajectories of new combinations). Additionally, recent studies have leveraged natural language processing (NLP) techniques to develop more nuanced measures of patent novelty and impact, such as those by Arts et al. (2021), Kelly et al. (2021), and Kalyani (2024). These advancements provide valuable benchmarks for assessing the technological significance of patents, which we utilize to validate and compare our measure of novel patents. Most of the extant papers, however, are limited to examining the technological novelty and the role of recombined knowledge in patents’ impact and do not look into who creates new combinations. We link this literature to the above literature strand on the role of mega firms to examine how much such firms contribute to novel patents and what type of new potential technological trajectories they generate to better understand their changing role in the economy-wide innovation process. The remainder of the paper is structured as follows. Section 2 describes the data and key measurement methods. Section 3 examines the aggregate trend in mega firms’ contributions to novel innovation, along with firm-level evidence. Section 4 analyzes the technological impact and knowledge diffusion of novel patents by mega firms compared to non-mega firms over time. Section 5 explores potential drivers of the rise in novel patents by mega firms. Finally, Section 6 concludes. 7

2 Data and Measurement 2.1 Data Construction and Key Measurement Methods The primary data sources are the USPTO PatentsView and S&P’s Compustat. The USPTO PatentsView tracks all patents ultimately granted by the USPTO from 1976 onward. We collectutilitypatentsgrantedtoU.S.assigneesbetween1976and2023totrackeconomy-wide innovation activities, and in particular, the creation and trajectories of new technological combinations. We describe detailed matching procedures in the Appendix A.2.3 To identify technological components underlying an invention, we exploit the detailed information provided by the USPTO patent database on the technological content of inventions. Each patent documentation in the USPTO reports technology classes based on all disclosed information. To conduct an efficient patent search, the USPTO requires patent examiners to objectively classify an invention into technology categories based on “invention information”and“additionalinformation.” Inthispaper, weusetechnologyclassesbasedon “invention information,” which, according to the USPTO, contains “technical information in the total disclosure of a patent document (for example, description, drawings, claims) that represents an addition to the state of the art.” We utilize the Cooperative Patent Classification (CPC) to identify technological components of inventions. The CPC scheme is a hierarchical system with multiple levels of classifications. The level of classification we use in this paper is “Main Group”—the most comparable level of classification to the USPC subclass widely used in the previous literature.4 Hereafter, we use “technological components” and “main group” interchangeably. Whilenewtechnologicalcomponentsareaddedovertime, theUSPTOreclassifiesoldpatents according to the new CPC code, which ensures comparability over time. By 2016, there were 7,246 distinct main groups under the CPC scheme excluding those under CPC Section Y.5 3While we use our own USPTO-Compustat bridge in this paper, we confirmed that our basic findings are robust to using the DISCERN (Duke Innovation & Scientific Enterprises Research Network) bridge. 4Our findings are robust to using different levels of aggregation as well as the IPC classification. 5SectionYrepresentsanewadditiontopatentclassificationsintroducedtogetherwithCPC,forgeneral 8

Following previous studies (Fleming et al., 2007; Akcigit et al., 2013; Strumsky and Lobo, 2015; Verhoeven et al., 2016), we define a new technological combination as a pairwise combination of technological components that appears in a patent for the first time. Patents incorporating such new technological combinations are defined as novel patents. While our analysis is based on utility patents assigned to the U.S. entities, we identify a pair of technological components as a new combination only if it is the first combination that appears among all utility patents granted to both U.S. and non-U.S. entities since 1976. Because the first year available in the USPTO PatentsView data is 1976, we do not observe the history of technological combinations created before then. We use the first three years, 1976-1979, as a buffer period and we track novel combinations starting from 1980. Note that, by construction, a novel patent may contain multiple new pairwise technological components. While novel patents rely on new pairwise combinations of technological components, we aggregate them to the patent level and use novel patents as our measure of novel innovation, in particular because the number of pairwise new combinations is more likely to overestimate the true number of novel innovations than the measure aggregated to the patent level. Nevertheless, we have verified that all our findings remain robust to using new pairwise combinations as the measure of novel innovation.6 To study the diffusion and technological trajectories of new combinations, as well as to develop a measure of novel patents’ impact, we identify the pool of follow-on inventions of a novel patent as subsequent patents that (re-)use the same combination of technological components as introduced by the novel patent. Specifically, we count the cumulative number of patented inventions that re-use the new technological combination in years following the appearance of the focal new combination, up to late 2023. Furthermore, to gauge the extent towhichsubsequentinnovationoccursbeyondtheboundariesofthefocalfirmthatgenerated thenovelpatent, wedifferentiatefollow-onpatentsthataregeneratedbythefocalfirmversus tagging of new technological developments which are already classified or indexed in other sections. We exclude technological components tagged under this section when constructing new combinations. 6Results are available upon request. 9

those generated by other firms. Occasionally, patents are assigned to multiple assignees. In suchcases, wesayafollow-onpatentisgeneratedbythefocalfirmifitisoneoftheassignees. 2.2 Validation of Measurement While the concept of new technological combinations and the measurement of novel patents based on this idea have been developed in the literature since at least 2007, more recent studiesincorporatingNaturalLanguageProcessing(NLP)techniqueshaveintroducedalternative measures of patent novelty and impact, including but not limited to creative destruction. In this section, we briefly examine the relationship between the novel patent measure used in our paper and these alternative measures. Later, we discuss how these measures capture the contributions of mega firms compared to our approach. Specifically, we first investigate the relationship between our measure of novel patents and three alternative measures. The first two are both based on NLP techniques. The first measure, developed by Arts et al. (2021), extracts the number of new bigrams—compared to all patents filed since 1969—from the title, abstract, and claims of a patent. The second measure is the patent creativity measure due to Kalyani (2024), which is defined as the share ofcreativetechnicalbigrams—wherecreativityisdeterminedbythesebigramsnotappearing in any patent filed in the previous five years—relative to all technical bigrams. The third measure we investigate is backward self-citation ratios, defined as the number of citations to a firm’s own previous patents divided by the total number of citations. The endogenous growth literature (e.g., Akcigit and Kerr, 2018; Jo and Kim, 2024) often uses this variable to capture innovation associated with creative destruction—where firms pursue new ideas that expand beyond their existing technological scope to replace incumbents in other markets. A higher self-citation ratio is typically linked to incremental improvements in a firm’s existing technology, whereas a lower self-citation ratio suggests a shift toward novel areas. In Table 1 we use patent-level data from 1980 to 2016 to estimate the relationship between our novel patents measure and those alternative measures. Specifically, we estimate 10

regressions where the dependent variables capture various aspects of patent novelty and creative destruction, and the main independent variable is our novel patent indicator. The results in Column (1) show a strong positive relationship between our measure and that in Arts et al. (2021): we find that while non-novel patents have on average 1.5 new bigrams, novel patents have an additional 0.8 new bigrams.7 Similarly, the results in Column (2) indicate a strong positive relationship between our measure and the patent creativity measure in Kalyani (2024). Lastly, the results in Column (3) show that novel patents are associated with lower backward self-citation ratios. All regressions include CPC group fixed effects and application year fixed effects, but the results remain robust to their exclusion or to alternative specifications with different combinations of fixed effects, as shown in Appendix A.3. Table 1: Novel Patents and Other Measures of Novelty and Creative Destruction (1) (2) (3) # New Bigrams Patent Creativity Backward Self-citation Ratio Novel Patent Indicator 0.769∗∗∗ 0.020∗∗∗ -0.027∗∗∗ (0.019) (0.000) (0.001) CPC Group FE Yes Yes Yes Year FE Yes Yes Yes Mean Dep. Var. (Non-novel) 1.516 0.068 0.121 Obs. 2,630,141 2,575,446 2,657,952 R-sq 0.06 0.12 0.06 Notes: TheestimatesareobtainedfromOLSregressionswithmulti-wayfixedeffects. “#NewBigrams”indicatesthenumber ofnewbigramsinpatenttext,obtainedfromArtsetal.(2021)and“PatentCreativity”istheshareofnewtechnicalbigrams in patents, obtained from Kalyani (2024). Backward self-citation ratio is defined as the number of citations to a firm’s own previous patents divided by the total number of citations. The regression estimates are derived from all patents that were appliedbetween1980and2016. MeanDep. Var. (Non-novel)isthemeanofeachoutcomefornon-novelpatents,providedto aidinterpretationoftheestimatedcoefficients. Robuststandarderrorsareinparentheses. ***p<0.01,**p<0.05,*p<0.1. Inthefollowingsections, wewilldiscusstheroleofmegafirmsingeneratingnovelpatents as well as the impact their patents have on follow-on patents. Anticipating this, in Table 2 we also present the results of estimating the relationship between our measure of novel 7Arts et al. (2021) also extract the number of new words and new trigrams in a patent. The regression coefficients remain similarly large and positive when using these alternative measures. They also develop a measure of backward cosine similarity, for which no meaningful difference is observed. Results are available upon request. 11

patents and several other measures of patents’ impact. Specifically, we once again estimate regressions where the dependent variables capture various measures of a patent’s impact while the main independent variable remains our novel patent indicator. Column (1) uses the stock market valuation of patents (expressed as the log of real dollar values) from Kogan et al. (2017), which reflects the patent’s contribution to a firm’s current and future profits. The results indicate a modest but positive effect, with novel patents valued approximately 2% higher than non-novel patents. Column (2) examines five-year forward citations, the most widely used measure of scientific impact. On average, novel patents receive 0.7 more citations than non-novel patents, which receive an average of 5.5 citations. Lastly, Column (3) utilizes the “breakthrough” patent indicator from Kelly et al. (2021). This measure combines a measure of patent novelty with a measure of its impact. As such, Kelly et al. (2021)’s measure is conceptually most closely related to the discussion of novel patents by mega firms and their impact in Sections 3 and 4 below, while being constructed in a totally different way. In constructing their measure, Kelly et al. (2021) also leverage NLP techniques to first quantify textual similarity between patent pairs, identifying a patent as novel if its content is distinct from prior patents and impactful if it is similar to future patents. They then compute the ratio of impact to backward similarity (i.e., the inverse of novelty), where higher values indicate distinct advancements at the technological frontier that serve as a foundation for subsequent inventions. The top 10% of patents in the resulting importance distribution are classified as breakthrough patents.8 The estimation results show that novel patents are 1.5 percentage points more likely to be breakthrough patents than non-novel patents, which have a 10% probability of being classified as breakthrough. 8Specifically, Kelly et al. (2021) define their measure as the similarity of a patent to all patents filed over the next τ years, divided by its similarity to all patents filed in the previous five years. Their baseline analysis uses τ = 10, though they also construct similar measures for τ = 5 and τ = 1. In our analysis, we use τ =5 to ensure the measure can be calculated for patents filed through 2016. 12

Table 2: Impact of Novel Patents (1) (2) (3) Market Valuation 5yr Forward Citation Breakthrough Novel Patent Indicator 0.021∗∗∗ 0.690∗∗∗ 0.015∗∗∗ (0.005) (0.041) (0.001) CPC Group FE Yes Yes Yes Year FE Yes Yes Yes Mean Dep. Var. (Non-novel) 1.825 5.482 0.102 Obs. 1,504,550 2,838,910 2,836,755 R-sq 0.09 0.04 0.16 Notes: TheestimatesareobtainedfromOLSregressionswithmulti-wayfixedeffects. “MarketValuation”isthestockmarket valueofapatentobtainedfromKoganetal.(2017),and“Breakthrough”isanindicatorofwhetherapatentisabreakthrough patent,obtainedfromKellyetal.(2021). Theregressionestimatesarederivedfromallpatentsthatwereappliedbetween1980 and 2016. Mean Dep. Var. (Non-novel) is the mean of each outcome for non-novel patents, provided to aid interpretation of theestimatedcoefficients. Robuststandarderrorsareinparentheses. ***p<0.01,**p<0.05,*p<0.1. We also examine how the results in Table 2 vary under different combinations of CPC group and year fixed effects. Controlling for year fixed effects is essential for revealing the positive relationship in Column (1) of Table 2, as it accounts for the time trend in the overall stock prices that are driven by macroeconomic factors such as changes in long-term interest rates or equity risk premium. Similarly, controlling for CPC group fixed effects is necessary to uncover the positive relationships in Columns (2) and (3), as it adjusts for systematic differences in the number of patents across technology groups. Further details are provided in Appendix A.4. 3 Contribution of Mega Firms to Novel Innovation 3.1 Aggregate Trend We define mega firms as the top 50 firms by sales each year among all publicly listed firms in the Compustat dataset. Although publicly traded firms represent a selected subset of all firms in the economy, this definition is unlikely to introduce any meaningful selection bias, as the vast majority of the largest firms in the U.S. are publicly traded. While these 50 firms comprise only a tiny fraction of the millions of firms in the U.S., our calculations 13

indicate that, on average, they account for 8% of employment, 14% of sales, and 17% of patent applications annually in the U.S. economy.9 Thus, they represent a sufficiently large share of business activity and innovation to have a meaningful impact at the aggregate level. Figure 1 shows the dynamics of the contribution of mega firms to novel innovation. The solid line indicates the share of novel patent applications by mega firms out of all novel patent applications applied by U.S. patent assignees and the dashed line is its smoothed trend. The share displays a U-shaped trend: There had been a prolonged decline from 16 percent in 1980 to nine percent in 2001, followed by a steep increase afterward reaching its historical high level of 18 percent in 2016. Figure 1: Share of Novel Patent Applications By Mega Firms Source: Author’sowncalculationusingtheUSPTOpatentdatamatchedwithCompustatdata. Notes: Thefigurepresentstheshareofpatentapplicationsfiledbymegafirmsasaproportionofallpatentapplicationsfrom 1980to2016(solidline),alongwithitssmoothedtrendestimatedusingtheHodrick-Prescottfilterwithasmoothing parameterof100(dashedline). Megafirmsaredefinedasthetop50firmsbysalesinCompustatforeachyear. Notethatthe verticalaxisdoesnotstartatzero. We find that this U-shaped trend is not specific to our baseline definition of mega firms. In Appendix A.6, we show that the trend remains robust when we expand the definition 9Aggregate employment data are obtained from the Business Dynamics Statistics of the Census Bureau, andaggregatebusinessreceiptsarefromtheStatisticsofIncomepublishedbytheInternalRevenueService. WithintheCompustatdataset,megafirmsaccountfor23%ofemployment,31%ofsales,and26%ofpatent applications. 14

of mega firms to the top 100 firms by sales each year, or the top 4 firms in each four-digit NAICS industry—an approach comparable to “superstar firms” definition in Autor et al. (2020). The trend also holds when defining mega firms as the top 50 firms by sales in each year, but only among those that file at least one patent, accounting for the fact that patenting activity is highly concentrated in certain industries and firms.10 From an accounting identity perspective, the observed trend in Figure 1 can be driven by two forces. The rise since the early 2000s, for example, may reflect an increase in the total number of patent applications by mega firms or a greater tendency for those patents to be novel. To disentangle these forces, we decompose the share of novel patents produced by mega firms as follows: N N /T T m,t m,t m,t m,t = × (1) N +N (N +N )/(T +T ) T +T m,t o,t m,t o,t m,t o,t m,t o,t (cid:124) (cid:123)(cid:122) (cid:125) (cid:124) (cid:123)(cid:122) (cid:125) (cid:124) (cid:123)(cid:122) (cid:125) Yt:Megafirmshare(novel) X1,t:Relativetendency X2,t:Megafirmshare(total) where N and N are the number of novel patents applied by mega firms and other firms, m,t o,t respectively, and T and T are the total number of patents applied by each group. The m,t o,t first term, (X ), captures the relative tendency of mega firms to engage in novel innovation 1,t compared to all firms, and the second term, (X ), represents the share of total patent 2,t applications accounted for by mega firms. By taking logs on both sides of equation (1) and time-differencing, we can quantify the contribution of each component to changes in the share of novel patents by mega firms over time. Between 1980 and 2000, the share of novel patents by mega firms (Y ) declined by 7.2 t percentage points, with X and X explaining 77 percent and 23 percent of the decline, 1,t 2,t respectively. Between 2000 and 2016, Y increased by 8.5 percentage points, with X and t 1,t X explaining 35 percent and 65 percent of the increase, respectively. These results indicate 2,t that the recent rise in the share of novel patents by mega firms is driven not only by their 10This U-shaped trend is also consistent with findings by Fort et al. (2025), who show that the share of breakthrough patents filed by firms with more than 10,000 employees follows a similar U-shaped pattern. 15

growing share of total patent applications, but also by their increased propensity for novel innovation. The time series for X and X are presented in Appendix A.5. 1,t 2,t Examining the number of novel patents filed by mega firms (N ) and other firms (N ) m,t o,t separately provides useful insight into the underlying trend. One possibility is that the U-shaped pattern in Figure 1 reflects a rise and fall in novel innovation by non-mega firms— perhaps driven by startup activity during the IT boom of the 1990s—while novel patenting by mega firms remained flat. However, Figure 2 below shows that this is not the case. The number of novel patents filed by mega firms (blue circles) declined between 1980 and 2000, but began rising sharply in the mid-2000s. Meanwhile, the number of novel patents filed by non-mega firms (red squares) remained relatively stable during the earlier period and also began increasing in the late 2000s, albeit to a more modest extent than mega firms. Note each series in the figure is normalized to its 2000 value for comparability between the two series. Figure 2: The Number of Novel Patents by Mega vs. Nonmega Firms Source: Author’sowncalculationusingtheUSPTOpatentdatamatchedwithCompustatdata. Notes: The figure presents the number of patent applications filed by mega firms (blue circle) and by non-mega firms (red square)from1980to2016. Eachseriesisnormalizedbyits2000valueforcomparabilitybetweenthetwoseries. It is worth noting that the U-shaped pattern similar to that observed for mega firms 16

in Figure 1 above is also found in the dynamics of novel patents in the U.S. overall. As mentioned, Akcigit et al. (2013) showed the long-term secular trend towards increasing importance of novel combinations in the U.S. innovation since the 1830s. A closer inspection of their evidence the timeline for which ends in 2004 reveals, however, that the share of novel combinations appears to had actually been declining from around the 1980s (Figure 1b in Akcigit et al. (2013)). Consistent with this, in Figure 3 we show that the absolute number of novel patent applications assigned to U.S. entities had been basically flat, while the share of novel patents in total had been declining steadily, from 12% in 1980 to 8% at the start of the 1990s to less than 4% in the early- to mid-2000s. As the same Figure 3 shows, however, the number of novel patent applications doubled to almost 8,000 per year after the mid-2000s and their share in total patent applications had accordingly recovered to almost 6 percent, the level last seen in the mid-1990s. This renewed trend towards increasing share of novel combinations after the mid-2000s suggests that the decline in the share of novel patents from the 1980s to early 2000s may have been an aberration, after all. Furthermore, comparing Figures 1 and 3, we can see that the increase in the share of novel patents in total patent applications by mega firms started increasing already from around 2001, several years before the reversal of the overall trend, while the number of novel patents filed by mega firms has also increased more sharply that the number of novel patents filed by nonmega firms (Figure 2). 17

Figure 3: The Number of Novel Patents and Their Share in Total Patent Applications Source: Author’sowncalculationusingtheUSPTOpatentdata. As the economy evolves, the set of the largest firms changes over time; the largest firms in the 1980s are not the same as those in the 2000s. This raises the question of whether the trend in novel patenting activity by mega firms is driven, at least in part, by changes in their composition over time.11 To address this question, we decompose the annual change in the number of novel patents filed by mega firms, N , into two components: one attributable m,t to firms that remain in the top 50 from year t − 1 to t (continuers), and the other due to turnover in composition—i.e., entrants (newly included mega firms) and exiters (firms that drop out of the top 50). Specifically, we decompose the annual growth rate of novel patents filed by mega firms as g = gcont +gcomp (2) t t t where gcont captures the contribution from continuing mega firms, and gcomp captures the t t 11Importantly, our results are robust to defining mega firms as the top four firms within each four-digit NAICS industry, as shown in Figure A4. This indicates that the U-shaped trend is remains robust when holding the industry composition constant. 18

net effect of firm turnover. A detailed derivation of this decomposition, along with a visual depiction of each component, is provided in Appendix A.7.12 Applying this method, we find that for the vast majority of the 1980-2016 period, the year-to-year growth in novel patent applications by mega firms is predominantly driven by continuers rather than by the change in composition. Using the Shapley-Shorrocks variance decomposition to Equation (2), we find that gcont accounts for 74% of the variation in g , while gcomp explains the remaining t t t 26%. 3.2 Firm-level Evidence We also employ a firm-year level panel regression framework to assess the robustness of the increase in novel innovation by mega firms since 2000, controlling for various firm characteristics. Specifically, we run a set of Poisson regressions of the following form: E(Y |X ) = λ = exp(β ·Mega +β ·Mega ×Post +X′ γ +α +η ) (3) it it it 1 i,t 2 i,t t it i t where the dependent variable Y is the number of novel patent applications by firm i in year it t, Mega is an indicator whether firm i is a mega firm in t, Post is an indicator whether i,t t t ≥ 2001, X is a vector of firm-level characteristics, α is firm fixed effect and λ is industry it i t by year fixed effect (industry notation omitted). The independent variable of interest is β . 2 Naturally, this regression framework uses a subset of patent assignees that are publicly-listed firms (i.e., exclude privately-owned firms and research institutions) in order to measure their firm characteristics and it compares these firms’ novel patenting behavior between 1980-2000 period and 2001-2016 period. Table 3 shows the estimation results. 12The decomposition method is similar to that developed by Sivadasan et al. (2025) and bears some similaritytoFosteretal.(2001), withafocusondecomposinggrowthratesratherthanchangesinaverages. 19

Table 3: Firm-level Regressions of Novel Patents by Mega Firms in 1980-2016 (1) (2) (3) (4) # Novel Patents # Novel Patents # Novel Patents # Novel Patents Mega firm 2.909∗∗∗ 0.185∗∗∗ 0.109∗∗∗ 0.041 (0.062) (0.037) (0.037) (0.050) Mega firm x Post 2001 0.421∗∗∗ 0.291∗∗∗ 0.347∗∗∗ 0.217∗∗∗ (0.123) (0.053) (0.053) (0.056) Lagged IHS(# total patents) 0.896∗∗∗ 0.826∗∗∗ 0.657∗∗∗ (0.005) (0.008) (0.014) Lagged log sales -0.023 -0.049∗ (0.019) (0.026) Lagged log employment 0.119∗∗∗ 0.089∗∗∗ (0.022) (0.030) NAICS4 x Year FE Yes Yes Yes Yes Firm FE No No No Yes Obs. 49,461 38,819 32,879 24,136 R-sq 0.45 0.73 0.74 0.76 Notes: TheestimatesareobtainedfromPoissonpseudo-maximumlikelihoodregressions(PPML)withmulti-wayfixedeffects. Robust standard errors are in parentheses. *** p < 0.01, ** p < 0.05, * p < 0.1. IHS stands for Inverse-Hyperbolic Sine transformation. Column (1) is the benchmark specification that only controls for industry (four-digit NAICS) by year fixed effects, which enables us to verify whether the increasing trend of novel patents by mega firms is driven by their industry composition.13 The estimated coefficients indicate that mega firms produced 18.3 times more novel patents than non-mega firms prior to 2001, while mega firms produce an additional 52 percent more novel patents after 2001.14 In Columns (2) and (3), we examine how much of these differences are explained by observable firm characteristics. Column (2) shows that controlling for the firms’ engagement in overall patenting, measured by the lagged value of total number of patents (applying inverse-hyperbolic sine transformation to accommodate zero patents), reduces the vast majority of the difference in the outcome variable between mega firms and non-mega firms prior to 2001, while a large and significant difference still exists after 2001. In Column 13The results are very similar when we do not include these fixed effects. 14The interpretation of the coefficients are derived from exp(2.909) = 18.3 and exp(0.421)-1 = 0.52. 20

(3), we additionally control for firm sales and employment and find that mega firms produce relatively more novel patents after 2001, suggesting that the trend toward increasing novel patents by mega firms holds independently of the increase in their total number of patents or size. Finally, in Column (4), we include firm fixed effects to also account for time-invariant unobservable factors. We find that mega firms produce 24 percent more novel patents than non-mega firms after 2001 even after controlling for firm fixed effects, suggesting that firms are more (less) likely to produce novel patents as they pass the ranking threshold from (to) a non-mega firm to (from) a mega firm in recent decades.15 3.3 Trends in Mega Firms’ Share of Novel and Impactful Innovation: An Alternative Measure We further examine whether the rise in mega firms’ contributions to novel innovation since 2001 remains evident when using alternative measures of novelty. Specifically, we employ the widely recognized indicator of patent importance developed by Kelly et al. (2021), which leverages natural language processing techniques to establish links between new patents and both existing and subsequent patents. As mentioned in Section 2.2, Kelly et al. identify a patentasimportantifitscontentisdistinctfrompriorpatents(isnovel)butsimilartofuture patents (is impactful). Thus, the indicator of patent importance identifies advancements at the technological frontier that serve as a foundation for subsequent inventions. Since Kelly etal. adoptafundamentallydifferentmeasurementapproachfromours, applyingthismetric enables us to assess the robustness of our findings regarding trends in mega firms’ role in creating novel innovation. Additionally, it sheds light on whether mega firms’ contributions to both novel and impactful patents have increased over time.16 We categorize patents into groups based on their importance, as identified by Kelly 15Over the sample period, there are 233 firm-year observations that switch the mega firm indicator from the previous year. 16Wepresentfurtherevidenceontheimpactofnovelpatentsproducedbymegafirmsusingthefollow-on patents measure in the next section. 21

et al., ranging from the bottom 10 percent (the least novel and least impactful) to the top 10 percent (the most novel and impactful), with the latter classified as breakthrough patents. Figure 4 presents the share of patent applications filed by mega firms as a proportion of all patent applications (solid line) within each group, alongside its smoothed trend (dashed line), estimated using the HP filter. Figure 4: Share of Patents Filed by Mega Firms Across Novelty and Impact Groups Source: Author’sowncalculationusingtheUSPTOpatentdatamatchedwithCompustatdata. Notes: Thefiguredisplaystheshareofpatentapplicationsfiledbymegafirmsasaproportionofallpatentapplicationsfrom 1980 to 2016 (solid line) within each novelty and impact group, along with its smoothed trend (dashed line) estimated using the Hodrick-Prescott filter with a smoothing parameter of 100. The title in each panel indicates the respective novelty and impact group, based on the measure developed by Kelly et al. (2021). For instance, the top-left panel represents the bottom 10% group (i.e., the least novel and least impactful patents), while the bottom-right panel corresponds to the top 10% group (i.e.,themostnovelandimpactfulpatents,labeledasbreakthroughpatentsbyKellyetal.(2021)). Megafirmsaredefinedas thetop50firmsbysalesinCompustatineachyear. We find that the share of patent applications filed by mega firms among the most novel 22

and impactful patents has risen rapidly since the early 2000s. Specifically, the share of breakthrough patents accounted for by mega firms nearly doubled from 21% in 2001 to 39% in 2016, with a large increase observed among patents in the above-median group. By contrast, mega firms’ presence in the less novel and less impactful patents (below median) declined notably in the 1990s and has not rebounded in any meaningful way since. These findings indicatethat the growingimportance of mega firmsextends beyond novel innovation and encompasses both novel and impactful innovation. They also suggest that our main results driven by the specific measure of novelty we use.17 4 Technological Impact and Knowledge Diffusion 4.1 Follow-on Patents Having established the growing contribution of mega firms to novel innovation, we now examine the technological impact and diffusion of knowledge stemming from these innovations. While we have already shown that mega firms’ share of breakthrough patents has increased substantially in recent decades based on Kelly et al.’s measure—which assesses a patent’s novelty and its impact using linguistic similarity—we complement this finding with a measure that captures more direct reuse of technological combinations. Specifically, we draw on the approach of Pezzoni et al. (2022), who measure the impact of a novel patent by counting the number of follow-on patents—defined as patents that reuse the same new combination of technological components first introduced by the novel patent. This measure captures the extent to which a novel patent opens new technological trajectories that are subsequently built upon by others, based on information from technological classification.18 17Wealsoexaminedthetrendintheshareofmegafirmsusingalternativetext-basednoveltyonlymeasures from Arts et al. (2021) and Kalyani (2024). The findings using those alternative measures are, once again, similar to the findings using our measure of novel patents, although the degree of similarity varies according to how many “new bigrams” are used to define novel patents. Details are available upon request. 18Forwardcitations,awidelyusedmeasureofapatent’sinfluenceonfutureinnovations,arelessappealing inourcontextbecauseanovelpatentmaybecitedforreasonsunrelatedtothenewcombinationitintroduces. Nevertheless,ourmainfindingsinSection4remainrobustwhenusingcitation-basedmeasures. SeeAppendix 23

To this end, we employ a difference-in-differences style framework to estimate how the number of follow-on patents and related outcomes evolved over time for novel patents produced by mega firms relative to those by non-mega firms. Table 4 presents the results. Table 4: Follow-on Patents on Novel Patents by Mega Firms (1) (2) (3) # Follow-on Patents No Follow-on Hit Mega firm -0.070∗∗ 0.005 -0.001 (0.027) (0.005) (0.001) Mega firm x Post 2001 0.190∗∗∗ -0.015∗∗ 0.005∗∗∗ (0.037) (0.007) (0.002) Section x Year FE Yes Yes Yes Obs. 147,318 147,318 147,318 Pseudo R-sq 0.07 R-sq 0.04 0.00 Notes: Column (1) shows the result from a Poisson pseudo-maximum likelihood regression (PPML) with multi-way fixed effects,wheretheoutcomevariableisthenumberoffollow-onpatents. Columns(2)and(3)areresultsfromlinearprobability OLSregressions. Robuststandarderrorsareinparentheses. ***p<0.01,**p<0.05,*p<0.1. In Column (1), the outcome variable is the number of follow-on patents filed in the five yearsafterthenovelpatent’sapplicationyear. InColumn(2), theoutcomeisanindicatorfor whether no follow-on patents were filed for within this period. Column (1) reports estimates from a Poisson pseudo-maximum likelihood regression, while Column (2) presents results from a linear probability model. The results in Column (1) indicate that, prior to 2001, novel patents by mega firms generated 7% fewer follow-on patents than those by non-mega firms, but after 2001, they produced 13% more.19 The results in Column (2) suggest that the likelihood of having no follow-on patents within five years was similar for mega and non-mega firms before 2001, but was 1.5 percentage points lower for mega firms thereafter. To assess whether mega firms produce not only novel patents but also the most successful ones, we follow Pezzoni et al. (2022) and identify “hit” novel patents. In the baseline spec- A.8. 19exp(0.190-0.070)-1=0.127. 24

ification, hits are defined as novel patents that rank in the top one percentile of follow-on patent counts within their main CPC section over the first five years (results are robust to alternative thresholds, such as the top five percentile). Column (3) of Table 4 presents estimates from a linear probability model, where the outcome variable is an indicator for whether a novel patent becomes a hit. The results show no significant difference between mega and non-mega firms prior to 2001, but after 2001, novel patents by mega firms were 0.5 percentage points more likely to become hits, compared to a baseline probability of 1.1% for non-mega firms. 4.2 Self-follow-on Patents Pioneering a new combination is inherently experimental and can be viewed as risk-taking reducing the uncertainty in viability of a new technological space and thus facilitating followon innovation. It has been argued that a possible reason for declining business dynamism in the U.S. and elsewhere may be a slowdown in new knowledge diffusion from leading to laggard firms (e.g., Akcigit and Ates, 2021). One rather straightforward way to examine knowledge spillover from leading to laggard firms is by looking at the dissemination of the follow-on patents beyond the focal firm that generated the novel patent—whether follow-on patents are assigned to the same or to different firms. If the share of follow-on patents generated by the same firm that came up with the new combination (self-follow-on rate, hereafter) is increasing over time, it can perhaps be interpreted as evidence of a slowdown in new knowledge diffusion. Table 5 presents regression results where the dependent variable is the self-follow-on rate within the first five years. Column (1) includes an indicator for mega firms as the key independent variable, while Column (2) adds an interaction term between the post-2001 dummy and the mega firm indicator. Columns (3) and (4) replicate this analysis for “hit” novel patents, defined as before.20 20Novel patents without any follow-on patents are excluded from these regressions since the dependent 25

Table 5: Self-follow-on Rates of Novel Patents over the First Five Years (1) (2) (3) (4) Self-follow-on Rate Self-follow-on Rate Self-follow-on Rate Self-follow-on Rate Mega firm -0.003 0.004 0.015 0.016 (0.006) (0.006) (0.030) (0.035) Mega firm x Post 2001 -0.011 -0.002 -0.036 -0.002 (0.008) (0.008) (0.038) (0.042) Post 2001 0.015∗∗∗ -0.010 (0.003) (0.015) Section x Year FE No Yes No Yes Condition on Hits No No Yes Yes Obs. 85,656 85,656 1,763 1,725 R-sq 0.00 0.02 0.00 0.23 Notes: AllcolumnsshowtheresultfromlinearprobabilityOLSregressions,wheretheoutcomevariableistheshareoffollow-on patentsthatareproducedbythefirmthatinitiallyproducedthecorrespondingnovelpatent. Byconstruction,theestimation includes only novel patents that have at least one follow-on patent. Robust standard errors are in parentheses. *** p<0.01, **p<0.05,*p<0.1. The estimation results in Column (1) do provide some evidence of a slowdown in knowledge diffusion for novel patents overall, as indicated by the positive and significant coefficient on the post-2001 dummy variable. However, mega firms exhibit a self-follow-on rate comparable to that of all firms, with no significant change over time. Also, as can be seen from Column (3), there is no evidence of any decline in knowledge diffusion from most important ”hit” novel patents, either for all firms or for the subset of mega firms, before and after 2001. Thus, at this level of analysis, we find no evidence supporting a decline in knowledge diffusion from novel combinations pioneered by mega firms. 5 Potential Driving Factors The time concurrence between the declining trend in the efficiency of R&D investment and businessdynamism, combinedwiththeriseof“superstar”firms, haspromptedeconomiststo considerthepossible relation between the two. Ourempiricalexaminationofthe most recent variable is the share of self-follow-on among all follow-on patents. 26

trends in novel patents has, however, produced evidence that at least the largest among the largest, mega firms have been contributing more, not less to the generation of novel patents compared to other firms, with the turnaround happening sometime in the early 2000s. A key question is what drove this rise in novel innovation by mega firms since the early 2000s. In this section, we propose two hypotheses that can potentially shed light on this issue and present some empirical evidence to support those. 5.1 Cash Holdings by Mega Firms AlargebodyofresearchincorporatefinancehasdocumentedthatR&Disdifficulttofinance externally due to its low pledgeability, uncertain outcomes, and reliance on trial-and-error processes (for survey, see Kerr and Nanda, 2015). As a result, internal sources—particularly cash holdings—play a key role in funding R&D investment (Brown et al., 2009; Hall and Lerner, 2010; He and Wintoki, 2016). Such financial constraints are likely to be especially salient for projects aimed at generating novel technological combinations, which inherently involve high uncertainty. Building on these insights, we investigate whether the dyamics of cash holdings by mega firms can help explain the trend shown in Figure 1. We begin by examining the relationship between cash holdings and novel innovation. Krieger et al. (2022) provide a framework well-suited to this context. They develop a model of costly external financing and risky R&D in which financial frictions induce firms to behave as if they are risk averse, even where shareholders are risk neutral. This endogenous risk aversionarisesbecausethemarginalcostofexternalfundsrisessharplyonceinternalliquidity is depleted, rendering the value function locally concave. As a result, larger cash holdings reduce the likelihood of costly financing shortfalls and increase the firm’s willingness to undertake risky R&D investments. Consistent with this mechanism, Krieger et al. provide empirical evidence in the context of drug development that higher cash flows lead firms to invest more in R&D, particularly in riskier, novel projects. We test whether the predictions of Krieger et al. carry over to our setting. To this end, 27

we examine the relationship between changes in cash holdings and subsequent innovation outcomes—both total and novel—using a local projection method. Specifically, we estimate the following equation: 3 3 Y −Y (cid:88) (cid:88) i,t+k i,t−1 = α+ β ln(cash) + γ′X +θ +η +ϵ (4) (Y +Y )/2 j i,t−j j i,t−j i t t+k i,t+k i,t−1 j=0 j=0 where i indexes firm, t denotes years, Y represents the number of patent applications, and it ln(cash) is the log of cash holdings. The vector X includes control variables such as i,t i,t log assets and total patent applications. Firm and year fixed effects are denoted as θ and i η , respectively. The dependent variable captures the percent change in patent applications t between year t-1 and t+k.21 The coefficient of interest is β , which indicates the percent 0 change in novel patent applications in year t + k following a one percent increase in cash holdings in year t. Figure 5 displays the estimated β for k = 0,1,2,3,4, and 5. The left and right panels 0 display results where Y represents novel patent applications and total patent applications, it respectively. The left panel shows that following a one percent increase in cash holdings in yeart(holdingtotalassetsconstant), thenumberofnovelpatentapplicationsgraduallyrises, reaching a 5% increase by year t+2. In contrast, the right panel indicates a more immediate increase in total patent applications, though the peak effect is small (2%) compared to novel patent applications. Overall, consistent with the findings of Krieger et al. (2022), increases in cash holdings are associated with greater subsequent patenting activity, with more than twice the magnitude for novel patents compared to all patents. 21The dependent variable is a simple relative change measure developed by Tornqvist et al. (1985) and popularized by Davis et al. (1996). This measure is identical to the log difference under a second-order approximation and offers several advantages. First, it is symmetric for increases and decreases, similar to thelog-differencemeasure,butunlikethelogdifference,itaccommodateszeros. Thispropertyisparticularly relevant to this analysis, as many firm-year observations in our dataset have zero novel patent applications. Second, this measure aligns with the recommendations of Chen and Roth (2024) and Mullahy and Norton (2024)whoadvocateforscale-independentchangemeasures. Incontrast, transformationssuchaslog(1+x) or inverse hyperbolic sine are scale-dependent and therefore less suitable for outcome measures. Lastly, this measure provides a better model fit when dealing with highly skewed variables compared to the traditional percent change measure. 28

Figure 5: Percent Change in Patent Applications Following An Increase In Cash Holdings (a) Novel Patent Applications (b) Total Patent Applications Notes: The figures show estimated values of β in Equation (4), where the left panel and right panel use j the percent change in notal patent applications and total patent applications, respectively. Vertical bands represent 95% confidence intervals. Robust standard errors are used. Building on this finding, we investigate the evolution of differences in cash holdings between mega firms and non-mega firms. To do so, we estimate a difference-in-difference style regression using our Compustat firm panel data: 2016 2016 (cid:88) (cid:88) Y = α+β ·Mega + η ·I + δ ·I ·Mega +X′ γ +θ +ϵ (5) it it τ τ=t τ τ=t it it i it τ=1980 τ=1980 where Y represents the log of cash holdings for firm i in year t, and Mega is an indicator it it for mega firms. I are year dummies and the term η denotes year fixed effects, while τ=t τ δ is the coefficient of interest, tracing the evolution of differences in cash holdings between τ mega firms and non-mega firms. The control variables X include log assets as the baseline it control, aligning with the literature that measures cash holdings using the cash-to-asset ratios. Results are robust to additionally controlling for employment or sales. Figure 6 presents the estimated δ using 1980 as the base year. The left panel displays τ the results without firm fixed effects (θ ), while the right panel includes them. We find that i cash holdings of mega firms, relative to non-mega firms, declined from 1980 to the mid-1990s but have steadily increased since then. The sharp rebound since the early 2000s is evident 29

regardless of whether we isolate within-firm changes in cash holdings (i.e., firms holding more cash as they become mega firms) or also account for compositional shifts (i.e., firms with more cash becoming mega firms). Figure 6: Log Difference in Cash Holdings Between Mega Firms and Non-mega Firms (a) Without Firm Fixed Effects (b) With Firm Fixed Effects Notes: The figure shows the estimated difference in log cash holdings between mega firms and non-mega firms from 1980 until 2016 using 1980 as the base year, that is, δ in Equation (5). The shaded area τ displays 95% confidence intervals around the estimated coefficients, constructed using heteroskedasticity robust standard errors. Previous studies document a secular increase in U.S. firms’ cash holdings since the 1980s, largely driven by R&D-intensive firms (Bates et al., 2009; Pinkowitz et al., 2016). Interestingly, our results indicate that the evolution of cash holdings by mega firms, relative to nonmega firms, has followed a U-shaped pattern—rather than a monotonic increase—mirroring the trend observed in Figure 1. Combined with the results in Figure 5, this supports the notion that increased cash holdings by mega firms has been an important driving factor behind the trend in their increased contribution to novel innovation. 5.2 Size versus Technological Scope of Inventor Teams Recent studies find that inventors are becoming increasingly concentrated in large firms (Akcigit and Goldschlag, 2023). One may think that this may also contribute to the increase in the share of mega firms in novel patents as the likelihood of generating new combinations 30

may be increasing almost mechanically with the larger size of inventor teams. To assess whether the evidence aligns with this simple mechanism, we analyze the trends in the differences in inventor team size between mega firms and non-mega firms. Specifically, we once again estimate Equation (5) where Y now represents the average number of inventors asit sociated with patent applications by firm i in year t, while other variables are as defined in the previous subsection. The coefficient of interest, δ captures the evolution of differences τ in the average number of inventor team size between mega firms and non-mega firms. Figure 7 presents the evolution of differences in the average size of inventor teams (i.e., δ in Equation (5)) using 1980 as the base year. The left panel shows results without firm τ fixed effects, while the right panel includes them. We find that the relative size of inventor teams in mega firms, compared to non-mega firms, has grown steadily since 1980. This increasing trend appears more pronounced when focusing on within-firm variations. The relative increase in inventor team size in mega firms, however, is observed throughout the time period, so it is not consistent with the decline in novel patents generated by mega firms from 1980 to 2000. In other words, if increasing the relative size of the inventor teams were all what it takes to increase the output of novel patents, we would expect to see such increase uniformly over all the decades. Instead, the trend in the share of mega firms in novel patents is U-shaped, so clearly, the increase in the team size is not the (whole) story. 31

Figure 7: Log Difference in Inventor Team Size Between Mega Firms and Non-mega Firms (a) Without Firm Fixed Effects (b) With Firm Fixed Effects Notes: The figures present estimated values of δ in Equation (5) from 1980 until 2016, where the τ outcome variable is the log of the average number of inventors associated patent applications at the firm-year level. 1980 is used as the base year. The left panel excludes firm fixed effects, while the right panel includes them. Shaded area represents 95% confidence intervals, and robust standard errors are used. Recall that, by definition, novel innovation involves combining technological components thathavenotpreviouslybeenusedtogether. Thisconceptiscloselyalignedwithrecombinant innovation as described by Weitzman (1998), who shows that the probability of generating new combinations increases with the breadth of the existing knowledge base. Also, as the stock of knowledge accumulates, achieving novel breakthroughs often requires not only larger but also diverse teams of inventors with increasingly specialized expertise (Jones, 2009). We therefore conjecture that what matters for novel patents is not merely the size of the inventor team but, more importantly, the breadth (scope) of the technological expertise embodied in these teams. Based on this idea, we replace the dependent variable Y with the average number of it technological areas in which inventor teams have expertise when estimating Equation (5). Since team size and technological scope may be mechanically correlated, we control for firms’ average inventor team size, though results remain robust to excluding this control. To construct the variable capturing an inventor’s technological expertise in year t, we identify the CPC technology group in which he/she have filed the largest number of patents over his/her 32

entire cumulative patenting history up to year t. We then count the distinct number of technological areas across all members of the inventor team associated with patent applications by firm i in year t. Figure 8 presents the evolution of differences in the technological scope of inventor teams between mega and non-mega firms. The left panel, once again, presents results without firm fixed effects, capturing both within-firm changes and compositional shifts. The relative technologicalscopeofinventorteamsatmegafirmsfollowsaU-shapedpattern,withatrough around 2000. The right panel incorporates firm fixed effects, isolating within-firm variations. From the mid-1990s, firms began broadening the technological scope of their inventor teams as they became mega firms. Thus, comparing the two panels, we can see that the decline in technological scope in the two last decades of the 20th century was driven by compositional shifts—an increasing share of firms with narrower technological scope among mega firms, while after 2000, mega firms with a broader technological scope took the center stage. Figure 8: Log Difference in Technology Scope Between Mega Firms and Non-mega Firms (a) Without Firm Fixed Effects (b) With Firm Fixed Effects Notes: The figures present estimated values of δ in Equation (5) from 1980 until 2016, where the τ outcome variable is the log of the average number of CPC technological groups in which inventor teams have expertise at the firm-year level. 1980 is used as the base year. The left panel excludes firm fixed effects, while the right panel includes them. Shaded area represents 95% confidence intervals, and robust standard errors are used. Thus, the estimation results suggest that the competitive advantage of mega firms in 33

producing novel innovations may be accruing to those of them that assembled not just larger but more diverse, in terms of their technological expertise, inventor teams (recall that we control for firms’ average inventor team size when estimating Equation (5)). The findings are alsorobusttoincludingadditionalfirmsizemeasures,suchasemploymentandsales. Overall, these results support the hypothesis that an increasing scope of technological expertise at mega firms has facilitated the rapid reversal in their contribution to novel innovations since the early 2000s. 5.3 Discussion It is useful to interpret the findings above in light of recent literature on the rise of mega firms, increase in markups, and decline in business dynamism. In particular, a growing body of research highlights the rising use of intangible inputs—such as information technology and software—as a key driver of these trends. For example, De Ridder (2024) emphasizes intangible inputs that are embedded in physical products or services, such as a phone’s operating system or a car’s drive-by-wire system. These technologies typically require substantial upfront investment in development but are nearly costless to replicate once deployed, allowing firmstoscaleatlowmarginalcost. Asaresult, firmsthatcansuccessfullydevelopanddeploy such technologies are able to grow faster and dominate their markets, contributing to rising concentration. Other studies highlight how Information and Communication Technologies (ICT), when integrated into firms’ production processes, improve efficiency and reduce overhead costs, facilitating expansion into new geographic and product markets (Aghion et al., 2023; Hsieh and Rossi-Hansberg, 2023). While the specific mechanisms differ, these studies consistently show that economic activity is increasingly concentrated in firms that are better able to deploy intangible inputs than others, leading to higher market concentration. In fact, there are numerous examples of novel innovation in which firms combine ICT with non-ICT components in their products, services or production processes. For example, AmazonhasfiledseveralnovelpatentsthatcombinetechnologiesinWirelessCommunication 34

Networks (CPC code H04W) with Transport or Storage Devices (B65G) to improve inventory management through robots that receive instructions from a central computer system. Amazon has also filed patents that integrate components from Equipment for Fitting in or to Aircraft (B64D) with Data Processing Systems or Methods (G06Q) to enable drone-based package delivery. Alphabet (Google) has likewise filed numerous novel patents that combine technologies classified under Vehicles (B60) with those under Computing (G06), aimed at developing autonomous vehicles—commercialized through its spin-off subsidiary, Waymo. Hence, a substantial subset of novel innovation is linked to the development and deployment of intangible inputs, and for the reasons discussed in Sections 5.1 and 5.2 above, mega firms are increasingly securing advantageous positions in producing such innovation. The rise in their novel combinations can thus be seen as one channel through which mega firms have expanded their market share, particularly since the early 2000s. Importantly, as shown in Section 4, these innovations also generate meaningful knowledge spillovers beyond firm boundaries—a relatively overlooked but potentially beneficial aspect of mega firms’ growing role. In addition, the findings in Section 5.2 offer a new perspective on the reallocation of inventors toward large incumbents documented in Akcigit and Goldschlag (2023). While they show that inventors have increasingly moved to large firms since 2000, with declines in individual productivity—–measured by the number of patent applications and citations—– our results highlight how mega firms are organizing diverse inventor teams and actively leveraging their expertise. In a related vein, Jin et al. (2023) analyze M&A activity by the five leading technology platforms–—Alphabet (Google), Apple, Facebook, Amazon, and Microsoft (GAFAM)—–and find that GAFAM’s acquisitions span a broader range of technology categories than those of other major acquirers, such as top private equity firms. This evidence suggests that mega firms may be expanding their technological scope and acquiring inventors through firm acquisitions, although more systematic analysis is needed. 35

6 Conclusion Our empirical analysis reveals a U-shaped trend in the share of novel patents produced by mega firms, declining from 1980 to the early 2000s before rebounding sharply to reach its highest level by 2016. This resurgence reflects not only an increase in mega firms’ overall patenting activity but also a heightened propensity to engage in novel innovation, as further evidenced by firm-level analysis. Since 2001, novel patents by mega firms have demonstrated greater technological impact, generating more follow-on patents and being more likely to become “hits” compared to those by non-mega firms. We also find that the share of novel patents in U.S. patent applications overall had declined over several decades until the mid- 2000s, followed by a rebound since then, starting several years after the start of the rebound in the share of novel patents by mega firms. Together, these findings suggest that mega firms might be increasingly shaping new technological trajectories in the U.S. economy. Importantly, ourfindingsonknowledgediffusionindicatethattheself-follow-onrate—the proportion of follow-on patents generated by the same firm—has remained stable for mega firms over time, showing no evidence of impeding knowledge spillover specifically for their novel patents. This stability, combined with the higher volume of follow-on patents post- 2001forthesenovelinnovations, suggeststhatmegafirmsarefacilitatingknowledgediffusion to other firms through this selected subset of innovations. While mega firms may restrict knowledge diffusion tied to other types of patents, our analysis highlights that this pattern does not extend to their novel innovations. We also explore potential drivers of this trend, finding suggestive evidence that the relative increase in cash holdings since the early 2000s may have enabled mega firms to fund risky, experimental R&D, while the broadening technological scope of their inventor teams may have enhanced their capacity to pioneer new combinations. The share of economic activities accounted for by mega firms has dramatically increased over the past several decades, and their innovation behavior has profound implications for economic growth, technological progress, and the appropriate policy response. While mega 36

firms may protect their technological superiority in certain dimensions, our evidence suggests that, especially in recent years, they are leading technological experimentation through novel innovations that enable follow-on innovation by others. If mega firms were predominantly hindering knowledge spillover, there might be a case for considering regulatory intervention. However, given their role as key actors in generating novel technologies, the impact of such measures requires careful consideration to avoid unintended consequences. Understanding the balance between these countervailing forces remains a critical research agenda for informing policy in this debate. References Aghion, Philippe, Antonin Bergeaud, Timo Boppart, Peter J Klenow, and Huiyu Li (2023) “A theory of falling growth and rising rents,” Review of Economic Studies, 90 (6), 2675– 2702. Akcigit, Ufuk and Sina T Ates (2021) “Ten facts on declining business dynamism and lessons from endogenous growth theory,” American Economic Journal: Macroeconomics, 13 (1), 257–298. (2023) “What happened to US business dynamism?” Journal of Political Economy, 131 (8), 2059–2124. Akcigit, Ufuk and Nathan Goldschlag (2023) “Where have all the” creative talents” gone? Employment dynamics of us inventors,” Working Paper No. 31085, National Bureau of Economic Research. Akcigit, Ufuk and William R Kerr (2018) “Growth through heterogeneous innovations,” Journal of Political Economy, 126 (4), 1374–1443. Akcigit, Ufuk, William R Kerr, and Tom Nicholas (2013) “The Mechanics of Endogenous Innovation and Growth: Evidence from Historical U.S. Patents,” Unpublished. Arts, Sam, Jianan Hou, and Juan Carlos Gomez (2021) “Natural language processing to 37

identify the creation and impact of new technologies in patent text: Code, data, and new measures,” Research Policy, 50 (2), 104144. Autor, David, David Dorn, Lawrence F Katz, Christina Patterson, and John Van Reenen (2020) “The fall of the labor share and the rise of superstar firms,” The Quarterly Journal of Economics, 135 (2), 645–709. Bates, Thomas W, Kathleen M Kahle, and Ren´e M Stulz (2009) “Why do US firms hold so much more cash than they used to?” The Journal of Finance, 64 (5), 1985–2021. Bloom, Nicholas, Charles I Jones, John Van Reenen, and Michael Webb (2020) “Are ideas getting harder to find?” American Economic Review, 110 (4), 1104–1144. Brown, James R, Steven M Fazzari, and Bruce C Petersen (2009) “Financing innovation and growth: Cash flow, external equity, and the 1990s R&D boom,” The Journal of Finance, 64 (1), 151–185. Chen, Jiafeng and Jonathan Roth (2024) “Logs with zeros? Some problems and solutions,” The Quarterly Journal of Economics, 139 (2), 891–936. Covarrubias, Matias, Germ´an Guti´errez, and Thomas Philippon (2020) “From good to bad concentration? US industries over the past 30 years,” NBER Macroeconomics Annual, 34 (1), 1–46. Cunningham, Colleen, Florian Ederer, and Song Ma (2021) “Killer acquisitions,” Journal of Political Economy, 129 (3), 649–702. Davis, Steven, John Haltiwanger, and Scott Schuh (1996) Job Creation and Destruction, Cambridge, MA: MIT Press. De Loecker, Jan, Jan Eeckhout, and Gabriel Unger (2020) “The rise of market power and the macroeconomic implications,” The Quarterly Journal of Economics, 135 (2), 561–644. DeRidder, Maarten(2024)“Marketpowerandinnovationintheintangibleeconomy,” American Economic Review, 114 (1), 199–251. Epicoco, Marianna, Magali Jaoul-Grammare, and Anne Plunket (2022) “Radical technologies, recombinant novelty and productivity growth: a cliometric approach,” Journal of 38

Evolutionary Economics, 32 (2), 673–711. Fleming, Lee, Santiago Mingo, and David Chen (2007) “Collaborative brokerage, generative creativity, and creative success,” Administrative Science Quarterly, 52 (3), 443–475. Fort, Teresa C, Nathan Goldschlag, Jack Liang, Peter K Schott, and Nikolas Zolas (2025) “Growth is Getting Harder to Find, Not Ideas,” Working Paper CES-WP-25-21, US Census Bureau Center for Economic Studies. Foster, Lucia, John C Haltiwanger, and Cornell John Krizan (2001) “Aggregate productivity growth: Lessons from microeconomic evidence,” in New developments in productivity analysis, 303–372: University of Chicago Press. Foster, Lucia S, John C Haltiwanger, and Cody Tuttle (2022) “Rising markups or changing technology?” Working Paper 30491, National Bureau of Economic Research. Gupta, Abhinav, Naman Nishesh, and Elena Simintzi (2024) “Big Data and Bigger Firms: A Labor Market Channel,” Working Paper. Guti´errez, Germa´n and Thomas Philippon (2019) “The failure of free entry,” Working Paper No. 26001, National Bureau of Economic Research. Hall, Bronwyn H and Josh Lerner (2010) “The financing of R&D and innovation,” in Handbook of the Economics of Innovation, 1, 609–639: Elsevier. He, Zhaozhao and M Babajide Wintoki (2016) “The cost of innovation: R&D and high cash holdings in US firms,” Journal of Corporate Finance, 41, 280–303. Hsieh,Chang-TaiandEstebanRossi-Hansberg(2023)“Theindustrialrevolutioninservices,” Journal of Political Economy: Macroeconomics, 1 (1), 3–42. Jin, Ginger Zhe, Mario Leccese, and Liad Wagman (2023) “How do top acquirers compare in technology mergers? New evidence from an S&P taxonomy,” International Journal of Industrial Organization, 89, 102891. Jo, Karam and Seula Kim (2024) “Heterogeneous Innovations and Growth Under Imperfect Technology Spillovers,” IZA Discussion Paper No. 17581, IZA. Jones, Benjamin F (2009) “The burden of knowledge and the “death of the renaissance 39

man”: Is innovation getting harder?” The Review of Economic Studies, 76 (1), 283–317. Kalyani, Aakash (2024) The Creativity Decline: Evidence from US Patents: Working Paper. Kelly, Bryan, Dimitris Papanikolaou, Amit Seru, and Matt Taddy (2021) “Measuring technological innovation over the long run,” American Economic Review: Insights, 3 (3), 303–320. Kerr, William R and Ramana Nanda (2015) “Financing innovation,” Annual Review of Financial Economics, 7 (1), 445–462. Kogan,Leonid,DimitrisPapanikolaou,AmitSeru,andNoahStoffman(2017)“Technological innovation, resource allocation, and growth,” The Quarterly Journal of Economics, 132 (2), 665–712. Krieger, Joshua, Danielle Li, and Dimitris Papanikolaou (2022) “Missing novelty in drug development,” The Review of Financial Studies, 35 (2), 636–679. Kwon,SpencerY,YueranMa,andKasparZimmermann(2024)“100yearsofrisingcorporate concentration,” American Economic Review, 114 (7), 2111–2140. Mullahy, John and Edward C Norton (2024) “Why transform y? The pitfalls of transformed regressions with a mass at zero,” Oxford Bulletin of Economics and Statistics, 86 (2), 417–447. Pezzoni, Michele, Reinhilde Veugelers, and Fabiana Visentin (2022) “How fast is this novel technology going to be a hit? Antecedents predicting follow-on inventions,” Research Policy, 51 (3), 104454. Pinkowitz, Lee, Ren´e M Stulz, and Rohan Williamson (2016) “Do US firms hold more cash than foreign firms do?” The Review of Financial Studies, 29 (2), 309–348. Schumpeter, Joseph A. (1911) The Theory of Economic Development: An Inquiry into Profits, Capital, Credit, Interest, and the Business Cycle, Cambridge, MA: Harvard University Press, Translated by Redvers Opie, 1934. Sivadasan, Jagadeesh, Natarajan Balasubramanian, Ravi Dharwadkar, and Charlotte Ren (2025) “How do US firms grow? New evidence from a growth decomposition,” Strategic 40

Management Journal, 46 (1), 49–81. Strumsky, Deborah and Jos´e Lobo (2015) “Identifying the sources of technological novelty in the process of invention,” Research Policy, 44 (8), 1445–1461. Tornqvist, Leo, Pentti Vartia, and O. Yrjo Vartia (1985) “How Should Relative Changes Be Measured?” The American Statistician, 39 (1), 43–46. Verhoeven,Dennis,Jurri¨enBakker,andReinhildeVeugelers(2016)“Measuringtechnological novelty with patent-based indicators,” Research Policy, 45 (3), 707–723. Weitzman, Martin L (1998) “Recombinant growth,” The Quarterly Journal of Economics, 113 (2), 331–360. Yeh, Chen, Claudia Macaluso, and Brad Hershbein (2022) “Monopsony in the US labor market,” American Economic Review, 112 (7), 2099–2138. 41

A Appendix A.1 Examples of Novel Patents Figure A1: Visual image of the novel patent example in the Introduction Source: USPTO. A.2 USPTO-Compustat Matching For the analyses involving mega firms, we use S&P’s Compustat data to track publicly listed firms in the U.S. We created our own bridge between the U.S. patenting firms in the USPTO patentdatabaseandCompustatfirmsthroughastandardname-matchingandinternet-based matching algorithm as in Autor et al. (2020a). First, we standardize firm names in both datasets using the algorithm provided by the NBER PDP and use the standardized names in the matching process. We define the patenting firms as patent assignees that are located in the U.S. with an assignee type equal to 2 (U.S. company or corporation) in the USPTO data. The first match procedure involves identifying firms with precisely the same standardized names in both datasets. Following the previous studies, we do not use address information 42

in Compustat throughout the entire match process as the data only reports information for headquarters, which can be different from the exact address of the establishments that filed patent applications to the USPTO. For the unmatched USPTO firms, we use stem names (standardized firm names without suffixes) to find matches. For the rest of the unmatched U.S. patenting firms after the standard name matching, we apply an internet-based matching algorithm to identify the same firms in Compustat. Specifically, we put every patent assignee and Compustat firm name into the Google.com search engine, collect the URLs of the top five search results, and identify any given pair of the patent assignee and Compustat firm as the same firm if they share at least two identical search results. If any of these patenting firms remain unmatched, we utilize web- URL information in Compustat and find the corresponding firms if the top five search results of the unmatched patenting firms exactly match the web-URL of the Compustat firms. For all the remaining unmatched U.S. patenting firms in the USPTO data after the previous steps, we use the NBER PDP and find matches in Compustat. The NBER PDP did extensive manual matching to identify the same firms across the two datasets. Thus, this procedure helps us to reduce our burdens of manually searching the unmatched USPTO firms. Lastly, we do our own manual matching to identify matches between the USPTO and Compustat firms. We manually inspect the match results to screen out false matches, especially for firms with many patent applications at the end of each procedure. The above procedure matches 68.1% of utility patent applications (granted until Sep. 30, 2023) filed by U.S. patenting firms, and 25.6% of U.S. patenting firms to Compustat firms from 1976 to 2016. A.3 Novel Patent Indicator and Alternative Measures of Novelty This section presents regression results where the dependent variables capture various alternative measures of patent novelty and creative destruction, with the novel patent indicator as the main independent variable. The tables below report results under different specifica- 43

tions, varying the inclusion of CPC group fixed effects and year fixed effects. The findings remain robust across all fixed-effect combinations. Table A1: Novel Patents & Number of New Bigrams (1) (2) (3) (4) (5) # New # New # New # New # New Bigrams Bigrams Bigrams Bigrams Bigrams Novel Patent Indicator 0.777∗∗∗ 0.852∗∗∗ 0.559∗∗∗ 0.769∗∗∗ 0.715∗∗∗ (0.019) (0.019) (0.019) (0.019) (0.019) CPC Group FE No Yes No Yes No Year FE No No Yes Yes No CPC Group x Year FE No No No No Yes Obs. 2,633,463 2,630,141 2,633,463 2,630,141 2,629,129 R-sq 0.00 0.04 0.03 0.06 0.08 Notes: TheestimatesareobtainedfromOLSregressionswithmulti-wayfixedeffects. “#NewBigrams”indicatesthenumber of new bigrams in patent text, obtained from Arts et al. (2021). Robust standard errors are in parentheses. *** p<0.01, ** p<0.05,*p<0.1. Table A2: Novel Patents and Patent Creativity (1) (2) (3) (4) (5) Patent Patent Patent Patent Patent Creativity Creativity Creativity Creativity Creativity Novel Patent Indicator 0.022∗∗∗ 0.022∗∗∗ 0.018∗∗∗ 0.020∗∗∗ 0.019∗∗∗ (0.000) (0.000) (0.000) (0.000) (0.000) CPC Group FE No Yes No Yes No Year FE No No Yes Yes No CPC Group x Year FE No No No No Yes Obs. 2,578,919 2,575,446 2,578,919 2,575,446 2,574,407 R-sq 0.00 0.06 0.06 0.12 0.16 Notes: The estimates are obtained from OLS regressions with multi-way fixed effects. “Patent Creativity” is the share of newtechnicalbigramsinpatents,obtainedfromKalyani(2024). Robuststandarderrorsareinparentheses. ***p<0.01, ** p<0.05,*p<0.1. 44

Table A3: Novel Patents and Backward Self-citation Ratio (1) (2) (3) (4) (5) Backward Backward Backward Backward Backward Self-citation Self-citation Self-citation Self-citation Self-citation Ratio Ratio Ratio Ratio Ratio Novel Patent Indicator -0.031∗∗∗ -0.026∗∗∗ -0.032∗∗∗ -0.027∗∗∗ -0.026∗∗∗ (0.001) (0.001) (0.001) (0.001) (0.001) CPC Group FE No Yes No Yes No Year FE No No Yes Yes No CPC Group x Year FE No No No No Yes Obs. 2,661,756 2,657,952 2,661,756 2,657,952 2,656,902 R-sq 0.00 0.06 0.00 0.06 0.08 Notes: TheestimatesareobtainedfromOLSregressionswithmulti-wayfixedeffects. Backwardself-citationratioisdefined as the number of citations to a firm’s own previous patents divided by the total number of citations. Robust standard errors areinparentheses. ***p<0.01,**p<0.05,*p<0.1. A.4 Novel Patent Indicator and Measures of Impact This section presents the results of regressions in which the dependent variables represent various measures of patent impact, while the key independent variable is the novel patent indicator. Table A4 illustrates the relationshipbetween stockmarket valuation and thenovel patent indicator. Table A4: Novel Patents and Market Valuation (1) (2) (3) (4) (5) Market Market Market Market Market Valuation Valuation Valuation Valuation Valuation Novel Patent Indicator -0.025∗∗∗ -0.025∗∗∗ 0.043∗∗∗ 0.021∗∗∗ 0.006 (0.005) (0.006) (0.005) (0.005) (0.005) CPC Group FE No Yes No Yes No Year FE No No Yes Yes No CPC Group x Year FE No No No No Yes Obs. 1,506,012 1,504,550 1,506,012 1,504,550 1,502,624 R-sq 0.00 0.05 0.04 0.09 0.14 Notes: The estimates are obtained from OLS regressions with multi-way fixed effects. “Patent Creativity” is the share of newtechnicalbigramsinpatents,obtainedfromKalyani(2024). Robuststandarderrorsareinparentheses. ***p<0.01, ** p<0.05,*p<0.1. We find that the relationship is negative when year fixed effects are excluded, as shown 45

in the first two columns. To understand this result, we plot the time series of the average stock market value of patents (solid line) and the share of novel patents (dashed line) in Figure A2. The two series exhibit opposite trends until around 2008, explaining the negative coefficient when time fixed effects are omitted. The share of novel patents has been declining during this period, a trend that is also broadly consistent with the findings of Arts et al. (2021) and Kalyani (2024). Meanwhile, average stock prices have been rising, largely due to factors unrelated to patent novelty, such as the decline in long-term interest rates and the equity risk premium. Therefore, we deem it more appropriate to control for time fixed effects to better isolate the relationship between the two variables. We also find that the estimated relationship is positive but becomes very small and statistically insignificant when controlling for CPC group-by-year fixed effects. This result suggests that the impact of novel patents on firms’ profits may be limited and weak. Figure A2: Trend in the Average Stock Market Valuations and Share of Novel Patents Source: Author’sowncalculationusingthemarketvaluationdatafromKoganetal.(2017)andUPSTOpatentdatabase. Notes: Stockmarketvaluationsarelogofvaluationsdeflatedto1982(million)dollarsusingtheCPI. Tables A5 and A6 present the relationship between the novel patent indicator and two outcomes: the five-year forward citation count and the breakthrough patent indicator developed by Kelly et al. (2021), respectively. In these cases, controlling for CPC group fixed 46

effects is crucial for uncovering the positive relationship. Table A5: Novel Patents and 5-year Forward Citations (1) (2) (3) (4) (5) 5-year 5-year 5-year 5-year 5-year Forward Forward Forward Forward Forward Citation Citation Citation Citation Citation Novel Patent Indicator -0.675∗∗∗ 0.528∗∗∗ -0.428∗∗∗ 0.690∗∗∗ 0.575∗∗∗ (0.041) (0.042) (0.041) (0.041) (0.042) CPC Group FE No Yes No Yes No Year FE No No Yes Yes No CPC Group x Year FE No No No No Yes Obs. 2,842,894 2,838,910 2,842,894 2,838,910 2,837,936 R-sq 0.00 0.03 0.01 0.04 0.07 Notes: TheestimatesareobtainedfromOLSregressionswithmulti-wayfixedeffects. Backwardself-citationratioisdefined as the number of citations to a firm’s own previous patents divided by the total number of citations. Robust standard errors areinparentheses. ***p<0.01,**p<0.05,*p<0.1. Table A6: Novel Patents and Breakthrough Patents (1) (2) (3) (4) (5) Breakthrough Breakthrough Breakthrough Breakthrough Breakthrough Novel Patent Indicator -0.039∗∗∗ 0.015∗∗∗ -0.039∗∗∗ 0.015∗∗∗ 0.004∗∗∗ (0.001) (0.001) (0.001) (0.001) (0.001) CPC Group FE No Yes No Yes No Year FE No No Yes Yes No CPC Group x Year FE No No No No Yes Obs. 2,836,794 2,836,755 2,836,794 2,836,755 2,835,781 R-sq 0.00 0.11 0.02 0.16 0.29 Notes: TheestimatesareobtainedfromOLSregressionswithmulti-wayfixedeffects. Backwardself-citationratioisdefinedas thenumberofcitationstoafirm’sownpreviouspatentsdividedbythetotalnumberofcitations. Notethatthebreakthrough measureinKellyetal.(2021)isalreadyresidualizedwithrespecttoyearfixed-effects,soincludingyearfixedeffectsdoesnot changethecoefficients. Robuststandarderrorsareinparentheses. ***p<0.01,**p<0.05,*p<0.1. A.5 Decomposition of Mega Firms’ Share in Novel Patents Figure A3 below displays the decomposition of novel patent applications filed by mega firms into (i) the share of novel patents within mega firms relative to that share among all firms (solid line) and (ii) the share of total patents accounted for by mega firms (dashed line). Both time series exhibit a U-shaped pattern with the reversal occuring around 2000. 47

Figure A3: Decomposition of Mega Firms’ Share in Novel Patents Source: Author’sowncalculationusingUPSTOpatentdatabaseandCOMPUSTAT. A.6 Trends in Mega Firms’ Share of Novel Patents Under Alternative Definitions of Mega Firms Figure A4 illustrates the share of novel patent applications filed by mega firms under various definitions. The top-left panel reproduces Figure 1 as a benchmark. The top-right panel expands the definition of mega firms to the top 100 firms by sales in each year. The bottomleft panel defines mega firms as the top 4 firms by sales in each four-digit NAICS industry. The bottom-right panel restricts mega firms to the top 50 firms by sales in each year, but only among COMPUSTAT firms that file at least one patent, accounting for the fact that patenting firms represent a highly selected subset. We find that the U-shaped trend reported in Figure 1 remains evident across all definitions. 48

Figure A4: Trends in Mega Firms’ Share in Novel Patents Under Alternative Definitions of Mega Firms Source: Author’sowncalculationusingUPSTOpatentdatabaseandCOMPUSTAT. A.7 Contribution by Continuers vs. Change in Composition of Mega Firms Let N denote the number of novel patent applications filed by mega firms in year t. We m,t decompose the annual change in this quantity, ∆N = N −N , as follows: m,t m,t m,t−1 (cid:88) (cid:88) (cid:88) ∆N = ∆n + (n −0)+ (0−n ), m,t i,t i,t i,t−1 i∈S t cont i∈S t new i∈S t exit where n is the number of novel patent applications by firm i in year t; Scont denotes the i,t t set of firms that are mega firms in both years t−1 and t (continuers); Snew refers to firms t that became mega firms in year t but were not in year t−1 (entrants); and Sexit refers to t firms that were mega firms in year t−1 but not in year t (exiters). 49

Dividing both sides by the average level of novel patenting, N = Nm,t+Nm,t−1, and m,t 2 expressing each term as a weighted average, we obtain: ∆N (cid:88) ∆n n (cid:88) (n −0) n (cid:88) (0−n ) n m,t i,t i,t i,t i,t i,t−1 i,t = · + · + · . N n N n N n N m,t i∈S t cont i,t m,t i∈S t new i,t m,t i∈S t exit i,t m,t Each term on the right-hand side represents a weighted average of firm-level growth rates in novel patent applications, where the weights are given by n /N . While the i,t m,t decomposition treats new mega firms as having zero patenting activity in year t − 1, this simply reflects their zero contribution to the mega firm group in that year. The same logic applies to exiters, whose contribution to the group becomes zero in year t. Therefore, the growth rate of novel patent applications among mega firms can be expressed as g = gcont +gnew +gexit = gcont +gcomp, t t t t t t where gcont captures the contribution of continuing mega firms, and gcomp(= gnew + gexit) t t t t captures the net effect of turnover. Note, by construction, gnew ≥ 0 and gexit ≤ 0. Hence, t t for example, a positive gcomp indicates that novel patenting by entrants in year t exceeded t that of exiters in year t−1. 50

Figure A5: Percent Change in Novel Patent Applications by Mega Firms: Contribution by Continuers vs. Change in Composition Figure A5 presents the decomposition graphically. The solid black line shows the overall growth rate g , while the blue bars represent the contribution from continuers (gcont) and t t the rose bars show the contribution from turnover (gcomp). The results indicate that, for the t vast majority of the sample period, year-to-year changes in novel patenting by mega firms were predominantly driven by continuers, rather than by changes in composition. A notable exception occurred around the year 2000, when new mega firms generated significantly fewer novel patents than the firms they replaced. Using the Shapley-Shorrocks variance decomposition, we find that gcont accounts for 74% of the variation in g , while gcomp explains the t t t remaining 26%. A.8 Forward Citations and Self-citation Rates of Novel Patents Tables A7 and A8 report results from the same regression specifications used in Tables 4 and 5, but with forward citations and self-forward-citation rates as alternative outcome variables. In these tables, “hits” are defined as novel patents that fall within the top one percentile of forward citations within their main CPC section over the first five years. 51

Table A7: Forward Citations on Novel Patents by Mega Firms (1) (2) (3) # Forward Citations No Forward Citation Hit Mega firm 0.025 -0.008∗ 0.001 (0.020) (0.004) (0.001) Mega firm x Post 2001 0.225∗∗∗ 0.009 -0.001 (0.074) (0.007) (0.002) Section x Year FE Yes Yes Yes Obs. 147129 147129 147129 Pseudo R-sq 0.09 R-sq 0.08 0.00 Notes: Column (1) shows the result from a Poisson pseudo-maximum likelihood regression (PPML) with multi-way fixed effects,wheretheoutcomevariableisthenumberoffollow-onpatents. Columns(2)and(3)areresultsfromlinearprobability OLSregressions. Robuststandarderrorsareinparentheses. ***p<0.01,**p<0.05,*p<0.1. Column (1) shows that, consistent with Table 4, novel patents filed by mega firms receive, on average, more forward citations after—and only after—2001. In contrast to Table 4, however, there is little difference between mega firms and non-mega firms in the extreme outcomes shown in Columns (2) and (3). 52

Table A8: Self-forward-citation Rates of Novel Patents by Mega Firms (1) (2) (3) (4) Self-fwd.-cit. Rate Self-fwd.-cit. Rate Self-fwd.-cit. Rate Self-fwd.-cit. Rate Mega firm 0.065∗∗∗ 0.060∗∗∗ 0.134∗∗∗ 0.119∗∗∗ (0.004) (0.004) (0.028) (0.030) Mega firm x Post 2001 0.004 0.009 0.009 -0.045 (0.007) (0.007) (0.047) (0.049) Post 2001 0.114∗∗∗ 0.117∗∗∗ (0.002) (0.015) Section x Year FE No Yes No Yes Condition on Hits No No Yes Yes Obs. 100894 100894 1700 1659 R-sq 0.03 0.07 0.06 0.30 Notes: AllcolumnsshowtheresultfromlinearprobabilityOLSregressions,wheretheoutcomevariableistheshareoffollow-on patentsthatareproducedbythefirmthatinitiallyproducedthecorrespondingnovelpatent. Byconstruction,theestimation includes only novel patents that have at least one follow-on patent. Robust standard errors are in parentheses. *** p<0.01, **p<0.05,*p<0.1. Column (1) indicates that self-forward-citation rates increased overall after 2001, consistent with the findings of Akcigit and Ates (2023). However, in line with Table 5, there is no evidence for a faster increase for novel patents filed by mega firms. As shown in the first rows, mega firms had higher self-forward-citation rates prior to 2001, suggesting slower knowledge diffusion from their novel innovations during this period. 53

Cite this document

APA

Joonkyu Choi, Serguey Braguinsky, Yuheng Ding, Karam Jo, & Seula Kim (2025). Mega Firms and New Technological Trajectories in the U.S. (FEDS 2025-060). Board of Governors of the Federal Reserve System, Finance and Economics Discussion Series. https://whenthefedspeaks.com/doc/feds_2025-060

BibTeX

@techreport{wtfs_feds_2025_060,
  author = {Joonkyu Choi and Serguey Braguinsky and Yuheng Ding and Karam Jo and Seula Kim},
  title = {Mega Firms and New Technological Trajectories in the U.S.},
  type = {Finance and Economics Discussion Series},
  number = {2025-060},
  institution = {Board of Governors of the Federal Reserve System},
  year = {2025},
  url = {https://whenthefedspeaks.com/doc/feds_2025-060},
  abstract = {We provide evidence that mega firms have played an increasingly important role in shaping new technological trajectories in recent years. While the share of novel patentsâdefined as patents introducing new combinations of technological componentsâ produced by mega firms declined until around 2000, it has rebounded sharply since then. Furthermore, we find that the technological impact and knowledge diffusion of novel patents by mega firms have grown relative to those by non-mega firms after 2001. We also explore potential drivers of this trend, presenting evidence that the rise in novel patenting by mega firms is tied to their disproportionate increase in cash holdings and the expansion of their technological scope. Our findings highlight an overlooked positive role of mega firms in the economywide innovation process.},
}