feds · November 30, 2009

A Dynamic Analysis of Consolidation in the Broadcast Television Industry

Abstract

This paper estimates a dynamic oligopoly model in order to separately identify the demand-side and cost-side advantages of consolidation in the broadcast television industry. I exploit an exogenous change in regulation that led to significant industry consolidation. Using revenue and ownership data for broadcast stations over the past ten years, I estimate the effect of ownership changes on revenue. I recover costs by examining patterns in ownership changes that are left unexplained by revenue estimation. I model firms' purchasing decisions as a dynamic game, and estimate the game using a two-step estimation method recently developed by Bajari, Benkard & Levin (2007). This is the first paper to estimate a model of merger activity in a dynamic, strategic setting. I find that there are both revenue and cost advantages to consolidation, but they operate through different mechanisms. Access to a wider audience enables firms to increase per-station advertising revenue, while simply owning more stations enables firms to reduce per-station operating costs. A firm's ability to realize these benefits is affected by its stations' network affiliations, locations and viewers.

Finance and Economics Discussion Series Divisions of Research & Statistics and Monetary Affairs Federal Reserve Board, Washington, D.C. A Dynamic Analysis of Consolidation in the Broadcast Television Industry Jessica C. Stahl 2009-48 NOTE: Staff working papers in the Finance and Economics Discussion Series (FEDS) are preliminary materials circulated to stimulate discussion and critical comment. The analysis and conclusions set forth are those of the authors and do not indicate concurrence by other members of the research staff or the Board of Governors. References in publications to the Finance and Economics Discussion Series (other than acknowledgement) should be cleared with the author(s) to protect the tentative character of these papers.

A Dynamic Analysis of Consolidation in the Broadcast Television Industry ∗ Jessica Calfee Stahl Board of Governors of the Federal Reserve System October 2009 Abstract Thispaperestimatesadynamicoligopolymodelinordertoseparately identify the demand-side and cost-side advantages of consolidation in the broadcast television industry. I exploit an exogenous change in regulation that led to significant industry consolidation. Using revenue and ownership data for broadcast stations over the past ten years, I estimate the effect of ownership changes on revenue. I recover costs by examining patterns in ownership changes that are left unexplained by revenue estimation. I model firms’ purchasing decisions as a dynamic game, and estimatethegameusingatwo-stepestimationmethodrecentlydeveloped by Bajari, Benkard & Levin (2007). This is the first paper to estimate a model of merger activity in a dynamic, strategic setting. I find that there are both revenue and cost advantages to consolidation, but they operate through different mechanisms. Access to a wider audience enables firms toincreaseper-stationadvertisingrevenue,whilesimplyowningmorestations enables firms to reduce per-station operating costs. A firm’s ability to realize these benefits is affected by its stations’ network affiliations, locations and viewers. ∗Email: jessica.c.stahl@frb.gov. This is a revised version of a chapter of my Ph.D. dissertation at Boston University. I would like to thank my advisor Marc Rysman for his continued guidance and support. I would also like to thank Joshua Lustig, Randall Ellis, Iain Cockburn and Jordi Jaumandreu, as well as participants at a number of seminars, for very helpful suggestions. Allerrorsaremyown. Theanalysisandconclusionssetfortharethoseoftheauthor and do not indicate concurrence by other members of the staff, by the Board of Governors, or by the Federal Reserve Banks. 1

1 Introduction Deregulation in 1996 and 1999 has led to substantial consolidation in the local broadcast television industry. While the consolidation has provoked considerable controversy, the forces that drive it are poorly understood. This paper exploits the exogenous change in regulation, and estimates a dynamic oligopoly model so as to separately identify the demand-side and supply-side advantages of consolidation. This is the first paper to estimate a model of consolidation in a dynamic, strategic framework. The rapid consolidation that took place following deregulation suggests that there are competitive advantages to consolidation in the broadcast television industry. Yet the vast majority of consolidation involves stations in different local markets, which are not in direct competition with one another. These markets are often located hundreds of miles apart. Therefore it is not obvious how these firms achieve either market power or economies of scale. How, for instance, does owning an NBC-affiliated station in Hagerstown, Maryland helps Nexstar Broadcasting run its Fox-affiliated station in Fort Wayne, Indiana?1 Itispossiblethatfirmswithmorestationsareatanadvantageinnegotiations withadvertisersiftheyoffermoreviewerspercontract. However,thisonlyworks iffirmsacquireportfoliosofstationsthatareattractivetoadvertisersasagroup. If firms are doing this, it should be reflected in higher per-station revenue for larger firms. On the other hand, it is possible that firms are able to centralize operations of stations even though they are located in separate markets, often many miles apart. If firms are doing this, it should be reflected in lower perstation costs for larger firms. 1Whilemostpeoplethinkoflocalstationsintermsoftheirnetworkaffiliations,stationsare usuallyownedbyanothercompanyaltogether,suchasNextarBroadcasting. Thisisdiscussed in more detail in Section 2 describing the industry. 2

The goal of this paper is to estimate the revenue versus cost advantages of consolidation, and to identify the characteristics of a firm’s portfolio of stations that enable the firm to enjoy these advantages. I consider the number of stations a firm owns, the population coverage of its stations, the average distance between its stations, the demographic heterogeneity of its viewers, and the network affiliations of its stations. These characteristics are likely to impact the attractiveness of a firm’s portfolio of stations to advertisers and/or affect the ability of the firm to centralize operations of its stations. Revenue and ownership data are available for nearly all broadcast stations since deregulation began. This permits direct observation of how the revenue generated by a firm’s stations responds to changes in the firm’s portfolio of stations. However, I do not observe cost, so I infer cost from patterns in how firms change their portfolios. Patterns that cannot be explained by revenue opportunities are attributed to cost savings. For example, if firms cluster their stations close together with no discernable effect on revenue, I infer that doing so reduces costs. This method for recovering costs is complicated by the fact that a firm’s portfolio decision is both dynamic and strategic. Because the buying and selling of stations involves significant sunk costs, the effects of this period’s purchasing decision will likely persist through future periods. For instance, a firm must consider that the purchase of additional stations this period will increase the number of stations that it can expect to own in all future periods, so the firm mustconsidertheeffectofthepurchasenotonlyonnextperiod’sexpectedprofits but on all future periods’ expected profits. In addition, if a firm’s revenue is affected by its competitors’ characteristics, then a firm will consider how its own purchasing decision will affect its competitors’ decisions. Suppose, for example, that a firm hesitates to acquire additional stations even though this would lower 3

per-stationcosts, becausedoingsowouldpromptitscompetitorstodothesame. I want my model to allow for this strategic behavior, so that I do not wrongly attribute it to a lack of cost savings from owning more stations. In order to take into account both the dynamic and strategic elements of a firm’s purchasing decisions, I model the decisions as a dynamic game. I estimate the dynamic game in order to recover costs. Each firm’s portfolio of stations is summarized by a vector of characteristics (number of stations, population coverage,distancebetweenstations,heterogeneityofviewersandnetworkaffiliations). In my model, a firm decides each period how to adjust these characteristics of its portfolio; this acts as an approximation of the firm’s true decision about which stations to buy and sell. The firm adjusts its vector of portfolio characteristics so as to maximize the present discounted value of its expected stream of profits, basing its decision on its own current vector of portfolio characteristics, as well as those of its competitors. The broadcast television industry is well-suited for identifying strategic behavior because firms’ stations are spread across independent local markets. In a given period, different firms are present in different sets of markets, so the nature of competition that they face varies. This feature of the industry allows me to identify the effects of strategic variables on firms’ payoffs, because I observe within-time variation in the values of these strategic variables. Until recently, it was virtually impossible to estimate dynamic games such as this for computational reasons. However, several authors have recently developed methods for estimating the parameters of a dynamic game that avoid computing the equilibrium even once. I adopt the two-step method proposed by Bajari, Benkard&Levin(2007)becauseitallowsforcontinuouschoicevariables. In the first stage, I use revenue data to estimate the effects of firms’ portfolio characteristics on per-station revenue. The second stage examines ownership 4

data to estimate the strategies of firms, that is, how they adjust their portfolio characteristics each period. Given these results, I find the effects that portfolio characteristics must have on per-station costs such that the observed strategies are consistent with profit maximization. The result is a set of estimates of both the revenue and cost effects of consolidation. This paper develops an innovative way to estimate the demand-side and supply-side advantages of consolidation in the absence of cost data. As far as I know, it is the first paper to estimate a model of merger activity in a dynamic, strategic framework. My approach allows me to isolate the relationships of interest in an industry that each year witnesses hundreds of firms considering the purchase or sale of over one thousand stations. I find that there are both cost and revenue advantages to consolidation, but they operate through different mechanisms. Revenue advantages come through greater population coverage, which is probably attractive to advertisers who would like to reach a wider audience for each contract that they negotiate. Cost advantages come through simply owning more stations; firms appear to be able to combine operations of multiple stations so as to cut per-station costs. Cost advantages are greater when a firm’s stations’ viewers are more demographically homogeneous. If most of a firm’s stations have the same network affiliation, the firm’s per-station cost is lower, but so is its per-station revenue. The rest of the paper proceeds as follows. Section 2 describes the relevant literature, Section 3 describes the broadcast television industry and Section 4 describes the data that I use. Section 5 describes the dynamic oligopoly model and Section 6 explains my estimation strategy in detail. Section 7 explains my results and Section 8 concludes the paper. 5

2 Related Literature There is a substantial theoretical literature seeking to understand why some industries initially support many competitors but then undergo significant consolidation or “shake-outs.”2 Some of these studies focus on the evolution of an industry from a highly competitive industry to an oligopoly, while others focus on the features of an industry that characterize its inherent or “natural” level of competitiveness in the long run. Because the broadcast television industry was not allowed to consolidate until the late 1990s, it is impossible to know what its natural evolution would have been. The rapid consolidation that followed deregulation suggests that there are demand-side and/or supply-side features of the industry that make it naturally concentrated rather than highly competitive. This paper takes advantage of a shake-out triggered by the exogenous deregulation in order to identify those features. Shaked & Sutton (1987) develop the idea that certain features of an industry’s production technology can lead the industry to be concentrated regardless of the size of demand. Specifically, they look at the relationship between a firm’s cost and the quality of its product. My results suggest that this relationship is important in the case of the broadcast television industry. Firms with greater population coverage appear to offer advertisers an inherently higherquality product, suggesting that consolidation changes the relationship between cost and quality for broadcasting firms. There is relatively little empirical research on mergers and consolidation despite the obvious practical importance of the topic. A persistent problem is the difficulty of identifying the factors that drive consolidation because merger ac- 2These studies include, but are not limited to, Jovanovic (1982), Shaked & Sutton (1983), Shaked & Sutton (1987), Klepper & Grady (1990), Shaked & Sutton (1990), Hopenhayn (1992), Petrakis & Roy (1999), Klepper & Simons (2000), and Klepper (2002). 6

tivity is endogenous. Event analysis uses the exact timing of the announcement of a merger as a source of exogeneity. Also called event studies, these look at how merger announcements affect the stock market values of the firms affected by the merger, in order to understand the effects of the merger. For instance, Knapp (1990) does an event analysis of the airline industry and concludes that mergers in this industry increase market power. Other papers make use of exogenous sources of variation in merger activity prompted by changes in regulations. For instance, Paul (2003) and Pesendorfer (2003)bothestimatethecoststructureofanindustrybeforeandafterconsolidation, making use of revisions to merger guidelines to argue that merger activity was exogenous. Paul looks at the beef-packing industry, while Pesendorfer looks at the paper industry. Both authors conclude that consolidation alters the cost structure of firms. Kim & Singal (1993) make use of a period of time when airline mergers went uncontested, and find a positive effect of airline mergers on prices. A few of these papers look specifically at consolidation in the broadcast television industry following deregulation. In 2006, the Federal Communications Commission (FCC) commissioned ten studies to estimate the effect of media consolidation on “diversity, competition and localism.”3 Crawford (2007) considers the impact of consolidation on the quantity and quality of local television programming. Shiman (2007) studies the relationship between ownership structure and the quantity of news programming, and Hammond et al. (2007) study the relationship between consolidation and minority and female ownership of television stations. Results are largely inconclusive; they depend on the methods of measurement and estimation, and are often statistically or economically 3“FCC Names Economic Studies to Be Conducted As Part of Media Ownership Rules Review,” November 22, 2006. See http://www.fcc.gov/ownership/actions.html. 7

insignificant. In all of these studies, ownership changes are treated as strictly exogenous, and the mechanisms through which they affect market outcomes are left largely unexplained. There is very little empirical work that endogenizes consolidation or firms’ merger decisions. An exception is the recent literature that models mergers as matching games between firms. The approach is similar to mine in that it uses merger decisions to make inferences about the parameters of firms’ payoff functions or to reveal firms’ incentives. Akkus & Hortacsu (2007) modelbankmergerdecisionstorevealwhatfeaturesofthemergingbanksenable them to create more value through merger. Park (2008) uses a matching model to analyze manager incentives in mutual fund acquisitions.4 However, matching models tend to suffer from a dimensionality problem. As far as I know, no matching model has incorporated dynamics, and even the incorporation of strategic effects has been limited. For instance, Fox (2007) suggests a way to avoid the curse of dimensionality in a matching model; the method essentially involves the assumption that a given pair of matching agents do not care about the other matches that occur in the market, so long as they prefer their own match to any other match they could have made. This assumption is reasonable in the context of, for instance, a marriage market. However, I am not willing to make this assumption, given that broadcast television firms may be directly affected by mergers between other firms in the industry. For this reason, I do not model mergers as a matching game. However, the incorporation of strategic and dynamic effects in the matching model is an interesting area for future research. 4See Sorenson (2007), Fox (2007) and Ho (2007) for other examples of matching game estimation, applied to settings other than mergers. 8

3 Broadcast Television Industry Originally, broadcast television stations could only be viewed by those who lived within reach of the station’s over-the-air signal.5 They tended to be locally owned, but as major national networks emerged, the government became concerned that local programming would be dominated by national interests. The FCC put into place a number of restrictions that were meant to encourage local ownership; these are discussed in detail later. Stations were thought to be more able to respond to the needs of the local community if they were not owned by or beholden to national networks.6 The total number of stations in the country is effectively fixed over time because licenses for new stations are rarely issued, and stations virtually never give up their licenses. There are 1,251 full-power commercial stations in the country, across 210 markets.7 Many stations maintain an affiliation with a national network; the station agrees to broadcast the network’s prime-time programming and advertising, usually in exchange for a fee. By maintaining affiliated stations in most markets across the country, the major networks have national coverage without violating FCC ownership rules. Four networks have essentially national 5Now most households in the United States have cable television, so they view their local broadcast stations through their cable provider; the FCC’s “must-carry” laws force cable providers to carry local stations. With very few exceptions, though, these stations are only provided locally. 6Forexample,theFCC’s1941“ReportonChainBroadcasting”maintainsthat“[a]station licensee must retain sufficient freedom of action to supply the program and advertising needs of the local community. Local program service is a vital part of community life.” 7Nielsen Media Research divides the country into 210 Designated Market Areas (DMAs) that are essentially mutually exclusive. A typical market has between five and eight stations; Boston and Houston have the most with 14 each, whereas a number of small towns have just one station each. 9

coverage: ABC, CBS, Fox and NBC. The WB and UPN merged in 2006 to form CW, which has emerged as a competitor to the major four. There are also a handful of smaller networks, the most significant of which are Ion and the Spanish-language networks Univision, Telemundo and Telefutura. TheTelecommunicationsActof1996wasthefirstmajoroverhauloftelecommunications policy since the Communications Act of 1934 created the FCC nearly 62 years earlier. The 1996 Act came in the wake of the breakup of AT&T, when there seemed to be some consensus among policy-makers that the telecommunications industry was over-regulated. Broadcasting companies argued that they were struggling to compete with cable television and home video, and exerted pressure on the FCC to loosen restrictions on ownership of stations. The 1996 Act removed the limit on the total number of stations that a single entity could own (previously 12) and increased from 25% to 35% the portion of the national population that a single owner’s stations could cover. Further deregulation in 1999 allowed joint ownership of two stations with overlapping coverage8 (previously disallowed) as long as they were not in the same market, and allowed joint ownership of two stations in the same market (previously disallowed) as long as that left eight independent full-powered commercial stations in the market and neither of these two stations were ranked in the top four in the market in terms of viewing share.9 The industry underwent significant consolidation in the ten years following deregulation. Views on the effects of this consolidation differ dramatically. For 8Technically, their “Grade B contours” are now allowed to overlap. “Grade B” is a measurement of the strength of a television signal as it arrives at a viewer’s antenna. A station’s “Grade B contour” is the geographic area in which it is predicted that a consumer with an outdoor rooftop receiving antenna can pick up a signal of Grade B intensity. 9This lastrule isreferred toas a“duopolyrule,” but obviouslydoes not referto a duopoly as economists define one. 10

instance, the Common Cause Education Fund argues “that profit-driven media conglomerates are investing less in news and information, and that local news in particular is failing to provide viewers with the information they need in order to participate in the democracy.”10 On the other hand, the National Association of Broadcasters (NAB) maintains that “[t]he real threat today to locally-oriented services, including costly services such as local news, is not the joint ownership of broadcast stations..., but the stations’ continuing challenge to maintain their economic vibrancy in the face of multichannel and other competitors that are not constrained by restrictions on local ownership structure.”11 Broadcasting companies argue that ownership deregulation was necessary for the survival of broadcast television and that further deregulation is in order. Congress has explicitly allowed for this possibility; it directed the FCC to re-evaluate remaining regulations every two years to “determine whether regulation is no longer necessary in the public interest.”12 Yet the debate rarely addresses the question of how station owners are helped by consolidation; in fact, this is not well understood. This paper estimates the revenue versus cost effects of consolidation. It is generally thought that increased revenue following consolidation has negative welfareimplications, whilelowercostfollowingconsolidationhaspositivewelfare implications. However, those implications do not necessarily hold in this industry. Because remaining FCC regulation prevents individual firms from dominating local markets, increased revenue following consolidation is most likely 10“The Fallout from the Telecommunications Act of 1996: Unintended Consequences and Lessons Learned” by the Common Cause Education Fund, May 9, 2005. 11Comments of the National Association of Broadcasters Before the Federal Communications Commission, October 22, 2007, p.3. 12Section 11(2)(a) of the Telecommunications Act of 1996. This was later changed to every four years. 11

not evidence of market power. Bigger firms are able to offer advertisers more viewers per transaction. The advertiser’s outside option has not changed; the bigger firm is simply offering the advertiser a more valuable product. On the other hand, it is often argued that local television programming is a public good. Thus a reduction in the costs incurred in the production of local programming may reveal under-provision of a public good. For these reasons, I hesitate to use my results to draw strong conclusions about welfare. However, as discussed in Section 2 on related literature, studies of this industry have attempted to directly measure the reduced-form effects of consolidation on particular outcomes so as to draw welfare conclusions, and have been inconclusive. It is difficult to understand the mechanisms through which consolidation affects welfare without understanding why consolidation has taken place. This paper attempts to fill that void, by identifying the demand-side and supply-side factors that have driven consolidation in this industry. 4 Data I acquired a proprietary dataset from BIA Financial Network Inc (BIAfn), a telecommunications research firm. As far as I know, this is the first paper in economics to use this panel dataset on broadcast television stations. I observe each station’s market, owner, revenue, coverage, and network affiliation annually from 1996 through 2007. Revenue is adjusted to constant 2008 dollars. Out of 1,251 full-power commercial stations, 131 stations without revenue data are dropped from my analysis, which leaves 1,120 stations. Each station is uniquely identified by its call letters. Owners do not have a unique identifier, so I look carefully at owner names in order to standardize them and create a unique identifier for each. The dataset as provided by BIAfn is at the level of the 12

station. I construct a panel dataset at the level of the firm. There are a total of 544 firms that were active (owned at least one station) at some point between 1996 and 2007 and for which I have data.13 If a firm has exited or not yet entered, then that firm is coded as having zero stations. A look at the data confirms that the broadcast television industry underwent significant consolidation in the ten years following deregulation. In 1995, just before deregulation, most firms held only one station and only three firms held more than ten stations. Immediately following deregulation, stations began changing hands. Figure 1 shows the number of transactions (sales of stations) on a yearly basis before and after deregulation. Purchasing activity picked up dramatically in 1996 (the dashed line); it saw a decline following 2001, but has shown signs of picking up again in recent years. Firms that already held a relatively large number of stations tended to expand their portfolios. For instance, of the eight firms that owned ten or more stations in 1995, all but three ofthemownedmorethantwentystationsby2007. Ontheotherhand,individual owners tended to exit the industry. Nearly 70 percent of the firms that owned only one station in 1995 had sold their station by 2007; only one owned more than three stations by 2007. By 2007, 25 firms owned more than ten stations each, and the majority of stations were in the hands of these large owners. The greatest number of stations held by one firm in 2007 was 60. Figure 2 and Figure 3 show how the ownership structure of the industry changed over these years. The trend toward consolidation suggests that owning more stations puts firms at a competitive advantage. For each firm in each year, I construct six variables that describe the firm’s 13Some firms were dropped because their stations did not have enough revenue data to enablemetoidentifyfirmfixedeffectsintherevenueregression. Thefirmsthatweredropped are generally the smallest (most often individual owners). 13

portfolio of stations. These are variables that are likely to affect the attractiveness of the stations as a group to advertisers, as well as variables that are likely to affect the extent to which the firm can centralize the operations of its stations. Note that I am trying to capture how joint ownership of these stations affects their profitability. Therefore these variables describe the firm’s stations as a group, rather than characterizing the firm’s stations themselves. These variables are: the number of stations owned by the firm, the total population coverage of its stations, the average distance between its stations in miles, the demographic heterogeneity of its stations’ viewers, the concentration of its stations among network affiliations, and its dominance of networks. Because I have a panel of stations, I will be able to control for individual station heterogeneity. Demographic heterogeneity is measured as the average Euclidean distance between the demographic vectors of a firms’ stations. A demographic vector is defined as, among the station’s viewers: {percent of households that are Caucasian; percent of households that are African-American; percent of households that are Asian; percent of households that are Hispanic}.14 The network concentration variable reflects the degree to which the firm has concentrated its stations’ affiliations among just a few networks. It is the Herfindahl index as applied to the firm’s concentration among eight national networks,15 divided by the expected value of this index given the number of stations held by the firm.16 The network dominance variable reflects the degree 14For instance, if a firm has one station in Chicago, where the demographic vector is {64,18,5,13}, and Houston, where the demographic vector is {56,12,4,28}, then the demographic heterogeneity variables is [(64 − 56)2 + (18 − 12)2 + (5 − 4)2 + (13 − 28)2]2 1 = 18. 15These are the eight networks mentioned earlier: ABC, CBS, CW, Fox, NBC, Ion, Univision, Telemundo and Telefutura. 16For instance, if a firm owns five stations, each affiliated with a different network, the un- 14

to which the firm dominates a particular network’s stations. I look at each network with which the firm has at least one station affiliation, and calculate the percent of the network’s stations that this firm owns. I then take the highest value among all networks to which the firm has affiliations.17 5 Model I estimate a model in which a firm chooses each period how to adjust the characteristics of its portfolio of stations. These characteristics are as listed in the data section: number of stations, population coverage, average distance between stations, demographic heterogeneity among viewers, network concentration and network dominance. The firm’s static per-station revenue and per-station cost are functions of its own portfolio characteristics as well as those of its competitors. I assume that a firm’s payoffs depend on its competitors’ portfolio characteristics only through the average values of these characteristics among all of its direct competitors. A direct competitor is defined as a firm that owns a station in one of i’s markets. A firm incurs sunk costs when it adjusts its portfolio, so the effects of this period’s adjustments persist through all future periods. Therefore firms choose their portfolio each period so as to maximize the present discounted value of their expected stream of profits as a function of their own normalized index is (1)2+(1)2+(1)2+(1)2+(1)2 = 0.2. The expected value of this index 5 5 5 5 5 for an owner of five stations if the owner were to pick its five stations randomly is 0.252. So the final value reflecting this owner’s network concentration is 0.2/0.252=0.79. 17For instance, suppose that a firm owns thirty stations, five of which are affiliated with NBC and twenty-five of which are affiliated with ABC. NBC is affiliated with a total of 191 stations, and ABC is affiliated with a total of 189 stations. So this firm owns 2.6% (5/191) of NBC stations, and 13.2% of ABC stations (25/189). I choose the highest value, which is 13.2%. 15

current portfolio characteristics and their competitors’ current average portfolio characteristics. This framework takes into account the strategic and dynamic elements of the firm’s decision. 5.1 Revenue and Cost Let firms be indexed by i = 1,...,I and years be indexed by t = 1,...,T. A firm’s per-station revenue is a reduced-form function of its own and its competitors’ portfolio characteristics: R = r +r s +r s +µr +ν (1) it 0 1 it 2 −it i it where s is the vector of i’s own portfolio characteristics in period t, s is it −it the vector of the average portfolio characteristics among i’s direct competitors in period t, µr is a firm-specific revenue constant (a firm fixed-effect) and ν i it is a demand shock distributed N(0,σr). The parameters to be estimated are i r={r ,r ,r }, µr and σr. 0 1 2 i i A firm’s per-station cost is a reduced-form function of its own portfolio characteristics, subject to adjustment costs: ∆ n2 ∆ n2 C = c +c s +c + it +c − it +µc +ω (2) it 0 1 it 2 n 3 n i it it it where s and s are as in the revenue equation, n is the firm’s number of it −it it stations, ∆ n2 is a squared positive adjustment in i’s number of stations since + it last period, ∆ n2 is a squared negative adjustment in i’s number of stations − it since last period, µc is a firm-specific cost constant and ω is a cost shock i it distributed N(0,σc). Adjustments are likely to have a linear effect on a firm’s total cost, rather than its per-station cost; for this reason, adjustments are divided by n in the per-station cost equation. The parameters to be estimated it are c={c ,c ,c ,c } and µc. 0 1 2 3 i 16

The firm’s static profit is: π = n (R −C ) it it it it 5.2 Strategies and Equilibrium Let S be a vector consisting of i’s own portfolio characteristics and the averages it of its competitors’ portfolio characteristics: S = {s ,s } it it −it Since a firm’s static profit depends on its competitors’ portfolios only through the average characteristics of its direct competitors s , firm i’s state in period −it t can be fully described by the state vector S . A firm’s strategy maps from it its state vector this period to the vector of its own portfolio characteristics next period: σ : S → s i it it+1 The firm chooses σ to maximize the present discounted value of its expected i stream of profits given the strategies of its competitors. This value can be written: (cid:32) (cid:33) ∞ (cid:88) V(S |σ ,σ ) = E βτπ |S ,σ ,σ (3) it i −i it+τ it i −i τ=0 The equilibrium concept is Markov-perfect Nash equilibrium; a firm’s strategydependsonlyonitscurrentstatevector. Theequilibriumcanthusbedefined as the strategy σ∗ that satisfies: V(S|σ∗,σ∗ ) ≥ V(S|σ(cid:48),σ∗ ) ∀ σ(cid:48) ∀ s (4) i −i i −i i That is, σ∗(s) is the strategy that maximizes the value of state s given σ∗ . i −i 17

6 Estimation 6.1 Empirical Strategy The goal of the paper is to estimate the revenue and cost parameter vectors r and c. Because revenue is observed, r can be estimated by regressing perstation revenue on state variables. Costs are not observed, so I estimate the dynamic game in order to recover the cost parameter vector c. I use a two-step method proposed by Bajari, Benkard & Levin (2007) that avoids computing the equilibriumevenonceandthatallowsforcontinuouschoicevariables. Inthefirst stage, I estimate the strategies governing firms’ adjustments of their portfolio characteristicseachperiod. Inthesecondstage,valuefunctionsareestimatedfor the observed strategies and for randomly drawn alternative strategies; then the costparametersarefoundthatminimizethesumofsquaredprofitabledeviations from the observed strategies. These are the cost parameters that “rationalize” the observed strategies given the estimated revenue parameters. 6.2 Revenue Estimation The first step is to estimate the revenue equation (1).18 The equation is at the level of the firm-year. The dependent variable is the average revenue generated by firm i’s stations in period t. The regressors are the characteristics of i’s own portfolio s and the average characteristics among the portfolios of i’s direct it competitors s . I also include interaction terms between i’s number of stations −it 18Because the results of the revenue equation are inputs in the dynamic model used to estimate the cost equation, I am restricted in the specification of the revenue equation. Most notably, I must use a linear specification, and must keep the number of regressors within reason. This will be discussed in more detail in the section describing the estimation of the cost equation. 18

n and each of the other characteristics of i’s portfolio (except total coverage). it I have a panel of firms, so firm fixed effects are included. The firm-specific standard error of the demand shock σr is estimated as the standard error of the i regression residual across all of firm i’s observations. One complication is that I am eliminating station identity from the model, yet stations are heterogeneous. Failing to take into account station heterogeneity might lead to biased coefficients on portfolio characteristics in the revenue equation, which would in turn lead to biased cost coefficients in the second stage of estimation. This would happen if the revenue potential of a station is correlated with the characteristics of its owner’s portfolio which are included in estimation.19 In order to deal with station heterogeneity, I include as a regressor in the revenue equation the average revenue generated by firm i’s period-t stations across all years and owners. This can be thought of as the average revenue potential of i’s period-t stations, regardless of ownership. I also include this value in the cost equation, to allow for the (likely) possibility that stations that generate more revenue also cost more to operate. Inclusion of the average revenue potential of firm i’s stations is roughly equivalent to including station fixed effects. I also include firm and year fixed effects. Therefore I am effectively looking at variation within the firm-stationyear. Given this, the most important identification assumption is that there is no trend in a station’s revenue which induces a firm with particular portfolio 19For example, suppose that owning more stations increases revenue with no effect on cost. Butalsosupposethatfirmswithmoreofstationstendtoownsmallerstations,whichgenerate less revenue and incur less cost. Then the coefficient on n in the revenue equation will be it biased downward. When looking at firm strategies to infer costs, I will be searching for the costcoefficientthatexplainsthefirm’stendencytobuyupstationsdespitethe(spurious)fact that this does not improve revenue. The cost coefficient on n will be biased downward as it well; it will look like owning more stations lowers costs more than it actually does. 19

characteristics to purchase it (no reverse causality). Since the timing of station purchases was largely the result of an exogenous change in regulation, this is likely to be a reasonable assumption. 6.3 Cost Estimation Estimation of the cost equation (2) is considerably more complicated. Costs are not observed, so I look at the strategies employed by firms - that is, their portfolio adjustments each period - in order to recover costs. If firms are acting optimally, then the ex ante value of a state for a firm if it follows the observed strategy must be greater than it would be if it followed any other strategy. The goal is to find the cost parameter vector c that makes this true, given the estimated revenue parameter vector r and competitor strategies σ . Therefore, −i in order to recover cost estimates, I must calculate the ex ante value of being at each observed state as a function of σ , σ , r and c. The estimation of cost i −i coefficients proceeds as follows: 1) Estimation of strategies 2) Estimation of the ex ante value of each observed state for the observed strategies and for alternative strategies, as a function of c 3) The search for c that makes the ex ante values of states greater for the observed strategies than for the alternative strategies. 6.3.1 Estimation of Strategies The functions governing the evolution of i’s own portfolio characteristics constitute i’s strategy, σ : S → s . The functions governing the evolution i it it+1 of the average portfolio characteristics among i’s competitors s will act as a −it summary of the strategies of i’s competitors, σ . That is, the strategy of com- −i 20

petitors is defined as σ : S → s . The definition of competitors’ strategies −i it −it+1 σ is important when I consider alternative strategies for firm i and calculate −i the ex ante value of each state under these alternatives, keeping competitors’ strategies σ fixed. By keeping fixed the evolution of the average competitors’ −i portfolio characteristics as a function of firm i’s own portfolio characteristics, I am essentially keeping fixed firm i’s competitors’ strategies. I use this technique – rather than simulating what each of i’s competitors do and then calculating the average portfolio characteristics among the competitors – for computational reasons. Let the variables in the state vector be indexed by j. I want to allow next period’s state variable S to be a function of all of the variables in the state jit+1 vector in this period S . I specify the evolution of S as follows: it jit S = α t+α S +η +(cid:15) (5) jit+1 j,t j,s it j,i j,it Recall that the variables in the state vector S include i’s own portfolio characit teristicss aswellastheaveragecharacteristicsamongi’scompetitors’portfolios it s . Therefore the variable S on the left-hand side of (5) can be an element −it jit+1 of s or an element of s . The α’s in the equations for which an element of it −it s is the dependent variable together compose σ , while the α’s in the equations it i for which an element of s is the dependent variable together compose σ . −it −i In (5), each state variable is subject to a time trend and a firm-specific constant. Estimation of (5) is complicated; I have a dynamic panel data model with endogenous explanatory variables, lagged dependent variables and unobserved firm heterogeneity. In the presence of lagged dependent variables and endogenous regressors, η cannot be treated as a constant to be estimated. Firsti differencing removes η , but the endogeneity problem requires instruments. I use i the estimator developed by Arellano & Bond (1991), which uses higher-order 21

lagged differences of the regressors as instruments. This often suffers from a weak instrument problem. Fortunately the F-statistic is greater than 10 in all of the first-stage regressions but one, when it is just under 10. A bigger problem is that the validity of the instruments requires the assumption of no serial correlation, which is strong in this case. The following system of equations is estimated using the Arellano-Bond GMM estimator: {∆S = α +α ∆S +∆(cid:15) } (6) jit+1 j,t j it j,it j=1,...,J Equation (6) is estimated only for firms that are already active in the industry (n > 0). For inactive firms, data are highly skewed, so I estimate it−1 Pr(S |n = 0) non-parametrically. Furthermore, non-parametric estimation it+1 it is done separately for firms that have not yet entered and firms that have exited because their behavior is quite different (once a firm exits, it rarely re-enters). 6.3.2 Value Function Estimation The value function can be written: (cid:32) (cid:33) ∞ (cid:88) E βτ[n (R −C )]|S ,σ ,σ (7) it+τ it+τ it+τ it i −i τ=0 where the expectation is over shocks to demand (ν), cost (ω) and strategy implementation ((cid:15)). State variables are continuous, so I use a method proposed by Bajari, Benkard & Levin (2007) to estimate the value function by simulation. For a given state S , a path of play can be simulated by using σˆ and a set of it shocks drawn from the estimated distribution of ∆(cid:15) . I simulate the evolution i of the state vector well into the future (100-200 periods), until the discount factor will render sufficiently small the present value of any payoffs generated. Given a set of parameters r and c and draws of shocks ν and ω, I can calculate 22

the present value of payoffs associated with this path of play. Repeating this procedurenumerous(1000-2000)timesandaveragingthepresentvalueofpayoffs over all of these paths gives me an estimated ex ante value associated with this state S . it 6.3.3 Search for Cost Estimates The final step is to search over possible cost parameter vectors to find c∗ which minimizes the profitable deviations generated by alternative strategies, and therefore makes σˆ optimal given σˆ and rˆ. The simulation of numerous paths i −i of play for each state that is observed in the data (a total of 12 years * 544 owners = 6528 states) is the computationally expensive part of the estimation process. I impose a few restrictions in order to make estimation feasible. First,Iimposetherestrictionthatthefirm-specificcostconstantµc isalinear i function of the firm-specific revenue constant µr. In other words, I include µr i i as a variable in the cost equation and estimate the cost coefficient on it. This is much less computationally expensive than freely estimating µc, which would i requiretheinclusionof543firmdummiesinsecond-stageestimation. Effectively I am allowing for the possibility that firms that generate systematically higher revenue incur either systematically higher or systematically lower costs. This allows for two scenarios. In one scenario, some firms are simply “better,” both generatinghigherrevenueandincurringlowercosts; thiswouldleadtoanegative relationship between µˆr and µc. In another scenario, the firms that invest more i i in programming tend to draw in more advertising revenue, whereas firms that cut programming costs tend to take a hit to advertising revenue; this would lead to a positive relationship between µˆr and µc. The cost coefficient on µˆr tells us i i i which scenario is a better description of reality. Another restriction is that I must write the ex ante value of a state as a 23

linear function of the cost parameter vector c: V(S |r,c,σ ,σ ) = W (S |r,σ ,σ )−cW (S |σ ,σ ) (8) it i −i 1 it i −i 2 it i −i where: (cid:32) (cid:33) ∞ (cid:88) W (S |r,σ ,σ ) = E βτn (r +rS +µr +ν )|S ,σ ,σ 1 it i −i it+τ 0 it+τ i it+τ it i −i τ=0 (cid:32) (cid:33) ∞ (cid:88) W (S |σ ,σ ) = E βτn {s ,µr}|S ,σ ,σ 2 it i −i it+τ it+τ i it i −i τ=0 W is the present discounted value of the expected stream of revenue and W 1 2 is a vector of the present discounted values of the expected streams of state variables and revenue-constant quantities.20 In other words, the difference in the ex ante values of a state under two different strategies is a linear function of thedifferencesofthesumofthestatevariablesunderthetwodifferentstrategies. This means that when I search over various trial values of c, I do not need to re-simulate the ex ante values of states every time. Otherwise the search for c∗ would be computationally infeasible. This imposes the restriction that c must enter linearly into the firm’s static payoffs. In order to achieve this, I am restricted in the specification of the revenue and cost equations. Most notably, I cannot allow for log-linear relationships between the state variables and perstation revenue and cost.21 ˆ ˆ For each state observed in the data, I estimate W and W under various 1 2 perturbations of the observed strategies (that is, functions governing the evolution of s ), keeping competitors’ strategies (that is, functions governing the it evolution of s ) and the revenue coefficient vector r fixed. I then find the cost −it 20Also included in the vector W is a constant term so that c can be estimated. 2 0 21Piecewise linear functions can be used to capture nonlinearities; I am currently working on this alternative specification. 24

coefficient vector c∗ that must exist in order for the observed strategies to be superior to the perturbations. The equilibrium condition (4) can be written: W (S |r,σ ,σ )−cW (S |σ ,σ ) 1 it i −i 2 it i −i ≥ W (S |r,σ(cid:48),σ )−cW (S |σ(cid:48),σ ) ∀ σ(cid:48) ∀ S (9) 1 it i −i 2 it i −i i it Letthealternativefirmstrategiesthatareconsideredbeindexedbyx = 1,...,X. Then let g(S ,x,c) denote the present discounted value of the stream of addiit tional profits that can be expected to result from a deviation to alternative strategy σ(cid:48) (indexed by x) when starting at state S and given cost coefficient i it vector c: g(S ,x,c) = W (S |r,σ(cid:48),σ )−W (S |r,σ ,σ ) it 1 it i −i 1 it i −i −c[W (S |σ(cid:48),σ )−W (S |σ ,σ )] (10) 2 it i −i 2 it i −i I find c∗ that minimizes the following objective function: X I T 1 (cid:88)(cid:88)(cid:88) Q (c) = 1(g(s ,x,c) > 0)(g(s ,x,c))2 (11) X it it X x=1 i=1 t=1 That is, I find c∗ that minimizes the sum of squared violations of the equilibrium condition. Because the industry remains regulated, firms are not free to choose any portfolio that they want. Therefore I restrict the behavior of firms in simulated paths of play. I do not allow firms to acquire more stations than would realistically be allowed under current regulations, or to choose a smaller average distance between stations than would be allowed. Also, I consider very small perturbations of the observed strategies as alternative strategies; these are alternatives that are presumably available to firms. A strategy perturbation consists of an increase or decrease for each portfolio characteristic in each period of 0.5%, 25

1%, 1.5% or 2%, chosen randomly. For a given state, alternative strategy and portfolio characteristic, the same deviation is made in each successive period. Parameter estimates are not very sensitive to the number of alternative strategies considered. More alternative strategies allows for more precise estimates but this comes at high computational cost. In practice I use X=30. 7 Empirical Results Revenue is observed, so the interpretation of revenue estimates is fairly straightforward. The interpretation of cost estimates is more complicated. Cost is essentially a residual, serving to explain the behavior of firms that is left unexplained by revenue. For this reason, it is unreasonable to interpret cost simply as strictly the cash that flows out of a firm in a given year; it may include the cost of raising capital, the cost of being exposed to risk, and so on. This should be kept in mind when interpreting my results. The equilibrium condition used to recover cost assumes that the present discounted value of the expected stream of revenue from a station minus the present discounted value of the expected stream of cost of a station must be greater than zero in order for a firm to choose to own (rather than sell) the station. In reality, this value must be greater than the market value of the station, because this is what the firm must pay to own the station (or what the firm could earn by selling the station). The annual per-station cost recovered in my estimation therefore includes the annual opportunity cost of holding (not selling) the station at market value. Suppose that the market for broadcast television stations operates perfectly, with efficient credit markets and all agents having full information. Then we would expect the market value of a station to be approximately equal to the 26

present discounted value of the station’s expected stream of profits. Therefore the annual opportunity cost of holding the station would be approximately equal to the annual profit generated by the station. In this case, recovered annual perstation cost would be close to annual per-station revenue.22 Table 1 presents summary statistics for annual per-station revenue and recovered annual perstation cost for portfolios that are observed in the data over the past ten years. The distributions of revenue and cost are highly skewed. For this reason, direct comparisons of the mean and median can be hard to interpret. However, the rough equivalence of the maximum per-station revenue and maximum recovered per-station cost shown in Table 1 is consistent with cost estimates that are relatively clean and that include opportunity cost. Itis worth thinking abouthowinclusionofopportunity cost inrecovered cost may affect the estimated coefficients of interest, that is, the effects of a firm’s portfolio characteristics on its per-station cost. As long as this opportunity cost is exogenous to the variables of interest, the estimated coefficients should not be biased. Theopportunitycostofholdingastationshouldreflectthehighestvalue that any other firm would pay for the station. This value should not be directly affected by the portfolio characteristics of the firm in question. Therefore the coefficients of interest should not be affected. A final point that should be kept in mind when interpreting my results is that this industry remains regulated. Therefore firms are not completely free to adjust the characteristics of their portfolios of stations. Therefore, the 22Consider a simple numerical example. A station generates $10 million in annual revenue, and costs $7 million to operate annually; therefore the station’s annual profit is $3 million. If themarketforthisstationoperatesperfectly,withfullinformation,thentheopportunitycost of holding the station would be approximately equal to $3 million because this is how much other firms would be willing to pay annually in order to own the station. Therefore the total annual cost of the station is $7 million + $3 million = $10 million. 27

estimatedrevenueandcosteffectsshouldbeinterpretedasthosethatexistunder current regulations. They do not tell us what the revenue and cost advantages of consolidation would be under an entirely different set of regulations.23 The estimated revenue coefficients rˆand cost coefficients cˆare shown in Table 2. Note that the adjustment coefficients are effects on the firm’s total (not per-station) cost, and that these are one-time effects that are assumed not to persist through time. All of the other estimates are the effects of portfolio characteristics on annual per-station revenue or cost. Preliminary standard errors for cost estimates are calculated using the method described in Pakes, Porter, Ho & Ishii (2006); this involves estimating the covariance matrix for the value function, taking draws from the sample distribution of the value function, and minimizingtheobjectivefunctiontofindcostestimatesforeachofthesedraws.24 Because the units of the portfolio characteristics vary and are not necessarily intuitive, the results are easier to interpret by looking at Table 3, which shows the dollar effects of standard deviation changes in portfolio characteristics on a firm’s average per-station revenue and cost, and on a firm’s per-station profit (revenue less cost). A standard deviation increase of 11.5 in Total Coverage in Millions (of persons) causes a $2.17 million increase in per-station revenue. That is, increasing the total population coverage of a firm’s stations enables the firm to boost its 23For instance, the revenue and cost coefficients on Number of Stations, Average Distance Between Stations andNumber of Stations * Average Distance Between Stations donottellus what the revenue and cost advantages of these characteristics would be if firms were allowed to, say, own all 14 stations in Boston - because firms cannot do this. 24Ultimately, the second-stage standard errors will be calculated by taking sub-samples of the data and re-estimating both the first and second stage, so that second stage standard errors will take into account first-stage standard errors; therefore the final standard errors are likely to be bigger than the standard errors shown here. 28

per-station revenue, most likely because advertisers pay more to reach more viewers per transaction. Given that median per-station revenue is $6 million, this effect is large in magnitude. A standard deviation change of 5 in Number of Stations owned is associated with a $1.85 million reduction in per-station cost. That is, acquiring more stations gives firms a cost advantage; firms are probably able to combine operations of multiple stations and cut costs. This effect is also quite large in magnitude. While increasing total coverage boosts per-station revenue, it also increases per-station cost (for a given number of stations owned). A standard deviation increase of 11.5 in Total Coverage in Millions is associated with a $713,000 increase in per-station cost. However, the net effect of total coverage on perstation profit is positive and large in magnitude. A standard deviation increase of 11.5 million is associated with a $1.46 million increase in per-station profit. Similarly, while acquiring more stations reduces per-station cost, it also reduces per-station revenue (for a given level of population coverage). A standard deviation increase in Number of Stations is associated with a $1.04 million fall in per-station revenue. It is possible that as firms acquire more stations and combine operations of their stations, the quality of programming declines, which may have a negative effect on advertising revenue. However, the net effect of owning more stations is positive. A standard deviation increase of 5 stations is associated with a $815,000 increase in per-station profit. An increase in network concentration has opposing demand-side and supplyside effects. A standard deviation increase of 1.03 in Network Concentration reduces per-station cost by $772,500.25 An increase in network concentration 25An example of a one-unit increase in the network concentration variable is the following: A firm with 17 stations goes from 1 ABC station, 2 CBS stations, 5 CW stations and 3 Fox stations to 1 ABC station, 10 CW stations and 10 Fox stations. 29

achieved while acquiring additional stations further reduces per-station cost; a standarddeviationincreaseof20inNetwork Concentration * Number of Stations reduces per-station cost by $340,000. This makes sense, because one important aspect of the management of a broadcast television station is negotiation and coordination with the national network to which the station is affiliated. Therefore operating multiple stations is probably less costly if most of the stations are affiliated with one or two networks. Yet at the same time, increased network concentration among a firm’s stations hurts the revenue potential of the stations. A standard deviation increase of 20 in Network Concentration * Number of Stations lowers per-station revenue by $720,000. This is likely because an advertiser has the option of advertising through a national network rather than through the owner of network-affiliated stations. Therefore a firm’s portfolio of stations is probably less attractive to an advertiser if it is essentially just a subset of the stations that are affiliated with one particular national network. I find that the operation of multiple stations is slightly costlier if the stations’ viewers are more demographically heterogeneous. A standard deviation increase of 126 in Average Demographic Heterogeneity * Number of Stations increases per-station cost by $143,640.26 I do not find any statistically significant effect of demographic heterogeneity on per-station revenue. However, existing regulation places a lower bound on this variable for firms. I find that the average distance between a firm’s stations has little effect on its per-station revenue. Again, though, existing regulation places a lower bound on this variable for firms. On the cost side, the evidence is mixed. A standard 26Recall that the demographic vector is: {% Caucasian, % African-American, % Asian, % Hispanic}. To give a sense of the units of this variable: If a firm transitions from owning twostationswiththevectors{25,25,25,25}and{15,15,31,38}toowningtwostationswiththe vectors{25,25,25,25}and{12,13,37,38},thisresultsinafive-unitincreaseinthedemographic heterogeneity variable. 30

deviation decrease of 5221 in Average Distance * Number of Stations causes a $417,680 decrease in per-station cost. A simultaneous increase in number of stations and fall in average distance between stations could be achieved by acquiring an additional station that is between two stations that a firm already owns. It makes sense that this would provide cost advantages. On the other hand, a standard deviation reduction in Average Distance between Stations of 430 miles, for a given number of stations owned, actually raises per-station cost by $3 million. Firms may find that there are advantages to diversification, and spreading stations across the country may be one way of diversifying (again, given the effective lower bound on distance between stations). For instance, Sinclair Broadcast Group declares on its website that it is “one of the largest and most diversified television broadcasting companies in the country today.”27 Thismaybeattractivetoinvestors; whilefewbroadcastingcompaniesarepublic, some such as Sinclair are and many may aspire to be. I find that strategic effects are important in this industry; four of the six strategic variables included in estimation have large and statistically significant effects on a firm’s per-station revenue. A standard deviation increase of 7.2 in Average Number among Competitors causes per-station revenue to fall by almost $1 million. This makes sense, given the finding that a firm faces lower per-station cost if it owns more stations. If a firm faces competitors with more stations, then the firm faces competitors with lower costs. Its competitors can therefore offer lower advertising prices, and draws advertisers away. A firm’s per-station revenue is also lower if its competitors’ stations are close together on average; a standard deviation increase of 381 in Average Distance among Competitors has a positive effect of $647,700. However, a standard deviation increase of 15.6 in Average Coverage among Competitors causes an increase in 27www.sbgi.net/index.shtml 31

per-station revenue of $1.9 million. This means that, given the size of a firms’ competitors in terms of the number of stations they own, an increase in their coverage lessens their competitive effect. A standard deviation increase of 5.7 in Average Network Domination among Competitors hasanegativeeffectofalmost $1 million on per-station revenue. The cost coefficient on the average revenue potential of a firm’s stations is 0.65. The positive sign means that stations that generate more revenue also cost more to operate. The cost coefficient on the firm’s revenue fixed-effect coefficient is 0.6. The positive sign means that firms that generate higher perstation revenue also incur higher per-station cost. It appears that firms that invest more in programming (increasing costs) tend to reap higher revenue in return, while firms that cut costs tend to accept revenue losses as a result. The intuition behind the identification of this parameter is as follows: firms that earn more revenue per station are not systematically seeking out large portfolios as much as we would expect given the higher per-station revenue that they generate; this must be because they also incur higher per-station costs. While consolidation has advantages, the process of purchasing and selling stations incurs adjustment costs. I find that purchasing a station involves a one-time cost of $1.3 million, while selling a station involves a one-time cost of $1.5 million (but the latter estimate is not statistically significant). Note that the adjustment cost is not capturing the market price of the station, which is captured in the constant term in the per-station cost equation. I conduct a crude check of my cost numbers by calculating what they imply for the average opportunity cost of holding a station, and comparing this to typical transaction prices for stations. Based on anecdotal evidence, a typical station generates profit margins of 20%, so I roughly estimate average annual per-station operating cost as revenue divided by 1.2. The implied annual oppor- 32

tunity cost of holding a station is then calculated as the total per-station cost estimated by my model minus this rough estimate of per-station operating cost. The average implied annual opportunity cost of holding a station is $18 million. At an interest rate of 6%, this is consistent with an average station market value of $307 million. Given the highly skewed distribution of revenue and cost in this industry, this is likely to be much greater than the median market value. I do not have data on station sales prices, but anecdotal evidence suggests that this is reasonable. 8 Conclusion This paper estimates the demand-side and supply-side advantages of consolidation in the broadcast television industry. The consolidation has provoked considerable controversy. However, the reasons it occurred are not well understood, because firms mainly purchase stations that are not in direct competition with one another and that are located many miles apart. I exploit an exogenous change in regulation that led to significant consolidation in the industry. Although revenue and ownership data are available for broadcast television stations, cost data are not available. I recover costs by examining patterns in ownership changes that are left unexplained by revenue opportunities. Because the firm’s purchasing decision involves dynamic and strategic elements, I model the decision as a dynamic game. The firm adjusts its portfolio characteristics each period so as to maximize the present discounted value of its expected stream of profits. My approach allows me to isolate the relationships of interest in an industry that each year witnesses hundreds of firms considering the purchase or sale of over one thousand stations. As far as I know, this paper is the first to estimate a model of merger activity in a dynamic, 33

strategic framework. Myresultssuggestthatconsolidationinthisindustryoffersbothrevenueand cost advantages, but that they operate through different mechanisms. Firms are able to generate more revenue per station if the total population coverage of their stations is greater. Advertisers are probably willing to pay more to reach more viewers per contract. On the other hand, the cost advantages of consolidation come not through wider population coverage, but through simply owning a large number of stations. The cost advantages of owning multiple stations are especially great if the firm’s stations are mostly affiliated with just one or two networks and if their viewers are more demographically homogeneous. However, a lack of diversification in network affiliation hurts the revenue potential of the firm’s stations. I find that strategic effects are important in this industry. A firm earns less revenue per station if its competitors own more stations on average and if its competitors’ stations are located close together on average. An interesting next step would be to use the estimated coefficients to solve the dynamic oligopoly model and conduct counterfactuals. I could determine the extent to which consolidation patterns have been driven by cost advantages, revenue advantages or strategic considerations, by simulating firms’ purchasing patterns under alternative values for the revenue and cost coefficients. 34

References Akkus, O. & Hortacsu, A. (2007). The determinants of bank mergers: A revealed preference analysis. Arellano, M. & Bond, S. (1991). Some tests for specification for panel data: Monte carlo evidence and an application to employment equations. The Review of Economic Studies, 58, 277–297. Bajari, P., Benkard, L., & Levin, J. (2007). Estimating dynamic models of imperfect competition. Econometrica, 75, 1331–1370. Crawford, G. (2007). Television station ownership structure and the quantity and qualityoftvprogramming. FederalCommunicationsCommissionMediaOwnership Study No. 3. Fox, J. (2007). Estimating matching games with transfers. Hammond, A., O’Connor, B., & Westin, T. (2007). The Impact of the FCC’s TV DuopolyRuleRelaxationonMinorityandWomenOwnedBroadcastStations1999- 2006. Federal Communications Commission Media Ownership Study No. 8. Ho, K. (2007). Insurer-provider networks in the medical care market. NBER Working Paper. Hopenhayn, H. (1992). Entry, exit and firm dynamics in long run equilibrium. Econometrica, 60, 1127–1150. Jovanovic, B. (1982). Selection and the evolution of industry. Econometrica, 50, 649–670. Kim, E. H. & Singal, V. (1993). Mergers and market power: Evidence from the airline industry. The American Economic Review, 83(3), 549–569. Klepper, S. (2002). Firm survival and the evolution of oligopoly. The RAND Journal of Economics, 33, 37–61. Klepper, S.&Grady, E.(1990). Theevolutionofnewindustriesandthedeterminants of market structure. The RAND Journal of Economics, 21, 27–44. Klepper, S. & Simons, K. (2000). The making of an oligopoly: Firm survival and technological change in the evolution of the u.s. tire industry. The Journal of Political Economy, 108, 728–760. 35

Knapp, W. (1990). Event analysis of air carrier mergers and acquisitions. The Review of Economics and Statistics, 72(4), 703–707. Pakes, A., Porter, J., Ho, K., & Ishii, J. (2006). Moment inequalities and their application. Centre For Microdata Methods and Practices Working Paper. Park, M.(2008). Anempiricaltwo-sidedmatchingmodelofacquisitions: Understanding merger incentives and outcomes in the mutual fund industry. Paul, C. (2003). Cost economies: A driving force for consolidation and concentration? Southern Economic Journal, 70, 110–127. Pesendorfer, M.(2003). Horizontalmergersinthepaperindustry. The RAND Journal of Economics, 34(3), 495–515. Petrakis, E. & Roy, S. (1999). Cost-reducing investment, competition and industry dynamics. International Economic Review, 40, 381–401. Shaked, A. & Sutton, J. (1983). Natural oligopolies. Econometrica, 51, 1469–1483. Shaked, A. & Sutton, J. (1987). Product differentiation and industrial structure. The Journal of Industrial Economics, 36, 131–146. Shaked, A. & Sutton, J. (1990). Multi-product firms and market structure. The RAND Journal of Economics, 21, 45–62. Shiman, D. (2007). The impact of ownership structure on television stations’ news andpublicaffairsprogramming. FederalCommunicationsCommissionMediaOwnership Study No. 4: News Operations. Sorenson, M. (2007). How smart is smart money? a two-sided matching model of venture capital. Journal of Finance, 62, 2725–2762. 36

Figure 1: Transactions Per Year snoitcasnarT fo rebmuN 002 051 001 05 1992 1994 1996 1998 2000 2002 2004 2006 Year fi Figure 2: Distribution of Number of Stations Held Among Firms in 1995 smriF gnomA ycneuqerF 052 002 051 001 05 0 0 5 10 15 20 25 30 35 40 45 50 55 60 Number of Stations Owned fi 37

Figure 3: Distribution of Number of Stations Held Among Firms in 2007 smriF gnomA ycneuqerF 052 002 051 001 05 0 0 5 10 15 20 25 30 35 40 45 50 55 60 Number of Stations Owned fi Table 1: Descriptive Statistics Descriptive Annual Recovered Annual Statistic Per‐Station Revenue† Per‐Station Cost† Mean $12.7 million $15.0 million Standard Deviation $20.1 million $21.1 million Median $6.0 million $17.2 million Minimum $0.06 million $0.01 million Maximum $169 million $151 million Results are preliminary. † Revenue and Cost are in 2008 dollars. 38

Table 2: Revenue and Cost Parameters Portfolio Effect on Effect on Characteristic Per‐Station Revenue† Per‐Station Cost† Number of Stations ‐207* ‐370* (79) (119) Squared Positive Adjustment 1350* (437) Squared Negative Adjustment 1500 (2712) Total Coverage in Millions 189* 62* (44) (11) Avg Distance between Stations 0.57 ‐7* (0.37) (2) Avg Distance * Number of Stations ‐0.04 0.08* (0.08) (0.036) Avg Demographic Heterogeneity ‐17 50 (14) (68) Avg Demographic Heterogeneity * Number of Stations 6.7 1.14* (3.7) (0.41) Network Concentration 315 ‐750* (198) (242) Network Concentration * Number of Stations ‐36* ‐17* (10) (4.5) Network Domination ‐23 ‐54 (39) (31) Network Domination * Number of Stations 1.5 1.18 (1.2) (0.57) Average Revenue Potential 1.01* 0.65* (0.02) (0.055) Average Revenue Potential * Number of Stations ‐0.0057* 0.0037* (0.0019) (0.00018) Avg Number among Competitors ‐137* (25) Avg Coverage in Millions among Competitors 120* (20) Avg Distance among Competitors 1.7* (0.6) Average Demog Heterogeneity among Competitors ‐20 (20) Average Network Concentration among Competitors 198 (224) Average Network Domination among Competitors ‐162* (27) Firm Dummy Coefficient in Revenue Regression 1 0.6* (0.3) † Except for adjustment coefficients, which are effects on a firm's total cost. Revenue and Cost are in thousands of 2008 dollars. * Statistically significant at the 5% level. 39

Table 3: Revenue and Cost Effects Portfolio Standard Effect on Effect on Net Effect on Characteristic Deviation Per‐Station Revenue Per‐Station Cost† Per‐Station Profit† Number of Stations 4.9 ‐$1,035,000* ‐$1,850,000* +$815,000 Squared Positive Adjustment 0.75 +$1,012,000* ‐$1,012,000 Squared Negative Adjustment 1.81 +$2,700,000* ‐$2,700,000 Total Coverage in Millions 11.5 +$2,173,500* +$713,000* +$1,460,500 Avg Distance between Stations 430 +$245,100 ‐$3,010,000* +$3,255,100 Avg Distance * Number of Stations 5221 ‐$208,840 +$417,680* ‐$626,520 Avg Demographic Heterogeneity 12.1 ‐$205,700 +$605,000 ‐$810,700 Avg Demographic Heterogeneity * Number of Stations 126 +$844,000 +$143,640* +$700,360 Network Concentration 1.03 +$324,450 ‐$772,500* +$1,096,950 Network Concentration * Number of Stations 20 ‐$720,000* ‐$340,000* ‐$380,000 Network Domination 5.1 ‐$117,300 ‐$275,400 +$158,100 Network Domination * Number of Stations 202 +$303,000 +$238,360 +$64,640 Average Revenue Potential 15,900 +$16,160,000* +$10,335,000* +$5,825,000 Average Revenue Potential * Number of Stations 197 ‐$1,123* +$729* ‐$1,852 Avg Number among Competitors 7.2 ‐$986,400* ‐$986,400 Avg Total Coverage in Millions among Competitors 15.6 +$1,872,000* +$1,872,000 Avg Distance among Competitors 381 +$647,700* +$647,700 Average Demog Heterogeneity among Competitors 10.4 ‐$208,000 ‐$208,000 Average Network Concentration among Competitors 1.05 +$207,900 +$207,900 Average Network Domination among Competitors 5.66 ‐$916,920* ‐$916,920 † Except for adjustment coefficients, which are effects on a firm's total cost and profit. * Statistically significant at the 5% level. 40

Cite this document

APA

Jessica C. Stahl (2009). A Dynamic Analysis of Consolidation in the Broadcast Television Industry (FEDS 2009-48). Board of Governors of the Federal Reserve System, Finance and Economics Discussion Series. https://whenthefedspeaks.com/doc/feds_2009-48

BibTeX

@techreport{wtfs_feds_2009_48,
  author = {Jessica C. Stahl},
  title = {A Dynamic Analysis of Consolidation in the Broadcast Television Industry},
  type = {Finance and Economics Discussion Series},
  number = {2009-48},
  institution = {Board of Governors of the Federal Reserve System},
  year = {2009},
  url = {https://whenthefedspeaks.com/doc/feds_2009-48},
  abstract = {This paper estimates a dynamic oligopoly model in order to separately identify the demand-side and cost-side advantages of consolidation in the broadcast television industry. I exploit an exogenous change in regulation that led to significant industry consolidation. Using revenue and ownership data for broadcast stations over the past ten years, I estimate the effect of ownership changes on revenue. I recover costs by examining patterns in ownership changes that are left unexplained by revenue estimation. I model firms' purchasing decisions as a dynamic game, and estimate the game using a two-step estimation method recently developed by Bajari, Benkard & Levin (2007). This is the first paper to estimate a model of merger activity in a dynamic, strategic setting. I find that there are both revenue and cost advantages to consolidation, but they operate through different mechanisms. Access to a wider audience enables firms to increase per-station advertising revenue, while simply owning more stations enables firms to reduce per-station operating costs. A firm's ability to realize these benefits is affected by its stations' network affiliations, locations and viewers.},
}