Every Little Bit Counts: The Impact of High-speed Internet on the Transition to College
Abstract
This paper investigates the effects of high-speed Internet on students' college application decisions. We link the diffusion of zip code-level residential broadband Internet to millions of PSAT and SAT takers' college testing and application outcomes and find that students with access to high-speed Internet in their junior year of high school perform better on the SAT and apply to a higher number and more expansive set of colleges. Effects appear to be concentrated among higher-SES students, indicating that while, on average, high-speed Internet improved students' postsecondary outcomes, it may have increased pre-existing inequities by primarily benefiting those with more resources.
Finance and Economics Discussion Series Divisions of Research & Statistics and Monetary Affairs Federal Reserve Board, Washington, D.C. Every Little Bit Counts: The Impact of High-speed Internet on the Transition to College Lisa J. Dettling, Sarena F. Goodman, and Jonathan Smith 2015-108 Please cite this paper as: Dettling, Lisa J., Sarena F. Goodman, and Jonathan Smith (2015). “Every Little Bit Counts: The Impact of High-speed Internet on the Transition to College,” Finance and Economics Discussion Series 2015-108. Washington: Board of Governors of the Federal Reserve System, http://dx.doi.org/10.17016/FEDS.2015.108. NOTE: Staff working papers in the Finance and Economics Discussion Series (FEDS) are preliminary materials circulated to stimulate discussion and critical comment. The analysis and conclusions set forth are those of the authors and do not indicate concurrence by other members of the research staff or the Board of Governors. References in publications to the Finance and Economics Discussion Series (other than acknowledgement) should be cleared with the author(s) to protect the tentative character of these papers.
Running head: HIGH-SPEED INTERNET AND COLLEGE APPLICATIONS 1 Every Little Bit Counts: The Impact of High-Speed Internet on the Transition to College Lisa J. Dettling1 - Federal Reserve Board of Governors Sarena Goodman - Federal Reserve Board of Governors Jonathan Smith - College Board December 2015 Abstract This paper investigates the effects of high-speed Internet on students’ college application decisions. We link the diffusion of zip code-level residential broadband Internet to millions of PSAT and SAT takers’ college testing and application outcomes and find that students with access to high-speed Internet in their junior year of high school perform better on the SAT and apply to a higher number and more expansive set of colleges. Effects appear to be concentrated among higher-SES students, indicating that while, on average, high-speed Internet improved students’ postsecondary outcomes, it may have increased pre-existing inequities by primarily benefiting those with more resources. 1 Email: lisa.j.dettling@frb.gov, sarena.f.goodman@frb.gov, and jsmith@collegeboard.org The analysis and conclusions set forth are those of the authors and do not indicate concurrence by other members of the Board of Governors, the College Board, or the research staff at either institution. We thank Josh Goodman, Hilary Hoynes, Adam Isen, Melissa Kearney, Michael Lovenheim, Amanda Pallais, Bekah Richards, and seminar participants at the Federal Reserve Board Applied Micro Lunch Seminar, the DC Economics of Education Working Group, and the 2015 Association for Education Finance and Policy and Society for Research and Educational Effectiveness conferences for helpful comments and suggestions.
HIGH-SPEED INTERNET AND COLLEGE APPLICATIONS 2 I. Introduction College planning is complicated: there are thousands of colleges in the United States, each with countless attributes, and students face uncertainty in both admission and completion. Moreover, students encounter hurdles and barriers throughout the lengthy college planning and application process. These barriers could be procedural, such as requirements that students register for exams by certain deadlines, as well as informational, such as the wealth of college characteristics that students are expected to obtain and distill to help produce a better fit. These complications, along with the many steps involved in the application process (Klasik, 2012), have contributed to widespread inequality in college access (Bowen, Chingos, and McPherson, 2009; Hoxby and Avery, 2013; Smith et al, 2013). While college access efforts, programs, and organizations are designed to help students overcome the optimization problem they face, there is an emerging literature that demonstrates that students often do not have full or correct information when making this consequential decision (Dillon and Smith 2013; Hoxby and Turner, 2015). A natural question follows - how can we encourage students to apply to and attend colleges that best fit their needs? Prior successful interventions have aimed to provide students with additional information (Avery and Kane, 2004; Carrell and Sacerdote, 2013) and change the way information is presented (Bettinger et al., 2012; Hoxby and Turner, 2013 and 2015). Other studies have found that students’ decisions are quite sensitive to small changes in information or costs.2 Thus, in theory, a technology that increased the availability or improved the presentation of information could also generate large changes in college-going. This paper investigates whether the dramatic and conditionally-random national diffusion of broadband Internet over the last decade affected students’ college application behavior. Between 2000 and 2013, high-speed Internet usage increased from 3 percent to 70 percent.3 Recent research concludes that the rollout of broadband to households had an impact on a host of outcomes, such as academic achievement (Vigdor et al., 2014; Faber et al., 2015), labor force participation (Dettling, forthcoming), voter turnout (Falck, Gold and Heblich, 2014), and 2 Evidence of students’ responsiveness to small changes in information or costs include interventions that change: 1) rules of thumb (Pallais, 2015), 2) the salience of college rankings (Luca and Smith, 2013), 3) financial aid offers (Cohodes and Goodman, 2014), 4) application fees and essays (Smith et al. 2015), and 5) admissions exam taking (Bulman, forthcoming; Goodman, 2013; Klasik 2014; Hurwitz et al. 2015). 3 High-speed internet usage rates were obtained from PEW Research at www.pewinternet.org/data-trend/internetuse/connection-type/.
HIGH-SPEED INTERNET AND COLLEGE APPLICATIONS 3 criminal behavior (Bhuller et al., 2011). An important mechanism emphasized in this research is ease and speed of information acquisition. In this paper, we tie the two strands of literature together and examine whether the diffusion of high-speed Internet – which could make it easier and less costly to obtain information about the college application process – affected students’ application decisions. Our main empirical strategy links administrative test-taker data from the College Board to a zip code-level measure of residential broadband Internet availability during students’ junior year of high school.4 We derive a new measure of broadband Internet availability from data collected by the Federal Communications Commission (FCC) on the number of broadband Internet Service Providers (ISPs) in each zip code from 1999 to 2007, a time when broadband Internet prevalence was skyrocketing. To our knowledge, ours is the first measure of availability at the zip code-level that reflects aggregate usage patterns.5 Using information on millions of students’ Preliminary SAT/National Merit Scholarship Qualifying Test® (PSAT) scores, SAT scores, and SAT Score Sends (as a proxy for applications), we first probe the extent to which broadband Internet availability induced SAT score changes and then examine a range of college application outcomes. Specifically, we investigate whether students with broadband Internet are more likely to apply to a four-year college, relatively many colleges, relatively selective colleges, an academically matched college, top liberal arts colleges, the flagship university within a student’s state, and an out-of-state college. Our findings indicate that increased availability of broadband Internet in a student’s zip code unambiguously improves her admissions exam scores and her application set. On average, her test score improves by 0.7 scale points and her application portfolio increases in both size and quality, with the magnitude of these changes ranging from 0.2 to 0.4 percentage point, depending on the specific outcome we consider. Some of these results are particularly striking when we scale by mean application rates; for example, about 7.2 percent of the SAT-taking population apply to a very selective liberal arts college, but we find that high-speed Internet availability increased that rate by almost 0.2 percentage point (or 3 percent). Further, while an applicant’s 4 We focus on the effects of residential broadband availability. Other forms of Internet access (e.g., dial-up, schoolbased) could also affect the outcomes we consider; thus, our estimates may understate the full role of Internet technology in college applications. 5 Figure 1 demonstrates that our measure of availability lines up well with trends in survey-reported usage rates. In the appendix, we show that state-level availability implied by our measure is an excellent predictor of teen highspeed Internet usage rates.
HIGH-SPEED INTERNET AND COLLEGE APPLICATIONS 4 performance on the SAT and her application set are undoubtedly intertwined, we find that the improvements in her application portfolio persist beyond what can be explained by her improved score: less than one-fifth of changes in her application outcomes can be traced to Internet-driven increases in average test scores. For a causal interpretation of our estimates, we must assume that high-speed Internet is exogenous to student’s testing and application outcomes, conditional on zip code fixed effects and student and time-varying zip code-level controls. We examine the validity of this assumption in several ways. First, we confirm that early and late adopting zip codes had similar trends in outcomes before high-speed Internet was available in the late 1990s. Second, we show that the availability of high-speed Internet in the student’s home zip code in subsequent years – i.e., when she is presumably in college – has no effect on her application outcomes. Third, we examine broadband availability in an event-study framework and estimate application effects in years prior to availability that are generally indistinguishable from zero, a pronounced jump in the year broadband becomes available, and generally positive and statistically significant effects thereafter. Last, because we observe outcomes only for students who take an exam administered by the College Board, to mitigate selection concerns, we demonstrate that our main estimates are robust to the exclusion of states in which the ACT—a college admissions exam that competes with the SAT—prevails among college-going students. Even as high-speed Internet became more accessible across the United States, it is likely that not every type of student benefited equally from its availability.7 In particular, it is well known that high-speed Internet take-up rates and computer accessibility vary by socioeconomic status.8 That said, the research on college under-match suggests teens from a lower socioeconomic background may be the least informed about college, and hence, may stand to gain the most from high-speed Internet. When we estimate the effects of high-speed Internet availability separately by group, we find that application improvements appear to be driven by higher-income students, 7 Moreover, even if there were equally improved access to the efficiencies of high-speed Internet—which there is not —the general equilibrium effects are less clear, since students may be competing with one another for spots. This is beyond the scope of this paper. 8 For example, the October 2003 CPS indicates that among 15-18 year olds whose mother did not complete high school, 8 percent had access to broadband at home and 55 percent had access to a computer at home. Among 15-18 years olds whose mother has a post-graduate degree, 46 percent had access to broadband at home and 97 percent had access to a computer at home. By 2009, when high speed Internet access was nearly universal, only 58 percent of 15-18 year olds with a mother who did not complete high school had access to broadband at home, compared with 95 percent of 15-18 year olds whose mother has a post-graduate degree. Computer ownership was not asked in 2009.
HIGH-SPEED INTERNET AND COLLEGE APPLICATIONS 5 students in urban areas, white students, and those with more-educated parents. These findings likely reflect an array of known impediments between availability and effective use for lowerresource groups – i.e., differential household adoption rates, within-household access to a computer, and use of the Internet to acquire and distill information on colleges. Altogether, our results indicate that while, on average, high-speed Internet improved students’ postsecondary outcomes, it may have widened existing inequities, favoring those with more resources. The contribution of our paper is fourfold. First, we provide the first causal estimates of the effects of high-speed Internet availability on college admissions testing and application behavior.9 Second, we add to a literature which finds that small cost reductions, and, specifically, improved access to information, can improve postsecondary outcomes. We demonstrate that significantly many students, if given the opportunity, appear to be able to obtain and distill information on colleges and universities for themselves. We also show that some students are potentially being left behind by the information age; because unobserved barriers limit their access, because these students require more guidance on how to benefit from their access, or some combination of the two. Third, given that universal broadband Internet is a central policy goal, our finding that broadband Internet availability can change college-going outcomes, but that the benefits may unequally accrue to higher-resource students, suggests a more intensive intervention than deployment alone may be necessary to realize the full benefits of increased availability in underserved areas. Last, we make a methodological contribution by developing a way to measure broadband Internet availability at the local level that matches aggregate usage patterns. II. Conceptual Framework and Related Literature There is a large and growing literature in economics on college choice from which we derive three key findings that serve as the bedrock of our analysis. One, there are substantial returns to college quality (e.g., Card, 1995; Black and Smith, 2006; and more recently, Zimmerman, 2014), particularly among disadvantaged students (Dale and Krueger, 2002, 2011). Two, despite the potential for large returns, many students, especially disadvantaged students, do not apply to or 9 There exists a related literature on the effects of computer and Internet technology on academic achievement (e.g., Faber et al. 2015; Vigdor, Ladd, and Martinez, 2014; Belo et al 2013; Fairlie and Robinson, forthcoming). We view our paper as complementary to those papers since we examine the role of Internet technology in reducing informational and transactional frictions in the college application process, which could also indirectly improve academic achievement if students attend better matched or higher quality colleges.
HIGH-SPEED INTERNET AND COLLEGE APPLICATIONS 6 attend a college commensurate with their abilities (Bowen, Chingos, and McPherson, 2009; Pallais and Turner, 2006; Spies, 2001). In the literature, this is typically referred to as “undermatching.” Three, information constraints appear to play a sizable role in under-matching (Hoxby and Avery, 2012; Hoxby and Turner, 2013 and 2015). Several recent papers indicate that high school students exhibit large changes in behavior in response to small changes in costs and pieces of information, and the strongest responses usually occur among subsets of students lacking sound pipelines to college. Smith, Hurwitz, and Howell (2015), for instance, find that a small increase in an application fee or an additional essay heavily influences application behavior. Further, students’ application sets are seemingly guided by defaults or perceived “recommendations.” Pallais (2015) finds that students apply to more schools when given an additional free Score Send, a cost savings of $6. Cohodes and Goodman (2014) find that students forego large expected earnings for small offers of financial aid from the state. Several papers also find that state mandated admissions exams and proximity to test centers induce large enrollment changes (Bulman, forthcoming; Goodman, 2013; Klasik 2014; Hurwitz et al. 2015). Such large behavioral responses to defaults, nudges, and small changes in costs suggest students are not well informed of optimal strategies in the application process. Several recent experimental studies have explored this idea by examining whether targeted information provision can improve application behavior. These experiments have directly intervened with the college application process and made tailored recommendations to particular student groups. Examples of interventions include filling out financial aid forms, counseling on the application process generally, and helping students obtain fee waivers that they were already eligible to receive (Avery and Kane, 2004; Bettinger et al., 2012; Carrell and Sacerdote, 2013; Hoxby and Turner, 2013 and 2015).10 Each intervention has generated large improvements in students’ attendance outcomes. The diffusion of broadband Internet serves as a hands-off experiment that could reduce some of the transactional frictions in the college search. High-speed Internet allows prospective 10 Hoxby and Turner’s experiment, in particular, produced large enrollment effects from targeted mailings even though their student sample was drawn from the universe of college admissions test-takers, so that both their treatment and control groups were equally college-bound and eligible to receive less-streamlined marketing materials from colleges. Consistent with prior observational studies, a survey of both groups found that untreated students were dramatically under-informed about college quality – particularly at top-ranked liberal arts schools and out-of-state schools but also at state flagships – relative to students who received their intervention.
HIGH-SPEED INTERNET AND COLLEGE APPLICATIONS 7 applicants to quickly and easily conduct tailored searches for the information they desire – information that once required conversations with high school counselors, parents and peers, who may be differentially knowledgeable of the college-going landscape and/or the students’ needs and interests. Moreover, the time and monetary cost of registering for exams, obtaining study materials, soliciting and submitting applications, and applying for financial aid and scholarships could be reduced. For example, college rankings, prospective college characteristics, admissions requirements, study guides, practice exams, deadlines, costs of attendance, application forms, and information on financial aid and scholarship programs are all available online in some form. In each case, high-speed Internet can make information more accessible and/or reduce the time cost of search and submission. Because we cannot separately identify the impact of each, we remain largely agnostic about how the Internet operates to improve postsecondary outcomes. Compared with the more targeted interventions described above, the nature of the relationship between high-speed Internet access and our outcomes is more complex. Students with Internet access must search for information and distill it largely by themselves. If students do not use the Internet to study for exams and search for information on college application, we would see no effect on student outcomes. If students primarily use the Internet for leisure activities, and the Internet serves as a distraction leading students to substitute time towards leisure, we could see worse outcomes. Ultimately, whether high-speed Internet availability, on average, generates positive, negative, or no effects on student outcomes is an empirical question we strive to answer through the paper.11 Driven in part by this ambiguity, we also investigate heterogeneity by demographics and geography. We consider two broad hypotheses. First, effects could be concentrated among groups found to under-match or who have been otherwise shown to be sensitive to information 11 Indeed, surveys offer ambiguous and inconclusive evidence on whether students use the Internet to effectively search for information on college application. A 2005 survey of Internet users found that 45 percent of Internet users, or 30 percent of all adults, had used the Internet to search for information on prospective college or universities for themselves or a family member (PEW 2005). At that time, this was similar to reported usage rates for banking online (44 percent) and looking for information about a job (44 percent), and higher than reported usage rates for reading a blog (27 percent), playing online games (32 percent), looking at online classifieds (36 percent), or using social networking sites (7 percent) (PEW, 2014). A 2012 survey found that three-fourths of teachers think the Internet and digital search tools have a mostly positive impact on students’ research habits, suggesting students may be able to conduct effective online searches (Purcell et al., 2012). However, 64 percent of teachers think digital technologies do more to distract students than help them academically (Purcell et al., 2012), suggesting that students may use the Internet for counterproductive activities that detract from study and search activities.
HIGH-SPEED INTERNET AND COLLEGE APPLICATIONS 8 nudges. Per the literature, these tend to be students with the fewest inroads to elite colleges. As such, we examine whether minority students or students in low-income neighborhoods, students with less-educated parents, or students from remote places are relatively more affected by highspeed Internet access. Still, the literature on under-match is fairly new, and many of the most relevant studies were conducted after high-speed Internet had become commonplace. This alone suggests that the mere prevalence of high-speed Internet may not have been particularly beneficial to these traditionally- underserved populations, either because of a lack of take-up or differential usage of Internet services. (Certainly, these populations on a whole are demonstrably less likely to have and use Internet services.) Indeed, within North Carolina, Vigdor, Ladd, and Martinez (2014) recently found that for younger students, Internet access increased the wedge in testing outcomes between high- and low-income students, which they attribute to a digital divide in how productively the Internet is used by the two groups at home. Thus, another hypothesis is that the greatest beneficiaries of high-speed Internet as it rolled out were the students with the most initiative and/or the greatest resources, especially if such students tend to use the Internet for education-related activities more than their peers. This might imply a concentration of effects among groups with greater inroads to colleges. III. Data The main empirical approach used in this paper is to relate zip code-level broadband Internet availability to individual-level testing and application data for students who took SAT exams. This section describes our main data sources and how we construct our relevant variables. a. Testing and Application Data Our primary data source is the College Board (CB), an organization that administers the Preliminary SAT/National Merit Scholarship Qualifying Test® (PSAT) and the SAT to high school students. The PSAT is an assessment taken prior to the SAT that serves as a qualifying exam for a nationally competitive scholarship program. Approximately 3.5 million students take the PSAT each year, typically either in the fall of their sophomore or junior year or both. The SAT is primarily a college admissions exam, and it yields a key metric on admissibility as well as application and demographic information for a majority of college-bound students. Over 1.5 million students in the graduating high school class of 2014 took the SAT, typically as juniors or seniors and often as both. Using the population of PSAT takers for our analysis of SAT
HIGH-SPEED INTERNET AND COLLEGE APPLICATIONS 9 outcomes provides us with a pre-“treatment” ability metric, which can be related to test-taking and application outcomes in the SAT sample. Our sample is composed of test-takers in the graduating high school cohorts of 2001 to 2008, with roughly one million students per cohort.12 CB data contain self-reported information on high school graduation cohort, student race, gender, cumulative high school GPA, home zip code, parental income and education, and high school attended. The SAT contains math and critical reading sections, each of which is scored on a scale between 200 and 800 for a maximum composite score of 1600.13 The PSAT has a similar format, but the section scoring is between 20 and 80. Along with exam scores and basic demographics, the CB data also identify colleges to which students send official copies of their SAT scores (Score Sends), which serve as good proxies for actual college applications (Card and Krueger, 2005; Pallais, 2015). When registering for the SAT, the student has the option to send her scores to four colleges for no additional cost. Scores may also be sent at a later date for a fee of approximately $11 per Score Send.14 For every Score Send, we merge characteristics of each college, including data on quality (average SAT score of incoming freshmen), control (public or private), level (two- or four-year), type (e.g., liberal arts, state public flagship), and location (state), which come from Integrated Postsecondary Education Data System (IPEDS). Our final analytic sample consists of the 7,452,302 students who take both the PSAT and SAT and are in the high school graduating cohorts of 2001 to 2008. The sample only includes students with a valid zip code.15 Table 1 displays the summary statistics on the sample. Table 1 displays scoring and application means of SAT takers. The average combined PSAT score on both sections is just over 99 and, similarly, the average SAT score, which combines the results of these same testing areas, is just over 1000. According to the students’ Score Sends, about 79 percent submitted an application to at least one four-year college. In addition, 40 percent applied to at least five colleges, which, importantly, is one more than the default of four 12 We use the 2001 to 2008 cohort in order to be able to examine Internet access in both junior and senior year of high school. Internet access is available from December 1999 to December 2007. 13 The writing section was introduced in 2005, making the maximum composite score 2400. For consistency across classes, and because colleges typically do not rely on the writing section, we focus only on the math and critical reading sections. 14 The cost changed slightly over the sample period. 15 A few students were excluded because all their demographic data were missing or because they live in a zip code with no information on broadband access.
HIGH-SPEED INTERNET AND COLLEGE APPLICATIONS 10 free Score Sends.16 Approximately 50 percent applied to a selective college (average SAT score greater than 1200) and 30 percent applied to a very selective college (average SAT score greater than 1300), though almost 70 percent of students applied to an academically matched college (average SAT score at least as good as their own or 1300, whichever is lower). Zooming in on school type, about 30 percent of test-takers applied to the state flagship, 50 percent to at least one out-of-state college, and less than 10 percent to a top private liberal arts college. The top of Table 1 displays demographic characteristics of our sample. The sample consists of 67.2 percent who identify as white, 10.7 percent as black, and 9.5 percent as Latino/Hispanic. About 45 percent of the sample is male. High school GPA is a categorical variable where 0 is a non-response, 1 is the lowest, and 12 is the highest. We use the categorical variable in the analyses but present the average of the continuous version (equal to 3.363) in the table. Finally, we add zip code-level economic characteristics to our data in order to control for changes in local economic conditions. Mean adjusted gross income is $77,444, which is measured at the zip code-level and was obtained from the IRS Statistics of Income (SOI) data.17 Population and housing data at the zip code-level were obtained from the 2000 Census and made time-variant by merging with zip code-level trends in SOI counts of filers and households. We capture local labor market trends by including information on county-level unemployment rates, collected from the Bureau of Labor Statistics. We also include trends in county-level house prices, obtained from the FHFA house price index and the 2000 Census.18 b. High-Speed Internet Access Data Our goal is to estimate how broadband Internet affects student testing and application outcomes. Since there is no measure of a student’s ability to use or access broadband in the CB data, we construct a measure of broadband availability in a student’s zip code.19 To do so, we 16 The modal number of Score Sends is four and Pallais (forthcoming) shows that students tend to apply to the number of free Score Sends. Therefore, sending at least five Score Sends is a deliberate act. 17 We interpolate missing years in this data. 18 We construct a county-level measure of house prices by combining information on county-level median home prices from the 2000 Census with the Federal Housing Finance Agency house price index, as in Dettling and Kearney (2014). Urban counties use the Metropolitan Statistical Area (MSA) version of the index and rural counties use the rural index. 19 We might alternatively attempt to derive variation from household or student usage rates. Unfortunately, usage rates cannot be constructed for all years at the subnational level. The PEW data are available frequently but are only available at the national level. The CPS data include state identifiers but are only available in 2000, 2001, 2003, 2007, and 2009. Moreover, a measure of access is preferable to usage rates, which capture a student’s ability to access online content but are also endogenous to our question of interest if, for example, parents take up high-speed Internet in order to improve their children’s educational outcomes. However, in the appendix, we confirm that there
HIGH-SPEED INTERNET AND COLLEGE APPLICATIONS 11 combine information on zip code-level ISP coverage with national trends in aggregate use of broadband services to produce a binary measure of broadband availability such that any household within a zip code where broadband is available can opt to have it. Because individual households have very little control over whether and when providers enter their zip code, and very little impact on aggregate usage, the measure we derive is interpretable as an exogenous shock to a student’s ability to use high-speed Internet. In this section, we describe the data and construction of our measure and save our examination of its exogeneity for the analysis section. We derive our measure of broadband ISP coverage from FCC Form 477 Filing data.21 The FCC requires every facilities-based provider with at least 250 high-speed lines to report basic information about its service offerings and end users twice a year.22 The FCC releases summary statistics to the public aggregated to the zip code-level, namely a list of zip codes with the number of ISPs who have at least one subscriber within the zip code receiving speeds of 200 kbps or more.23 The data are available bi-annually from December 1999 to June 2008, and to protect confidentiality, do not distinguish between one, two, or three providers in a zip code. Over that time, there is considerable variation both across and within zip codes. Ideally, we would like to operationalize the data on the number of ISPs in a zip code to be able to compare, across zip codes and time, the average resident’s ability to access high-speed Internet in her home. While we do not have a direct measure of zip code-level access from which to derive a correspondence between the number of ISPs in a zip code and accessibility, we can compare nationally aggregated coverage rates implied by the FCC data to survey-reported national usage rates of high-speed Internet. This is a reasonable litmus test for how well the raw FCC data capture market penetration – i.e., how provider entry translates into usage – at least at the national level. Figure 1 compares trends in the fraction of the population residing in a zip code with at least one provider to national trends in survey-reported usage, which were obtained is a strong positive relationship between our measure of access – aggregated to the state-level – and the state-level teen usage rates that can be derived from the CPS. 21 The FCC data can be downloaded from https://www.fcc.gov/encyclopedia/form-477-data-zip-codes-number-highspeed-service-providers. 22 Small providers, many of whom serve sparsely populated areas, are not required to report to the FCC and sometimes provide information on a voluntary basis. In our analysis we will provide separate treatment for rural and urban areas, in part to address concerns of measurement error arising by differential reporting standards across the two densities. 23 A “subscriber” can be either a residential or small business customer. Larger businesses and institutions are not included as they typically use an alternative technology.
HIGH-SPEED INTERNET AND COLLEGE APPLICATIONS 12 from PEW Research.24 There are large discrepancies between the two series, which do not match one another well in either levels or trends.25 Figure 1 also indicates that using the next available cutoff – four or more providers – does little to improve the divergence in levels or trends. The inability of the raw FCC data to capture national trends in usage is not surprising once one considers the vast heterogeneity in geographic and population sizes across the roughly 32,000 zip codes in the United States. Consider, for example, two zip codes which each reported one to three providers in 2000: 82332 is a rural zip code in Savery, WY with 134 residents in 1,422 square miles, and 10030 is an urban zip code in New York, NY with 25,847 residents in 0.30 square mile. By 2008, 82332 had 4 providers and 10030 had 11. Thus, it seems unlikely that all residents of each of these zip codes has equal access to broadband, suggesting a “one size fits all” measure of zip code-level coverage is inappropriate.26 We propose a measure that scales the number of providers in a zip code by its size. A 2006 GAO report which investigated broadband deployment across the United States found that ISPs rarely overlapped service territories (GAO, 2006). Thus, the number of ISPs in a zip code should roughly translate into the extent to which that zip code is covered, relative to its size. Moreover, because the literature on broadband roll-out suggests that supply-side constraints limiting roll-out were structurally different in urban and rural areas, we allow for how we define “size” to vary across these concepts.27 In particular, we scale the number of providers by square mile in rural 24 The PEW data can be found at: www.pewinternet.org/data-trend/internet-use/connection-type/. Note that the these rates of usage are extremely similar to those found in the Current Population Survey, which asked respondents about broadband Internet usage in 2000, 2001, 2003, 2007 and 2009. 25 We are not the first to note that the raw data on provider presence may not be able to accurately capture local access. A 2006 study by the GAO concludes that defining access according to provider presence alone “…may overstate deployment in the sense that it can be taken to imply there is deployment throughout the zip code even when deployment is very localized.” The paper also provides the results of a case study in Kentucky, where it was found that 95 percent of residents had a provider in their zip code, but only 70 percent had access in their area. 26 Prior research using the FCC data attempts to circumvent the comparability issues across zip codes by drawing variation in ISP coverage from fairly homogeneous zip codes, either by investigating outcomes within a single state- -where zip codes are quite similarly structurally – or by removing high- and low-density zip codes (Vigdor, Ladd, and Martinez, 2014; Xiao and Orazem, 2010). Unfortunately, such cuts are less desirable in our setting, because much of the foundational work in higher education suggests that these are precisely the sorts of comparisons we wish to make, as it is at extreme population densities where there is the most substantial divergence in both information and application behavior. 27 In rural areas, where zip codes are much larger and less densely populated, coverage was constrained by the cost of extending additional lines long distances to reach relatively few customers (GAO, 2005). In urban areas, population density can be a problem because too many customers using a single line at once can exhaust the system (Faulhaber, 2002; Greenstein and Price, 2007; Grubesic and Murray, 2002). Thus, to be most flexible in our definition of coverage, we allowed for the possibility that geographic size may be most relevant for rural zip codes and population size may be relevant for urban zip codes. Note that in our empirical match (described in the appendix) we allowed for a single definition but found that the best match was to include different definitions for
HIGH-SPEED INTERNET AND COLLEGE APPLICATIONS 13 areas and by population in urban areas.28 In the appendix, we present alternative specifications using only the population-scaled measure and only the mileage-scaled measure across all zip code types (as well as separately for urban and rural zip codes). The results become similar to our main estimates the better the measure fits usage trends.29 To operationalize the concept of high-speed Internet availability and facilitate interpretation, we convert the scaled measure into an indicator variable that takes a value of one when it crosses a specific threshold.30 Since there is no theoretical guidance for what an appropriate threshold might be, we identify the threshold empirically by targeting the national trends in surveyreported high-speed Internet usage displayed in figure 1. We construct an algorithm to test varying thresholds and ultimately find that the best-fit measure defines penetration in a rural zip code as “at least one provider per 12 square miles,” and in an urban zip code as “at least one provider for every 2,700 people.”31 More details on the construction of this measure can be found in the data appendix. The red line in figure 1 displays bi-annual trends in our measure of broadband Internet access rates based on our measure of high-speed Internet penetration. We see that, unlike the rural and urban consumers. In the appendix we provide results using a single definition for both urban and rural areas. 28 Zip code population data are based on a combination of Census 2000 population data and SOI population data. SOI data provides a count of the number of income tax filers, which was used to create population trends to move the 2000 Census zip-code population total forward. Zip-code land area estimates are from the 2000 Census. 29 The root mean squared error between our main measure of access and national trends in usage is 0.92. Using just the urban population measure for the whole sample, the root mean squared error is 1.34 and the results are quite similar to our main measure. Using just the rural mileage measure for the whole sample, the root mean square error is much larger at 2.79. In this case, the results begin to differ from our main results. We attribute this to the relatively poor fit of this measure, as a different mileage measure which better fits overall usage patterns leads to more similar results to those obtained from our main measure. We also examined the sensitivity of our results to models using discrete bins characterizing the number of providers (one to three and four or more), with the caveat that we believe this substantially overstates national broadband availability and fits the data poorly (as evidenced by figure 1). The measure, though it is difficult to interpret, offers inconsistent testing results (additional test-taking drawing in equally able students but then lower SAT scores overall) and somewhat inconsistent application results (more applications to selective schools; fewer applications). The measure performs better in both respects when we investigate North Carolina alone, as in Vigdor, Ladd, and Martinez (2014), likely because zip codes within a single state are at least conceptually similar. Results are available upon request. 30 We do so because we wish to interpret our results as the effect of Internet access on student outcomes. Interpretation of the linear measure would be difficult, as the literature provides no guidance on what it means for a zip code to have one additional provider per person or square mile. Additionally, the provider measure is not continuous and is listed in bins; thus, a continuous measure of coverage cannot be used and interpreted as such. While we prefer a dichotomous measure, we note the caveat that we can only interpret our measure as the average resident’s ability to access high-speed Internet. It is possible that, for some parts of a zip code, Internet will be available prior to a zip code being “turned on” by our measure, and for others, Internet may never be available. 31 We define “best fit” as minimizing the root mean squared error between each measure and the survey-reported trends. More details can be found in the appendix.
HIGH-SPEED INTERNET AND COLLEGE APPLICATIONS 14 providers-only-based measures, our measure closely follows levels and trends in the PEW survey reported usage rates. In 2000, both measures are close to zero, and by 2008, both are around 60 percent. To further confirm that our measure of availability translates into teen usage, we turn to the Current Population Survey, which collected information on broadband usage in 2000, 2001, 2003 and 2007 (Appendix Table 4). We find that the fraction of the state population with broadband available in their zip code indeed predicts state-level teen usage rates.32 Figure 2 displays maps of our measure of zip code-level high-speed Internet penetration over time, where dark gray zip codes are those that our measure indicates have access to highspeed Internet and light gray zip codes are those that do not.33 It is clear there is both across-time and across-zip code variation in high-speed Internet coverage. In December 1999, few zip codes have access, and those that do are clustered in the major population centers on the East Coast. By December 2007, coverage has expanded across the United States, although there are still many areas that coverage has not reached. Over the full sample period, around 30 percent of our SAT takers are considered to have high-speed Internet available in their area in their junior year (Table 1). Finally, we note that our measure is designed to capture high-speed Internet “availability” or “access” to the average resident in a zip code, but an individual student’s ability to use the Internet is much more nuanced, whereby household take-up and student usage can play either a mediating or amplifying role. High-speed Internet subscriptions are not free, and there are large observable differences in high-speed Internet take-up by education and income. These gaps remain to the present day when local high-speed Internet access is nearly universal, suggesting these differences are due to differential take-up rates beyond the question of availability.34 Moreover, even if a home has a high-speed Internet subscription, differences in purchased speeds and the number of devices available to a student can lead to differences in a student’s ability to use online services.35 Finally, while survey evidence indicates that searching for college 32 We also consider other aggregated measures of availability, and ours performs the best In fact, some measures (such as the “at least one ISP” measure) are negatively correlated with usage. 33 The white areas on the map are zip codes for which we do not have information. Most represent unpopulated areas like national parks and bodies of water. 34 The 2009 October CPS indicates that 58 percent of 15-18 year olds with a mother who did not complete high school had broadband at home compared with 95 percent of 15-18 years with a mother with a post-graduate degree. 35 The 2003 October CPS indicates that 55 percent of 15-18 year olds with a mother who did not complete high school had a computer at home compared with 97 percent of 15-18 years with a mother with a post-graduate degree.
HIGH-SPEED INTERNET AND COLLEGE APPLICATIONS 15 information is a popular usage of Internet services, if (some) teens do not use Internet for these purposes, we might not see changes in students’ application outcomes.36 IV. Analysis In this section, we investigate whether high-speed Internet access affects a student’s college admissibility and application set, and, to some extent, how the two interact. To separate the two, we need to understand the evolution of a student’s application and distinguish behavioral changes in how she targets her application from structural changes in the quality of her application. Thus, we begin by examining whether our measure of high-speed Internet access coincides with shifts in the quantity and quality of SAT test-takers drawn from that population. Then, in our core analysis, drawing from existing research on college quality and under-match, we consider application outcomes designed to capture behavioral patterns deviating from defaults or broad recommendations. We then examine the validity of the assumptions underlying our interpretation of our estimates as causal. Finally, we extend our analyses in two ways. First, we wed our testing and application results to examine the extent to which observable differences in applicant quality induced by high-speed Internet access can explain the application effects we detect. In addition, we investigate whether our results appear to be concentrated within particular demographic and socioeconomic groups. a. Empirical Specification Our estimating equation is a generalized difference-in-differences: y =α+β*broadband +Aθ+X λ+γ +γ +ε izc izc i zc z c izc where y is a binary variable37 capturing a testing or application outcome for student i from zip izc code z in cohort c; broadband is an indicator for broadband availability in her junior year; A is izc i a set of student-specific demographic and ability controls; X is a set of cohort-varying zip code zc economic conditions coincident with the timing of the broadband availability measure (i.e., also in her junior year);38 and γ and γ are zip code and cohort effects. Note that A includes the z c i 36 A 2005 survey of Internet users found that 45 percent of Internet users had used the Internet to search for information on prospective college or universities for themselves or a family member (PEW, 2005). 37 An exception is when we estimate the effect of Internet availability on PSAT and SAT scores, which is an integerbased variable, ranging in increments of 10 from 400 to 1600. 38 Student-specific controls include dummies for gender, race, and high school GPA, as well as PSAT math and verbal scores; zip code controls include adjusted gross income, population, number of houses, unemployment rate, and median home price. Additionally, in Appendix Table 5 we add a control for student survey responses to parental education and income, questions that do not appear on the PSAT survey and that are associated with considerable nonresponse on the SAT survey. Results are qualitatively unchanged to their inclusion.
HIGH-SPEED INTERNET AND COLLEGE APPLICATIONS 16 student’s PSAT verbal and math scores as well as high school GPA, which capture “latent ability” as well as an early signal of admissibility to selective colleges. Our primary coefficient of interest is β such that when y is “applied to a four-year college,” our estimate represents the increase in the likelihood a student applies to a four-year college if high-speed Internet is available in her home zip code. This characterization of β holds under the assumption that, all else equal, trends in testing and application outcomes in zip codes with and without high-speed Internet would have evolved similarly over time, save for the availability of high-speed Internet. Of course, zip codes which received high-speed Internet access earlier may have been different than zip codes which received high-speed Internet access later. That is why it is imperative that our specification include zip code fixed effects (γ ) so that our estimates are net z of any time-invariant differences across zip codes. X also contains a set of zip-code-level zc economic indicators, including local income, unemployment rates, house prices, and population density in an effort to capture any observable changes in zip-code characteristics that may have been correlated with the availability of high-speed Internet. Our identifying assumption is that, all else equal, our measure of high-speed Internet availability is exogenous to a student’s testing and application outcomes. Recall that we derive our measure from zip-code-level access based on counts of ISPs benchmarked to trends in usage. We might be concerned that usage is endogenous to our outcomes, and, at the individual level, it almost certainly is. However, recall that our specification includes cohort effects (γ ), so that our c estimated effects are net of any national trends in high-speed Internet adoption and the availability of online content. Thus, for identification purposes, we need only assume that provider entry is exogenous to student’s outcomes. This means that threats to identification come in the form of either student-demand-related pressures on Internet service providers to enter their zip code, or any omitted variables that co-vary with student outcomes and provider entry. We view such threats to be small, if not negligible. Per the former, there is abundant evidence that supply-side constraints restricted high-speed Internet access, and that the supply of high-speed Internet lagged consumer demand. To provide high-speed Internet, Internet service providers (ISPs) – typically the existing phone or cable company – had to make substantial infrastructure investments, retrofitting existing phone and cable lines and installing new switches and servers (Faulhaber, 2002; Greenstein and Prince, 2007; Grubesic and Murray, 2002). There is a general consensus that the costs slowed rollout
HIGH-SPEED INTERNET AND COLLEGE APPLICATIONS 17 and access did not keep up with consumer demand (Greenstein and Prince, 2007; Faulhaber, 2002). Dettling (forthcoming) discusses how variation in the underlying housing infrastructure and the availability and quality of existing telephone and cable wiring made these infrastructure investments differentially costly across locations, which created differences in the timing of the availability of high-speed Internet services across locations that was unrelated to consumer demand-related pressures.39 Based on these known barriers to entry, it seems unlikely, but not impossible, that student-demand-related pressures induced high-speed Internet service provision. Thus, net of state and year fixed effects, and the economic controls included in our model, the roll-out of high-speed Internet is arguably random. For the latter to be plausible, it must be that our extensive set of student- and zip-code-level controls are inadequate. We investigate the exclusion of key variables by re-estimating our main outcomes over other timings of high-speed Internet availability that are less likely to affect our outcomes. For example, as a preview of our results, while we find that junior year high-speed Internet availability leads to SAT score improvements, when we instead use a measure of highspeed Internet availability in December of a student’s senior year of high school – after most students would take the SAT – there is no discernible effect on SAT scores. We view this as suggestive evidence that our estimation framework does not omit key variables, though we acknowledge it is a possibility. Finally, before we turn to results, we make two notes regarding their interpretation. First, the coefficient on broadband captures the effect of potential access, as opposed to high-speed izc 39 There are two main transmission modes for high-speed Internet in the United States: cable-based and telephonebased digital subscriber line (DSL) service. Each service requires the installation of fiber-optic wiring, which provides high-speed Internet service up to a certain point, from which the signal travels over traditional coaxial cable or copper telephone wiring the rest of the way. These fiber-optic lines may reach the ISPs’ central office, some remote terminal in the neighborhood, or the home. The main issue that prevented timely rollout for the cable companies was capacity. Cable companies had installed some fiber lines in the 1980s to provide digital cable service, but each additional customer on a single fiber line reduces the “downstream” capacity, meaning that multiple simultaneous users reduces speeds and could exhaust the system. Thus, to provide reliable, high-speed Internet service, cable companies needed to add more fiber lines that came closer to residences. For DSL Internet from the phone companies, rollout was prevented by the need to upgrade the existing telephone wiring, much of which was old and had been split too often to be capable of carrying high-speed two-way traffic. In either case, the key insight is that existing wiring leading up to the home or apartment building was insufficient to carry high-speed traffic, while the wiring already in a home or building was typically sufficient. Dettling (forthcoming) demonstrates that this incentivized entry into markets where the existing housing infrastructure offered lowered costs of provision. That is, areas with more multiple-family dwellings received access earlier because ISPs could take advantage of economies of scale in the provision by bringing one line to multiple consumers.
HIGH-SPEED INTERNET AND COLLEGE APPLICATIONS 18 Internet use, because broadband is a measure of high-speed Internet penetration over a izc geography rather than student-specific high-speed Internet adoption. Arguably, this coefficient is the most relevant for policy. As noted earlier, the effect of high-speed Internet use will depend on how many students use it, how often they use it, and in what ways – all of which the government is unable to fully control or measure. Moreover, local peer effects, wherein local access changes the college-going culture of students’ neighborhood or high school even for those students who do not personally have access at home, could amplify or reduce the effectiveness of use in ways that are difficult to separate from own access but that a measure of local access will generally subsume. Second, because our analysis focuses on how the prevalence of high-speed Internet in junior year coincides with late high school outcomes and includes both zip code and year fixed effects, our estimates will reflect only the testing and application effects of coverage that are immediately detectable. In practice, reducing the cost of information could affect outcomes at any pre-collegiate stage. Early high school students, for instance, could alter their coursework and career paths based on information gleaned from the Internet, which, in turn, could also affect postsecondary outcomes. While some of our results will hint that gains in applications are indeed larger the longer broadband is available, we generally leave this possibility to future work.40 b. Main Results: Test-Taking, Scoring, and Application Outcomes We begin our analysis by examining whether SAT test-taking rates systematically vary with high-speed Internet availability. Relative to the full population of U.S. students, SAT takers tend to be college-aspiring students, so our sample of testing and application outcomes are likely positively selected from the distribution of student ability. Moreover, there exists a competing exam for college-bound students, the ACT, that students can elect to take instead. Thus, depending on how high-speed Internet is used and by whom, we might see shifts in SAT testtaking and SAT test-taker quality that result from increased broadband availability. Note that while test-taking is a separately interesting outcome that may be affected by highspeed Internet availability, any systematic differences that we detect in the amount or quality of SAT takers would also affect our interpretation of β when the dependent variable is an SAT score or a college application decision. This is because we can only observe these outcomes for 40 For example, Table 3 suggests that the application effects of broadband grow the longer it is available to students.
HIGH-SPEED INTERNET AND COLLEGE APPLICATIONS 19 students who take the SAT, and such results would imply that we do not observe a comparable set of SAT test-takers across zip codes with and without high-speed Internet. Consider, for example, if, when high-speed Internet is available, more students find it worthwhile to take the SAT, but those who elect to take the exam only in this state tend to earn below-average scores. Were we to then observe less-favorable average application outcomes in zip codes where broadband=1, we might incorrectly infer that high-speed Internet negatively affects application outcomes, when instead the composition of students for whom we can observe outcomes has also changed. Thus, if there is observable selection into (or out of) SAT-taking when high-speed Internet is available, estimated βs for outcomes that can only be observed conditional on taking the SAT would reflect a combination of true application effects on students who take the exam in any state and broadband-driven sample selection. To estimate shifts in test-taking and test-taker quality, we leverage information available from the PSAT. The PSAT is the qualifying test for the National Merit Scholarship Program and is often mandatory within a state or district, so that the population of PSAT takers approximates the at-risk (i.e., non-selected) population of SAT takers. Using the full set of PSAT takers, we first examine whether more students elect to take the SAT once broadband is available, so that y is an indicator value that takes the value of 1 if a student takes the SAT exam and 0 izc otherwise. Table 2, column 1 indicates that the estimated effect of high-speed Internet on SATtaking is extremely small (0.00049) and statistically indistinguishable from zero. Thus, on average, the likelihood a student takes the SAT is unchanged by increased high-speed Internet prevalence. Still, there remains the possibility that there are changes in test-taker quality underlying this non-effect if, for instance, more-able students are induced to take the exam and directly offset less-able students who are discouraged from taking the exam. Thus, column 2 considers whether the PSAT score for our analysis sample – i.e., students who take both the PSAT and the SAT – varies systematically with high-speed Internet availability. Again, we obtain results indistinguishable from zero. From this combination of results, we can make two related inferences. First, the availability of high-speed Internet in junior year apparently does not induce students to alter their SAT test-taking decisions. And second, the set of test-takers for whom we
HIGH-SPEED INTERNET AND COLLEGE APPLICATIONS 20 observe applications and SAT scores does not meaningfully differ in ways that would raise sample selection issues in examining these outcomes.41 Next, we examine whether broadband availability affects outcomes later in the application cycle. We begin by looking to see whether we can detect systematic differences in SAT scores (Table 2, panel 2, column 1). Since, as we have demonstrated, the underlying test-taker populations are equally able, any observable changes would indicate that test-takers become differentially admissible to selective colleges when broadband is available. Indeed, students in zip codes with broadband outperform their academic peers in zip codes without broadband by an average of 0.7 SAT point (or 0.3 percent of a standard deviation). Finally, we examine the effect of broadband on application rates. Drawing from existing research on college quality and under-match, we consider outcomes designed to capture behavioral patterns deviating from defaults or broad recommendations, such as applying to more schools than the number allotted with a test registration (i.e. applying to five or more), applying to a four-year school, applying to a selective school, applying to a match school, applying to a state flagship, applying out of state, and applying to a highly ranked liberal arts school. We construct each application measure as an indicator variable and we test, in line with the literature, whether the likelihood a student will deviate from perceived defaults increases with more information. The remainder of Table 2 displays estimates of the effect of broadband on the application measures. The signs on all of the coefficients are consistent with improved outcomes: we estimate statistically significant gains in the probability a student applies to more than the default number of colleges, a four-year college, a selective college, a college commensurate with her 41 Ideally, to examine selection, we would use all students who are at risk of applying to college when broadband becomes available (e.g., high school juniors by home zip code over time), but we do not observe such a measure in our data and, to our knowledge, these data are not available elsewhere. However, we believe that the population of PSAT-takers is an ample proxy for the most relevant denominator in our setting for several reasons. For one, we find no evidence of changes in the underlying quality of test-takers in our sample (as measured by PSAT scores). In addition, the PSAT is generally administered to high school sophomores and juniors in the fall, and we restrict our analysis sample to only test-takers who took both exams; at best, this leaves a very small window to respond through PSAT-taking. It seems much more likely that, in practice, students who are induced into college-going by broadband availability in their junior year will forego the PSAT entirely and elect only the SAT exam, which is the exam relevant for college admissions. Such students are excluded from our analyses. Relatedly, students induced into PSAT-taking would generally not have been otherwise collegebound, and thus are extremely unlikely to apply broadly or to schools in the selectivity ranges we consider; if anything, including them should bias our estimates toward zero. Finally, when we restrict the sample to SAT states, where selection is less of a concern, our results look quite similar.
HIGH-SPEED INTERNET AND COLLEGE APPLICATIONS 21 own score (i.e., a match college), a top liberal arts college, and an out-of-state college ranging from 0.2 to 0.4 percentage point. In some instances, these changes reflect quite meaningful deviations from mean behavior; for example, only about 7.2 percent of SAT takers apply to top liberal arts colleges, and high-speed Internet availability induces a 0.17 percentage point change over that baseline. Interestingly, while both are positive in sign, the effect of broadband Internet on whether students apply to a very selective college or an in-state flagship are statistically indistinguishable from zero. The first null result could reflect disparities in student groups to whom the gains of high-speed Internet availability tend to accrue – perhaps the most elite students are not the largest beneficiaries. The second null result is consistent with this line of thinking as well, but also with prior work that has found that groups of students who under-match generally tend to favor public, in-state colleges in their applications (Hoxby and Avery, 2013), so we might not anticipate large effects within this class of institutions. Moreover, flagship colleges tend to be large and well-known; thus, if simple awareness was preventing applications, we might not expect this outcome to be affected.42 c. Validity of Research Design Our identifying assumption is that barring the emergence of broadband, trends in our outcomes would have evolved similarly. One way to check this assumption is to examine trends in our main outcomes before broadband became available to residential customers in 1999. Using SAT data for the 1996 to 1998 graduation cohorts, we classify zip codes according to whether high-speed Internet became available during the early (1999–2001), middle (2002– 2005), or late (2005–2007) years of our sample period, and chart the evolution of our main outcomes over this prior period. To conserve space, we limit the discussion of this analysis (and the rest of the validity checks) to the main testing and application outcomes that were statistically significant in Table 2 and present the remaining outcomes in the appendix. Figure 3 displays the results of this analysis, which indicate that there are substantial and persistent level differences by timing in several of our outcomes before broadband technology became commonplace: test-takers from zip codes that received broadband relatively early tend to 42 While Hoxby and Avery (2013) find that, within the state, these groups do not seem to favor flagships and that they often instead apply to less selective public universities in their state, in a related survey, Hoxby and Turner (2015) find that the reasons such students give for not applying to the state flagship seem more related to the social environment at the school than to unawareness of its academic quality.
HIGH-SPEED INTERNET AND COLLEGE APPLICATIONS 22 have better application outcomes. This result is not surprising, since ISPs tended to enter wealthier zip codes earlier than less well-off zip codes. Recall, however, that our specification includes zip code effects. This means we need only assume that trends – not levels – are statistically similar across the timings we consider in the absence of broadband. Figure 3 indicates that this assumption appears to be valid for the 1996–1998 period: there do not appear to be any differential trends in our outcomes by broadband availability category. Thus, prior trends do not appear to be driving our estimates. Another way to examine the validity of our design is to estimate outcomes for students who were older in the application cycle when broadband became available. Specifically, we would not expect to see systematic differences in applications among students who were already freshmen in college by the time broadband became available to their home zip code. Thus, we reestimate our model, adding new indicators for broadband availability in a student’s home zip code during her senior year of high school and freshman year of college. Although obtaining broadband access during one’s freshman year of college should not theoretically affect application outcomes, in practice we have an imperfect measure of individual-level access and it is possible that some students have access to high-speed Internet before our measure officially turns on in their zip code. Thus, we do not necessarily expect to see precise zeroes for freshman year of college, but rather broadly more muted effects for cohorts who generally would have been too old to benefit. Figure 4 summarizes the results of our analyses of zip code broadband access in junior year of high school, senior year of high school, and freshman year of college on our main outcomes. Again, for brevity we present the outcomes that were statistically significant in Table 2. In these figures, the x-axis plots the year of schooling and the y-axis displays the estimated coefficients and confidence intervals on the indicator for zip code-level broadband access in the year listed. In each case, broadband access in junior year has a positive and significant effect on the outcome, and an imprecisely estimated null effect in the freshman year of college. Finally, in order to comprehensively examine the pre-trends together with the dynamics following the introduction of broadband, we follow Autor (2003) and estimate an event-study version of our estimating equation. Specifically, we replace broadband with a series of izc dummies for the years prior to and following the introduction of broadband over our sample
HIGH-SPEED INTERNET AND COLLEGE APPLICATIONS 23 period, and re-estimate each of our outcomes. For consistency and ease of interpretation, the omitted dummy in each equation corresponds to the year just before broadband is introduced. Table 3 displays the results. The coefficients on the broadband availability leads are generally not significantly different from 0; thus, in this framework as well, there is no evidence of anticipatory effects or important heterogeneity in application pre-trends.43 By contrast, the coefficients on the lag terms are generally all highly significant. Interestingly, though there is a clear, statistically significant jump in each of our application outcomes the year broadband becomes available, the effect sizes appear to swell the longer broadband is available, suggesting that the benefits of broadband may be larger if introduced even earlier in the college application cycle. Finally, we have noted that in our data we can only observe testing and application outcomes for SAT takers, even though there exists a competing admissions exam students can elect to take instead. Thus, we might be concerned that some of the behavior we detect reflects selection into (or out of) our analysis sample, driven by switching between the ACT and SAT. While colleges throughout the United States generally will accept either exam, the tests historically tend to prevail in particular geographic regions; students on the coasts and in the South more often take the SAT and students in the Midwest favor the ACT. In addition, students planning to apply to highly selective universities often take the SAT. Over the time period we consider, as accountability pressures grew, some states, and often those where the ACT had already prevailed, began requiring their students to take an admissions exam. Since the resulting score could be used in the application process, state mandates potentially interfered with some of the competitive dynamics between the two exams. To examine whether the existence of the ACT is introducing selection concerns into our analysis, we restrict our analysis sample to those states in which the SAT historically has prevailed, and re-estimate all of the outcomes (Table 4). The results broadly mirror the main 43 The one exception is that the policy leads for “Apply 5+” appear to indicate pre-trends, which suggests that we may be incorrectly attributing increases in the number of applications to broadband availability. Note that this interpretation is not fully consistent with the evidence brought to bear on this outcome in the prior two exercises, and of the many validity tests we do, it is statistically probable we fail one by random chance. Thus, the preponderance of evidence suggests that broadband availability is exogenous to students’ application outcomes.
HIGH-SPEED INTERNET AND COLLEGE APPLICATIONS 24 results in Table 2.44 The SAT score increase owing to availability is again 0.7 SAT points, and application outcomes increase across the board similar to our main estimates. d. Exploring Mechanisms We showed earlier that application outcomes improved for students with broadband Internet. However, we also detect broadband-driven increases in their average admissibility, as measured by their scores. We have not yet shown whether application improvements occur independent of scoring improvements. In other words, to understand how broadband availability is operating to improve outcomes, we would also like to know whether students who are observably equally admissible apply differently when broadband is available. To probe this, we re-estimate our application outcomes, adding a separate control for the SAT score in addition to the PSAT score. Table 5 displays the results of this exercise and indicates that broadband has an effect on student applications, above and beyond improved admissibility. Dividing the estimates in Table 5 by those in Table 2 indicates that score improvement explains at most 20 percent of changes in the application set (as indicated by the italicized percent changes). While there may be some complementarity between changes in SAT scores and other features of a student’s application that are unobservable to us – for example, a student with a higher score might feel encouraged to enroll in more rigorous courses or participate in additional extra-curricular activities – these results indicate that broadband availability is likely also acting through channels independent of student admissibility, such as reducing the work involved in submissions or making information on schools easier to obtain. We can further examine this separability by considering whether a more short-term treatment – broadband availability in December of senior year of high school – affects our outcomes. In this case, students would have much less time to improve the strength of their application. Table 6 displays the effects of senior year access on application outcomes. Similar to the results for junior year, we find broad improvements in outcomes, although the results are muted.45 Moreover, while still positive, we no longer detect significant effects on the probability a student 44 For completeness, we re-estimate the specifications examining selection into SAT-taking (i.e., the top panel of Table 2). The coefficients (standard errors) on “PSAT Score” and “Took the SAT” are -0.06520 (0.03571) and 0.00175 (0.00098), respectively, neither of which is statistically different from zero (at the 5% level). 45 Because we focus on the same cohorts of students and allow the years in which Internet access was obtained in the zip code to vary, the magnitude of the coefficients are not directly comparable to the junior year results. For instance, the first cohort we use graduates in 2001. That cohort could have obtained Internet access in December of their junior year, or 1999. In the senior year analysis, that cohort could have obtained access in December of 2000.
HIGH-SPEED INTERNET AND COLLEGE APPLICATIONS 25 applies to a selective school or out-of-state school. Importantly, we do not estimate an SAT score change consistent with the later timing. Since students do not typically take the SAT after fall of senior year, we find this (non-)result to be a compelling affirmation of our strategy and indication that broadband availability is acting beyond score increases to improve application outcomes. Overall, Table 6 suggests that admissibility alone is not driving our results. e. Heterogeneous Effects The literature on under-match identifies the existence of substantially many high-achieving students left behind because of a lack of peers, role models, and necessary information that could help them with the application process. Broadly, findings in this area tend to connect measures of relative disadvantage—e.g., lower-income geographies, less-educated parents, students in rural areas, and students who identify as a racial minority—to information constraints limiting students’ postsecondary opportunities. In this regard, it seems plausible that high-speed Internet access might have a larger impact on postsecondary outcomes for students from these groups. However, our setting offers more than just pure information, such that there may also be unobserved countervailing forces, such as whether the student’s family can afford to or wants to purchase a subscription, whether the student has devices at their disposal, and how effectively the Internet is being used. If those differences are correlated with relative disadvantage – which most evidence suggests that they are – high-speed Internet access may not translate into improved academic outcomes according to the groups we consider.46 This would be consistent with recent work which suggests students from lower socioeconomic backgrounds (i.e., lower- SES students) with access to Internet suffer academically relative to their peers, while students from better socioeconomic backgrounds (i.e., higher-SES students) gain (Vigdor, Ladd and Martinez, 2014). Thus, our next exercise is designed to probe whether we observe systematic differences in outcomes by particular student groups, with the caveat that the analyses that follow are somewhat difficult to interpret because the dimensions we consider are also correlated with 46 There is evidence that disadvantaged students are less likely to have broadband at home, have a computer at home and use the Internet for education-related activities: for example, the October 2003 CPS indicates that among 15-18 year olds whose mother did not complete high school, 8 percent had access to broadband at home and 55 percent had access to a computer at home. Among 15-18 years olds whose mother has a post-graduate degree, 46 percent had access to broadband at home and 97 percent had access to a computer at home. Among students with access to Internet at home, 74 percent of 15-18 year old Internet users whose mothers did not finish high-school use the Internet for education-related purposes, compared to 93 percent of Internet users whose mothers had a post-graduate degree.
HIGH-SPEED INTERNET AND COLLEGE APPLICATIONS 26 lower rates of household Internet take-up and device access. As before, our βs are interpretable as the effects of potential access to high-speed Internet; if we observe no effect on the students who are most constrained, we are unable to test whether this is due to lack of take-up or lack of a treatment effect. Table 7 displays estimates of the effect of broadband Internet by group. We find that improvements in applications are concentrated among higher-income, more-educated, white, and urban students. By contrast, our estimates for lower-SES students appear to indicate large changes in SAT scores but a pervasive null effect on applications for these groups.47 These patterns may partly reflect how well such populations are able to use the Internet to improve outcomes on their own. Altogether our findings suggest that the benefits of high-speed Internet may be lopsided and favor those who already had more resources. Still, our set of results may be somewhat specific to our setting, when high-speed Internet was first becoming available to households. It may be that in its early stages, the benefits of high-speed Internet primarily accrue to resource-rich students, but as it becomes more diffuse, lower-resource students experience large benefits as well. We probe this hypothesis in a framework mirroring Table 3 splitting the sample, as above, by parental education (Appendix Table 7). The results generally indicate that low-SES students realize increased score gains over time, but that the gains only translate into improved applications if they essentially grew up with broadband in their zip code. By contrast, among high-SES students, the gains in both scores and applications are immediate and persistent. This set of findings is broadly consistent with the under-match literature and could imply any or all of the following: 1) there is learning among low-SES students in how one can use the Internet to improve outcomes; 2) low-SES students are very late adopters; 3) low-SES students alter their admissibility over time in important ways we cannot observe (e.g., through changes in their coursework); or 4) the benefits of broadband disseminate across zip codes over time, even to non-users. While outside the scope of this paper, future research could explore why lower- 47 In some instances, lower-SES students appear to have lower average PSAT scores once broadband becomes available (available upon request). While interesting from a policy perspective, this result complicates the interpretation of the large SAT score increases and the null application effects, as they might reflect changes in the test-taking population or the characteristics of our applicant sample. Still, in other instances, we observe no such evidence of selection among lower-SES students. The SAT and application effects are similar across the measures of SES we consider.
HIGH-SPEED INTERNET AND COLLEGE APPLICATIONS 27 resource students appear to experience gains in their scores over time, why these gains are not immediately reflected in their applications, and how policy could mitigate the gaps we detect. V. Conclusion This paper examines the effect of high-speed Internet on testing and college application decisions. We find that the availability of high-speed Internet in a student’s zip code unambiguously improves her scores and her application set. Students appear to diverge from defaults and broad recommendations by sending more applications, applying to more-selective schools, applying to schools outside their state or smaller in presence, and applying to schools more commensurate with their abilities. Some of this improvement can be traced to increases in average test scores, also owing to broadband. Moreover, consistent with prior literature, we find that interventions fairly late in adolescence can have considerable effects on students’ postsecondary outcomes. We find that the primary beneficiaries of broadband Internet availability appear to be higherresource student groups, so that the digital divide may be substantially neglecting students who already tend to have fewer inroads to elite academic institutions. If this gap is due to differences in student’s ability to access and use broadband – perhaps because their parents do not take up broadband or because students do not have access to devices – these results suggest that policies aimed at increasing broadband availability and affordability could reduce inequality in postsecondary outcomes. If these gaps are due to differences in the ability to find and digest the relevant online content, policies aimed at providing guidance on how the Internet can enhance opportunities for these students might be effective. Nonetheless, our results imply that students can benefit from content available online to improve their outcomes. And, even though our results cannot speak directly to Internet usage at school, it is possible that in-school programs that encourage and monitor Internet searches could be an effective tool for improving college outcomes. We leave it to future work to uncover which margins are relevant for policy to mitigate these gaps. Of course, an important lingering question is how the application improvements we estimate translate into differences in attendance. We cannot observe enrollment outcomes for our analysis sample, but we can use comparable estimates from the literature to derive anticipated effects. Hoxby and Turner (2013) launched a national information intervention designed to improve application and attendance outcomes. Students who were sent material were 12 percentage points
HIGH-SPEED INTERNET AND COLLEGE APPLICATIONS 28 more likely to apply to “peer” institutions and then 5.3 percentage points more likely to enroll in them. In other words, in their setting, about 43 percent of the detected change in application behavior effectively translated into improved attendance outcomes. If we apply those estimates to our “match” application coefficient, we can expect a 0.12 percentage point increase in the likelihood that college-bound students enroll at schools commensurate with their ability. Assuming no supply-side constraints or general equilibrium effects, this decreases the extent of under-match by about 3,500 students per year.48 48 Statistic derived from the Digest of Education Statistics 2013 count of first-time freshmen.
HIGH-SPEED INTERNET AND COLLEGE APPLICATIONS 29 References Autor, D. (2003). “Outsourcing at Will: The Contribution of Unjust Dismissal Doctrine to the Growth of Employment Outsourcing,” Journal of Labor Economics, 21(1): 1-42. Avery, C. and T. Kane (2004). Student Perceptions of College Opportunities: The Boston COACH Program. In C. Hoxby, ed. College Choices: The Economics of Where to Go, When to Go, and How to Pay for It. University of Chicago Press. Bailey M. and S. Dynarski (2011, December). Gains and Gaps: Changing Inequality in U.S. College Entry and Completion. Working Paper 17633, NBER. Belo, R., P. Ferreira, and R. Telang (2013). “Broadband in School: Impact on Student Performance,” Management Science. 60(2): 265-282. Bettinger, E. P., Long, B. T., Oreopoulos, P., & Sanbonmatsu, L. (2012). “The role of application assistance and information in college decisions: Results from the H&R Block FAFSA experiment,” Quarterly Journal of Economics, 127, 1205–1242. Bhuller, M., T. Havnes, E. Leuven and M. Mogstad (2011) “Broadband Internet: An Information Superhighway to Sex Crime?” IZA Discussion Papers Series No. 5675. Bowen, W., M. Chingos, and M. McPherson (2009). Crossing the Finish Line: Completing College at America’s Public Universities. Princeton University Press. Bulman, G. (forthcoming). “The Effect of Access to College Assessments on Enrollment and Attainment.” American Economic Journal: Applied Economics. Card, D (1995). “Using Geographic Variation in College Proximy to Estimate the Return to Schooling,” in Aspects of Labour Market Behaviour: Essays in Honour of John Vanderkamp, ed. by Louis N. Christofides, E. Kenneth Grant, and Robert Swidinsky. Toronto: University of Toronto Press, 201-222. Carrell, S. E. and B. Sacerdote (2013). “Late Interventions Matter Too: The Case of College Coaching in New Hampshire,” NBER Working Paper, 19031. Cohodes, S. and J. Goodman (2014). “Merit Aid, College Quality and College Completion: Massachusetts' Adams Scholarship as an In-Kind Subsidy,” American Economic Journal: Applied Economics. Dale, S. and A. Krueger (2002). “Estimating the Payoff to Attending a More Selective College: An Application of Selection on Observables and Unobservables.” Quarterly Journal of Economics 117(4), 1491–1527. Dale, S. and A. Krueger (2011, June). Estimating the Return to College Selectivity over the Career Using Administrative Earnings Data. Working Paper 17159, NBER.
HIGH-SPEED INTERNET AND COLLEGE APPLICATIONS 30 Dettling, L. (forthcoming) “Broadband in the labor market: The Impact of High-Speed Internet on Married Women’s Labor Force Participation,” Industrial and Labor Relations Review. Dillon, E. and J. Smith (2013). “The Determinants of Mismatch Between Students and Colleges,” NBER Working Paper. Fairlie, R. and J. Robinson (forthcoming). “Experimental Evidence on the Effects of Home Computers on Academic Achievement among Schoolchildren,” American Economic Journal: Applied Economics. Falck, O., R. Gold, and S. Heblich (2014). “E-lections: Voting Behavior and the Internet,” American Economic Review 104(1) pp.329-389 Faulhaber, G. R. (2002). “Broadband deployment: Is policy in the way?” In R. H. Crandall and J. H. Alleman (Eds.), Broadband: Should We Regulate High-Speed Internet Access, AEI-Brookings Joint Center for Regulatory Studies. pp. 223–244. Goodman, S. (2013) “Learning from the Test: Raising Selective College Enrollment by Providing Information” Finance and Economics Discussion Series, 2013-69. Greenstein, S. and J. Prince (2007) “The diffusion of the Internet and the geography of the digital divide in the United States” In D.Q. Mansell, Robin and R. Silverstone (Eds.), Oxford Handbook on ICTs, Oxford University Press. Grubesic, T. H. and A. T. Murray (2002). “Constructing the divide: Spatial disparities in Broadband access.” Papers in Regional Science 81, 197–221. Hoxby, C. and C. Avery (2012, December). The Missing "One-Offs": The Hidden Supply of High- Achieving, Low Income Students. Working Paper 18586, NBER. Hoxby, C. and S. Turner (2013). Expanding College Opportunities for High-Achieving, Low Income Students. Discussion Paper 12-014. Stanford Institute for Economic Policy Research. Hoxby, C. and S. Turner (2015). What High-Achieving Low-Income Students Know About College. Working Paper 20861, NBER. Hurwitz, M., J. Smith, S. Niu, and J. Howell (2015). “The Maine Question: How is Four-Year College Enrollment Affected by Mandatory College Entrance Exams?” Educational Evaluation and Policy Analysis. 37(1): 138-159. Kane, T. (1995, July). Rising Public College Tuition and College Entry: How Well Do Public Subsidies Promote Access to College? Working Paper 5164, NBER. Klasik, D. (2012). The college application gauntlet: A systematic analysis of the steps to four-year college enrollment. Research in Higher Education, 53(5), 506-549.
HIGH-SPEED INTERNET AND COLLEGE APPLICATIONS 31 Klasik, D. (2013). “The ACT of enrollment: The college enrollment effects of required college entrance exam taking,” Educational Researcher, 42(3), 151-160. Luca, M. and J. Smith (2013). “Salience in Quality Disclosure: Evidence From U.S. News College Rankings.” Journal of Economics & Management Strategy. Vol. 22(1): 58-77. Pallais A (2015). “Small Differences that Matter: Mistakes in Applying to College.” Journal of Labor Economics 33(2):493-520. Pallais A. and S. Turner (2006). Opportunities for Low Income Students at Top Colleges and Universities: Policy Initiatives and the Distribution of Students. National Tax Journal 59(2), 357- 386. PEW Research (2014) “Internet use over time” http://www.pewinternet.org/data-trend/internetuse/internet-use-over-time/ Accessed September 30, 2014 Smith, J. M. Hurwitz, and J. Howell (2015). “Screening Mechanisms and Student Responses in the College Market,” Economics of Education Review, 44:17-28. Spies, R. (2001). “The Effect of Rising Costs on College Choice: The Fourth and Final in a Series of Studies on This Subject.” Princeton University Research Report Series, No. 117. United States Government Accountability Office (2006) “ Broadband deployment is extensive throughout the United States, but it is difficult to assess the extent of deployment gaps in rural areas” GAO Report 06-426 Vigdor, J., H. Ladd, and E. Martinez (2014). "Scaling The Digital Divide: Home Computer Technology And Student Achievement." Economic Inquiry 52(3): 1103-1119. Zimmerman, S. (2014). The Returns to College Admission for Academically Marginal Students Journal of Labor Economics 32(4), 711-754. Zucker, K. and A. Smith (2013). “Home Broadband 2013” August 26, 2013 http://www.pewinternet.org/2013/08/26/home-broadband-2013/
HIGH-SPEED INTERNET AND COLLEGE APPLICATIONS 32 Tables and Figures Figure 1: Measures of High Speed Internet Availability and Usage Notes: Displayed are trends in high-speed Internet availability and usage according to different measures. PEW refers to surveyreported national usage rates. The two provider measures (>1 and >4) refer to implied aggregate availability rates, if availability is defined as having at least one or at least four providers in the zip code. These rates are weighted by zip code population. The coverage measure refers to our preferred measure of high-speed Internet availability, as described in the text. Source: Pew Research Center, Broadband vs. Dial-up Adoption Over Time; FCC, Form 477 Data; U.S. Census Bureau, 2000 Census
HIGH-SPEED INTERNET AND COLLEGE APPLICATIONS 33 Figure 2: Zip Code Internet Penetration in 1999 and 2007 (A) December 1999 (B) December 2007 Notes: Displayed are zip code-level maps of Internet penetration. The light gray shading represents zip codes without coverage and the dark gray shading represents zip codes with coverage, where coverage is defined as our indicator for Internet availability, as described in the text. White areas are not included in the sample. Sources: FCC, Form 477 Data; U.S. Census Bureau, 2000 Census
HIGH-SPEED INTERNET AND COLLEGE APPLICATIONS 34 Figure 3: Means of Selected Application Outcomes in 1996-1998 by Year High-Speed Internet Entered Zip Code Notes: Displayed are population-weighted means of the main application outcomes in the years just prior to the introduction of residential broadband Internet (1996 to 1998). Means were constructed by classifying zip codes according to the year in which high-speed Internet became available in the zip code. Sources: College Board data; FCC, Form 477 Data; U.S. Census Bureau, 2000 Census; IRS, SOI Tax Statistics; BLS, Local Area Unemployment Statistics; FHFA Housing Price Index
HIGH-SPEED INTERNET AND COLLEGE APPLICATIONS 35 Figure 4: Estimated Effects of Broadband Availability in Junior Year of High School, Senior Year of High School, and Freshman Year of College on Selected Application Outcomes Notes: Displayed are coefficients and confidence intervals for the effect of broadband availability on application outcomes by year of schooling, estimated as described in the main text. The short bars bookend the 95% confidence interval, and the long bars bookend the 99% confidence interval. Sources: College Board data; FCC, Form 477 Data; U.S. Census Bureau, 2000 Census; IRS, SOI Tax Statistics; BLS, Local Area Unemployment Statistics; FHFA Housing Price Index
HIGH-SPEED INTERNET AND COLLEGE APPLICATIONS 36 Table 1: Summary Statistics All Students (obs = 7,452,302) Std. Mean Dev. Min Max Student Demographics and Exam Scores White 0.672 0.470 0 1 Black 0.107 0.309 0 1 Latino/Hispanic 0.095 0.293 0 1 Other Race 0.126 0.332 0 1 Male 0.447 0.497 0 1 Parental Education (0: HS; 1: Some college) 0.841 0.365 0 1 Cumulative GPA 3.363 0.610 0 4.33 PSAT Score 99.40 19.83 40 160 SAT score 1049.63 204.85 400 1600 Zip Code/County Level Variables Broadband Availability - Dec. of Junior Year 0.3177 0.4656 0 1 Mean Adjusted Gross Income 76,946 53,646 13,370 2,791,601 Population 4,111 9,799 - 132,944 Unemployment Rate 5 2 - 31 Median Home Price 192,769 118,034 - 943,877 Application Outcomes Applied to a 4-Year College 0.7897 0.4075 0 1 Applied to at least 5 Colleges 0.4023 0.4904 0 1 Applied to a College with an Avg. SAT of 1200 0.5032 0.5000 0 1 Applied to a College with an Avg. SAT of 1300 0.2972 0.4570 0 1 Applied to a Selective Liberal Arts College 0.1226 0.3280 0 1 Applied to a Top Liberal Arts College 0.0719 0.2583 0 1 Applied to a Match College 0.7071 0.4551 0 1 Applied to an In-State Flagship 0.2956 0.4563 0 1 Applied to an Out-of-State College 0.4920 0.4999 0 1 Notes: Sample is all SAT takers, high school graduating cohorts 2001 to 2008. Sources: College Board data; FCC, Form 477 Data; U.S. Census Bureau, 2000 Census; IRS, SOI Tax Statistics; BLS, Local Area Unemployment Statistics; FHFA Housing Price Index
HIGH-SPEED INTERNET AND COLLEGE APPLICATIONS 37 Table 2: Effect of Broadband Internet Availability in Junior Year on Testing and Application Outcomes Selection into SAT-taking Took the SAT PSAT Score (among (among PSAT-takers) SAT-takers) Broadband Access 0.00049 -0.01789 (0.00081) (0.03232) Observations 12,556,552 7,452,302 Main Testing and Application Outcomes Applied to Applied to a College a College Applied to Applied to a Applied to a with an with an at least 5 4-Year Match Avg. SAT Avg. SAT SAT Score Colleges College College of 1200 of 1300 Broadband 0.69798*** 0.00327*** 0.00357*** 0.00292*** 0.00361*** 0.00055 Access (0.16392) (0.00084) (0.00081) (0.00086) (0.00103) (0.00074) Observations 7,452,302 7,452,302 7,452,302 7,452,302 7,452,302 7,452,302 Applied to a Applied to Applied to an Top Liberal the Instate Out-of-State Arts College Flagship College Broadband Access 0.00172*** 0.00109 0.00321*** (0.00040) (0.00080) (0.00083) Observations 7,452,302 7,452,302 7,452,302 Notes: Standard errors adjusted for clustering at the zip code-level are in parentheses (* p<.10 ** p<.05 *** p<.01). Sample includes students who took both the PSAT and SAT tests in the high school graduating cohorts of 2001 to 2008, except in the first column of the first panel for which the sample is students who took the PSAT test. Estimates are from separate regressions that include zip code and year fixed effects, as well as controls for gender, race, high school GPA, PSAT verbal and math sections, and time-varying zip code characteristics. Sources: College Board data; FCC, Form 477 Data; U.S. Census Bureau, 2000 Census; IRS, SOI Tax Statistics; BLS, Local Area Unemployment Statistics; FHFA Housing Price Index
Running head: HIGH-SPEED INTERNET AND COLLEGE APPLICATIONS 1 Table 3: Event-Study Estimates of the Effect of Broadband Availability on Selected Application Outcomes (1) (2) (3) (4) (5) (6) (7) Apply to a Apply 5+ Apply to a Apply to a Apply to a Highly Apply Out SAT Score Schools 4 Year Selective Match Ranked LA of State Broadband Available… …in 8 years 0.21219 -0.00496 0.00048 0.00117 0.00030 -0.00312 0.00208 (0.88115) (0.00529) (0.00578) (0.00591) (0.00576) (0.00266) (0.00524) …in 7 years -0.14927 -0.00143 0.00340 0.01729*** 0.00094 0.00164 0.00200 (0.62943) (0.00351) (0.00333) (0.00447) (0.00362) (0.00151) (0.00371) …in 6 years 0.05230 -0.00382* -0.00312 0.00275 -0.00267 -0.00113 0.00099 (0.44878) (0.00216) (0.00214) (0.00259) (0.00227) (0.00096) (0.00225) - …in 5 years -0.38327 0.00525*** -0.00281 0.00445* -0.00173 -0.00010 0.00045 (0.38061) (0.00184) (0.00183) (0.00233) (0.00196) (0.00091) (0.00189) - …in 4 years -0.10911 0.00505*** -0.00338** 0.00039 -0.00233 0.00030 -0.00102 (0.30570) (0.00153) (0.00147) (0.00186) (0.00158) (0.00072) (0.00153) - …in 3 years -0.07731 0.00480*** -0.00229* 0.00482*** -0.00169 0.00109 -0.00064 (0.25603) (0.00127) (0.00121) (0.00161) (0.00129) (0.00066) (0.00126) …in 2 years -0.18200 -0.00190* 0.00125 0.00248** 0.00132 0.00033 -0.00040 (0.22331) (0.00109) (0.00098) (0.00125) (0.00108) (0.00057) (0.00107) …in 1 year -- -- -- -- -- -- -- -- -- -- -- -- -- -- …for 0 years 0.44932** 0.00192* 0.00339*** 0.00553*** 0.00287*** 0.00120** 0.00363*** (0.20466) (0.00100) (0.00092) (0.00119) (0.00100) (0.00054) (0.00099) …for 1 year 1.43066*** 0.00305*** 0.00507*** 0.00657*** 0.00492*** 0.00338*** 0.00464*** (0.23567) (0.00115) (0.00110) (0.00144) (0.00119) (0.00060) (0.00113) …for 2 years 1.58830*** 0.00525*** 0.00626*** 0.01168*** 0.00658*** 0.00465*** 0.00693*** (0.28819) (0.00142) (0.00134) (0.00170) (0.00145) (0.00069) (0.00138) …for 3 years 1.86798*** 0.00954*** 0.00985*** 0.00840*** 0.00955*** 0.00488*** 0.00996*** (0.32671) (0.00166) (0.00160) (0.00195) (0.00172) (0.00077) (0.00161) …for 4 years 3.40615*** 0.01503*** 0.01402*** 0.01144*** 0.01373*** 0.00528*** 0.01633*** (0.38963) (0.00201) (0.00191) (0.00227) (0.00206) (0.00089) (0.00191) …for 5 years 2.45957*** 0.02514*** 0.02547*** 0.01729*** 0.02408*** 0.00304*** 0.02534*** (0.45137) (0.00234) (0.00228) (0.00258) (0.00245) (0.00100) (0.00222) …for 6 years 3.53663*** 0.03145*** 0.03433*** 0.02391*** 0.03320*** 0.00721*** 0.02903*** (0.55269) (0.00289) (0.00275) (0.00299) (0.00295) (0.00130) (0.00268) …for 7 years 4.38627*** 0.02475*** 0.02600*** 0.01900*** 0.02491*** 0.00526** 0.02662*** (0.86324) (0.00456) (0.00418) (0.00460) (0.00455) (0.00232) (0.00444) Observ ations 7,452,302 7,452 ,302 7,452 ,302 7,452 ,302 7,452 ,302 7,452 ,302 7,452 ,302 Notes: Each column represents a separate OLS regression whereby the specification is as in Table 2, except the single broadband availability dummy is replaced by a series of dummies for the years prior to and following the introduction of the broadband. The dummy for the year immediately preceding introduction is excluded. Standard errors adjusted for clustering at the zip code-level are in parentheses (* p<.10 ** p<.05 *** p<.01). Sample includes students who took both the PSAT and SAT tests in the high school graduating cohorts of 2001 to 2008. Regressions include zip code and year fixed effects, as well as controls for gender, race, high school GPA, PSAT verbal and math sections, and time-varying zip code characteristics and a constant. Sources: College Board data; FCC, Form 477 Data; U.S. Census Bureau, 2000 Census; IRS, SOI Tax Statistics; BLS, Local Area Unemployment Statistics; FHFA Housing Price Index
HIGH-SPEED INTERNET AND COLLEGE APPLICATIONS 2 Table 4: Effect of Broadband Internet Availability in Junior Year on Testing and Application Outcomes, SAT States Only Applied to a Applied to at College with Applied to a Applied to an least 5 Applied to a 4- an Avg. SAT Applied to a Top Liberal Out-of-State SAT Score Colleges Year College of 1200 Match College Arts College College Broadband Access 0.70417*** 0.00348*** 0.00430*** 0.00200* 0.00091 0.00148*** 0.00408*** (0.18552) (0.00094) (0.00092) (0.00116) (0.00083) (0.00044) (0.00092) Observations 6,080,456 6,080,456 6,080,456 6,080,456 6,080,456 6,080,456 6,080,456 Notes: Standard errors adjusted for clustering at the zip code-level are in parentheses (* p<.10 ** p<.05 *** p<.01). Sample includes students who took both the PSAT and SAT tests in the high school graduating cohorts of 2001 to 2008. Estimates are from separate regressions that include zip code and year fixed effects, as well as controls for gender, race, high school GPA, PSAT verbal and math sections, and time-varying zip code characteristics. SAT states are defined as states in which the SAT is the dominant exam. Sources: College Board data; FCC, Form 477 Data; U.S. Census Bureau, 2000 Census; IRS, SOI Tax Statistics; BLS, Local Area Unemployment Statistics; FHFA Housing Price Index
HIGH-SPEED INTERNET AND COLLEGE APPLICATIONS 3 Table 5: Effect of Broadband Internet Availability in Junior Year on Testing and Application Outcomes, Controlling for SAT Score Applied to Applied to a a Top Applied to Applied to a Applied to at College with Liberal an Out-of- Applied to at least 5 4-Year least 5 an Avg. SAT Applied to Arts State Colleges College Colleges of 1200 Match College College Broadband Access 0.00286*** 0.00325*** 0.00286*** 0.00313*** 0.00292*** 0.00154*** 0.00280*** (0.00083) (0.00081) (0.00083) (0.00102) (0.00086) (0.00040) (0.00082) Percent of initial coefficient 80 99 80 87 100 90 87 Observations 7,452,302 7,452,302 7,452,302 7,452,302 7,452,302 7,452,302 7,452,302 Notes: Standard errors adjusted for clustering at the zip code-level are in parentheses (* p<.10 ** p<.05 *** p<.01). Sample includes students who took both the PSAT and SAT tests in the high school graduating cohorts of 2001 to 2008. Estimates are from separate regressions that include zip code and year fixed effects, as well as controls for gender, race, high school GPA, PSAT verbal and math sections, and time-varying zip code characteristics. Sources: College Board data; FCC, Form 477 Data; U.S. Census Bureau, 2000 Census; IRS, SOI Tax Statistics; BLS, Local Area Unemployment Statistics; FHFA Housing Price Index
HIGH-SPEED INTERNET AND COLLEGE APPLICATIONS 4 Table 6: Effect of Broadband Internet Availability in Senior Year on Testing and Application Outcomes Applied to a Applied to at College with Applied to a Applied to an least 5 Applied to a 4- an Avg. SAT Applied to a Top Liberal Out-of-State SAT Score Colleges Year College of 1200 Match College Arts College College Broadband Access -0.00338 0.00336*** 0.00182** -0.00036 0.00298*** 0.00001 0.00010 (0.17111) (0.00085) (0.00083) (0.00101) (0.00076) (0.00039) (0.00085) Observations 7,452,121 7,452,121 7,452,121 7,452,121 7,452,121 7,452,121 7,452,121 Notes: Standard errors adjusted for clustering at the zip code-level are in parentheses (* p<.10 ** p<.05 *** p<.01). Sample includes students who took both the PSAT and SAT tests in the high school graduating cohorts of 2001 to 2008. Estimates are from separate regressions that include zip code and year fixed effects, as well as controls for gender, race, high school GPA, PSAT verbal and math sections, and time-varying zip code characteristics. Sources: College Board data; FCC, Form 477 Data; U.S. Census Bureau, 2000 Census; IRS, SOI Tax Statistics; BLS, Local Area Unemployment Statistics; FHFA Housing Price Index
HIGH-SPEED INTERNET AND COLLEGE APPLICATIONS 5 Table 7: Effects of Broadband Internet Availability in Junior Year on Testing and Application Outcomes by Group Applied to Applied to a College a Top Applied to Applied to Applied to with an Applied to Liberal an Out-of- SAT a 4-Year at least 5 Avg. SAT a Match Arts State Score College Colleges of 1200 College College College Below median income zip Broadband Access 0.55126** -0.00048 0.00054 0.00319* -0.00109 0.00018 -0.00025 (0.26502) (0.00135) (0.00135) (0.00167) (0.00150) (0.00051) (0.00135) Observations 3,065,569 3,065,569 3,065,569 3,065,569 3,065,569 3,065,569 3,065,569 Above median income zip Broadband Access 0.22062 0.00369*** 0.00230** 0.00326** 0.00348*** 0.00169*** 0.00286*** (0.20831) (0.00099) (0.00106) (0.00133) (0.00105) (0.00056) (0.00103) Observations 4,381,979 4,381,979 4,381,979 4,381,979 4,381,979 4,381,979 4,381,979 Parent’s education is high school or less Broadband Access 1.18410*** -0.00009 0.00231 0.00200 -0.00160 0.00114* 0.00206 (0.37447) (0.00189) (0.00192) (0.00206) (0.00204) (0.00061) (0.00186) Observations 980,650 980,650 980,650 980,650 980,650 980,650 980,650 Parent’s education is some college or more Broadband Access 0.48472*** 0.00313*** 0.00288*** 0.00384*** 0.00291*** 0.00165*** 0.00261*** (0.17461) (0.00082) (0.00092) (0.00109) (0.00089) (0.00047) (0.00090) Observations 5,202,386 5,202,386 5,202,386 5,202,386 5,202,386 5,202,386 5,202,386
HIGH-SPEED INTERNET AND COLLEGE APPLICATIONS 6 Table 7 Cont: Effects of Broadband Internet Availability in Junior Year on Testing and Application Outcomes by Group Applied to a College Applied to Applied to Applied to a Applied to with an Applied to a Top an Out-of- 4-Year at least 5 Avg. SAT a Match Liberal Arts State SAT Score College Colleges of 1200 College College College Black/Hispanic Broadband Access 0.50816 -0.00083 0.00203 -0.00074 -0.00139 -0.00038 0.00007 (0.36226) (0.00168) (0.00181) (0.00199) (0.00190) (0.00062) (0.00178) Observations 1,504,678 1,504,678 1,504,678 1,504,678 1,504,678 1,504,678 1,504,678 White/Other Broadband Access 0.46961*** 0.00320*** 0.00298*** 0.00422*** 0.00274*** 0.00208*** 0.00271*** (0.16955) (0.00085) (0.00089) (0.00109) (0.00090) (0.00045) (0.00088) Observations 5,947,624 5,947,624 5,947,624 5,947,624 5,947,624 5,947,624 5,947,624 Rural Broadband Access 0.60706 -0.00176 0.00286 -0.00034 -0.00333 -0.00038 0.00071 (0.40139) (0.00219) (0.00212) (0.00231) (0.00228) (0.00101) (0.00218) Observations 887,587 887,587 887,587 887,587 887,587 887,587 887,587 Urban Broadband Access 0.65184*** 0.00393*** 0.00302*** 0.00425*** 0.00345*** 0.00196*** 0.00302*** (0.17790) (0.00087) (0.00091) (0.00113) (0.00093) (0.00044) (0.00089) Observations 6,564,715 6,564,715 6,564,715 6,564,715 6,564,715 6,564,715 6,564,715 Notes: Standard errors adjusted for clustering at the zip code-level are in parentheses (* p<.10 ** p<.05 *** p<.01). Sample includes students who took both the PSAT and SAT tests in the high school graduating cohorts of 2001 to 2008. Estimates are from separate regressions that include zip code and year fixed effects, as well as controls for gender, race, high school GPA, PSAT verbal and math sections, and time-varying zip code characteristics. Sources: College Board data; FCC, Form 477 Data; U.S. Census Bureau, 2000 Census; IRS, SOI Tax Statistics; BLS, Local Area Unemployment Statistics; FHFA Housing Price Index
HIGH-SPEED INTERNET AND COLLEGE APPLICATIONS 7 For Online Publication Appendix Appendix 1. Broadband Internet Penetration Variable Creation and Alternative Specifications To construct our measure of zip code-level broadband Internet access, we merged the zip codelevel FCC data on the number of ISPs in a zip code, to zip code-level information on (1) land area and (2) population. The land area data are from the 2000 Census. The population data come from a combination of 2000 census data and SOI income tax data, which we used to construct zip-codelevel population counts over time. We then constructed ratios of the number of ISPs to zip-code geographic size and population size. We assigned each zip code as urban or rural based on the fraction of its population which was urban, defining an urban zip code as one where more than 50 percent of the population is urban. We also collected national and sub-national trends broadband Internet usage from the CPS/PEW data. The CPS data come from the CPS computer and Internet supplements/school enrollment surveys (August 2000, September 2001, October 2003, 2007, 2009). The PEW Survey is collected at least annually and the month of the survey was recorded. We interpolated the between months so the timing was equivalent with the FCC data (which were collected in June/December). We constructed national trends out of both series, and a separate urban and rural trends from the CPS series based on whether an individual lived in a metropolitan statistical area. We then conduct the following exercise: 1) Identify a threshold for “coverage” (one provider per x square miles or one provider per y thousand people) 2) Construct an indicator which takes a value of one if threshold is in each zip code-year 3) Aggregate up using zip code population weights to construct national time series of high-speed Internet penetration 4) Estimate root mean squared error between CPS or PEW measure and #3 We then incrementally increase the threshold in step 1 and repeat steps 2 through 4. We tried the following combinations: one provider per 1-10,000 people (in intervals of 500 people), and one provider per 1-40 square miles (in intervals of one mile). We also allowed different thresholds for urban and rural zip codes (although we did not impose that they had to be different) by comparing to the CPS data. Finally, we minimized the root mean squared error (RMSE) and identified that threshold as the preferred definition of zip code broadband coverage. This is to define a rural zip code as “covered” when there is at least one provider per 12 square miles and an urban zip code as “covered” when there is at least one provider for every 2,700 people. For exposition, we also present results for alternative internet penetration concepts, applying each of the population-based (i.e. “one provider per 2700 people”) and mileage-based (i.e., one provider per 12 square miles) measures to all zip codes (Appendix Table 1), as well as for the urban and rural subsamples separately (Appendix Table 2). Appendix Table 1 demonstrates that
HIGH-SPEED INTERNET AND COLLEGE APPLICATIONS 8 the results using the population-based measure (second panel) are qualitatively similar to those obtained using our main measure (first panel). The RMSE for this single measure, however, is about 45 percent larger than the RMSE for our preferred measure, suggesting the fit to usage is worse. The third panel presents the results using the mileage-based measure (i.e. “one provider per 12 square miles”). In this case, the results are quite different from the results obtained using our main measure. However, this could be attributable to the fact that this measure fits the usage data poorly, with a RMSE of 2.79 (300 percent larger than the RMSE for our preferred measure). To further examine this, in the fourth panel of Appendix Table 1 we alternatively estimate the model using a mileage-based measure that better-fits the data (one provider per one square mile, which has an RMSE of 1.39). In this case, we find more similar results to both the preferred measure and the population-based measure. Taken together, these results are affirming and supportive of our strategy: alternative measures that more closely match national usage trends tend to lead to more consistent and similar results. In Appendix Tables 2 and 3, we consider the alternative three measures from the prior table separately within the urban and rural zip code subsamples. First, it is clear from the RMSEs that the separate population and mileage-based measures used in the preferred measure fit the urban and rural samples better than the alternative measures do, with RMSEs of 0.85 and 0.5, respectively. In addition, Appendix Table 2 indicates that the mileage-based measure (i.e., “one provider per 12 square miles”) performs poorly for the urban sample, with an RMSE of 2.89, and leads to results that differ from the population-based results. However, as in the pooled results, the mileage-based measure that fits the usage data more closely (i.e., “one provider per one square mile”) leads to results which are more consistent with the main results. Finally, Appendix Table 3 confirms the null results for the rural subsample in Table 6. In each case, regardless of the fit of the measure, we see little to no effect of Internet access on our outcomes for the rural subsample. In Appendix Table 4, we test the strength of the relationship between the various measures of broadband availability described in the body of the paper (and above) and teen high-speed Internet usage. We obtained teen high-speed Internet usage rates from the CPS, which interviewed respondents about their broadband usage in August 2000, September 2001, October 2003, and October 2007. Since zip codes are not available for all CPS respondents, we aggregate our measures of availability to the state-level using population weights. We then match this to collapsed CPS data for 15-18 year old respondents, including their high speed Internet usage and demographic characteristics. To mimic our main estimating equation, we control for similar characteristics, including year and state fixed effects, race/ethnicity, sex, family income, and parental education. We also include measures of population density, home prices, and unemployment rates at the state-level, again to emulate our main equation. Appendix Table 4 indicates that the fraction of the state for which high-speed Internet is available by our main measure is a positive and significant predictor of teen high-speed Internet usage. Of the additional measures from Appendix Tables 2 and 3, the “improved fit” mileage measure (one provider per one square mile) is the only other statistically significant predictor of internet usage among teens. Moreover, of the measures we consider, our main measure, given the relatively small standard error, appears to admit the least noise. Columns 4 and 5 present estimates using the “at least one provider” in a zip code dummy variable and a linear measure of
HIGH-SPEED INTERNET AND COLLEGE APPLICATIONS 9 the number of providers, respectively. Neither are significantly associated with high-speed Internet usage among teens (and the “at least one provider” measure is actually wrong-signed).
HIGH-SPEED INTERNET AND COLLEGE APPLICATIONS 10 Appendix Figure 1a: Means of Additional Application Outcomes in 1996-1998 by Year High- Speed Internet Entered Zip Code Notes: Displayed are population weighted means of the main application outcomes in the years just prior to the introduction of residential broadband Internet (1996 to 1998). Means were constructed by classifying zip codes according to the year in which high-speed Internet became available in the zip code. Sources: College Board data; FCC, Form 477 Data; U.S. Census Bureau, 2000 Census; IRS, SOI Tax Statistics; BLS, Local Area Unemployment Statistics; FHFA Housing Price Index
HIGH-SPEED INTERNET AND COLLEGE APPLICATIONS 11 Appendix Figure 1b: Estimated Effects of Broadband Availability in Junior Year, Senior Year and Freshman Year of College on Additional Application Outcomes. Notes: Displayed are coefficients and 99 percent confidence intervals for the effect of broadband availability on application outcomes by year of schooling, estimated according to equation 1. The short bars bookend the 95% confidence interval, and the long bars bookend the 99% confidence interval. Sources: College Board data; FCC, Form 477 Data; U.S. Census Bureau, 2000 Census; IRS, SOI Tax Statistics; BLS, Local Area Unemployment Statistics; FHFA Housing Price Index
HIGH-SPEED INTERNET AND COLLEGE APPLICATIONS 12 Appendix Table 1: Alternative Measures of Zip Code-Level Broadband Access Applied to Applied to Applied to a College a College Applied to a Top Applied to Applied to Applied to with an with an Applied to a Selective Liberal Applied to an Out-ofat least 5 a 4-Year Avg. SAT Avg. SAT a Match Liberal Arts Arts the Instate State Colleges College of 1200 of 1300 College College College Flagship College Main Measure of Broadband Access = 1 provider per 2700 people if urban and 1 provider per 12 square miles if rural (RMSE=0.92) Broadband Access 0.00327*** 0.00357*** 0.00361*** 0.00055 0.00292*** -0.00041 0.00172*** 0.00109 0.00321*** (0.00084) (0.00081) (0.00103) (0.00074) (0.00086) (0.00050) (0.00040) (0.00080) (0.00083) Observations 7,452,302 7,452,302 7,452,302 7,452,302 7,452,302 7,452,302 7,452,302 7,452,302 7,452,302 Alternate Measure of Broadband Access = 1 provider per 2700 people (RMSE=1.34) Broadband Access 0.00134 0.00276*** 0.00342*** 0.00059 0.00228*** -0.00044 0.00139*** 0.00073 0.00236*** (0.00082) (0.00079) (0.00101) (0.00073) (0.00085) (0.00049) (0.00039) (0.00078) (0.00081) Observations 7,452,302 7,452,302 7,452,302 7,452,302 7,452,302 7,452,302 7,452,302 7,452,302 7,452,302 Alternate Measure of Broadband Access = 1 provider per 12 square miles (RMSE=2.79) Broadband Access -0.00351*** -0.00526*** -0.00114 -0.00028 -0.00586*** -0.00318*** -0.00148*** 0.00187* -0.00281** (0.00110) (0.00112) (0.00133) (0.00094) (0.00118) (0.00065) (0.00050) (0.00110) (0.00113) Observations 7,452,302 7,452,302 7,452,302 7,452,302 7,452,302 7,452,302 7,452,302 7,452,302 7,452,302 Alternate Measure of Broadband Access = 1 provider per 1 square mile (RMSE=1.39) Broadband Access 0.00359*** 0.00226** 0.00299** -0.00179* 0.00209* 0.00265*** 0.00056 -0.00151 -0.00063 (0.00118) (0.00115) (0.00140) (0.00097) (0.00120) (0.00067) (0.00054) (0.00114) (0.00114) Observations 7,452,302 7,452,302 7,452,302 7,452,302 7,452,302 7,452,302 7,452,302 7,452,302 7,452,302 Notes: Standard errors adjusted for clustering at the zip code-level are in parentheses (* p<.10 ** p<.05 *** p<.01). Sample includes students who took both the PSAT and SAT tests in the high school graduating cohorts of 2001 to 2008. Estimates are from separate regressions that include zip code and year fixed effects, as well as controls for gender, race, high school GPA, PSAT verbal and math sections, and time-varying zip code characteristics. RMSE refers to the fit of the measure with national trends in Internet usage from PEW. Sources: College Board data; FCC, Form 477 Data; U.S. Census Bureau, 2000 Census; IRS, SOI Tax Statistics; BLS, Local Area Unemployment Statistics; FHFA Housing Price Index; PEW Research Center, Broadband vs. Dial-up Adoption Over Time
HIGH-SPEED INTERNET AND COLLEGE APPLICATIONS 13 Appendix Table 2: Alternative Measures of Zip Code-Level Broadband Access for Urban Subsample Applied to Applied to Applied to a College a College Selective Applied Applied to Applied to Applied to with an with an Applied to Liberal Applied to a to the an Out-ofat least 5 a 4-Year Avg. SAT Avg. SAT a Match Arts Top Liberal Instate State Colleges College of 1200 of 1300 College College Arts College Flagship College Urban Students - Measure of Broadband Access = 1 provider per 2700 people (RMSE=0.85) Broadband Access 0.00302*** 0.00393*** 0.00425*** 0.00064 0.00345*** -0.00014 0.00196*** 0.00077 0.00302*** (0.00091) (0.00087) (0.00113) (0.00080) (0.00093) (0.00054) (0.00044) (0.00087) (0.00089) Observations 6,564,715 6,564,715 6,564,715 6,564,715 6,564,715 6,564,715 6,564,715 6,564,715 6,564,715 Measure of Broadband Access = 1 provider per 12 square miles (RMSE=2.89) Broadband Access -0.00468*** -0.00680*** -0.00067 -0.00061 -0.00666*** -0.00292*** -0.00156*** 0.00143 -0.00469*** (0.00128) (0.00130) (0.00158) (0.00110) (0.00138) (0.00075) (0.00058) (0.00130) (0.00131) Observations 6,564,715 6,564,715 6,564,715 6,564,715 6,564,715 6,564,715 6,564,715 6,564,715 6,564,715 Measure of Broadband Access = 1 provider per 1 square mile (RMSE=1.45) Broadband Access 0.00255** 0.00214* 0.00330** -0.00118 0.00189 0.00235*** 0.00020 -0.00111 -0.00081 (0.00119) (0.00117) (0.00143) (0.00098) (0.00122) (0.00068) (0.00055) (0.00116) (0.00115) Observations 6,564,715 6,564,715 6,564,715 6,564,715 6,564,715 6,564,715 6,564,715 6,564,715 6,564,715 Notes: Standard errors adjusted for clustering at the zip code-level are in parentheses (* p<.10 ** p<.05 *** p<.01). Sample includes students who took both the PSAT and SAT tests in the high school graduating cohorts of 2001 to 2008. Estimates are from separate regressions that include zip code and year fixed effects, as well as controls for gender, race, high school GPA, PSAT verbal and math sections, and time-varying zip code characteristics. RMSE refers to the fit of the measure with separate urban and rural national trends in usage from the CPS. Sources: College Board data; FCC, Form 477 Data; U.S. Census Bureau, 2000 Census; IRS, SOI Tax Statistics; BLS, Local Area Unemployment Statistics; FHFA Housing Price Index; U.S. Census Bureau, CPS
HIGH-SPEED INTERNET AND COLLEGE APPLICATIONS 14 Appendix Table 3: Alternative Measures of Zip Code-Level Broadband Access for Rural Subsample Applied to Applied to Applied to a College a College Selective Applied Applied to Applied to Applied to with an with an Applied to Liberal Applied to a to the an Out-ofat least 5 a 4-Year Avg. SAT Avg. SAT a Match Arts Top Liberal Instate State Colleges College of 1200 of 1300 College College Arts College Flagship College Alternate Measure of Broadband Access = 1 provider per 2700 people (RMSE=2.36) Broadband Access -0.00390** -0.00298 -0.00102 -0.00328** -0.00343* -0.00087 -0.00091 -0.00061 -0.00167 (0.00180) (0.00182) (0.00200) (0.00155) (0.00194) (0.00108) (0.00084) (0.00165) (0.00189) Observations 887,587 887,587 887,587 887,587 887,587 887,587 887,587 887,587 887,587 Main Measure of Broadband Access = 1 provider per 12 square miles (RMSE=0.50) Broadband Access 0.00286 -0.00176 -0.00034 -0.00117 -0.00333 -0.00178 -0.00038 0.00430** 0.00071 (0.00212) (0.00219) (0.00231) (0.00178) (0.00228) (0.00126) (0.00101) (0.00192) (0.00218) Observations 887,587 887,587 887,587 887,587 887,587 887,587 887,587 887,587 887,587 Alternate Measure of Broadband Access = 1 provider per 1 square miles (RMSE=2.06) Broadband Access 0.01195 0.00963 0.02461* 0.01168 0.01992 -0.00724 -0.00385 -0.00064 0.02048 (0.01556) (0.01351) (0.01328) (0.01343) (0.01346) (0.00872) (0.00695) (0.01141) (0.01359) Observations 887,587 887,587 887,587 887,587 887,587 887,587 887,587 887,587 887,587 Notes: Standard errors adjusted for clustering at the zip code-level are in parentheses (* p<.10 ** p<.05 *** p<.01). Sample includes students who took both the PSAT and SAT tests in the high school graduating cohorts of 2001 to 2008. Estimates are from separate regressions that include zip code and year fixed effects, as well as controls for gender, race, high school GPA, PSAT verbal and math sections, and time-varying zip code characteristics. RMSE refers to the fit of the measure with separate urban and rural national trends in usage from the CPS. Sources: College Board data; FCC, Form 477 Data; U.S. Census Bureau, 2000 Census; IRS, SOI Tax Statistics; BLS, Local Area Unemployment Statistics; FHFA Housing Price Index; U.S. Census Bureau, CPS Appendix Table 4: Relationship between State-Level Broadband Availability and Teen High-Speed Internet Usage
HIGH-SPEED INTERNET AND COLLEGE APPLICATIONS 15 1 Provider Per 1 Provider Per 12 Square 1 Provider Per At Least 1 Number of Main Measure 2700 People Miles Square Mile Provider Providers Fraction of State with Broadband 0.1447582** 0.101977 0.0428876 .2231739* -0.22539 0.0044884 (0.06821) (0.06945) (0.11755) (0.12215) (0.19505) (0.00555) N 204 204 204 204 204 204 Notes: Standard errors adjusted for clustering at the state-level are in parentheses (* p<.10 ** p<.05 *** p<.01). Sample includes 15-18 year old CPS respondents in 2000, 2001, 2003 and 2007. Estimates are from separate regressions that include state and year fixed effects, as well as controls for the fraction of teens in the state belonging to each gender, race/ethnicity, family income, and parental education group. Regressions are weighted by the sum of the CPS-provided weight in each cell. Also includes controls for time-varying state characteristics. Fraction of state with broadband is calculated by collapsing the zip code-level measures of availability denoted in the column headings, weighting by zip code population. Sources: U.S. Census Bureau, CPS; FCC, Form 477 Data; U.S. Census Bureau, 2000 Census; IRS, SOI Tax Statistics; BLS, Local Area Unemployment Statistics; FHFA Housing Price Index
HIGH-SPEED INTERNET AND COLLEGE APPLICATIONS 16 Appendix Table 5: Effects of Broadband Internet Availability in Junior Year on Testing and Application Outcomes, Controlling for Self-Reported Parental Education and Parental Income Applied Applied to to a a College College Applied to Applied Applied Applied to Applied to Applied to with an with an Applied to a Top to the to the In- an Out-ofat least 5 a 4-Year Avg. SAT Avg. SAT a Match Liberal Arts Instate State State SAT Score Colleges College of 1200 of 1300 College College Flagship Flagship College Broadband Access 0.64913*** 0.00289*** 0.00302*** 0.00325*** 0.00035 0.00242*** 0.00172*** 0.00095 0.00095 0.00271*** (0.16265) (0.00083) (0.00080) (0.00103) (0.00074) (0.00086) (0.00040) (0.00080) (0.00080) (0.00082) Observations 7,452,302 7,452,302 7,452,302 7,452,302 7,452,302 7,452,302 7,452,302 7,452,302 7,452,302 7,452,302 Notes: Standard errors adjusted for clustering at the zip code-level are in parentheses (* p<.10 ** p<.05 *** p<.01). Sample includes students who took both the PSAT and SAT tests in the high school graduating cohorts of 2001 to 2008. Estimates are from separate regressions that include zip code and year fixed effects, as well as controls for gender, race, high school GPA, PSAT verbal and math sections, and time-varying zip code characteristics. Sources: College Board data; FCC, Form 477 Data; U.S. Census Bureau, 2000 Census; IRS, SOI Tax Statistics; BLS, Local Area Unemployment Statistics; FHFA Housing Price Index
Running head: HIGH-SPEED INTERNET AND COLLEGE APPLICATIONS 17 Appendix Table 6: Event-Study Estimates of the Effect of Broadband Availability on Additional Application Outcomes (1) (2) Apply to a Very Selective Apply to In-State Flagship Broadband A vailable… …in 8 years -0.01039** 0.00183 (0.00420) (0.00526) …in 7 years -0.00791*** 0.00460 (0.00295) (0.00319) …in 6 years -0.00808*** -0.00137 (0.00200) (0.00218) …in 5 years -0.00472*** 0.00053 (0.00166) (0.00183) …in 4 years -0.00517*** -0.00141 (0.00139) (0.00149) …in 3 years -0.00365*** -0.00038 (0.00120) (0.00120) …in 2 years -0.00271*** 0.00133 (0.00104) (0.00100) …in 1 year -- -- -- -- …for 0 years -0.00215** 0.00170* (0.00101) (0.00093) …for 1 year 0.00007 0.00019 (0.00100) (0.00112) …for 2 years 0.00311** 0.00077 (0.00127) (0.00133) …for 3 years 0.00257* 0.00130 (0.00143) (0.00159) …for 4 years 0.00280* 0.00206 (0.00168) (0.00186) …for 5 years 0.00669*** 0.00658*** (0.00196) (0.00225) …for 6 years 0.01371*** 0.00963*** (0.00242) (0.00252) …for 7 years 0.01214*** 0.00504 (0.00379) (0.00407) Observations 7,452,302 7,452 ,302 Notes: Each column represents a separate OLS regression whereby the specification is as in Table 2, except the single broadband availability dummy is replaced by a series of dummies for the years prior to and following the introduction of the broadband. The dummy for the year immediately preceding introduction is excluded. Standard errors adjusted for clustering at the zip code-level are in parentheses (* p<.10 ** p<.05 *** p<.01). Sample includes students who took both the PSAT and SAT tests in the high school graduating cohorts of 2001 to 2008. Regressions include zip code and year fixed effects, as well as controls for gender, race, high school GPA, PSAT verbal and math sections, and time-varying zip code characteristics and a constant. Sources: College Board data; FCC, Form 477 Data; U.S. Census Bureau, 2000 Census; IRS, SOI Tax Statistics; BLS, Local Area Unemployment Statistics; FHFA Housing Price Index
HIGH-SPEED INTERNET AND COLLEGE APPLICATIONS 18 Appendix Table 7: Event-Study Estimates of the Effect of Broadband Availability on Selected Application Outcomes, by Parent’s Education (1) (2) (3) (4) (5) (6) (7) Apply to a Apply 5+ Apply to a Apply to a Apply to a Highly Apply Out SAT Score Schools 4 Year Selective Match Ranked LA of State Parent’s Education is high school or less Broadband Available… …in 8 years -2.69527 0.00273 0.00393 0.00436 0.01214 0.00206 0.00931 (2.06484) (0.00979) (0.00999) (0.00985) (0.01079) (0.00315) (0.01039) …in 7 years 1.22467 0.00583 0.00766 0.02895*** 0.00783 -0.00079 -0.00007 (1.24179) (0.00691) (0.00682) (0.00867) (0.00737) (0.00214) (0.00718) …in 6 years 0.27380 0.00193 0.00041 0.00777 0.00111 -0.00251* 0.00310 (0.88926) (0.00451) (0.00445) (0.00492) (0.00483) (0.00145) (0.00467) …in 5 years 0.05135 0.00210 0.00247 0.01058** 0.00572 -0.00243* 0.00602 (0.79357) (0.00399) (0.00389) (0.00457) (0.00428) (0.00134) (0.00410) …in 4 years -0.43937 -0.00096 -0.00010 0.00090 0.00258 -0.00167 0.00022 (0.69606) (0.00349) (0.00341) (0.00381) (0.00371) (0.00114) (0.00359) …in 3 years -0.11566 -0.00306 0.00097 0.00560* 0.00171 -0.00052 -0.00299 (0.60716) (0.00303) (0.00300) (0.00339) (0.00324) (0.00104) (0.00319) …in 2 years -1.03429* -0.00473* 0.00349 0.00089 0.00542* -0.00059 -0.00353 (0.55573) (0.00274) (0.00270) (0.00299) (0.00291) (0.00095) (0.00283) …in 1 year -- -- -- -- -- -- -- -- -- -- -- -- -- -- …for 0 years 0.90140* 0.00057 0.00130 0.00408 0.00045 0.00067 0.00185 (0.53009) (0.00264) (0.00261) (0.00283) (0.00282) (0.00090) (0.00271) …for 1 year 0.57946 -0.00115 0.00388 0.00561* 0.00320 0.00030 0.00373 (0.56718) (0.00288) (0.00282) (0.00312) (0.00299) (0.00097) (0.00290) …for 2 years 0.67338 -0.00071 0.00333 0.01028*** 0.00242 0.00237** 0.00298 (0.65275) (0.00334) (0.00329) (0.00352) (0.00355) (0.00110) (0.00332) …for 3 years 1.45484** 0.00310 0.00724** 0.00329 0.00390 0.00250** 0.00419 (0.72561) (0.00362) (0.00359) (0.00387) (0.00385) (0.00120) (0.00358) …for 4 years 3.56488*** 0.00817* 0.00669 0.00216 0.00475 0.00519*** 0.01071*** (0.82624) (0.00430) (0.00417) (0.00454) (0.00450) (0.00137) (0.00410) …for 5 years 2.70453*** 0.02324*** 0.02226*** 0.01019* 0.01859*** 0.00405*** 0.02452*** (0.94430) (0.00497) (0.00496) (0.00523) (0.00533) (0.00154) (0.00491) …for 6 years 2.44160** 0.01964*** 0.03208*** 0.01793*** 0.02732*** 0.00373* 0.02190*** (1.15563) (0.00628) (0.00597) (0.00624) (0.00643) (0.00201) (0.00592) …for 7 years 6.73415*** -0.00054 0.02437** 0.01047 0.01219 -0.00128 0.02683** (2.34714) (0.01143) (0.01075) (0.01090) (0.01130) (0.00445) (0.01177) Observ ations 7,452,302 7,452 ,302 7,452 ,302 7,452 ,302 7,452 ,302 7,452 ,302 7,452 ,302
Running head: HIGH-SPEED INTERNET AND COLLEGE APPLICATIONS 19 Appendix Table 7 Cont.: Event-Study Estimates of the Effect of Broadband Availability on Selected Application Outcomes, by Parent’s Education (1) (2) (3) (4) (5) (6) (7) Apply to a Apply 5+ Apply to a 4 Apply to a Apply to a Highly Apply Out SAT Score Schools Year Selective Match Ranked LA of State Parent’s education is some college or more Broadband Available… …in 8 years 0.48685 -0.00991 -0.00192 -0.00236 -0.00437 -0.00476 -0.00221 (0.96073) (0.00612) (0.00566) (0.00688) (0.00590) (0.00313) (0.00580) …in 7 years -0.02511 -0.00407 0.00042 0.01367*** -0.00376 0.00209 0.00024 (0.67865) (0.00397) (0.00328) (0.00455) (0.00363) (0.00185) (0.00392) …in 6 years 0.14667 -0.00589** -0.00479** 0.00041 -0.00520** -0.00130 0.00036 (0.47089) (0.00242) (0.00217) (0.00279) (0.00233) (0.00113) (0.00246) …in 5 years -0.27409 -0.00743*** -0.00461** 0.00227 -0.00427** 0.00011 -0.00041 (0.40158) (0.00208) (0.00186) (0.00247) (0.00203) (0.00107) (0.00212) …in 4 years 0.03056 -0.00588*** -0.00478*** -0.00070 -0.00409** -0.00001 -0.00100 (0.33162) (0.00176) (0.00152) (0.00202) (0.00166) (0.00086) (0.00172) …in 3 years 0.04875 -0.00513*** -0.00274** 0.00451** -0.00214 0.00101 -0.00013 (0.27824) (0.00146) (0.00126) (0.00176) (0.00139) (0.00079) (0.00142) …in 2 years -0.06748 -0.00192 0.00045 0.00255* 0.00037 0.00006 -0.00079 (0.24856) (0.00127) (0.00106) (0.00141) (0.00120) (0.00068) (0.00124) …in 1 year -- -- -- -- -- -- -- -- -- -- -- -- -- -- …for 0 years 0.30206 0.00228** 0.00326*** 0.00582*** 0.00290*** 0.00095 0.00360*** (0.22960) (0.00115) (0.00098) (0.00131) (0.00111) (0.00065) (0.00114) …for 1 year 1.38254*** 0.00393*** 0.00574*** 0.00742*** 0.00596*** 0.00324*** 0.00446*** (0.25977) (0.00131) (0.00115) (0.00157) (0.00128) (0.00072) (0.00126) …for 2 years 1.57484*** 0.00770*** 0.00691*** 0.01328*** 0.00781*** 0.00422*** 0.00753*** (0.31215) (0.00158) (0.00137) (0.00184) (0.00151) (0.00082) (0.00152) …for 3 years 1.67708*** 0.01207*** 0.01079*** 0.01103*** 0.01065*** 0.00303*** 0.01202*** (0.34181) (0.00182) (0.00162) (0.00208) (0.00174) (0.00090) (0.00176) …for 4 years 3.09785*** 0.01821*** 0.01615*** 0.01517*** 0.01574*** 0.00404*** 0.01880*** (0.40655) (0.00221) (0.00192) (0.00238) (0.00206) (0.00105) (0.00208) …for 5 years 2.29152*** 0.02825*** 0.02649*** 0.02146*** 0.02567*** 0.00200* 0.02765*** (0.46924) (0.00256) (0.00229) (0.00271) (0.00244) (0.00119) (0.00244) …for 6 years 2.49761*** 0.03608*** 0.03342*** 0.02652*** 0.03194*** 0.00342** 0.02968*** (0.56957) (0.00316) (0.00279) (0.00316) (0.00295) (0.00152) (0.00295) …for 7 years 2.74529*** 0.03229*** 0.02714*** 0.02358*** 0.02617*** 0.00265 0.03010*** (0.92528) (0.00513) (0.00451) (0.00522) (0.00478) (0.00267) (0.00500) Observ ations 7,452,302 7,452 ,302 7,452 ,302 7,452 ,302 7,452 ,302 7,452 ,302 7,452 ,302 Notes: Each column represents a separate OLS regression whereby the specification is as in Table 2, except the single broadband availability dummy is replaced by a series of dummies for the years prior to and following the introduction of the broadband. The dummy for the year immediately preceding introduction is excluded. Standard errors adjusted for clustering at the zip code-level are in parentheses (* p<.10 ** p<.05 *** p<.01). Sample includes students who took both the PSAT and SAT tests in the high school graduating cohorts of 2001 to 2008. Regressions include zip code and year fixed effects, as well as controls for gender, race, high school GPA, PSAT verbal and math sections, and time-varying zip code characteristics and a constant. Sources: College Board data; FCC, Form 477 Data; U.S. Census Bureau, 2000 Census; IRS, SOI Tax Statistics; BLS, Local Area Unemployment Statistics; FHFA Housing Price Index.
Running head: HIGH-SPEED INTERNET AND COLLEGE APPLICATIONS 20
Cite this document
Lisa J. Dettling, Sarena F. Goodman, & and Jonathan Smith (2015). Every Little Bit Counts: The Impact of High-speed Internet on the Transition to College (FEDS 2015-108). Board of Governors of the Federal Reserve System, Finance and Economics Discussion Series. https://whenthefedspeaks.com/doc/feds_2015-108
@techreport{wtfs_feds_2015_108,
author = {Lisa J. Dettling and Sarena F. Goodman and and Jonathan Smith},
title = {Every Little Bit Counts: The Impact of High-speed Internet on the Transition to College},
type = {Finance and Economics Discussion Series},
number = {2015-108},
institution = {Board of Governors of the Federal Reserve System},
year = {2015},
url = {https://whenthefedspeaks.com/doc/feds_2015-108},
abstract = {This paper investigates the effects of high-speed Internet on students' college application decisions. We link the diffusion of zip code-level residential broadband Internet to millions of PSAT and SAT takers' college testing and application outcomes and find that students with access to high-speed Internet in their junior year of high school perform better on the SAT and apply to a higher number and more expansive set of colleges. Effects appear to be concentrated among higher-SES students, indicating that while, on average, high-speed Internet improved students' postsecondary outcomes, it may have increased pre-existing inequities by primarily benefiting those with more resources.},
}