Search for EdWorkingPapers here by author, title, or keywords.
Can schools that boost student outcomes reproduce their success at new campuses? We study a policy reform that allowed effective charter schools in Boston, Massachusetts to replicate their school models at new locations. Estimates based on randomized admission lotteries show that replication charter schools generate large achievement gains on par with those produced by their parent campuses. The average effectiveness of Boston’s charter middle school sector increased after the reform despite a doubling of charter market share. An exploration of mechanisms shows that Boston charter schools reduce the returns to teacher experience and compress the distribution of teacher effectiveness, suggesting the highly standardized practices in place at charter schools may facilitate replicability.
Only half of SAT-takers retake the exam, with even lower retake rates among low income and underrepresented minority (URM) students. We exploit discontinuous jumps in retake probabilities at multiples of 100, driven by left-digit bias, to estimate retaking’s causal effects. Retaking substantially improves SAT scores and increases four-year college enrollment rates, particularly for low income and URM students. Eliminating disparities in retake rates could close up to 10 percent of the income-based gap and up to seven percent of the race-based gap in four-year college enrollment rates of high school graduates.
We investigate the determinants of high school completion and college attendance, the likelihood of taking science, technology, engineering or math (STEM) courses in the first year of college and the probability of earning a degree in a STEM field. The focus is on women, who tend to be under-represented in STEM fields. Tracking four cohorts of students throughout Florida, women perform nearly as well as men on math achievement tests through high school and are more likely to finish high school and attend college than males. Among college students, however, women are less likely than are men to take courses in the physical sciences in their first year and are less likely to earn a degree in physics or engineering, even after adjusting for pre-college test scores. Gender matching of students and math/science teachers in middle and high school tends to increase the likelihood that female college freshman will take at least one STEM course. However, conditional on first-year coursework, neither gender matching at the secondary or college levels appears to have any effect on the likelihood of completing a major in a STEM field. For all students, having high school math and physics teachers with a degree in math or physics, respectively, (as opposed to education) is associated with a higher likelihood of taking STEM courses as college freshmen.
We demonstrate that heat inhibits learning and that school air-conditioning may mitigate this effect. Student fixed effects models using 10 million PSAT-retakers show hotter school days in years before the test reduce scores, with extreme heat being particularly damaging. Weekend and summer temperature has little impact, suggesting heat directly disrupts learning time. New nationwide, school-level measures of air-conditioning penetration suggest patterns consistent with such infrastructure largely offsetting heat’s effects. Without air-conditioning, a 1°F hotter school year reduces that year’s learning by one percent. Hot school days disproportionately impact minority students, accounting for roughly five percent of the racial achievement gap.
Free and reduced-price meal (FRM) data are used ubiquitously to proxy for student disadvantage in education research and policy applications. The Community Eligibility Provision (CEP)—a recently-implemented policy change to the federally-administered National School Lunch Program—allows schools serving low-income populations to identify all students as FRM-eligible regardless of individual circumstances. We study the CEP’s effect on FRM eligibility as a proxy for student disadvantage, and relatedly, we examine the viability of direct certification (DC) status as an alternative disadvantage measure. Our findings on whether the CEP degrades the informational content of FRM data are mixed. At the individual level there is essentially no effect, but the CEP does meaningfully change the information conveyed by the FRM-eligible share of students in a school. Our comparison of FRM and DC data in the post-CEP era shows that these measures are similarly informative as proxies for disadvantage, despite the CEP-induced information loss in FRM data. Using both measures together can improve the identification of disadvantaged students, but only marginally.
Teachers’ impact on student long-run success is only partially explained by their contributions to students’ short-run academic performance. For this study, we explore a second dimension of teacher effectiveness by creating measures of teachers’ contributions to student class-attendance. We find systematic variation in teacher effectiveness at reducing unexcused class absences at the middle and high school level. These differences across teachers are as stable as those for student achievement, but teacher effectiveness on attendance only weakly correlates with their effects on achievement. We link these measures of teacher effectiveness to students’ long-run outcomes. A high value-added to attendance teacher has a stronger impact on students’ likelihood of finishing high school than does a high value-added to achievement teacher. Moreover, high value-added to attendance teachers can motivate students to pursue higher academic goals as measured by Advanced Placement course taking. These positive effects are particularly salient for low-achieving and low-attendance students.
We present results from a meta-analysis of 95 experimental and quasi-experimental preK-12 science, technology, engineering, and mathematics (STEM) professional development and curriculum programs, seeking to understand what content, activities and formats relate to stronger student outcomes. Across rigorously conducted studies, we found an average weighted impact estimate of +0.21 standard deviations. Programs saw stronger outcomes when they helped teachers learn to use curriculum materials; focused on improving teachers' content knowledge, pedagogical content knowledge and/or understanding of how students learn; incorporated summer workshops; and included teacher meetings to troubleshoot and discuss classroom implementation. We discuss implications for policy and practice.
Sixty-seven school finance reforms (SFRs) in 26 states have taken place since 1990; however, there is little empirical evidence on the heterogeneity of SFR effects. We provide a comprehensive description of how individual reforms affected resource allocation to low- and high-income districts within states, including both financial and non-financial outcomes. After summarizing the heterogeneity of individual SFR impacts, we then examine its correlates, identifying both policy and legislative/political factors. Taken together, this research aims to provide a rich description of variation in states’ responses to SFRs, as well as explanation of this heterogeneity as it relates to contextual factors.
Text-message based parenting programs have proven successful in improving parental engagement and preschoolers’ literacy development. The tested programs have provided a combination of (a) general information about important literacy skills, (b) actionable advice (i.e., specific examples of such activities), and (c) encouragement. The regularity of the texts – each week throughout the school year – also provided nudges to focus parents’ attention on their children. This study seeks to identify mechanisms of the overall effect of such programs. It investigates whether the actionable advice alone drives previous study’s results and whether additional texts of actionable advice improve program effectiveness. The findings provide evidence that text messaging programs can supply too little or too much information. A single text per week is not as effective at improving parenting practices as a set of three texts that also include information and encouragement, but a set of five texts with additional actionable advice is also not as effective as the three-text approach. The results on children’s literacy development depend strongly on the child’s pre-intervention literacy skills. For children in the lowest quarter of the pre-treatment literacy assessments, only providing one example of an activity decreases literacy scores by 0.15 standard deviations relative to the original intervention. Literacy scores of children in higher quarters are marginally higher with only one tip per week. We find no positive effects of increasing to five texts per week.
Researchers commonly interpret effect sizes by applying benchmarks proposed by Cohen over a half century ago. However, effects that are small by Cohen’s standards are often large in the context of field-based education interventions. This focus on magnitude also obscures important differences in study features, program costs, and scalability. In this paper, I propose a new framework for interpreting effect sizes of education interventions, which consists of five broadly applicable guidelines and a detailed schema for interpreting effects from causal studies with standardized achievement outcomes. The schema introduces new effect-size and cost benchmarks, while also considering program scalability. Together, the framework provides scholars and research consumers with an empirically-based, practical approach for interpreting the policy importance of effect sizes from education interventions.