Search EdWorkingPapers

Search EdWorkingPapers by author, title, or keywords.

K-12 Education

Alvin Makori, Patricia Burch, Susanna Loeb.

High-impact tutoring has emerged as a primary school district investment for addressing learning loss that occurred during the COVID-19 pandemic. While existing research shows that high-impact tutoring is effective for accelerating student learning, this study examined the school-level facilitators and barriers to scaling high-impact tutoring. Situated in an urban traditional school district and an urban charter management organization, we collected survey and interview data from teachers and administrators to identify scaling challenges. Major barriers to scaling included time and space constraints, tutor supply and quality, updated data systems, and school level costs, while a key facilitator was teacher buy-in. We end the paper with recommendations for how districts can strategically grow their high-impact tutoring efforts.

More →


David M. Houston, Alyssa Barone.

What happens to public opinion when prominent partisan officials intervene in education policy debates? We analyzed the results of 18 survey experiments conducted between 2009 and 2021 with nationally representative samples of U.S. adults. Each experiment explored the effect of an endorsement of a specific education policy by a high-profile partisan official on the public’s attitudes toward that policy. Our results indicated that the engagement of such officials in education policy issues typically did little to move public opinion in the direction of the cue-giver’s preferred policies. Instead, the chief consequence was increased polarization among the public along partisan lines. A key exception applied to endorsements of policies that diverged from the traditional position of the cue-giver’s own party, which tended to shift aggregate public opinion modestly in favor of those policies. Such cross-party cues also had minor de-polarizing consequences.

More →


Christopher D. Brooks, Matthew G. Springer.

We analyzed the proposed spending data for the American Recovery Plan’s Elementary and Secondary Emergency Relief III (ESSER III) fund from the spring of 2021 of nearly 3,000 traditional public-school districts in the United States to (1) identify trends in the strategies adopted and (2) to test whether spending strategies were observably heterogeneous across district characteristics. We found that districts proposed a breadth of spending patterns with ESSER III. Moreover, there was a clear prioritization on spending related to academic learning recovery and facilities and operations spending, with the latter being particularly emphasized in higher-poverty districts. This divergent spending pattern may have important equity implications for short-term academic learning recovery for students affected by the COVID-19 pandemic.

More →


Sarah Ruth Morris, Andy Parra-Martinez, Jonathan Wai, Robert Maranto.

This mixed-methods study synthesizes Standards-Based Grading (SBG) literature, analyzes 249 Arkansas administrators' survey responses using OLS regressions, and identifies themes through in-vivo coding of qualitative feedback. Results show more SBG support among liberal, elementary-level administrators in larger, economically diverse districts. Qualitative insights highlight structural barriers and mindsets against SBG, emphasizing its importance for mastery-focused assessment and grading alignment. These findings underscore the influence of principals' beliefs on SBG support and suggest researching the contextual and ideological factors influencing SBG's implementation.

More →


Joshua Bleiberg, Eric Brunner, Erica Harbatkin, Matthew A. Kraft, Matthew G. Springer.

Federal incentives and requirements under the Obama administration spurred states to adopt major reforms to their teacher evaluation systems. We examine the effects of these reforms on student achievement and attainment at a national scale by exploiting their staggered implementation across states. We find precisely estimated null effects, on average, that rule out impacts as small as 0.017 standard deviations for achievement and 1.2 percentage points for high school graduation and college enrollment. We highlight five factors that likely limited the efficacy of teacher evaluation at scale: political opposition, decentralization, capacity constraints, limited generalizability, and the absence of compensating wages.

More →


Matthew A. Kraft, Sarah Novicoff.

We examine the fundamental and complex role that time plays in the learning process. We begin by developing a conceptual framework to elucidate the multiple obstacles schools face in converting total time in school into active learning time. We then synthesize the causal research and document a clear positive effect of additional time on student achievement typically of small to medium magnitude depending on dosage, use, and context. Further descriptive analyses reveal how large differences in the length of the school day and year across public schools are an underappreciated dimension of educational inequality in the United States. Finally, our case study of time loss in one urban district demonstrates the potential to substantially increase instructional time within existing constraints.

More →


Isaac M. Opper, Umut Özek.

We use a marginal treatment effect (MTE) representation of a fuzzy regression discontinuity setting to propose a novel estimator. The estimator can be thought of as extrapolating the traditional fuzzy regression discontinuity estimate or as an observational study that adjusts for endogenous selection into treatment using information at the discontinuity. We show in a frequentest framework that it is consistent under weaker assumptions than existing approaches and then discuss conditions in a Bayesian framework under which it can be considered the posterior mean given the observed conditional moments. We then use this approach to examine the effects of early grade retention. We show that the benefits of early grade retention policies are larger for students with lower baseline achievement and smaller for low-performing students who are exempt from retention. These findings imply that (1) the benefits of early grade retention policies are larger than have been estimated using traditional fuzzy regression discontinuity designs but that (2) retaining additional students would have a limited effect on student outcomes.

More →


Deven Carlson, Adam Shepardson.

As students are exposed to extreme temperatures with ever-increasing frequency, it is important to understand how such exposure affects student learning. In this paper we draw upon detailed student achievement data, combined with high-resolution weather records, to paint a clear portrait of the effect of temperature on student learning across a six-year period for students in Tulsa, Oklahoma. The detailed, longitudinal nature of our data allows us to estimate the effects of both test-day and longer-term temperature on student test performance, and to examine how the effects of both temperature measures vary across seasons, student background, and the distribution of student achievement. Our results show that test-day temperature has no significant effect on student test performance in fall or winter, but a clear negative effect on students’ spring performance, particularly in math. Second, we find that summer temperature has a positive, statistically significant, and substantively meaningful effect on student performance on the fall MAP assessment—these effects appear in both math and reading. The results also illustrate that 90-day temperature affects math performance in winter and spring, but these estimates are modest in substantive magnitude.

More →


Brian A. Jacob.

Media reports suggest that parent frustration with COVID school policies and the growing politicization of education have increased community engagement with local public schools. However, there is no evidence to date on whether these factors have translated into greater engagement at the ballot box. This paper uses a novel data set to explore how school board elections changed following the start of the COVID-19 pandemic. I find that school board elections post-COVID were more likely to be contested, and that voter turnout in contested elections increased. These changes were large in magnitude and varied with several district characteristics.

More →


Joshua B. Gilbert.
When analyzing treatment effects on test scores, researchers face many choices and competing guidance for scoring tests and modeling results. This study examines the impact of scoring choices through simulation and an empirical application. Results show that estimates from multiple methods applied to the same data will vary because two-step models using sum or factor scores provide attenuated standardized treatment effects compared to latent variable models. This bias dominates any other differences between models or features of the data generating process, such as the use of scoring weights. An errors-in-variables (EIV) correction removes the bias from two-step models. An empirical application to data from a randomized controlled trial demonstrates the sensitivity of the results to model selection. This study shows that the psychometric principles most consequential in causal inference are related to attenuation bias rather than optimal scoring weights.

More →