EdWorkingPapers
Remote Learning in 2020-21 and Student Attendance Since the COVID-19 Pandemic
Student attendance declined during the COVID-19 pandemic and remains lower than pre-pandemic levels. This study examines the role of remote learning in these post-pandemic declines in student attendance. I find that remote learning in 2020-21 led to persistent declines in post-pandemic attendance, with generally larger negative effects for students exposed to longer periods of remote learning… more →
The Effect of Air Pollution on Student Achievement: A Systematic Review and Meta-Analysis of the Causal Evidence
Air pollution is one of the most pressing global public health challenges of the 21st century. This article presents a systematic review and meta-analysis of the best available evidence of the effect of air pollution on student achievement. A meta-analysis of 28 causal studies around the world yielding 62 effect sizes estimates that air pollution, across many contexts and pollutants, decreases… more →
Gifted Identification Across the Distribution of Family Income
Currently, 6.1 percent of K-12 students in the United States receive gifted education. Using education and IRS data that provide information on students and their family income, we show pronounced differences in who schools identify as gifted across the distribution of family income. Under 4 percent of students in the lowest income percentile are identified as gifted, compared with 20 percent… more →
School-Based Disability Identification Varies by Student Family Income
Currently, 18 percent of K-12 students in the United States receive additional supports through the identification of a disability. Socioeconomic status is viewed as central to understanding who gets identified as having a disability, yet limited large-scale evidence examines how disability identification varies for students from different income backgrounds. Using unique data linking… more →
A Longitudinal Study of External Contract Teacher Employment in Washington State School Districts
This study examines the phenomenon of external teacher contracting in Washington State schools. Using administrative data, we analyze shifting patterns of employment among external contract teachers. External contract teachers now represent a significant portion of the workforce in a few districts, but a very small portion statewide. These districts have formed robust online programs that may… more →
The Nation’s Achievement Inequality Report Card: An Assessment of Test Score and Equality Trends in Traditional Public, Charter, Catholic, and Department of Defense Schools
We present a descriptive comparison of trends in achievement and inequality in traditional public, public charter, Catholic, and Department of Defense schools in the U.S. Our sample includes 6,155,570 observations for 4th and 8th graders in math and reading between 2005 and 2024. We focus on changes in the 90th and 10th percentile scores of the students in those school sectors on the National… more →
The Chronic(les) of Absenteeism Measurement: Unpacking the Many Measures of Attendance and Evidence for a Lower Chronic Absenteeism Threshold
Chronic absenteeism has surged in recent years, drawing growing policy and research attention. However, a complicating factor often overlooked is that the measurement of absenteeism is inconsistent, with substantial researcher degrees of freedom. This study investigates how researchers’ measurement choices shape predictions of academic risk and how absenteeism can be more effectively… more →
COVID-19, School District Operations, and Student Academic Performance in Virginia
We use longitudinal student-level data and interrupted time series methods to examine the impact of the COVID-19 pandemic on mathematics achievement among 3rd-8th grade students in Virginia, a state that offered particularly low levels of access to in-person learning in the school reopening period. We find notably large negative initial effects on math in 2020-21, much greater in magnitude… more →
The Trade-off between Quality and Quantity: Evidence from a Field Experiment on Tutoring
High-dosage tutoring has the potential to substantially raise adolescent academic achievement. However, at scale, schools may not have the financial ability to deliver small-group tutoring frequently. In this paper, I test the relative importance of group size (quality) versus tutoring frequency (quantity). I evaluate the impact of an in-school math tutoring program in a middle school in the… more →
Running a Business in High School: Selection into the Virtual Enterprises Program
To better prepare high school students for the workforce, many schools and districts are building career and technical education coursework that provides students with the opportunity to deeply engage in work-based learning. Virtual Enterprises (VE) is a program where students open school-based enterprises, hold positions in the company (e.g., Chief Executive Officer, Marketing Director), sell… more →
Using experimental variation to examine the (co-)development of cognitive and social-emotional skills in early childhood
Questions about the stability of psychological constructs, skill generalization, and transfer have long motivated psychological research. Despite a proliferation of theory, the field has rarely established causal effects. We employed a novel approach to test the stability and codevelopment of cognitive and social-emotional skills in early childhood using longitudinal randomized controlled… more →
Predicting Persistence and Fadeout Across Multi-Site RCTs of an Early Childhood Mathematics Curriculum Intervention
This study examined predictors of persistence and fadeout across multiple cluster RCTs that evaluated a preschool mathematics curriculum. We used meta-analytic methods to explore how impacts on student mathematics achievement faded between post-test (i.e., endline) and one-year follow-up. We found that the magnitude of the impact at post-test was a strong predictor of the one-year follow-up… more →
Mapping the Mechanisms of Interdisciplinary Learning Transfer from Reading to Math Achievement: Evidence from a Large-Scale Randomized Controlled Trial
Far transfer---the application of learning across distant domains---remains elusive in intervention research, and even when it is found, its mechanisms remain unclear or unexplored. This study analyzes data from the Model of Reading Engagement (MORE), a sustained content literacy intervention implemented in Grades 1-3 that demonstrated positive treatment effects on both near transfer reading… more →
The Effect of Four-Day School Week Adoption on Teacher Retention and Sorting
As teacher shortages worsen across the U.S., many school districts have implemented a unique solution to attract and retain effective teachers: switching from the traditional five-day school week to a four-day school week (4DSW). I use 17 years of teacher-level employment data from Texas in a difference-in-differences analysis to examine whether the 4DSW truly affects teacher retention and… more →
When and Why Does College Advising “Work:” Evidence from Advise TN
College advising programs increase the likelihood students apply to and enroll in higher education. However, few are proven effective at scale. We leverage the rollout of Advise TN across 33 communities to estimate causal impacts of a novel advising program on college enrollment, persistence, degree completion, and workforce participation. With complementary event-study and robust difference-… more →
Childhood Interventions and Life Course Development
A paradox has perplexed researchers studying childhood interventions: although program impacts on children’s skills often fade, some interventions have nonetheless produced long-run impacts on adult outcomes. Existing developmental theory does not provide a straightforward explanation. The fadeout-emergence paradox spotlights our limited understanding of how early skill gains shape long-run… more →
Bulwark or Barrier? The Effect of Academic Criteria-Based Reclassification on the High School Outcomes of Multilingual Learners in Texas
English learner (EL) classification can provide multilingual students (MLs) with key supports while simultaneously limiting access to important educational opportunities. To determine when students are ready to exit EL status, some states require students to meet academic criteria in addition to demonstrated English proficiency. However, few studies empirically examine these criteria, which… more →
How General is Educational Intervention Fadeout? A Meta-Analysis of Educational RCTs with Follow-Up
Researchers and policymakers pursue educational interventions with the goal of altering children’s long-term trajectories. However, many effects fade quickly after interventions end. Researchers have sought to address the fadeout problem by identifying characteristics of interventions that lead to persistent effects, though reliable answers have been elusive. We present evidence from 87… more →
Absent and Afraid? Immigration Enforcement and Student Attendance in the Second Trump Administration
Intensified immigration enforcement activity under the second Trump administration has increased anxiety for immigrants in the United States, including many families with school-age children. This study provides early evidence on the effects of the second Trump presidency on the attendance of students who may be from immigrant families. Using a difference-in-differences design, I estimate the… more →
The Effect of Centralized-Admission School Lotteries on Between-School Segregation: Evidence from 300 Largest School Districts in the United States
This study examines how centralized-admission school lotteries affect between-school racial and ethnic segregation in the largest U.S. public school districts. Using original nationwide panel data and a difference-in-differences design with staggered adoption, the research analyzes effects on school composition, intergroup exposure, and distribution evenness. The findings reveal that… more →
The West Texas Measles Outbreak and Student Absences
Declining child-vaccination rates are driving a measles resurgence in the US, yet little evidence documents how these outbreaks may disrupt schooling. Using daily absence data from a school district at the center of the West Texas outbreak, this preregistered analysis finds absences increased 41 percent relative to the within-year variation from two prior years, with larger effects among… more →
No Pay? No Way! Teacher Compensation Reforms and the Market for Graduate Degrees
Graduate degrees in education provide financial stability for many institutions, yet reformers have sought to decouple teacher pay from these credentials. Without a wage premium, educators may skip advanced study, reducing enrollment at nearby universities. Using a natural experiment in Tennessee, we show that eliminating a graduate degree wage premium for teachers led to a 27% (140 student)… more →
More Often or Longer? The Effects of the Academic Schedule on Postsecondary Academic Outcomes
One of the most common scheduling decisions in higher education is the determination of biweekly or triweekly classes. On the surface, these two formats are equivalent in terms of the number of minutes in a course (75 minutes twice a week or 50 minutes three times a week). However, the two structures may have different pros and cons for both students and faculty and it is ambiguous which… more →
Effects of a non-traditional teacher preparation program on non-test outcomes: evidence from relay graduate school of education in New York City
This study examines the effects of a non-traditional teacher preparation program, the Relay Graduate School of Education, on non-test outcomes for New York City public school students in Grades 3–8. By controlling for student and school fixed effects, I use plausibly random variation in Relay teacher assignments within students over time to identify causal Relay program effects. Results… more →
The Influence of Partisanship in Local School Board Elections: Evidence from Exit Polling in Michigan & Rhode Island
Education in the U.S. has long been shaped by local school boards elected in nonpartisan contests, a structure intended to shield schools from broader political forces. Today, many states are considering reforms to make school board elections partisan, yet the impact on voters remains unclear. Using exit poll data from 839 voters in Michigan (nonpartisan elections) and Rhode Island (partisan… more →
Assessing Permanent School Closures: A Conceptual Framework
Amid widespread declining enrollment, the expiration of COVID-19 ESSER funding, and looming uncertainty in federal P-12 education involvement, many school districts may soon consider permanent school closures. While extant permanent school closure literature provides a starting point for future analyses, it often fails to advise the breadth of contexts in which future closures may occur,… more →
Online Tutoring, School Performance, and School-to-Work Transitions: Evidence from a Randomized Controlled Trial
Tutoring programs for low-performing students, delivered in-person or online, effectively enhance school performance, yet their medium- and longer-term impacts on labor market outcomes remain less understood. To address this gap, we conduct a randomized controlled trial with 839 secondary school students in Germany to examine the effects of an online tutoring program for low-performing… more →
Understanding the decision (not) to become a teacher: evidence from survey experiments with undergraduates in the UK and US
Teacher shortages are widespread, yet the reasons people choose (not) to enter the profession remain poorly understood. We conducted two survey experiments in which thousands of undergraduates chose between pairs of hypothetical jobs. This allowed us to evaluate the effects of differences in pay, working patterns and other job attributes on job choices, as well as explore how personality type… more →
Does State-Mandated Third-Grade Reading Retention Policy Improve Achievement? Evidence from a Staggered-Adoption Difference-in-Differences Design
This paper investigates whether the state-mandated third-grade reading retention policy autonomously enhances student achievement or depends on broader literacy reforms. Using district-level data from the Stanford Education Data Archive (2010–2019), I employ a staggered-adoption Difference-in-Differences design, as per Callaway and Sant’Anna (2021), to assess heterogeneous treatment effects… more →
Values, Visions, and Variation in American School Districts: A Computational Mixed Methods Analysis of School District Strategic Plans
The decentralization of power is a defining feature of the American education system, allowing schools to reflect community values and needs. Yet, little is known about how values and visions for education hold constant or vary across districts. Through an analysis of 617 district strategic plans, combining qualitative coding and computational topic modeling, we provide insight into how local… more →
Marginal Returns to Public Universities
This paper studies the returns to enrolling in American public universities by comparing the long-term outcomes of barely admitted versus barely rejected applicants. I use administrative admission records spanning all 35 public universities in Texas, which collectively enroll 10 percent of all American public university students, to systematically identify and employ decentralized cutoffs in… more →
Variations in Pre-Primary Education Infrastructure Within and Across Administrative Sectors in Rwanda
This study examines disparities in structural quality across Rwanda’s pre-primary modalities—centre-based, community-based, and home-based—operating under a single policy framework. Using data from 4,875 settings across 91 administrative sectors in seven districts, we applied multilevel models to separate within-sector differences by modality from between-sector variation, associated with… more →
Selling Student Success: A Critical Analysis of Predictive Analytics Vendors in Higher Education
As predictive analytics become increasingly embedded in higher education, commercial vendors offering these tools play a growing role in shaping institutional decision making, particularly through identifying students deemed “at risk.” In this qualitative study, we analyzed 161 publicly available materials from 15 vendors to examine these companies’ marketing of predictive analytics. Drawing… more →
Ready for What? School and District Responses to State College and Career Readiness Accountability in Tennessee
Tennessee’s K-12 accountability system incorporates three distinct measures of college and career readiness (CCR) for state and federal accountability. Each of these indicators applies its own set of metrics and performance benchmarks, but they all consistently draw upon similar components including participation in Early Postsecondary Opportunities (EPSOs), standardized tests like the ACT and… more →
Fast Track to Success? A Mixed Methods Evaluation of Condensed Course Formats at Tennessee Community Colleges
As colleges face increasing pressure to improve student outcomes, one solution gaining traction is the adoption of condensed courses (i.e., shortened academic terms). We employ quasi-experimental methods to estimate the effect of enrolling in a condensed course on course- and student-level outcomes at all public community colleges in Tennessee. We also leverage interviews with college faculty… more →
The Role of Education-Industry Match in College Earnings Premia
Many states incentivize college students to major in fields aligned with specific, often “in-demand” industries. While their goal is often to raise students’ labor market outcomes, little is known about whether matching one’s degree with an industry of work improves employment and earnings. We leverage a novel education-industry crosswalk applied to student and worker panel data covering over… more →
Beyond the Classroom: Impact of a High-Dosage Tutoring Program on Student Literacy Achievement
This study examines the impact of a high-dosage tutoring program, characterized by low tutor-to-student-ratio, on the literacy achievement of students in grades two through five in a midsized suburban school district in the southeastern United States. Using a student-level randomized controlled trial, 333 students were randomly assigned to either receive tutoring during the intervention period… more →
The Early Intervention and Early Childhood Special Education Workforce: Descriptive Evidence on Demographics and Turnover from Oregon
Early intervention (EI) and early childhood special education (ECSE) services for children with disabilities have expanded substantially across the U.S. over the past few decades, necessitating efforts to recruit and retain a qualified workforce to meet their needs. Despite widespread reports of staffing challenges in this sector, few contemporary studies provide large-scale evidence on this… more →
Removing Barriers to College Credits: Where and for Whom AP Exam Fee Waivers Work
Do policies that broaden educational access also foster success? We study this question in the context of North Carolina’s universal Advanced Placement (AP) exam fee waiver policy. Using student-course level administrative data, we exploit within-student variation on a sample of students who took multiple AP courses to estimate the policy’s effect on exam participation (access) and pass rates… more →
Learning to Work Towards Goals: A Sequential Evaluation of the Effect of Goal-Setting Course on Academic and Soft Skills
This study sequentially evaluates a soft-skills course implemented in Ugandan and Kenyan primary schools that replaced academic review time with lessons on goal-setting and related skills as students prepared for high-stakes primary school-leaving exams. An exploratory evaluation in Uganda provided evidence of positive impacts on girls' test scores. A confirmatory evaluation in Kenya found… more →
Beyond the One-Teacher Model: Experimental Evidence on Using Embedded Paraprofessionals as Personalized Instructors
Using embedded paraprofessionals to provide personalized instruction is a promising model for differentiating instruction within the classroom. This study examines two randomized controlled trials of paraprofessional-led tutoring in early-grade math and literacy. However, intent-to-treat (ITT) analyses revealed no overall achievement impacts for either program. We then explore two mechanisms… more →
Do Test Scores Misrepresent Test Results? An Item-by-Item Analysis
Much of the data collected in education is effectively thrown away. Students answer individual test questions, but administrators and researchers only see aggregate performance. All the item-level data are lost. Ex ante it is not clear this destroys much useful information, since the aggregate might be a sufficient statistic. Using data from Texas for 5 million students and 1.31 billion… more →
The Architecture of Expected Wage Gaps: Between- and Within-School Sources of Career Education Inequality
This study investigates how school-level variation contributes to social stratification even before labor market entry by examining Career and Technical Education (CTE) as a key mechanism for sorting students into pathways with unequal economic returns. Using Delaware administrative data and Bureau of Labor Statistics occupational wage data, we introduce “expected wage” as a measure to… more →
Tutor CoPilot: A Human-AI Approach for Scaling Real-Time Expertise
Generative AI, particularly Large Language Models (LLMs), can expand access to expert guidance in domains like education, where such support is often limited. We introduce Tutor CoPilot, a Human-AI system that models expert thinking to assist tutors in real time. In a randomized controlled trial involving more than 700 tutors and 1,000 students from underserved communities, students with… more →
Labor Market Strength and Declining Community College Enrollment
Declining U.S. college enrollments have triggered questions about the health of the postsecondary sector. Using institution-level data, we make four points. First, such declines are driven not by the four-year sector but by two-year community colleges, which have apparently shrunk by over 30% since the peak of the Great Recession. Second, over one-third of this apparent decline is an artifact… more →
Creating Short Forms of Early Childhood Development Measures: A Framework for Quantifying Statistical, Conceptual, and Practical Tradeoffs in Direct Assessment
Direct assessments of early childhood development (ECD) are a cornerstone of research in developmental psychology and are increasingly used to evaluate programs and policies in lower- and middle-income countries. Despite strong psychometric properties, these assessments are too expensive and time consuming for use in large-scale monitoring or national-level surveys. Short forms of direct… more →
The Effects of Immigration Enforcement on Student Outcomes in a New Era of Immigration Policy in the United States
This study presents the first evidence, to our knowledge, of the effects of the surge in interior immigration apprehensions in 2025 in the United States on student academic performance using detailed student-level administrative records from Florida. We find evidence that immigration enforcement reduced test scores for both U.S.-born and foreign-born Spanish-speaking students while also… more →
How do place-based scholarships affect student borrowing and academic outcomes? Lessons from Atlanta
Previous research shows that Achieve Atlanta’s placed-based scholarship and associated services meaningfully improve college persistence and completion. In this follow up study that uses similar methods but additional and more detailed data, we examine whether scholarship recipients exhibit different student loan portfolios, course-taking patterns, or academic performance. Using regression… more →
Do As I Say: What Teachers’ Language Reveals About Classroom Management Practices
Classroom management critically affects students’ academic and behavioral outcomes, yet we lack quantitative methods for observing these practices at scale. This study develops and validates language-based measures of classroom management—such as responding to student behavior and issuing verbal or material sanctions—using natural language processing (NLP) on 1,652 elementary mathematics… more →
ChatGPT vs. Machine Learning: Assessing the Efficacy and Accuracy of Large Language Models for Automated Essay Scoring
Automated Essay Scoring (AES) is a critical tool in education that aims to enhance the efficiency and objectivity of educational assessments. Recent advancements in Large Language Models (LLMs), such as ChatGPT, have sparked interest in their potential for AES. However, comprehensive comparisons of LLM-based methods with traditional machine learning (ML) methods across different assessment… more →