Search EdWorkingPapers

Search EdWorkingPapers by author, title, or keywords.

K-12 Education

Steven Michael Carlo.

The National Assessment of Educational Progress (NAEP) has tested the civic, or citizenship knowledge of students across the nation at irregular intervals since its very inception. Despite advancements in reading and mathematics, evidenced by results from the National Assessment of Educational Progress (NAEP), civics proficiency has remained consistently low, which raises concerns among educators and policymakers. This study attempts to provide those educators and policymakers with state-level predictions, not currently provided for the civics assessment. This research addresses this gap in state-level civics education data by applying multilevel regression with poststratification (MRP) to NAEP's nationally representative civics scores, yielding state-specific estimates that account for student demographics. A historical analysis of NAEP's development underscores its significance in national education and highlights the challenges of transitioning to state-level reporting, particularly for civics, which lacks state-level generalizability. Furthermore, this paper evaluates NAEP's frameworks, questioning their alignment with civics education's evolving needs, and investigates the presence of opportunity gaps in civics knowledge across gender and racial/ethnic lines. By comparing MRP estimates with published NAEP results, the study validates the method's credibility and emphasizes the potential of MRP in educational research. The findings reveal persistent racial/ethnic disparities in civic knowledge, with profound implications for civics instruction and policy. The research concludes by stressing the necessity for state-specific data to inform education policy and practice, advocating for teaching methods that enhance civic understanding and engagement, and suggesting future research directions to address the uncovered disparities.

More →


David Blazar, Max Anthenelli, Wenjing Gao, Ramon Goings, Seth Gershenson.

Mounting evidence supporting the advantages of a diverse teacher workforce prompts policymakers to scrutinize existing recruitment pathways. Following four cohorts of Maryland public high-school students over 12 years reveals several insights. Early barriers require timely interventions, aiding students of color in achieving educational milestones that are prerequisites for teacher candidacy (high school graduation, college enrollment). While alternative pathways that bypass traditional undergraduate teacher preparation may help, current approaches still show persistent racial disparities. Data simulations underscore the need for race-conscious policies specifically targeting or differentially benefiting students of color, as race-neutral strategies have minimal impact. Ultimately, multiple race-conscious policy solutions addressing various educational milestones must demonstrate significant effectsapproximately 30% increasesto reshape the teacher workforce to align with student body demographics.

More →


Matthew A. Kraft, Melissa Arnold Lyon.

We examine the state of the U.S. K-12 teaching profession over the last half century by compiling nationally representative time-series data on four interrelated constructs: occupational prestige, interest among students, the number of individuals preparing for entry, and on-the-job satisfaction. We find a consistent and dynamic pattern across every measure: a rapid decline in the 1970s, a swift rise in the 1980s extending into the mid 1990s, relative stability, and then a sustained decline beginning around 2010. The current state of the teaching profession is at or near its lowest levels in 50 years. We identify and explore a range of hypotheses that might explain these historical patterns including economic and sociopolitical factors, education policies, and school environments.

More →


Jason Fontana, Jennifer L. Jennings.

Does state implementation of Education Savings Accounts (ESAs), which are voucher-like taxpayer-funded subsidies for children to attend private schools, increase tuition prices? We analyze a novel longitudinal dataset for all private schools in Iowa and Nebraska, neighboring states that adopted ESAs in the same legislative session, with Iowa’s implementation beginning first. By leveraging state and grade-level variation in eligibility, we provide new causal evidence that ESAs led Iowa private schools to increase tuition. Increases varied by the percentage of the grade eligible for ESAs. When eligibility was universal (kindergarten), private schools increased prices 21-25%, compared with 10-16% in grades with partial eligibility. In contrast, private schools did not increase tuition in pre-K, which was ineligible for ESAs. If a goal of ESAs is to extend private school access to new families, the substantial tuition increases they produce may limit access.

More →


Paiheng Xu, Jing Liu, Nathan Jones, Julie Cohen, Wei Ai.

Assessing instruction quality is a fundamental component of any improvement efforts in the education system. However, traditional manual assessments are expensive, subjective, and heavily dependent on observers’ expertise and idiosyncratic factors, preventing teachers from getting timely and frequent feedback. Different from prior research that focuses on low-inference instructional practices, this paper presents the first study that leverages Natural Language Processing (NLP) techniques to assess multiple high-inference instructional practices in two distinct educational settings: in-person K-12 classrooms and simulated performance tasks for pre-service teachers. This is also the first study that applies NLP to measure a teaching practice that has been demonstrated to be particularly effective for students with special needs. We confront two challenges inherent in NLP-based instructional analysis, including noisy and long input data and highly skewed distributions of human ratings. Our results suggest that pretrained Language Models (PLMs) demonstrate performances comparable to the agreement level of human raters for variables that are more discrete and require lower inference, but their efficacy diminishes with more complex teaching practices. Interestingly, using only teachers’ utterances as input yields strong results for student-centered variables, alleviating common concerns over the difficulty of collecting and transcribing high-quality student speech data in in-person teaching settings. Our findings highlight both the potential and the limitations of current NLP techniques in the education domain, opening avenues for further exploration.

More →


Ellen Sahlström, Mikko Silliman.

We study the extent and consequences of biases against immigrants exhibited by high school teachers in Finland. Compared to native students, immigrant students receive 0.06 standard deviation units lower scores from teachers than from blind graders. This effect is almost entirely driven by grading penalties incurred by high-performing immigrant students and is largest in subjects where teachers have more discretion in grading. While teacher-assigned grades on the matriculation exam are not used for tertiary enrollment decisions, we show that immigrant students who attend schools with biased teachers are less likely to continue to higher education.

More →


Douglas D. Ready, Sierra G. McCormick, Rebecca J. Shmoys.

This paper describes a 12-week cluster randomized controlled trial that examined the efficacy of BookNook, a virtual tutoring platform focused on reading. Cohorts of first- through fourth-grade students attending six Rocketship public charter schools in Northern California were randomly assigned within grades to receive BookNook. Intent-to-Treat models indicate that students in cohorts assigned to BookNook outperformed their control-group peers by roughly 0.05 SDs. Given the substantial variability in usage rates among students enrolled in BookNook cohorts, we also leveraged Treatment-on-the-Treated approaches. These models suggest that students who completed 10 or more BookNook sessions experienced a reading advantage of 0.08 SDs, while those who completed 20 or more sessions—the recommended dosage—experienced a 0.26 SD developmental advantage.

More →


NaYoung Hwang.

This study examines the impact of special education on academic and behavioral outcomes for students with learning disabilities (LD) by using statewide Indiana data covering kindergarten through eighth grade. The results from student fixed effects models show that special education services improve achievement in math and English Language Arts, but they also increase suspensions and absences for students with LD. These effects vary across student subgroups, including gender, race/ethnicity, eligibility for free or reduced-price lunch, and English language learner status. The findings reveal both the significant benefits and unintended consequences of special education services for students with LD, highlighting the complex dynamics and varying effects of special education.

More →


Jesper Eriksen, Shaun M. Dougherty.

Vocational Education and Training (VET) programs are prevalent in a European context, but often struggle with drop-out rates that exceed those of general upper-secondary education. Using Danish administrative data, we study the effects of reform-induced reductions in shares of VET students who did not pass their lower secondary final exams on passing GPA VET students. We find that passing students have a higher probability of remaining enrolled in VET after the first year of studies when entering a VET school with a higher share of below-passing peers. Studying outside options, we find that students become less likely to drop out of education entirely. The results are consistent with models of peer effects in which particularly unmotivated students become points of comparison for their peers, increasing their motivation and likelihood of remaining enrolled.

More →


Arielle Boguslav.

Despite the common title of “coach,” definitions of high-quality coaching vary tremendously across models and programs. Yet, few studies make comparisons across different models to understand what is most helpful, for whom, and under what circumstances. As a result, practitioners are left with many options and little evidence-based direction. This is exacerbated by the literature’s focus on more abstract features of coaching practice (e.g. building trust), leaving practitioners to figure out what concrete discourse strategies support these goals. This paper begins to address these challenges by introducing a taxonomy of coaching “moves,” parsing the concrete details of coach discourse. While the taxonomy is informed by the literature, it highlights conceptual possibilities rather than providing a list of empirically-grounded or “evidence-based” strategies. In doing so, this taxonomy may serve as a common language to guide future work exploring how coach discourse shapes teacher development, synthesizing across studies, and supporting coach practice.

More →