Search and Filter
Assessment
Students' Grade Satisfaction Influences Evaluations of Teaching: Evidence from Individual-level Data and an Experimental Intervention
Student surveys are widely used to evaluate university teaching and increasingly adopted at the K-12 level, although there remains considerable debate about what they measure. Much disagreement focuses on the well-documented correlation between student grades and their evaluations of instructors… more →
Bridging human and machine scoring in experimental assessments of writing: tools, tips, and lessons learned from a field trial in education
Topics: MethodsIn a randomized trial that collects text as an outcome, traditional approaches for assessing treatment impact require that each document first be manually coded for constructs of interest by human raters. An impact analysis can then be conducted to compare treatment and control groups, using the… more →
Measuring and Summarizing the Multiple Dimensions of Teacher Effectiveness
Topics: Teacher and Leader DevelopmentTags: AssessmentThere is an emerging consensus that teachers impact multiple student outcomes, but it remains unclear how to measure and summarize the multiple dimensions of teacher effectiveness into simple metrics for research or personnel decisions. We present a multidimensional empirical Bayes framework and… more →
Teacher Preparation Programs and Graduates' Growth in Instructional Effectiveness
Topics: Teacher and Leader DevelopmentMany prior studies have examined whether there are average differences in levels of teaching effectiveness among graduates from different teacher preparation programs (TPPs); other studies have investigated which features of preparation predict graduates’ average levels of teaching effectiveness… more →
College Entrance Exam-Taking Strategies in Georgia
Using administrative data from Georgia, we provide the first study of the full set of college entrance exam-taking strategies, including who takes the ACT and the SAT (or both), when they take the exams, and how many times they take each exam. We have several main findings. First, one-third of… more →
Characterizing Cross-Site Variation in Local Average Treatment Effects in Multisite Regression Discontinuity Design Contexts with an Application to Massachusetts High School Exit Exam
Topics: MethodsTags: Assessment, High schoolsIn multisite experiments, we can quantify treatment effect variation with the cross-site treatment effect variance. However, there is no standard method for estimating cross-site treatment effect variance in multisite regression discontinuity designs (RDD). This research rectifies this gap in… more →
Using Implementation Fidelity to Aid in Interpreting Program Impacts: A Brief Review
Topics: MethodsTags: Assessment, CurriculumPoor program implementation constitutes one explanation for null results in trials of educational interventions. For this reason, researchers often collect data about implementation fidelity when conducting such trials. In this article, we document whether and how researchers report and measure… more →
New Schools and New Classmates: The Disruption and Peer Group Effects of School Reassignment
Topics: Policy, Politics, and GovernancePolicy makers periodically consider using student assignment policies to improve educational outcomes by altering the socio-economic and academic skill composition of schools. We exploit the quasi-random reassignment of students across schools in the Wake County Public School System to estimate… more →
Understanding Performance in Test Taking: The Role of Question Difficulty Order
Tags: AssessmentStandardized assessments are widely used to determine access to educational resources with important consequences for later economic outcomes in life. However, many design features of the tests themselves may lead to psychological reactions influencing performance. In particular, the level of… more →
Measuring Teaching Practices at Scale: A Novel Application of Text-as-Data Methods
Topics: MethodsValid and reliable measurements of teaching quality facilitate school-level decision-making and policies pertaining to teachers. Using nearly 1,000 word-to-word transcriptions of 4th- and 5th-grade English language arts classes, we apply novel text-as-data methods to develop automated measures… more →
A Half Century of Progress in U. S. Student Achievement: Agency and Flynn Effects; Ethnic and SES Differences
Topics: Student LearningPrincipals (policymakers) disagree as to whether U. S. student performance has changed over the past half century. To inform conversations, agents administered seven million psychometrically linked tests in math (m) and reading (rd) in 160 survey waves to national probability samples of cohorts… more →
The Distribution of School Spending Impacts
Tags: Efficacy, AssessmentWe examine all known "credibly causal" studies to explore the distribution of the causal effects of public K-12 school spending on student outcomes in the United States. For each of the 31 included studies, we compute the same marginal spending effect parameter estimate. Precision-weighted… more →
Achievement Gaps in the Wake of COVID-19
Topics: Student LearningA survey targeting education researchers conducted in November, 2020 provides both short- and longer-term predictions of how much achievement gaps between low- and high-income students in U.S elementary schools will change as a result of COVID-related disruptions to schooling and family life.… more →
How Much Does Teacher Quality Vary Across Teacher Preparation Programs? Reanalyses from Six States
Topics: MethodsAt least sixteen US states have taken steps toward holding teacher preparation programs (TPPs) accountable for teacher value-added to student test scores. Yet it is unclear whether teacher quality differences between TPPs are large enough to make an accountability system worthwhile. Several… more →
Higher-Quality Elementary Schools Sustain the Prekindergarten Boost: Evidence from an Exploration of Variation in the Boston Prekindergarten Program’s Impacts
Topics: Student LearningWhile there is a consensus that attending preschool better prepares children for kindergarten, evidence on the factors that sustain the preschool boost into the early elementary years is still emerging. To add to this literature, we use lottery data from applicants to oversubscribed… more →
The Effects of Financial Aid Loss on Persistence and Graduation: A Multi-Dimensional Regression Discontinuity Approach
Tags: Higher education, AssessmentFor years Georgia's HOPE Scholarship program provided full tuition scholarships to high achieving students. State budgetary shortfalls reduced its generosity in 2011. Under the new rules, only students meeting more rigorous merit-based criteria would retain the original scholarship covering full… more →
An Evaluation of Credit Recovery as an Intervention for High School Students Who Fail Courses
Credit recovery (CR) refers to online courses that high school students take after previously failing the course. Many have suggested that CR courses are helping students to graduate from high school without corresponding increases in academic skills. This study analyzes administrative data from… more →
Education Leaders’ Knowledge of Causal Research Design: A Measurement Challenge
Federal policy has both incentivized and supported better use of research evidence by educational leaders. However, the extent to which these leaders are well-positioned to understand foundational principles from research design and statistics, including those that underlie the What Works… more →
The role of student effort on performance in PISA: Revisiting the gender gap in achievement
International assessments are important to benchmark the quality of education across countries. However, on low-stakes tests, students’ incentives to invest their maximum effort may be minimal. Research stresses that ignoring students’ effort when interpreting results from low-stakes assessments… more →
How Can Released State Test Items Support Interim Assessment Purposes in an Educational Crisis?
Tags: Covid-19 recovery, AssessmentState testing programs regularly release previously administered test items to the public. We provide an open-source recipe for state, district, and school assessment coordinators to combine these items flexibly to produce scores linked to established state score scales. These would enable… more →