Search EdWorkingPapers

Search EdWorkingPapers by author, title, or keywords.

Educator preparation, professional development, performance and evaluation

Kylie L. Anglin, Vivian C. Wong.

Researchers are rarely satisfied to learn only whether an intervention works, they also want to understand why and under what circumstances interventions produce their intended effects. These questions have led to increasing calls for implementation research to be included in high quality studies with strong causal claims. Of critical importance is determining whether an intervention can be delivered with adherence to a standardized protocol, and the extent to which an intervention protocol can be replicated across sessions, sites, and studies. When an intervention protocol is highly standardized and delivered through verbal interactions with participants, a set of natural language processing (NLP) techniques termed semantic similarity can be used to provide quantitative summary measures of how closely intervention sessions adhere to a standardized protocol, as well as how consistently the protocol is replicated across sessions. Given the intense methodological, budgetary and logistical challenges for conducting implementation research, semantic similarity approaches have the benefit of being low-cost, scalable, and context agnostic for use. In this paper, we demonstrate how semantic similarity approaches may be utilized in an experimental evaluation of a coaching protocol on teacher pedagogical skills in a simulated classroom environment. We discuss strengths and limitations of the approach, and the most appropriate contexts for applying this method.

More →


Matthew A. Kraft, Alexander Bolves.

We study the adoption and implementation of a new mobile communication app among a sample of 132 New York City public schools. The app provides a platform for sharing general announcements and news as well as engaging in personalized two-way communication with individual parents. We provide participating schools with free access to the app and randomize schools to receive intensive support (training, guidance, monitoring, and encouragement) for maximizing the efficacy of the app. Although user supports led to higher levels of communication within the app in the treatment year, overall usage remained low and declined in the following year when treatment schools no longer received intensive supports. We find few subsequent effects on perceptions of communication quality or student outcomes. We leverage rich internal user data to explore how take-up and usage patterns varied across staff and school characteristics. These analyses help to identify early adopters and reluctant users, revealing both opportunities and obstacles to engaging parents through new communication technology.

More →


Philip Oreopoulos.

This article takes stock of where the field of behavioral science applied to education policy seems to be at, which avenues seem promising and which ones seem like dead ends. I present a curated set of studies rather than an exhaustive literature review, categorizing interventions by whether they nudge (keep options intact) or “shove” (restrict choice), and whether they apply a high or low touch (whether they use face-to-face interaction or not). Many recent attempts to test large-scale low touch nudges find precisely estimated null effects, suggesting we should not expect letters, text messages, and online exercises to serve as panaceas for addressing education policy’s key challenges.  Programs that impose more choice-limiting structure to a youth’s routine, like mandated tutoring, or programs that nudge parents, appear more promising.

More →


Jason A. Grissom, David S. Woo, Brendan Bartanen.

High rates of principal turnover nationally mean that school districts constantly are called on to recruit and select new principals. The importance of a school’s principal makes choosing candidates who will be effective paramount, yet we have little evidence linking information known to school districts at time of selection to principal’s future job performance. Using data from Tennessee, we test the degree to which observable information about novice principals from prior to entry, including qualifications, work history information, and effectiveness in prior roles, predicts practice ratings assigned to them in their initial years in the principalship. We find that educational attainment and years of experience in other jobs hold little predictive power. Performance ratings received as an assistant principal (AP) or teacher, however, do predict principal effectiveness. Moreover, APs who previously worked in schools with highly rated principals are more likely to be effective upon transitioning into the principalship.

 

More →


Brendan Bartanen, Laura K. Rogers, David S. Woo.

Assistant principals are important education personnel, both as essential members of school leadership teams and apprentice principals. However, empirical evidence on their career outcomes remains scarce. Using statewide administrative data from Tennessee and Missouri, we provide the first comprehensive analysis of AP mobility. While prior work focuses only on AP promotions into principal positions, we also account for APs who exit school leadership and transfer to a different school. We find yearly mobility rates of 25–28%, with 10% of APs leaving school leadership, 7.5% changing schools, and 7.5–10% becoming principals. We also document a strong relationship between AP mobility and principal turnover, where higher-performing APs are substantially more likely to replace their departing principal. Principal transitions also appear to increase the likelihood that APs exit school leadership and change schools, highlighting an additional cost of high rates of principal churn.

More →


David D. Liebowitz.

Teacher evaluation policies seek to improve student outcomes by increasing the effort and skill levels of current and future teachers. Current policy and most prior research treats teacher evaluation as balancing two aims: accountability and skill development. Proper teacher evaluation design has been understood as successfully weighting the accountability and professional growth dimensions of policy and practice. I develop a model of teacher effectiveness that incorporates improvement from evaluation and detail conditions which determine the effectiveness of teacher evaluation for growth and accountability at improving student outcomes. Drawing on empirical evidence from the personnel economics, economics of education and measurement literatures, I simulate the long-term effects of a set of teacher evaluation policies. I find that those that treat evaluation for accountability and evaluation for growth as substitutes outperform policies that treat them as complements. I conclude that optimal teacher evaluation policies would impose accountability on teachers performing below a defined level and above which teachers would be subject to no accountability pressure but would receive intensive instructional supports.

More →


Andre Joshua Nickow, Philip Oreopoulos, Vincent Quan.

Tutoring—defined here as one-on-one or small-group instructional programming by teachers, paraprofessionals, volunteers, or parents—is one of the most versatile and potentially transformative educational tools in use today. Within the past decade, dozens of preK-12 tutoring experiments have been conducted, varying widely in their approach, context, and cost. Our study represents the first systematic review and meta-analysis of these and earlier studies. We develop a framework for considering different types of programs to not only examine overall effects, but also explore how these effects vary by program characteristics and intervention context. We find that tutoring programs yield consistent and substantial positive impacts on learning outcomes, with an overall pooled effect size estimate of 0.37 SD. Effects are stronger, on average, for teacher and paraprofessional tutoring programs than for nonprofessional and parent tutoring. Effects also tend to be strongest among the earlier grades. While overall effects for reading and math interventions are similar, reading tutoring tends to yield higher effect sizes in earlier grades, while math tutoring tends to yield higher effect sizes in later grades. Tutoring programs conducted during school tend to have larger impacts than those conducted after school.

More →


David D. Liebowitz, Lorna Porter.

Many education policymakers and system leaders prioritize recruiting and developing effective school leaders as key mechanisms to improve school climate and student learning. Despite efforts to select and support successful school leaders, however, relatively little is understood about the prior professional experiences and skillsets that principals possess upon entry into their positions. In this descriptive paper, we use 14 years of administrative data on all educators in Oregon to trace the prior professional experiences and instructional effectiveness of those who become school leaders. We highlight that many principals in Oregon acquire educational leadership experience outside the assistant principal role and outside of the school district in which they serve. We also find that when future school leaders were teachers, they improved student achievement at modestly higher rates than their peers. Insight into these topics has the potential to inform the pre-service training, recruitment and professional development of school leaders.

More →


Stephen B. Holt, Rui Wang, Seth Gershenson.

Teaching is often assumed to be a relatively stressful occupation and occupational stress among teachers has been linked to poor mental health, attrition from the profession, and decreased effectiveness in the classroom. Despite widespread concern about teachers’ mental health, however, little empirical evidence exists on long-run trends in teachers’ mental health or the prevalence of mental health problems in teaching relative to other professions. We address this gap in the literature using nationally representative data from the 1979 and 1997 cohorts of the National Longitudinal Survey of Youth (NLSY). In the 1979 cohort, women who become teachers have similar mental health to non-teachers prior to teaching but enjoy better mental health than their non-teaching peers, on average, while working as teachers. However, in the 1997 cohort teachers self-report worse mental health, on average, than the 1979 cohort and fare no better than their non-teaching professional peers while teaching. Overall, teachers seem to enjoy mental health outcomes that are as good or better than their peers in other professions.

More →


Marcos A. Rangel, Ying Shi.

We study racial bias and the persistence of first impressions in the context of education. Teachers who begin their careers in classrooms with large black-white score gaps carry negative views into evaluations of future cohorts of black students. Our evidence is based on novel data on blind evaluations and non-blind public school teacher assessments of fourth and fifth graders in North Carolina. Negative first impressions lead teachers to be significantly less likely to over-rate but not more likely to under-rate black students’ math and reading skills relative to their white classmates. Teachers' perceptions are sensitive to the lowest-performing black students in early classrooms, but non-responsive to highest-performing ones. This is consistent with the operation of confirmatory biases. Since teacher expectations can shape grading patterns and sorting into academic tracks as well as students’ own beliefs and behaviors, these findings suggest that novice teacher initial experiences may contribute to the persistence of racial gaps in educational achievement and attainment.

More →