Learning from Evaluation when Context Matters

Similar documents
Why Randomize? Dan Levy Harvard Kennedy School

Why Randomize? Jim Berry Cornell University

The Returns to Public Investment in Human Capital and Infrastructure Lee Branstetter

Sustainable Development Education, Research and Innovation

Randomized Evaluations in Practice: Opportunities and Challenges. Kyle Murphy Policy Manager, J-PAL January 30 th, 2017

The Sustainable Tourism Programme of the 10-Year Framework of Programmes on Sustainable Consumption and Production

The Modern Design Organization. Leah Buley, UX London May 2016

SIXTH REGIONAL 3R FORUM IN ASIA AND THE PACIFIC, AUGUST 2015, MALE, MALDIVES

HOW THE PACE OF CHANGE AFFECTS THE OUTCOMES YOU GET:

Marc Shotland. J-PAL Global TRANSLATING RESEARCH INTO ACTION

Digital Reality TM changes everything

Development Economics: Microeconomic issues and Policy Models

HIGHLIGHTS OF THE LATIN AMERICAN AND CARIBBEAN REGIONAL WORKSHOP ON SCIENCE AND TECHNOLOGY FOR SUSTAINABLE DEVELOPMENT (Santiago, Chile, 5-75

Course Overview J-PAL HOW TO RANDOMIZE 2

Alexa, What s the Internet of Things. Karen M. Waddill M.A. CCC-SLP ATP

EHR Optimization: Why Is Meaningful Use So Difficult?

Chaloemphon Meechai 1 1

Game Theory. Department of Electronics EL-766 Spring Hasan Mahmood

Is SBCC the secret sauce in clean cooking interventions? From global review to randomized controlled trial in Bangladesh

MOVING FROM R&D TO WIDESPREAD ADOPTION OF ENVIRONMENTALLY SOUND INNOVATION

15: Ethics in Machine Learning, plus Artificial General Intelligence and some old Science Fiction

Health Informaticians Drive Innovation from Bench to Bedside

Boundaryless Hospital - Rethink and Redefine Health Care Management. New Chains of Value Creation

Lecture 4: Chapter 4

The Digital Doctor: Hope, Hype & Harm at the Dawn of Medicine s Computer Age

The Contribution of the Social Sciences to the Energy Challenge

Machine Learning and Decision Making for Sustainability

Measurement for Generation and Dissemination of Knowledge a case study for India, by Mr. Ashish Kumar, former DG of CSO of Government of India

THE MACROECONOMICS OF THE GLOBAL TECHNOLOGY ECONOMY. Howard A. Rubin

14 th Berlin Open Access Conference Publisher Colloquy session

Adjusting your IWA for Global Perspectives

Five-year strategy. Harnessing the power of evidence and ideas. Evidence. Ideas. Change. Evidence. Ideas. Change.

Science and Technology for Human Development: State-Citizen Synergy. Shahid Najam Zain Azeem

The Relationship Between Annual GDP Growth and Income Inequality: Developed and Undeveloped Countries

How machines learn in healthcare

Unit 1: The Economic Fundamentals Weeks How does scarcity impact the decisions individuals and societies must make?

Compute P(X 4) = Chapter 8 Homework Problems Compiled by Joe Kahlig

AI use in European healthcare

A Comprehensive Statewide Study of Gambling Impacts: Implications for Public Health

Climate Change, Energy and Transport: The Interviews

Correlation of regional innovation policy and private enterprise independent innovation capability Ying-jie Zhang

Mainstreaming PE in Horizon 2020: perspectives and ambitions

Australia and Japan: a View from Asia Kevin Sneader October 13th 2014

Japan Lagging in Scientific Research

BOLD: Exponential Growth and the Democratization of the World

ZOLO Healthcare Solutions All Rights Reserved

Harvesting from pan-european experiences. Marco d Angelantonio Health Information Management

Grand Challenges for Systems and Services Sciences

ArtWorks Scotland! !!! Research Summary Quality Perspectives, Artists and Partners: Research Background and Key observations

Ken Buetow, Ph.D. Director, Computation Science and Informatics, Complex Adaptive ASU Professor, School of Life Science

The Benefits of Broadband Expansion to America's Economy, Education, and Health

Virtual Model Validation for Economics

Study on Fuzzy Comprehensive Evaluation of Regional Technological Innovation Ability of China Changzhutan 3 +5 Urban Agglomeration Based on AHP

Draft Plan of Action Chair's Text Status 3 May 2008

Telecoms and Tech Week

Technology Transfer Principles: Methods, Knowledge States and Value Systems Underlying Successful Technological Innovation

Botswana - Botswana AIDS Impact Survey III 2008

Test at a Glance. Updated June 2017

A New Energy Efficiency Center

Outline of Presentation

Hazard Perception Training for Young Drivers. Anuj K. Pradhan, PhD University of Michigan Transportation Research Institute

CEOCFO Magazine. Pat Patterson, CPT President and Founder. Agilis Consulting Group, LLC

Formal Model for e-healthcare Readiness Assessment in Developing Country Context

Technology, Social Entrepreneurs and the Global Crisis

Mr. Peter Felten Launching Event IFRC Forecast-based Action by the DREF 07. Mai 2018, 10.30h-13.15h CICG, Geneva. Keynote Speech

The Digital Doctor: Hope, Hype, and Harm at the Dawn of Medicine s Computer Age

AP Statistics S A M P L I N G C H A P 11

Stats: Modeling the World. Chapter 11: Sample Surveys

System and Network Administration

Translational scientist competency profile

(PDF) ORGANIZATIONAL BEHAVIOR 11TH ED

Improving Productivity: Private, Social and Public Sector Perspectives

Towards a Software Engineering Research Framework: Extending Design Science Research

Macroeconomics: Principles, Applications, and Tools

Surveillance and Calibration Verification Using Autoassociative Neural Networks

Date. Probability. Chapter

Critical and Social Perspectives on Mindfulness

Army Research Laboratory -Orlando TSIS 2017

A brief introduction to... Human-centred design and behavioural science. A brief introduction to... Human-centred design and behavioural science

Trieste Forum. Impact of Science and Technology on Society and Economy

The communication dimension of wind energy: Challenges and opportunities

Towards a Consumer-Driven Energy System

Accelerated Inclusive Growth. through. Inclusive Innovation

Factors influencing the adoption of building information modeling in the AEC Industry

Corporate Social Responsibility Practices among Small and Medium Enterprises in Sri Lanka. W. M. H. Piumali and R. M. C. Kumari

Transformation of Power Distribution

GECC Assessment of TECH Technology in World Civilization

THE STATE OF THE SOCIAL SCIENCE OF NANOSCIENCE. D. M. Berube, NCSU, Raleigh

Executive Summary Industry s Responsibility in Promoting Responsible Development and Use:

Many people dream of starting a business - but are hesitant to start a business of

Innovation in the Energy Sector: Which Technologies do we need after 2030 and which policies do we need now?

Measuring well-being and progress

Digital Transformation towards Society /09/07 Shigetoshi SAMESHIMA Research & Development Group, Hitachi, Ltd.

Overview of Research toward Realization of Intelligent Society

I S S U E N O. 1 / / V O L U M E N O. 1 / / S E P THE ROAD TO SUCCESS Y O U R J O U R N E Y S T A R T S T O D A Y

Implementation of Systems Medicine across Europe

Encouraging SME Participation in Public Procurement Markets in MENA

Business angels Published on Innovation Policy Platform (

Channeling Facebook into checkbook: Zuckerbergs to donate billions

How U.S. Employment Is Changing

Transcription:

Learning from Evaluation when Context Matters Lant Pritchett Harvard Kennedy School and Center for Global Development At Evidence on a silver platter: Evaluation results for policy making in Development Cooperation, November 5 2015

Four big problems with the RCT revolution or randomistas It is not good science experiments without theory cannot have external validity It is not a good theory of organizational learning about implementation intensive programs (which is most everything) It is not able to focus on development and hence is mainly useful for charity work It is not based on a realistic positive model or theory of change of policy adoption

Every bubble has a transition from we to they : We have to get in on this to What were they thinking?

Getting over the hype cycle about RCTs to slope of enlightenment

Doing science via RCTs embedded as independent impact evaluation of projects Stage of hype cycle Peak of inflated expectations Trough of disillusionment Slope of enlightenment Doing science with RCTs Build RCTs into donor/ngo financed projects which create treatments to look at the impact of interventions. After many RCTs are done do a systematic review of what works to guide policy If doing experiments were science there would be a Nobel Prize for alchemy This is not science it is the parody of science. This cannot work (and we knew that) This does not work (we can now show that) All methods have to be more embedded in a theory that provides a notion of context and hence invariance laws that are able to encompass all empirical findings.

Cannot work and does not work Cannot work as a method because if the bad studies have heterogeneity (e.g. impacts differ across contexts) then good studies cannot logically be expected to reduce that heterogeneity Does not work: Eva Vivalt s survey of 600 RCT studies Evans on education Bold, et al on replication Pritchett and Sandefur (2015)

Cannot work: No claim to external validity is coherent because the gap between observational and RCT results is the result of behavior Distribution of the impact on test scores of reduced class size from non-observational studies (about the non- RCT literature) Typical 2σ β OLS(j)=0 β OLS(i)=.2 β RCT(i)=.3 β OLS(k)=.4 Zero Gold standard RCT from one specific context (country, region, grade, range of class sizes)

Does not work: Rigorous evidence isn t Vivalt 2015 shows the evaluations of impacts of similar programs have very high variance Bold et al show context includes implementing organization Evans shows systematic reviews are not so systematic Pritchett and Sandefur (2015) shows OLS from context beats RCT from another context to predict impact Muralidharan (2015) shows even lots of zero estimated impacts do reveal why they failed

Not a good model of how organizations actually learn to do things better Stage of hype cycle Peak of inflated expectations Trough of disillusionment Slope of enlightenment Forcing organizations to acknowledge failure Introduce rigorous independent evaluation into programs and organizations (donor and other) will learn what works Organizations resist evaluations that come anywhere near core beliefs/practices Without organizational cooperation what works gets subverted Learning during implementation can be enhanced by with implementing organizations but only at the expense of the independence of the evaluation

Example: Cameras in classrooms Duflo, Hanna et al show cameras in classrooms of a small NGO in Rajasthan work to increase teacher attendance and raise learning performance of students Duflo et al (2008) show introducing better biometrics (and pay incentives) in Rajasthan reduces presence of ANMs in health sub-centers Dhaliwal and Hanna (2013) show biometrics in Karnataka India to increase attendance doesn t change attendance of doctors and reduces PHC use

A study of implementing biometrics to track attendance of medical personnel at PHCs in Karnataka 0-0,05-0,1-0,15-0,2 Treatment impact on patient perceptions Staff availability Staff Quality Knowledge of entitlements Source: Dhaliwal and Hanna 2013 Use of biometrics to track attendance in treatment with some threat of docking days missed Attendance of doctors tracked was around 30 percent Program made patient perception worse The program had positive effects on health status because they used the PHC less 0,2 0,1 0-0,1-0,2-0,3 Percentage change in place of delivery PHC Large Hospital (Public or Private)

Learning when the fitness function is rugged and contextual over a hyperdimensional design pace

Not a good model of to do better development (as opposed to charity mitigating the consequences of lack of development) Stage of hype cycle Peak of inflated expectations Trough of disillusionment Slope of enlightenment Evaluating what is possible to be evaluated The RCT would improve evidence and lead to better development outcomes Most program evaluations want large N do generate statistical power and hence focus on individualized treatments (e.g. CCTs, deworming, livelihood programs) Most deep causes of development are ontologically at the social/political/economic level, not individual so national development ultimately matters more for all outcomes than program design RCTs are only a small part of the development agenda

Trillion dollar questions versus million dollar questions The gain from India s growth performance after an incipient crisis in 1991 (for which policy response appeared to matter and the policy response was influenced by donor action) cumulative added trillions of dollars to Indian output The latest Science magazine paper show replication of an approach to livelihood programs in six countries saw gains in per capita consumption of PPP$54 per year and that is gross not net even applied to all billion extreme poor on that planet that is a gross not net gain of 54 billion.

Not a good model of to do better development (as opposed to charity mitigating the consequences of lack of development) Stage of hype cycle Peak of inflated expectations Trough of disillusionment Slope of enlightenment Political economy of policy making and policy adoption By providing unambiguously rigorous and easy to understand information to policy makers they will adopt new programs Many government mistakes are due to deeply embedded ideas and interests. The idea that policy makers are simply waiting for new evidence is a fanciful view of political process RCTs can be built into the experimentation and scaling of policies and programs that policy makers otherwise want to adopt but this is more experiential learning than impact evaluation

Don t get me wrong There should be enormously more not less use of randomization but to achieve this requires not RCTs used for independent evaluation of impact (from inputs to outcomes) but making randomization inside organizations possible to explore a hyper-dimensional and rugged design space using within treatment variation to achieve organizational goals (with some small use of impact evaluation)