AI Fairness 360. Kush R. Varshney

Similar documents
Dependable AI Systems

Big Data & AI Governance: The Laws and Ethics

TRUSTING THE MIND OF A MACHINE

Artificial Intelligence Machine learning and Deep Learning: Trends and Tools. Dr. Shaona

Our position. ICDPPC declaration on ethics and data protection in artificial intelligence

An Introduction to Machine Learning for Social Scientists

Transparency and Accountability of Algorithmic Systems vs. GDPR?

15: Ethics in Machine Learning, plus Artificial General Intelligence and some old Science Fiction

Why AI Goes Wrong And How To Avoid It Brandon Purcell

Roadmap for machine learning

Towards Trusted AI Impact on Language Technologies

AI Frontiers. Dr. Dario Gil Vice President IBM Research

Introduction to Computer Science - PLTW #9340

MSc(CompSc) List of courses offered in

What we are expecting from this presentation:

Ethics of Data Science

An Introduction to Convolutional Neural Networks. Alessandro Giusti Dalle Molle Institute for Artificial Intelligence Lugano, Switzerland

Surveillance and Privacy in the Information Age. Image courtesy of Josh Bancroft on flickr. License CC-BY-NC.

Ethics Guideline for the Intelligent Information Society

Machine Learning for Antenna Array Failure Analysis

Classification of Road Images for Lane Detection

Artificial intelligence and judicial systems: The so-called predictive justice

Friends don t let friends deploy Black-Box models The importance of transparency in Machine Learning. Rich Caruana Microsoft Research

Randomized Evaluations in Practice: Opportunities and Challenges. Kyle Murphy Policy Manager, J-PAL January 30 th, 2017

Artificial Intelligence in Medicine. The Landscape. The Landscape

On the Diversity of the Accountability Problem

Stanford Center for AI Safety

#Azure #MicrosoftAIJourney Feedback Forms

Machines can learn, but what will we teach them? Geraldine Magarey

Mastering the game of Omok

Robotesting: Are you ready for that yet?

Discussion of The power of monitoring: how to make the most of a contaminated multivariate sample

GPU ACCELERATED DEEP LEARNING WITH CUDNN

The 2018 Publishing Landscape: Technological Horizons. Lyndsey Dixon Editorial Director, APAC Journals Taylor & Francis Group

Human-Centric Trusted AI for Data-Driven Economy

The Information Commissioner s response to the Draft AI Ethics Guidelines of the High-Level Expert Group on Artificial Intelligence

Re-Considering Bias: What Could Bringing Gender Studies and Computing Together Teach Us About Bias in Information Systems?

Navigating the AI Adoption Minefield Pitfalls, best practices, and developing your own AI roadmap April 11

Raw Data. Cleaned, Structured Data. Exploratory Data Analysis. Verify Hunches (stats) Data Product

AI & Law. What is AI?

Prof. Roberto V. Zicari Frankfurt Big Data Lab RatSWD- February 9, 2017 Berlin

Initialisation improvement in engineering feedforward ANN models.

Regulatory Mechanisms and Algorithms towards Trust in AI/ML

Views from a patent attorney What to consider and where to protect AI inventions?

IUU Fishing Detection

SPECIFICITY of MACHINE LEARNING PROJECTS. Borys Pratsiuk, Head of R&D, Ci

Workshop on anonymization Berlin, March 19, Basic Knowledge Terms, Definitions and general techniques. Murat Sariyar TMF

*Please see course page for full description and additional details.

Trust in AI by educating engineers to ethically aligned design

신경망기반자동번역기술. Konkuk University Computational Intelligence Lab. 김강일

Artificial Intelligence and Deep Learning

OECD WORK ON ARTIFICIAL INTELLIGENCE

How Innovation & Automation Will Change The Real Estate Industry

Demystifying Machine Learning

Big Data, privacy and ethics: current trends and future challenges

Gridiron-Gurus Final Report

THE USE OF ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING IN SPEECH RECOGNITION. A CS Approach By Uniphore Software Systems

Black Box Machine Learning

The BGF-G7 Summit Report The AIWS 7-Layer Model to Build Next Generation Democracy

Hacking Reinforcement Learning

Some thoughts on safety of machine learning

Implementing Quality Systems

Advances and Perspectives in Health Information Standards

Sentiment Analysis of User-Generated Contents for Pharmaceutical Product Safety

Ethical Bias in AI-Based Security Systems: The Big Data Disconnect

CS221 Final Project Report Learn to Play Texas hold em

A Review of Related Work on Machine Learning in Semiconductor Manufacturing and Assembly Lines

ALGORITHMIC EFFECTS ON USER S EXPERIENCE

Recommendations Worth a Million

reality lapses with the attention." (James, 1950, p~ 293)~

Vincent Thomas Mule, Jr., U.S. Census Bureau, Washington, DC

The IEEE Global Initiative for Ethical Considerations in Artificial Intelligence and Autonomous Systems. Overview June, 2017

Applications of Professional Skepticism. CPA Ibrahim Muhumed. 8 th March 2018

How Machine Learning and AI Are Disrupting the Current Healthcare System. Session #30, March 6, 2018 Cris Ross, CIO Mayo Clinic, Jim Golden, PwC

AI AS A FORCE OF GOOD

The IEEE Global Initiative on Ethics of Autonomous and Intelligent Systems. FairWare2018, 29 May 2018

Executive summary. AI is the new electricity. I can hardly imagine an industry which is not going to be transformed by AI.

Machine Learning has been used in the real estate industry much longer than headlines and pitch decks suggest

Prof. Roberto V. Zicari Frankfurt Big Data Lab The Human Side of AI SIU Frankfurt, November 20, 2017

Human + Machine How AI is Radically Transforming and Augmenting Lives and Businesses Are You Ready?

Intro to AI & AI DAOs: Nature 2.0 Edition. Trent Ocean BigchainDB

Reduction of Musical Residual Noise Using Harmonic- Adapted-Median Filter

HCITools: Strategies and Best Practices for Designing, Evaluating and Sharing Technical HCI Toolkits

Modulation Classification of Satellite Communication Signals Using Cumulants and Neural Networks

Applied Applied Artificial Intelligence - a (short) Silicon Valley appetizer

AI for Autonomous Ships Challenges in Design and Validation

PMU Big Data Analysis Based on the SPARK Machine Learning Framework

Edmund Burke, Philosophical Enquiry into the Origin of our Ideas of the Sublime and the Beautiful, 1757

Supervisors: Rachel Cardell-Oliver Adrian Keating. Program: Bachelor of Computer Science (Honours) Program Dates: Semester 2, 2014 Semester 1, 2015

Lecture 4 Biosignal Processing. Digital Signal Processing and Analysis in Biomedical Systems

Some Challenging Problems in Mining Social Media

Panel on Adaptive, Autonomous and Machine Learning: Applications, Challenges and Risks - Introduction

What s Ethics Got to Do

Big Data Framework for Synchrophasor Data Analysis

Canadian Technology Accreditation Criteria (CTAC) PROGRAM GENERAL LEARNING OUTCOMES (PGLO) Common to all Technologist Disciplines

A New Design and Analysis Methodology Based On Player Experience

COURSE SYLLABUS. Course Title: Introduction to Quality and Continuous Improvement

Powerful But Limited: A DARPA Perspective on AI. Arati Prabhakar Director, DARPA

Societal and Ethical Challenges in the Era of Big Data: Exploring the emerging issues and opportunities of big data management and analytics

Breakthrough to Impact

Transcription:

IBM Research AI AI Fairness 360 Kush R. Varshney krvarshn@us.ibm.com http://krvarshney.github.io @krvarshney http://aif360.mybluemix.net https://github.com/ibm/aif360 https://pypi.org/project/aif360 2018 International Business Machines Corporation 1

AI is now used in many high-stakes decision making applications Credit Employment Admission Sentencing 2018 International Business Machines Corporation 2

What does it take to trust a decision made by a machine? (Other than that it is 99% accurate) Is it fair? Is it easy to understand? Did anyone tamper with it? Is it accountable? 2018 International Business Machines Corporation 3

Unwanted bias and algorithmic fairness Machine learning, by its very nature, is always a form of statistical discrimination Discrimination becomes objectionable when it places certain privileged groups at systematic advantage and certain unprivileged groups at systematic disadvantage Illegal in certain contexts 2018 International Business Machines Corporation 4

Unwanted bias and algorithmic fairness Machine learning, by its very nature, is always a form of statistical discrimination Unwanted bias in training data yields models with unwanted bias that scale out Prejudice in labels Undersampling or oversampling 2018 International Business Machines Corporation 5

Fairness in building and deploying models (d Alessandro et al., 2017) 2018 International Business Machines Corporation 6

Metrics, Algorithms dataset metric preprocessing algorithm inprocessing algorithm postprocessing algorithm classifier metric 2018 International Business Machines Corporation 7

Metrics, Algorithms, and Explainers dataset metric explainer dataset metric preprocessing algorithm inprocessing algorithm postprocessing algorithm classifier metric classifier metric explainer 2018 International Business Machines Corporation 8

21 (or more) definitions of fairness and the need for a toolbox with guidance There is no one definition of fairness applicable in all contexts Some definitions even conflict Requires a comprehensive set of fairness metrics and bias mitigation algorithms Also requires some guidance to industry practitioners 2018 International Business Machines Corporation 9

Bias mitigation is not easy Cannot simply drop protected attributes because features are correlated with them 2018 International Business Machines Corporation 10

Research Algorithmic fairness is one of the hottest topics in the ML/AI research community (Hardt, 2017)

05/03/18 Facebook says it has a tool to detect bias in its artificial intelligence Quartz 05/25/18 Microsoft is creating an oracle for catching biased AI algorithms MIT Technology Review 05/31/18 Pymetrics open-sources Audit AI, an algorithm bias detection tool VentureBeat 06/07/18 Google Education Guide to Responsible AI Practices Fairness Google 06/09/18 Accenture wants to beat unfair AI with a professional toolkit TechCrunch

Fairness Measures Fairness Comparison Themis-ML FairML Aequitas Framework to test given algorithm on variety of datasets and fairness metrics Extensible test-bed to facilitate direct comparisons of algorithms with respect to fairness measures. Includes raw & preprocessed datasets Python library built on scikit-learn that implements fairness-aware machine learning algorithms Looks at significance of model inputs to quantify prediction dependence on inputs Web audit tool as well as python lib. Generates bias report for given model and dataset https://github.com/megantosh/fairness_me asures_code https://github.com/algofairness/fairnesscomparison https://github.com/cosmicbboy/themis-ml https://github.com/adebayoj/fairml https://github.com/dssg/aequitas Fairtest Tests for associations between algorithm outputs and protected populations https://github.com/columbia/fairtest Themis Audit-AI Takes a black-box decision-making procedure and designs test cases automatically to explore where the procedure might be exhibiting group-based or causal discrimination Python library built on top of scikit-learn with various statistical tests for classification and regression tasks https://github.com/laser-umass/themis https://github.com/pymetrics/audit-ai

AI Fairness 360 Differentiation Datasets Toolbox Fairness metrics (30+) Fairness metric explanations Bias mitigation algorithms (9+) Guidance Industry-specific tutorials Comprehensive bias mitigation toolbox (including unique algorithms from IBM Research) Several metrics and algorithms that have no available implementations elsewhere Extensible Designed to translate new research from the lab to industry practitioners (e.g. scikit-learn s fit/predict paradigm)

Optimized Preprocessing (NIPS 2017) 1. Group discrimination Control dependence p Y D of transformed outcome Y on D 2. Individual distortion Avoid large changes in individual features 3. Utility preservation Retain joint distribution p X,Y so model can still learn task x, y δ min Δ(p X, Y, p X,Y ) s. t. J p Y D y d 1, p Y D y d 1 ε x, y E δ x, y, X, Y d, x, y c d 1 d 2