INTRODUCTORY STATISTICS LECTURE 4 PROBABILITY

Similar documents
Discrete Random Variables Day 1

Such a description is the basis for a probability model. Here is the basic vocabulary we use.

Module 4 Project Maths Development Team Draft (Version 2)

Probability. Ms. Weinstein Probability & Statistics

Random Variables. A Random Variable is a rule that assigns a number to each outcome of an experiment.

Stat 20: Intro to Probability and Statistics

Random Variables. Outcome X (1, 1) 2 (2, 1) 3 (3, 1) 4 (4, 1) 5. (6, 1) (6, 2) (6, 3) (6, 4) (6, 5) (6, 6) }

Class XII Chapter 13 Probability Maths. Exercise 13.1

Chapter 1. Probability

S = {(1, 1), (1, 2),, (6, 6)}

Chapter 1. Probability

Probability MAT230. Fall Discrete Mathematics. MAT230 (Discrete Math) Probability Fall / 37

4.1 Sample Spaces and Events

CHAPTERS 14 & 15 PROBABILITY STAT 203

Exam III Review Problems

Chapter 11: Probability and Counting Techniques

CHAPTER 2 PROBABILITY. 2.1 Sample Space. 2.2 Events

PROBABILITY. 1. Introduction. Candidates should able to:

November 6, Chapter 8: Probability: The Mathematics of Chance

The Teachers Circle Mar. 20, 2012 HOW TO GAMBLE IF YOU MUST (I ll bet you $5 that if you give me $10, I ll give you $20.)

APPENDIX 2.3: RULES OF PROBABILITY

, x {1, 2, k}, where k > 0. (a) Write down P(X = 2). (1) (b) Show that k = 3. (4) Find E(X). (2) (Total 7 marks)

November 11, Chapter 8: Probability: The Mathematics of Chance

Mathematical Foundations HW 5 By 11:59pm, 12 Dec, 2015

Chapter 6: Probability and Simulation. The study of randomness

Probability (Devore Chapter Two)

STAT 155 Introductory Statistics. Lecture 11: Randomness and Probability Model

Classical Definition of Probability Relative Frequency Definition of Probability Some properties of Probability

3 The multiplication rule/miscellaneous counting problems

Combinatorics: The Fine Art of Counting

Section Introduction to Sets

Week 3 Classical Probability, Part I

November 8, Chapter 8: Probability: The Mathematics of Chance

Probability. Dr. Zhang Fordham Univ.

Intermediate Math Circles November 1, 2017 Probability I

Lesson 4: Chapter 4 Sections 1-2

4.3 Rules of Probability

Contents 2.1 Basic Concepts of Probability Methods of Assigning Probabilities Principle of Counting - Permutation and Combination 39

STOR 155 Introductory Statistics. Lecture 10: Randomness and Probability Model

Midterm 2 Practice Problems

ECON 214 Elements of Statistics for Economists

Total. STAT/MATH 394 A - Autumn Quarter Midterm. Name: Student ID Number: Directions. Complete all questions.

MATH 215 DISCRETE MATHEMATICS INSTRUCTOR: P. WENG

North Seattle Community College Winter ELEMENTARY STATISTICS 2617 MATH Section 05, Practice Questions for Test 2 Chapter 3 and 4

Chapter 6: Probability and Simulation. The study of randomness

RANDOM EXPERIMENTS AND EVENTS

MTH 103 H Final Exam. 1. I study and I pass the course is an example of a. (a) conjunction (b) disjunction. (c) conditional (d) connective

AP Statistics Ch In-Class Practice (Probability)

Discrete Structures for Computer Science

Introduction to probability

Probability and Counting Techniques

3 The multiplication rule/miscellaneous counting problems

Math 1313 Section 6.2 Definition of Probability

The study of probability is concerned with the likelihood of events occurring. Many situations can be analyzed using a simplified model of probability

Probability and Randomness. Day 1

Conditional Probability Worksheet

7.1 Experiments, Sample Spaces, and Events

Outcome X (1, 1) 2 (2, 1) 3 (3, 1) 4 (4, 1) 5 {(1, 1) (1, 2) (1, 3) (1, 4) (1, 5) (1, 6) (6, 1) (6, 2) (6, 3) (6, 4) (6, 5) (6, 6)}

Beginnings of Probability I

Week 1: Probability models and counting

Define and Diagram Outcomes (Subsets) of the Sample Space (Universal Set)

1. A factory makes calculators. Over a long period, 2 % of them are found to be faulty. A random sample of 100 calculators is tested.

Review Questions on Ch4 and Ch5

STAT 430/510 Probability Lecture 3: Space and Event; Sample Spaces with Equally Likely Outcomes

Statistics Intermediate Probability

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Review of Probability

23 Applications of Probability to Combinatorics

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Math Exam 2 Review. NOTE: For reviews of the other sections on Exam 2, refer to the first page of WIR #4 and #5.

Math Exam 2 Review. NOTE: For reviews of the other sections on Exam 2, refer to the first page of WIR #4 and #5.

Math 14 Lecture Notes Ch. 3.3

CSC/MTH 231 Discrete Structures II Spring, Homework 5

Conditional Probability Worksheet

The point value of each problem is in the left-hand margin. You must show your work to receive any credit, except on problems 1 & 2. Work neatly.

The probability set-up

Textbook: pp Chapter 2: Probability Concepts and Applications

Chapter 3: Elements of Chance: Probability Methods

Lecture 21/Chapter 18 When Intuition Differs from Relative Frequency

( ) Online MC Practice Quiz KEY Chapter 5: Probability: What Are The Chances?

Math 1342 Exam 2 Review

Math 227 Elementary Statistics. Bluman 5 th edition

Section 6.1 #16. Question: What is the probability that a five-card poker hand contains a flush, that is, five cards of the same suit?

SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question.

Module 5: Probability and Randomness Practice exercises

SALES AND MARKETING Department MATHEMATICS. Combinatorics and probabilities. Tutorials and exercises

Chapter 11: Probability and Counting Techniques

Grade 7/8 Math Circles February 25/26, Probability

STAT Statistics I Midterm Exam One. Good Luck!

1. An office building contains 27 floors and has 37 offices on each floor. How many offices are in the building?

The next several lectures will be concerned with probability theory. We will aim to make sense of statements such as the following:

Simple Probability. Arthur White. 28th September 2016

I. WHAT IS PROBABILITY?

a) Getting 10 +/- 2 head in 20 tosses is the same probability as getting +/- heads in 320 tosses

Chapter 11. Sampling Distributions. BPS - 5th Ed. Chapter 11 1

Grade 6 Math Circles Fall Oct 14/15 Probability

ABC High School, Kathmandu, Nepal. Topic : Probability

1. How to identify the sample space of a probability experiment and how to identify simple events

The probability set-up

Unit 9: Probability Assignments

Transcription:

INTRODUCTORY STATISTICS LECTURE 4 PROBABILITY

THE GREAT SCHLITZ CAMPAIGN 1981 Superbowl Broadcast of a live taste pitting Against key competitor: Michelob Subjects: 100 Michelob drinkers REF: SCHLITZBREWING.COM

MICHELOB VS. SCHLITZ VS. REF: DAILYHIIT.COM.

MICHELOB VS. SCHLITZ Average person prefer with 0.5 probability VS. REF: DAILYHIIT.COM.

MICHELOB VS. SCHLITZ Average person prefer with 0.5 probability VS. probability of least say 40% of Michelob drinkers prefer Schlitz? REF: DAILYHIIT.COM.

MICHELOB VS. SCHLITZ Average person prefer with 0.5 probability VS. probability of least say 40% of Michelob drinkers prefer Schlitz? 98% REF: DAILYHIIT.COM.

PROBABILITY GAMES REF:WIKIPEDIA.ORG

PROBABILITY GAMES Blackjack Yahtzee Backgammon Poker and many others REF:WIKIPEDIA.ORG

REF: NJAROUNDTHEWORLD.COM/

MIT blackjack team Students and professors from MIT Basic probability in 1980s, profit: 170$/hour Max in a single trip: 500K $ REF: NJAROUNDTHEWORLD.COM/

THE HISTORY Starts with Cardano Medical doctor Gambling addict Book on Games of Chance REF:WIKIPEDIA.ORG

FIRST PROBABILITY MODELS A coin is flipped What is the probability that a Tails show? We can say 0.5, if and only if the coin is unbiased What if outcomes are not equally likely?

REVIEW Which of the following events would you be most surprised by? a. exactly 3 heads in 10 coin flips b. exactly 3 heads in 100 coin flips c. exactly 3 heads in 1000 coin flips

REVIEW Which of the following events would you be most surprised by? a. exactly 3 heads in 10 coin flips b. exactly 3 heads in 100 coin flips c. exactly 3 heads in 1000 coin flips

REVIEW Which of the following events would you be most surprised by? a. exactly 3 heads in 10 coin flips b. exactly 3 heads in 100 coin flips c. exactly 3 heads in 1000 coin flips

REVIEW Which of the following events would you be most surprised by? a. exactly 3 heads in 10 coin flips b. exactly 3 heads in 100 coin flips c. exactly 3 heads in 1000 coin flips Total Outcomes: 2number of flips

REVIEW Which of the following events would you be most surprised by? a. exactly 3 heads in 10 coin flips C(10,3) / 2 10 b. exactly 3 heads in 100 coin flips C(100,3) / 2 100 c. exactly 3 heads in 1000 coin flips C(1000,3) / 2 100 a = 0.11, b = 9.46 x 10-29, c =1.11x 10-299

REVIEW Which of the following events would you be most surprised by? a. exactly 3 heads in 10 coin flips C(10,3) / 2 10 b. exactly 3 heads in 100 coin flips C(100,3) / 2 100 c. exactly 3 heads in 1000 coin flips C(1000,3) / 2 100 a = 0.11, b = 9.46 x 10-29, c =1.11x 10-299

REVIEW Which of the following events would you be most surprised by? a. exactly 3 heads in 10 coin flips C(10,3) / 2 10 b. exactly 3 heads in 100 coin flips C(100,3) / 2 100 c. exactly 3 heads in 1000 coin flips C(1000,3) / 2 100 a = 0.11, b = 9.46 x 10-29, c =1.11x 10-299

LAW OF LARGE NUMBERS Law of large numbers states that as more observations are collected, the proportion of occurrences with a particular outcome, A, converges to the probability of that outcome, P(A).

DISJOINT AND NON-DISJOINT OUTCOMES

DISJOINT AND NON-DISJOINT OUTCOMES Disjoint (mutually exclusive) outcomes: Cannot happen at the same time.

DISJOINT AND NON-DISJOINT OUTCOMES Disjoint (mutually exclusive) outcomes: Cannot happen at the same time. The outcome of a single coin toss cannot be a head and a tail.

DISJOINT AND NON-DISJOINT OUTCOMES Disjoint (mutually exclusive) outcomes: Cannot happen at the same time. The outcome of a single coin toss cannot be a head and a tail. A student both cannot fail and pass a class.

DISJOINT AND NON-DISJOINT OUTCOMES Disjoint (mutually exclusive) outcomes: Cannot happen at the same time. The outcome of a single coin toss cannot be a head and a tail. A student both cannot fail and pass a class. Non-disjoint outcomes: Can happen at the same time.

DISJOINT AND NON-DISJOINT OUTCOMES Disjoint (mutually exclusive) outcomes: Cannot happen at the same time. The outcome of a single coin toss cannot be a head and a tail. A student both cannot fail and pass a class. Non-disjoint outcomes: Can happen at the same time. A student can get an A in Stats and A in Econ in the same semester.

COMPLEMENTARY EVENTS

COMPLEMENTARY EVENTS Complementary events are two mutually exclusive events whose probabilities that add up to 1.

COMPLEMENTARY EVENTS Complementary events are two mutually exclusive events whose probabilities that add up to 1. A couple has one kid. If we know that the kid is not a boy, what is gender of this kid? {M,F} > Boy and girl are complementary outcomes.

COMPLEMENTARY EVENTS Complementary events are two mutually exclusive events whose probabilities that add up to 1. A couple has one kid. If we know that the kid is not a boy, what is gender of this kid? {M,F} > Boy and girl are complementary outcomes. A couple has two kids, if we know that they are not both girls, what are the possible gender combinations for these kids? {MM, MF, FM, FF}

INDEPENDENCE

INDEPENDENCE Two events are independent if knowing the outcome of one provides no useful information about the outcome of the other.

INDEPENDENCE Two events are independent if knowing the outcome of one provides no useful information about the outcome of the other. Knowing that the coin landed on a head on the first toss does not provide any useful information for determining what the coin will land on in the second toss.

INDEPENDENCE Two events are independent if knowing the outcome of one provides no useful information about the outcome of the other. Knowing that the coin landed on a head on the first toss does not provide any useful information for determining what the coin will land on in the second toss. Outcomes of two tosses of a coin are independent.

INDEPENDENCE Two events are independent if knowing the outcome of one provides no useful information about the outcome of the other. Knowing that the coin landed on a head on the first toss does not provide any useful information for determining what the coin will land on in the second toss. Outcomes of two tosses of a coin are independent. Knowing that the first card drawn from a deck is an ace does provide useful information for determining the probability of drawing an ace in the second draw.

INDEPENDENCE Two events are independent if knowing the outcome of one provides no useful information about the outcome of the other. Knowing that the coin landed on a head on the first toss does not provide any useful information for determining what the coin will land on in the second toss. Outcomes of two tosses of a coin are independent. Knowing that the first card drawn from a deck is an ace does provide useful information for determining the probability of drawing an ace in the second draw. Outcomes of two draws from a deck of cards (without replacement) are dependent.

EXERCISE Between January 9-12, 2013, SurveyUSA interviewed a random sample of 500 NC residents asking them whether they think widespread gun ownership protects law abiding citizens from crime, or makes society more dangerous. 58% of all respondents said it protects citizens. 67% of White respondents, 28% of Black respondents, and 64% of Hispanic respondents shared this view. Which of the below is true? Opinion on gun ownership and race ethnicity are most likely (a) complementary (b) mutually exclusive (c) independent (d) dependent (e) disjoint

EXERCISE Between January 9-12, 2013, SurveyUSA interviewed a random sample of 500 NC residents asking them whether they think widespread gun ownership protects law abiding citizens from crime, or makes society more dangerous. 58% of all respondents said it protects citizens. 67% of White respondents, 28% of Black respondents, and 64% of Hispanic respondents shared this view. Which of the below is true? Opinion on gun ownership and race ethnicity are most likely (a) complementary (b) mutually exclusive (c) independent (d) dependent (e) disjoint

REVIEW

REVIEW

REVIEW Do the sum of probabilities of two disjoint events always add up to 1?

REVIEW Do the sum of probabilities of two disjoint events always add up to 1? Not necessarily, there may be more than 2 events in the sample space, e.g. party affiliation.

REVIEW Do the sum of probabilities of two disjoint events always add up to 1? Not necessarily, there may be more than 2 events in the sample space, e.g. party affiliation. Do the sum of probabilities of two complementary events always add up to 1?

REVIEW Do the sum of probabilities of two disjoint events always add up to 1? Not necessarily, there may be more than 2 events in the sample space, e.g. party affiliation. Do the sum of probabilities of two complementary events always add up to 1? Yes, that s the definition of complementary, e.g. heads and tails.

EXERCISE

EXERCISE If we were to randomly select 5 Texans, what is the probability that at least one is uninsured?

EXERCISE If we were to randomly select 5 Texans, what is the probability that at least one is uninsured? The sample space for the number of Texans who are uninsured would be: S = {0,1,2,3,4,5}

EXERCISE If we were to randomly select 5 Texans, what is the probability that at least one is uninsured? The sample space for the number of Texans who are uninsured would be: We are interested in instances where at least one person is uninsured: S = {0,1,2,3,4,5}

EXERCISE If we were to randomly select 5 Texans, what is the probability that at least one is uninsured? The sample space for the number of Texans who are uninsured would be: We are interested in instances where at least one person is uninsured: So we can divide up the sample space intro two categories: S = {0,at least one}

EXERCISE If we were to randomly select 5 Texans, what is the probability that at least one is uninsured? The sample space for the number of Texans who are uninsured would be: We are interested in instances where at least one person is uninsured: So we can divide up the sample space intro two categories:

EXERCISE (CONTINUED ) Since the probability of the sample space must add up to 1: Prob(at least 1 uninsured) = 1 Prob(none uninsured) = 1 - [ (1-0.255) 5 ] = 0.77

EXERCISE Roughly 20% of undergraduates at a university are vegetarian or vegan. What is the probability that, among a random sample of 3 undergraduates, at least one is vegetarian or vegan? A. 1 0.2 3 B. 1 0.2 3 C. 0.8 3 D. 1 0.8 3 E. 1 0.8 3

EXERCISE Roughly 20% of undergraduates at a university are vegetarian or vegan. What is the probability that, among a random sample of 3 undergraduates, at least one is vegetarian or vegan? A. 1 0.2 3 B. 1 0.2 3 C. 0.8 3 D. 1 0.8 3 E. 1 0.8 3

CONDITIONAL PROBABILITY Researchers randomly assigned 72 chronic users of cocaine into three groups: desipramine (antidepressant), lithium (standard treatment for cocaine) and placebo. Results of the study are summarized below. REF: HTTP://WWW.OSWEGO.EDU/ SRP/STATS/2_WAY_TBL_1.HTM

CONDITIONAL PROBABILITY Researchers randomly assigned 72 chronic users of cocaine into three groups: desipramine (antidepressant), lithium (standard treatment for cocaine) and placebo. Results of the study are summarized below. Relapse No Relapse Total Desipramin 10 14 24 Lithium e 18 6 24 Placebo 20 4 24 Total 48 24 72 REF: HTTP://WWW.OSWEGO.EDU/ SRP/STATS/2_WAY_TBL_1.HTM

CONDITIONAL PROBABILITY What is the probability a patient relapsed? Relapse No Relapse Total Desipramin 10 14 24 Lithium e 18 6 24 Placebo 20 4 24 Total 48 24 72

CONDITIONAL PROBABILITY What is the probability a patient relapsed? Relapse No Relapse Total Desipramin 10 14 24 Lithium e 18 6 24 Placebo 20 4 24 Total 48 24 72

CONDITIONAL PROBABILITY What is the probability a patient relapsed? Relapse No Relapse Total Desipramin 10 14 24 Lithium e 18 6 24 Placebo 20 4 24 Total 48 24 72

CONDITIONAL PROBABILITY What is the probability a patient relapsed? Relapse No Relapse Total Desipramin 10 14 24 Lithium e 18 6 24 Placebo 20 4 24 Total 48 24 72 P(relapsed) = 48/72 = 0.67

CONDITIONAL PROBABILITY What is the probability that a patient received desipramin and relapsed? Relapse No Relapse Total Desipramin 10 14 24 Lithium e 18 6 24 Placebo 20 4 24 Total 48 24 72

CONDITIONAL PROBABILITY What is the probability that a patient received desipramin and relapsed? Relapse No Relapse Total Desipramin 10 14 24 Lithium e 18 6 24 Placebo 20 4 24 Total 48 24 72 P(relapsed) = 10/72 = 0.14

CONDITIONAL PROBABILITY What is the probability that a patient received desipramin and relapsed? Relapse No Relapse Total Desipramin 10 14 24 Lithium e 18 6 24 Placebo 20 4 24 Total 48 24 72 P(relapsed) = 10/72 = 0.14

CONDITIONAL PROBABILITY What is the probability that a patient received desipramin and relapsed? Relapse No Relapse Total Desipramin 10 14 24 Lithium e 18 6 24 Placebo 20 4 24 Total 48 24 72 P(relapsed) = 10/72 = 0.14

CONDITIONAL PROBABILITY Conditional probability The conditional probability of the outcome of interest A, given condition B, is calculated as P (A B) = P (A and B) P (B)

Probability that patients took desipramin relapsed P( relapsed disepramin) = Relapse No Relapse Total Desipramin 10 14 24 Lithium e 18 6 24 Placebo 20 4 24 Total 48 24 72

Probability that patients took desipramin relapsed P( relapsed disepramin) = P (relapse and disepramin) P (disepramin) Relapse No Relapse Total Desipramin 10 14 24 Lithium e 18 6 24 Placebo 20 4 24 Total 48 24 72

Probability that patients took desipramin relapsed P( relapsed disepramin) = P (relapse and disepramin) P (disepramin) Relapse No Relapse Total Desipramin 10 14 24 Lithium e 18 6 24 Placebo 20 4 24 Total 48 24 72

Probability that patients took desipramin relapsed P( relapsed disepramin) = P (relapse and disepramin) P (disepramin) Relapse No Relapse Total Desipramin 10 14 24 Lithium e 18 6 24 Placebo 20 4 24 Total 48 24 72

Probability that patients took desipramin relapsed P( relapsed disepramin) = P (relapse and disepramin) P (disepramin) Relapse No Relapse Total Desipramin 10 14 24 Lithium e 18 6 24 Placebo 20 4 24 Total 48 24 72 10/24 = 0.42

INDEPENDENT EVENTS Two events are independent if knowing the outcome of one provides no useful information about the outcome of the other. If A and B are independent events, P(A B) = P(A) If P(A B) = P(A), then A and B are independent events

BREAST CANCER American Cancer Society estimates that about 1.7% of women have breast cancer. HTTP:// WWW.CANCER.ORG/ CANCER/ CANCERBASICS/ CANCER-PREVALENCE Susan G. Komen For The Cure Foundation states that mammography correctly identifies about 78% of women who truly have breast cancer. HTTP:// WW5.KOMEN.ORG/ BREASTCANCER/ ACCURACYOFMAMMOGRAMS.HTML An article published in 2003 suggests that up to 10% of all mammograms result in false positives for patients who do not have cancer. HTTP:// WWW.NCBI.NLM.NIH.GOV/ PMC/ ARTICLES/ PMC136094

BREAST CANCER When a patient goes through breast cancer screening there are two competing claims: patient had cancer and patient doesn t have cancer. If a mammogram yields a positive result, what is the probability that patient actually has cancer? Cancer status Test result cancer, 0.017 positive, 0.78 negative, 0.22 0.017*0.78 = 0.0133 0.017*0.22 = 0.0037 no cancer, 0.983 positive, 0.1 negative, 0.9 0.983*0.1 = 0.0983 0.983*0.9 = 0.8847

BREAST CANCER When a patient goes through breast cancer screening there are two competing claims: patient had cancer and patient doesn t have cancer. If a mammogram yields a positive result, what is the probability that patient actually has cancer? Cancer status Test result cancer, 0.017 positive, 0.78 negative, 0.22 0.017*0.78 = 0.0133 0.017*0.22 = 0.0037 no cancer, 0.983 positive, 0.1 negative, 0.9 0.983*0.1 = 0.0983 0.983*0.9 = 0.8847

BREAST CANCER When a patient goes through breast cancer screening there are two competing claims: patient had cancer and patient doesn t have cancer. If a mammogram yields a positive result, what is the probability that patient actually has cancer? Cancer status Test result cancer, 0.017 positive, 0.78 negative, 0.22 0.017*0.78 = 0.0133 0.017*0.22 = 0.0037 no cancer, 0.983 positive, 0.1 negative, 0.9 0.983*0.1 = 0.0983 0.983*0.9 = 0.8847

BREAST CANCER When a patient goes through breast cancer screening there are two competing claims: patient had cancer and patient doesn t have cancer. If a mammogram yields a positive result, what is the probability that patient actually has cancer? Cancer status Test result cancer, 0.017 positive, 0.78 negative, 0.22 0.017*0.78 = 0.0133 0.017*0.22 = 0.0037 no cancer, 0.983 positive, 0.1 negative, 0.9 0.983*0.1 = 0.0983 0.983*0.9 = 0.8847

BREAST CANCER When a patient goes through breast cancer screening there are two competing claims: patient had cancer and patient doesn t have cancer. If a mammogram yields a positive result, what is the probability that patient actually has cancer? Cancer status Test result cancer, 0.017 positive, 0.78 negative, 0.22 0.017*0.78 = 0.0133 0.017*0.22 = 0.0037 no cancer, 0.983 positive, 0.1 negative, 0.9 0.983*0.1 = 0.0983 0.983*0.9 = 0.8847

BREAST CANCER When a patient goes through breast cancer screening there are two competing claims: patient had cancer and patient doesn t have cancer. If a mammogram yields a positive result, what is the probability that patient actually has cancer? Cancer status Test result cancer, 0.017 positive, 0.78 negative, 0.22 0.017*0.78 = 0.0133 0.017*0.22 = 0.0037 no cancer, 0.983 positive, 0.1 negative, 0.9 0.983*0.1 = 0.0983 0.983*0.9 = 0.8847

BREAST CANCER When a patient goes through breast cancer screening there are two competing claims: patient had cancer and patient doesn t have cancer. If a mammogram yields a positive result, what is the probability that patient actually has cancer? Cancer status Test result cancer, 0.017 no cancer, 0.983 positive, 0.78 negative, 0.22 positive, 0.1 0.017*0.78 = 0.0133 0.017*0.22 = 0.0037 0.983*0.1 = 0.0983 P(C +) P(C and +) = P(+) 0.0133 = 0.0133 + 0.0983 = 0.12 negative, 0.9 0.983*0.9 = 0.8847

BAYES THEOREM Bayes Theorem: P(outcome A of variable 1 outcome B of variable 2) = P(B A 1 )P(A 1 ) P(B A 1 )P(A 1 ) + P(B A 2 )P(A 2 ) + + P(B A k )P(A k ) where A1, A2,, Ak represent all other possible outcomes of variable 1.

EXAMPLE

EXAMPLE A common epidemiological model for the spread of diseases is the SIR model, where the population is partitioned into three groups: Susceptible, Infected, and Recovered. This is a reasonable model for diseases like chickenpox where a single infection usually provides immunity to subsequent infections. Sometimes these diseases can also be difficult to detect.

EXAMPLE A common epidemiological model for the spread of diseases is the SIR model, where the population is partitioned into three groups: Susceptible, Infected, and Recovered. This is a reasonable model for diseases like chickenpox where a single infection usually provides immunity to subsequent infections. Sometimes these diseases can also be difficult to detect. Imagine a population in the midst of an epidemic where 60% of the population is considered susceptible, 10% is infected, and 30% is recovered. The only test for the disease is accurate 95% of the time for susceptible individuals, 99% for infected individuals, but 65% for recovered individuals. (Note: In this case accurate means returning a negative result for susceptible and recovered individuals and a positive result for infected individuals).

EXAMPLE A common epidemiological model for the spread of diseases is the SIR model, where the population is partitioned into three groups: Susceptible, Infected, and Recovered. This is a reasonable model for diseases like chickenpox where a single infection usually provides immunity to subsequent infections. Sometimes these diseases can also be difficult to detect. Imagine a population in the midst of an epidemic where is considered susceptible, for the disease is accurate infected individuals accurate means returning a negative result for susceptible and recovered individuals and a positive result for infected individuals). Draw a probability tree to reflect the information given above. If the individual has tested positive, what is the probability that they are actually infected?

` 60%: susceptible 10%: infected 30%:recovered. TEST 95% : susceptible 99% : infected 65% : recovered Group Test result susceptible, 0.6 infected, 0.1 recovered, 0.3 positive, 0.05 negative, 0.95 positive, 0.99 negative, 0.01 positive, 0.35 negative, 0.65 0.03 0.57 0.099 0.001 0.105 0.195

` 60%: susceptible 10%: infected 30%:recovered. TEST 95% : susceptible 99% : infected 65% : recovered Group Test result susceptible, 0.6 infected, 0.1 recovered, 0.3 positive, 0.05 negative, 0.95 positive, 0.99 negative, 0.01 positive, 0.35 negative, 0.65 0.03 0.57 0.099 0.001 0.105 0.195

` 60%: susceptible 10%: infected 30%:recovered. TEST 95% : susceptible 99% : infected 65% : recovered Group Test result susceptible, 0.6 infected, 0.1 recovered, 0.3 positive, 0.05 negative, 0.95 positive, 0.99 negative, 0.01 positive, 0.35 negative, 0.65 0.03 0.57 0.099 0.001 0.105 0.195

` 60%: susceptible 10%: infected 30%:recovered. TEST 95% : susceptible 99% : infected 65% : recovered Group Test result susceptible, 0.6 infected, 0.1 recovered, 0.3 positive, 0.05 negative, 0.95 positive, 0.99 negative, 0.01 positive, 0.35 negative, 0.65 0.03 0.57 0.099 0.001 0.105 0.195

EXERCISE (CONT.) Group Test result susceptible, 0.6 infected, 0.1 recovered, 0.3 positive, 0.05 negative, 0.95 positive, 0.99 negative, 0.01 positive, 0.35 negative, 0.65 0.03 0.57 0.099 0.001 0.105 0.195

EXERCISE (CONT.) Group Test result susceptible, 0.6 infected, 0.1 recovered, 0.3 positive, 0.05 negative, 0.95 positive, 0.99 negative, 0.01 positive, 0.35 negative, 0.65 0.03 0.57 0.099 0.001 0.105 0.195

EXERCISE (CONT.) P(inf +) = P(inf and +) P(+) = 0.099 0.03 + 0.099 + 0.105 0.423 Group Test result susceptible, 0.6 infected, 0.1 recovered, 0.3 positive, 0.05 negative, 0.95 positive, 0.99 negative, 0.01 positive, 0.35 negative, 0.65 0.03 0.57 0.099 0.001 0.105 0.195

EXERCISE (CONT.) P(inf +) = P(inf and +) P(+) = 0.099 0.03 + 0.099 + 0.105 0.423 Group Test result susceptible, 0.6 infected, 0.1 recovered, 0.3 positive, 0.05 negative, 0.95 positive, 0.99 negative, 0.01 positive, 0.35 negative, 0.65 0.03 0.57 0.099 0.001 0.105 0.195

EXERCISE (CONT.) P(inf +) = P(inf and +) P(+) = 0.099 0.03 + 0.099 + 0.105 0.423 Group Test result susceptible, 0.6 infected, 0.1 recovered, 0.3 positive, 0.05 negative, 0.95 positive, 0.99 negative, 0.01 positive, 0.35 negative, 0.65 0.03 0.57 0.099 0.001 0.105 0.195

EXERCISE (CONT.) P(inf +) = P(inf and +) P(+) = 0.099 0.03 + 0.099 + 0.105 0.423 Group Test result susceptible, 0.6 infected, 0.1 recovered, 0.3 positive, 0.05 negative, 0.95 positive, 0.99 negative, 0.01 positive, 0.35 negative, 0.65 0.03 0.57 0.099 0.001 0.105 0.195

RANDOM VARIABLES

RANDOM VARIABLES A random variable is a numeric quantity whose value depends on the outcome of a random event

RANDOM VARIABLES A random variable is a numeric quantity whose value depends on the outcome of a random event We use a capital letter, like X, to denote a random variable.

RANDOM VARIABLES A random variable is a numeric quantity whose value depends on the outcome of a random event We use a capital letter, like X, to denote a random variable. The values of a random variable are denoted with a lower case letter, in this case x For example, P( X=x)

EXPECTATION We are often interested in the average outcome of a random variable. We call this the expected value (mean), and it is a weighted average of the possible outcomes µ = E(X) = kx xp (X = x i ) i=1

A DICE GAME

A DICE GAME Enter to game: 3$

A DICE GAME Enter to game: 3$ You roll a fair dice, whatever value you roll, you get its dollar equivalent

A DICE GAME Enter to game: 3$ You roll a fair dice, whatever value you roll, you get its dollar equivalent E.g: If you roll a 2, you get 2$, if you roll 5 you get 5$

A DICE GAME Enter to game: 3$ You roll a fair dice, whatever value you roll, you get its dollar equivalent E.g: If you roll a 2, you get 2$, if you roll 5 you get 5$ If you play this game 3000 times, how much money would you expect to lose or win?

A DICE GAME Outcomes = {1,2,3,4,5,6} After 1000 times, you ll get almost equal amounts of 1 s, 2 s, 6 s, that is 3000/6 = 500 times Total Earning = (1 x 500 + 2x500 +.. +6 x 500)= 10,500 Total Cost = 3 x 3000 = 9,000 Expected Profit = 10,500-9,000 = 1,500 $

EXPECTATION

EXPECTATION

EXPECTATION Dice X P(X) X P(X)

EXPECTATION Dice X P(X) X P(X) 1 $1 1/6 1/6

EXPECTATION Dice X P(X) X P(X) 1 $1 1/6 1/6 2 $2 1/6 2/6

EXPECTATION Dice X P(X) X P(X) 1 $1 1/6 1/6 2 $2 1/6 2/6 3 $3 1/6 3/6

EXPECTATION Dice X P(X) X P(X) 1 $1 1/6 1/6 2 $2 1/6 2/6 3 $3 1/6 3/6 4 $4 1/6 4/6

EXPECTATION Dice X P(X) X P(X) 1 $1 1/6 1/6 2 $2 1/6 2/6 3 $3 1/6 3/6 4 $4 1/6 4/6 5 $5 1/6 5/6

EXPECTATION Dice X P(X) X P(X) 1 $1 1/6 1/6 2 $2 1/6 2/6 3 $3 1/6 3/6 4 $4 1/6 4/6 5 $5 1/6 5/6 6 $6 1/6 6/6

EXPECTATION Dice X P(X) X P(X) 1 $1 1/6 1/6 2 $2 1/6 2/6 3 $3 1/6 3/6 4 $4 1/6 4/6 5 $5 1/6 5/6 6 $6 1/6 6/6 Total E(X) = 21/6 = 3.5

EXPECTED VALUE OF A DISCRETE RANDOM VARIABLE In a game of cards you win $1 if you draw a heart, $5 if you draw an ace (including the ace of hearts), $10 if you draw the king of spades and nothing for any other card you draw. Write the probability model for your winnings, and calculate your expected winning.

EXPECTED VALUE OF A DISCRETE RANDOM VARIABLE $1, a heart- $5, an ace $10, king of spades - $0, any other card Event X P(X) XP(X) Heart (not ace) 1 12 52 Ace 5 4 52 King of spades 10 1 52 All else 0 35 52 0 Total E(X) = 42 52 0.81 12 52 20 52 10 52

EXPECTED VALUE OF A DISCRETE RANDOM VARIABLE $1, a heart- $5, an ace $10, king of spades - $0, any other card Event X P(X) XP(X) Heart (not ace) 1 12 52 Ace 5 4 52 King of spades 10 1 52 All else 0 35 52 0 Total E(X) = 42 52 0.81 12 52 20 52 10 52

EXPECTED VALUE OF A DISCRETE RANDOM VARIABLE Event X P(X) XP(X) Heart (not ace) 1 12 52 Ace 5 4 52 King of spades 10 1 52 12 52 20 52 10 52 All else 0 35 52 0 Total E(X) = 42 52 0.81

EXPECTED VALUE OF A DISCRETE RANDOM VARIABLE 0.7 0.525 Probability 0.35 0.175 0 0 1 2 3 5 6 7 8 9 10 X Event X P(X) XP(X) Heart (not ace) 1 12 52 Ace 5 4 52 King of spades 10 1 52 12 52 20 52 10 52 All else 0 35 52 0 Total E(X) = 42 52 0.81

EXPECTED VALUE OF A DISCRETE RANDOM VARIABLE 0.7 0.525 Probability 0.35 0.175 0 0 1 2 3 5 6 7 8 9 10 X Event X P(X) XP(X) Heart (not ace) 1 12 52 Ace 5 4 52 King of spades 10 1 52 12 52 20 52 10 52 All else 0 35 52 0 Total E(X) = 42 52 0.81

EXPECTED VALUE OF A DISCRETE RANDOM VARIABLE 0.7 0.525 Probability 0.35 0.175 0 0 1 2 3 5 6 7 8 9 10 X Event X P(X) XP(X) Heart (not ace) 1 12 52 Ace 5 4 52 King of spades 10 1 52 12 52 20 52 10 52 All else 0 35 52 0 Total E(X) = 42 52 0.81

EXPECTED VALUE OF A DISCRETE RANDOM VARIABLE 0.7 0.525 Probability 0.35 0.175 0 0 1 2 3 5 6 7 8 9 10 X Event X P(X) XP(X) Heart (not ace) 1 12 52 Ace 5 4 52 King of spades 10 1 52 12 52 20 52 10 52 All else 0 35 52 0 Total E(X) = 42 52 0.81

EXPECTED VALUE OF A DISCRETE RANDOM VARIABLE 0.7 0.525 Probability 0.35 0.175 0 0 1 2 3 5 6 7 8 9 10 X Event X P(X) XP(X) Heart (not ace) 1 12 52 Ace 5 4 52 King of spades 10 1 52 12 52 20 52 10 52 All else 0 35 52 0 Total E(X) = 42 52 0.81

VARIANCE OF A RANDOM VARIABLE

VARIANCE OF A RANDOM VARIABLE The variable in winnings

VARIANCE OF A RANDOM VARIABLE The variable in winnings

VARIANCE OF A RANDOM VARIABLE The variable in winnings

VARIANCE OF A RANDOM VARIABLE The variable in winnings

VARIANCE OF A RANDOM VARIABLE The variable in winnings

LINEAR COMBINATIONS a linear combination of random variables X and Y is ax + by such as, 3X + 5Y 0.23 X - 32Y What is E(aX + by)=?

LINEAR COMBINATIONS a linear combination of random variables X and Y is ax + by such as, 3X + 5Y 0.23 X - 32Y What is E(aX + by)=? E(aX + by )=ae(x)+be(y )

LINEAR COMBINATIONS The variability of linear combinations of two independent random variables

LINEAR COMBINATIONS The variability of linear combinations of two independent random variables Var(aX + by )=a 2 Var(X)+b 2 Var(Y )

EXERCISE A casino game costs $5 to play. If you draw first a red card, then you get to draw a second card. If the second card is the ace of hearts, you win $500. If not, you don t win anything, i.e. lose your $5. What is your expected profits/losses from playing this game?

EXERCISE A casino game costs $5 to play. If you draw first a red card, then you get to draw a second card. If the second card is the ace of hearts, you win $500. If not, you don t win anything, i.e. lose your $5. What is your expected profits/losses from playing this game?

EXERCISE A casino game costs $5 to play. If you draw first a red card, then you get to draw a second card. If the second card is the ace of hearts, you win $500. If not, you don t win anything, i.e. lose your $5. What is your expected profits/losses from playing this game?

EXERCISE A casino game costs $5 to play. If you draw first a red card, then you get to draw a second card. If the second card is the ace of hearts, you win $500. If not, you don t win anything, i.e. lose your $5. What is your expected profits/losses from playing this game?

EXERCISE A casino game costs $5 to play. If you draw first a red card, then you get to draw a second card. If the second card is the ace of hearts, you win $500. If not, you don t win anything, i.e. lose your $5. What is your expected profits/losses from playing this game?

SIMPLIFYING RANDOM VARIABLES

OVERVIEW Probability Random Variables Continuous Distributions MOST OF THE SLIDES ADOPTED FROM OPENINTRO STATS BOOK.

CONTINOUS DISTRIBUTIONS Below is a histogram of the distribution of heights of US adults. The proportion of data that falls in the shaded bins gives the probability that a randomly sampled US adult is between 180 cm and 185 cm (about 5 11 to 6 1 ).

FROM HISTOGRAMS TO CONTINUOUS DISTRIBUTIONS Since height is a continuous numerical variable, its probability density function is a smooth curve.

FROM HISTOGRAMS TO CONTINUOUS DISTRIBUTIONS Therefore, the probability that a randomly sampled US adult is between 180 cm and 185 cm can also be estimated as the shaded area under the curve.

FROM HISTOGRAMS TO CONTINUOUS DISTRIBUTIONS Therefore, the probability that a randomly sampled US adult is between 180 cm and 185 cm can also be estimated as the shaded area under the curve.

FROM HISTOGRAMS TO CONTINUOUS DISTRIBUTIONS Since continuous probabilities are estimated as the area under the curve, the probability of a person being exactly 180 cm (or any exact value) is defined as 0.