Hypergeometric Probability Distribution

Similar documents
Section 6.5 Conditional Probability

Important Distributions 7/17/2006

Unit 9: Probability Assignments

Section : Combinations and Permutations

Week in Review #5 ( , 3.1)

Math 1313 Conditional Probability. Basic Information

Review Questions on Ch4 and Ch5

Key Concepts. Theoretical Probability. Terminology. Lesson 11-1

Probabilities and Probability Distributions

Math 58. Rumbos Fall Solutions to Exam Give thorough answers to the following questions:

Spring 2016 Math 54 Test #2 Name: Write your work neatly. You may use TI calculator and formula sheet. Total points: 103

Lenarz Math 102 Practice Exam # 3 Name: 1. A 10-sided die is rolled 100 times with the following results:

Unit 1B-Modelling with Statistics. By: Niha, Julia, Jankhna, and Prerana

Intermediate Math Circles November 1, 2017 Probability I. Problem Set Solutions

Math 447 Test 1 February 25, Spring 2016

Geometric Distribution

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Math Exam 2 Review. NOTE: For reviews of the other sections on Exam 2, refer to the first page of WIR #4 and #5.

Math Exam 2 Review. NOTE: For reviews of the other sections on Exam 2, refer to the first page of WIR #4 and #5.

1) If P(E) is the probability that an event will occur, then which of the following is true? (1) 0 P(E) 1 (3) 0 P(E) 1 (2) 0 P(E) 1 (4) 0 P(E) 1

INDEPENDENT AND DEPENDENT EVENTS UNIT 6: PROBABILITY DAY 2

McGraw Hill Ryerson Data Management 12. Comparing and Selecting Discrete Probability Distributions

Exam III Review Problems

1 2-step and other basic conditional probability problems

Name Class Date. Introducing Probability Distributions

Class XII Chapter 13 Probability Maths. Exercise 13.1

Discrete probability and the laws of chance

Lesson Sampling Distribution of Differences of Two Proportions

Independence Is The Word

Determine whether the given events are disjoint. 4) Being over 30 and being in college 4) A) No B) Yes

MAT 1272 STATISTICS LESSON STATISTICS AND TYPES OF STATISTICS

Describe the variable as Categorical or Quantitative. If quantitative, is it discrete or continuous?

CSE 312: Foundations of Computing II Quiz Section #2: Inclusion-Exclusion, Pigeonhole, Introduction to Probability

Gathering information about an entire population often costs too much or is virtually impossible.

, x {1, 2, k}, where k > 0. (a) Write down P(X = 2). (1) (b) Show that k = 3. (4) Find E(X). (2) (Total 7 marks)

Probability and Randomness. Day 1

Probability: Anticipating Patterns

Contents 2.1 Basic Concepts of Probability Methods of Assigning Probabilities Principle of Counting - Permutation and Combination 39

Chapter 11: Probability and Counting Techniques

Basic Practice of Statistics 7th

Math 1313 Section 6.5 Section 6.5: Conditional Probability

Fundamentals of Probability

Raise your hand if you rode a bus within the past month. Record the number of raised hands.

10.2.notebook. February 24, A standard deck of 52 playing cards has 4 suits with 13 different cards in each suit.

Case 1: If Denver is the first city visited, then the outcome looks like: ( D ).

MAT104: Fundamentals of Mathematics II Summary of Counting Techniques and Probability. Preliminary Concepts, Formulas, and Terminology

Probability of Independent and Dependent Events. CCM2 Unit 6: Probability

Introduction to probability

Mr. Orchard s Math 141 WIR Test 3 Review Week 11

Independent and Mutually Exclusive Events

heads 1/2 1/6 roll a die sum on 2 dice 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12 1, 2, 3, 4, 5, 6 heads tails 3/36 = 1/12 toss a coin trial: an occurrence

3 The multiplication rule/miscellaneous counting problems

Math March 12, Test 2 Solutions

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Module 4 Project Maths Development Team Draft (Version 2)

Assignment 4: Permutations and Combinations

North Seattle Community College Winter ELEMENTARY STATISTICS 2617 MATH Section 05, Practice Questions for Test 2 Chapter 3 and 4

For question 1 n = 5, we let the random variable (Y) represent the number out of 5 who get a heart attack, p =.3, q =.7 5

, -the of all of a probability experiment. consists of outcomes. (b) List the elements of the event consisting of a number that is greater than 4.

Problem Set 2. Counting

Probability Distributions. Probability Distributions. J. Boulton. May 08, 2013 MDM 4U1. Where are we?

1 2-step and other basic conditional probability problems

Probability. Ms. Weinstein Probability & Statistics

Distribution of Aces Among Dealt Hands

Sec 4.4. Counting Rules. Bluman, Chapter 4

CSE 312: Foundations of Computing II Quiz Section #2: Inclusion-Exclusion, Pigeonhole, Introduction to Probability (solutions)

MTH 103 H Final Exam. 1. I study and I pass the course is an example of a. (a) conjunction (b) disjunction. (c) conditional (d) connective

Chapter 5 - Elementary Probability Theory

MAT Midterm Review

1. An office building contains 27 floors and has 37 offices on each floor. How many offices are in the building?

Discrete Random Variables Day 1

Such a description is the basis for a probability model. Here is the basic vocabulary we use.

Math 1313 Section 6.2 Definition of Probability

MATH CALCULUS & STATISTICS/BUSN - PRACTICE EXAM #2 - FALL DR. DAVID BRIDGE

Algebra 2 Notes Section 10.1: Apply the Counting Principle and Permutations

Math 227 Elementary Statistics. Bluman 5 th edition

Problems from 9th edition of Probability and Statistical Inference by Hogg, Tanis and Zimmerman:

Section 5.4 Permutations and Combinations

Permutations and Combinations

RANDOM EXPERIMENTS AND EVENTS

Probability and Counting Techniques

Statistics 101: Section L Laboratory 10

Section 5.4 Permutations and Combinations

Midterm (Sample Version 3, with Solutions)

Well, there are 6 possible pairs: AB, AC, AD, BC, BD, and CD. This is the binomial coefficient s job. The answer we want is abbreviated ( 4

Week 3 Classical Probability, Part I

2. Combinatorics: the systematic study of counting. The Basic Principle of Counting (BPC)

Probability. The Bag Model

Probability Distributions for Discrete Variables. a) Four of the 52 cards in a standard deck are kings. The probability of selecting a king is

Basic Probability Ideas. Experiment - a situation involving chance or probability that leads to results called outcomes.

Probability Homework

MATH , Summer I Homework - 05

Basic Probability & Statistics Exam 2 { Part I { Sections (Chapter 4, Chapter 5) March 19, 2009

Key Concept Probability of Independent Events. Key Concept Probability of Mutually Exclusive Events. Key Concept Probability of Overlapping Events

Intermediate Math Circles November 1, 2017 Probability I

Chapter 3: Probability (Part 1)

Math 4610, Problems to be Worked in Class

3 The multiplication rule/miscellaneous counting problems

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. B) Blood type Frequency

Nagel Langdons Gender Pay Gap Statement

Transcription:

Hypergeometric Probability Distribution Example problem: Suppose 30 people have been summoned for jury selection, and that 12 people will be chosen entirely at random (not how the real process works!). Also, suppose that there are 17 candidates that are less than 40 years old, and 13 candidates that are at least 40 years old. What is the probability that exactly 5 of the candidates chosen are less than 40 years old? : We must choose 5 younger candidates from the 17 available, and 7 older candidates from the 13 available. This will be a total of 12 jurors. Since each person is equally likely to be chosen, the probability that exactly 5 younger candidates are chosen is The Distribution This is an example of the hypergeometric distribution: ( 17 P(5) = (13 7 ) ( 30 12 ) = 28.9% there are n possible outcomes. This is sometimes called the population size. there are a outcomes which are classified as successes (and therefore n a failures ) there are r trials. This is sometimes called the sample size. the trials are dependent the random variable X measures the number of successes Then the probability that exactly k successes occur in the r trials is In the jury example above, we have the following parameters: n = 30 a = 17 r = 12 k = 5

Expected Value The expected value for a hypergeometric distribution is the number of trials multiplied by the proportion of the population that is successes: Example 1: Drawing 2 Face Cards Suppose you draw 5 cards from a standard, shuffled deck of 52 cards. What is the probability that you draw exactly 2 face cards? What is the expected number of face cards? This is a hypergeometric distribution with the following values: n = 52 possible outcomes a = 12 successes (face cards) r = 5 trials (cards drawn) k = 2 required successes (face cards) ( 12 12 ) (52 2 5 2 ) ( 52 ( 12 2 ) (40 3 ) ( 52 P(X = 2) 0.2509 The probability of getting exactly two face cards is about 25%. The expected number of face cards in a hand of 5 cards is = 5 12 52 = 15 13 = 1.15

Example 2: Gender split for hiring There are 85 people who interview for 4 data science positions at a prestigious company. The company ranks the applicants and decides there are 12 candidates who are all equally suited for the 4 positions. Since they are concerned about unfair biases in the selection process, the company decides to choose 4 people at random from the 12 best candidates. 7 of the 12 candidates identify as female, and 5 of the 12 candidates identify as male. What is the probability that there will be exactly 2 people hired who identify with each gender? What is the expected number hired by gender? This is a hypergeometric distribution with the following values: n = 12 (total number of candidates) a = 7 (candidates identifying as female) r = 4 (required number of candidates) k = 2 (required number of candidates identifying as female) Note that we have arbitrarily selected candidates who identify as female as the success category; the same procedure works by selecting candidates who identify as male as the success category. ( 7 2 ) (12 7 4 2 ) ( 12 4 ) ( 7 2 ) (5 2 ) ( 12 4 ) P(X = 2) 0.424 There is about a 42.4% probability that there will be 2 candidates hired who identify with each gender. = 4 7 12 = 7 3 2.33 The expected number of candidates hired who identify as female is about 2.33, and the expected number of candidates hired who identify as male is about 4 2.33 = 1.67.

Example 3: M:tG Suppose you have a Magic: the Gathering deck of 60 cards, of which 22 are lands and 38 are non-lands. Make a probability distribution table for lands drawn in the opening hand of 7 cards. Use the table to calculate the probability of drawing 2 or 3 lands in the opening hand. This is a hypergeometric distribution, with the following values (counting land cards as successes): n = 60 (total number of cards) a = 22 (land cards) r = 7 (cards drawn) We need to calculate P(X = k) for each k {0,1,2,,7}. k P(X = k) 0 3.27% 1 15.73% 2 30.02% 3 29.43% 4 15.98% 5 4.79% 6 0.73% 7 0.04% The probability of drawing an opening hand with 2 or 3 land cards is about 30.02% + 29.43% = 59.45%. Example 4: k can t exceed r There are 15 players available for a tournament. 5 players will be selected at random to participate. Only 3 players are experienced players. What is the probability that at least 2 players selected will be experienced? Even though there are 5 trials (r = 5) it is not possible to have more than 3 successes (since a = 3). P(X 2) = P(X = 2) + P(X = 3) = = ( 3 2 ) (15 3 5 2 ) ( 15 + ( 3 2 ) (12 3 ) ( 15 + = 660 3003 + 66 3003 ( 3 3 ) (15 3 5 3 ) ( 15 ( 3 3 ) (12 2 ) ( 15

= 22 91 = 24.2% It s not possible to have k = 4, for example. If you were to try to substitute the value k = 4 into the probability formula, you would have the combination ( 3 ), which has no value. Instead, we say that P(X = 4) = 0 in this 4 situation. Hypergeometric Distribution vs. Binomial Distribution Both distributions have a fixed number of trials. The trials in hypergeometric distributions are dependent, and the trials in binomial distributions are independent. In a hypergeometric distribution there is a fixed number of possible successes available, and they re used up as trials occur. In a binomial distribution the successes are not used up. We sometimes call this drawing without replacement (hypergeometric) versus with replacement (binomial).