APPENDIX 2.3: RULES OF PROBABILITY

Similar documents
CHAPTER 6 PROBABILITY. Chapter 5 introduced the concepts of z scores and the normal curve. This chapter takes

Such a description is the basis for a probability model. Here is the basic vocabulary we use.

Grade 6 Math Circles Fall Oct 14/15 Probability

Grade 7/8 Math Circles February 25/26, Probability

Simple Probability. Arthur White. 28th September 2016

Probably About Probability p <.05. Probability. What Is Probability? Probability of Events. Greg C Elvers

1MA01: Probability. Sinéad Ryan. November 12, 2013 TCD

Chapter 5: Probability: What are the Chances? Section 5.2 Probability Rules

Probability. The MEnTe Program Math Enrichment through Technology. Title V East Los Angeles College

Define and Diagram Outcomes (Subsets) of the Sample Space (Universal Set)

c. If you roll the die six times what are your chances of getting at least one d. roll.

INDEPENDENT AND DEPENDENT EVENTS UNIT 6: PROBABILITY DAY 2

(a) Suppose you flip a coin and roll a die. Are the events obtain a head and roll a 5 dependent or independent events?

CHAPTERS 14 & 15 PROBABILITY STAT 203

Chapter 4: Introduction to Probability

4.1 Sample Spaces and Events

Probability. Ms. Weinstein Probability & Statistics

Lenarz Math 102 Practice Exam # 3 Name: 1. A 10-sided die is rolled 100 times with the following results:

Classical vs. Empirical Probability Activity

ECON 214 Elements of Statistics for Economists

PROBABILITY. 1. Introduction. Candidates should able to:

November 6, Chapter 8: Probability: The Mathematics of Chance

STANDARD COMPETENCY : 1. To use the statistics rules, the rules of counting, and the characteristic of probability in problem solving.

Topic : ADDITION OF PROBABILITIES (MUTUALLY EXCLUSIVE EVENTS) TIME : 4 X 45 minutes

Chapter 4 Student Lecture Notes 4-1

Intermediate Math Circles November 1, 2017 Probability I

More Probability: Poker Hands and some issues in Counting

Key Concepts. Theoretical Probability. Terminology. Lesson 11-1

Chapter 1: Sets and Probability

November 11, Chapter 8: Probability: The Mathematics of Chance

Probability is often written as a simplified fraction, but it can also be written as a decimal or percent.

Probability of Independent and Dependent Events. CCM2 Unit 6: Probability

ABE/ASE Standards Mathematics

Review. Natural Numbers: Whole Numbers: Integers: Rational Numbers: Outline Sec Comparing Rational Numbers

When combined events A and B are independent:

7.1 Experiments, Sample Spaces, and Events

I. WHAT IS PROBABILITY?

Chapter 1. Probability

Counting Methods and Probability

Math 14 Lecture Notes Ch. 3.3

0-5 Adding Probabilities. 1. CARNIVAL GAMES A spinner has sections of equal size. The table shows the results of several spins.

Name: Class: Date: 6. An event occurs, on average, every 6 out of 17 times during a simulation. The experimental probability of this event is 11

Unit 11 Probability. Round 1 Round 2 Round 3 Round 4

Section 6.5 Conditional Probability

Textbook: pp Chapter 2: Probability Concepts and Applications

Independent and Mutually Exclusive Events

Objective 1: Simple Probability

Developed by Rashmi Kathuria. She can be reached at

Def: The intersection of A and B is the set of all elements common to both set A and set B

Probability. The Bag Model

Grade 8 Math Assignment: Probability

Statistics Intermediate Probability

ABC High School, Kathmandu, Nepal. Topic : Probability

Example 1. An urn contains 100 marbles: 60 blue marbles and 40 red marbles. A marble is drawn from the urn, what is the probability that the marble

North Seattle Community College Winter ELEMENTARY STATISTICS 2617 MATH Section 05, Practice Questions for Test 2 Chapter 3 and 4

"Well, statistically speaking, you are for more likely to have an accident at an intersection, so I just make sure that I spend less time there.

Exam III Review Problems

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Outcomes: The outcomes of this experiment are yellow, blue, red and green.

Introduction to Probability and Statistics I Lecture 7 and 8

7 5 Compound Events. March 23, Alg2 7.5B Notes on Monday.notebook

Empirical (or statistical) probability) is based on. The empirical probability of an event E is the frequency of event E.

Objectives. Determine whether events are independent or dependent. Find the probability of independent and dependent events.

If you roll a die, what is the probability you get a four OR a five? What is the General Education Statistics

Before giving a formal definition of probability, we explain some terms related to probability.

Unit 6: Probability. Marius Ionescu 10/06/2011. Marius Ionescu () Unit 6: Probability 10/06/ / 22

Normal Distribution Lecture Notes Continued

Unit 6: Probability. Marius Ionescu 10/06/2011. Marius Ionescu () Unit 6: Probability 10/06/ / 22

Probability (Devore Chapter Two)

Contents 2.1 Basic Concepts of Probability Methods of Assigning Probabilities Principle of Counting - Permutation and Combination 39

The point value of each problem is in the left-hand margin. You must show your work to receive any credit, except on problems 1 & 2. Work neatly.

INTRODUCTORY STATISTICS LECTURE 4 PROBABILITY

CHAPTER 2 PROBABILITY. 2.1 Sample Space. 2.2 Events

Simulations. 1 The Concept

Business Statistics. Chapter 4 Using Probability and Probability Distributions QMIS 120. Dr. Mohammad Zainal

The Teachers Circle Mar. 20, 2012 HOW TO GAMBLE IF YOU MUST (I ll bet you $5 that if you give me $10, I ll give you $20.)

Chapter 4: Probability and Counting Rules

Unit 1 Day 1: Sample Spaces and Subsets. Define: Sample Space. Define: Intersection of two sets (A B) Define: Union of two sets (A B)

Week 3 Classical Probability, Part I

Probability 1. Joseph Spring School of Computer Science. SSP and Probability

Laboratory 1: Uncertainty Analysis

Probability - Chapter 4

Chapter 1. Probability

Chapter 6: Probability and Simulation. The study of randomness

Chapter 5 - Elementary Probability Theory

Raise your hand if you rode a bus within the past month. Record the number of raised hands.

7.1 Chance Surprises, 7.2 Predicting the Future in an Uncertain World, 7.4 Down for the Count

Stat 20: Intro to Probability and Statistics

Theory of Probability - Brett Bernstein

STATISTICS and PROBABILITY GRADE 6

Chapter 2. Permutations and Combinations

Conditional Probability Worksheet

Statistics, Probability and Noise

Conditional Probability Worksheet

Independence Is The Word

Probability, Continued

Class XII Chapter 13 Probability Maths. Exercise 13.1

Probability MAT230. Fall Discrete Mathematics. MAT230 (Discrete Math) Probability Fall / 37

Mathematics 'A' level Module MS1: Statistics 1. Probability. The aims of this lesson are to enable you to. calculate and understand probability

Chapter 3: Elements of Chance: Probability Methods

Transcription:

The frequentist notion of probability is quite simple and intuitive. Here, we ll describe some rules that govern how probabilities are combined. Not all of these rules will be relevant to the rest of this book. However, describing these will help to make sure that we are using the concepts of probability correctly as we move on to more advanced topics. We will begin with some notation. We can denote the probability of a flipped coin coming up heads as p(heads) =.5 and the probability of it coming up tails as p(tails) =.5. Or we can say that the probability of a rolled die coming up 1 is p(1) =.1667 and the probability of it coming up 3 is p(3) =.1667. However, we want to think about the general case of outcomes and events, not just those associated with coin flips or die rolls. Therefore, we will use letters to define arbitrary events. For example, we can use A, B, and C to denote three different events, no matter what variable we might be considering. The OR Rule for Mutually Exclusive Events: p(a or B) = p(a) + p(b) A critical concept for us is the probability of A or B occurring. We ve seen this question before, but now we can provide a bit more detail about how this is computed and what assumptions must be true for our calculation to be valid. Events are mutually exclusive if they cannot co-occur. For example, a flipped coin can come up heads or tails, but not both. Therefore, the possible outcomes of a coin flip are mutually exclusive. Similarly, a rolled die can be one, and only one, of the following: 1, 2, 3, 4, 5, or 6. Therefore, these are mutually exclusive events. When we draw cards from a deck, the four suits are mutually exclusive. A drawn card can be a heart, but it can t simultaneously be a spade. When events A and B are mutually exclusive, the probability of A or B occurring is the sum of their separate probabilities: p(a or B) = p(a) + p(b). (2.A3.1) For example, if A and B are heads and tails, respectively, then the probability of a flipped coin being either a head (A) or a tail (B) is p(a or B) = p(a) + p(b) =.5 +.5 = 1. If we consider the role of a die, and A and B are 4 and 6, respectively, then the probability of a rolled die coming up 4 or 6 is APPENDIX 2.3: RULES OF PROBABILITY 1 1 p( Aor B) = pa ( ) + pb ( ) = + =. 33. 6 6 Or if we consider the role of a die and A, B, and C are 1, 4, and 6, respectively, then the probability of a rolled die coming up 1 or 4 or 6 is 1 1 1 p( Aor Bor C) = p( A) + p( B) + p( C) = + + =.. 5 6 6 6 The OR rule is the most important rule of probability for much of what follows in subsequent chapters. The AND Rule for Independent Events: p(a and B) = p(a)p(b) Two events (or outcomes) are independent if the occurrence of one does not affect the probability that the other will occur. For example, if two coins are flipped, the outcomes are independent. In other words, if one coin comes up heads, it has no effect on whether the other coin will come up heads. Or if the same coin is flipped twice, coming up heads on the first flip has no effect on the probability of it coming up heads on the second flip. Each time a coin is flipped, the outcome is independent of the outcomes of all previous flips. When events A and B are independent, the probability of A and B occurring is the product of their separate probabilities: p(a and B) = p(a) p(b). (2.A3.2) For example, if A and B are heads and tails, respectively, then the probability of flipping a coin twice and getting a head (A) on the first flip and a head (B) on the second flip is p( Aand B) = papb ( ) ( ) = (. )(. ).. 1 1 2 = 5 5 2 = 25 Notice that each of the following events has the same probability of occurrence: (head and tail), (head and head), (tail and tail), and (tail and head). These are the four possible outcomes for two flips of a coin, and each has a probability of.25. The sum of these four probabilities is 1, because no other outcomes are possible. In this example, we ve considered two successive flips of the same coin, but the result would be exactly the same if we considered flipping two coins simultaneously. The OR Rule for Events That Are Not Mutually Exclusive: p(a or B) = p(a) + p(b) - p(a)p(b) Some events are not mutually exclusive. For example, a card drawn from a deck can be both a Heart and a 1

2 Statistics for Research in Psychology King. A student can be both female and in psychology. A person can be both anxious and depressed. When events are not mutually exclusive, the OR rule is modified as follows: p(a or B) = p(a) + p(b) - p(a) p(b). (2.A3.3) Equation 2.A3.3 differs from equation 2.A3.1 only in the last term, p(a)p(b), which denotes the probability of both A and B occurring. Let s consider drawing a card from a 52-card deck that has four suits (Clubs, Spades, Hearts, and Diamonds) and 13 ranks (Ace, 2, 3, 4, 5, 6, 7, 8, 9, 10, Jack, Queen, and King). If event A is drawing a red card and event B is drawing a King, then we can ask about the probability of A or B. These events are not mutually exclusive. If you draw a red card, it could be a King. Conversely, if you draw a King, it could be red. Figure 2.A3.1 shows a full deck of playing cards to help us think about this question. The bottom two rows show all the red cards; diamonds and hearts. These represent half the deck, so the probability of drawing a red card is p(a) =.5. The right column shows the four Kings. Because four of the 52 cards are Kings, the probability of drawing a King is p(a) = 4/52 = 1/13 =.07692. Because A and B are not mutually exclusive, we have to take into account the probability that a card is both red and a King. The probability of being red and being a King is p( ApB ) ( ) =.. 1 1 2 = 1 13 26 = 03846 Another way to say this is that red Kings compose 1/26th of the deck. Equation 2.A3.3 tells us that we should do the following to calculate the probability of drawing a card that is red or a King: p( Aor B) = pa ( ) + pb ( ) papb ( ) ( ) 1 1 = + 1 1 2 13 2 = 1 13 2 + 1 13 1 26 =. 5 +. 07692. 03846 =. 53846. We can confirm that this is the correct answer by counting the number of cards that satisfy our two constraints of being red or being a King. There are 26 red cards, including the red Kings. When we add in the two black Kings, we now have 28 cards altogether. Therefore, the proportion of cards that satisfy conditions A or B is 28/52 = 7/13 =.53846. We can now see that subtracting the third term in equation 2.A3.3, p(a)p(b), from the first two serves to prevent red Kings from being counted twice. The AND Rule for Dependent Events: p(a and B) = p(a)p(b A) Not all events are independent; some are dependent. To understand dependence, let s first think about independent events. Let s say we draw a card from a shuffled deck, put it back in, reshuffle, and then draw again. This is called sampling with replacement. What is the probability of drawing two aces in two successive draws when sampling with replacement? Well, there are two events (A = drawing an Ace on the first draw, B = drawing an Ace on the second draw). The probability of A is p(a) = 1/13 and the probability of B is p(b) = 1/13. Therefore, using the AND rule (from equation 2.A3.2), we find that the probability of A and B is p(a and B) = p(a)p(b) = 1/(13 * 13) = 1/169 =.00592. Now, let s change the example slightly and imagine drawing two cards without replacing the first one before the second one is drawn. This is called sampling without replacement. What is the probability now of drawing two aces? If an Ace had been drawn on the first draw, then the probability of an Ace on the second draw has changed. If an Ace was the first card drawn, then only 51 cards remain and only three of these are aces. Therefore, the probability of drawing an Ace on the second draw depends on whether an Ace was drawn on the first draw. Therefore, we can t use equation 2.A3.2. Rather, we use equation 2.A3.4 as follows: p(a and B) = p(a) p(b A). (2.A3.4) The term p(b A) should be read as the probability of event B occurring, given that event A has occurred. In our example, this means the probability of drawing an Ace on the second draw, given that an Ace was drawn on the first draw. We call p(b A) a conditional probability. 1 Because there are four aces in the deck, the probability of the first card drawn being an Ace is p(a) = 4/52 = 1/13 =.07692. As we noted, if the first card drawn was an Ace, then there are only three aces in the remaining 51 cards. So, when the second card is drawn, the probability of drawing an Ace is only p(b A) = 3/51 1. Please note, we will return to the important issue of conditional probabilities in Chapter 7, where we discuss significance tests. If you hear that a result is statistically significant, this means someone has conducted a significance test. You may be surprised to learn that psychologists are often harshly criticized for misinterpreting the results of significances tests. Many of these misinterpretations arise from not understanding the concept of conditional probability. Therefore, conditional probability is not a minor concept. It is hugely important for the correct interpretation of significance tests. See you in Chapter 7.

Chapter 2 Online Appendices 3 FIGURE 2.A3.1 A Deck of Playing Cards There are four suits (Spades, Clubs, Diamonds, and Hearts) and 13 ranks (Ace, 2, 3, 4, 5, 6, 7, 8, 9, 10, Jack, Queen, and King). istock.com/imannaggia = 1/17 =.05882. If we now work through equation 2.A3.4, we will find that p( Aand B) = papb ( ) ( A) = 1 3 13 51 3 = =. 00452. 663 So, the probability of drawing two aces is greater if we draw with replacement than if we draw without replacement. Another way to say this is that the probability of drawing two aces is greater when the draws are independent versus dependent. LEARNING CHECK 1 1. What is the probability that a card drawn from a 52-card deck will be an 8 or a 9? 2. What is the probability that in two independent draws from a 52-card deck, the first card will be an 8 and the second card will be a 9? 3. What is the probability that a card drawn from a 52-card deck will be an 8 or red? 4. What is the probability that in two successive draws from a 52-card deck, the first card will be an 8 and the second will be a 9 when sampling is without replacement? Answers 1. p = p(8) + p(9) = 4/52 + 4/52 = 8/52 = 2/13 =.1538. 2. p = p(8) * p(9) = 4/52 * 4/52 = 1/13 * 1/13 = 1/169 =.0059. 3. p = p(8) + p(red) - p(8)p(red) = 1/13-1/2 - (1/2 * 1/13) =.5385. 4. p = p(8) * p(9 8) = 1/13 * 4/51 =.0769 *.0784 =.006.

4 Statistics for Research in Psychology APPENDIX 2.4: PROBABILITY DENSITY FUNCTIONS Functions You probably encountered functions in high school mathematics. If not, then you almost certainly recognize this: y = x 2. This is the square function. Functions are like black boxes. You put a number in, and you get a number out. For this reason, it s common to express functions like this: y = f(x). The f means function, x is the input, and y is the output. Something goes on inside the black box called f, and a number pops out, which we call y. In the case of the square function, you put in some number x, and you get out the square of the number, which we call y. The defining feature of a function is that there is a single y value for every possible x value. Therefore, y is said to be a function of x. Probability density functions are functions for this reason; there is a single y value for each x value, as shown in Figure 2.4. But what is the y value in Figure 2.4? Density The term density should be familiar. When we talk about population density, for example, we mean the number of people per square mile or square kilometer. Population density is greater in cities than in rural areas. Density usually refers to the number of things (people, trees, worms, neurons) per unit measure (square mile, acre, cubic foot, cubic millimeter). In a grouped frequency table (e.g., Table 2.8), we can think of the number of scores per interval as density. The more scores per interval, the greater the density. So, the raw frequency counts tell us something about the density of scores in an interval. The notion of density is more abstract for mathematicians and statisticians. It differs from the usual notion of density in that it is defined at a point rather than for some width, area, or volume. How can density be defined at a point? Let s start by thinking about a traffic jam that stretches for 5 miles, or 8.05 kilometers. Cars are packed bumper to bumper, so the density of cars is the same at each point along the highway. If you count the number of cars in a 1-kilometer stretch (interval), you might find that there are 400 cars in this interval. So, the density is 400 per kilometer. If you count the number of cars in a half-kilometer interval, you would find 200 per half kilometer. Now, 400 per kilometer is the same density as 200 per half kilometer, and it is also the same as 100 per quarter kilometer. All of these measures of density can be put on the same scale by dividing the number of cars in an interval by the interval width. The interval widths for this example are 1 kilometer,.5 kilometers, and.25 kilometers. So, if we divide the counts (400, 200, and 100) by the corresponding interval widths, we obtain 400/1 = 400, 200/.5 = 400, and 100/.25 = 400. In this way, density can be computed independently of interval width. So, how does this relate to specifying density at a point? We will now return to the distribution of heights that we discussed in Chapter 2. Figure 2.A4.1 shows histograms of 1,000,000 heights drawn from a known distribution. The widths of the intervals decrease from 5.33 inches (Figure 2.A4.1a) to.67 inches (Figure 2.A4.1f). As interval width decreases, fewer scores fall in each interval. Therefore, the heights of the histogram bars decrease as interval width decreases. In our traffic jam example, we noted that density involves dividing the number or proportion of scores in each interval by the interval width. This has been done in Figure 2.A4.2, in which the bar heights (p = n/n) from Figure 2.A4.1 are divided by the interval width (p/ width) to yield density. As interval width decreases, the tops of the histogram bars become indistinguishable from the solid line, which represents the probability density function of the distribution from which the scores were drawn. Let s now think of a theoretical population with an infinite number of scores, rather than just 1,000,000. As interval width becomes smaller and smaller, two things happen. First, density converges to a single unambiguous value. (To see this, think about the histogram bar centered on 65 in Figures 2.A4.2a through 2.A4.2f.) Second, in the limit, the width becomes zero. This means that (i) density can be defined at a point and (ii) the probability of any specific score actually occurring is 0. The result is the continuous line (function) that defines a y value (density) for each x value. We call this a probability density function. It might seem like a bit of a paradox that as interval width decreases, the density of scores in a small region of the distribution approaches a constant value, whereas the proportion of scores in each interval approaches zero. This is something we just have to live with. Probability Density So far, we ve seen the following things. The curve in Figure 2.4 is a function. The y values represent the abstract notion of density defined at a point. Density does not mean probability. So why is this called a probability density function? Let s see if we can answer this.

Chapter 2 Online Appendices 5 FIGURE 2.A4.1 Histograms of 1,000,000 Heights (a) 0.50 (b) Proportion = (n/n) 0.40 0.30 0.20 (c) 0.50 (d) Proportion = (n/n) 0.40 0.30 0.20 (e) 0.50 (f) Proportion = (n/n) 0.40 0.30 0.20 (a through f) Each panel shows a histogram of 1,000,000 heights. The interval widths range from 5.33 inches (a) to.67 inches (f). The y-axis represents the proportion of scores ( p = n/n) in each interval. As interval width decreases, fewer scores fall in each interval. In Chapter 2, we considered distributions defined by the categories of qualitative variables, discrete values of quantitative variables, and intervals of quantitative variables. Each of these categories or intervals was associated with a probability, and the sum of all these probabilities is 1. Something very similar is true of a probability density function. As interval widths get narrower, the number of intervals increases while the proportion of scores in each interval decreases. This means that no matter how narrow the intervals, the sum of the proportions in the intervals will be 1. So, here is another oddity for us. As the interval width approaches zero, the sum of the proportions associated with the intervals remains 1. At the same time, no matter how narrow the intervals are, some will contain more scores than others. This is another seeming paradox that we just have to live with. If you have taken a calculus course, you will recognize that I ve just described integration. Therefore, we can say that density functions are probability functions, because the area under the curve is 1. For this reason, the function in Figure 2.A4.2 (the curved line) is a probability function. If we compute the area under the curve between any two values of x, we obtain the probability that a randomly chosen score will fall in that interval. And that s all I have to say about that.

6 Statistics for Research in Psychology FIGURE 2.A4.2 An Illustration of Density (a) 0.12 Density = (n/n)/width 0.08 0.06 0.04 0.02 (c) 0.12 Density = (n/n)/width 0.08 0.06 0.04 0.02 (e) 0.12 Density = (n/n)/width 0.08 0.06 0.04 0.02 (a through f) Densities computed for 1,000,000 heights. The interval widths range from 5.33 inches (a) to.67 inches (f). The y-axis represents the proportion of scores (p = n/n) in each interval divided by the width of the interval p/width. The solid line is the mathematical density function associated with the distribution from which the scores were drawn. As the interval width approaches 0, the heights of the histogram bars increasingly resemble the continuous probability density function or pdf. (b) (d) (f)