Lesson Sampling Distribution of Differences of Two Proportions

Similar documents
Laboratory 1: Uncertainty Analysis

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. B) Blood type Frequency

Such a description is the basis for a probability model. Here is the basic vocabulary we use.

FALL 2015 STA 2023 INTRODUCTORY STATISTICS-1 PROJECT INSTRUCTOR: VENKATESWARA RAO MUDUNURU

2.2 More on Normal Distributions and Standard Normal Calculations

Math 58. Rumbos Fall Solutions to Exam Give thorough answers to the following questions:

Math Exam 2 Review. NOTE: For reviews of the other sections on Exam 2, refer to the first page of WIR #4 and #5.

Math Exam 2 Review. NOTE: For reviews of the other sections on Exam 2, refer to the first page of WIR #4 and #5.

Sampling Terminology. all possible entities (known or unknown) of a group being studied. MKT 450. MARKETING TOOLS Buyer Behavior and Market Analysis

Chapter 5: Probability: What are the Chances? Section 5.2 Probability Rules

Session 5 Variation About the Mean

Example 1. An urn contains 100 marbles: 60 blue marbles and 40 red marbles. A marble is drawn from the urn, what is the probability that the marble

4 th Grade Mathematics Learning Targets By Unit

Spring 2016 Math 54 Test #2 Name: Write your work neatly. You may use TI calculator and formula sheet. Total points: 103

Section 1.5 Graphs and Describing Distributions

6.1 (CD-ROM TOPIC) USING THE STANDARDIZED NORMAL DISTRIBUTION TABLE

GREATER CLARK COUNTY SCHOOLS PACING GUIDE. Algebra I MATHEMATICS G R E A T E R C L A R K C O U N T Y S C H O O L S

CHAPTER 6 PROBABILITY. Chapter 5 introduced the concepts of z scores and the normal curve. This chapter takes

Independence Is The Word

Chapter 19. Inference about a Population Proportion. BPS - 5th Ed. Chapter 19 1

Real Numbers and the Number Line. Unit 1 Lesson 3

Spring 2017 Math 54 Test #2 Name:

Section 6.5 Conditional Probability

Hypergeometric Probability Distribution

Chapter 4. Displaying and Summarizing Quantitative Data. Copyright 2012, 2008, 2005 Pearson Education, Inc.

a) Getting 10 +/- 2 head in 20 tosses is the same probability as getting +/- heads in 320 tosses

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Chapter 20. Inference about a Population Proportion. BPS - 5th Ed. Chapter 19 1

NUMBERS & OPERATIONS. 1. Understand numbers, ways of representing numbers, relationships among numbers and number systems.

Exam III Review Problems

Raise your hand if you rode a bus within the past month. Record the number of raised hands.

Square Roots and the Pythagorean Theorem

MATH-1110 FINAL EXAM FALL 2010

Fundamentals of Probability

USE OF BASIC ELECTRONIC MEASURING INSTRUMENTS Part II, & ANALYSIS OF MEASUREMENT ERROR 1

Unit 11 Probability. Round 1 Round 2 Round 3 Round 4

INDEPENDENT AND DEPENDENT EVENTS UNIT 6: PROBABILITY DAY 2

SAMPLING DISTRIBUTION MODELS TODAY YOU WILL NEED: PENCIL SCRATCH PAPER A PARTNER (YOUR CHOICE) ONE THUMBTACK PER GROUP Z-SCORE CHART

Mathematics Success Grade 8

Probability and Statistics

Chapter 4. September 08, appstats 4B.notebook. Displaying Quantitative Data. Aug 4 9:13 AM. Aug 4 9:13 AM. Aug 27 10:16 PM.

C) 1 4. Find the indicated probability. 2) A die with 12 sides is rolled. What is the probability of rolling a number less than 11?

12.1 The Fundamental Counting Principle and Permutations

Univariate Descriptive Statistics

NSCAS - Math Table of Specifications

Please Turn Over Page 1 of 7

Grades 6 8 Innoventure Components That Meet Common Core Mathematics Standards

CALCULATING SQUARE ROOTS BY HAND By James D. Nickel

The rectangle above has been divided into squares. Assume that the length of each side of a small square is 1 cm.

Objective: Plot points, using them to draw lines in the plane, and describe

Statistics 101: Section L Laboratory 10

What is the expected number of rolls to get a Yahtzee?

11-1 Practice. Designing a Study

Math + 4 (Red) SEMESTER 1. { Pg. 1 } Unit 1: Whole Number Sense. Unit 2: Whole Number Operations. Unit 3: Applications of Operations

MAT Midterm Review

**Gettysburg Address Spotlight Task

Unit Nine Precalculus Practice Test Probability & Statistics. Name: Period: Date: NON-CALCULATOR SECTION

Statistical Hypothesis Testing

Residential Paint Survey: Report & Recommendations MCKENZIE-MOHR & ASSOCIATES

Lecture Slides. Elementary Statistics Twelfth Edition. by Mario F. Triola. and the Triola Statistics Series. Section 2.2- #

1. How to identify the sample space of a probability experiment and how to identify simple events

GREATER CLARK COUNTY SCHOOLS PACING GUIDE. Grade 4 Mathematics GREATER CLARK COUNTY SCHOOLS

GRADE 4. M : Solve division problems without remainders. M : Recall basic addition, subtraction, and multiplication facts.

MANIPULATIVE MATHEMATICS FOR STUDENTS

Mathematics (Project Maths)

Game Mechanics Minesweeper is a game in which the player must correctly deduce the positions of

7 th grade Math Standards Priority Standard (Bold) Supporting Standard (Regular)

Proportions. Chapter 19. Inference about a Proportion Simple Conditions. Inference about a Proportion Sampling Distribution

1 2-step and other basic conditional probability problems

Ten Calculator Activities- Teacher s Notes

Revised Elko County School District 2 nd Grade Math Learning Targets

North Seattle Community College Winter ELEMENTARY STATISTICS 2617 MATH Section 05, Practice Questions for Test 2 Chapter 3 and 4

Name Class Date. Introducing Probability Distributions

1 = 3 2 = 3 ( ) = = = 33( ) 98 = = =

It is important that you show your work. The total value of this test is 220 points.

MATRIX SAMPLING DESIGNS FOR THE YEAR2000 CENSUS. Alfredo Navarro and Richard A. Griffin l Alfredo Navarro, Bureau of the Census, Washington DC 20233

A natural number is called a perfect cube if it is the cube of some. some natural number.

Math 1313 Conditional Probability. Basic Information

An Overview of Mathematics 4

Lesson 11: Linear and Exponential Investigations

Review of Probability

Gathering information about an entire population often costs too much or is virtually impossible.

Math 247: Continuous Random Variables: The Uniform Distribution (Section 6.1) and The Normal Distribution (Section 6.2)

Exam #1. Good luck! Page 1 of 7

TJP TOP TIPS FOR IGCSE STATS & PROBABILITY

California 1 st Grade Standards / Excel Math Correlation by Lesson Number

Warm Up The following table lists the 50 states.

CHAPTER 2 PROBABILITY. 2.1 Sample Space. 2.2 Events

Chapter 1. Statistics. Individuals and Variables. Basic Practice of Statistics - 3rd Edition. Chapter 1 1. Picturing Distributions with Graphs

Confidence Intervals. Class 23. November 29, 2011

German Tanks: Exploring Sampling Distributions Name

Key Concepts. Theoretical Probability. Terminology. Lesson 11-1

Key Concept Probability of Independent Events. Key Concept Probability of Mutually Exclusive Events. Key Concept Probability of Overlapping Events

Probability. Ms. Weinstein Probability & Statistics

STAT 100 Fall 2014 Midterm 1 VERSION B

SAMPLE. This chapter deals with the construction and interpretation of box plots. At the end of this chapter you should be able to:

1. Let X be a continuous random variable such that its density function is 8 < k(x 2 +1), 0 <x<1 f(x) = 0, elsewhere.

MAT 1272 STATISTICS LESSON STATISTICS AND TYPES OF STATISTICS

This page intentionally left blank

Austin and Sara s Game

Transcription:

STATWAY STUDENT HANDOUT STUDENT NAME DATE INTRODUCTION The GPS software company, TeleNav, recently commissioned a study on proportions of people who text while they drive. The study suggests that there are differences in the texting-while-driving habits of men and women. 1 We will compare the proportions of male and female texters who text frequently while driving. Our primary tool for this comparison is subtraction. That sounds simple enough, but before we can really know when one of these differences between two proportions is significant, we must understand the nature of the distribution of such differences. This understanding allows us to apply the methods of inference we have learned margins of error, confidence intervals, and hypothesis tests to the comparisons between two population proportions. For any two distributions of sample proportions, the distribution of differences between sample proportions can be very large and difficult to picture. To ease the comprehension of what a distribution of differences looks like, we will turn to two very small populations of texting drivers. The first population consists of two males who text while driving and the second population consists of three females who text while driving. Within these populations, frequent texting while driving is defined to be at least seven texts while driving per week. Rarely texting is defined as less than seven texts per week while driving. These are represented as follows: F = frequently texting (at least seven texts while driving per week) R = rarely texting (less than seven texts while driving per week) From these small populations, we will collect sample proportions of drivers who text frequently. We will then create a distribution of differences between male and female proportions. This small distribution will help us find a way to quantify the center and spread of such distributions of differences. 1 TeleNav, Inc. (2010). TeleNav-commissioned survey suggests both genders have similar views on abiding by and breaking the rules of the road. Retrieved July 14, 2010, from www.telenav.com/about/driving-behavior.

STATWAY STUDENT HANDOUT 2 TRY THESE As mentioned, the first population we will consider consists of two men who text while driving, where one of these men texts frequently. The observations are {Frequent, Rare}. This is a small population of one frequent texter out of two, with a population proportion of 1/2, which is represented as π = ½ = 0.5. Now construct a sampling distribution of sample proportions from this population. 1 Treating this collection of two texting men as a small population, we will compute all sample proportions from samples of size n = 2. Sampling is done with replacement to preserve the independence of the trials (so some samples will include the same observation twice). In the following table, all possible samples are listed. For the case the frequent texter is chosen twice, the proportion is 2/2 = 1.0. For the case where the first pick is a frequent texter but the second is not, the proportion is 1/2 = 0.5. A Fill in the remaining sample proportions of men who text frequently, each of which are denoted as p m. Sample Drawn With Replacement Sample Proportion, B We have previously given formulas for the mean, standard error, and variance of a sampling distribution of sample proportions. These formulas require the population proportion of men

STATWAY STUDENT HANDOUT 3 who frequently text while driving, (π = ½ = 0.50) and the sample sizes (n = 2). Evaluate these formulas for the current sampling distribution. 2 Next, suppose we have a population of three women who text while driving and only one of these women texts frequently. These are represented as {Frequent, Rare, Rare} A What is the population proportion of frequent texters in the population of women? B Compute all sample proportions from samples of women texters of size n = 2. Denote the sample proportion of frequently texting women as p w. Sample Drawn With Replacement Sample Proportion, 2/2 = 1.0 1/2 = 0.5 0/2 = 0.0 1/2 = 0.50 0/2 = 0.0

Proportions of Women Who Text Frequently STATWAY STUDENT HANDOUT 4 C For this collection of proportions from the women drivers whose population proportion is π w = 1/3 = 0.333, compute the mean, standard error, and variance. Remember that the samples are all of size n = 2. 3 Our goal is to discover the nature of the distribution of all differences (p m p w ). To accomplish this, consider the difference of every value of p m minus every value of p w. A In the top row of the following table, all men s proportions (p m ) found in Question 1b are listed. In the left column, all women s proportions (p w ) found in Question 2b are listed. Most of the differences (p m p w ) are listed as well, but seven are missing. In any cell where no difference is given, record the value of p m p w, where p m is recorded at the top of the cell s column and p w is recorded at the left of the cell s row. Proportions of Men Who Text Frequently 0.00 0.50 0.50 0.00 0.50 0.50 0.00 0.00 0.50 0.50 0.00 0.00 0.50 0.50 0.50 1.00 0.50 0.50 0.00 0.00 0.00 1.00 0.50 0.50 0.00 1.00 0.50 0.50 0.00 The total number of differences is 9 4 = 36.

STATWAY STUDENT HANDOUT 5 B Fill in the frequency table with the frequencies and their corresponding probabilities as relative frequencies for the various values of p m p w. Record the probabilities as unreduced fractions whose denominators are each 36. The distribution of probabilities is the sampling distribution of differences. Frequency Probability 1.0 0.5 0.0 0.5 1.0 Total 36 36/36 = 1 C Sketch a histogram of the probability distribution of p m p w on the graph below. The Sampling Distribution of Differences, 12/36 10/36 8/36 6/36 4/36 2/36-1.0-0.5 0.0 0.5 1.0 This distribution gives probabilities for every possible difference between male proportions of frequent texters and female proportions of frequent texters. Is the distribution approximately symmetric? Is the distribution approximately bell-shaped?

STATWAY STUDENT HANDOUT 6 Recalling that the mean is a similar to a balancing point for a probability distribution, give a visual estimate of the mean. 4 Use technology to compute the mean and standard deviation of all differences summarized by the frequency table in Question 3 (or your instructor can provide these). A What is the mean of differences? Is this value close to the estimate made in Question 3c? B Compute the difference of the means of the men and women from Questions 1b and 2c. C Comparing your answers in Questions 4a and 4b, complete the statement below. The mean of differences in Question 4a is the of the means from Questions 1b and 2c. D Compute the variance of all differences (by squaring the standard deviation) summarized by the frequency table in Question 3b (or your instructor can provide this). E Compute the sum of the variances of men and women texters from Questions 1b and 2c. F Comparing your answers in Questions 4d and 4e, complete the following statement. The variance of differences in Question 4d is the of the variances from Questions 1b and 2c. G Compute the standard deviation (or error) of all differences of proportions by taking the square root of the variance from Question 4d. Round this answer to the nearest tenth.

STATWAY STUDENT HANDOUT 7 H Now that you have quantified the mean and standard deviation (or error) of differences, you can begin to make inferences regarding extreme values. Do you consider a difference of p m p w = 1.0 to be unusually low? Why? Do you consider a difference of p m p w = 0.5 to be unusually high? Why? NEXT STEPS For this part of the lesson, we will expand our simulated populations of men and women using decks of cards. Your instructor will provide a deck of black cards and a deck of blue cards. These decks represent the following. The black deck is the population of males who text while driving. The blue deck is the population of females who text while driving. Do not count the cards in the decks! Just as the exact truth is always unknown in large populations, the truth must remain unknown with the decks of cards. In this way, you can experience the uncertainty that occurs when working with samples. Within each deck, there are red cards and black cards. The color of each card s front side represents the following: A card with a red front is a person who sends texts frequently while driving. A card with a black front is a person who sends texts rarely while driving. Your instructor has manipulated the decks of cards so that the proportions of red cards in each deck roughly match the proportions of men and women drivers who text frequently in the real world.

STATWAY STUDENT HANDOUT 8 TRY THESE 5 From each deck of cards, you and a partner will sample 10 cards, randomly replacing each card after it is drawn before drawing the next card to preserve the independence of the trials. A Shuffle the black deck (representing the men who text while driving) and draw 10 cards with replacement. Count the red front cards drawn. What is the sample proportion of frequent texters for the men (the number of red front cards divided by the sample size n = 10)? The dotplot below includes a simulation of 30 additional proportions (p m ) generated by drawing cards from a deck, just as you have done. Add your data point to this dotplot. Men who Text Frequently Proportions of Men who Text Frequently While Driving (Seven or More Texts Per Week) 0.0 0.1 0.2 0.3 0.4 0.5 Men Proportions 0.6 0.7 Population includes only men who admit to texting while driving. B Shuffle the blue deck (representing the women who text while driving) and draw 10 cards with replacement. Count the red front cards drawn. What is the sample proportion of frequent texters for the women (the number of red front cards divided by the sample size n = 10)?

STATWAY STUDENT HANDOUT 9 Once again, a dotplot is provided below which includes 30 proportions (p w ) generated in the same way. Add your proportion to this dotplot. Women who Text Frequently Proportions of Women who Text Frequently While Driving (Seven or More Texts Per Week) 0.0 0.1 0.2 0.3 0.4 0.5 Women Proportions 0.6 0.7 Population includes only women who admit to texting while driving. C Compute the difference between the proportions you found in Questions 5a and 5b. The dotplot below is generated by taking differences between the sample proportions p m and p w in the previous dotplots. Add your difference to the dotplot of differences below. Differences Between Men and Women Proportions For Frequent Texters (Seven or More Texts Per Week) -0.3-0.2-0.1 0.0 0.1 0.2 Differences 0.3 0.4 0.5 0.6 Populations include only people who admit to texting while driving.

STATWAY STUDENT HANDOUT 10 D Add any additional differences (p m p w ) generated by others in your class to this dotplot. From the dotplot of texting men, estimate the mean by picking one proportion that is representative of the group s center. E From the dotplot of texting women, estimate the mean by picking one proportion that is representative of that group s center. F The difference of means is the mean of the differences (p m p w ). Thus, an estimate of the mean difference can be found by subtracting the estimated means from Questions 5d and 5e. Subtract these values. Does this value make a good representation of the center of the dotplot of differences? G Using complete sentences, refer to the dotplot in Question 5c to describe the distribution of differences (p m p w ) in terms of shape, symmetry, and center. 6 The variances of sample proportions (p m and p w ) are each computed as the square of the standard error, When the population proportions (π) are unknown, estimate them using sample proportions p m and p w ). The variance of the differences (p m p w ) is the sum of the individual variances. This variance is therefore estimated as a sum,

STATWAY STUDENT HANDOUT 11 Applying a square root gives an estimate of the standard error of differences (p m p w ). A Use your representative values for p m and p w (from Questions 5d and 5e) to estimate the standard error of differences between all such proportions. B Using the estimated mean of p m p w values (from Question 5f) and the standard error (from Question 6a), do you consider a difference of p m p w = 0.6 to be unusually high? C Using the estimated mean of the representative value of p m p w (from Question 5f) and the standard error (from Question 6a), do you consider a difference of p m p w = 0 to be unusually low? D Suppose that the mean value of p m is the population proportion, π m, of men who text successively. Suppose also that the mean value of p w is the population proportion, π w, of women who text successively. What is implied by the approximate difference between π m and π w given in your answer from 6c above?

STATWAY STUDENT HANDOUT 12 TAKE IT HOME To compare the proportion of men who are left-handed to the proportion of women who are left-handed, twenty-five samples, each containing 575 men each, were gathered. From each of these samples of men, a sample proportion, p m, of those who are left-handed was computed. Additionally, twenty samples of 815 women each were gathered, and from each, a sample proportion, p w, of women who are left-handed was computed. Thus, twenty-five sample proportions, p m, of left-handed men, and twenty sample proportions, p w, of lefthanded women were gathered. From these, the collection of all possible differences, p m p w, was constructed. The distribution of differences is plotted below. Differences of Left Handed Proportions For Men and Women -0.01 0.00 0.01 0.02 0.03 0.04 Male Proportion Minus Female Proportion 0.05 0.06 Twenty-five samples of men gathered with 575 members in each. Twenty Samples of women gathered with 815 members each. 1 Do you consider an 8% difference between the proportions of men and women who are lefthanded likely? If unlikely, is the difference too high or too low? 2 If you assume an 8% difference, is the proportion of left-handedness higher for men or women?

STATWAY STUDENT HANDOUT 13 3 Do you consider a 1% difference likely? If unlikely, is the difference too high or too low? 4 If you assume a 1% difference, is the proportion of left-handedness higher for men or women? 5 Pick a difference that you consider a representative value of the differences on the dotplot. 6 Does your chosen difference allow for p m and p w to be equal? If not, then which is greater? 7 Suppose that a random proportion of left-handedness for men, p m = 0.10, is chosen from a sample of size n m = 575. Suppose also that a random proportion of left-handedness for women, p w = 0.08, is chosen from a sample of size n w = 815. Estimate the standard error in the differences of sample proportions, p m p w, rounded to two places after the decimal. 8 Estimate, roughly, the number of standard errors that the difference, p m p w = 0.08, lies from your representative difference in Question 5. Does the estimated value support your answer from Question 1? 9 Estimate, roughly, the number of standard errors that the difference, p m p w = 0.01, lies from your representative difference in Question 5. Does the estimated value support your answer from Question 3?

STATWAY STUDENT HANDOUT 14 +++++ This lesson is part of STATWAY, A Pathway Through College Statistics, which is a product of a Carnegie Networked Improvement Community that seeks to advance student success. Version 1.0, A Pathway Through Statistics, Statway was created by the Charles A. Dana Center at the University of Texas at Austin under sponsorship of the Carnegie Foundation for the Advancement of Teaching. This version 1.5 and all subsequent versions, result from the continuous improvement efforts of the Carnegie Networked Improvement Community. The network brings together community college faculty and staff, designers, researchers and developers. It is an open-resource research and development community that seeks to harvest the wisdom of its diverse participants in systematic and disciplined inquiries to improve developmental mathematics instruction. For more information on the Statway Networked Improvement Community, please visit carnegiefoundation.org. For the most recent version of instructional materials, visit Statway.org/kernel. +++++ STATWAY and the Carnegie Foundation logo are trademarks of the Carnegie Foundation for the Advancement of Teaching. A Pathway Through College Statistics may be used as provided in the CC BY license, but neither the Statway trademark nor the Carnegie Foundation logo may be used without the prior written consent of the Carnegie Foundation.