Statistical Hypothesis Testing

Similar documents
Statistical tests. Paired t-test

Hypothesis Tests. w/ proportions. AP Statistics - Chapter 20

Please Turn Over Page 1 of 7

Chapter 20. Inference about a Population Proportion. BPS - 5th Ed. Chapter 19 1

Chapter 19. Inference about a Population Proportion. BPS - 5th Ed. Chapter 19 1

Proportions. Chapter 19. Inference about a Proportion Simple Conditions. Inference about a Proportion Sampling Distribution

Probability - Introduction Chapter 3, part 1

MITOCW mit_jpal_ses06_en_300k_512kb-mp4

Name: Exam 01 (Midterm Part 2 take home, open everything)

OFF THE WALL. The Effects of Artist Eccentricity on the Evaluation of Their Work ROUGH DRAFT

Introduction to Chi Square

One-Sample Z: C1, C2, C3, C4, C5, C6, C7, C8,... The assumed standard deviation = 110

Before giving a formal definition of probability, we explain some terms related to probability.

Unit 1B-Modelling with Statistics. By: Niha, Julia, Jankhna, and Prerana

Lesson Sampling Distribution of Differences of Two Proportions

Chapter 1. Probability

Chapter 1. Probability

If a fair coin is tossed 10 times, what will we see? 24.61% 20.51% 20.51% 11.72% 11.72% 4.39% 4.39% 0.98% 0.98% 0.098% 0.098%

The point value of each problem is in the left-hand margin. You must show your work to receive any credit, except on problems 1 & 2. Work neatly.

Player Speed vs. Wild Pokémon Encounter Frequency in Pokémon SoulSilver Joshua and AP Statistics, pd. 3B

Probability Rules. 2) The probability, P, of any event ranges from which of the following?

How Many Imputations are Really Needed? Some Practical Clarifications of Multiple Imputation Theory

Algebra II Journal. Module 4: Inferences. Predicting the Future

The genealogical history of a population The coalescent process. Identity by descent Distribution of pairwise coalescence times

Raise your hand if you rode a bus within the past month. Record the number of raised hands.

ECON 214 Elements of Statistics for Economists

Game Mechanics Minesweeper is a game in which the player must correctly deduce the positions of

Math 1313 Section 6.2 Definition of Probability

Jednoczynnikowa analiza wariancji (ANOVA)

x y

MATH 1115, Mathematics for Commerce WINTER 2011 Toby Kenney Homework Sheet 6 Model Solutions

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY

Lectures 15/16 ANOVA. ANOVA Tests. Analysis of Variance. >ANOVA stands for ANalysis Of VAriance >ANOVA allows us to:

Name: Unit 7 Study Guide 1. Use the spinner to name the color that fits each of the following statements.

Geometric Distribution

Nessie is alive! Gerco Onderwater. Role of statistics, bias and reproducibility in scientific research

On a loose leaf sheet of paper answer the following questions about the random samples.

Ex 1: A coin is flipped. Heads, you win $1. Tails, you lose $1. What is the expected value of this game?

Contents 2.1 Basic Concepts of Probability Methods of Assigning Probabilities Principle of Counting - Permutation and Combination 39

a) Getting 10 +/- 2 head in 20 tosses is the same probability as getting +/- heads in 320 tosses

Laboratory 1: Uncertainty Analysis

Math 58. Rumbos Fall Solutions to Exam Give thorough answers to the following questions:

If a fair coin is tossed 10 times, what will we see? 24.61% 20.51% 20.51% 11.72% 11.72% 4.39% 4.39% 0.98% 0.98% 0.098% 0.098%

Unit 6: What Do You Expect? Investigation 2: Experimental and Theoretical Probability

Solutions to Odd-Numbered End-of-Chapter Exercises: Chapter 13

7.1 Chance Surprises, 7.2 Predicting the Future in an Uncertain World, 7.4 Down for the Count

A Probability Work Sheet

Probability. March 06, J. Boulton MDM 4U1. P(A) = n(a) n(s) Introductory Probability

Chapter 5 - Elementary Probability Theory

1. The chance of getting a flush in a 5-card poker hand is about 2 in 1000.

7.1 Experiments, Sample Spaces, and Events

Sampling Terminology. all possible entities (known or unknown) of a group being studied. MKT 450. MARKETING TOOLS Buyer Behavior and Market Analysis

Lecture 21/Chapter 18 When Intuition Differs from Relative Frequency

Multivariate Permutation Tests: With Applications in Biostatistics

Author Manuscript Behav Res Methods. Author manuscript; available in PMC 2012 September 01.

8.6 Jonckheere-Terpstra Test for Ordered Alternatives. 6.5 Jonckheere-Terpstra Test for Ordered Alternatives

EXACT P-VALUES OF SAVAGE TEST STATISTIC

Key Concepts. Theoretical Probability. Terminology. Lesson 11-1

Sampling distributions and the Central Limit Theorem

The topic for the third and final major portion of the course is Probability. We will aim to make sense of statements such as the following:

The next several lectures will be concerned with probability theory. We will aim to make sense of statements such as the following:

TEST A CHAPTER 11, PROBABILITY

Probabilities and Probability Distributions

Lesson 1: Chance Experiments

Confidence Intervals. Class 23. November 29, 2011

Why Randomize? Jim Berry Cornell University

AP Statistics Ch In-Class Practice (Probability)

Name. Is the game fair or not? Prove your answer with math. If the game is fair, play it 36 times and record the results.

The Coin Toss Experiment

What Do You Expect? Concepts

4.1 Sample Spaces and Events

CORRELATION BASED CLASSIFICATION OF COMPLEX PRI MODULATION TYPES

Statistical Methods in Computer Science

There is no class tomorrow! Have a good weekend! Scores will be posted in Compass early Friday morning J

Tree and Venn Diagrams

Class XII Chapter 13 Probability Maths. Exercise 13.1

Probability. Dr. Zhang Fordham Univ.

GROUND DATA PROCESSING & PRODUCTION OF THE LEVEL 1 HIGH RESOLUTION MAPS

Exam III Review Problems

1) What is the total area under the curve? 1) 2) What is the mean of the distribution? 2)

Math 1313 Conditional Probability. Basic Information

Comparing Means. Chapter 24. Case Study Gas Mileage for Classes of Vehicles. Case Study Gas Mileage for Classes of Vehicles Data collection

II/IV B.Tech (Supplementary) DEGREE EXAMINATION

Basic Probability Concepts

What is the probability Jordan will pick a red marble out of the bag and land on the red section when spinning the spinner?

Lecture 13: Physical Randomness and the Local Uniformity Principle

Module 4 Project Maths Development Team Draft (Version 2)

Mathematical Foundations HW 5 By 11:59pm, 12 Dec, 2015

6. Methods of Experimental Control. Chapter 6: Control Problems in Experimental Research

Name: Exam 01 (Midterm Part 2 Take Home, Open Everything)

MITOCW watch?v=krzi60lkpek

SUPPLEMENT TO THE PAPER TESTING EQUALITY OF SPECTRAL DENSITIES USING RANDOMIZATION TECHNIQUES

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. B) Blood type Frequency

Chapter 3: Elements of Chance: Probability Methods

WEEK 7 REVIEW. Multiplication Principle (6.3) Combinations and Permutations (6.4) Experiments, Sample Spaces and Events (7.1)

Genealogical trees, coalescent theory, and the analysis of genetic polymorphisms

Unit 6: Probability Summative Assessment. 2. The probability of a given event can be represented as a ratio between what two numbers?

Probability. Ms. Weinstein Probability & Statistics

Unit 8: Sample Surveys

More on games (Ch )

Transcription:

Statistical Hypothesis Testing Statistical Hypothesis Testing is a kind of inference Given a sample, say something about the population Examples: Given a sample of classifications by a decision tree, test the hypothesis that classification performance is no better than chance (testing that a population parameter has a value) Given a sample of KOSO and KOSO* trials, test the hypothesis that KOSO* generally has shorter run-times than KOSO (comparison of two populations based on samples) Given a sample of knowledge engineers each entering x pieces of knowledge in a fixed time t, how what is the value of x/t for any knowledge engineer, and what is your confidence in it (parameter estimation)

Statistical Hypothesis Testing More examples Given runtimes for KOSO and KOSO* under a factorial arrangement of conditions four levels of number of processors, four levels of alpha test the hypothesis that the superiority of KOSO* increases as the number of processors decreases (looking for interaction effects) Given a correlation between log(runtime) and the size of a job, test whether the population value of the correlation is ρ Calculate a confidence interval around your estimate of the population value of the correlation

Statistical Inference Let s start with an example... Fred tosses a coin 20 times and it comes up heads 15 times. Fred wonders whether the coin is fair. The Logic: Let R be the result 15 out of 20 heads Assume the coin is fair (Π =.5) If R is very unlikely under the assumption that the coin is fair, Pr(R Π =.5) α then Fred is inclined to reject the hypothesis that the coin is fair. Fred picks a value of α (say,.01) and calculates the conditional probability p = Pr(R Π =.5) Fred s residual uncertainty that the coin is fair is less than or equal to α p is not the probability that the coin is fair, nor is 1 - p the probability that the coin is not fair

Running the example Write down the null and alternative hypotheses Ho: Π =.5 H1: Π.5 Decide on a level of confidence α =.5 Calculate the sampling distribution of the number of heads in 20 tosses under the assumption that the coin is fair Calculate the probability p of the sample result from the sampling distribution If p α reject the null hypothesis

Running the example Write down the null and alternative hypotheses Ho: Π =.5 H1: Π.5 Decide on a level of confidence α Calculate the sampling distribution of the number of heads in 20 tosses under the assumption that the coin is fair Calculate the probability p of the sample result from the sampling distribution If p α reject the null hypothesis

The probability of 15 out of 20 tosses coming up heads under the assumption that the coin is fair (defun toss-coin (p) "Returns 1 with probability p, 0 with probability 1-p" (if (< (random 1.0) p) 1 0)) (defun sampling-distribution-of-heads (p n k) "p is the bias of the coin, n is the number of tosses in a sample, and k is the size of the sampling distribution the number of sample values of the number of heads" (loop repeat k collect (apply #'+ (loop repeat n collect (toss-coin p)))))

The probability of 15 out of 20 tosses coming up heads under the assumption that the coin is fair (defun toss-coin (p) "Returns 1 with probability p, 0 with probability 1-p" (if (< (random 1.0) p) 1 0)) (defun sampling-distribution-of-heads (p n k) "p is the bias of the coin, n is the number of tosses in a sample, and k is the size of the sampling distribution the number of sample values of the number of heads" (loop repeat k collect (apply #'+ (loop repeat n collect (toss-coin p))))) 90 80 70 (sampling-distribution-of-heads.5 20 500) 60 50 40 30 20 10 4 5 6 7 8 9 10 11 12 13 14 15 16

The probability of 15 out of 20 tosses coming up heads under the assumption that the coin is fair (sampling-distribution-of-heads.5 20 500) 90 80 70 60 50? (setf sorted-dist (sort dist '<))? (what-percentile? 15 sorted-dist) 0.0040000000000000036? (setf critical-value (quantile sorted-dist.95 )) 13.0 40 30 20 10 4 5 6 7 8 9 10 11 12 13 14 15 16

Running the example Write down the null and alternative hypotheses Ho: Π =.5 H1: Π.5 Decide on a level of confidence α Calculate the sampling distribution of the number of heads in 20 tosses under the assumption that the coin is fair Calculate the probability p of the sample result from the sampling distribution If p α reject the null hypothesis

Statistical Inference Fred tosses a coin 20 times and it comes up heads 15 times. Fred wonders whether the coin is fair. The Logic: Let R be the result 15 out of 20 heads Assume the coin is fair (Π =.5) If R is very unlikely under the assumption that the coin is fair, Pr(R Π =.5) α then Fred is inclined to reject the hypothesis that the coin is fair. Fred picks a value of α (say,.01) and calculates the conditional probability p = Pr(R Π =.5) =.004 Fred s residual uncertainty that the coin is fair is less than or equal to α.004 is not the probability that the coin is fair, nor is 1 -.004 the probability that the coin is not fair

Statistical Hypothesis Testing Sampling Distributions Suppose you have a sample of size N and you calculate a statistic φ on that sample (e.g., the mean) If you had drawn a different sample of size N, you would have a different value of φ The distribution of φ for all possible samples of size N is called the sampling distribution of φ The distribution of some possible samples of size N, obtained by Monte Carlo sampling (or other computer-intensive methods) is called the empirical sampling distribution of φ The empirical sampling distribution estimates the sampling distribution

Statistical Hypothesis Testing Sampling Distributions and the Null Hypothesis When we get a sampling distribution for hypothesis testing, it should be the sampling distribution for φ under the assumption that Ho is true That way, we can calculate the probability p of the sample result the particular value of φ under Ho We reject Ho if p α α Sampling distribution of φ under Ho Pr ( φ Ho) φ