CHAPTER 1 Exploring Data

Similar documents
NCSS Statistical Software

Comparing Across Categories Part of a Series of Tutorials on using Google Sheets to work with data for making charts in Venngage

CHAPTER 4 Designing Studies

LESSON Constructing and Analyzing Two-Way Frequency Tables

SS Understand charts and graphs used in business.

Section 1.5 Graphs and Describing Distributions

Chapter 1. Picturing Distributions with Graphs

Chapter 3. Graphical Methods for Describing Data. Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc.

Chapter 4: Designing Studies

MAT 1272 STATISTICS LESSON STATISTICS AND TYPES OF STATISTICS

10 Wyner Statistics Fall 2013

Using Charts and Graphs to Display Data

Measurement and Data. Bar Graphs. Talk About It. More Ideas. Formative Assessment. Have children try the following problem.

Microsoft Excel. Creating a Pie Chart on a Picture. 1. In order to create a pie chart on a picture, you need to first find

Section 1: Data (Major Concept Review)

Notes 5C: Statistical Tables and Graphs

Going back to the definition of Biostatistics. Organizing and Presenting Data. Learning Objectives. Nominal Data 10/10/2016. Tabulation and Graphs

Chapter 10. Definition: Categorical Variables. Graphs, Good and Bad. Distribution

Radio Today 2013 How America Listens to Radio

Creating Run Charts (Time Series Plots, Line Charts) Excel 2010 Tutorial

Introduction. Descriptive Statistics. Problem Solving. Inferential Statistics. Chapter1 Slides. Maurice Geraghty

This Chapter s Topics

Stats: Modeling the World. Chapter 11: Sample Surveys

State of the media: audio today A FOCUS ON BLACK & HISPANIC AUDIENCES

A Gentle Introduction to SAS/Graph Software

Sampling, Part 2. AP Statistics Chapter 12

Such a description is the basis for a probability model. Here is the basic vocabulary we use.

Important Considerations For Graphical Representations Of Data

Chapter Displaying Graphical Data. Frequency Distribution Example. Graphical Methods for Describing Data. Vision Correction Frequency Relative

Better Measurement. Better Decisions.

Purpose. Charts and graphs. create a visual representation of the data. make the spreadsheet information easier to understand.

Chapter 1. Statistics. Individuals and Variables. Basic Practice of Statistics - 3rd Edition. Chapter 1 1. Picturing Distributions with Graphs

1. Use Pattern Blocks. Make the next 2 figures in each increasing pattern. a) 2. Write the pattern rule for each pattern in question 1.

CHAPTER 8: Producing Data: Sampling

Microsoft Excel: Data Analysis & Graphing. College of Engineering Engineering Education Innovation Center

ESSENTIAL MATHEMATICS 1 WEEK 17 NOTES AND EXERCISES. Types of Graphs. Bar Graphs

3. Data and sampling. Plan for today

Describing Data Visually. Describing Data Visually. Describing Data Visually 9/28/12. Applied Statistics in Business & Economics, 4 th edition

Infographics at CDC for a nonscientific audience

ESP 171 Urban and Regional Planning. Demographic Report. Due Tuesday, 5/10 at noon

Gender Pay Report 2017

Notes: Displaying Quantitative Data

Graphing Guidelines. Controlled variables refers to all the things that remain the same during the entire experiment.

Chapter 20. Inference about a Population Proportion. BPS - 5th Ed. Chapter 19 1

Using Figures - The Basics

Lesson Sampling Distribution of Differences of Two Proportions

Chapter 2. The Excel functions, Excel Analysis ToolPak Add-ins or Excel PHStat2 Add-ins needed to create frequency distributions are:

PivotTables PivotCharts (Chapter 5)

15-388/688 - Practical Data Science: Visualization and Data Exploration. J. Zico Kolter Carnegie Mellon University Spring 2018

Chapter 3 Monday, May 17th

Washington s Lottery: Daily Race Game Evaluation Study TOPLINE RESULTS. November 2009

Searching, Exporting, Cleaning, & Graphing US Census Data Kelly Clonts Presentation for UC Berkeley, D-lab March 9, 2015

Stat 20: Intro to Probability and Statistics

11 Wyner Statistics Fall 2018

Math 2 Proportion & Probability Part 3 Sums of Series, Combinations & Compound Probability

Office 2016 Excel Basics 24 Video/Class Project #36 Excel Basics 24: Visualize Quantitative Data with Excel Charts. No Chart Junk!!!

Chapter 2 Descriptive Statistics: Tabular and Graphical Methods

Objectives. Organizing Data. Example 1. Making a Frequency Distribution. Solution

Describing Data. Presenting Categorical Data Graphically. Describing Data 143

4-8 Bayes Theorem Bayes Theorem The concept of conditional probability is introduced in Elementary Statistics. We noted that the conditional

Business Statistics:

2.2 More on Normal Distributions and Standard Normal Calculations

The purpose of this study is to show that this difference is crucial.

Chapter 1: Introduction to Statistics

not human choice is used to select the sample.

Report to Guilden Sutton Parish Council. Survey Analysis and Report of Residents Attitudes Towards Fracking in Guilden Sutton

Female Height. Height (inches)

Population vs. Sample

STK 573 Metode Grafik untuk Analisis dan Penyajian Data

Proportions. Chapter 19. Inference about a Proportion Simple Conditions. Inference about a Proportion Sampling Distribution

Statistics. Graphing Statistics & Data. What is Data?. Data is organized information. It can be numbers, words, measurements,

Chapter 5: Probability: What are the Chances? Section 5.2 Probability Rules

Chapter 4. September 08, appstats 4B.notebook. Displaying Quantitative Data. Aug 4 9:13 AM. Aug 4 9:13 AM. Aug 27 10:16 PM.

Univariate Descriptive Statistics

Elementary Statistics. Graphing Data

Section 3 Correlation and Regression - Worksheet

Statistics 101 Reviewer for Final Examination

S TAT E O F THE MEDIA: HOW AMERICA LISTENS MARCH 2015

Puerto Rico Radio Today How Puerto Rico Listens to Radio

Chapter 2. Organizing Data. Slide 2-2. Copyright 2012, 2008, 2005 Pearson Education, Inc.

Probability and Counting Rules. Chapter 3

The Deception of the Eye and the Brain

Karyn Bak Sales Manager x706 Reach Radio s Top Professionals Today!

Title: Playing By The Numbers Client: Broadcaster Writer: David Bray Date: March 6, 2005 Words: 2,167

Statistics is the study of the collection, organization, analysis, interpretation and presentation of data.

Name: Class: Date: 6. An event occurs, on average, every 6 out of 17 times during a simulation. The experimental probability of this event is 11

S2 End-of-unit Test 2

Stat472/572 Sampling: Theory and Practice Instructor: Yan Lu Albuquerque, UNM

Chapter 2 Frequency Distributions and Graphs

DESCRIBING DATA. Frequency Tables, Frequency Distributions, and Graphic Presentation

GEDmatch Home Page The upper left corner of your home page has Information about you and links to lots of helpful information. Check them out!

Example Report Station Community Engagement Survey

McGraw-Hill/Irwin 2004 The McGraw-Hill Companies, Inc., All Rights Reserved.

Lesson 8: The Difference Between Theoretical Probabilities and Estimated Probabilities

SEEING IS BELIEVING...OR IS IT? INSECTS LEVEL 1

Massachusetts Renewables/ Cape Wind Survey

Chapter 8. Producing Data: Sampling. BPS - 5th Ed. Chapter 8 1

CBL Lab WHY ARE THERE MORE REDS IN MY BAG? MATHEMATICS CURRICULUM GRADE SIX. Florida Sunshine State Mathematics Standards

Date. Probability. Chapter

Answer Table of Key Contents iii Introduction iv Consumer Math Basic Mat

Transcription:

CHAPTER 1 Exploring Data 1.1 Analyzing Categorical Data The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers

Analyzing Categorical Data Learning Objectives After this section, you should be able to: ü DISPLAY categorical data with a bar graph ü IDENTIFY what makes some graphs of categorical data deceptive ü CALCULATE and DISPLAY the marginal distribution of a categorical variable from a two-way table ü CALCULATE and DISPLAY the conditional distribution of a categorical variable for a particular value of the other categorical variable in a two-way table ü DESCRIBE the association between two categorical variables The Practice of Statistics, 5 th Edition 2

Categorical Variables Categorical variables place individuals into one of several groups or categories. Frequency Table Relative Frequency Table Format Count of Stations Format Percent of Stations Variable Values Adult Contemporary 1556 Adult Standards 1196 Contemporary Hit 569 Country 2066 News/Talk 2179 Oldies 1060 Religious 2014 Rock 869 Spanish Language 750 Other Formats 1579 Total 13838 Adult Contemporary 11.2 Adult Standards 8.6 Contemporary Hit 4.1 Country 14.9 News/Talk 15.7 Oldies 7.7 Religious 14.6 Rock 6.3 Count Spanish Language 5.4 Percent Other Formats 11.4 Total 99.9 The Practice of Statistics, 5 th Edition 3

Displaying Categorical Data Frequency tables can be difficult to read. Sometimes is is easier to analyze a distribution by displaying it with a bar graph or pie chart. Frequency Table Format Count of Stations Adult Contemporary 1556 Adult Standards 1196 Contemporary Hit 569 Country 2066 News/Talk 2179 Oldies 1060 Religious 2014 Rock 869 Spanish Language 750 Other Formats 1579 Total 13838 2500 2000 1500 1000 500 0 Count of Stations The Practice of Statistics, 5 th Edition 4

Displaying Categorical Data Frequency tables can be difficult to read. Sometimes is is easier to analyze a distribution by displaying it with a bar graph or pie chart. Format Relative Frequency Table Percent of Stations Percent of Stations Adult Contemporary Adult Contemporary 11.2 Adult Standards Adult Standards 8.6 Contemporary Hit 4.1 Country 14.9 5% 11% 11% 9% Contemporary hit Country News/Talk 15.7 6% 4% News/Talk Oldies 7.7 Oldies Religious 14.6 Rock 6.3 15% 15% Religious Spanish Language 5.4 Other Formats 11.4 8% 16% Rock Spanish Total 99.9 Other The Practice of Statistics, 5 th Edition 5

Graphs: Good and Bad Bar graphs compare several quantities by comparing the heights of bars that represent those quantities. Our eyes, however, react to the area of the bars as well as to their height. üwhen you draw a bar graph, make the bars equally wide. It is tempting to replace the bars with pictures for greater eye appeal. üdon t do it! There are two important lessons to keep in mind: (1)beware the pictograph, and (2)watch those scales. The Practice of Statistics, 5 th Edition 6

Two-Way Tables and Marginal Distributions When a dataset involves two categorical variables, we begin by examining the counts or percents in various categories for one of the variables. A two-way table describes two categorical variables, organizing counts according to a row variable and a column variable. Young adults by gender and chance of getting rich Female Male Total Almost no chance 96 98 194 Some chance, but probably not 426 286 712 A 50-50 chance 696 720 1416 A good chance 663 758 1421 Almost certain 486 597 1083 Total 2367 2459 4826 What are the variables described by this two-way table? How many young adults were surveyed? The Practice of Statistics, 5 th Edition 7

Two-Way Tables and Marginal Distributions The marginal distribution of one of the categorical variables in a twoway table of counts is the distribution of values of that variable among all individuals described by the table. Note: Percents are often more informative than counts, especially when comparing groups of different sizes. How to examine a marginal distribution: 1)Use the data in the table to calculate the marginal distribution (in percents) of the row or column totals. 2)Make a graph to display the marginal distribution. The Practice of Statistics, 5 th Edition 8

Two-Way Tables and Marginal Distributions Examine the marginal distribution of chance of getting rich. Young adults by gender and chance of getting rich Female Male Total Almost no chance 96 98 194 Some chance, but probably not 426 286 712 A 50-50 chance 696 720 1416 A good chance 663 758 1421 Almost certain 486 597 1083 Total 2367 2459 4826 Chance of being wealthy by age 30 Response Almost no chance Percent 194/4826 = 4.0% 35 30 25 Some chance 712/4826 = 14.8% A 50-50 chance 1416/4826 = 29.3% Percent 20 15 10 A good chance 1421/4826 = 29.4% Almost certain 1083/4826 = 22.4% 5 0 Almost none Some chance 50-50 chance Good chance Almost certain Survey Response The Practice of Statistics, 5 th Edition 9

Relationships Between Categorical Variables A conditional distribution of a variable describes the values of that variable among individuals who have a specific value of another variable. How to examine or compare conditional distributions: 1) Select the row(s) or column(s) of interest. 2) Use the data in the table to calculate the conditional distribution (in percents) of the row(s) or column(s). 3) Make a graph to display the conditional distribution. Use a side-by-side bar graph or segmented bar graph to compare distributions. The Practice of Statistics, 5 th Edition 10

Relationships Between Categorical Variables Calculate the conditional distribution of opinion among males. Examine the relationship between gender and opinion. Response Male Almost no chance 98/2459 = 4.0% Some chance 286/2459 = 11.6% A 50-50 chance 720/2459 = 29.3% A good chance 758/2459 = 30.8% Almost certain 597/2459 = 24.3% Female 96/2367 = 4.1% 426/2367 = 18.0% 696/2367 = 29.4% 663/2367 = 28.0% 486/2367 = 20.5% Percent Young adults by gender and chance of getting rich 100% 90% 80% 70% 60% 50% 40% 30% 20% 10% 0% Chance of being wealthy by age 30 Males Female Male Total Almost no chance 96 98 194 Some chance, but probably not 426 286 712 A 50-50 chance 696 720 1416 A good chance 663 758 1421 Almost certain 486 597 1083 Total 2367 2459 4826 Females Almost certain Good chance 50-50 chance Some chance Almost no Opinion chance The Practice of Statistics, 5 th Edition 11

Relationships Between Categorical Variables Can we say there is an association between gender and opinion in the population of young adults? Making this determination requires formal inference, which will have to wait a few chapters. Caution! Even a strong association between two categorical variables can be influenced by other variables lurking in the background. The Practice of Statistics, 5 th Edition 12

Data Analysis: Making Sense of Data Section Summary In this section, we learned how to ü DISPLAY categorical data with a bar graph ü IDENTIFY what makes some graphs of categorical data deceptive ü CALCULATE and DISPLAY the marginal distribution of a categorical variable from a two-way table ü CALCULATE and DISPLAY the conditional distribution of a categorical variable for a particular value of the other categorical variable in a two-way table ü DESCRIBE the association between two categorical variables The Practice of Statistics, 5 th Edition 13