Chapter Displaying Graphical Data. Frequency Distribution Example. Graphical Methods for Describing Data. Vision Correction Frequency Relative

Similar documents
Chapter 3. Graphical Methods for Describing Data. Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc.

Notes: Displaying Quantitative Data

Business Statistics:

Chapter 10. Definition: Categorical Variables. Graphs, Good and Bad. Distribution

Outline. Drawing the Graph. 1 Homework Review. 2 Introduction. 3 Histograms. 4 Histograms on the TI Assignment

Section 1.5 Graphs and Describing Distributions

Chapter 4. September 08, appstats 4B.notebook. Displaying Quantitative Data. Aug 4 9:13 AM. Aug 4 9:13 AM. Aug 27 10:16 PM.

Frequency Distribution and Graphs

Going back to the definition of Biostatistics. Organizing and Presenting Data. Learning Objectives. Nominal Data 10/10/2016. Tabulation and Graphs

Name: Date: Period: Histogram Worksheet

Describing Data Visually. Describing Data Visually. Describing Data Visually 9/28/12. Applied Statistics in Business & Economics, 4 th edition

DESCRIBING DATA. Frequency Tables, Frequency Distributions, and Graphic Presentation

Displaying Distributions with Graphs

Chapter 2 Frequency Distributions and Graphs

STK110. Chapter 2: Tabular and Graphical Methods Lecture 1 of 2. ritakeller.com. mathspig.wordpress.com

Chapter 1. Picturing Distributions with Graphs

NCSS Statistical Software

Chapter 4 Displaying and Describing Quantitative Data

Stat 20: Intro to Probability and Statistics

Chpt 2. Frequency Distributions and Graphs. 2-3 Histograms, Frequency Polygons, Ogives / 35

LESSON 2: FREQUENCY DISTRIBUTION

Organizing Data 10/11/2011. Focus Points. Frequency Distributions, Histograms, and Related Topics. Section 2.1

Chapter 1. Statistics. Individuals and Variables. Basic Practice of Statistics - 3rd Edition. Chapter 1 1. Picturing Distributions with Graphs

Numerical: Data with quantity Discrete: whole number answers Example: How many siblings do you have?

Chapter 4. Displaying and Summarizing Quantitative Data. Copyright 2012, 2008, 2005 Pearson Education, Inc.

Elementary Statistics. Graphing Data

PSY 307 Statistics for the Behavioral Sciences. Chapter 2 Describing Data with Tables and Graphs

Section 1: Data (Major Concept Review)

Statistics for Managers using Microsoft Excel 3 rd Edition

Review. In an experiment, there is one variable that is of primary interest. There are several other factors, which may affect the measured result.

10 Wyner Statistics Fall 2013

Chapter 2. Organizing Data. Slide 2-2. Copyright 2012, 2008, 2005 Pearson Education, Inc.

Variables. Lecture 13 Sections Wed, Sep 16, Hampden-Sydney College. Displaying Distributions - Quantitative.

Statistics. Graphing Statistics & Data. What is Data?. Data is organized information. It can be numbers, words, measurements,

11 Wyner Statistics Fall 2018

Important Considerations For Graphical Representations Of Data

Learning Log Title: CHAPTER 2: ARITHMETIC STRATEGIES AND AREA. Date: Lesson: Chapter 2: Arithmetic Strategies and Area

Chapter 1: Stats Starts Here Chapter 2: Data

Sections Descriptive Statistics for Numerical Variables

1.1 Displaying Distributions with Graphs, Continued

AP Statistics Composition Book Review Chapters 1 2

TOPIC 4 GRAPHICAL PRESENTATION

Notes 5C: Statistical Tables and Graphs

Describing Data: Frequency Tables, Frequency Distributions, and Graphic Presentation. Chapter 2

Descriptive Statistics II. Graphical summary of the distribution of a numerical variable. Boxplot

Principles of Graphical Excellence Best Paper: ALAIR April 5 6, 2001 AIR: June 2-5, 2002, Toronto Focus-IR, February 21, 2003

Purpose. Charts and graphs. create a visual representation of the data. make the spreadsheet information easier to understand.

Tables and Figures. Germination rates were significantly higher after 24 h in running water than in controls (Fig. 4).

Lecture Slides. Elementary Statistics Twelfth Edition. by Mario F. Triola. and the Triola Statistics Series. Section 2.2- #

Chapter 2. The Excel functions, Excel Analysis ToolPak Add-ins or Excel PHStat2 Add-ins needed to create frequency distributions are:

Why Should We Care? Everyone uses plotting But most people ignore or are unaware of simple principles Default plotting tools are not always the best

Introduction. Descriptive Statistics. Problem Solving. Inferential Statistics. Chapter1 Slides. Maurice Geraghty

(3 pts) 1. Which statements are usually true of a left-skewed distribution? (circle all that are correct)

Office 2016 Excel Basics 24 Video/Class Project #36 Excel Basics 24: Visualize Quantitative Data with Excel Charts. No Chart Junk!!!

Statistics is the study of the collection, organization, analysis, interpretation and presentation of data.

2.2 More on Normal Distributions and Standard Normal Calculations

Business Statistics. Lecture 2: Descriptive Statistical Graphs and Plots

A Visual Display. A graph is a visual display of information or data. This is a graph that shows a girl walking her dog. Communicating with Graphs

Learning Objectives. Describing Data: Displaying and Exploring Data. Dot Plot. Dot Plot 12/9/2015

Describing Data: Displaying and Exploring Data. Chapter 4

PLC Papers Created For:

This Chapter s Topics

Chapter 2 Descriptive Statistics: Tabular and Graphical Methods

Mrs. Ambre s Math Notebook

Univariate Descriptive Statistics

Graphing Guidelines. Controlled variables refers to all the things that remain the same during the entire experiment.

Sidcot intranet - Firefly. Useful links: Instant classroom. MyMaths. Objectives

Interval of Head Circumferences (mm) XS 510 < 530 S 530 < 550 M 550 < 570 L 570 < 590 XL 590 < 610 XXL 610 < 630. Hat Sizes.

3. Data and sampling. Plan for today

TJP TOP TIPS FOR IGCSE STATS & PROBABILITY

Using Charts and Graphs to Display Data

AWM 11 UNIT 1 WORKING WITH GRAPHS

General tips for all graphs Choosing the right kind of graph scatter graph bar graph

PASS Sample Size Software

Core Connections, Course 2 Checkpoint Materials

Why Should We Care? More importantly, it is easy to lie or deceive people with bad plots

Describing Data. Presenting Categorical Data Graphically. Describing Data 143

!"#$%&'("&)*("*+,)-(#'.*/$'-0%$1$"&-!!!"#$%&'(!"!!"#$%"&&'()*+*!

Algebra I. Measures of Central Tendency: Mean, Median, Mode & Additional Measures of Data. Slide 1 / 141 Slide 2 / 141. Slide 4 / 141.

Name: Date: Class: Lesson 3: Graphing. a. Useful for. AMOUNT OF HEAT PRODUCED IN KJ. b. Difference between a line graph and a scatter plot:

Lecture 2: Chapter 2

CHAPTER 13A. Normal Distributions

Infographics at CDC for a nonscientific audience

ESSENTIAL MATHEMATICS 1 WEEK 17 NOTES AND EXERCISES. Types of Graphs. Bar Graphs

Data Presentation. Esra Akdeniz. February 12th, 2016

Excel Manual X Axis Scale Start At Graph

TO PLOT OR NOT TO PLOT?

Mathematics Success Grade 8

TEKSING TOWARD STAAR MATHEMATICS GRADE 6. Student Book

Problem Solving with Length, Money, and Data

What Is a Histogram? A bar graph that shows the distribution of data A snapshot of data taken from a process HISTOGRAM VIEWGRAPH 1

Data and its representation

To describe the centre and spread of a univariate data set by way of a 5-figure summary and visually by a box & whisker plot.

CS 147: Computer Systems Performance Analysis

Exploring Data Patterns. Run Charts, Frequency Tables, Histograms, Box Plots

SAMPLE. This chapter deals with the construction and interpretation of box plots. At the end of this chapter you should be able to:

Using Graphing Skills

Using Figures - The Basics

BE540 - Introduction to Biostatistics Computer Illustration. Topic 1 Summarizing Data Software: STATA. A Visit to Yellowstone National Park, USA

4 th Grade Curriculum Map

Transcription:

Chapter 3 Graphical Methods for Describing 3.1 Displaying Graphical Distribution Example The data in the column labeled vision for the student data set introduced in the slides for chapter 1 is the answer to the question, What is your principle means of correcting your vision? The results are tabulated below Vision Correction None 38 38/79 =.481 Glasses 31 31/79 =.392 Contacts 1 1/79 =.127 Total 79 1. 1

Histogram Chart Examples 3 Contacts Glasses None Count of Gender 2 1 Female Gender Male This comparative bar chart is based on frequencies and it can be difficult to interpret and misleading. Would you mistakenly interpret this to mean that the females and males use contacts equally often? You shouldn t. The picture is distorted because the frequencies of males and females are not equal. Histogram Chart Examples Percent Count of Gender 1 5 Contacts Glasses None Female Gender Male When the comparative bar chart is based on percents (or relative frequencies) (each group adds up to 1%) we can clearly see a difference in pattern for the eye correction proportions for each of the genders. Clearly for this sample of students, the proportion of female students with contacts is larger then the proportion of males with contacts. Bar Chart Examples Male Contacts Glasses None Gender Female 5 Percent Count of Gender 1 Stacking the bar chart can also show the difference in distribution of eye correction method. This graph clearly shows that the females have a higher proportion using contacts and both the no correction and glasses group have smaller proportions then for the males. 2

Pie Charts - Procedure 1. Draw a circle to represent the entire data set. 2. For each category, calculate the slice size. Slice size = 36(category relative frequency) 3. Draw a slice of appropriate size for each category. Pie Chart - Example Using the vision correction data we have: Pie Chart of Eye Correction All Students Glasses (31, 39.2%) Contacts (1, 12.7%) None (38, 48.1%) Pie Chart - Example Using side-by-side pie charts we can compare the vision correction for males and females. Pie Chart for Eye Corrections for Females Contacts, 5, 2% Pie Chart for Eye Corrections for Males Contacts, 5, 9% None, 11, 44% None, 28, 52% Glasses, 21, 39% Glasses, 9, 36% 3

Another Example This data constitutes the grades earned by the distance learning students during one term in the Winter of 22. Student Grade Students Proportion A 454.414 B 293.267 C 113.13 D 35.32 F 32.29 I 92.84 W 78.71 Pie Chart Another Example Using the grade data from the previous slide we have: F 3% D 3% C 1% I 8% W 7% A 42% B 27% Grade Distribution Pie Chart Another Example Using the grade data we have: C 1% D F 3% 3% I 8% W 7% B 27% Grade Distribution A 42% By pulling a slice (exploding) we can accentuate and make it clearing how A was the predominate grade for this course. 4

3.2: Numerical : Stem and Leaf A quick technique for picturing the distributional pattern associated with numerical data is to create a picture called a stem-and-leaf diagram (Commonly called a stem plot). 1. We want to break up the data into a reasonable number of groups. 2. Looking at the range of the data, we choose the stems (one or more of the leading digits) to get the desired number of groups. 3. The next digits (or digit) after the stem become(s) the leaf. 4. Typically, we truncate (leave off) the remaining digits. Stem and Leaf For our first example, we use the weights of the 25 female students. Choosing the 1 st two digits as the stem and the 3 rd digit as the leaf we have the following 15 14 155 195 139 2 157 13 113 13 121 14 14 15 125 135 124 13 15 125 12 13 17 124 16 1 11 12 13 14 15 16 17 18 19 2 3 3 15454 95 57 5 Stem and Leaf Typically we sort the order the stems in increasing order. We also note on the diagram the units for stems and leaves 1 11 12 13 14 15 16 17 18 19 2 3 3 14455 59 57 5 Probable outliers Stem: Tens and hundreds digits Leaf: Ones digit 5

Stem-and-leaf GPA example The following are the GPAs for the 2 advisees of a faculty member. GPA 3.9 2.4 2.27 3.98 3.7 2.99 3.72 3.23 3.13 3.5 2.26 3.15 2.8 1.75 3.89 3.38 2.74 1.65 2.22 2.66 If the ones digit is used as the stem, you only get three groups. You can expand this a little by breaking up the stems by using each stem twice letting the 2 nd digits -4 go with the first and the 2 nd digits 5-9 with the second. Call it Low & High The next slide gives two versions of the stem-and-leaf diagram. 1L 1H 2L 2H 3L 3H Stem-and-leaf GPA example 1L 1H 2L 2H 3L 3H 67 222 6978 1123 57789 65,75 4,22,26,27 66,99,74,8 9,13,15,23,38 5,7,72,89,98 Stem: Ones digit Leaf: Tenths digits Stem: Ones digit Leaf: Tenths and hundredths digits Note: The characters in a stem-and-leaf diagram must all have the same width, so if typing a fixed character width font such as courier. Comparative Stem & Leaf Diagram Student Wt (Comparing 2 groups) When it is desirable to compare two groups, back-to-back stem and leaf diagrams are useful. Here is the result from the student weights. From this comparative stem and leaf diagram, it is clear that the males weigh more (as a group not necessarily as individuals) than the females. 3 1 3 11 7 55441 12 145 95 13 4558 14 555 75 15 5556 16 5558 17 5555 18 358 5 19 2 21 22 55 23 79 6

From this comparative stem and leaf diagram, it is clear that the male ages are all more closely grouped then the females. Also the females had a number of outliers. Comparative Stem & Leaf Diagram Student Age female male 7 1 9999 1 888889999999999999999 1111 2 1111111111 3322222 2 2222223333 4 2 445 2 6 2 88 3 3 3 7 3 8 3 4 4 4 4 7 4 3.3: Distributions & Histograms When working with discrete data, the frequency tables are similar to those produced for qualitative data. For example, a survey of local law firms in a medium sized town gave Number of Lawyers 1 11.44 2 7.28 3 4.16 4 2.8 5 1.4 Distributions & Histograms When working with discrete data, the steps to construct a histogram are 1. Draw a horizontal scale, and mark the possible values. 2. Draw a vertical scale and mark it with either frequencies or relative frequencies (usually start at ). 3. Above each possible value, draw a rectangle whose height is the frequency (or relative frequency) centered at the data value with a width chosen appropriately. Typically if the data values are integers then the widths will be one. 7

Distributions & Histograms Look for a central or typical value, extent of spread or variation, general shape, location and number of peaks, and presence of gaps and outliers. Distributions & Histograms The number of lawyers in the firm will have the following histogram. 12 1 8 6 4 2 1 2 3 4 5 # of Lawyers Clearly, the largest group are single member law firms and the frequency decreases as the number of lawyers in the firm increases. Distributions & Histograms 5 students were asked the question, How many textbooks did you purchase last term? The result is summarized below and the histogram is on the next slide. # of Textbooks 1 or 2 4.8 3 or 4 16.32 5 or 6 24.48 7 or 8 6.12 8

Distributions & Histograms How many textbooks did you purchase last term?.6.5 Proportion of Students.4.3.2.1. 1 or 2 3 or 4 5 or 6 7 or 8 # of Textbooks The largest group of students bought 5 or 6 textbooks with 3 or 4 being the next largest frequency. Distributions & Histograms Another version with the scales produced differently. Distributions & Histograms When working with continuous data, the steps to construct a histogram are 1. Decide into how many groups or classes you want to break up the data. Typically somewhere between 5 and 2. A good rule of thumb is to think having an average of more than 5 per group.* 2. Use your answer to help decide the width of each group. 3. Determine the starting point for the lowest group. *A quick estimate for a reasonable number of intervals is number of observations 9

Example of Distribution Consider the student weights in the student data set. The data values fall between 13 (lowest) and 239 (highest). The range of the dataset is 239-13=136. There are 79 data values, so to have an average of at least 5 per group, we need 16 or fewer groups. We need to choose a width that breaks the data into 16 or fewer groups. Any width 1 or large would be reasonable. Example of Distribution Choosing a width of 15 we have the following frequency distribution. Class Interval 1 to <115 2.25 115 to <13 1.127 13 to <145 21.266 145 to <16 15.19 16 to <175 15.19 175 to <19 8.11 19 to <25 3.38 25 to <22 1.13 22 to <235 2.25 235 to <25 2.25 79 1. Histogram for Continuous Mark the boundaries of the class intervals on a horizontal axis Use frequency or relative frequency on the vertical scale. 1

Histogram for Continuous The following histogram is for the frequency table of the weight data. Histogram for Continuous The following histogram is the Minitab output of the relative frequency histogram. Notice that the relative frequency scale is in percent. Cumulative Table If we keep track of the proportion of that data that falls below the upper boundaries of the classes, we have a cumulative relative frequency table. Cumulative Class Interval 1 to < 115.25.25 115 to < 13.127.152 13 to < 145.266.418 145 to < 16.19.68 16 to < 175.19.797 175 to < 19.11.899 19 to < 25.38.937 25 to < 22.13.949 22 to < 235.25.975 235 to < 25.25 1. 11

Cumulative Plot If we graph the cumulative relative frequencies against the upper endpoint of the corresponding interval, we have a cumulative relative freq plot. Crumulative 1..8.6.4.2 Cumulative Plot for the Student Weights. 1 115 13 145 16 175 19 25 22 235 25 Weight (pounds) Histogram for Continuous Another version of a frequency table and histogram for the weight data with a class width of 2. Class Interval 1 to <12 3.38 12 to <14 21.266 14 to <16 24.34 16 to <18 19.241 18 to <2 5.63 2 to <22 3.38 22 to <24 4.51 79 1.1 Histogram for Continuous The resulting histogram. 12

Histogram for Continuous The resulting cumulative relative frequency plot. Cumulative Plot for the Student Weights Cum Rel 1..8.6.4.2. 1 115 13 145 16 175 19 25 22 235 Weight (pounds) Histogram for Continuous Yet, another version of a frequency table and histogram for the weight data with a class width of 2. Class Interval 95 to <115 2.25 115 to <135 17.215 135 to <155 23.291 155 to <175 21.266 175 to <195 8.11 195 to <215 4.51 215 to <235 2.25 235 to <255 2.25 79.999 Histogram for Continuous The corresponding histogram. 13

Histogram for Continuous A class width of 15 or 2 seems to work well because all of the pictures tell the same story. The bulk of the weights appear to be centered around 15 lbs with a few values substantially large. The distribution of the weights is unimodal and is positively skewed. Illustrated Distribution Shapes Unimodal Bimodal Multimodal Skew negatively Symmetric Skew positively Histograms with uneven class widths Consider the following frequency histogram of ages based on A with class widths of 2. Notice it is a bit choppy. Because of the positively skewed data, sometimes frequency distributions are created with unequal class widths. 14

Histograms with uneven class widths For many reasons, either for convenience or because that is the way data was obtained, the data may be broken up in groups of uneven width as in the following example referring to the student ages. Class Interval 18 to <2 26.329 2 to <22 24.34 22 to <24 17.215 24 to <26 4.51 26 to <28 1.13 28 to <4 5.63 4 to <5 2.25 Histograms with uneven class widths If a frequency (or relative frequency) histogram is drawn with the heights of the bars being the frequencies (relative frequencies), the result is distorted. Notice that it appears that there are a lot of people over 28 when there is only a few. Histograms with uneven class widths To correct the distortion, we create a density histogram. The vertical scale is called the density and the density of a class is calculated by density = rectangle height = relative frequency of class class width This choice for the density makes the area of the rectangle equal to the relative frequency. 15

Histograms with uneven class widths Continuing this example we have Class Interval Density 18 to <2 26.329.165 2 to <22 24.34.152 22 to <24 17.215.18 24 to <26 4.51.26 26 to <28 1.13.7 28 to <4 5.63.5 4 to <5 2.25.3 Histograms with uneven class widths The resulting histogram is now a reasonable representation of the data. 3.4: Displaying Bivariate Scatterplots A scatterplot is a plot of pairs of observed values (both quantitative) of two different variables. It s plotting the (x, y) ordered pair on coordinate plane like you have did in Alg/Geo When one of the variables is considered to be a response variable (y) and the other an explanatory variable (x). The explanatory variable is usually plotted on the x axis 16

A sample of one-way Greyhound bus fares from Rochester, NY to cities less than 75 miles was taken by going to Greyhound s website. The following table gives the destination city, the distance and the one-way fare. Distance should be the x axis and the Fare should be the y axis. Example Standard One-Way Fare Destination City Distance Albany, NY 24 39 Baltimore, MD 43 81 Buffalo, NY 69 17 Chicago, IL 67 96 Cleveland, OH 257 61 Montreal, QU 48 7.5 New York City, NY 34 65 Ottawa, ON 467 82 Philadelphia, PA 335 67 Potsdam, NY 239 47 Syracuse, NY 95 2 Toronto, ON 178 35 Washington, DC 496 87 Example Scatterplot $1 $9 Greyhound Bus Fares Vs. Distance $8 Standard One-Way Fare $7 $6 $5 $4 $3 $2 $1 5 15 25 35 45 55 65 Distance from Rochester, NY (miles) Displaying Bivariate Time series Just plot most any univariate data against time (if applicable). As an example you could measure your height every hour during the day (interested in height), then plot with time on the x axis to see the change in ht as the day goes by. 17