TO PLOT OR NOT TO PLOT?

Similar documents
Using Figures - The Basics

Graphing Techniques. Figure 1. c 2011 Advanced Instructional Systems, Inc. and the University of North Carolina 1

Appendix III Graphs in the Introductory Physics Laboratory

Page 21 GRAPHING OBJECTIVES:

Chapter 10. Re-expressing Data: Get it Straight! Copyright 2012, 2008, 2005 Pearson Education, Inc.

Why Should We Care? Everyone uses plotting But most people ignore or are unaware of simple principles Default plotting tools are not always the best

Laboratory 2: Graphing

Year 11 Graphing Notes

Chapter 2: PRESENTING DATA GRAPHICALLY

LINEAR EQUATIONS IN TWO VARIABLES

Why Should We Care? More importantly, it is easy to lie or deceive people with bad plots

Appendix C: Graphing. How do I plot data and uncertainties? Another technique that makes data analysis easier is to record all your data in a table.

PASS Sample Size Software

Graphs. This tutorial will cover the curves of graphs that you are likely to encounter in physics and chemistry.

Experiment G: Introduction to Graphical Representation of Data & the Use of Excel

MATHEMATICAL FUNCTIONS AND GRAPHS

Describing Data Visually. Describing Data Visually. Describing Data Visually 9/28/12. Applied Statistics in Business & Economics, 4 th edition

EXPERIMENTAL ERROR AND DATA ANALYSIS

Constructing Line Graphs*

Graphing with Excel. Data Table

Science Binder and Science Notebook. Discussions

Statistics. Graphing Statistics & Data. What is Data?. Data is organized information. It can be numbers, words, measurements,

AP* Environmental Science Grappling with Graphics & Data

Important Considerations For Graphical Representations Of Data

Name: Date: Class: Lesson 3: Graphing. a. Useful for. AMOUNT OF HEAT PRODUCED IN KJ. b. Difference between a line graph and a scatter plot:

General tips for all graphs Choosing the right kind of graph scatter graph bar graph

ESSENTIAL MATHEMATICS 1 WEEK 17 NOTES AND EXERCISES. Types of Graphs. Bar Graphs

A-level Physics. PHY6T/Q14 Final Marking Guidelines. 2450/2455 June 2014 PMT. Version/Stage: 1.0 Final Marking Guidelines

Chapter 3. Graphical Methods for Describing Data. Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc.

10 GRAPHING LINEAR EQUATIONS

Tables and Figures. Germination rates were significantly higher after 24 h in running water than in controls (Fig. 4).

Engineering Fundamentals and Problem Solving, 6e

This is Appendix A: Graphs in Economics, appendix 1 from the book Economics Principles (index.html) (v. 1.0).

Statistics, Probability and Noise

Physics 2310 Lab #5: Thin Lenses and Concave Mirrors Dr. Michael Pierce (Univ. of Wyoming)

Constructing Line Graphs Appendix B AP Biology Investigative Lab Essentials

Applied Linear Algebra in Geoscience Using MATLAB

CS 147: Computer Systems Performance Analysis

Understanding Apparent Increasing Random Jitter with Increasing PRBS Test Pattern Lengths

Excel Manual X Axis Label Below Chart 2010 >>>CLICK HERE<<<

Line Graphs. Name: The independent variable is plotted on the x-axis. This axis will be labeled Time (days), and

Experiment 3. Ohm s Law. Become familiar with the use of a digital voltmeter and a digital ammeter to measure DC voltage and current.

Experiment 2. Ohm s Law. Become familiar with the use of a digital voltmeter and a digital ammeter to measure DC voltage and current.

Univariate Descriptive Statistics

8.EE. Development from y = mx to y = mx + b DRAFT EduTron Corporation. Draft for NYSED NTI Use Only

Contents. An introduction to MATLAB for new and advanced users

Hyperbolas Graphs, Equations, and Key Characteristics of Hyperbolas Forms of Hyperbolas p. 583

file:///d:/mohammad 1/New Folder/Freeman/Microeconomics Paul Krug...

4.4 Slope and Graphs of Linear Equations. Copyright Cengage Learning. All rights reserved.

Rec. ITU-R F RECOMMENDATION ITU-R F *

Image Enhancement (from Chapter 13) (V6)

Using Voltage Dividers to Design a Photo-Sensitive LED Circuit. ( Doug Oliver & Jackie Kane. May be reproduced for non-profit classroom use.

AP Physics Problems -- Waves and Light

Scientific Investigation Use and Interpret Graphs Promotion Benchmark 3 Lesson Review Student Copy

Chapter Displaying Graphical Data. Frequency Distribution Example. Graphical Methods for Describing Data. Vision Correction Frequency Relative

Addendum COLOR PALETTES

Chapter 10. Definition: Categorical Variables. Graphs, Good and Bad. Distribution

Section 1.5 Graphs and Describing Distributions

How to define Graph in HDSME

Lesson 6.1 Linear Equation Review

Infographics at CDC for a nonscientific audience

Computer Tools for Data Acquisition

Algebra. Teacher s Guide

WELCOME TO LIFE SCIENCES

A Visual Display. A graph is a visual display of information or data. This is a graph that shows a girl walking her dog. Communicating with Graphs

Graphing Guidelines. Controlled variables refers to all the things that remain the same during the entire experiment.

Patterns and Graphing Year 10

Chapter 17 Waves in Two and Three Dimensions

Purpose. Charts and graphs. create a visual representation of the data. make the spreadsheet information easier to understand.

USE OF BASIC ELECTRONIC MEASURING INSTRUMENTS Part II, & ANALYSIS OF MEASUREMENT ERROR 1

Laboratory 1: Uncertainty Analysis

Nonuniform multi level crossing for signal reconstruction

Tennessee Senior Bridge Mathematics

Oscilloscope Measurements

The 34th International Physics Olympiad

Drawing Bode Plots (The Last Bode Plot You Will Ever Make) Charles Nippert

Performance Characteristics

10.2 Images Formed by Lenses SUMMARY. Refraction in Lenses. Section 10.1 Questions

Business Statistics:

2.3 Quick Graphs of Linear Equations

6. Multivariate EDA. ACE 492 SA - Spatial Analysis Fall 2003

Comparison of FRD (Focal Ratio Degradation) for Optical Fibres with Different Core Sizes By Neil Barrie

ECE 3155 Experiment I AC Circuits and Bode Plots Rev. lpt jan 2013

PASS Sample Size Software. These options specify the characteristics of the lines, labels, and tick marks along the X and Y axes.

Name Date: Course number: MAKE SURE TA & TI STAMPS EVERY PAGE BEFORE YOU START EXPERIMENT 10. Electronic Circuits

MP211 Principles of Audio Technology

STAB22 section 2.4. Figure 2: Data set 2. Figure 1: Data set 1

Experiment 2: Transients and Oscillations in RLC Circuits

A To draw a line graph showing the connection between the time and cost

Plotting Points & The Cartesian Plane. Scatter Plots WS 4.2. Line of Best Fit WS 4.3. Curve of Best Fit WS 4.4. Graphing Linear Relations WS 4.

GRAPHS & CHARTS. Prof. Rahul C. Basole CS/MGT 8803-DV > January 23, 2017 INFOVIS 8803DV > SPRING 17

Magnitude Scaling. Observations: 1. (Ordinal) Homer Simpson is more humorous than any other character from The Simpsons

Section 3 Correlation and Regression - Worksheet

2.3 BUILDING THE PERFECT SQUARE

Data Presentation. Esra Akdeniz. February 12th, 2016

Chpt 2. Frequency Distributions and Graphs. 2-3 Histograms, Frequency Polygons, Ogives / 35

Elementary Plotting Techniques

Honors Chemistry Summer Assignment

Scientific Measurement

Section 3. Imaging With A Thin Lens

Transcription:

Graphic Examples This document provides examples of a number of graphs that might be used in understanding or presenting data. Comments with each example are intended to help you understand why the data were plotted in a certain fashion, or why it should have been done differently. TO PLOT OR NOT TO PLOT? The purpose of plotting scientific data is to visualize variation or show relationships between variables, but not all data sets require a plot. If there are only one or two points, it is easy to examine the numbers directly, and little or nothing is gained by putting them on a graph. Similarly, if there is no variation in the data, it is easy enough to see or state the fact without using a graph of any sort. When a graph is appropriate, it must be of an appropriate type to avoid misleading the reader. Fig. 1. Research expenditures for various scientific fields. Both plots in figure 1 show US research expenditures by discipline in 2000. The scatter plot on the left is incorrect because it implies a relationship between the variables on the two axes, further reinforced by the connecting lines. Since the horizontal axis is just a list of disciplines with no inherent ordering, no relationship can exist. Categorical data of this sort are better plotted as a bar graph, as on the right, since such a graph displays the relative magnitudes without implying a functional relationship. (Pie charts are often seen in the popular press for financial data, in order to emphasize the relative size of the allocations. Pie charts are rarely used in technical fields.) A SET OF COMMON MISTAKES It has been argued that smoking causes lung cancer. One way to test this hypothesis is to look for a relation between tobacco smoking and lung cancer. The figure below plots Graphic Examples 1

data for cigarette consumption in 1930 and male death rate from lung cancer in 1950 for several countries. Fig. 2. Deaths due to cigarette consumption. There are a lot of mistakes in figure 2, including Missing units is it total consumption and deaths, or normalized for population? Reversed axes we suspect that smoking leads to cancer, not the converse. The independent or causal variable goes on the x-axis. The jagged line connecting the points has no basis. The scatter of the data suggests large random effects, not real changes from point to point. A caption that is not particularly helpful. Redrawing the graph produces the following result Fig. 3. Death rate from lung cancer vs cigarette consumption for several countries. The solid line is a linear fit to the data. Graphic Examples 2

The straight line represents a simple model, which may be the best that these badlyscattered data can support. The extrapolation to zero consumption may or may not be valid, and would have to be tested with other data. MISLEADING SCALES An experiment is conducted to determine how much a solute contributes to the volume of the resulting solution. The procedure is to add weighed amounts of a salt, KCl, to 100 ml samples of water. After allowing the system to come to equilibrium the solution is filtered to remove any residual solid and the volume of solution is measured. Fig. 4. Solution volume as a function of KCl mass. The data are plotted in the figure above, in a manner which is worse than useless. Note the following problems: Axes are not labeled with the quantity measured, nor are units identified. The axes are very unequal in length, for no visible reason. The vertical scale has too wide a range to display the range of the data. The horizontal scale is also too long, extending well beyond the data range. Grid lines add clutter but not information. A fitted straight line is shown, but the scales make it hard to tell if it is accurate. The fit extends far beyond the data, without justification. Fig. 5. Solution volume as a function of KCl mass. Graphic Examples 3

Fixing these errors produces the plot above. It is now clear that the solution volume increases with added solute mass, but only until the solution becomes saturated, so a linear fit to the whole data set is just nonsense. Below saturation, the scales now allow the reader to evaluate the data accurately, for example to see if the volume increase is linear below saturation or if more data are required to decide. COMPUTER FITS Experimenters often use computer-generated best-fit lines to demonstrate agreement with some model or theory. For example, a student has data from a radiation experiment which consisted of observing the number of gamma rays emitted in a fixed time interval. Counts were obtained for many time intervals and the results plotted as a histogram. The next step is to see if the distribution of counts follows the expected Gaussian distribution. Using the defaults in a poor fitting program might produce this result. Fig. 6. Comparison of data and theory for counting experiment. One line consists of data, the other theory, but it is hard to follow either one (impossible with a monochrome version). Connecting the data dots is also incorrect because it implies that there are more data than actually present. A better presentation would look like figure 7. The actual data points are now clearly distinguishable symbols, showing that the raw data for four count values has been binned together and that there is some scatter around the theory curve. The theory itself is displayed as a smooth solid line because the values can be calculated everywhere and there are no uncertainties in the calculated numbers. Any program to be used for scientific graphing must be able to produce a similar plot. Graphic Examples 4

Fig. 7. Histogram of interval counting data. The solid line is the expected Gaussian distribution, squares are observations. GUIDING EXPERIMENTATION Data plots are often a useful guide to experimentation. A plot will quickly show if parameters are varying as expected, and may indicate regimes where more or less data are needed. Fig. 8. Density of liquid mercury as a function of temperature. The solid line is a linear fit to the observations. The plot in figure 8 was obtained by measuring the density of liquid mercury as a function of temperature. Over the range shown, the density decreases linearly with Graphic Examples 5

temperature, to a very good approximation, and one could define a volume expansion coefficient from the slope of the line. Fig. 9. Density of liquid water as a function of temperature. The next liquid measured was water, which is clearly a much more complex substance. The total variation from 0 to 100 C is only about 5%, showing the need for good precision of measurement, and is certainly not linear. In fact, it might be useful to get more data in the region around 0 C, to find out if the density approaches zero uniformly or has a maximum in the region. Fig. 10. Density of liquid water as a function of temperature, expanded scale. The results of the additional measurements are plotted above, clearly showing a peak in the density as a function of temperature. Note that these data are plotted on even more Graphic Examples 6

expanded scales, with a vertical range of only 0.4%. Since this is needed to show the small maximum, both plots would probably be included in a report of this experiment. TRANSFORMATION OF VARIABLES It is sometimes helpful to mathematically transform one or both of the variables before plotting. The technique can be used to linearize data to simplify model fitting, or to change the way data are distributed to clarify display. The exact procedure will depend on the situation, but two examples will show the process. Making a relationship linear A beam of light is bent when it is incident on a plane surface between different substances. The angle of refraction is related to the angle of incidence by Snell s law, n 1 sinθ i = n 2 sinθ r A student measures the incident and refracted angles for an air to glass interface, and wants to find the index of refraction for the glass, n 2, knowing that the index for air n 1 = 1.000. Believing that a graph would be a good way to analyze the data, the student solves for the refracted angle in terms of the incident angle θ r = arcsin 1 sinθ i n 2 A computer program might fit this function, but the available program won t do the arcsin function, so the student tries to be more clever. Noting that sinθ r = 1 n 2 sinθ i she plots sinθ r vs sinθ i, with the following result Graphic Examples 7

Fig. 11. Linearized refraction data. The solid line is a fit, assuming Snell s law. It is now easy to see that the data are well described by the expected straight line and to obtain the slope, which is 1/n 2. Changing the distribution Next, consider the graph below, which plots the number of state employees vs the total population for the 50 US states in the year 2000. Evidently, there are a lot more small states than large ones, so the data are bunched near the origin. The straight line is drawn on the assumption that the size of the bureaucracy is simply proportional to the number of citizens. Unfortunately, this assumption does not appear to be valid, since there seem to be systematic deviations at the low end, where the data are hard to distinguish, and the intercept is not zero. Another approach is needed. Graphic Examples 8

Fig. 12. Number of state government employees vs total state population in 2000. The next plot uses the same data, but they are displayed as the logarithm of both employee and population numbers. The effect of taking a log is to spread out the small values and compress the larger ones, causing the data to be more uniformly distributed on the axes. This often aids visualization of deviations or other problems. Fig. 13. Number of state government employees vs total state population in 2000. The line represents a power-law fit to the data. Log-log plots are also useful for demonstrating power-law or scaling relations. A power law, y = ax b in which the exponent b is not necessarily one, is a generalization of the familiar proportionality. Taking the logarithm of both sides, we get log y = blog x + loga so a power-law relationship is a straight line on a logy vs logx plot, with slope of b. Referring back to the bureaucracy example, the slope of the line shown is 0.79, indicating that the number of state employees increases somewhat less rapidly than the population. An economist would note that this is an example of economy of scale. Graphic Examples 9