BENFORD S LAW AND NATURALLY OCCURRING PRICES IN CERTAIN ebay AUCTIONS*

Similar documents
On the Peculiar Distribution of the U.S. Stock Indeces Digits

Exact Asymptotic Goodness-of-Fit Testing For Discrete Circular Data, With Applications

ABSTRACT. The probability that a number in many naturally occurring tables

An Empirical Non-Parametric Likelihood Family of. Data-Based Benford-Like Distributions

The A pplicability Applicability o f of B enford's Benford's Law Fraud detection i n in the the social sciences Johannes Bauer

Benford s Law: Tables of Logarithms, Tax Cheats, and The Leading Digit Phenomenon

DETECTING FRAUD USING MODIFIED BENFORD ANALYSIS

Benford s Law, data mining, and financial fraud: a case study in New York State Medicaid data

Not the First Digit! Using Benford s Law to Detect Fraudulent Scientific Data* Andreas Diekmann Swiss Federal Institute of Technology Zurich

The Political Economy of Numbers: John V. C. Nye - Washington University. Charles C. Moul - Washington University

Fundamental Flaws in Feller s. Classical Derivation of Benford s Law

Exact asymptotic goodness-of-fit testing for discrete circular data with applications

Benford s Law. David Groce Lyncean Group March 23, 2005

Modelling Conformity of Nigeria s Recent Population Censuses With Benford s Distribution

arxiv: v2 [math.pr] 20 Dec 2013

Activity Sheet #1 Presentation #617, Annin/Aguayo,

Do Populations Conform to the Law of Anomalous Numbers?

The First Digit Phenomenon

BENFORD S LAW, FAMILIES OF DISTRIBUTIONS AND A TEST BASIS. This Draft: October 9, 2010 First Draft: August 6, 2006

BENFORD S LAW IN THE CASE OF HUNGARIAN WHOLE-SALE TRADE SECTOR

log

Connectivity in Social Networks

Detecting Evidence of Non-Compliance In Self-Reported Pollution Emissions Data: An Application of Benford's Law

Research Article n-digit Benford Converges to Benford

A STUDY OF BENFORD S LAW, WITH APPLICATIONS TO THE ANALYSIS OF CORPORATE FINANCIAL STATEMENTS

Faculty Forum You Cannot Conceive The Many Without The One -Plato-

Analysis of Top 500 Central and East European Companies Net Income Using Benford's Law

Characterization of noise in airborne transient electromagnetic data using Benford s law

Fraud Detection using Benford s Law

Some Fine Combinatorics

USING BENFORD S LAW IN THE ANALYSIS OF SOCIO-ECONOMIC DATA

Benford Distribution in Science. Fabio Gambarara & Oliver Nagy

Modulation Classification based on Modified Kolmogorov-Smirnov Test

Laboratory 1: Uncertainty Analysis

PERUVIAN ECONOMIC ASSOCIATION. Tracking the Exchange Rate Management in Latin. America

Benford s Law A Powerful Audit Tool

Intuitive Considerations Clarifying the Origin and Applicability of the Benford Law. Abstract

CONTRIBUTIONS TO THE TESTING OF BENFORD S LAW

Newcomb, Benford, Pareto, Heaps, and Zipf Are arbitrary numbers random?

Visualizing Patterns: Fibonacci Numbers and 1,000 Pointed Stars. Abstract. Agenda. MAA sponsored since 1950

Newcomb, Benford, Pareto, Heaps, and Zipf Are arbitrary numbers random?

Chapter 1. Probability

Detecting fraud in financial data sets

IBM Research Report. Audits and Business Controls Related to Receipt Rules: Benford's Law and Beyond

Naked-Eye Quantum Mechanics: Practical Applications of Benford's Law for Integer Quantities

Non-overlapping permutation patterns

The Benford paradox. Johan Fellman 1. Abstract

NON-OVERLAPPING PERMUTATION PATTERNS. To Doron Zeilberger, for his Sixtieth Birthday

What is counting? (how many ways of doing things) how many possible ways to choose 4 people from 10?

Solutions to Odd-Numbered End-of-Chapter Exercises: Chapter 13

Benford's Law. Theory, the General Law of Relative Quantities, and Forensic Fraud Detection Applications. Alex Ely Kossovsky.

TECHNOLOGY YOU CAN USE AGAINST THOSE WHO USE TECHNOLOGY BENFORD S LAW: THE FUN, THE FACTS, AND THE FUTURE

Date: Tuesday, 1 February :00PM. Location: Museum of London

Statistical Hypothesis Testing

CCST9017 Hidden Order in Daily Life: A Mathematical Perspective. Lecture 8. Statistical Frauds and Benford s Law

LUCAS-SIERPIŃSKI AND LUCAS-RIESEL NUMBERS

DATA DIAGNOSTICS USING SECOND ORDER TESTS OF BENFORD S LAW

Enumeration of Two Particular Sets of Minimal Permutations

Greedy Flipping of Pancakes and Burnt Pancakes

Lossy Compression of Permutations

Benford s Law Applies to Online Social Networks

On uniquely k-determined permutations

Lecture 3 - Regression

Is everything stochastic?

Research Methodologies for Management Sciences & Interdisciplinary Research in Contemporary World

Social Studies 201 Notes for November 8, 2006 Sampling distributions Rest of semester For the remainder of the semester, we will be studying and

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY

Stock Market Indices Prediction Using Time Series Analysis

A Comparative Analysis of the Bootstrap versus Traditional Statistical Procedures Applied to Digital Analysis Based on Benford s Law

Combinatorics in the group of parity alternating permutations

SMT 2014 Advanced Topics Test Solutions February 15, 2014

Two Perspectives on Logic

SOLUTIONS TO PROBLEM SET 5. Section 9.1

SYMMETRIES OF FIBONACCI POINTS, MOD m

Testing Benford s Law with the First Two Significant Digits

Empirical Information on the Small Size Effect Bias Relative to the False Positive Rejection Error for Benford Test-Screening

18.204: CHIP FIRING GAMES

Submitted November 19, 1989 to 2nd Conference Economics and Artificial Intelligence, July 2-6, 1990, Paris

Game Theory and Randomized Algorithms

Benford s Law Applied to Hydrology Data Results and Relevance to Other Geophysical Data

Sec 5.1 The Basics of Counting

"È$ß#È"ß$È#ß%È% This same mapping could also be represented in the form

SAR ADC Algorithm with Redundancy Based on Fibonacci Sequence

On repdigits as product of consecutive Fibonacci numbers 1

Benford s Law and articles of scientific journals: comparison of JCR Ò and Scopus data

A CONJECTURE ON UNIT FRACTIONS INVOLVING PRIMES


Fast Sorting and Pattern-Avoiding Permutations

Probability Theory. POLI Mathematical and Statistical Foundations. Sebastian M. Saiegh

STRATEGY AND COMPLEXITY OF THE GAME OF SQUARES

University of British Columbia. Math 312, Midterm, 6th of June 2017

The tenure game. The tenure game. Winning strategies for the tenure game. Winning condition for the tenure game

APPLYING BENFORD S LAW BY TESTING THE GOVERNMENT MACROECONOMICS DATA. [Využití Benfordova zákona při testování makroekonomických dat vlády]

The Strong Finiteness of Double Mersenne Primes and the Infinity of Root Mersenne Primes and Near-square Primes of Mersenne Primes

Sequential games. We may play the dating game as a sequential game. In this case, one player, say Connie, makes a choice before the other.

Dynamic Programming in Real Life: A Two-Person Dice Game

Primitive Roots. Chapter Orders and Primitive Roots

HEALTH CARE EXPENDITURE IN AFRICA AN APPLICATION OF SHRINKAGE METHODS

Please Turn Over Page 1 of 7

CHAPTER-4 FRUIT QUALITY GRADATION USING SHAPE, SIZE AND DEFECT ATTRIBUTES

Transcription:

Econometrics Working Paper EWP0505 ISSN 1485-6441 Department of Economics BENFORD S LAW AND NATURALLY OCCURRING PRICES IN CERTAIN ebay AUCTIONS* David E. Giles Department of Economics, University of Victoria Victoria, B.C., Canada V8W 2Y2 May, 2005 Abstract We show that certain the winning bids for certain ebay auctions obey Benford s Law. One implication of this is that it is unlikely that these bids are subject to collusion among bidders, or shilling on the part of sellers. Parenthetically, we also show that numbers from the naturally occurring Fibonacci and Lucas sequences also obey Benford s Law. Keywords: JEL Classifications: Benford s Law; auction prices; football; Kuiper test; Watson test C12; C14; C16; D44 Author Contact: Table 2: Uniformity Tests for Growth Cyc David Giles, Dept. of Economics, University of Victoria, P.O. Box 1700, STN CSC, Victoria, B.C., Canada V8W 2Y2; e-mail: dgiles@uvic.ca; FAX: (250) 721-6214 1

1. Introduction Consider the daily return on a stock on the NYSE. What is the probability that the first significant digit in the return is a 1, or a 9, for example? Contrary to what one might think, the answer is not (1/9) in either case. Newcomb (1881) appears to have been the first to notice that many naturally occurring decimal numbers have a first significant digit that is 1 with probability approximately 31%, while the probability of this digit being 9 is less than 5%. His examination of this phenomenon was motivated by observing that the first few pages of books of logarithm tables were generally dirtier (as a result of being thumbed more often) than were later pages. His results were not widely appreciated, and were unwittingly re-discovered by Benford (1938). Benford provided numerous examples of data that obey a particular non-uniform distribution with respect to first significant digits, and the phenomenon became known as Benford s Law. His examples included the areas of lakes, the lengths of rivers, and the molecular weights of compounds. Subsequently, a number of studies (e.g., Pinkham, 1961; Raimi, 1969, 1976; Ley, 1996; and de Ceuster et al., 1998) have confirmed that Benford s Law holds for a very large range of phenomena, including certain economic data. The purpose of this note is to examine whether Benford s Law holds in the context of certain ebay auction prices. The importance of the results that we obtain is that they can be used as a test of collusion in that market place In the next section we discuss Benford s Law and illustrate its application to two important naturally occurring data-sets. The main results of the paper are given in section 3, where we consider an analysis of ebay auction prices; and section 4 concludes. 2. Benford s Law and Natural Phenomena Let d denote the first significant digit in a random decimal number, N. So, for example, if N = 194.23 then d = 1; and if N = 0.00476 then d = 4. Benford s Law dictates that Pr.[ d = k] = log10[1 + (1/ k)] ; k = 1,.., 9. This result can be established via combinatorial arguments (e.g., Cohen, 1976), and straightforward statistical derivations of this result are also available (e.g., Hill, 1995). Further motivation is provided by De Ceuster et al. (1998), for example. Studies that have drawn attention to the relevance of Benford s Law to economic data include those of Ley and Varian (1994); Ley (1996), who investigated one-day returns on the Dow-Jones 2

and Standard and Poor indices for U.S. stocks; and Pietronero et al. (2001), who considered prices on the Madrid, Vienna and Zurich stock exchanges. One application of Benford s Law that is relevant to the present paper arises in the context of auditing tax data. Fraudulent activity tends to result in figures that are too uniform, relative to Benford s Law, as was discussed by Nigrini (1996). This application of Benford s Law is surveyed by Geyer and Williamson (2004). To lay the basis for our subsequent analysis, let us consider two specific examples of Benford s Law. As this law apparently occurs widely in the context of natural phenomena, one might speculate that it will be satisfied by the well-known Fibonacci series, {1, 1, 2, 3, 5, 8, 13, 21, 34, 55, 89, 144, }, where the terms satisfy the recurrence relationship: F(n+1) = F(n) + F(n-1), and where the irrational number φ = Limit n [ F( n + 1) / F( n)] 1.61803... is the co-called golden section that occurs widely in nature. Figure 1 compares the empirical frequency distribution of first digits for the first 1,476 Fibonacci numbers with the corresponding distribution predicted by Benford s Law. 1 Recently, Canessa (2003) has proposed a general statistical thermodynamic theory that explains, inter alia, why Fibonacci sequences should obey Benford s Law. 2 The visual similarity between the two frequency distributions in Figure 1 is apparent, and a formal statistical test of their equality (and hence of Canessa s theory) is discussed in the next section. The series of Lucas (1878) numbers is {1, 3, 4, 7, 11, 18, 29, 47, 76, 123,. }, where the terms follow the same recurrence relationship as for the Fibonacci sequence. Interestingly, Limit n [ L( n + 1) / L( n)] = φ, and in fact this is true for any sequence of integers of the form {1, x, (1+x), [(1+x) +x],.}. There are numerous relationships between Fibonacci and Lucas numbers. While the Lucas numbers are less well known than those of Fibonacci s sequence, they occur extensively in nature, such as in lunar eclipse cycles, and in the properties of electrical circuits, for example. 4 Figure 1 also compares the empirical frequency distribution of first significant digits for the first 1,475 Lucas numbers with the distribution predicted by Benford s Law. 5 The extent to which the Lucas sequence accords with this law is even more striking than for the Fibonacci sequence, but this result does not appear to have been observed previously in the literature. 3

3. Analysis of Some ebay Auction Prices The question we wish to address is whether or not the closing prices of successful auctions on the ebay internet auction site are naturally occurring numbers. If they are not, then this may be the result of collusion among bidders and/or interference ( shilling ) on the part of the seller. In our investigation we have considered all of the successful auctions for tickets for professional football games in the event tickets category on ebay, for the period 25 November to 3 December, 2004. 6 Auctions that ended with the Buy-it-Now option, and all Dutch auctions, were excluded from the data. This resulted in 1,161 successful auction completions. By not considering college football games we avoided the distortions that might arise in those cases where there is a two-tier ticketing system. By this we mean the situation where there are student only tickets and regular tickets. Information was recorded for each winning bid. All prices are in U.S. dollars as this is the official ebay currency. 7 Figure 2 compares the empirical distributions of the first significant digits of prices per lot and prices per ticket with the distribution predicted by Benford s Law. Although these results are less striking than those in Figure 1, they still suggest that the auction prices accord with Benford s Law. Of course, a formal test of the equivalence of the distributions is required, and we now turn to this issue. Typically, chi-square goodness-of-fit tests have been used in association with Benford s Law. Various other nonparametric tests, such as integrated deviations tests of the Cramér-von Mises type, or supremum tests of the Kolmogorov-Smirnov type, could be considered here. These tests involve comparing the empirical distribution function of the data with the hypothesized distribution, and they rely on the Glivenko-Cantelli Theorem for their justification. 8 However, none of these tests are actually appropriate here. It is important to note that our data have a special characteristic that must be taken into account when considering this testing problem they follow a circular distribution. This is clear if we consider a pair of prices such as $99.99 and $100, say, whose first significant digits are 9 and 1 respectively. The values 9 and 1 are as far apart as is possible within the set of first significant digits, yet the associated prices are as close as is possible in an ebay auction (where bids are in dollars and cents). Recently, Giles (2005) has recognized the relevance of distributional circularity in the dating of business cycles. The phenomenon of circularity is common with many types of statistical data, and there is an extensive literature dealing with testing for the underlying distribution in such cases (e.g., Fisher, 1993). One well-known example that meets our needs here is Kuiper s (1959) V N test, which is a modified Kolmogorov-Smirnov test. The crucial point is that such tests are invariant to the 4

location of the origin and the scale on the circle, whereas conventional such tests are not. 9 Suppose that the empirical distribution function for a sample of size N is F N (x), and that the null population distribution function is F 0 (x). Then, Kuiper s basic test statistic is given by: V = D + D + N N N, where the two components are the one-sided Kolmogorov-Smirnov statistics: + D = sup. [ F ( x) F ( x)] N < x< N D = sup. [ F ( x) F ( x)]. N < x< 0 0 N Another important property of Kuiper s test is that the null distribution of the test statistic is invariant to the hypothesized distribution, regardless of the sample size. Critical values for the * null distribution of [ 1/ 2 2 V = V N + 0.155 + 0.24N 1/ N N ] are tabulated for by Stephens (1970). 10 These critical values are provided together with our test results in Table 1. As can be seen there, the inferences that were drawn visually from Figures 1 and 2 are very strongly supported by the formal tests we cannot reject the hypothesis that the data for the Fibonacci and Lucas sequences, and for the ebay auction prices obey Benford s Law. 4. Conclusions In this paper we have tested Benford s Law against a small set of winning prices for ebay auctions for pro-football tickets, and shown that this law cannot be rejected. Had there been collusion among bidders, or shilling on the part of sellers in these auctions, it is unlikely that this positive result would have been achieved. This demonstrates the usefulness of Benford s Law in a different auditing context than has been considered previously, and suggests that further research along these lines would be valuable. It also sets the scene for work in progress that investigates psychological barriers in ebay auctions. Peripherally, we have also confirmed Canessa s (2003) result that numbers from the Fibonacci sequence obey Benford s Law, and we have shown that this also holds for numbers from the important and closely related Lucas sequence. 5

0.35 Figure 1: Relative Frequency Distributions For Fibonacci Digits, Lucas Digits, and Benford's Law 0.3 0.25 Relative Frequency 0.2 0.15 Fibonacci Benford Lucas 0.1 0.05 0 1 2 3 4 5 6 7 8 9 First Significant Digit 0.35 Figure 2: Relative Frequency Distributions for ebay Auction Prices for Pro-Football Tickets 0.30 0.25 Relative Frequency 0.20 0.15 ebay Price Benford's Law 0.10 0.05 0.00 1 2 3 4 5 6 7 8 9 First Significant Digit 6

Table 1 Kuiper Test Results* Fibonacci Lucas ebay Numbers Numbers Prices N 1,476 1,475 1,161 V N 0.0217 0.0007 0.0465 V* N 0.8370 0.0270 1.5919 * 10%, 5% and 1% critical values for V* N are 1.620, 1.747 and 2.001, regardless of the sample size. 7

References Benford, F., 1938. The law of anomalous numbers. Proceedings of the American Philosophical Society 78, 551-572. Canessa, E., 2003. Theory of analogous force in number sets. Physica A 328, 44-52. Cohen, D., 1976. An explanation of the first digit phenomenon. Journal of Combinatorial Theory 20, 367-370. De Ceuster, M. K. J., G. Dhaene and T. Schatteman, 1998. On the hypothesis of psychological barriers in stock markets and Benford s Law. Journal of Empirical Finance 5, 263-279. Durbin, J., 1973. Distribution Theory for Tests Based on the Sample Distribution Function. SIAM, Philadelphia PA. Fisher, N.I., 1993. Statistical Analysis of Circular Data. Cambridge University Press, Cambridge. Geyer, C. L. and P. P. Williamson, 2004. Detecting fraud in data sets using Benford s Law. Communications in Statistics B 33, 229-246. Giles, D. E., 2005. Testing for a Santa Claus effect in business cycles. Economics Letters, in press. Gottwald, G. A. and M. Nicol, 2002. On the nature of Benford s Law. Physica A 303, 387-396. Hill, T. P., 1995. A statistical derivation of the first digit phenomenon. Statistical Science 10, 354-363, Kuiper, N. H., 1959. Alternative proof of a theorem of Birnhaum and Pyke. Annals of Mathematical Statistics 30, 251-252. Ley, E., 1996. On the peculiar distribution of the U.S. stock indexes digits. American Statistician 50, 311-313. Ley, E. and H. Varian, 1994. Are there psychological barriers in the Dow-Jones index? Applied Financial Economics 4, 217-224. Lucas, E., 1878. The theory of simply periodic numerical functions. American Journal of Mathematics 1, 184-240, 289-321. McMinn, D., 2003. Fibonacci Lucas numbers and moon-sun cycles. http://www.davidmcminn.com/pages/fiblucas.htm Newcomb, S., 1881. Note on the frequency of use of the different digits in natural numbers. American Journal of Mathematics 4, 39-40. Nigrini, M., 1996. A taxpayer compliance application of Benford s Law. Journal of the American Taxation Association 18, 72-91. Pietronero, L., E. Tosatti, V. Tosatti, and A. Vespignani, 2001. Explaining the uneven distribution of numbers in nature: the laws of Benford and Zipf. Physica A 293, 297-304. 8

Pinkham, R. S., 1961. On the distribution of first significant digits. Annals of Mathematical Statistics 32, 1223-1230. Raimi, R. A., 1969. The peculiar distribution of first digits. Scientific American, 109-119. Raimi, R. A., 1976.The first digit problem. American Mathematical Monthly 83, 521-538. Scott, L. H., undated. Electrical equivalence of discrete cascaded T sections, electrical transmission lines, and Fibonacci and Lucas numbers. http://www.lhsorg.org/ Stephens, M. A., 1965. The goodness-of-fit statistic, V N : Distribution and significance points. Biometrika 52, 309-321. 9

Footnotes * I am very grateful to Sophie Hesling for her excellent research assistance. 1. This number of observations is dictated by machine precision. The last number in the sequence is F(1,476) = 0.1306989E+309. 2. Pietronero et al. (2001) provide an explanation for Benford s Law in terms of multiplicative dynamical processes, and derive a generalization in terms of a power law. Gottwald and Nicol (2002) prove that systems satisfying Benford s Law do not have to be random or chaotic they may be deterministic, and are more related to the (near) multiplicative nature of the process than to its chaoticity. 3. For example, for n > 1, every Lucas number is the sum of two Fibonacci numbers, and for all n, L(n) = [F(n+2) F(n-2)] and [L(n-1)L(n+1) + F(n-1)F(n+1)] = 6[F(n)] 2. n Moreover, Limitn [ L( n)] = φ, and this relationship is approximated well even when n is quite small. For example, for n = 8, 9, 10 we have L = 47, 76, 123; and φ n = 46.971, 75.999 and 123.046. 4. See McMinn (2003), and Scott (undated). Also, in a sequence of n equal resistors, connected alternately in series and in parallel, then Limit n [Total resistance] = φ. If the wiring is alternately in parallel and in sequence, Limit [Total resistance] = 1 / φ. 5. The last number in the sequence is L(1,475) = 0.111630E+309. 6. More specifically, the time-span is from 00:00:01 PST on 25 November, to 24:00:00 PST on 3 December, 2004. 7. Strictly, it is possible for other currencies to be used, but there were no such examples in our population. In any case, Benford s Law is scale and base invariant, so the choice of (a common) currency does not affect our results. 8. For a detailed technical discussion, see Durbin (1973), for example. The Glivenko- Cantelli Theorem implies that, under quite mild conditions, the empirical distribution function converges uniformly and with probability one to the population distribution function. 9. Note that if the data are circular, as is the case with the digits 1 through 9, the distribution of the test statistic should not depend on whether we begin counting at 1 and end at 9 (say), or alternatively begin counting at 6 and end at 5 (say). 10. See Stephens (197, p.118). The purpose of transforming from V N to V* N is to create a statistic whose null distribution is independent of the sample size. This is especially helpful for large N. n 10