The Originative Statistical Regression Models: Are They Too Old and Untenable? To Fit or Not to Fit Data to a Model: That is the Question.
|
|
- Evelyn Parker
- 5 years ago
- Views:
Transcription
1 1
2 Objectives 1.To poll the titled and untitled questions. 2.To offer my answer with illustrative examples (2) and recent projects (2). The Originative Statistical Regression Models: Are They Too Old and Untenable? To Fit or Not to Fit Data to a Model: That is the Question.
3 Are They Too Old and Untenable? For big data apps, yes. And small data apps, as well. Statistical ordinary least squares (OLS) regression, and logistic regression (LR) models are popular techniques for prediction (of a continuous dependent variable) or classification (of a categorical dependent variable). They are the workhorses of statistical modeling as their results are considered the gold standard. Moreover, they are used as the benchmark for assessing the superiority of newer techniques. OLS regression method was first published by Legendre on March 6, The LR model was developed by Berkson in Something old is not necessarily useless today, and something new is not necessarily better than something old is. Lest one forgets, "the wheel and the printed word." The statistical regression paradigm, which dictates "fitting the data to a pre-specified model," is old as it was developed, and tested within the small data setting of the day. It would seem that a newer machine-learning method, say, GenIQ, which "lets the data define the model," is especially effective for today's big data, and should nicely outdo the originative statistical regression model.
4 To Fit Data to a Model? For experimental-design and clinical data, yes. For post-hoc, secondary and big data, I do not think so. The linear model for one and for all, no.
5 In this illustration, GenIQs optimizing of the deciles is equivalent to predicting the best possible ranking of the target variable based on the GenIQ score GenIQvar. Bruce Ratner, Ph.D. 5
6 OBJECTIVE #1: To build a binary response model with data from Table 1. I built a logistic regression model (LRM) for predicting the target variable Response using two predictor variables, XX1 and XX2. The LRM equation is: Logit of Response (=Yes) = * XX * XX2 Table 1 ID XX1 XX2 Response Yes No Yes No No No Yes Yes Yes No Bruce Ratner, Ph.D. 6
7 Bruce Ratner, Ph.D. 7
8
9 Bruce Ratner, Ph.D. 9
10
11 OBJECTIVE: To build a continuous profit model with data from Table 1. I built an ordinary least squares regression model (OLS) for predicting the target variable Profit using two predictor variables, XX1 and XX2. The OLS equation is: Profit_est = * XX * XX2 Table 1 ID XX1 XX2 Profit Bruce Ratner, Ph.D. 11
12 OLS RESULTS: The Profit ranking is not perfect. The ranking of Profit based on the OLS Profit score, Profit_est, is in Table 2, below. Clearly, the Profit ordering is not perfect. Table 2 ID XX1 XX2 Profit Profit_est Bruce Ratner, Ph.D. 12
13
14
15 I built a GenIQ Model to predict Profit using XX1 and XX2 with data in Table 1. The GenIQ Model tree display and model equation (code) are below. Note: There are two GenIQ variables, GenIQvar_1, GenIQvar_2 (defined later). x1 = GenIQvar_1; x1 = Cos(x1); x2 = GenIQvar_2; x1 = x1 * x2; x2 = XX2; x2 = Cos(x2); x2 = Cos(x2); x3 = GenIQvar_2; x4 = GenIQvar_2; x3 = x3 * x4; x2 = x3 - x2; x1 = x1 * x2; GenIQvar = x1; Bruce Ratner, Ph.D. 15
16
17 RECENT PROJECT #1 Objective Dec 06 Sep 08 Aug 10 Dec 11 To build a LTV5 Model based on sales transaction data from Dec., 2006 Dec., 2011 The window of Sep 08 Aug 10 is for defining the ATTRITION and LVT5 variables ATTRITION = 1 if there are no sales in the monthly window = 0 if there is at least one sale in the mos wndw LTV5 = LVT2*2.5 where LTV2 = total sales in the mos wndw In addition to monthly sales, which reflect few big-ticket items and many $0s, the three basic variables are also put in play for machine-learning data mining (Predictors are defined in Dec 06 Aug 08): 1. RECENCY = last month in which a sale occurrence was observed for a rep. 2. TRX = total number of monthly sales transactions observed 3. BEGYR = first year in which a sales occurrence was observed
18 Identification of the Relationships that Define Attrition(=1-Retention) and LTV5 Machine-learning data mining identifies surprising relationships ATTRITION_var = RECENCY TRX*BEGYR LTV5_var = TRX*BEGYR RECENCY
19 x1 = TRX; x2 = begyr; x1= x1 * x2; x2 = Recency; If x1 NE 0 Then x1 = x2 / x1; Else x1 = 1; GenIQvar = x1; GenIQ_ATTRITION_probability = 1 / (1 + Exp(-( GenIQvar * )));
20 x1 = TRX; x2 = TRX; x3 = begyr; x2 = x2 * x3; If x1 NE 0 Then x1 = x2 / x1; Else x1 = 1; x2 = Recency; x3 = TRX; If x2 NE 0 Then x2 = x3 / x2; Else x2 = 1; x1 = x1 * x2; GenIQvar = x1; GenIQ_Profit_estimate = * GenIQvar;
21 Identification of the Relationships that Define Attrition(=1-Retention) and LTV5 Estimating the vars, we have Logit_ATTRITION = (2.1622*ATTRITION_var) PROB_ATTRIT_est = exp(logit_attrition)/(1+ exp(logit_attrition) ) LTV5_var = (TRX*BEGYR)/RECENCY LTV5_estimate = ( *LTV5_var)
22
23
24
25
26 RECENT PROJECT #2 Objective To build a classification model based on survey data, where the dependent variable focuses on three segments of the original six segments (1, 2,, 6) _3, _4 and _1256
27
28
29 Defining the Classification Model Prob_3_1256 = 1 / (1 + Exp(-( GenIQvar * ))); logit_3_1256 = GenIQvar * ; PROB_4_1256 = 1 / (1 + Exp(-( GenIQvar * E-02))); logit_4_1256= GenIQvar * E-02; Prob_3 = exp(logit_3_1256)/ (1 + exp(logit_3_1256) + exp(logit_4_1256) ); Prob_4 = exp(logit_4_1256)/ (1 + exp(logit_3_1256) + exp(logit_4_1256) ); Prob_1256 = 1 - ( Prob_3 + Prob_4);
30 Classification Model Accuracy
31 Marginal Model (set Prob_1256=0)
32
33
34 End Page 34
Chapter 8. Using the GLM
Chapter 8 Using the GLM This chapter presents the type of change products that can be derived from a GLM enhanced change detection procedure. One advantage to GLMs is that they model the probability of
More informationKnowledge discovery & data mining Classification & fraud detection
Knowledge discovery & data mining Classification & fraud detection Knowledge discovery & data mining Classification & fraud detection 5/24/00 Click here to start Table of Contents Author: Dino Pedreschi
More informationGraph-of-word and TW-IDF: New Approach to Ad Hoc IR (CIKM 2013) Learning to Rank: From Pairwise Approach to Listwise Approach (ICML 2007)
Graph-of-word and TW-IDF: New Approach to Ad Hoc IR (CIKM 2013) Learning to Rank: From Pairwise Approach to Listwise Approach (ICML 2007) Qin Huazheng 2014/10/15 Graph-of-word and TW-IDF: New Approach
More informationWhat Drives Innovation Choices in The Small Satellite Industry? The Role of Technological Resources and Managerial Experience
What Drives Innovation Choices in The Small Satellite Industry? The Role of Technological Resources and Managerial Experience Yue Song, Devi Gnyawali Virginia Polytechnic Institute and State University
More informationPatent Mining: Use of Data/Text Mining for Supporting Patent Retrieval and Analysis
Patent Mining: Use of Data/Text Mining for Supporting Patent Retrieval and Analysis by Chih-Ping Wei ( 魏志平 ), PhD Institute of Service Science and Institute of Technology Management National Tsing Hua
More informationContents. List of Figures List of Tables. Structure of the Book How to Use this Book Online Resources Acknowledgements
Contents List of Figures List of Tables Preface Notation Structure of the Book How to Use this Book Online Resources Acknowledgements Notational Conventions Notational Conventions for Probabilities xiii
More informationConcerted actions program. Appendix to full research report. Jeffrey Derevensky, Rina Gupta. Institution managing award: McGill University
Concerted actions program Appendix to full research report Jeffrey Derevensky, Rina Gupta Institution managing award: McGill University Gambling and video game playing among adolescents (French title:
More informationThe following definitions are derived from the online help of VentureSource. 1023
Appendix A Definitions from VentureSource The following definitions are derived from the online help of VentureSource. 1023 A.1 Venture Financing Round Types Seed Round: Seed rounds are initial rounds
More informationBuilding a Computer Mahjong Player Based on Monte Carlo Simulation and Opponent Models
Building a Computer Mahjong Player Based on Monte Carlo Simulation and Opponent Models Naoki Mizukami 1 and Yoshimasa Tsuruoka 1 1 The University of Tokyo 1 Introduction Imperfect information games are
More informationSSC Case Study Competition: Solving a Puzzle with Multiple Solutions
SSC Case Study Competition: Solving a Puzzle with Multiple Solutions Biostatistics Seminar Oct 7, 2014 Shahriar Shams, Changchang Xu SSC case study competition: FREE FOOD on top of knowledge transfer Venue
More informationSession 124TS, A Practical Guide to Machine Learning for Actuaries. Presenters: Dave M. Liner, FSA, MAAA, CERA
Session 124TS, A Practical Guide to Machine Learning for Actuaries Presenters: Dave M. Liner, FSA, MAAA, CERA SOA Antitrust Disclaimer SOA Presentation Disclaimer A practical guide to machine learning
More informationCRF and Structured Perceptron
CRF and Structured Perceptron CS 585, Fall 2015 -- Oct. 6 Introduction to Natural Language Processing http://people.cs.umass.edu/~brenocon/inlp2015/ Brendan O Connor Viterbi exercise solution CRF & Structured
More informationpage 2 / 5
MIXED EFFECTS MODELS FOR COMPLEX DATA CHAPMAN HALL CRC MONOGRAPHS ON STATISTICS APPLIED PROBABILITY APPLIED STATISTICS PROBABILITY 5E B W WITH WILEYPLUS SET page 1 / 5 page 2 / 5 mixed effects models for
More informationBuilding a more stable predictive logistic regression model. Anna Elizabeth Campain
Building a more stable predictive logistic regression model Anna Elizabeth Campain Common problems when working with clinical data Missing data Imbalanced class distribution Unstable logistic regression
More informationBeyond Reliability: Advanced Analytics for Predicting Quality
Beyond Reliability: Advanced Analytics for Predicting Quality William J. Goodrum, Jr., PhD Elder Research, Inc. william.goodrum@elderresearch.com Headquarters 300 W. Main Street, Suite 301 Charlottesville,
More informationWe Value Your Business
We Value Your Business 1 Freemium Business Model Description Fred Wilson, Venture Capitalist Give your service away for free, possibly ad supported but maybe not, acquire a lot of customers very efficiently
More informationProbability WS 1 Counting , , , a)625 b)1050c) a)20358,520 b) 1716 c) 55,770
Probability WS 1 Counting 1.28 2.13,800 3.5832 4.30 5.. 15 7.72 8.33, 5 11. 15,504 12. a)25 b)1050c)2275 13. a)20358,520 b) 171 c) 55,770 d) 12,271,512e) 1128 f) 17 14. 438 15. 2,000 1. 11,700 17. 220,
More informationPREDICTING ASSEMBLY QUALITY OF COMPLEX STRUCTURES USING DATA MINING Predicting with Decision Tree Algorithm
PREDICTING ASSEMBLY QUALITY OF COMPLEX STRUCTURES USING DATA MINING Predicting with Decision Tree Algorithm Ekaterina S. Ponomareva, Kesheng Wang, Terje K. Lien Department of Production and Quality Engieering,
More informationAn Introduction to Machine Learning for Social Scientists
An Introduction to Machine Learning for Social Scientists Tyler Ransom University of Oklahoma, Dept. of Economics November 10, 2017 Outline 1. Intro 2. Examples 3. Conclusion Tyler Ransom (OU Econ) An
More informationWireless Purchase Experience Customer Satisfaction Increases Across the Board, J.D. Power Finds
Wireless Purchase Experience Customer Satisfaction Increases Across the Board, Finds T-Mobile, Metro by T-Mobile and Consumer Cellular Rank Highest in Respective Segments COSTA MESA, Calif.: 14 Feb. 2019
More informationSeries and parallel resistances
Series and parallel resistances Objectives Calculate the equivalent resistance for resistors connected in both series and parallel combinations. Construct series and parallel circuits of lamps (resistors).
More informationDicing The Data from NAB/RAB Radio Show: Sept. 7, 2017 by Jeff Green, partner, Stone Door Media Lab
Dicing The Data from NAB/RAB Radio Show: Sept. 7, 2017 by Jeff Green, partner, Stone Door Media Lab SLIDE 2: Dicing the Data to Predict the Hits Each week you re at your desk considering new music. Maybe
More informationThe method requires foreground and background sequence datasets. The users can use fasta files as input.
1 Introduction he emergence of hip-seq technology for genome-wide profiling of transcription factor binding sites (FBS) has made it possible to categorize very precisely the FBS motifs. How to harness
More informationTextual Characteristics based High Quality Online Reviews Evaluation and Detection
2013 Submitted on: October 30, Textual Characteristics based High Quality Online Reviews Evaluation and Detection Hui Nie School of Information Management, Sun Yat-sen University, Guangzhou, China. E-mail
More informationPatent portfolio audits. Cost-effective IP management. Vashe Kanesarajah Manager, Europe & Asia Clarivate Analytics
Patent portfolio audits Cost-effective IP management Vashe Kanesarajah Manager, Europe & Asia Clarivate Analytics Clarivate Analytics Patent portfolio audits 3 Introduction The world today is in a state
More informationArtificial Bandwidth Extension Using Deep Neural Networks for Spectral Envelope Estimation
Platzhalter für Bild, Bild auf Titelfolie hinter das Logo einsetzen Artificial Bandwidth Extension Using Deep Neural Networks for Spectral Envelope Estimation Johannes Abel and Tim Fingscheidt Institute
More informationBAYESIAN STATISTICAL CONCEPTS
BAYESIAN STATISTICAL CONCEPTS A gentle introduction Alex Etz @alxetz ß Twitter (no e in alex) alexanderetz.com ß Blog November 5 th 2015 Why do we do statistics? Deal with uncertainty Will it rain today?
More informationISSN: (Online) Volume 4, Issue 4, April 2016 International Journal of Advance Research in Computer Science and Management Studies
ISSN: 2321-7782 (Online) Volume 4, Issue 4, April 2016 International Journal of Advance Research in Computer Science and Management Studies Research Article / Survey Paper / Case Study Available online
More informationPatents as Indicators
Patents as Indicators Prof. Bronwyn H. Hall University of California at Berkeley and NBER Outline Overview Measures of innovation value Measures of knowledge flows October 2004 Patents as Indicators 2
More informationAutomating NSF HERD Reporting Using Machine Learning and Administrative Data
Automating NSF HERD Reporting Using Machine Learning and Administrative Data Rodolfo H. Torres CIMA Session: The Use of Advance Analytics to Drive Decisions 2018 APLU Annual Meeting New Orleans Marriott,
More informationCompound Object Detection Using Region Co-occurrence Statistics
Compound Object Detection Using Region Co-occurrence Statistics Selim Aksoy 1 Krzysztof Koperski 2 Carsten Tusk 2 Giovanni Marchisio 2 1 Department of Computer Engineering, Bilkent University, Ankara,
More informationUSE OF PROFILE WAVE AMPLITUDE ESTIMATES FOR PAVEMENT SERVICEABILITY MEASURES
USE OF PROFILE WAVE AMPLITUDE ESTIMATES FOR PAVEMENT SERVICEABILITY MEASURES Roger S. Walker and W. Ronald Hudson, Center for Highway Research, University of Texas at Austin For a number of years, engineers
More informationPredicting the outcome of NFL games using machine learning Babak Hamadani bhamadan-at-stanford.edu cs229 - Stanford University
Predicting the outcome of NFL games using machine learning Babak Hamadani bhamadan-at-stanford.edu cs229 - Stanford University 1. Introduction: Professional football is a multi-billion industry. NFL is
More informationThe Game-Theoretic Approach to Machine Learning and Adaptation
The Game-Theoretic Approach to Machine Learning and Adaptation Nicolò Cesa-Bianchi Università degli Studi di Milano Nicolò Cesa-Bianchi (Univ. di Milano) Game-Theoretic Approach 1 / 25 Machine Learning
More informationAssessing FP7-ICT research. Performance indicators: patents and publications
Assessing FP7-ICT research Performance indicators: patents and publications Carlos OLIVEIRA DG Information Society and Media Evaluation and Monitoring Unit (DG INFSO C3) FP7-ICT assessment Overview 1)
More informationStatistics 1040 Summer 2009 Exam III
Statistics 1040 Summer 2009 Exam III 1. For the following basic probability questions. Give the RULE used in the appropriate blank (BEFORE the question), for each of the following situations, using one
More informationGlobal Journal of Engineering Science and Research Management
A KERNEL BASED APPROACH: USING MOVIE SCRIPT FOR ASSESSING BOX OFFICE PERFORMANCE Mr.K.R. Dabhade *1 Ms. S.S. Ponde 2 *1 Computer Science Department. D.I.E.M.S. 2 Asst. Prof. Computer Science Department,
More informationINTELLIGENT APRIORI ALGORITHM FOR COMPLEX ACTIVITY MINING IN SUPERMARKET APPLICATIONS
Journal of Computer Science, 9 (4): 433-438, 2013 ISSN 1549-3636 2013 doi:10.3844/jcssp.2013.433.438 Published Online 9 (4) 2013 (http://www.thescipub.com/jcs.toc) INTELLIGENT APRIORI ALGORITHM FOR COMPLEX
More informationVEHICULAR SPEED DETERMINATION USING CELLULAR AND GEO-SPATIAL INFORMATION
International Journal of Mechanical Engineering and Technology (IJMET) Volume 1, Issue 1, January 219, pp. 1496-156, Article ID: IJMET_1_1_152 Available online at http://www.iaeme.com/ijmet/issues.asp?jtype=ijmet&vtype=1&itype=1
More information2016 SAS Analytics Day
About the presenter John Harden MBA with Marketing Analytics Concentration Candidate Professional Experience: U.S. Air Force Computer Systems Operator U.S. Air Force Public Affairs Officer Educational
More informationLecture 3 - Regression
Lecture 3 - Regression Instructor: Prof Ganesh Ramakrishnan July 25, 2016 1 / 30 The Simplest ML Problem: Least Square Regression Curve Fitting: Motivation Error measurement Minimizing Error Method of
More informationSubjective Evaluation Survey
Subjective Evaluation Survey Please answer the following set of questions for each negotiation counterpart. Comment boxes are available for each question to provide additional feedback. 1. Your first name
More informationDR. S OFFICE S BIG YEAR. game plan
DR. S OFFICE S BIG YEAR game plan CLARIFYING DOCTOR S ROLE 1. Check your primary desired role: CEO Treating Doctor Clinical Director 2. What is the number of treatment days per week you are planning for
More informationConnecting Commerce. Mining industry confidence in the digital environment. Written by
Connecting Commerce Mining industry confidence in the digital environment Written by About the research This article is part of the Connecting Commerce research programme from The Economist Intelligence
More informationDiscussion: Does Scientific Innovation Lead to Academic Entrepreneurship?
Discussion: Does Scientific Innovation Lead to Academic Entrepreneurship? Andrew A. Toole Deputy Chief Economist, US Patent and Trademark Office Research Associate, ZEW, Mannheim, Germany Labor Market
More informationToday. CS 395T Visual Recognition. Course content. Administration. Expectations. Paper reviews
Today CS 395T Visual Recognition Course logistics Overview Volunteers, prep for next week Thursday, January 18 Administration Class: Tues / Thurs 12:30-2 PM Instructor: Kristen Grauman grauman at cs.utexas.edu
More informationCHAPTER-4 FRUIT QUALITY GRADATION USING SHAPE, SIZE AND DEFECT ATTRIBUTES
CHAPTER-4 FRUIT QUALITY GRADATION USING SHAPE, SIZE AND DEFECT ATTRIBUTES In addition to colour based estimation of apple quality, various models have been suggested to estimate external attribute based
More informationNot the First Digit! Using Benford s Law to Detect Fraudulent Scientific Data* Andreas Diekmann Swiss Federal Institute of Technology Zurich
Not the First! Using Benford s Law to Detect Fraudulent Scientific Data* Andreas Diekmann Swiss Federal Institute of Technology Zurich October 2004 diekmann@soz.gess.ethz.ch *For data collection I would
More informationChapter 1: Stats Starts Here Chapter 2: Data
Chapter 1: Stats Starts Here Chapter 2: Data Statistics data, datum variation individual respondent subject participant experimental unit observation variable categorical quantitative Calculator Skills:
More informationShervin Rahimzadeh Arashloo
Shervin Rahimzadeh Arashloo Contact Details Department of Medical Informatics Faculty of Medical Sciences Tarbiat Modares University Tehran, Iran S.Rahimzadeh@modares.ac.ir Research Interests Computer
More informationInformation Management course
Università degli Studi di Mila Master Degree in Computer Science Information Management course Teacher: Alberto Ceselli Lecture 19: 10/12/2015 Data Mining: Concepts and Techniques (3rd ed.) Chapter 8 Jiawei
More informationMULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.
Practice for Final Exam Name Identify the following variable as either qualitative or quantitative and explain why. 1) The number of people on a jury A) Qualitative because it is not a measurement or a
More informationLaser Printer Source Forensics for Arbitrary Chinese Characters
Laser Printer Source Forensics for Arbitrary Chinese Characters Xiangwei Kong, Xin gang You,, Bo Wang, Shize Shang and Linjie Shen Information Security Research Center, Dalian University of Technology,
More informationIBM SPSS Neural Networks
IBM Software IBM SPSS Neural Networks 20 IBM SPSS Neural Networks New tools for building predictive models Highlights Explore subtle or hidden patterns in your data. Build better-performing models No programming
More informationCROSS-LAYER FEATURES IN CONVOLUTIONAL NEURAL NETWORKS FOR GENERIC CLASSIFICATION TASKS. Kuan-Chuan Peng and Tsuhan Chen
CROSS-LAYER FEATURES IN CONVOLUTIONAL NEURAL NETWORKS FOR GENERIC CLASSIFICATION TASKS Kuan-Chuan Peng and Tsuhan Chen Cornell University School of Electrical and Computer Engineering Ithaca, NY 14850
More informationAnalysis of Ternary and Binary High Resolution Codes Using MATLAB
Analysis of Ternary and Binary High Resolution Codes Using MATLAB Annepu.Venkata NagaVamsi Dept of E.I.E, AITAM, Tekkali -532201, India. Dr.D.Elizabeth Rani Dept of E.I.E,Gitam university, Vishakapatnam-45,
More informationSocial Network Analysis in HCI
Social Network Analysis in HCI Derek L. Hansen and Marc A. Smith Marigold Bays-Muchmore (baysmuc2) Hang Cui (hangcui2) Contents Introduction ---------------- What is Social Network Analysis? How does it
More informationSolution Manual Fundamentals Of Statistical Signal Processing Detection Theory
Solution Manual Fundamentals Of Statistical Signal Processing Detection Theory Solutions Manual - Fundamentals of Statistical Signal Procession -Estimation Fundamentals of Statistical Signal Processing,
More informationSoftware as a Medical Device (SaMD)
Software as a Medical Device () Working Group Status Application of Clinical Evaluation Working Group Chair: Bakul Patel Center for Devices and Radiological Health US Food and Drug Administration NWIE
More informationLive esport-analytics
Live esport-analytics Solving the Informational Fairness Conundrum Lukas N.P. Egger Head of Research, Dojo Madness DOJO MADNESS - esports tools - Help gamers to master their play - Gaming enthusiasm and
More informationHamming net based Low Complexity Successive Cancellation Polar Decoder
Hamming net based Low Complexity Successive Cancellation Polar Decoder [1] Makarand Jadhav, [2] Dr. Ashok Sapkal, [3] Prof. Ram Patterkine [1] Ph.D. Student, [2] Professor, Government COE, Pune, [3] Ex-Head
More informationSaying Thank You 4/24/17. The Art of Saying Thank You Eric D. Garcia, IT & Digital Marketing Consultant
The Art of Saying Thank You Eric D. Garcia, IT & Digital Marketing Consultant Clients/Our Pet Owners Employees Saying Thank You It s what you do to keep clients coming back Marketing is not just about
More informationResearch Challenges in Forecasting Technical Emergence. Dewey Murdick, IARPA 25 September 2013
Research Challenges in Forecasting Technical Emergence Dewey Murdick, IARPA 25 September 2013 1 Invests in high-risk/high-payoff research programs that have the potential to provide our nation with an
More informationBasics of Probability
Basics of Probability Dublin R May 30, 2013 1 Overview Overview Basics of Probability (some definitions, the prob package) Dice Rolls and the Birthday Distribution ( histograms ) Gambler s Ruin ( plotting
More information15-388/688 - Practical Data Science: Visualization and Data Exploration. J. Zico Kolter Carnegie Mellon University Spring 2018
15-388/688 - Practical Data Science: Visualization and Data Exploration J. Zico Kolter Carnegie Mellon University Spring 2018 1 Outline Basics of visualization Data types and visualization types Software
More informationSkills we've learned. Skills we need. 7 3 Independent and Dependent Events. March 17, Alg2 Notes 7.3.notebook
7 3 Independent and Dependent Events Skills we've learned 1. In a box of 25 switches, 3 are defective. What is the probability of randomly selecting a switch that is not defective? 2. There are 12 E s
More informationWeb Appendix: Online Reputation Mechanisms and the Decreasing Value of Chain Affiliation
Web Appendix: Online Reputation Mechanisms and the Decreasing Value of Chain Affiliation November 28, 2017. This appendix accompanies Online Reputation Mechanisms and the Decreasing Value of Chain Affiliation.
More informationUnit 7 Central Tendency and Probability
Name: Block: 7.1 Central Tendency 7.2 Introduction to Probability 7.3 Independent Events 7.4 Dependent Events 7.1 Central Tendency A central tendency is a central or value in a data set. We will look at
More informationChapter 5: Game Analytics
Lecture Notes for Managing and Mining Multiplayer Online Games Summer Semester 2017 Chapter 5: Game Analytics Lecture Notes 2012 Matthias Schubert http://www.dbs.ifi.lmu.de/cms/vo_managing_massive_multiplayer_online_games
More informationCHAPTER 144. Interpretation of Shoreline Position from Aerial Photographs John S. Fisher 1 Margery F. Overton 2
CHAPTER 144 Interpretation of Shoreline Position from Aerial Photographs John S. Fisher 1 Margery F. Overton 2 Abstract A review of some of the potential sources of error associated with the use of aerial
More informationComparing the Design Cognition of Concept Design Reviews of Industrial and Mechanical Engineering Designers
Comparing the Design Cognition of Concept Design Reviews of Industrial and Mechanical Engineering Designers John S. Gero George Mason University and UNCC, USA john@johngero.com Hao Jiang Zhejiang University,
More informationCrop Area Estimation with Remote Sensing
Boogta 25-28 November 2008 1 Crop Area Estimation with Remote Sensing Some considerations and experiences for the application to general agricultural statistics Javier.gallego@jrc.it Some history: MARS
More informationDota2 is a very popular video game currently.
Dota2 Outcome Prediction Zhengyao Li 1, Dingyue Cui 2 and Chen Li 3 1 ID: A53210709, Email: zhl380@eng.ucsd.edu 2 ID: A53211051, Email: dicui@eng.ucsd.edu 3 ID: A53218665, Email: lic055@eng.ucsd.edu March
More informationClever YouTube Profits
1 2 Clever YouTube Profits With Declan Mc First I would like to thank and congratulate you for purchasing Clever YouTube Profits, you won t be sorry that you did :). I ve been online now about a year and
More informationSimulated Statistics for the Proposed By-Division Design In the Consumer Price Index October 2014
Simulated Statistics for the Proposed By-Division Design In the Consumer Price Index October 2014 John F Schilp U.S. Bureau of Labor Statistics, Office of Prices and Living Conditions 2 Massachusetts Avenue
More informationDemand for Commitment in Online Gaming: A Large-Scale Field Experiment
Demand for Commitment in Online Gaming: A Large-Scale Field Experiment Vinci Y.C. Chow and Dan Acland University of California, Berkeley April 15th 2011 1 Introduction Video gaming is now the leisure activity
More informationGAMBLING ( ) Name: Partners: everyone else in the class
Name: Partners: everyone else in the class GAMBLING Games of chance, such as those using dice and cards, oporate according to the laws of statistics: the most probable roll is the one to bet on, and the
More informationCELLULAR COVERAGE IN UNDERGROUND TRANSPORT SYSTEMS: A CASE STUDY THE RIO DE JANEIRO METROPOLITAN
CELLULAR COVERAGE IN UNDERGROUND TRANORT SYSTEMS: A CASE STUDY THE RIO DE JANEIRO METROPOLITAN Marcio Rodrigues * Bruno Maia * Luiz Silva Mello ** Marlene Pontes * ** * WiNGS Telecom ** CETUC-PUC/Rio INTRODUCTION
More informationNumber patterns on a spreadsheet
A1 SS Number patterns on a spreadsheet This sheet will help you to create your own number patterns on a spreadsheet. Do the steps one at a time. You will soon feel more confident with a spreadsheet program.
More informationLearning and Using Models of Kicking Motions for Legged Robots
Learning and Using Models of Kicking Motions for Legged Robots Sonia Chernova and Manuela Veloso Computer Science Department Carnegie Mellon University Pittsburgh, PA 15213 {soniac, mmv}@cs.cmu.edu Abstract
More informationReload & Relaunch Strategic Governance of Platform Ecosystems
Reload & Relaunch Strategic Governance of Platform Ecosystems Joost Rietveld - Erasmus University Melissa Schilling - New York University Cristiano Bellavitis - University of Auckland Platform Strategy
More informationBasic Analysis of Computer-Mediated Communication: Mean Length of Utterance Waseda University Eiichiro Tsutsui
Basic Analysis of Computer-Mediated Communication: Mean Length of Utterance Waseda University Eiichiro Tsutsui 1. Purpose The purpose of this chapter is to investigate whether or not our students can make
More informationAn Introduction to Poker Opponent Modeling
An Introduction to Poker Opponent Modeling Peter Chapman Brielin Brown University of Virginia 1 March 2011 It is not my aim to surprise or shock you-but the simplest way I can summarize is to say that
More informationInformation Retrieval Evaluation
Information Retrieval Evaluation (COSC 416) Nazli Goharian nazli@cs.georgetown.edu Goharian, Grossman, Frieder, 2002, 2010 Measuring Effectiveness An algorithm is deemed incorrect if it does not have a
More informationScatter Plots, Correlation, and Lines of Best Fit
Lesson 7.3 Objectives Interpret a scatter plot. Identify the correlation of data from a scatter plot. Find the line of best fit for a set of data. Scatter Plots, Correlation, and Lines of Best Fit A video
More informationMachine Learning. Classification, Discriminative learning. Marc Toussaint University of Stuttgart Summer 2014
Machine Learning Classification, Discriminative learning Structured output, structured input, discriminative function, joint input-output features, Likelihood Maximization, Logistic regression, binary
More informationViewing Environments for Cross-Media Image Comparisons
Viewing Environments for Cross-Media Image Comparisons Karen Braun and Mark D. Fairchild Munsell Color Science Laboratory, Center for Imaging Science Rochester Institute of Technology, Rochester, New York
More informationMULTIPLE ORGANISATION ( MULTI ORG )
MULTIPLE ORGANISATION ( MULTI ORG ) Oracle Financials R12 www.erpstuff.com INDEX MULTI ORGANISATION Introduction Benefits Multi Org Structure in Release 11 and Release 12 Release 11 Versus Release 12 Important
More informationCHAPTER 9 THE EFFECTS OF GAUGE LENGTH AND STRAIN RATE ON THE TENSILE PROPERTIES OF REGULAR AND AIR JET ROTOR SPUN COTTON YARNS
170 CHAPTER 9 THE EFFECTS OF GAUGE LENGTH AND STRAIN RATE ON THE TENSILE PROPERTIES OF REGULAR AND AIR JET ROTOR SPUN COTTON YARNS 9.1 INTRODUCTION It is the usual practise to test the yarn at a gauge
More informationCLASSIFICATION OF VEGETATION AREA FROM SATELLITE IMAGES USING IMAGE PROCESSING TECHNIQUES ABSTRACT
CLASSIFICATION OF VEGETATION AREA FROM SATELLITE IMAGES USING IMAGE PROCESSING TECHNIQUES Arpita Pandya Research Scholar, Computer Science, Rai University, Ahmedabad Dr. Priya R. Swaminarayan Professor
More informationAt The Crossroads Jacobs Media 2015
At The Crossroads Jacobs Media 2015 Methodology 54 public radio stations N = 19,730 Interview dates: June 15-July 20, 2015 Most respondents are members of station email databases. Some responses were gathered
More informationR&D Policy and Technological Trajectories of Regions: Evidence from the EU Framework Programmes
R&D Policy and Technological Trajectories of Regions: Evidence from the EU Framework Programmes Wolf-Hendrik Uhlbach 1 Pierre-Alexandre Balland 2 Thomas Scherngell 3 1 Copenhagen Business School 2 Utrecht
More information1 Dr. Norbert Steigenberger Reward-based crowdfunding. On the Motivation of Backers in the Video Gaming Industry. Research report
1 Dr. Norbert Steigenberger Reward-based crowdfunding On the Motivation of Backers in the Video Gaming Industry Research report Dr. Norbert Steigenberger Seminar for Business Administration, Corporate
More informationGames and Big Data: A Scalable Multi-Dimensional Churn Prediction Model
Games and Big Data: A Scalable Multi-Dimensional Churn Prediction Model Paul Bertens, Anna Guitart and África Periáñez (Silicon Studio) CIG 2017 New York 23rd August 2017 Who are we? Game studio and graphics
More informationipad Total Cost of Ownership: the Cost Savings and of a Mid-Year Refresh
ipad Total Cost of Ownership: the and Cost Savings of a Mid-Year Refresh All technologies have a shelf life and a perceived value at each birthday. What that value is and how quickly it depreciates depends
More informationJohn Forth and Geoff Mason. National Institute of Economic and Social Research, London. NIESR Discussion Paper No March 2004
Information and Communication Technology (ICT) Adoption and Utilisation, Skill Constraints and Firm- Level Performance: Evidence from UK Benchmarking Surveys John Forth and Geoff Mason National Institute
More informationAdditional file 1: Cleaning, Geocoding and Weighting
Additional file 1: Cleaning, Geocoding and Weighting Contents 1 Introduction... 2 2 Address Accuracy and Cleaning... 2 2.1 Sources... 2 2.2 Address Linking... 3 2.3 Cleaning Summary... 3 3 Time Consistency
More informationFindings. A Number of Candles Do Not Work as Expected
1 Findings Arguably, you are reading the most important chapter because it discusses the discoveries I made about candles while researching this book. You may already know some of them, but the others
More informationReflections on the First Man vs. Machine No-Limit Texas Hold 'em Competition
Reflections on the First Man vs. Machine No-Limit Texas Hold 'em Competition Sam Ganzfried Assistant Professor, Computer Science, Florida International University, Miami FL PhD, Computer Science Department,
More information2. What is Text Mining? There is no single definition of text mining. In general, text mining is a subdomain of data mining that primarily deals with
1. Title Slide 1 2. What is Text Mining? There is no single definition of text mining. In general, text mining is a subdomain of data mining that primarily deals with textual documents rather than discrete
More information