On Feature Selection, Bias-Variance, and Bagging
|
|
- Timothy Berry
- 6 years ago
- Views:
Transcription
1 On Feature Selection, Bias-Variance, and Bagging Art Munson 1 Rich Caruana 2 1 Department of Computer Science Cornell University 2 Microsoft Corporation ECML-PKDD 2009 Munson; Caruana (Cornell; Microsoft) On Feature Selection ECML-PKDD / 22
2 Task: Model Presence/Absence of Birds Munson; Caruana (Cornell; Microsoft) On Feature Selection ECML-PKDD / 22
3 Task: Model Presence/Absence of Birds Tried: SVMs boosted decision trees bagged decision trees neural networks Munson; Caruana (Cornell; Microsoft) On Feature Selection ECML-PKDD / 22
4 Task: Model Presence/Absence of Birds Tried: SVMs boosted decision trees bagged decision trees neural networks Ultimate goal: understand avian population dynamics Ran feature selection to find smallest feature set with excellent performance. Munson; Caruana (Cornell; Microsoft) On Feature Selection ECML-PKDD / 22
5 Bagging Likes Many Noisy Features (?) RMS European Starling bagging all features # features Munson; Caruana (Cornell; Microsoft) On Feature Selection ECML-PKDD / 22
6 Surprised Reviewers Reviewer A [I] also found that the results reported in Figure 2 [were] strange, where the majority [of] results show that classifiers built from selected features are actually inferior to the ones trained from the whole feature [set]. Reviewer B It is very surprising that the performance of all methods improves (or stays constant) when the number of features is increased. Munson; Caruana (Cornell; Microsoft) On Feature Selection ECML-PKDD / 22
7 Purpose of this Study Does bagging often benefit from many features? If so, why? Munson; Caruana (Cornell; Microsoft) On Feature Selection ECML-PKDD / 22
8 Outline 1 Story Behind the Paper 2 Background 3 Experiment 1: FS and Bias-Variance 4 Experiment 2: Weak, Noisy Features Munson; Caruana (Cornell; Microsoft) On Feature Selection ECML-PKDD / 22
9 Review of Bagging Bagging: simple ensemble learning algorithm [Bre96]: draw random sample of training data train a model using sample (e.g. decision tree) repeat N times (e.g. 25 times) bagged predictions: average predictions of N models Munson; Caruana (Cornell; Microsoft) On Feature Selection ECML-PKDD / 22
10 Facts about Bagging Surprisingly competitive performance & rarely overfits [BK99]. Main benefit is reducing variance of constituent models [BK99]. Improves ability to ignore irrelevant features [AP96]. Munson; Caruana (Cornell; Microsoft) On Feature Selection ECML-PKDD / 22
11 Review of Bias-Variance Decomposition Error of learning algorithm on example x comes from 3 sources: noise intrinsic error / uncertainty for x s true label bias how close, on average, is algorithm to optimal prediction variance how much does prediction change if change training set Error decomposes as: error(x) = noise(x) + bias(x) + variance(x) On real problems, cannot separately measure bias and noise. Munson; Caruana (Cornell; Microsoft) On Feature Selection ECML-PKDD / 22
12 Measuring Bias & Variance (Squared Error) Generate empirical distribution of the algorithm s predictions [BK99]: Randomly sample 1 2 of the training data. Train model using sample and make predictions y for test data. Repeat R times (e.g. 20 times). Compute average prediction y m for every test example. Munson; Caruana (Cornell; Microsoft) On Feature Selection ECML-PKDD / 22
13 Measuring Bias & Variance (Squared Error) Generate empirical distribution of the algorithm s predictions [BK99]: Randomly sample 1 2 of the training data. Train model using sample and make predictions y for test data. Repeat R times (e.g. 20 times). Compute average prediction y m for every test example. For each test example x with true label t: bias(x) = (t y m ) 2 variance(x) = 1 R R (y m y i ) 2 Average over test cases to get expected bias & variance for algorithm. i=1 Munson; Caruana (Cornell; Microsoft) On Feature Selection ECML-PKDD / 22
14 Review of Feature Selection Forward Stepwise Feature Selection Start from empty selected set. Evaluate benefit of selecting each non-selected feature (train model for each choice). Select most beneficial feature. Repeat search until stopping criteria. Munson; Caruana (Cornell; Microsoft) On Feature Selection ECML-PKDD / 22
15 Review of Feature Selection Forward Stepwise Feature Selection Start from empty selected set. Evaluate benefit of selecting each non-selected feature (train model for each choice). Select most beneficial feature. Repeat search until stopping criteria. Correlation-based Feature Filtering Rank features by individual correlation with class label. Choose cutoff point (by statistical test or cross-validation). Keep features above cutoff point. Discard rest. Munson; Caruana (Cornell; Microsoft) On Feature Selection ECML-PKDD / 22
16 Experiment 1: Bias-Variance of Feature Selection Summary: 19 datasets order features using feature selection forward stepwise feature selection or correlation feature filtering, depending on dataset size estimate bias & variance at multiple feature set sizes 5-fold cross-validation # Features 1e Dataset Sizes # Samples Munson; Caruana (Cornell; Microsoft) On Feature Selection ECML-PKDD / 22
17 Case 1: No Improvement from Feature Selection covtype variance bias/noise 0.06 MSE single decision tree bagged decision tree # features Munson; Caruana (Cornell; Microsoft) On Feature Selection ECML-PKDD / 22
18 Case 2: FS Improves Non-Bagged Model MSE variance bias/noise medis overfits with too many features # features Munson; Caruana (Cornell; Microsoft) On Feature Selection ECML-PKDD / 22
19 Take Away Points More features lower bias/noise, higher variance. Feature selection does not improve bagged model performance (1 exception). Best subset size corresponds to best bias/variance tradeoff point. Algorithm dependant Relevant features may be discarded if variance increase outweighs extra information Munson; Caruana (Cornell; Microsoft) On Feature Selection ECML-PKDD / 22
20 Why Does Bagging Benefit from so Many Features? MSE cryst # features 200 variance bias/noise ,341 Munson; Caruana (Cornell; Microsoft) On Feature Selection ECML-PKDD / 22
21 Why Does Bagging Benefit from so Many Features? MSE cryst # features 200 variance bias/noise ,341 Munson; Caruana (Cornell; Microsoft) On Feature Selection ECML-PKDD / 22
22 Why Does Bagging Benefit from so Many Features? MSE cryst # features 200 variance bias/noise ,341 Munson; Caruana (Cornell; Microsoft) On Feature Selection ECML-PKDD / 22
23 Hypothesis Bagging improves base learner s ability to benefit from weak, noisy features. Munson; Caruana (Cornell; Microsoft) On Feature Selection ECML-PKDD / 22
24 Experiment 2: Noisy Informative Features Summary: generate synthetic data (6 features) duplicate 1/2 of the features 20 times corrupt X% of values in duplicated features train single and bagged trees with corrupted features and 3 non-duplicated features compare to: ideal, unblemished feature set, and no noisy features (3 non-duplicated only) Munson; Caruana (Cornell; Microsoft) On Feature Selection ECML-PKDD / 22
25 Bagging Extracts More Info from Noisy Features damaged variance bias/noise MSE core fraction feature values corrupted Munson; Caruana (Cornell; Microsoft) On Feature Selection ECML-PKDD / 22
26 Bagging Extracts More Info from Noisy Features damaged variance bias/noise MSE original features (ideal) core fraction feature values corrupted Munson; Caruana (Cornell; Microsoft) On Feature Selection ECML-PKDD / 22
27 Bagging Extracts More Info from Noisy Features non-duplicated 0.25 features (baseline) 0.2 MSE 0.15 damaged variance bias/noise 6 original features (ideal) core fraction feature values corrupted Munson; Caruana (Cornell; Microsoft) On Feature Selection ECML-PKDD / 22
28 Bagging Extracts More Info from Noisy Features non-duplicated 0.25 features (baseline) 0.2 MSE 0.15 damaged variance bias/noise everything else: 3 non-duplicated features + 60 noisy features 6 original features (ideal) core fraction feature values corrupted Munson; Caruana (Cornell; Microsoft) On Feature Selection ECML-PKDD / 22
29 Conclusions After training 9,060,936 decision trees... Experiment 1: More features lower bias/noise, higher variance. Feature selection does not improve bagged model performance. Best subset size corresponds to best bias/variance tradeoff point. Experiment 2: Bagged trees surprisingly good at extracting useful information from noisy features. Different weak features in different trees. Munson; Caruana (Cornell; Microsoft) On Feature Selection ECML-PKDD / 22
30 Bibliography Kamal M. Ali and Michael J. Pazzani. Error reduction through learning multiple descriptions. Machine Learning, 24(3): , Eric Bauer and Ron Kohavi. An empirical comparison of voting classification algorithms: Bagging, boosting, and variants. Machine Learning, 36(1-2): , Leo Breiman. Bagging predictors. Machine Learning, 24(2): , Munson; Caruana (Cornell; Microsoft) On Feature Selection ECML-PKDD / 22
31 Exception: Overfitting Pseudo-Identifiers bunting variance bias/noise MSE # features Munson; Caruana (Cornell; Microsoft) On Feature Selection ECML-PKDD / 22
SMILe: Shuffled Multiple-Instance Learning
SMILe: Shuffled Multiple-Instance Learning Gary Doran and Soumya Ray Department of Electrical Engineering and Computer Science Case Western Reserve University Cleveland, OH 44106, USA {gary.doran,sray}@case.edu
More informationSELECTING RELEVANT DATA
EXPLORATORY ANALYSIS The data that will be used comes from the reviews_beauty.json.gz file which contains information about beauty products that were bought and reviewed on Amazon.com. Each data point
More informationApplications of Machine Learning Techniques in Human Activity Recognition
Applications of Machine Learning Techniques in Human Activity Recognition Jitenkumar B Rana Tanya Jha Rashmi Shetty Abstract Human activity detection has seen a tremendous growth in the last decade playing
More informationImage analysis. CS/CME/BioE/Biophys/BMI 279 Oct. 31 and Nov. 2, 2017 Ron Dror
Image analysis CS/CME/BioE/Biophys/BMI 279 Oct. 31 and Nov. 2, 2017 Ron Dror 1 Outline Images in molecular and cellular biology Reducing image noise Mean and Gaussian filters Frequency domain interpretation
More informationDETECTION AND CLASSIFICATION OF POWER QUALITY DISTURBANCES
DETECTION AND CLASSIFICATION OF POWER QUALITY DISTURBANCES Ph.D. THESIS by UTKARSH SINGH INDIAN INSTITUTE OF TECHNOLOGY ROORKEE ROORKEE-247 667 (INDIA) OCTOBER, 2017 DETECTION AND CLASSIFICATION OF POWER
More informationVoice Activity Detection
Voice Activity Detection Speech Processing Tom Bäckström Aalto University October 2015 Introduction Voice activity detection (VAD) (or speech activity detection, or speech detection) refers to a class
More informationLearning to Play like an Othello Master CS 229 Project Report. Shir Aharon, Amanda Chang, Kent Koyanagi
Learning to Play like an Othello Master CS 229 Project Report December 13, 213 1 Abstract This project aims to train a machine to strategically play the game of Othello using machine learning. Prior to
More informationKnowledge discovery & data mining Classification & fraud detection
Knowledge discovery & data mining Classification & fraud detection Knowledge discovery & data mining Classification & fraud detection 5/24/00 Click here to start Table of Contents Author: Dino Pedreschi
More informationSSB Debate: Model-based Inference vs. Machine Learning
SSB Debate: Model-based nference vs. Machine Learning June 3, 2018 SSB 2018 June 3, 2018 1 / 20 Machine learning in the biological sciences SSB 2018 June 3, 2018 2 / 20 Machine learning in the biological
More informationLearning Deep Networks from Noisy Labels with Dropout Regularization
Learning Deep Networks from Noisy Labels with Dropout Regularization Ishan Jindal*, Matthew Nokleby*, Xuewen Chen** *Department of Electrical and Computer Engineering **Department of Computer Science Wayne
More informationStacking Ensemble for auto ml
Stacking Ensemble for auto ml Khai T. Ngo Thesis submitted to the Faculty of the Virginia Polytechnic Institute and State University in partial fulfillment of the requirements for the degree of Master
More informationMULTIPLE CLASSIFIERS FOR ELECTRONIC NOSE DATA
MULTIPLE CLASSIFIERS FOR ELECTRONIC NOSE DATA M. Pardo, G. Sberveglieri INFM and University of Brescia Gas Sensor Lab, Dept. of Chemistry and Physics for Materials Via Valotti 9-25133 Brescia Italy D.
More informationClassification of Hand Gestures using Surface Electromyography Signals For Upper-Limb Amputees
Classification of Hand Gestures using Surface Electromyography Signals For Upper-Limb Amputees Gregory Luppescu Stanford University Michael Lowney Stanford Univeristy Raj Shah Stanford University I. ITRODUCTIO
More informationClassification of photographic images based on perceived aesthetic quality
Classification of photographic images based on perceived aesthetic quality Jeff Hwang Department of Electrical Engineering, Stanford University Sean Shi Department of Electrical Engineering, Stanford University
More informationBuilding a Computer Mahjong Player Based on Monte Carlo Simulation and Opponent Models
Building a Computer Mahjong Player Based on Monte Carlo Simulation and Opponent Models Naoki Mizukami 1 and Yoshimasa Tsuruoka 1 1 The University of Tokyo 1 Introduction Imperfect information games are
More informationCSE 258 Winter 2017 Assigment 2 Skill Rating Prediction on Online Video Game
ABSTRACT CSE 258 Winter 2017 Assigment 2 Skill Rating Prediction on Online Video Game In competitive online video game communities, it s common to find players complaining about getting skill rating lower
More informationIJITKMI Volume 7 Number 2 Jan June 2014 pp (ISSN ) Impact of attribute selection on the accuracy of Multilayer Perceptron
Impact of attribute selection on the accuracy of Multilayer Perceptron Niket Kumar Choudhary 1, Yogita Shinde 2, Rajeswari Kannan 3, Vaithiyanathan Venkatraman 4 1,2 Dept. of Computer Engineering, Pimpri-Chinchwad
More informationFreeCiv Learner: A Machine Learning Project Utilizing Genetic Algorithms
FreeCiv Learner: A Machine Learning Project Utilizing Genetic Algorithms Felix Arnold, Bryan Horvat, Albert Sacks Department of Computer Science Georgia Institute of Technology Atlanta, GA 30318 farnold3@gatech.edu
More informationCSE 255 Assignment 1: Helpfulness in Amazon Reviews
CSE 255 Assignment 1: Helpfulness in Amazon Reviews Kristján Jónsson University of California, San Diego 9500 Gilman Dr La Jolla, CA 92093 USA kjonsson@eng.ucsd.edu Devin Platt University of California,
More informationDynamic Throttle Estimation by Machine Learning from Professionals
Dynamic Throttle Estimation by Machine Learning from Professionals Nathan Spielberg and John Alsterda Department of Mechanical Engineering, Stanford University Abstract To increase the capabilities of
More information신경망기반자동번역기술. Konkuk University Computational Intelligence Lab. 김강일
신경망기반자동번역기술 Konkuk University Computational Intelligence Lab. http://ci.konkuk.ac.kr kikim01@kunkuk.ac.kr 김강일 Index Issues in AI and Deep Learning Overview of Machine Translation Advanced Techniques in
More informationProject summary. Key findings, Winter: Key findings, Spring:
Summary report: Assessing Rusty Blackbird habitat suitability on wintering grounds and during spring migration using a large citizen-science dataset Brian S. Evans Smithsonian Migratory Bird Center October
More informationCAT Training CNNs for Image Classification with Noisy Labels
yclic Annealing Training (AT) NNs for Image lassification with Noisy Labels JiaWei Li, Tao Dai, QingTao Tang, YeLi Xing, Shu-Tao Xia Tsinghua University li-jw15@mailstsinghuaeducn October 8, 2018 AT Training
More informationAn Introduction to Machine Learning for Social Scientists
An Introduction to Machine Learning for Social Scientists Tyler Ransom University of Oklahoma, Dept. of Economics November 10, 2017 Outline 1. Intro 2. Examples 3. Conclusion Tyler Ransom (OU Econ) An
More informationNeurocomputing 73 (2010) Contents lists available at ScienceDirect. Neurocomputing. journal homepage:
Neurocomputing 73 (2010) 3039 3065 Contents lists available at ScienceDirect Neurocomputing journal homepage: www.elsevier.com/locate/neucom A neural network-based framework for the reconstruction of incomplete
More informationOutcome Forecasting in Sports. Ondřej Hubáček
Outcome Forecasting in Sports Ondřej Hubáček Motivation & Challenges Motivation exploiting betting markets performance optimization Challenges no available datasets difficulties with establishing the state-of-the-art
More informationFriends don t let friends deploy Black-Box models The importance of transparency in Machine Learning. Rich Caruana Microsoft Research
Friends don t let friends deploy Black-Box models The importance of transparency in Machine Learning Rich Caruana Microsoft Research Friends Don t Let Friends Deploy Black-Box Models The Importance of
More informationClassification of Voltage Sag Using Multi-resolution Analysis and Support Vector Machine
Journal of Clean Energy Technologies, Vol. 4, No. 3, May 2016 Classification of Voltage Sag Using Multi-resolution Analysis and Support Vector Machine Hanim Ismail, Zuhaina Zakaria, and Noraliza Hamzah
More informationAdversarial Examples and Adversarial Training. Ian Goodfellow, OpenAI Research Scientist Presentation at Quora,
Adversarial Examples and Adversarial Training Ian Goodfellow, OpenAI Research Scientist Presentation at Quora, 2016-08-04 In this presentation Intriguing Properties of Neural Networks Szegedy et al, 2013
More informationThe Game-Theoretic Approach to Machine Learning and Adaptation
The Game-Theoretic Approach to Machine Learning and Adaptation Nicolò Cesa-Bianchi Università degli Studi di Milano Nicolò Cesa-Bianchi (Univ. di Milano) Game-Theoretic Approach 1 / 25 Machine Learning
More informationHow Many Imputations are Really Needed? Some Practical Clarifications of Multiple Imputation Theory
Prev Sci (2007) 8:206 213 DOI 10.1007/s11121-007-0070-9 How Many Imputations are Really Needed? Some Practical Clarifications of Multiple Imputation Theory John W. Graham & Allison E. Olchowski & Tamika
More informationEfficient Target Detection from Hyperspectral Images Based On Removal of Signal Independent and Signal Dependent Noise
IOSR Journal of Electronics and Communication Engineering (IOSR-JECE) e-issn: 2278-2834,p- ISSN: 2278-8735.Volume 9, Issue 6, Ver. III (Nov - Dec. 2014), PP 45-49 Efficient Target Detection from Hyperspectral
More informationMultimedia Forensics
Multimedia Forensics Using Mathematics and Machine Learning to Determine an Image's Source and Authenticity Matthew C. Stamm Multimedia & Information Security Lab (MISL) Department of Electrical and Computer
More informationCover Page. The handle holds various files of this Leiden University dissertation.
Cover Page The handle http://hdl.handle.net/17/55 holds various files of this Leiden University dissertation. Author: Koch, Patrick Title: Efficient tuning in supervised machine learning Issue Date: 13-1-9
More informationHuman or Robot? Robert Recatto A University of California, San Diego 9500 Gilman Dr. La Jolla CA,
Human or Robot? INTRODUCTION: With advancements in technology happening every day and Artificial Intelligence becoming more integrated into everyday society the line between human intelligence and computer
More informationObject Category Detection using Audio-visual Cues
Object Category Detection using Audio-visual Cues Luo Jie 1,2, Barbara Caputo 1,2, Alon Zweig 3, Jörg-Hendrik Bach 4, and Jörn Anemüller 4 1 IDIAP Research Institute, Centre du Parc, 1920 Martigny, Switzerland
More informationPatent Mining: Use of Data/Text Mining for Supporting Patent Retrieval and Analysis
Patent Mining: Use of Data/Text Mining for Supporting Patent Retrieval and Analysis by Chih-Ping Wei ( 魏志平 ), PhD Institute of Service Science and Institute of Technology Management National Tsing Hua
More informationOptimal Load Shedding Using an Ensemble of Artificial Neural Networks
Optimal Load Shedding Using an Ensemble of Artificial Neural Networks Original Scientific Paper Muhammad FaizanTahir faizantahir_2k7@yahoo.com Hafiz Teheeb-Ul-Hassan tehzibulhasan@gmail.com Kashif Mehmood
More informationStatistical Hypothesis Testing
Statistical Hypothesis Testing Statistical Hypothesis Testing is a kind of inference Given a sample, say something about the population Examples: Given a sample of classifications by a decision tree, test
More informationTODAY, wireless communications are an integral part of
CS229 FINAL PROJECT - FALL 2010 1 Predicting Wireless Channel Utilization at the PHY Jeffrey Mehlman, Stanford Networked Systems Group, Aaron Adcock, Stanford E.E. Department Abstract The ISM band is an
More informationIntroduction to Machine Learning
Introduction to Machine Learning Deep Learning Barnabás Póczos Credits Many of the pictures, results, and other materials are taken from: Ruslan Salakhutdinov Joshua Bengio Geoffrey Hinton Yann LeCun 2
More informationQuick, Draw! Doodle Recognition
Quick, Draw! Doodle Recognition Kristine Guo Stanford University kguo98@stanford.edu James WoMa Stanford University jaywoma@stanford.edu Eric Xu Stanford University ericxu0@stanford.edu Abstract Doodle
More information2010 Census Coverage Measurement - Initial Results of Net Error Empirical Research using Logistic Regression
2010 Census Coverage Measurement - Initial Results of Net Error Empirical Research using Logistic Regression Richard Griffin, Thomas Mule, Douglas Olson 1 U.S. Census Bureau 1. Introduction This paper
More informationAutocomplete Sketch Tool
Autocomplete Sketch Tool Sam Seifert, Georgia Institute of Technology Advanced Computer Vision Spring 2016 I. ABSTRACT This work details an application that can be used for sketch auto-completion. Sketch
More informationImage analysis. CS/CME/BioE/Biophys/BMI 279 Oct. 31 and Nov. 2, 2017 Ron Dror
Image analysis CS/CME/BioE/Biophys/BMI 279 Oct. 31 and Nov. 2, 2017 Ron Dror 1 Outline Images in molecular and cellular biology Reducing image noise Mean and Gaussian filters Frequency domain interpretation
More informationConstructing local discriminative features for signal classification
Constructing local discriminative features for signal classification Local features for signal classification Outline Motivations Problem formulation Lifting scheme Local features Conclusions Toy example
More informationSampling distributions and the Central Limit Theorem
Sampling distributions and the Central Limit Theorem Johan A. Elkink University College Dublin 14 October 2013 Johan A. Elkink (UCD) Central Limit Theorem 14 October 2013 1 / 29 Outline 1 Sampling 2 Statistical
More informationCampus Location Recognition using Audio Signals
1 Campus Location Recognition using Audio Signals James Sun,Reid Westwood SUNetID:jsun2015,rwestwoo Email: jsun2015@stanford.edu, rwestwoo@stanford.edu I. INTRODUCTION People use sound both consciously
More informationClassification of Digital Photos Taken by Photographers or Home Users
Classification of Digital Photos Taken by Photographers or Home Users Hanghang Tong 1, Mingjing Li 2, Hong-Jiang Zhang 2, Jingrui He 1, and Changshui Zhang 3 1 Automation Department, Tsinghua University,
More informationFeature Reduction and Payload Location with WAM Steganalysis
Feature Reduction and Payload Location with WAM Steganalysis Andrew Ker & Ivans Lubenko Oxford University Computing Laboratory contact: adk @ comlab.ox.ac.uk SPIE/IS&T Electronic Imaging, San Jose, CA
More informationCROSS-LAYER FEATURES IN CONVOLUTIONAL NEURAL NETWORKS FOR GENERIC CLASSIFICATION TASKS. Kuan-Chuan Peng and Tsuhan Chen
CROSS-LAYER FEATURES IN CONVOLUTIONAL NEURAL NETWORKS FOR GENERIC CLASSIFICATION TASKS Kuan-Chuan Peng and Tsuhan Chen Cornell University School of Electrical and Computer Engineering Ithaca, NY 14850
More informationMapping Open Water Bodies with Optical Remote Sensing
Mapping Open Water Bodies with Optical Remote Sensing M. O Donnell 1,2 and E. Podest 1 1.Jet Propulsion Laboratory, California Institute of Technology 2 Alliance Gertz-Ressler High School, Los Angeles,
More informationEstimating Single-Channel Source Separation Masks: Relevance Vector Machine Classifiers vs. Pitch-Based Masking
Estimating Single-Channel Source Separation Masks: Relevance Vector Machine Classifiers vs. Pitch-Based Masking Ron J. Weiss and Daniel P. W. Ellis LabROSA, Dept. of Elec. Eng. Columbia University New
More informationarxiv: v2 [cs.mm] 12 Jan 2018
Paper accepted to Media Watermarking, Security, and Forensics, IS&T Int. Symp. on Electronic Imaging, SF, California, USA, 14-18 Feb. 2016. Deep learning is a good steganalysis tool when embedding key
More informationAdaBoost based EMD as a De-Noising Technique in Time Delay Estimation Application
International Journal of Computer Applications (975 8887) Volume 78 No.12, September 213 AdaBoost based EMD as a De-Noising Technique in Time Delay Estimation Application Kusma Kumari Cheepurupalli Dept.
More informationPredicting outcomes of professional DotA 2 matches
Predicting outcomes of professional DotA 2 matches Petra Grutzik Joe Higgins Long Tran December 16, 2017 Abstract We create a model to predict the outcomes of professional DotA 2 (Defense of the Ancients
More informationSTARCRAFT 2 is a highly dynamic and non-linear game.
JOURNAL OF COMPUTER SCIENCE AND AWESOMENESS 1 Early Prediction of Outcome of a Starcraft 2 Game Replay David Leblanc, Sushil Louis, Outline Paper Some interesting things to say here. Abstract The goal
More informationDetection and Classification of Power Quality Event using Discrete Wavelet Transform and Support Vector Machine
Detection and Classification of Power Quality Event using Discrete Wavelet Transform and Support Vector Machine Okelola, Muniru Olajide Department of Electronic and Electrical Engineering LadokeAkintola
More informationA TWO-PART PREDICTIVE CODER FOR MULTITASK SIGNAL COMPRESSION. Scott Deeann Chen and Pierre Moulin
A TWO-PART PREDICTIVE CODER FOR MULTITASK SIGNAL COMPRESSION Scott Deeann Chen and Pierre Moulin University of Illinois at Urbana-Champaign Department of Electrical and Computer Engineering 5 North Mathews
More informationRELEASING APERTURE FILTER CONSTRAINTS
RELEASING APERTURE FILTER CONSTRAINTS Jakub Chlapinski 1, Stephen Marshall 2 1 Department of Microelectronics and Computer Science, Technical University of Lodz, ul. Zeromskiego 116, 90-924 Lodz, Poland
More informationWheel Health Monitoring Using Onboard Sensors
Wheel Health Monitoring Using Onboard Sensors Brad M. Hopkins, Ph.D. Project Engineer Condition Monitoring Amsted Rail Company, Inc. 1 Agenda 1. Motivation 2. Overview of Methodology 3. Application: Wheel
More informationReal-Time Tracking via On-line Boosting Helmut Grabner, Michael Grabner, Horst Bischof
Real-Time Tracking via On-line Boosting, Michael Grabner, Horst Bischof Graz University of Technology Institute for Computer Graphics and Vision Tracking Shrek M Grabner, H Grabner and H Bischof Real-time
More informationCamera Model Identification Framework Using An Ensemble of Demosaicing Features
Camera Model Identification Framework Using An Ensemble of Demosaicing Features Chen Chen Department of Electrical and Computer Engineering Drexel University Philadelphia, PA 19104 Email: chen.chen3359@drexel.edu
More informationContents. List of Figures List of Tables. Structure of the Book How to Use this Book Online Resources Acknowledgements
Contents List of Figures List of Tables Preface Notation Structure of the Book How to Use this Book Online Resources Acknowledgements Notational Conventions Notational Conventions for Probabilities xiii
More informationFeature Engineering. Digging into Data: Jordan Boyd-Graber. University of Maryland. March 4, 2013
Feature Engineering Digging into Data: Jordan Boyd-Graber University of Maryland March 4, 2013 Digging into Data: Jordan Boyd-Graber (UMD) Feature Engineering March 4, 2013 1 / 30 Roadmap How to split
More informationSimulate IFFT using Artificial Neural Network Haoran Chang, Ph.D. student, Fall 2018
Simulate IFFT using Artificial Neural Network Haoran Chang, Ph.D. student, Fall 2018 1. Preparation 1.1 Dataset The training data I used is generated by the trigonometric functions, sine and cosine. There
More informationAn Introduction to Poker Opponent Modeling
An Introduction to Poker Opponent Modeling Peter Chapman Brielin Brown University of Virginia 1 March 2011 It is not my aim to surprise or shock you-but the simplest way I can summarize is to say that
More information2007 Census of Agriculture Non-Response Methodology
2007 Census of Agriculture Non-Response Methodology Will Cecere National Agricultural Statistics Service Research and Development Division, U.S. Department of Agriculture, 3251 Old Lee Highway, Fairfax,
More informationLearning Strategies for Opponent Modeling in Poker
Computer Poker and Imperfect Information: Papers from the AAAI 2013 Workshop Learning Strategies for Opponent Modeling in Poker Ömer Ekmekci Department of Computer Engineering Middle East Technical University
More informationCC4.5: cost-sensitive decision tree pruning
Data Mining VI 239 CC4.5: cost-sensitive decision tree pruning J. Cai 1,J.Durkin 1 &Q.Cai 2 1 Department of Electrical and Computer Engineering, University of Akron, U.S.A. 2 Department of Electrical Engineering
More information10:00-10:30 HOMOGENIZATION OF THE GLOBAL TEMPERATURE Victor Venema, University of Bonn
10:00-10:30 HOMOGENIZATION OF THE GLOBAL TEMPERATURE Victor Venema, University of Bonn The comments in these notes are only intended to clarify the slides and should be seen as informal, just like words
More informationAN ABSTRACT OF THE THESIS OF
AN ABSTRACT OF THE THESIS OF Paul Lewis for the degree of Master of Science in Computer Science presented on June 1, 2010. Title: Ensemble Monte-Carlo Planning: An Empirical Study Abstract approved: Alan
More informationLearning Dota 2 Team Compositions
Learning Dota 2 Team Compositions Atish Agarwala atisha@stanford.edu Michael Pearce pearcemt@stanford.edu Abstract Dota 2 is a multiplayer online game in which two teams of five players control heroes
More informationWe Know Where You Are : Indoor WiFi Localization Using Neural Networks Tong Mu, Tori Fujinami, Saleil Bhat
We Know Where You Are : Indoor WiFi Localization Using Neural Networks Tong Mu, Tori Fujinami, Saleil Bhat Abstract: In this project, a neural network was trained to predict the location of a WiFi transmitter
More informationLaser Printer Source Forensics for Arbitrary Chinese Characters
Laser Printer Source Forensics for Arbitrary Chinese Characters Xiangwei Kong, Xin gang You,, Bo Wang, Shize Shang and Linjie Shen Information Security Research Center, Dalian University of Technology,
More informationCS229: Machine Learning
CS229: Machine Learning Event Identification in Continues Seismic Data Please print out, fill in and include this cover sheet as the first page of your submission. We strongly recommend that you use this
More informationLecture 4 Biosignal Processing. Digital Signal Processing and Analysis in Biomedical Systems
Lecture 4 Biosignal Processing Digital Signal Processing and Analysis in Biomedical Systems Contents - Preprocessing as first step of signal analysis - Biosignal acquisition - ADC - Filtration (linear,
More informationBlack Box Machine Learning
Black Box Machine Learning David S. Rosenberg Bloomberg ML EDU September 20, 2017 David S. Rosenberg (Bloomberg ML EDU) September 20, 2017 1 / 67 Overview David S. Rosenberg (Bloomberg ML EDU) September
More informationPOKER AGENTS LD Miller & Adam Eck April 14 & 19, 2011
POKER AGENTS LD Miller & Adam Eck April 14 & 19, 2011 Motivation Classic environment properties of MAS Stochastic behavior (agents and environment) Incomplete information Uncertainty Application Examples
More informationServer-side Early Detection Method for Detecting Abnormal Players of StarCraft
KSII The 3 rd International Conference on Internet (ICONI) 2011, December 2011 489 Copyright c 2011 KSII Server-side Early Detection Method for Detecting bnormal Players of StarCraft Kyung-Joong Kim 1
More informationDeep Learning for Infrastructure Assessment in Africa using Remote Sensing Data
Deep Learning for Infrastructure Assessment in Africa using Remote Sensing Data Pascaline Dupas Department of Economics, Stanford University Data for Development Initiative @ Stanford Center on Global
More informationDetecting spread spectrum pseudo random noise tags in EEG/MEG using a structure-based decomposition
Detecting spread spectrum pseudo random noise tags in EEG/MEG using a structure-based decomposition P Desain 1, J Farquhar 1,2, J Blankespoor 1, S Gielen 2 1 Music Mind Machine Nijmegen Inst for Cognition
More informationStock Market Indices Prediction Using Time Series Analysis
Stock Market Indices Prediction Using Time Series Analysis ALINA BĂRBULESCU Department of Mathematics and Computer Science Ovidius University of Constanța 124, Mamaia Bd., 900524, Constanța ROMANIA alinadumitriu@yahoo.com
More informationPerformance of Specific vs. Generic Feature Sets in Polyphonic Music Instrument Recognition
Performance of Specific vs. Generic Feature Sets in Polyphonic Music Instrument Recognition Igor Vatolkin 1, Anil Nagathil 2, Wolfgang Theimer 3, Rainer Martin 2 1 ChairofAlgorithmEngineering, TU Dortmund
More informationSelection of Significant Features Using Monte Carlo Feature Selection
Selection of Significant Features Using Monte Carlo Feature Selection Susanne Bornelöv and Jan Komorowski Abstract Feature selection methods identify subsets of features in large datasets. Such methods
More informationSearching Optimal Movements in Multi-Player Games with Imperfect Information
1 Searching Optimal Movements in Multi-Player Games with Imperfect Information Kenshi Yoshimura Teruhisa Hochin Hiroki Nomiya Department of Information Science Kyoto Institute of Technology Kyoto, Japan
More informationThe game of Bridge: a challenge for ILP
The game of Bridge: a challenge for ILP S. Legras, C. Rouveirol, V. Ventos Véronique Ventos LRI Univ Paris-Saclay vventos@nukk.ai 1 Games 2 Interest of games for AI Excellent field of experimentation Problems
More informationInformation Management course
Università degli Studi di Mila Master Degree in Computer Science Information Management course Teacher: Alberto Ceselli Lecture 19: 10/12/2015 Data Mining: Concepts and Techniques (3rd ed.) Chapter 8 Jiawei
More informationEmpirical Assessment of Classification Accuracy of Local SVM
Empirical Assessment of Classification Accuracy of Local SVM Nicola Segata Enrico Blanzieri Department of Engineering and Computer Science (DISI) University of Trento, Italy. segata@disi.unitn.it 18th
More informationWavelet-based Voice Morphing
Wavelet-based Voice orphing ORPHANIDOU C., Oxford Centre for Industrial and Applied athematics athematical Institute, University of Oxford Oxford OX1 3LB, UK orphanid@maths.ox.ac.u OROZ I.. Oxford Centre
More informationFault Detection Using Hilbert Huang Transform
International Journal of Research in Advent Technology, Vol.6, No.9, September 2018 E-ISSN: 2321-9637 Available online at www.ijrat.org Fault Detection Using Hilbert Huang Transform Balvinder Singh 1,
More informationSome Challenging Problems in Mining Social Media
Some Challenging Problems in Mining Social Media Huan Liu Joint work with Shamanth Kumar Ali Abbasi Reza Zafarani Fred Morstatter Jiliang Tang Data Mining and Machine Learning Lab May 17, 2014 AI Forum
More informationEnsemble Empirical Mode Decomposition: An adaptive method for noise reduction
IOSR Journal of Electronics and Communication Engineering (IOSR-JECE) e-issn: 2278-2834,p- ISSN: 2278-8735. Volume 5, Issue 5 (Mar. - Apr. 213), PP 6-65 Ensemble Empirical Mode Decomposition: An adaptive
More informationDeep Neural Networks (2) Tanh & ReLU layers; Generalisation and Regularisation
Deep Neural Networks (2) Tanh & ReLU layers; Generalisation and Regularisation Steve Renals Machine Learning Practical MLP Lecture 4 9 October 2018 MLP Lecture 4 / 9 October 2018 Deep Neural Networks (2)
More informationStock Price Prediction Using Multilayer Perceptron Neural Network by Monitoring Frog Leaping Algorithm
Stock Price Prediction Using Multilayer Perceptron Neural Network by Monitoring Frog Leaping Algorithm Ahdieh Rahimi Garakani Department of Computer South Tehran Branch Islamic Azad University Tehran,
More informationBAYESIAN STATISTICAL CONCEPTS
BAYESIAN STATISTICAL CONCEPTS A gentle introduction Alex Etz @alxetz ß Twitter (no e in alex) alexanderetz.com ß Blog November 5 th 2015 Why do we do statistics? Deal with uncertainty Will it rain today?
More informationNEW ASSOCIATION IN BIO-S-POLYMER PROCESS
NEW ASSOCIATION IN BIO-S-POLYMER PROCESS Long Flory School of Business, Virginia Commonwealth University Snead Hall, 31 W. Main Street, Richmond, VA 23284 ABSTRACT Small firms generally do not use designed
More informationOthello/Reversi using Game Theory techniques Parth Parekh Urjit Singh Bhatia Kushal Sukthankar
Othello/Reversi using Game Theory techniques Parth Parekh Urjit Singh Bhatia Kushal Sukthankar Othello Rules Two Players (Black and White) 8x8 board Black plays first Every move should Flip over at least
More informationOpleiding Informatica
Opleiding Informatica Agents for the card game of Hearts Joris Teunisse Supervisors: Walter Kosters, Jeanette de Graaf BACHELOR THESIS Leiden Institute of Advanced Computer Science (LIACS) www.liacs.leidenuniv.nl
More informationDota2 is a very popular video game currently.
Dota2 Outcome Prediction Zhengyao Li 1, Dingyue Cui 2 and Chen Li 3 1 ID: A53210709, Email: zhl380@eng.ucsd.edu 2 ID: A53211051, Email: dicui@eng.ucsd.edu 3 ID: A53218665, Email: lic055@eng.ucsd.edu March
More information