Knowledge discovery & data mining Classification & fraud detection

Size: px
Start display at page:

Download "Knowledge discovery & data mining Classification & fraud detection"

Transcription

1 Knowledge discovery & data mining Classification & fraud detection Knowledge discovery & data mining Classification & fraud detection 5/24/00 Click here to start Table of Contents Author: Dino Pedreschi Knowledge discovery & data mining Classification & fraud detection Module outline The classification task Classification systems and inductive learning Train & test Train & test Training step Test step Prediction Machine learning terminology Comparing classifiers Classical example: play tennis? Module outline Bayesian classification Estimating a-posteriori probabilities Naïve Bayesian Classification Play-tennis example: estimating P(xi C) file:///c /My Documents/2DM_class/index.htm (1 of 3) [5/24/ :49:42 PM]

2 Knowledge discovery & data mining Classification & fraud detection Play-tennis example: classifying X The independence hypothesis Module outline Decision trees Classical example: play tennis? Decision tree obtained with ID3 (Quinlan 86) From decision trees to classification rules Decision tree induction Generate_DT(samples, attribute_list) = Criteria for finding the best split Information gain (ID3 C4.5) Information gain (ID3 C4.5) Information gain - play tennis example Gini index (CART) Gini index - play tennis example Entropy vs. Gini (on continuous attributes) Other criteria in decision tree construction The overfitting problem Stopping vs. pruning If dataset is large If data set is not so large Categorical vs. continuous attributes Summarizing Scalability to large databases Module outline Backpropagation file:///c /My Documents/2DM_class/index.htm (2 of 3) [5/24/ :49:42 PM]

3 Knowledge discovery & data mining Classification & fraud detection Backpropagation Prediction and (statistical) regression Other methods (not covered) Classification with decision trees What have we achieved? References - classification References - classification file:///c /My Documents/2DM_class/index.htm (3 of 3) [5/24/ :49:42 PM]

4 Knowledge discovery & data mining Slide 1 of 50 file:///c /My Documents/2DM_class/sld001.htm [5/24/ :50:12 PM]

5 Module outline Slide 2 of 50 file:///c /My Documents/2DM_class/sld002.htm [5/24/ :50:13 PM]

6 The classification task Slide 3 of 50 file:///c /My Documents/2DM_class/sld003.htm [5/24/ :50:14 PM]

7 Classification systems and inductive learning Slide 4 of 50 file:///c /My Documents/2DM_class/sld004.htm [5/24/ :50:15 PM]

8 Train & test Slide 5 of 50 file:///c /My Documents/2DM_class/sld005.htm [5/24/ :50:16 PM]

9 Train & test Slide 6 of 50 file:///c /My Documents/2DM_class/sld006.htm [5/24/ :50:17 PM]

10 Training step Slide 7 of 50 file:///c /My Documents/2DM_class/sld007.htm [5/24/ :50:18 PM]

11 Test step Slide 8 of 50 file:///c /My Documents/2DM_class/sld008.htm [5/24/ :50:19 PM]

12 Prediction Slide 9 of 50 file:///c /My Documents/2DM_class/sld009.htm [5/24/ :50:20 PM]

13 Machine learning terminology Slide 10 of 50 file:///c /My Documents/2DM_class/sld010.htm [5/24/ :50:20 PM]

14 Comparing classifiers Slide 11 of 50 file:///c /My Documents/2DM_class/sld011.htm [5/24/ :50:21 PM]

15 Classical example: play tennis? Slide 12 of 50 file:///c /My Documents/2DM_class/sld012.htm [5/24/ :50:23 PM]

16 Module outline Slide 13 of 50 file:///c /My Documents/2DM_class/sld013.htm [5/24/ :50:24 PM]

17 Bayesian classification Slide 14 of 50 file:///c /My Documents/2DM_class/sld014.htm [5/24/ :50:24 PM]

18 Estimating a-posteriori probabilities Slide 15 of 50 file:///c /My Documents/2DM_class/sld015.htm [5/24/ :50:25 PM]

19 Naïve Bayesian Classification Slide 16 of 50 file:///c /My Documents/2DM_class/sld016.htm [5/24/ :50:26 PM]

20 Play-tennis example: estimating P(xi C) Slide 17 of 50 file:///c /My Documents/2DM_class/sld017.htm [5/24/ :50:28 PM]

21 Play-tennis example: classifying X Slide 18 of 50 file:///c /My Documents/2DM_class/sld018.htm [5/24/ :50:29 PM]

22 The independence hypothesis Slide 19 of 50 file:///c /My Documents/2DM_class/sld019.htm [5/24/ :50:30 PM]

23 Module outline Slide 20 of 50 file:///c /My Documents/2DM_class/sld020.htm [5/24/ :50:31 PM]

24 Decision trees Slide 21 of 50 file:///c /My Documents/2DM_class/sld021.htm [5/24/ :50:32 PM]

25 Classical example: play tennis? Slide 22 of 50 file:///c /My Documents/2DM_class/sld022.htm [5/24/ :50:33 PM]

26 Decision tree obtained with ID3 (Quinlan 86) Slide 23 of 50 file:///c /My Documents/2DM_class/sld023.htm [5/24/ :50:34 PM]

27 From decision trees to classification rules Slide 24 of 50 file:///c /My Documents/2DM_class/sld024.htm [5/24/ :50:35 PM]

28 Decision tree induction Slide 25 of 50 file:///c /My Documents/2DM_class/sld025.htm [5/24/ :50:36 PM]

29 Generate_DT(samples, attribute_list) = Slide 26 of 50 file:///c /My Documents/2DM_class/sld026.htm [5/24/ :50:37 PM]

30 Criteria for finding the best split Slide 27 of 50 file:///c /My Documents/2DM_class/sld027.htm [5/24/ :50:38 PM]

31 Information gain (ID3 C4.5) Slide 28 of 50 file:///c /My Documents/2DM_class/sld028.htm [5/24/ :50:39 PM]

32 Information gain (ID3 C4.5) Slide 29 of 50 file:///c /My Documents/2DM_class/sld029.htm [5/24/ :50:40 PM]

33 Information gain - play tennis example Slide 30 of 50 file:///c /My Documents/2DM_class/sld030.htm [5/24/ :50:42 PM]

34 Gini index (CART) Slide 31 of 50 file:///c /My Documents/2DM_class/sld031.htm [5/24/ :50:43 PM]

35 Gini index - play tennis example Slide 32 of 50 file:///c /My Documents/2DM_class/sld032.htm [5/24/ :50:44 PM]

36 Entropy vs. Gini (on continuous attributes) Slide 33 of 50 file:///c /My Documents/2DM_class/sld033.htm [5/24/ :50:45 PM]

37 Other criteria in decision tree construction Slide 34 of 50 file:///c /My Documents/2DM_class/sld034.htm [5/24/ :50:46 PM]

38 The overfitting problem Slide 35 of 50 file:///c /My Documents/2DM_class/sld035.htm [5/24/ :50:47 PM]

39 Stopping vs. pruning Slide 36 of 50 file:///c /My Documents/2DM_class/sld036.htm [5/24/ :50:48 PM]

40 If dataset is large Slide 37 of 50 file:///c /My Documents/2DM_class/sld037.htm [5/24/ :50:49 PM]

41 If data set is not so large Slide 38 of 50 file:///c /My Documents/2DM_class/sld038.htm [5/24/ :50:50 PM]

42 Categorical vs. continuous attributes Slide 39 of 50 file:///c /My Documents/2DM_class/sld039.htm [5/24/ :50:51 PM]

43 Summarizing Slide 40 of 50 file:///c /My Documents/2DM_class/sld040.htm [5/24/ :50:52 PM]

44 Scalability to large databases Slide 41 of 50 file:///c /My Documents/2DM_class/sld041.htm [5/24/ :50:53 PM]

45 Module outline Slide 42 of 50 file:///c /My Documents/2DM_class/sld042.htm [5/24/ :50:53 PM]

46 Backpropagation Slide 43 of 50 file:///c /My Documents/2DM_class/sld043.htm [5/24/ :50:54 PM]

47 Backpropagation Slide 44 of 50 file:///c /My Documents/2DM_class/sld044.htm [5/24/ :50:55 PM]

48 Prediction and (statistical) regression Slide 45 of 50 file:///c /My Documents/2DM_class/sld045.htm [5/24/ :50:56 PM]

49 Other methods (not covered) Slide 46 of 50 file:///c /My Documents/2DM_class/sld046.htm [5/24/ :50:57 PM]

50 Classification with decision trees Slide 47 of 50 file:///c /My Documents/2DM_class/sld047.htm [5/24/ :50:58 PM]

51 What have we achieved? Slide 48 of 50 file:///c /My Documents/2DM_class/sld048.htm [5/24/ :50:59 PM]

52 References - classification Slide 49 of 50 file:///c /My Documents/2DM_class/sld049.htm [5/24/ :51:01 PM]

53 References - classification Slide 50 of 50 file:///c /My Documents/2DM_class/sld050.htm [5/24/ :51:02 PM]

Information Management course

Information Management course Università degli Studi di Mila Master Degree in Computer Science Information Management course Teacher: Alberto Ceselli Lecture 19: 10/12/2015 Data Mining: Concepts and Techniques (3rd ed.) Chapter 8 Jiawei

More information

Contents. List of Figures List of Tables. Structure of the Book How to Use this Book Online Resources Acknowledgements

Contents. List of Figures List of Tables. Structure of the Book How to Use this Book Online Resources Acknowledgements Contents List of Figures List of Tables Preface Notation Structure of the Book How to Use this Book Online Resources Acknowledgements Notational Conventions Notational Conventions for Probabilities xiii

More information

Classification rules for Indian Rice diseases

Classification rules for Indian Rice diseases www.ijcsi.org 444 Classification rules for Indian Rice diseases A.Nithya 1 and Dr.V.Sundaram 2 1 Asst Professor in Computer Applications, Nehru Arts and Science College, Coimbatore, Tamil Nadu, India.

More information

Machine Learning, Data Mining, and Knowledge Discovery: An Introduction

Machine Learning, Data Mining, and Knowledge Discovery: An Introduction Machine Learning, Data Mining, and Kwledge Discovery: An Introduction Outline Data Mining Application Examples Data Mining & Kwledge Discovery Data Mining with Weka AHPCRC Workshop - 8/16/11 - Dr. Martin

More information

Energy modeling/simulation Using the BIM technology in the Curriculum of Architectural and Construction Engineering and Management

Energy modeling/simulation Using the BIM technology in the Curriculum of Architectural and Construction Engineering and Management Paper ID #7196 Energy modeling/simulation Using the BIM technology in the Curriculum of Architectural and Construction Engineering and Management Dr. Hyunjoo Kim, The University of North Carolina at Charlotte

More information

DECISION TREE TUTORIAL

DECISION TREE TUTORIAL Kardi Teknomo DECISION TREE TUTORIAL Revoledu.com Decision Tree Tutorial by Kardi Teknomo Copyright 2008-2012 by Kardi Teknomo Published by Revoledu.com Online edition is available at Revoledu.com Last

More information

PREDICTING ASSEMBLY QUALITY OF COMPLEX STRUCTURES USING DATA MINING Predicting with Decision Tree Algorithm

PREDICTING ASSEMBLY QUALITY OF COMPLEX STRUCTURES USING DATA MINING Predicting with Decision Tree Algorithm PREDICTING ASSEMBLY QUALITY OF COMPLEX STRUCTURES USING DATA MINING Predicting with Decision Tree Algorithm Ekaterina S. Ponomareva, Kesheng Wang, Terje K. Lien Department of Production and Quality Engieering,

More information

Feature Engineering. Digging into Data: Jordan Boyd-Graber. University of Maryland. March 4, 2013

Feature Engineering. Digging into Data: Jordan Boyd-Graber. University of Maryland. March 4, 2013 Feature Engineering Digging into Data: Jordan Boyd-Graber University of Maryland March 4, 2013 Digging into Data: Jordan Boyd-Graber (UMD) Feature Engineering March 4, 2013 1 / 30 Roadmap How to split

More information

Games and Big Data: A Scalable Multi-Dimensional Churn Prediction Model

Games and Big Data: A Scalable Multi-Dimensional Churn Prediction Model Games and Big Data: A Scalable Multi-Dimensional Churn Prediction Model Paul Bertens, Anna Guitart and África Periáñez (Silicon Studio) CIG 2017 New York 23rd August 2017 Who are we? Game studio and graphics

More information

Implementation of a New Recommendation System Based on Decision Tree Using Implicit Relevance Feedback

Implementation of a New Recommendation System Based on Decision Tree Using Implicit Relevance Feedback Implementation of a New Recommendation System Based on Decision Tree Using Implicit Relevance Feedback Anıl Utku*, Hacer Karacan, Oktay Yıldız, M. Ali Akcayol Gazi University Computer Engineering Department,

More information

Privacy preserving data mining multiplicative perturbation techniques

Privacy preserving data mining multiplicative perturbation techniques Privacy preserving data mining multiplicative perturbation techniques Li Xiong CS573 Data Privacy and Anonymity Outline Review and critique of randomization approaches (additive noise) Multiplicative data

More information

A novel feature selection algorithm for text categorization

A novel feature selection algorithm for text categorization Expert Systems with Applications Expert Systems with Applications 33 (2007) 1 5 www.elsevier.com/locate/eswa A novel feature selection algorithm for text categorization Wenqian Shang a, *, Houkuan Huang

More information

Patent Mining: Use of Data/Text Mining for Supporting Patent Retrieval and Analysis

Patent Mining: Use of Data/Text Mining for Supporting Patent Retrieval and Analysis Patent Mining: Use of Data/Text Mining for Supporting Patent Retrieval and Analysis by Chih-Ping Wei ( 魏志平 ), PhD Institute of Service Science and Institute of Technology Management National Tsing Hua

More information

SSB Debate: Model-based Inference vs. Machine Learning

SSB Debate: Model-based Inference vs. Machine Learning SSB Debate: Model-based nference vs. Machine Learning June 3, 2018 SSB 2018 June 3, 2018 1 / 20 Machine learning in the biological sciences SSB 2018 June 3, 2018 2 / 20 Machine learning in the biological

More information

An Introduction to Machine Learning for Social Scientists

An Introduction to Machine Learning for Social Scientists An Introduction to Machine Learning for Social Scientists Tyler Ransom University of Oklahoma, Dept. of Economics November 10, 2017 Outline 1. Intro 2. Examples 3. Conclusion Tyler Ransom (OU Econ) An

More information

Survey on Needs, Applications and Agorithms of Data Mining for Healthcare

Survey on Needs, Applications and Agorithms of Data Mining for Healthcare Survey on Needs, Applications and Agorithms of Data Mining for Healthcare Sharad Mathur 1, Dr. Bhavesh Joshi 2 ABSTRACT Data mining is one of the essential areas of research that is more popular in health

More information

CC4.5: cost-sensitive decision tree pruning

CC4.5: cost-sensitive decision tree pruning Data Mining VI 239 CC4.5: cost-sensitive decision tree pruning J. Cai 1,J.Durkin 1 &Q.Cai 2 1 Department of Electrical and Computer Engineering, University of Akron, U.S.A. 2 Department of Electrical Engineering

More information

Decision Tree Based Online Voltage Security Assessment Using PMU Measurements

Decision Tree Based Online Voltage Security Assessment Using PMU Measurements Decision Tree Based Online Voltage Security Assessment Using PMU Measurements Vijay Vittal Ira A. Fulton Chair Professor Arizona State University Seminar, January 27, 29 Project Team Ph.D. Student Ruisheng

More information

Norsk Regnesentral (NR) Norwegian Computing Center

Norsk Regnesentral (NR) Norwegian Computing Center Norsk Regnesentral (NR) Norwegian Computing Center Petter Abrahamsen Joining Forces 2018 www.nr.no NUSSE: - 512 9-digit numbers - 200 additions/second Our latest servers: - Four Titan X GPUs - 14 336 cores

More information

ArcGIS Pro: What s New in Analysis. Rob Elkins

ArcGIS Pro: What s New in Analysis. Rob Elkins ArcGIS Pro: What s New in Analysis Rob Elkins ArcGIS Pro Welcome ArcGIS Pro: Analysis Rob Elkins ArcGIS Pro 1.0 Now Available = + Includes the complete ArcGIS Platform Application fusion Single, always

More information

IMPLEMENTATION OF NAÏVE BAYESIAN DATA MINING ALGORITHM ON DECEASED REGISTRATION DATA

IMPLEMENTATION OF NAÏVE BAYESIAN DATA MINING ALGORITHM ON DECEASED REGISTRATION DATA International Journal of Computer Engineering & Technology (IJCET) Volume 10, Issue 1, January February 2019, pp. 32 37, Article ID: IJCET_10_01_004 Available online at http://www.iaeme.com/ijcet/issues.asp?jtype=ijcet&vtype=10&itype=1

More information

An FPGA Implementation of Decision Tree Classification

An FPGA Implementation of Decision Tree Classification An FPGA Implementation of Decision Tree Classification Ramanathan Narayanan Daniel Honbo Gokhan Memik Alok Choudhary Joseph Zambreno Electrical Engineering and Computer Science Electrical and Computer

More information

On Feature Selection, Bias-Variance, and Bagging

On Feature Selection, Bias-Variance, and Bagging On Feature Selection, Bias-Variance, and Bagging Art Munson 1 Rich Caruana 2 1 Department of Computer Science Cornell University 2 Microsoft Corporation ECML-PKDD 2009 Munson; Caruana (Cornell; Microsoft)

More information

Anticipation of Winning Probability in Poker Using Data Mining

Anticipation of Winning Probability in Poker Using Data Mining Anticipation of Winning Probability in Poker Using Data Mining Shiben Sheth 1, Gaurav Ambekar 2, Abhilasha Sable 3, Tushar Chikane 4, Kranti Ghag 5 1, 2, 3, 4 B.E Student, SAKEC, Chembur, Department of

More information

Learning to Play like an Othello Master CS 229 Project Report. Shir Aharon, Amanda Chang, Kent Koyanagi

Learning to Play like an Othello Master CS 229 Project Report. Shir Aharon, Amanda Chang, Kent Koyanagi Learning to Play like an Othello Master CS 229 Project Report December 13, 213 1 Abstract This project aims to train a machine to strategically play the game of Othello using machine learning. Prior to

More information

Using Decision Trees for Identification of Most Relevant Indicators for Effective ICT Utilization

Using Decision Trees for Identification of Most Relevant Indicators for Effective ICT Utilization BULGARIAN ACADEMY OF SCIENCES CYBERNETICS AND INFORMATION TECHNOLOGIES Volume 13, No 1 Sofia 2013 Print ISSN: 1311-9702; Online ISSN: 1314-4081 DOI: 10.2478/cait-2013-0008 Using Decision Trees for Identification

More information

Machine Learning and Data Mining Course Summary

Machine Learning and Data Mining Course Summary Machine Learning and Data Mining Course Summary Outline Data Mining and Society Discrimination, Privacy, and Security Hype Curve Future Directions Course Summary 2 Controversial Issues Data mining (or

More information

ArcGIS Pro: What s New in Analysis

ArcGIS Pro: What s New in Analysis Federal GIS Conference February 9 10, 2015 Washington, DC ArcGIS Pro: What s New in Analysis James Sullivan What is analysis? Analysis transforms raw data into information or knowledge. Spatial analysis

More information

SELECTING RELEVANT DATA

SELECTING RELEVANT DATA EXPLORATORY ANALYSIS The data that will be used comes from the reviews_beauty.json.gz file which contains information about beauty products that were bought and reviewed on Amazon.com. Each data point

More information

Performance of Specific vs. Generic Feature Sets in Polyphonic Music Instrument Recognition

Performance of Specific vs. Generic Feature Sets in Polyphonic Music Instrument Recognition Performance of Specific vs. Generic Feature Sets in Polyphonic Music Instrument Recognition Igor Vatolkin 1, Anil Nagathil 2, Wolfgang Theimer 3, Rainer Martin 2 1 ChairofAlgorithmEngineering, TU Dortmund

More information

Lesson 08. Convolutional Neural Network. Ing. Marek Hrúz, Ph.D. Katedra Kybernetiky Fakulta aplikovaných věd Západočeská univerzita v Plzni.

Lesson 08. Convolutional Neural Network. Ing. Marek Hrúz, Ph.D. Katedra Kybernetiky Fakulta aplikovaných věd Západočeská univerzita v Plzni. Lesson 08 Convolutional Neural Network Ing. Marek Hrúz, Ph.D. Katedra Kybernetiky Fakulta aplikovaných věd Západočeská univerzita v Plzni Lesson 08 Convolution we will consider 2D convolution the result

More information

Data Mining for Healthcare Data: A Comparison of Neural Networks Algorithms

Data Mining for Healthcare Data: A Comparison of Neural Networks Algorithms Data Mining for Healthcare 10 Data Mining for Healthcare Data: A Comparison of Neural Networks Algorithms 1 Debby E. Sondakh Universitas Klabat, Jln. Arnold Mononutu, Airmadidi Minahasa Utara 1 Program

More information

IBM SPSS Neural Networks

IBM SPSS Neural Networks IBM Software IBM SPSS Neural Networks 20 IBM SPSS Neural Networks New tools for building predictive models Highlights Explore subtle or hidden patterns in your data. Build better-performing models No programming

More information

Kalman Filters and Adaptive Windows for Learning in Data Streams

Kalman Filters and Adaptive Windows for Learning in Data Streams Kalman Filters and Adaptive Windows for Learning in Data Streams Albert Bifet Ricard Gavaldà Universitat Politècnica de Catalunya DS 06 Barcelona A. Bifet, R. Gavaldà (UPC) Kalman Filters and Adaptive

More information

Decision Tree Analysis in Game Informatics

Decision Tree Analysis in Game Informatics Decision Tree Analysis in Game Informatics Masato Konishi, Seiya Okubo, Tetsuro Nishino and Mitsuo Wakatsuki Abstract Computer Daihinmin involves playing Daihinmin, a popular card game in Japan, by using

More information

Workshop on anonymization Berlin, March 19, Basic Knowledge Terms, Definitions and general techniques. Murat Sariyar TMF

Workshop on anonymization Berlin, March 19, Basic Knowledge Terms, Definitions and general techniques. Murat Sariyar TMF Workshop on anonymization Berlin, March 19, 2015 Basic Knowledge Terms, Definitions and general techniques Murat Sariyar TMF Workshop Anonymisation, March 19, 2015 Outline Background Aims of Anonymization

More information

Comparative Study of various Surveys on Sentiment Analysis

Comparative Study of various Surveys on Sentiment Analysis Comparative Study of various Surveys on Milanjit Kaur 1, Deepak Kumar 2. 1 Student (M.Tech Scholar), Computer Science and Engineering, Lovely Professional University, Punjab, India. 2 Assistant Professor,

More information

The Game-Theoretic Approach to Machine Learning and Adaptation

The Game-Theoretic Approach to Machine Learning and Adaptation The Game-Theoretic Approach to Machine Learning and Adaptation Nicolò Cesa-Bianchi Università degli Studi di Milano Nicolò Cesa-Bianchi (Univ. di Milano) Game-Theoretic Approach 1 / 25 Machine Learning

More information

STARCRAFT 2 is a highly dynamic and non-linear game.

STARCRAFT 2 is a highly dynamic and non-linear game. JOURNAL OF COMPUTER SCIENCE AND AWESOMENESS 1 Early Prediction of Outcome of a Starcraft 2 Game Replay David Leblanc, Sushil Louis, Outline Paper Some interesting things to say here. Abstract The goal

More information

Application of Data Mining in Multiobjective Optimization Problems

Application of Data Mining in Multiobjective Optimization Problems Int. J. Simul. Multisci. Des. Optim. 2014, 5, A15 A. Mosavi, Published by EDP Sciences, 2014 DOI: 10.1051/smdo/2013002 ASMDO 2011 Available online at: http://www.ijsmdo.org Application of Data Mining in

More information

Learning Deep Networks from Noisy Labels with Dropout Regularization

Learning Deep Networks from Noisy Labels with Dropout Regularization Learning Deep Networks from Noisy Labels with Dropout Regularization Ishan Jindal*, Matthew Nokleby*, Xuewen Chen** *Department of Electrical and Computer Engineering **Department of Computer Science Wayne

More information

Nicolas Lachiche (Université de Strasbourg, France)

Nicolas Lachiche (Université de Strasbourg, France) http://www.reframe-d2k.org/ Rethinking the Essence, Flexibility and Reusability of Advanced Model Exploitation Nicolas Lachiche (Université de Strasbourg, France) Bern, 28 April 2016 Scientific background

More information

Wireless Sensor Network Assited Fire Detection And Prevention With Classification Algorithms

Wireless Sensor Network Assited Fire Detection And Prevention With Classification Algorithms International Journal of Emerging Trends in Science and Technology Wireless Sensor Network Assited Fire Detection And Prevention With Classification Algorithms Brinda.s Student of M.Tech Information and

More information

NEURAL NETWORK DEMODULATOR FOR QUADRATURE AMPLITUDE MODULATION (QAM)

NEURAL NETWORK DEMODULATOR FOR QUADRATURE AMPLITUDE MODULATION (QAM) NEURAL NETWORK DEMODULATOR FOR QUADRATURE AMPLITUDE MODULATION (QAM) Ahmed Nasraden Milad M. Aziz M Rahmadwati Artificial neural network (ANN) is one of the most advanced technology fields, which allows

More information

CROSS-LAYER FEATURES IN CONVOLUTIONAL NEURAL NETWORKS FOR GENERIC CLASSIFICATION TASKS. Kuan-Chuan Peng and Tsuhan Chen

CROSS-LAYER FEATURES IN CONVOLUTIONAL NEURAL NETWORKS FOR GENERIC CLASSIFICATION TASKS. Kuan-Chuan Peng and Tsuhan Chen CROSS-LAYER FEATURES IN CONVOLUTIONAL NEURAL NETWORKS FOR GENERIC CLASSIFICATION TASKS Kuan-Chuan Peng and Tsuhan Chen Cornell University School of Electrical and Computer Engineering Ithaca, NY 14850

More information

Frictional Force (32 Points)

Frictional Force (32 Points) Dual-Range Force Sensor Frictional Force (32 Points) Computer 19 Friction is a force that resists motion. It involves objects in contact with each other, and it can be either useful or harmful. Friction

More information

Stacking Ensemble for auto ml

Stacking Ensemble for auto ml Stacking Ensemble for auto ml Khai T. Ngo Thesis submitted to the Faculty of the Virginia Polytechnic Institute and State University in partial fulfillment of the requirements for the degree of Master

More information

TD-Leaf(λ) Giraffe: Using Deep Reinforcement Learning to Play Chess. Stefan Lüttgen

TD-Leaf(λ) Giraffe: Using Deep Reinforcement Learning to Play Chess. Stefan Lüttgen TD-Leaf(λ) Giraffe: Using Deep Reinforcement Learning to Play Chess Stefan Lüttgen Motivation Learn to play chess Computer approach different than human one Humans search more selective: Kasparov (3-5

More information

TAKING ACTION: FRAUD DETECTION, INVESTIGATION AND RESOLUTION USING DATA WAREHOUSE AND DATA MINING TECHNIQUES TO FIGHT FRAUD

TAKING ACTION: FRAUD DETECTION, INVESTIGATION AND RESOLUTION USING DATA WAREHOUSE AND DATA MINING TECHNIQUES TO FIGHT FRAUD TAKING ACTION: FRAUD DETECTION, INVESTIGATION AND RESOLUTION USING DATA WAREHOUSE AND DATA MINING TECHNIQUES TO FIGHT FRAUD In this session, we will use the data warehouse model to illustrate fraud investigation

More information

A Study on Forecasting System of Patent Registration Based on Bayesian Network

A Study on Forecasting System of Patent Registration Based on Bayesian Network Intelligent Information Management, 2012, 4, 284-290 http://dx.doi.org/10.4236/iim.2012.425040 Published Online October 2012 (http://www.scirp.org/journal/iim) A Study on Forecasting System of Patent Registration

More information

IJITKMI Volume 7 Number 2 Jan June 2014 pp (ISSN ) Impact of attribute selection on the accuracy of Multilayer Perceptron

IJITKMI Volume 7 Number 2 Jan June 2014 pp (ISSN ) Impact of attribute selection on the accuracy of Multilayer Perceptron Impact of attribute selection on the accuracy of Multilayer Perceptron Niket Kumar Choudhary 1, Yogita Shinde 2, Rajeswari Kannan 3, Vaithiyanathan Venkatraman 4 1,2 Dept. of Computer Engineering, Pimpri-Chinchwad

More information

Small power load disaggregation in office buildings based on electrical signature classification

Small power load disaggregation in office buildings based on electrical signature classification Small power load disaggregation in office buildings based on electrical signature classification Conference or Workshop Item Accepted Version Rodriguez, A., Smith, S. T., Kiff, A. and Potter, B. (2016)

More information

Microsoft Excel: Data Analysis & Graphing. College of Engineering Engineering Education Innovation Center

Microsoft Excel: Data Analysis & Graphing. College of Engineering Engineering Education Innovation Center Microsoft Excel: Data Analysis & Graphing College of Engineering Engineering Education Innovation Center Objectives Use relative, absolute, and mixed cell referencing Identify the types of graphs and their

More information

A Metric-Based Machine Learning Approach to Genealogical Record Linkage

A Metric-Based Machine Learning Approach to Genealogical Record Linkage A Metric-Based Machine Learning Approach to Genealogical Record Linkage S. Ivie, G. Henry, H. Gatrell and C. Giraud-Carrier Department of Computer Science, Brigham Young University Abstract Genealogical

More information

Techniques for Sentiment Analysis survey

Techniques for Sentiment Analysis survey I J C T A, 9(41), 2016, pp. 355-360 International Science Press ISSN: 0974-5572 Techniques for Sentiment Analysis survey Anu Sharma* and Savleen Kaur** ABSTRACT A Sentiment analysis is a technique to analyze

More information

Analysis of Data Mining Methods for Social Media

Analysis of Data Mining Methods for Social Media 65 Analysis of Data Mining Methods for Social Media Keshav S Rawat Department of Computer Science & Informatics, Central university of Himachal Pradesh Dharamshala (Himachal Pradesh) Email:Keshav79699@gmail.com

More information

Ambient Water Usage Sensor for the Identification of Daily Activities

Ambient Water Usage Sensor for the Identification of Daily Activities Ambient Water Usage Sensor for the Identification of Daily Activities Dipl.-Ing. Alexander Gerka OFFIS Institute for Information Technology, Oldenburg, Germany 2 Agenda Introduction Detection of Activitities

More information

The Originative Statistical Regression Models: Are They Too Old and Untenable? To Fit or Not to Fit Data to a Model: That is the Question.

The Originative Statistical Regression Models: Are They Too Old and Untenable? To Fit or Not to Fit Data to a Model: That is the Question. 1 Objectives 1.To poll the titled and untitled questions. 2.To offer my answer with illustrative examples (2) and recent projects (2). The Originative Statistical Regression Models: Are They Too Old and

More information

Information Sociology

Information Sociology Information Sociology Educational Objectives: 1. To nurture qualified experts in the information society; 2. To widen a sociological global perspective;. To foster community leaders based on Christianity.

More information

Step-by-Step Guide Query Studio Grouping, Sorting, Sectioning, Filtering and Calculations. Grouping

Step-by-Step Guide Query Studio Grouping, Sorting, Sectioning, Filtering and Calculations. Grouping There are several ways that data contained in reports run using Query Studio can be formatted. Data can be auto-summarized by grouping the data based on specific criteria, data can be sorted in ascending

More information

Application of machine learning techniques for solving real world business problems The case study target marketing of insurance policies

Application of machine learning techniques for solving real world business problems The case study target marketing of insurance policies Application of machine learning techniques for solving real world business problems The case study target marketing of insurance policies Ineta Juozenaite Project Work presented as partial requirement

More information

Autocomplete Sketch Tool

Autocomplete Sketch Tool Autocomplete Sketch Tool Sam Seifert, Georgia Institute of Technology Advanced Computer Vision Spring 2016 I. ABSTRACT This work details an application that can be used for sketch auto-completion. Sketch

More information

A Machine Learning Based Approach for Predicting Undisclosed Attributes in Social Networks

A Machine Learning Based Approach for Predicting Undisclosed Attributes in Social Networks A Machine Learning Based Approach for Predicting Undisclosed Attributes in Social Networks Gergely Kótyuk Laboratory of Cryptography and Systems Security (CrySyS) Budapest University of Technology and

More information

Latest trends in sentiment analysis - A survey

Latest trends in sentiment analysis - A survey Latest trends in sentiment analysis - A survey Anju Rose G Punneliparambil PG Scholar Department of Computer Science & Engineering Govt. Engineering College, Thrissur, India anjurose.ar@gmail.com Abstract

More information

The method requires foreground and background sequence datasets. The users can use fasta files as input.

The method requires foreground and background sequence datasets. The users can use fasta files as input. 1 Introduction he emergence of hip-seq technology for genome-wide profiling of transcription factor binding sites (FBS) has made it possible to categorize very precisely the FBS motifs. How to harness

More information

Inference of Opponent s Uncertain States in Ghosts Game using Machine Learning

Inference of Opponent s Uncertain States in Ghosts Game using Machine Learning Inference of Opponent s Uncertain States in Ghosts Game using Machine Learning Sehar Shahzad Farooq, HyunSoo Park, and Kyung-Joong Kim* sehar146@gmail.com, hspark8312@gmail.com,kimkj@sejong.ac.kr* Department

More information

COMPARISON OF MACHINE LEARNING ALGORITHMS IN WEKA

COMPARISON OF MACHINE LEARNING ALGORITHMS IN WEKA COMPARISON OF MACHINE LEARNING ALGORITHMS IN WEKA Clive Almeida 1, Mevito Gonsalves 2 & Manimozhi R 3 International Journal of Latest Trends in Engineering and Technology Special Issue SACAIM 2017, pp.

More information

Automating NSF HERD Reporting Using Machine Learning and Administrative Data

Automating NSF HERD Reporting Using Machine Learning and Administrative Data Automating NSF HERD Reporting Using Machine Learning and Administrative Data Rodolfo H. Torres CIMA Session: The Use of Advance Analytics to Drive Decisions 2018 APLU Annual Meeting New Orleans Marriott,

More information

Lecture 3 - Regression

Lecture 3 - Regression Lecture 3 - Regression Instructor: Prof Ganesh Ramakrishnan July 25, 2016 1 / 30 The Simplest ML Problem: Least Square Regression Curve Fitting: Motivation Error measurement Minimizing Error Method of

More information

Classification of photographic images based on perceived aesthetic quality

Classification of photographic images based on perceived aesthetic quality Classification of photographic images based on perceived aesthetic quality Jeff Hwang Department of Electrical Engineering, Stanford University Sean Shi Department of Electrical Engineering, Stanford University

More information

A TWO-PART PREDICTIVE CODER FOR MULTITASK SIGNAL COMPRESSION. Scott Deeann Chen and Pierre Moulin

A TWO-PART PREDICTIVE CODER FOR MULTITASK SIGNAL COMPRESSION. Scott Deeann Chen and Pierre Moulin A TWO-PART PREDICTIVE CODER FOR MULTITASK SIGNAL COMPRESSION Scott Deeann Chen and Pierre Moulin University of Illinois at Urbana-Champaign Department of Electrical and Computer Engineering 5 North Mathews

More information

Image Extraction using Image Mining Technique

Image Extraction using Image Mining Technique IOSR Journal of Engineering (IOSRJEN) e-issn: 2250-3021, p-issn: 2278-8719 Vol. 3, Issue 9 (September. 2013), V2 PP 36-42 Image Extraction using Image Mining Technique Prof. Samir Kumar Bandyopadhyay,

More information

League of Legends: Dynamic Team Builder

League of Legends: Dynamic Team Builder League of Legends: Dynamic Team Builder Blake Reed Overview The project that I will be working on is a League of Legends companion application which provides a user data about different aspects of the

More information

A Machine Learning Approach to Real Time Earthquake Classification for the Southern California Early Response Warning System

A Machine Learning Approach to Real Time Earthquake Classification for the Southern California Early Response Warning System A Machine Learning Approach to Real Time Earthquake Classification for the Southern California Early Response Warning System Anshul Ramachandran (aramacha@caltech.edu) Suraj Nair (snair@caltech.edu) Ashwin

More information

Using a genetic algorithm for mining patterns from Endgame Databases

Using a genetic algorithm for mining patterns from Endgame Databases 0 African Conference for Sofware Engineering and Applied Computing Using a genetic algorithm for mining patterns from Endgame Databases Heriniaina Andry RABOANARY Department of Computer Science Institut

More information

CS221 Project Final Report Gomoku Game Agent

CS221 Project Final Report Gomoku Game Agent CS221 Project Final Report Gomoku Game Agent Qiao Tan qtan@stanford.edu Xiaoti Hu xiaotihu@stanford.edu 1 Introduction Gomoku, also know as five-in-a-row, is a strategy board game which is traditionally

More information

An Investigation of Scalable Anomaly Detection Techniques for a Large Network of Wi-Fi Hotspots

An Investigation of Scalable Anomaly Detection Techniques for a Large Network of Wi-Fi Hotspots An Investigation of Scalable Anomaly Detection Techniques for a Large Network of Wi-Fi Hotspots Pheeha Machaka 1 and Antoine Bagula 2 1 Council for Scientific and Industrial Research, Modelling and Digital

More information

Constructing local discriminative features for signal classification

Constructing local discriminative features for signal classification Constructing local discriminative features for signal classification Local features for signal classification Outline Motivations Problem formulation Lifting scheme Local features Conclusions Toy example

More information

LifeCLEF Bird Identification Task 2016

LifeCLEF Bird Identification Task 2016 LifeCLEF Bird Identification Task 2016 The arrival of deep learning Alexis Joly, Inria Zenith Team, Montpellier, France Hervé Glotin, Univ. Toulon, UMR LSIS, Institut Universitaire de France Hervé Goëau,

More information

Building a more stable predictive logistic regression model. Anna Elizabeth Campain

Building a more stable predictive logistic regression model. Anna Elizabeth Campain Building a more stable predictive logistic regression model Anna Elizabeth Campain Common problems when working with clinical data Missing data Imbalanced class distribution Unstable logistic regression

More information

Social Media Intelligence in Practice: The NEREUS Experimental Platform. Dimitris Gritzalis & Vasilis Stavrou June 2015

Social Media Intelligence in Practice: The NEREUS Experimental Platform. Dimitris Gritzalis & Vasilis Stavrou June 2015 Social Media Intelligence in Practice: The NEREUS Experimental Platform Dimitris Gritzalis & Vasilis Stavrou June 2015 Social Media Intelligence in Practice: The NEREUS Experimental Platform 3 rd Hellenic

More information

2007 Census of Agriculture Non-Response Methodology

2007 Census of Agriculture Non-Response Methodology 2007 Census of Agriculture Non-Response Methodology Will Cecere National Agricultural Statistics Service Research and Development Division, U.S. Department of Agriculture, 3251 Old Lee Highway, Fairfax,

More information

Information Infrastructure II (Data Mining) I211

Information Infrastructure II (Data Mining) I211 Information Infrastructure II (Data Mining) I211 Spring 2010 Basic Information Class meets: Time: MW 9:30am 10:45am Place: I2 130 Instructor: Predrag Radivojac Office: Informatics 219 Email: predrag@indiana.edu

More information

Supervised Versus Unsupervised Binary-Learning by Feedforward Neural Networks

Supervised Versus Unsupervised Binary-Learning by Feedforward Neural Networks Machine Learning, 42, 97 122, 2001 c 2001 Kluwer Academic Publishers. Manufactured in The Netherlands. Supervised Versus Unsupervised Binary-Learning by Feedforward Neural Networks NATHALIE JAPKOWICZ nat@site.uottawa.ca

More information

Analysis of Economic Data

Analysis of Economic Data Analysis of Economic Data CHUNG-MING KUAN Department of Finance & CRETA National Taiwan University September 14, 2014 C.-M. Kuan (Finance & CRETA, NTU) Analysis of Economic Data September 14, 2014 1 /

More information

IES, Faculty of Social Sciences, Charles University in Prague

IES, Faculty of Social Sciences, Charles University in Prague IMPACT OF INTELLECTUAL PROPERTY RIGHTS AND GOVERNMENTAL POLICY ON INCOME INEQUALITY. Ing. Oksana Melikhova, Ph.D. 1, 1 IES, Faculty of Social Sciences, Charles University in Prague Faculty of Mathematics

More information

Image Finder Mobile Application Based on Neural Networks

Image Finder Mobile Application Based on Neural Networks Image Finder Mobile Application Based on Neural Networks Nabil M. Hewahi Department of Computer Science, College of Information Technology, University of Bahrain, Sakheer P.O. Box 32038, Kingdom of Bahrain

More information

MatMap: An OpenSource Indoor Localization System

MatMap: An OpenSource Indoor Localization System MatMap: An OpenSource Indoor Localization System Richard Ižip and Marek Šuppa Faculty of Mathematics, Physics and Informatics, Comenius University, Bratislava, Slovakia izip1@uniba.sk, suppa1@uniba.sk,

More information

An Optimization of Audio Classification and Segmentation using GASOM Algorithm

An Optimization of Audio Classification and Segmentation using GASOM Algorithm An Optimization of Audio Classification and Segmentation using GASOM Algorithm Dabbabi Karim, Cherif Adnen Research Unity of Processing and Analysis of Electrical and Energetic Systems Faculty of Sciences

More information

Lecture5: Lossless Compression Techniques

Lecture5: Lossless Compression Techniques Fixed to fixed mapping: we encoded source symbols of fixed length into fixed length code sequences Fixed to variable mapping: we encoded source symbols of fixed length into variable length code sequences

More information

IDENTICAL AND FRATERNAL TWIN RECOGNITION USING PHOTOPLETHYSMOGRAM SIGNALS

IDENTICAL AND FRATERNAL TWIN RECOGNITION USING PHOTOPLETHYSMOGRAM SIGNALS IDENTICAL AND FRATERNAL TWIN RECOGNITION USING PHOTOPLETHYSMOGRAM SIGNALS NurIzzati Mohammed Nadzri and Khairul Azami Sidek Department of Electrical and Computer Engineering, Faculty of Engineering, International

More information

Predicting Video Game Popularity With Tweets

Predicting Video Game Popularity With Tweets Predicting Video Game Popularity With Tweets Casey Cabrales (caseycab), Helen Fang (hfang9) December 10,2015 Task Definition Given a set of Twitter tweets from a given day, we want to determine the peak

More information

Machine Learning and Decision Making for Sustainability

Machine Learning and Decision Making for Sustainability Machine Learning and Decision Making for Sustainability Stefano Ermon Department of Computer Science Stanford University April 12 Overview Stanford Artificial Intelligence Lab Fellow, Woods Institute for

More information

k-nearest Neighbors Algorithm in Profiling Power Analysis Attacks

k-nearest Neighbors Algorithm in Profiling Power Analysis Attacks RADIOENGINEERING, VOL. 25, NO. 2, JUNE 2016 365 k-nearest Neighbors Algorithm in Profiling Power Analysis Attacks Zdenek MARTINASEK 1, Vaclav ZEMAN 1, Lukas MALINA 1, Josef MARTINASEK 2 1 Dept. of Telecommunications,

More information

Supervisors: Rachel Cardell-Oliver Adrian Keating. Program: Bachelor of Computer Science (Honours) Program Dates: Semester 2, 2014 Semester 1, 2015

Supervisors: Rachel Cardell-Oliver Adrian Keating. Program: Bachelor of Computer Science (Honours) Program Dates: Semester 2, 2014 Semester 1, 2015 Supervisors: Rachel Cardell-Oliver Adrian Keating Program: Bachelor of Computer Science (Honours) Program Dates: Semester 2, 2014 Semester 1, 2015 Background Aging population [ABS2012, CCE09] Need to

More information

GE 113 REMOTE SENSING

GE 113 REMOTE SENSING GE 113 REMOTE SENSING Topic 8. Image Classification and Accuracy Assessment Lecturer: Engr. Jojene R. Santillan jrsantillan@carsu.edu.ph Division of Geodetic Engineering College of Engineering and Information

More information

Prediction of Electromagnetic Fields around High Voltage Transmission Lines

Prediction of Electromagnetic Fields around High Voltage Transmission Lines Acta Technica Jaurinensis Vol. 10, No. 1, pp. 50-58, 2017 DOI: 10.14513/actatechjaur.v10.n1.414 Available online at acta.sze.hu Prediction of Electromagnetic Fields around High Voltage Transmission Lines

More information

COMPARATIVE ANALYSIS OF ACCURACY ON MISSING DATA USING MLP AND RBF METHOD V.B. Kamble 1, S.N. Deshmukh 2 1

COMPARATIVE ANALYSIS OF ACCURACY ON MISSING DATA USING MLP AND RBF METHOD V.B. Kamble 1, S.N. Deshmukh 2 1 COMPARATIVE ANALYSIS OF ACCURACY ON MISSING DATA USING MLP AND RBF METHOD V.B. Kamble 1, S.N. Deshmukh 2 1 P.E.S. College of Engineering, Aurangabad. (M.S.) India. 2 Dr. Babasaheb Ambedkar Marathwada University,

More information

Department of Statistics and Operations Research Undergraduate Programmes

Department of Statistics and Operations Research Undergraduate Programmes Department of Statistics and Operations Research Undergraduate Programmes OPERATIONS RESEARCH YEAR LEVEL 2 INTRODUCTION TO LINEAR PROGRAMMING SSOA021 Linear Programming Model: Formulation of an LP model;

More information

The Art of Neural Nets

The Art of Neural Nets The Art of Neural Nets Marco Tavora marcotav65@gmail.com Preamble The challenge of recognizing artists given their paintings has been, for a long time, far beyond the capability of algorithms. Recent advances

More information