Introduction to Machine Learning

Similar documents
Introduction to Machine Learning

MINE 432 Industrial Automation and Robotics

Artificial Neural Networks. Artificial Intelligence Santa Clara, 2016

Available online at ScienceDirect. Procedia Computer Science 85 (2016 )

ECE 599/692 Deep Learning Lecture 19 Beyond BP and CNN

Biologically Inspired Computation

Course Objectives. This course gives a basic neural network architectures and learning rules.

Application of Multi Layer Perceptron (MLP) for Shower Size Prediction

Artificial Neural Networks

Figure 1. Artificial Neural Network structure. B. Spiking Neural Networks Spiking Neural networks (SNNs) fall into the third generation of neural netw

بسم اهلل الرحمن الرحيم. Introduction to Neural Networks

AN IMPROVED NEURAL NETWORK-BASED DECODER SCHEME FOR SYSTEMATIC CONVOLUTIONAL CODE. A Thesis by. Andrew J. Zerngast

NEURAL NETWORK BASED MAXIMUM POWER POINT TRACKING

1 Introduction. w k x k (1.1)

INTRODUCTION TO DEEP LEARNING. Steve Tjoa June 2013

Research on Hand Gesture Recognition Using Convolutional Neural Network

COMPARATIVE STUDY ON ARTIFICIAL NEURAL NETWORK ALGORITHMS

Contents 1 Introduction Optical Character Recognition Systems Soft Computing Techniques for Optical Character Recognition Systems

Lesson 08. Convolutional Neural Network. Ing. Marek Hrúz, Ph.D. Katedra Kybernetiky Fakulta aplikovaných věd Západočeská univerzita v Plzni.

Available online at ScienceDirect. Procedia Technology 18 (2014 )

DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition. ECE 289G: Paper Presentation #3 Philipp Gysel

WorldQuant. Perspectives. Welcome to the Machine

CHAPTER 4 LINK ADAPTATION USING NEURAL NETWORK

Multiple-Layer Networks. and. Backpropagation Algorithms

Research Article Adaptive Forming of the Beam Pattern of Microstrip Antenna with the Use of an Artificial Neural Network

CHAPTER 6 BACK PROPAGATED ARTIFICIAL NEURAL NETWORK TRAINED ARHF

Thursday, December 11, 8:00am 10:00am rooms: pending

The Automatic Classification Problem. Perceptrons, SVMs, and Friends: Some Discriminative Models for Classification

Analysis of Learning Paradigms and Prediction Accuracy using Artificial Neural Network Models

Landmark Recognition with Deep Learning

CS 7643: Deep Learning

6. Convolutional Neural Networks

Deep Learning. Dr. Johan Hagelbäck.

Harmonic detection by using different artificial neural network topologies

Camera Model Identification With The Use of Deep Convolutional Neural Networks

신경망기반자동번역기술. Konkuk University Computational Intelligence Lab. 김강일

ARTIFICIAL INTELLIGENCE

Neural Network Part 4: Recurrent Neural Networks

Evolutionary Artificial Neural Networks For Medical Data Classification

Generating an appropriate sound for a video using WaveNet.

Are there alternatives to Sigmoid Hidden Units? MLP Lecture 6 Hidden Units / Initialisation 1

Automatic Speech Recognition (CS753)

Initialisation improvement in engineering feedforward ANN models.

Transactions on Information and Communications Technologies vol 1, 1993 WIT Press, ISSN

Deep Learning Basics Lecture 9: Recurrent Neural Networks. Princeton University COS 495 Instructor: Yingyu Liang

Use of Artificial Neural Network for Land Use Land Cover Classification of UAV Acquired Imagery

A comparative study of different feature sets for recognition of handwritten Arabic numerals using a Multi Layer Perceptron

Deep Neural Networks (2) Tanh & ReLU layers; Generalisation and Regularisation

Acoustic Signals Recognition by Convolutional Neural Network

Constant False Alarm Rate Detection of Radar Signals with Artificial Neural Networks

Single Channel Source Separation with General Stochastic Networks

CONSTRUCTION COST PREDICTION USING NEURAL NETWORKS

Statistical Tests: More Complicated Discriminants

arxiv: v1 [cs.ce] 9 Jan 2018

AI & Machine Learning. By Jan Øye Lindroos

Microprocessor Implementation of Fuzzy Systems and Neural Networks Jeremy Binfet Micron Technology

Artificial Intelligence and Deep Learning

Convolutional Neural Networks. Fei-Fei Li & Justin Johnson & Serena Yeung. Lecture 5-1

Artificial Neural Network Engine: Parallel and Parameterized Architecture Implemented in FPGA

Split-Complex Convolutional Neural Networks

Sonia Sharma ECE Department, University Institute of Engineering and Technology, MDU, Rohtak, India. Fig.1.Neuron and its connection

REAL TIME EMULATION OF PARAMETRIC GUITAR TUBE AMPLIFIER WITH LONG SHORT TERM MEMORY NEURAL NETWORK

Representation Learning for Mobile Robots in Dynamic Environments

COMPUTATONAL INTELLIGENCE

Neural Network Classifier and Filtering for EEG Detection in Brain-Computer Interface Device

An Hybrid MLP-SVM Handwritten Digit Recognizer

Behaviour Patterns Evolution on Individual and Group Level. Stanislav Slušný, Roman Neruda, Petra Vidnerová. CIMMACS 07, December 14, Tenerife

10mW CMOS Retina and Classifier for Handheld, 1000Images/s Optical Character Recognition System

Convolutional Neural Networks

Application of Deep Learning in Software Security Detection

Back Propagation Algorithm: The Best Algorithm Among the Multi-layer Perceptron Algorithm

Neural networks are very

IBM SPSS Neural Networks

Augmenting Self-Learning In Chess Through Expert Imitation

AUTOMATION TECHNOLOGY FOR FABRIC INSPECTION SYSTEM

MAGNT Research Report (ISSN ) Vol.6(1). PP , Controlling Cost and Time of Construction Projects Using Neural Network

6.- Supervised Neural Networks: Multilayer Perceptron

CSE 473 Artificial Intelligence (AI) Outline

Approximation a One-Dimensional Functions by Using Multilayer Perceptron and Radial Basis Function Networks

Lecture 11-1 CNN introduction. Sung Kim

Convolutional Neural Networks. Fei-Fei Li & Justin Johnson & Serena Yeung. Lecture 5-1

Perspectives on Intelligent System Techniques used in Data Mining Poonam Verma

Neural Network Based Rail Flaw Detection Using Unprocessed Ultrasonic Data

Application of Artificial Neural Networks in Autonomous Mission Planning for Planetary Rovers

POLITEHNICA UNIVERSITY TIMISOARA

Prediction of Breathing Patterns Using Neural Networks

Prediction of Rock Fragmentation in Open Pit Mines, using Neural Network Analysis

Binary Neural Network and Its Implementation with 16 Mb RRAM Macro Chip

Image Manipulation Detection using Convolutional Neural Network

Demystifying Machine Learning

Predicting outcomes of professional DotA 2 matches

Systolic modular VLSI Architecture for Multi-Model Neural Network Implementation +

Number Plate Detection with a Multi-Convolutional Neural Network Approach with Optical Character Recognition for Mobile Devices

SIMULATION-BASED MODEL CONTROL USING STATIC HAND GESTURES IN MATLAB

HUMAN-LEVEL ARTIFICIAL INTELIGENCE & COGNITIVE SCIENCE

arxiv: v1 [cs.ne] 16 Nov 2016

Eur Ing Dr. Lei Zhang Faculty of Engineering and Applied Science University of Regina Canada

Transient stability Assessment using Artificial Neural Network Considering Fault Location

Recurrent neural networks Modelling sequential data. MLP Lecture 9 Recurrent Networks 1

An Introduction to Artificial Intelligence, Machine Learning, and Neural networks. Carola F. Berger

Transcription:

Introduction to Machine Learning Perceptron Barnabás Póczos

Contents History of Artificial Neural Networks Definitions: Perceptron, Multi-Layer Perceptron Perceptron algorithm 2

Short History of Artificial Neural Networks 3

Progression (1943-1960) First mathematical model of neurons Pitts & McCulloch (1943) Beginning of artificial neural networks Perceptron, Rosenblatt (1958) A single neuron for classification Perceptron learning rule Short History Perceptron convergence theorem Degression (1960-1980) Perceptron can t even learn the XOR function We don t know how to train MLP 1963 Backpropagation but not much attention Bryson, A.E.; W.F. Denham; S.E. Dreyfus. Optimal programming problems with inequality constraints. I: Necessary conditions for extremal solutions. AIAA J. 1, 11 (1963) 2544-2550 4

Short History Progression (1980-) 1986 Backpropagation reinvented: Rumelhart, Hinton, Williams: Learning representations by back-propagating errors. Nature, 323, 533 536, 1986 Successful applications: Character recognition, autonomous cars, Open questions: Overfitting? Network structure? Neuron number? Layer number? Bad local minimum points? When to stop training? Hopfield nets (1982), Boltzmann machines, 5

Degression (1993-) Short History SVM: Vapnik and his co-workers developed the Support Vector Machine (1993). It is a shallow architecture. SVM and Graphical models almost kill the ANN research. Training deeper networks consistently yields poor results. Exception: deep convolutional neural networks, Yann LeCun 1998. (discriminative model) 6

Progression (2006-) Deep Belief Networks (DBN) Short History Hinton, G. E, Osindero, S., and Teh, Y. W. (2006). A fast learning algorithm for deep belief nets. Neural Computation, 18:1527-1554. Generative graphical model Based on restrictive Boltzmann machines Can be trained efficiently Deep Autoencoder based networks Bengio, Y., Lamblin, P., Popovici, P., Larochelle, H. (2007). Greedy Layer-Wise Training of Deep Networks, Advances in Neural Information Processing Systems 19 Convolutional neural networks running on GPUs Alex Krizhevsky, Ilya Sutskever, Geoffrey Hinton, Advances in Neural Information Processing Systems 2012 7

The Neuron 8

The Neuron Each neuron has a body, axon, and many dendrites A neuron can fire or rest If the sum of weighted inputs larger than a threshold, then the neuron fires. Synapses: The gap between the axon and other neuron s dendrites. It determines the weights in the sum. 9

The Mathematical Model of a Neuron 10

Typical activation functions Identity function Threshold function (perceptron) Ramp function 11

Typical activation functions Logistic function Hyperbolic tangent function 12

Typical activation functions Rectified Linear Unit (ReLU) Softplus function (This is a smooth approximation of ReLU) Leaky ReLU Exponential Linear Unit 13

14

15

Structure of Neural Networks 16

Fully Connected Neural Network Input neurons, Hidden neurons, Output neurons 17

Layers, Feedforward neural networks Convention: The input layer is Layer 0. 18

Multilayer Perceptron Multilayer perceptron: Connections only between Layer i and Layer i+1 The most popular architecture. 19

20

Recurrent Neural Networks Recurrent NN: there are connections backwards too. 21

The Perceptron 22

The Training Set 23

The Perceptron 24

The Perceptron 1-1 25

Matlab: opengl hardwarebasic, nnd4pr

Matlab demos: nnd3pc 27

The Perceptron Algorithm 28

The Perceptron algorithm The perceptron learning algorithm 29

The perceptron algorithm Observation 30

The Perceptron Algorithm How can we remember this rule? An interesting property: we do not require the learning rate to go to zero! 31

The Perceptron Algorithm 32

Perceptron Convergence 33

Perceptron Convergence 34

Perceptron Convergence Lemma Using this notation, the update rule can be written as Proof 35

Perceptron Convergence Lemma 36

Perceptron Convergence 37

Lower bound 38

Upper bound Therefore, 39

Upper bound Therefore, 40

The Perceptron Algorithm 41

Take me home! History of Neural Networks Mathematical model of the neuron Activation Functions Perceptron definition Perceptron algorithm Perceptron Convergence Theorem 42