Spatial Transformer Networks

Size: px
Start display at page:

Download "Spatial Transformer Networks"

Transcription

1 Spatial Transformer Networks Kaichun Mo Shanghai Jiao Tong University July 28, 2015 Kaichun Mo STN July 28, / 29

2 Overview 1 Spatial Transformer Network 2 Caffe: A popular deep learning framework 3 Future Work Kaichun Mo (SJTU@Cornell) STN July 28, / 29

3 Start from the interesting part: Spatial Transformer Network! Kaichun Mo STN July 28, / 29

4 Spatial Transformer Network: Ideal Images Kaichun Mo STN July 28, / 29

5 Spatial Transformer Network: Ideal Images Kaichun Mo STN July 28, / 29

6 Spatial Transformer Network: ideal Images Kaichun Mo STN July 28, / 29

7 Spatial Transformer Network: ideal Images Kaichun Mo STN July 28, / 29

8 Spatial Transformer Network: Real Images Kaichun Mo STN July 28, / 29

9 Spatial Transformer Network: Spatial Variance Kaichun Mo STN July 28, / 29

10 Introduction: Spatial Transformer Layer A new type of layer that is designed to perserve the spatial invariance It is expected to be more powerful than pooling layer Kaichun Mo (SJTU@Cornell) STN July 28, / 29

11 Spatial Transformer Layer: Algorithm Localisation Network: For each input i, output its specific transform matrix θ i Grid generator and Sampler: Compute the transformed result using θ i Kaichun Mo (SJTU@Cornell) STN July 28, / 29

12 Spatial Transformer Network: Localisation Net Localisation Network: For each input image, try to learn the specific transformation matrix θ i by which it was deformed After learning that, we can reverse it as possible as we can Kaichun Mo (SJTU@Cornell) STN July 28, / 29

13 Spatial Transformer Network: Algorithm Localisation Network: For each input i, output its specific transform parameter θ i Grid generator and Sampler: Compute the transformed result using parameter θ i Kaichun Mo (SJTU@Cornell) STN July 28, / 29

14 Spatial Transformer Network: Grid Generator For each input, using localisation network, we know the transformation matrix to reverse the deformation Then, we need to perform this matrix on the input image by first generating the grid Kaichun Mo STN July 28, / 29

15 Spatial Transformer Network: Sampler Some projected point on the input image may not be sampled We need to use interpolation technique to sample it Ex bilinear Kaichun Mo STN July 28, / 29

16 Spatial Transformer Layer: Algorithm Localisation Network: For each input i, output its specific transform matrix θ i Grid generator and Sampler: Compute the transformed result using θ i Kaichun Mo (SJTU@Cornell) STN July 28, / 29

17 Spatial Transformer Network: Differentiability Kaichun Mo STN July 28, / 29

18 Spatial Transformer Network: Differentiability In order to learn the weights in localisation network and perform back propogation during training, we need the forwarding function to be differentiable Fortunately, it is, at least in sense of subgradient! More detail in paper Kaichun Mo STN July 28, / 29

19 Spatial Transformer Network: Advantages Efficiency: highly localized highly parallelizable GPU acceleration End-to-end training: can be seamlessly incorporated into neural network no pre-training is required Spatial Invariance: Make neural network to be less vulnerable to spatial transformations Kaichun Mo STN July 28, / 29

20 Introduction to Caffe! Kaichun Mo STN July 28, / 29

21 Caffe: Website Kaichun Mo STN July 28, / 29

22 Caffe: Why Use It? Kaichun Mo STN July 28, / 29

23 Caffe: Why Use It? Kaichun Mo STN July 28, / 29

24 Caffe: Network Definition Kaichun Mo STN July 28, / 29

25 Caffe: Layer Definition Kaichun Mo STN July 28, / 29

26 Caffe: Solver Definition Kaichun Mo STN July 28, / 29

27 Future Work Finish implementing Spatial Transformer Layer on Caffe Test its performance on different vision task This layer should be powerful whenever images are not spatially aligned and attention or localisation is necessary Kaichun Mo STN July 28, / 29

28 References Official Website: Official Tutorial: DIY Deep Learning for Vision with Caffe Spatial Transformer Networks, by Max Jaderberg, et al Kaichun Mo STN July 28, / 29

29 Thank you for listening! Q&A Kaichun Mo STN July 28, / 29

CSC321 Lecture 11: Convolutional Networks

CSC321 Lecture 11: Convolutional Networks CSC321 Lecture 11: Convolutional Networks Roger Grosse Roger Grosse CSC321 Lecture 11: Convolutional Networks 1 / 35 Overview What makes vision hard? Vison needs to be robust to a lot of transformations

More information

Lecture 23 Deep Learning: Segmentation

Lecture 23 Deep Learning: Segmentation Lecture 23 Deep Learning: Segmentation COS 429: Computer Vision Thanks: most of these slides shamelessly adapted from Stanford CS231n: Convolutional Neural Networks for Visual Recognition Fei-Fei Li, Andrej

More information

Convolutional neural networks

Convolutional neural networks Convolutional neural networks Themes Curriculum: Ch 9.1, 9.2 and http://cs231n.github.io/convolutionalnetworks/ The simple motivation and idea How it s done Receptive field Pooling Dilated convolutions

More information

GPU ACCELERATED DEEP LEARNING WITH CUDNN

GPU ACCELERATED DEEP LEARNING WITH CUDNN GPU ACCELERATED DEEP LEARNING WITH CUDNN Larry Brown Ph.D. March 2015 AGENDA 1 Introducing cudnn and GPUs 2 Deep Learning Context 3 cudnn V2 4 Using cudnn 2 Introducing cudnn and GPUs 3 HOW GPU ACCELERATION

More information

Deformable Convolutional Networks

Deformable Convolutional Networks Deformable Convolutional Networks Jifeng Dai^ With Haozhi Qi*^, Yuwen Xiong*^, Yi Li*^, Guodong Zhang*^, Han Hu, Yichen Wei Visual Computing Group Microsoft Research Asia (* interns at MSRA, ^ equal contribution)

More information

TOOLS AND PROCESSORS FOR COMPUTER VISION. Selected Results from the Embedded Vision Alliance s Spring 2017 Computer Vision Developer Survey

TOOLS AND PROCESSORS FOR COMPUTER VISION. Selected Results from the Embedded Vision Alliance s Spring 2017 Computer Vision Developer Survey TOOLS AND PROCESSORS FOR COMPUTER VISION Selected Results from the Embedded Vision Alliance s Spring 2017 Computer Vision Developer Survey 1 EXECUTIVE SUMMARY Since 2015, the Embedded Vision Alliance has

More information

Convolutional Networks Overview

Convolutional Networks Overview Convolutional Networks Overview Sargur Srihari 1 Topics Limitations of Conventional Neural Networks The convolution operation Convolutional Networks Pooling Convolutional Network Architecture Advantages

More information

Deep Learning. Dr. Johan Hagelbäck.

Deep Learning. Dr. Johan Hagelbäck. Deep Learning Dr. Johan Hagelbäck johan.hagelback@lnu.se http://aiguy.org Image Classification Image classification can be a difficult task Some of the challenges we have to face are: Viewpoint variation:

More information

Biologically Inspired Computation

Biologically Inspired Computation Biologically Inspired Computation Deep Learning & Convolutional Neural Networks Joe Marino biologically inspired computation biological intelligence flexible capable of detecting/ executing/reasoning about

More information

6. Convolutional Neural Networks

6. Convolutional Neural Networks 6. Convolutional Neural Networks CS 519 Deep Learning, Winter 2016 Fuxin Li With materials from Zsolt Kira Quiz coming up Next Tuesday (1/26) 15 minutes Topics: Optimization Basic neural networks No Convolutional

More information

Understanding Neural Networks : Part II

Understanding Neural Networks : Part II TensorFlow Workshop 2018 Understanding Neural Networks Part II : Convolutional Layers and Collaborative Filters Nick Winovich Department of Mathematics Purdue University July 2018 Outline 1 Convolutional

More information

An Introduction to Convolutional Neural Networks. Alessandro Giusti Dalle Molle Institute for Artificial Intelligence Lugano, Switzerland

An Introduction to Convolutional Neural Networks. Alessandro Giusti Dalle Molle Institute for Artificial Intelligence Lugano, Switzerland An Introduction to Convolutional Neural Networks Alessandro Giusti Dalle Molle Institute for Artificial Intelligence Lugano, Switzerland Sources & Resources - Andrej Karpathy, CS231n http://cs231n.github.io/convolutional-networks/

More information

Accelerated Impulse Response Calculation for Indoor Optical Communication Channels

Accelerated Impulse Response Calculation for Indoor Optical Communication Channels Accelerated Impulse Response Calculation for Indoor Optical Communication Channels M. Rahaim, J. Carruthers, and T.D.C. Little Department of Electrical and Computer Engineering Boston University, Boston,

More information

Image Manipulation Detection using Convolutional Neural Network

Image Manipulation Detection using Convolutional Neural Network Image Manipulation Detection using Convolutional Neural Network Dong-Hyun Kim 1 and Hae-Yeoun Lee 2,* 1 Graduate Student, 2 PhD, Professor 1,2 Department of Computer Software Engineering, Kumoh National

More information

CPSC 340: Machine Learning and Data Mining. Convolutional Neural Networks Fall 2018

CPSC 340: Machine Learning and Data Mining. Convolutional Neural Networks Fall 2018 CPSC 340: Machine Learning and Data Mining Convolutional Neural Networks Fall 2018 Admin Mike and I finish CNNs on Wednesday. After that, we will cover different topics: Mike will do a demo of training

More information

Semantic Segmentation on Resource Constrained Devices

Semantic Segmentation on Resource Constrained Devices Semantic Segmentation on Resource Constrained Devices Sachin Mehta University of Washington, Seattle In collaboration with Mohammad Rastegari, Anat Caspi, Linda Shapiro, and Hannaneh Hajishirzi Project

More information

TOOLS & PROCESSORS FOR COMPUTER VISION. Selected Results from the Embedded Vision Alliance s Fall 2017 Computer Vision Developer Survey

TOOLS & PROCESSORS FOR COMPUTER VISION. Selected Results from the Embedded Vision Alliance s Fall 2017 Computer Vision Developer Survey TOOLS & PROCESSORS FOR COMPUTER VISION Selected Results from the Embedded Vision Alliance s Fall 2017 Computer Vision Developer Survey ABOUT THE EMBEDDED VISION ALLIANCE EXECUTIVE SUMMA Y Since 2015, the

More information

Colorful Image Colorizations Supplementary Material

Colorful Image Colorizations Supplementary Material Colorful Image Colorizations Supplementary Material Richard Zhang, Phillip Isola, Alexei A. Efros {rich.zhang, isola, efros}@eecs.berkeley.edu University of California, Berkeley 1 Overview This document

More information

A Geometric Correction Method of Plane Image Based on OpenCV

A Geometric Correction Method of Plane Image Based on OpenCV Sensors & Transducers 204 by IFSA Publishing, S. L. http://www.sensorsportal.com A Geometric orrection Method of Plane Image ased on OpenV Li Xiaopeng, Sun Leilei, 2 Lou aiying, Liu Yonghong ollege of

More information

TOOLS & PROCESSORS FOR COMPUTER VISION. Selected Results from the Embedded Vision Alliance s Computer Vision Developer Survey

TOOLS & PROCESSORS FOR COMPUTER VISION. Selected Results from the Embedded Vision Alliance s Computer Vision Developer Survey TOOLS & PROCESSORS FOR COMPUTER VISION Selected Results from the Embedded Vision Alliance s Computer Vision Developer Survey JANUARY 2019 EXECUTIVE SUMMA Y Since 2015, the Embedded Vision Alliance has

More information

AI Frontiers. Dr. Dario Gil Vice President IBM Research

AI Frontiers. Dr. Dario Gil Vice President IBM Research AI Frontiers Dr. Dario Gil Vice President IBM Research 1 AI is the new IT MIT Intro to Machine Learning course: 2013 138 students 2016 302 students 2017 700 students 2 What is AI? Artificial Intelligence

More information

INFORMATION about image authenticity can be used in

INFORMATION about image authenticity can be used in 1 Constrained Convolutional Neural Networs: A New Approach Towards General Purpose Image Manipulation Detection Belhassen Bayar, Student Member, IEEE, and Matthew C. Stamm, Member, IEEE Abstract Identifying

More information

DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition. ECE 289G: Paper Presentation #3 Philipp Gysel

DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition. ECE 289G: Paper Presentation #3 Philipp Gysel DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition ECE 289G: Paper Presentation #3 Philipp Gysel Autonomous Car ECE 289G Paper Presentation, Philipp Gysel Slide 2 Source: maps.google.com

More information

Application of Maxwell Equations to Human Body Modelling

Application of Maxwell Equations to Human Body Modelling Application of Maxwell Equations to Human Body Modelling Fumie Costen Room E, E0c at Sackville Street Building, fc@cs.man.ac.uk The University of Manchester, U.K. February 5, 0 Fumie Costen Room E, E0c

More information

The analysis of multi-channel sound reproduction algorithms using HRTF data

The analysis of multi-channel sound reproduction algorithms using HRTF data The analysis of multichannel sound reproduction algorithms using HRTF data B. Wiggins, I. PatersonStephens, P. Schillebeeckx Processing Applications Research Group University of Derby Derby, United Kingdom

More information

Mobile SuDoKu Harvesting App

Mobile SuDoKu Harvesting App Mobile SuDoKu Harvesting App Benjamin Zwiener Department of Computer Science Doane University 1014 Boswell Ave, Crete, NE, 68333 benjamin.zwiener@doane.edu Abstract The purpose of this project was to create

More information

UNDERSTANDING LTE WITH MATLAB

UNDERSTANDING LTE WITH MATLAB UNDERSTANDING LTE WITH MATLAB FROM MATHEMATICAL MODELING TO SIMULATION AND PROTOTYPING Dr Houman Zarrinkoub MathWorks, Massachusetts, USA WILEY Contents Preface List of Abbreviations 1 Introduction 1.1

More information

WPI Precision Personnel Locator: Inverse Synthetic Array Reconciliation Tomography Performance. Co-authors: M. Lowe, D. Cyganski, R. J.

WPI Precision Personnel Locator: Inverse Synthetic Array Reconciliation Tomography Performance. Co-authors: M. Lowe, D. Cyganski, R. J. WPI Precision Personnel Locator: Inverse Synthetic Array Reconciliation Tomography Performance Presented by: Andrew Cavanaugh Co-authors: M. Lowe, D. Cyganski, R. J. Duckworth Introduction 2 PPL Project

More information

Coursework 2. MLP Lecture 7 Convolutional Networks 1

Coursework 2. MLP Lecture 7 Convolutional Networks 1 Coursework 2 MLP Lecture 7 Convolutional Networks 1 Coursework 2 - Overview and Objectives Overview: Use a selection of the techniques covered in the course so far to train accurate multi-layer networks

More information

Microphone Array Design and Beamforming

Microphone Array Design and Beamforming Microphone Array Design and Beamforming Heinrich Löllmann Multimedia Communications and Signal Processing heinrich.loellmann@fau.de with contributions from Vladi Tourbabin and Hendrik Barfuss EUSIPCO Tutorial

More information

Thermal Image Enhancement Using Convolutional Neural Network

Thermal Image Enhancement Using Convolutional Neural Network SEOUL Oct.7, 2016 Thermal Image Enhancement Using Convolutional Neural Network Visual Perception for Autonomous Driving During Day and Night Yukyung Choi Soonmin Hwang Namil Kim Jongchan Park In So Kweon

More information

PLazeR. a planar laser rangefinder. Robert Ying (ry2242) Derek Xingzhou He (xh2187) Peiqian Li (pl2521) Minh Trang Nguyen (mnn2108)

PLazeR. a planar laser rangefinder. Robert Ying (ry2242) Derek Xingzhou He (xh2187) Peiqian Li (pl2521) Minh Trang Nguyen (mnn2108) PLazeR a planar laser rangefinder Robert Ying (ry2242) Derek Xingzhou He (xh2187) Peiqian Li (pl2521) Minh Trang Nguyen (mnn2108) Overview & Motivation Detecting the distance between a sensor and objects

More information

Image representation, sampling and quantization

Image representation, sampling and quantization Image representation, sampling and quantization António R. C. Paiva ECE 6962 Fall 2010 Lecture outline Image representation Digitalization of images Changes in resolution Matlab tutorial Lecture outline

More information

Radio Deep Learning Efforts Showcase Presentation

Radio Deep Learning Efforts Showcase Presentation Radio Deep Learning Efforts Showcase Presentation November 2016 hume@vt.edu www.hume.vt.edu Tim O Shea Senior Research Associate Program Overview Program Objective: Rethink fundamental approaches to how

More information

Transformation to Artificial Intelligence with MATLAB Roy Lurie, PhD Vice President of Engineering MATLAB Products

Transformation to Artificial Intelligence with MATLAB Roy Lurie, PhD Vice President of Engineering MATLAB Products Transformation to Artificial Intelligence with MATLAB Roy Lurie, PhD Vice President of Engineering MATLAB Products 2018 The MathWorks, Inc. 1 A brief history of the automobile First Commercial Gas Car

More information

Automatic tumor segmentation in breast ultrasound images using a dilated fully convolutional network combined with an active contour model

Automatic tumor segmentation in breast ultrasound images using a dilated fully convolutional network combined with an active contour model Automatic tumor segmentation in breast ultrasound images using a dilated fully convolutional network combined with an active contour model Yuzhou Hu Departmentof Electronic Engineering, Fudan University,

More information

Harnessing the Power of AI: An Easy Start with Lattice s sensai

Harnessing the Power of AI: An Easy Start with Lattice s sensai Harnessing the Power of AI: An Easy Start with Lattice s sensai A Lattice Semiconductor White Paper. January 2019 Artificial intelligence, or AI, is everywhere. It s a revolutionary technology that is

More information

3D Data Navigation via Natural User Interfaces

3D Data Navigation via Natural User Interfaces 3D Data Navigation via Natural User Interfaces Francisco R. Ortega PhD Candidate and GAANN Fellow Co-Advisors: Dr. Rishe and Dr. Barreto Committee Members: Dr. Raju, Dr. Clarke and Dr. Zeng GAANN Fellowship

More information

Computing for Engineers in Python

Computing for Engineers in Python Computing for Engineers in Python Lecture 10: Signal (Image) Processing Autumn 2011-12 Some slides incorporated from Benny Chor s course 1 Lecture 9: Highlights Sorting, searching and time complexity Preprocessing

More information

신경망기반자동번역기술. Konkuk University Computational Intelligence Lab. 김강일

신경망기반자동번역기술. Konkuk University Computational Intelligence Lab.  김강일 신경망기반자동번역기술 Konkuk University Computational Intelligence Lab. http://ci.konkuk.ac.kr kikim01@kunkuk.ac.kr 김강일 Index Issues in AI and Deep Learning Overview of Machine Translation Advanced Techniques in

More information

Research on Hand Gesture Recognition Using Convolutional Neural Network

Research on Hand Gesture Recognition Using Convolutional Neural Network Research on Hand Gesture Recognition Using Convolutional Neural Network Tian Zhaoyang a, Cheng Lee Lung b a Department of Electronic Engineering, City University of Hong Kong, Hong Kong, China E-mail address:

More information

Decoding Brainwave Data using Regression

Decoding Brainwave Data using Regression Decoding Brainwave Data using Regression Justin Kilmarx: The University of Tennessee, Knoxville David Saffo: Loyola University Chicago Lucien Ng: The Chinese University of Hong Kong Mentor: Dr. Xiaopeng

More information

arxiv: v1 [cs.cv] 13 Aug 2017

arxiv: v1 [cs.cv] 13 Aug 2017 arxiv:1708.03898v1 [cs.cv] 13 Aug 2017 AN EXTREMELY EFFICIENT CHESS-BOARD DETECTION FOR NON-TRIVIAL PHOTOS Maciej A. Czyzewski mail@maciejczyzewski.me August 15, 2017 Abstract. We present a set of algorithms

More information

Supplementary Material for Generative Adversarial Perturbations

Supplementary Material for Generative Adversarial Perturbations Supplementary Material for Generative Adversarial Perturbations Omid Poursaeed 1,2 Isay Katsman 1 Bicheng Gao 3,1 Serge Belongie 1,2 1 Cornell University 2 Cornell Tech 3 Shanghai Jiao Tong University

More information

A Neural Algorithm of Artistic Style (2015)

A Neural Algorithm of Artistic Style (2015) A Neural Algorithm of Artistic Style (2015) Leon A. Gatys, Alexander S. Ecker, Matthias Bethge Nancy Iskander (niskander@dgp.toronto.edu) Overview of Method Content: Global structure. Style: Colours; local

More information

CS 229 Final Project: Using Reinforcement Learning to Play Othello

CS 229 Final Project: Using Reinforcement Learning to Play Othello CS 229 Final Project: Using Reinforcement Learning to Play Othello Kevin Fry Frank Zheng Xianming Li ID: kfry ID: fzheng ID: xmli 16 December 2016 Abstract We built an AI that learned to play Othello.

More information

SPACE TRANSPORTATION RESEARCH & ORGANIZATION THEORY RESEARCH

SPACE TRANSPORTATION RESEARCH & ORGANIZATION THEORY RESEARCH SPACE TRANSPORTATION RESEARCH & ORGANIZATION THEORY RESEARCH ENGAGING DISPARATE COMMUNITIES FOR BUSINESS & ECONOMIC RESEARCH IN THE SPACE INDUSTRY Ken Davidian, FAA AST Director of Research, presentation

More information

Comprehensive GD&T Evaluation Software for Manufacturing Quality Control

Comprehensive GD&T Evaluation Software for Manufacturing Quality Control Comprehensive GD&T Evaluation Software for Manufacturing Quality Control Model-Based Family of Software EVOLVE SmartProfile Comprehensive GD&T Evaluation Software for Manufacturing Quality Control Easy

More information

ROAD RECOGNITION USING FULLY CONVOLUTIONAL NEURAL NETWORKS

ROAD RECOGNITION USING FULLY CONVOLUTIONAL NEURAL NETWORKS Bulletin of the Transilvania University of Braşov Vol. 10 (59) No. 2-2017 Series I: Engineering Sciences ROAD RECOGNITION USING FULLY CONVOLUTIONAL NEURAL NETWORKS E. HORVÁTH 1 C. POZNA 2 Á. BALLAGI 3

More information

Process Planning - The Link Between Varying Products and their Manufacturing Systems p. 37

Process Planning - The Link Between Varying Products and their Manufacturing Systems p. 37 Definitions and Strategies Changeability - An Introduction p. 3 Motivation p. 3 Evolution of Factories p. 7 Deriving the Objects of Changeability p. 8 Elements of Changeable Manufacturing p. 10 Factory

More information

ECC419 IMAGE PROCESSING

ECC419 IMAGE PROCESSING ECC419 IMAGE PROCESSING INTRODUCTION Image Processing Image processing is a subclass of signal processing concerned specifically with pictures. Digital Image Processing, process digital images by means

More information

Detection of Obscured Targets: Signal Processing

Detection of Obscured Targets: Signal Processing Detection of Obscured Targets: Signal Processing James McClellan and Waymond R. Scott, Jr. School of Electrical and Computer Engineering Georgia Institute of Technology Atlanta, GA 30332-0250 jim.mcclellan@ece.gatech.edu

More information

Listening with Headphones

Listening with Headphones Listening with Headphones Main Types of Errors Front-back reversals Angle error Some Experimental Results Most front-back errors are front-to-back Substantial individual differences Most evident in elevation

More information

Joint Demosaicing and Super-Resolution Imaging from a Set of Unregistered Aliased Images

Joint Demosaicing and Super-Resolution Imaging from a Set of Unregistered Aliased Images Joint Demosaicing and Super-Resolution Imaging from a Set of Unregistered Aliased Images Patrick Vandewalle a, Karim Krichane a, David Alleysson b, and Sabine Süsstrunk a a School of Computer and Communication

More information

Carnegie Mellon University, University of Pittsburgh

Carnegie Mellon University, University of Pittsburgh Carnegie Mellon University, University of Pittsburgh Carnegie Mellon University, University of Pittsburgh Artificial Intelligence (AI) and Deep Learning (DL) Overview Paola Buitrago Leader AI and BD Pittsburgh

More information

A GPU-Based Real- Time Event Detection Framework for Power System Frequency Data Streams

A GPU-Based Real- Time Event Detection Framework for Power System Frequency Data Streams Engineering Conferences International ECI Digital Archives Modeling, Simulation, And Optimization for the 21st Century Electric Power Grid Proceedings Fall 10-24-2012 A GPU-Based Real- Time Event Detection

More information

Blind Single-Image Super Resolution Reconstruction with Defocus Blur

Blind Single-Image Super Resolution Reconstruction with Defocus Blur Sensors & Transducers 2014 by IFSA Publishing, S. L. http://www.sensorsportal.com Blind Single-Image Super Resolution Reconstruction with Defocus Blur Fengqing Qin, Lihong Zhu, Lilan Cao, Wanan Yang Institute

More information

Cantag: an open source software toolkit for designing and deploying marker-based vision systems. Andrew Rice. Computer Laboratory

Cantag: an open source software toolkit for designing and deploying marker-based vision systems. Andrew Rice. Computer Laboratory Cantag: an open source software toolkit for designing and deploying marker-based vision systems Andrew Rice University of Cambridge Marker Based Vision Systems MBV systems track specific marker tags in

More information

Color Filter Array Interpolation Using Adaptive Filter

Color Filter Array Interpolation Using Adaptive Filter Color Filter Array Interpolation Using Adaptive Filter P.Venkatesh 1, Dr.V.C.Veera Reddy 2, Dr T.Ramashri 3 M.Tech Student, Department of Electrical and Electronics Engineering, Sri Venkateswara University

More information

CMSC 372 Artificial Intelligence. Fall Administrivia

CMSC 372 Artificial Intelligence. Fall Administrivia CMSC 372 Artificial Intelligence Fall 2017 Administrivia Instructor: Deepak Kumar Lectures: Mon& Wed 10:10a to 11:30a Labs: Fridays 10:10a to 11:30a Pre requisites: CMSC B206 or H106 and CMSC B231 or permission

More information

Intelligent Technology for More Advanced Autonomous Driving

Intelligent Technology for More Advanced Autonomous Driving FEATURED ARTICLES Autonomous Driving Technology for Connected Cars Intelligent Technology for More Advanced Autonomous Driving Autonomous driving is recognized as an important technology for dealing with

More information

ABSTRACT ADAPTIVE SPACE-TIME PROCESSING FOR WIRELESS COMMUNICATIONS. by Xiao Cheng Bernstein

ABSTRACT ADAPTIVE SPACE-TIME PROCESSING FOR WIRELESS COMMUNICATIONS. by Xiao Cheng Bernstein Use all capitals, single space inside the title, followed by double space. Write by in separate line, followed by a single space: Use all capitals followed by double space.. ABSTRACT ADAPTIVE SPACE-TIME

More information

CONVOLUTIONAL NEURAL NETWORKS: MOTIVATION, CONVOLUTION OPERATION, ALEXNET

CONVOLUTIONAL NEURAL NETWORKS: MOTIVATION, CONVOLUTION OPERATION, ALEXNET CONVOLUTIONAL NEURAL NETWORKS: MOTIVATION, CONVOLUTION OPERATION, ALEXNET MOTIVATION Fully connected neural network Example 1000x1000 image 1M hidden units 10 12 (= 10 6 10 6 ) parameters! Observation

More information

DeepStack: Expert-Level AI in Heads-Up No-Limit Poker. Surya Prakash Chembrolu

DeepStack: Expert-Level AI in Heads-Up No-Limit Poker. Surya Prakash Chembrolu DeepStack: Expert-Level AI in Heads-Up No-Limit Poker Surya Prakash Chembrolu AI and Games AlphaGo Go Watson Jeopardy! DeepBlue -Chess Chinook -Checkers TD-Gammon -Backgammon Perfect Information Games

More information

PRACTICAL IMAGE AND VIDEO PROCESSING USING MATLAB

PRACTICAL IMAGE AND VIDEO PROCESSING USING MATLAB PRACTICAL IMAGE AND VIDEO PROCESSING USING MATLAB OGE MARQUES Florida Atlantic University *IEEE IEEE PRESS WWILEY A JOHN WILEY & SONS, INC., PUBLICATION CONTENTS LIST OF FIGURES LIST OF TABLES FOREWORD

More information

Filters. Materials from Prof. Klaus Mueller

Filters. Materials from Prof. Klaus Mueller Filters Materials from Prof. Klaus Mueller Think More about Pixels What exactly a pixel is in an image or on the screen? Solid square? This cannot be implemented A dot? Yes, but size matters Pixel Dots

More information

ONE of the most common and robust beamforming algorithms

ONE of the most common and robust beamforming algorithms TECHNICAL NOTE 1 Beamforming algorithms - beamformers Jørgen Grythe, Norsonic AS, Oslo, Norway Abstract Beamforming is the name given to a wide variety of array processing algorithms that focus or steer

More information

23270: AUGMENTED REALITY FOR NAVIGATION AND INFORMATIONAL ADAS. Sergii Bykov Technical Lead Machine Learning 12 Oct 2017

23270: AUGMENTED REALITY FOR NAVIGATION AND INFORMATIONAL ADAS. Sergii Bykov Technical Lead Machine Learning 12 Oct 2017 23270: AUGMENTED REALITY FOR NAVIGATION AND INFORMATIONAL ADAS Sergii Bykov Technical Lead Machine Learning 12 Oct 2017 Product Vision Company Introduction Apostera GmbH with headquarter in Munich, was

More information

Presenter s biographies

Presenter s biographies 9:15 9:30 Welcome from INSPER Presenter: Luciano Soares - INSPER Presenter s biographies 9:30 10:00 Presenters: Marcio Aguiar - NVIDIA & Esteban Clua - UFF Title: CUDA 8 and Pascal Bio: Esteban Clua is

More information

Filtering Images in the Spatial Domain Chapter 3b G&W. Ross Whitaker (modified by Guido Gerig) School of Computing University of Utah

Filtering Images in the Spatial Domain Chapter 3b G&W. Ross Whitaker (modified by Guido Gerig) School of Computing University of Utah Filtering Images in the Spatial Domain Chapter 3b G&W Ross Whitaker (modified by Guido Gerig) School of Computing University of Utah 1 Overview Correlation and convolution Linear filtering Smoothing, kernels,

More information

FSI Machine Vision Training Programs

FSI Machine Vision Training Programs FSI Machine Vision Training Programs Table of Contents Introduction to Machine Vision (Course # MVC-101) Machine Vision and NeuroCheck overview (Seminar # MVC-102) Machine Vision, EyeVision and EyeSpector

More information

Real- Time Computer Vision and Robotics Using Analog VLSI Circuits

Real- Time Computer Vision and Robotics Using Analog VLSI Circuits 750 Koch, Bair, Harris, Horiuchi, Hsu and Luo Real- Time Computer Vision and Robotics Using Analog VLSI Circuits Christof Koch Wyeth Bair John. Harris Timothy Horiuchi Andrew Hsu Jin Luo Computation and

More information

AI-Driven QA: Simulating Massively Multiplayer Behavior for Debugging Games. Shuichi Kurabayashi, Ph.D. Cygames, Inc.

AI-Driven QA: Simulating Massively Multiplayer Behavior for Debugging Games. Shuichi Kurabayashi, Ph.D. Cygames, Inc. AI-Driven QA: Simulating Massively Multiplayer Behavior for Debugging Games Shuichi Kurabayashi, Ph.D. Cygames, Inc. Keio University Summary We disclose know-hows to develop an AI-driven automatic quality

More information

Deformable Deep Convolutional Generative Adversarial Network in Microwave Based Hand Gesture Recognition System

Deformable Deep Convolutional Generative Adversarial Network in Microwave Based Hand Gesture Recognition System arxiv:1711.01968v2 [stat.ml] 22 Nov 2017 Deformable Deep Convolutional Generative Adversarial Network in Microwave Based Hand Gesture Recognition System Abstract Traditional vision-based hand gesture recognition

More information

Job Description. Commitment: Must be available to work full-time hours, M-F for weeks beginning Summer of 2018.

Job Description. Commitment: Must be available to work full-time hours, M-F for weeks beginning Summer of 2018. Research Intern Director of Research We are seeking a summer intern to support the team to develop prototype 3D sensing systems based on state-of-the-art sensing technologies along with computer vision

More information

LIGHT FIELD (LF) imaging [2] has recently come into

LIGHT FIELD (LF) imaging [2] has recently come into SUBMITTED TO IEEE SIGNAL PROCESSING LETTERS 1 Light Field Image Super-Resolution using Convolutional Neural Network Youngjin Yoon, Student Member, IEEE, Hae-Gon Jeon, Student Member, IEEE, Donggeun Yoo,

More information

Google DeepMind s AlphaGo vs. world Go champion Lee Sedol

Google DeepMind s AlphaGo vs. world Go champion Lee Sedol Google DeepMind s AlphaGo vs. world Go champion Lee Sedol Review of Nature paper: Mastering the game of Go with Deep Neural Networks & Tree Search Tapani Raiko Thanks to Antti Tarvainen for some slides

More information

Preprocessing and Segregating Offline Gujarati Handwritten Datasheet for Character Recognition

Preprocessing and Segregating Offline Gujarati Handwritten Datasheet for Character Recognition Preprocessing and Segregating Offline Gujarati Handwritten Datasheet for Character Recognition Hetal R. Thaker Atmiya Institute of Technology & science, Kalawad Road, Rajkot Gujarat, India C. K. Kumbharana,

More information

Improved Locations Through Waveform Cross-Correlation Within the Antelope Environment

Improved Locations Through Waveform Cross-Correlation Within the Antelope Environment Improved Locations Through Waveform Cross-Correlation Within the Antelope Environment David von Seggern Nevada Seismological Laboratory Antelope Users Group Meeting June 7, 2008 Outline of This Talk history

More information

Chapter 3: Alarm correlation

Chapter 3: Alarm correlation Chapter 3: Alarm correlation Algorithmic Methods of Data Mining, Fall 2005, Chapter 3: Alarm correlation 1 Part II. Episodes in sequences Chapter 3: Alarm correlation Chapter 4: Frequent episodes Chapter

More information

SWITCHED-CURRENTS an analogue technique for digital technology

SWITCHED-CURRENTS an analogue technique for digital technology SWITCHED-CURRENTS an analogue technique for digital technology Edited by С Toumazou, ]. B. Hughes & N. C. Battersby Supported by the IEEE Circuits and Systems Society Technical Committee on Analog Signal

More information

How does prism technology help to achieve superior color image quality?

How does prism technology help to achieve superior color image quality? WHITE PAPER How does prism technology help to achieve superior color image quality? Achieving superior image quality requires real and full color depth for every channel, improved color contrast and color

More information

Deep Learning for Autonomous Driving

Deep Learning for Autonomous Driving Deep Learning for Autonomous Driving Shai Shalev-Shwartz Mobileye IMVC dimension, March, 2016 S. Shalev-Shwartz is also affiliated with The Hebrew University Shai Shalev-Shwartz (MobilEye) DL for Autonomous

More information

STREAK DETECTION ALGORITHM FOR SPACE DEBRIS DETECTION ON OPTICAL IMAGES

STREAK DETECTION ALGORITHM FOR SPACE DEBRIS DETECTION ON OPTICAL IMAGES STREAK DETECTION ALGORITHM FOR SPACE DEBRIS DETECTION ON OPTICAL IMAGES Alessandro Vananti, Klaus Schild, Thomas Schildknecht Astronomical Institute, University of Bern, Sidlerstrasse 5, CH-3012 Bern,

More information

isudoku Computing Solutions to Sudoku Puzzles w/ 3 Algorithms by: Gavin Hillebrand Jamie Sparrow Jonathon Makepeace Matthew Harris

isudoku Computing Solutions to Sudoku Puzzles w/ 3 Algorithms by: Gavin Hillebrand Jamie Sparrow Jonathon Makepeace Matthew Harris isudoku Computing Solutions to Sudoku Puzzles w/ 3 Algorithms by: Gavin Hillebrand Jamie Sparrow Jonathon Makepeace Matthew Harris What is Sudoku? A logic-based puzzle game Heavily based in combinatorics

More information

The Art of Neural Nets

The Art of Neural Nets The Art of Neural Nets Marco Tavora marcotav65@gmail.com Preamble The challenge of recognizing artists given their paintings has been, for a long time, far beyond the capability of algorithms. Recent advances

More information

Table of Contents HOL EMT

Table of Contents HOL EMT Table of Contents Lab Overview - - Machine Learning Workloads in vsphere Using GPUs - Getting Started... 2 Lab Guidance... 3 Module 1 - Machine Learning Apps in vsphere VMs Using GPUs (15 minutes)...9

More information

A New Scheme for No Reference Image Quality Assessment

A New Scheme for No Reference Image Quality Assessment Author manuscript, published in "3rd International Conference on Image Processing Theory, Tools and Applications, Istanbul : Turkey (2012)" A New Scheme for No Reference Image Quality Assessment Aladine

More information

CROSS-LAYER FEATURES IN CONVOLUTIONAL NEURAL NETWORKS FOR GENERIC CLASSIFICATION TASKS. Kuan-Chuan Peng and Tsuhan Chen

CROSS-LAYER FEATURES IN CONVOLUTIONAL NEURAL NETWORKS FOR GENERIC CLASSIFICATION TASKS. Kuan-Chuan Peng and Tsuhan Chen CROSS-LAYER FEATURES IN CONVOLUTIONAL NEURAL NETWORKS FOR GENERIC CLASSIFICATION TASKS Kuan-Chuan Peng and Tsuhan Chen Cornell University School of Electrical and Computer Engineering Ithaca, NY 14850

More information

The Greater Hamilton Region - a Smart Connected Rural Community. Russell Bennett Manager Business Systems

The Greater Hamilton Region - a Smart Connected Rural Community. Russell Bennett Manager Business Systems The Greater Hamilton Region - a Smart Connected Rural Community Russell Bennett Manager Business Systems Setting the scene - SGSC Population: 15,751 Largest Industry by jobs: Agriculture Total Council

More information

CS 229, Project Progress Report SUNet ID: Name: Ajay Shanker Tripathi

CS 229, Project Progress Report SUNet ID: Name: Ajay Shanker Tripathi CS 229, Project Progress Report SUNet ID: 06044535 Name: Ajay Shanker Tripathi Title: Voice Transmogrifier: Spoofing My Girlfriend s Voice Project Category: Audio and Music The project idea is an easy-to-state

More information

Convolution Pyramids. Zeev Farbman, Raanan Fattal and Dani Lischinski SIGGRAPH Asia Conference (2011) Julian Steil. Prof. Dr.

Convolution Pyramids. Zeev Farbman, Raanan Fattal and Dani Lischinski SIGGRAPH Asia Conference (2011) Julian Steil. Prof. Dr. Zeev Farbman, Raanan Fattal and Dani Lischinski SIGGRAPH Asia Conference (2011) presented by: Julian Steil supervisor: Prof. Dr. Joachim Weickert Fig. 1.1: Gradient integration example Seminar - Milestones

More information

Ground-Based Magnetometer Arrays and Geomagnetically Induced Current in Power Grids: Science and Operations

Ground-Based Magnetometer Arrays and Geomagnetically Induced Current in Power Grids: Science and Operations Ground-Based Magnetometer Arrays and Geomagnetically Induced Current in Power Grids: Science and Operations Alan W P Thomson (awpt@bgs.ac.uk), Ciarán Beggan and Gemma Kelly Introduction What is this hazard

More information

An Array Feed Radial Basis Function Tracking System for NASA s Deep Space Network Antennas

An Array Feed Radial Basis Function Tracking System for NASA s Deep Space Network Antennas An Array Feed Radial Basis Function Tracking System for NASA s Deep Space Network Antennas Ryan Mukai Payman Arabshahi Victor A. Vilnrotter California Institute of Technology Jet Propulsion Laboratory

More information

Announcements. Image Processing. What s an image? Images as functions. Image processing. What s a digital image?

Announcements. Image Processing. What s an image? Images as functions. Image processing. What s a digital image? Image Processing Images by Pawan Sinha Today s readings Forsyth & Ponce, chapters 8.-8. http://www.cs.washington.edu/education/courses/49cv/wi/readings/book-7-revised-a-indx.pdf For Monday Watt,.3-.4 (handout)

More information

Machine Learning and Decision Making for Sustainability

Machine Learning and Decision Making for Sustainability Machine Learning and Decision Making for Sustainability Stefano Ermon Department of Computer Science Stanford University April 12 Overview Stanford Artificial Intelligence Lab Fellow, Woods Institute for

More information

Convolutional Networks for Image Segmentation: U-Net 1, DeconvNet 2, and SegNet 3

Convolutional Networks for Image Segmentation: U-Net 1, DeconvNet 2, and SegNet 3 Convolutional Networks for Image Segmentation: U-Net 1, DeconvNet 2, and SegNet 3 1 Olaf Ronneberger, Philipp Fischer, Thomas Brox (Freiburg, Germany) 2 Hyeonwoo Noh, Seunghoon Hong, Bohyung Han (POSTECH,

More information

UNIT II IIR FILTER DESIGN

UNIT II IIR FILTER DESIGN UNIT II IIR FILTER DESIGN Structures of IIR Analog filter design Discrete time IIR filter from analog filter IIR filter design by Impulse Invariance, Bilinear transformation Approximation of derivatives

More information

Using Artificial intelligent to solve the game of 2048

Using Artificial intelligent to solve the game of 2048 Using Artificial intelligent to solve the game of 2048 Ho Shing Hin (20343288) WONG, Ngo Yin (20355097) Lam Ka Wing (20280151) Abstract The report presents the solver of the game 2048 base on artificial

More information