Fully Convolutional Networks for Semantic Segmentation
|
|
- Sherman Richards
- 6 years ago
- Views:
Transcription
1 Fully Convolutional Networks for Semantic Segmentation Jonathan Long* Evan Shelhamer* Trevor Darrell UC Berkeley Presented by: Gordon Christie 1
2 Overview Reinterpret standard classification convnets as Fully convolutional networks () for semantic segmentation Use AlexNet, VGG, and GoogleNet in experiments Novel architecture: combine information from different layers for segmentation State-of-the-art segmentation for PASCAL VOC 2011/2012, NYUDv2, and SIFT Flow at the time Inference less than one fifth of a second for a typical image 2
3 pixels in, pixels out monocular depth estimation (Liu et al. 2015) semantic segmentation boundary prediction (Xie & Tu 2015) 3
4 convnets perform classification < 1 millisecond tabby cat 1000-dim vector end-to-end learning 4
5 R-CNN does detection many seconds dog R-CNN cat 5
6 R-CNN figure: Girshick et al. 6
7 < 1/5 second??? end-to-end learning 7
8 a classification network tabby cat 8
9 becoming fully convolutional 9
10 becoming fully convolutional 10
11 upsampling output 11
12 end-to-end, pixels-to-pixels network conv, pool, nonlinearity upsampling pixelwise output + loss 12
13 Dense Predictions Shift-and-stitch: trick that yields dense predictions without interpolation Upsampling via deconvolution Shift-and-stitch used in preliminary experiments, but not included in final model Upsampling found to be more effective and efficient 13
14 Classifier to Dense Convolutionalize proven classification architectures: AlexNet, VGG, and GoogLeNet (reimplementation) Remove classification layer and convert all fully connected layers to convolutions Append 1x1 convolution with channel dimensions and predict scores at each of the coarse output locations (21 categories + background for PASCAL) 14
15 Classifier to Dense Cast ILSVRC classifiers into s and compare performance on validation set of PASCAL 2011 learning rate, not best performance possible.) - AlexNet - VGG16 mean IU forward time 50 ms 210 ms 59 ms conv. layers parameters 57M 134M 6M rf size max stride GoogLeNet 4 15
16 spectrum of deep features combine where (local, shallow) with what (global, deep) fuse features into deep jet (cf. Hariharan et al. CVPR15 hypercolumn ) 16
17 skip layers interp + sum interp + sum end-to-end, joint learning of semantics and location dense output 17
18 skip layers 32x upsampled prediction (-32s) 2x upsampled prediction 16x upsampled prediction (-16s) 2x upsampled prediction 8x upsampled prediction (-8s) pool3 pool4 pool5 pool4 prediction P pool3 prediction P n to combine coarse, high layer information with fine, low layer information. Layers are shown as grids that 18
19 Comparison of skip s Results on subset of validation set of PASCAL VOC VGG16, renamed to highlight stride. pixel acc. mean acc. mean IU f.w. IU -32s-fixed s s s
20 skip layer refinement input image stride 32 stride 16 stride 8 ground truth no skips 1 skip 2 skips 20
21 training + testing - train full image at a time without patch sampling - reshape network to take input of any size - forward time is ~150ms for 500 x 500 x 21 output 21
22 Results PASCAL VOC 2011/12 VOC 2011: 8498 training images (from additional labeled data test sets, and reduces inference time. mean IU mean IU inference VOC2011 test VOC2012 test time R-CNN [12] SDS [16] s -8s ms 22
23 Table 4. Results on NYUDv2. RGBD is early-fusion of the RGB and depth channels at the input. HHA is the depth embedding of [14] as horizontal disparity, height above ground, and the angle of the local surface normallabels à with the 40 categories inferred gravity 1449 RGB- D images with pixelwise direction. RGB-HHA is the jointly trained late fusion model that sums RGB and HHA predictions. Results NYUDv2 Gupta et al. [14] -32s RGB -32s RGBD -32s HHA -32s RGB-HHA -16s RGB-HHA pixel acc mean acc mean IU f.w. IU Table 5. (center) a a non-par SVM whi vnet traine samples ( noted RCN Li Tigh Tighe Tighe Farabe Farabe 23
24 Results SIFT Flow is early-fusion of the Table 5. Results on SIFT Flow10 with class segmentation 2688 images with pixel labels HA is the depth embed(center) and geometric segmentation (right). Tighe [33] is à 33 semantic categories, 3 geometric categories ght above ground, and a non-parametric transfer method. Tighe 1 is an exemplar th the inferred gravity SVM while 2 is SVM + MRF. Farabet is a multi-scale conlearn both label spaces jointly ned late fusion model vnet trained on class-balanced samples (1) or natural frequency à learning and inference have similar performance and samples (2). Pinheiro is a multi-scale, recurrent convnet, deean mean f.w. computation as independent models noted RCNN3 ( 3 ). The metric for geometry is pixel accuracy. c IU IU D images, with pixel- Liu et al. [23] Tighe et al. [33] Tighe et al. [34] 1 Tighe et al. [34] 2 Farabet et al. [8] 1 Farabet et al. [8] 2 Pinheiro et al. [28] -16s pixel acc mean mean f.w. acc. IU IU geom. acc
25 SDS* Truth Input Relative to prior state-of-theart SDS: - 20% relative improvement for mean IoU faster *Simultaneous Detection and Segmentation Hariharan et al. ECCV14 25
26 leaderboard == segmentation with Caffe 26
27 conclusion fully convolutional networks are fast, endto-end models for pixelwise problems - code in Caffe branch (merged soon) - models for PASCAL VOC, NYUDv2, SIFT Flow, PASCAL-Context caffe.berkeleyvision.org fcn.berkeleyvision.org github.com/bvlc/caffe 27
A Fuller Understanding of Fully Convolutional Networks. Evan Shelhamer* Jonathan Long* Trevor Darrell UC Berkeley in CVPR'15, PAMI'16
A Fuller Understanding of Fully Convolutional Networks Evan Shelhamer* Jonathan Long* Trevor Darrell UC Berkeley in CVPR'15, PAMI'16 1 pixels in, pixels out colorization Zhang et al.2016 monocular depth
More informationLecture 23 Deep Learning: Segmentation
Lecture 23 Deep Learning: Segmentation COS 429: Computer Vision Thanks: most of these slides shamelessly adapted from Stanford CS231n: Convolutional Neural Networks for Visual Recognition Fei-Fei Li, Andrej
More informationDetection and Segmentation. Fei-Fei Li & Justin Johnson & Serena Yeung. Lecture 11 -
Lecture 11: Detection and Segmentation Lecture 11-1 May 10, 2017 Administrative Midterms being graded Please don t discuss midterms until next week - some students not yet taken A2 being graded Project
More informationCS 7643: Deep Learning
CS 7643: Deep Learning Topics: Toeplitz matrices and convolutions = matrix-mult Dilated/a-trous convolutions Backprop in conv layers Transposed convolutions Dhruv Batra Georgia Tech HW1 extension 09/22
More informationColorful Image Colorizations Supplementary Material
Colorful Image Colorizations Supplementary Material Richard Zhang, Phillip Isola, Alexei A. Efros {rich.zhang, isola, efros}@eecs.berkeley.edu University of California, Berkeley 1 Overview This document
More informationConvolutional Networks for Image Segmentation: U-Net 1, DeconvNet 2, and SegNet 3
Convolutional Networks for Image Segmentation: U-Net 1, DeconvNet 2, and SegNet 3 1 Olaf Ronneberger, Philipp Fischer, Thomas Brox (Freiburg, Germany) 2 Hyeonwoo Noh, Seunghoon Hong, Bohyung Han (POSTECH,
More informationAn Introduction to Convolutional Neural Networks. Alessandro Giusti Dalle Molle Institute for Artificial Intelligence Lugano, Switzerland
An Introduction to Convolutional Neural Networks Alessandro Giusti Dalle Molle Institute for Artificial Intelligence Lugano, Switzerland Sources & Resources - Andrej Karpathy, CS231n http://cs231n.github.io/convolutional-networks/
More informationDeep Learning. Dr. Johan Hagelbäck.
Deep Learning Dr. Johan Hagelbäck johan.hagelback@lnu.se http://aiguy.org Image Classification Image classification can be a difficult task Some of the challenges we have to face are: Viewpoint variation:
More informationarxiv: v1 [cs.cv] 9 Nov 2015 Abstract
Bayesian SegNet: Model Uncertainty in Deep Convolutional Encoder-Decoder Architectures for Scene Understanding Alex Kendall Vijay Badrinarayanan University of Cambridge agk34, vb292, rc10001 @cam.ac.uk
More informationConvolutional Neural Networks
Convolutional Neural Networks Convolution, LeNet, AlexNet, VGGNet, GoogleNet, Resnet, DenseNet, CAM, Deconvolution Sept 17, 2018 Aaditya Prakash Convolution Convolution Demo Convolution Convolution in
More informationOn the Use of Fully Convolutional Networks on Evaluation of Infrared Breast Image Segmentations
17º WIM - Workshop de Informática Médica On the Use of Fully Convolutional Networks on Evaluation of Infrared Breast Image Segmentations Rafael H. C. de Melo, Aura Conci, Cristina Nader Vasconcelos Computer
More informationROAD RECOGNITION USING FULLY CONVOLUTIONAL NEURAL NETWORKS
Bulletin of the Transilvania University of Braşov Vol. 10 (59) No. 2-2017 Series I: Engineering Sciences ROAD RECOGNITION USING FULLY CONVOLUTIONAL NEURAL NETWORKS E. HORVÁTH 1 C. POZNA 2 Á. BALLAGI 3
More informationarxiv: v1 [cs.cv] 15 Apr 2016
High-performance Semantic Segmentation Using Very Deep Fully Convolutional Networks arxiv:1604.04339v1 [cs.cv] 15 Apr 2016 Zifeng Wu, Chunhua Shen, Anton van den Hengel The University of Adelaide, SA 5005,
More informationSemantic Segmentation on Resource Constrained Devices
Semantic Segmentation on Resource Constrained Devices Sachin Mehta University of Washington, Seattle In collaboration with Mohammad Rastegari, Anat Caspi, Linda Shapiro, and Hannaneh Hajishirzi Project
More informationNU-Net: Deep Residual Wide Field of View Convolutional Neural Network for Semantic Segmentation
NU-Net: Deep Residual Wide Field of View Convolutional Neural Network for Semantic Segmentation Mohamed Samy 1 Karim Amer 1 Kareem Eissa Mahmoud Shaker Mohamed ElHelw Center for Informatics Science Nile
More informationLecture 11-1 CNN introduction. Sung Kim
Lecture 11-1 CNN introduction Sung Kim 'The only limit is your imagination' http://itchyi.squarespace.com/thelatest/2012/5/17/the-only-limit-is-your-imagination.html Lecture 7: Convolutional
More informationDeep Multispectral Semantic Scene Understanding of Forested Environments using Multimodal Fusion
Deep Multispectral Semantic Scene Understanding of Forested Environments using Multimodal Fusion Abhinav Valada, Gabriel L. Oliveira, Thomas Brox, and Wolfram Burgard Department of Computer Science, University
More informationContinuous Gesture Recognition Fact Sheet
Continuous Gesture Recognition Fact Sheet August 17, 2016 1 Team details Team name: ICT NHCI Team leader name: Xiujuan Chai Team leader address, phone number and email Address: No.6 Kexueyuan South Road
More informationCascaded Feature Network for Semantic Segmentation of RGB-D Images
Cascaded Feature Network for Semantic Segmentation of RGB-D Images Di Lin1 Guangyong Chen2 Daniel Cohen-Or1,3 Pheng-Ann Heng2,4 Hui Huang1,4 1 Shenzhen University 2 The Chinese University of Hong Kong
More informationCROSS-LAYER FEATURES IN CONVOLUTIONAL NEURAL NETWORKS FOR GENERIC CLASSIFICATION TASKS. Kuan-Chuan Peng and Tsuhan Chen
CROSS-LAYER FEATURES IN CONVOLUTIONAL NEURAL NETWORKS FOR GENERIC CLASSIFICATION TASKS Kuan-Chuan Peng and Tsuhan Chen Cornell University School of Electrical and Computer Engineering Ithaca, NY 14850
More informationCONVOLUTIONAL NEURAL NETWORKS: MOTIVATION, CONVOLUTION OPERATION, ALEXNET
CONVOLUTIONAL NEURAL NETWORKS: MOTIVATION, CONVOLUTION OPERATION, ALEXNET MOTIVATION Fully connected neural network Example 1000x1000 image 1M hidden units 10 12 (= 10 6 10 6 ) parameters! Observation
More informationAutomatic tumor segmentation in breast ultrasound images using a dilated fully convolutional network combined with an active contour model
Automatic tumor segmentation in breast ultrasound images using a dilated fully convolutional network combined with an active contour model Yuzhou Hu Departmentof Electronic Engineering, Fudan University,
More informationSketch-a-Net that Beats Humans
Sketch-a-Net that Beats Humans Qian Yu SketchLab@QMUL Queen Mary University of London 1 Authors Qian Yu Yongxin Yang Yi-Zhe Song Tao Xiang Timothy Hospedales 2 Let s play a game! Round 1 Easy fish face
More informationEn ny æra for uthenting av informasjon fra satellittbilder ved hjelp av maskinlæring
En ny æra for uthenting av informasjon fra satellittbilder ved hjelp av maskinlæring Mathilde Ørstavik og Terje Midtbø Mathilde Ørstavik and Terje Midtbø, A New Era for Feature Extraction in Remotely Sensed
More informationGenerating an appropriate sound for a video using WaveNet.
Australian National University College of Engineering and Computer Science Master of Computing Generating an appropriate sound for a video using WaveNet. COMP 8715 Individual Computing Project Taku Ueki
More informationGESTURE RECOGNITION WITH 3D CNNS
April 4-7, 2016 Silicon Valley GESTURE RECOGNITION WITH 3D CNNS Pavlo Molchanov Xiaodong Yang Shalini Gupta Kihwan Kim Stephen Tyree Jan Kautz 4/6/2016 Motivation AGENDA Problem statement Selecting the
More informationIntroduction to Machine Learning
Introduction to Machine Learning Deep Learning Barnabás Póczos Credits Many of the pictures, results, and other materials are taken from: Ruslan Salakhutdinov Joshua Bengio Geoffrey Hinton Yann LeCun 2
More informationCS688/WST665 Student presentation Learning Fine-grained Image Similarity with Deep Ranking CVPR Gayoung Lee ( 이가영 )
CS688/WST665 Student presentation Learning Fine-grained Image Similarity with Deep Ranking CVPR 2014 Gayoung Lee ( 이가영 ) Contents 1. Background knowledge 2. Proposed method 3. Experimental Result 4. Conclusion
More informationSemantic Localization of Indoor Places. Lukas Kuster
Semantic Localization of Indoor Places Lukas Kuster Motivation GPS for localization [7] 2 Motivation Indoor navigation [8] 3 Motivation Crowd sensing [9] 4 Motivation Targeted Advertisement [10] 5 Motivation
More informationLearning to Predict Indoor Illumination from a Single Image. Chih-Hui Ho
Learning to Predict Indoor Illumination from a Single Image Chih-Hui Ho 1 Outline Introduction Method Overview LDR Panorama Light Source Detection Panorama Recentering Warp Learning From LDR Panoramas
More informationLesson 08. Convolutional Neural Network. Ing. Marek Hrúz, Ph.D. Katedra Kybernetiky Fakulta aplikovaných věd Západočeská univerzita v Plzni.
Lesson 08 Convolutional Neural Network Ing. Marek Hrúz, Ph.D. Katedra Kybernetiky Fakulta aplikovaných věd Západočeská univerzita v Plzni Lesson 08 Convolution we will consider 2D convolution the result
More informationDeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition. ECE 289G: Paper Presentation #3 Philipp Gysel
DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition ECE 289G: Paper Presentation #3 Philipp Gysel Autonomous Car ECE 289G Paper Presentation, Philipp Gysel Slide 2 Source: maps.google.com
More informationarxiv: v1 [cs.cv] 19 Jun 2017
Satellite Imagery Feature Detection using Deep Convolutional Neural Network: A Kaggle Competition Vladimir Iglovikov True Accord iglovikov@gmail.com Sergey Mushinskiy Open Data Science cepera.ang@gmail.com
More informationGoing Deeper into First-Person Activity Recognition
Going Deeper into First-Person Activity Recognition Minghuang Ma, Haoqi Fan and Kris M. Kitani Carnegie Mellon University Pittsburgh, PA 15213, USA minghuam@andrew.cmu.edu haoqif@andrew.cmu.edu kkitani@cs.cmu.edu
More informationtsushi Sasaki Fig. Flow diagram of panel structure recognition by specifying peripheral regions of each component in rectangles, and 3 types of detect
RECOGNITION OF NEL STRUCTURE IN COMIC IMGES USING FSTER R-CNN Hideaki Yanagisawa Hiroshi Watanabe Graduate School of Fundamental Science and Engineering, Waseda University BSTRCT For efficient e-comics
More informationBiologically Inspired Computation
Biologically Inspired Computation Deep Learning & Convolutional Neural Networks Joe Marino biologically inspired computation biological intelligence flexible capable of detecting/ executing/reasoning about
More informationVideo Object Segmentation with Re-identification
Video Object Segmentation with Re-identification Xiaoxiao Li, Yuankai Qi, Zhe Wang, Kai Chen, Ziwei Liu, Jianping Shi Ping Luo, Chen Change Loy, Xiaoou Tang The Chinese University of Hong Kong, SenseTime
More informationCSC 578 Neural Networks and Deep Learning
CSC 578 Neural Networks and Deep Learning Fall 2018/19 6. Convolutional Neural Networks (Some figures adapted from NNDL book) 1 Convolution Neural Networks 1. Convolutional Neural Networks Convolution,
More informationarxiv: v1 [cs.ce] 9 Jan 2018
Predict Forex Trend via Convolutional Neural Networks Yun-Cheng Tsai, 1 Jun-Hao Chen, 2 Jun-Jie Wang 3 arxiv:1801.03018v1 [cs.ce] 9 Jan 2018 1 Center for General Education 2,3 Department of Computer Science
More informationSCENE SEMANTIC SEGMENTATION FROM INDOOR RGB-D IMAGES USING ENCODE-DECODER FULLY CONVOLUTIONAL NETWORKS
SCENE SEMANTIC SEGMENTATION FROM INDOOR RGB-D IMAGES USING ENCODE-DECODER FULLY CONVOLUTIONAL NETWORKS Zhen Wang *, Te Li, Lijun Pan, Zhizhong Kang China University of Geosciences, Beijing - (comige@gmail.com,
More informationConvolutional Neural Network-Based Infrared Image Super Resolution Under Low Light Environment
Convolutional Neural Network-Based Infrared Super Resolution Under Low Light Environment Tae Young Han, Yong Jun Kim, Byung Cheol Song Department of Electronic Engineering Inha University Incheon, Republic
More informationarxiv: v1 [stat.ml] 10 Nov 2017
Poverty Prediction with Public Landsat 7 Satellite Imagery and Machine Learning arxiv:1711.03654v1 [stat.ml] 10 Nov 2017 Anthony Perez Department of Computer Science Stanford, CA 94305 aperez8@stanford.edu
More informationAutomatic point-of-interest image cropping via ensembled convolutionalization
1 Automatic point-of-interest image cropping via ensembled convolutionalization Andrea Asperti and Pietro Battilana University of Bologna Department of informatics: Science and Engineering (DISI) Abstract
More informationDEEP LEARNING ON RF DATA. Adam Thompson Senior Solutions Architect March 29, 2018
DEEP LEARNING ON RF DATA Adam Thompson Senior Solutions Architect March 29, 2018 Background Information Signal Processing and Deep Learning Radio Frequency Data Nuances AGENDA Complex Domain Representations
More informationUnderstanding Convolution for Semantic Segmentation
Understanding Convolution for Semantic Segmentation Panqu Wang 1, Pengfei Chen 1, Ye Yuan 2, Ding Liu 3, Zehua Huang 1, Xiaodi Hou 1, Garrison Cottrell 4 1 TuSimple, 2 Carnegie Mellon University, 3 University
More informationیادآوری: خالصه CNN. ConvNet
1 ConvNet یادآوری: خالصه CNN شبکه عصبی کانولوشنال یا Convolutional Neural Networks یا نوعی از شبکههای عصبی عمیق مدل یادگیری آن باناظر.اصالح وزنها با الگوریتم back-propagation مناسب برای داده های حجیم و
More informationEE-559 Deep learning 7.2. Networks for image classification
EE-559 Deep learning 7.2. Networks for image classification François Fleuret https://fleuret.org/ee559/ Fri Nov 16 22:58:34 UTC 2018 ÉCOLE POLYTECHNIQUE FÉDÉRALE DE LAUSANNE Image classification, standard
More informationOn Emerging Technologies
On Emerging Technologies 9.11. 2018. Prof. David Hyunchul Shim Director, Korea Civil RPAS Research Center KAIST, Republic of Korea hcshim@kaist.ac.kr 1 I. Overview Recent emerging technologies in civil
More informationTiny ImageNet Challenge Investigating the Scaling of Inception Layers for Reduced Scale Classification Problems
Tiny ImageNet Challenge Investigating the Scaling of Inception Layers for Reduced Scale Classification Problems Emeric Stéphane Boigné eboigne@stanford.edu Jan Felix Heyse heyse@stanford.edu Abstract Scaling
More informationAn energy-efficient coarse grained spatial architecture for convolutional neural networks AlexNet
LETTER IEICE Electronics Express, Vol.14, No.15, 1 12 An energy-efficient coarse grained spatial architecture for convolutional neural networks AlexNet Boya Zhao a), Mingjiang Wang b), and Ming Liu Harbin
More informationSemantic Segmentation in Red Relief Image Map by UX-Net
Semantic Segmentation in Red Relief Image Map by UX-Net Tomoya Komiyama 1, Kazuhiro Hotta 1, Kazuo Oda 2, Satomi Kakuta 2 and Mikako Sano 2 1 Meijo University, Shiogamaguchi, 468-0073, Nagoya, Japan 2
More informationDSNet: An Efficient CNN for Road Scene Segmentation
DSNet: An Efficient CNN for Road Scene Segmentation Ping-Rong Chen 1 Hsueh-Ming Hang 1 1 National Chiao Tung University {james50120.ee05g, hmhang}@nctu.edu.tw Sheng-Wei Chan 2 Jing-Jhih Lin 2 2 Industrial
More informationVehicle Color Recognition using Convolutional Neural Network
Vehicle Color Recognition using Convolutional Neural Network Reza Fuad Rachmadi and I Ketut Eddy Purnama Multimedia and Network Engineering Department, Institut Teknologi Sepuluh Nopember, Keputih Sukolilo,
More informationMultispectral Pedestrian Detection using Deep Fusion Convolutional Neural Networks
Multispectral Pedestrian Detection using Deep Fusion Convolutional Neural Networks Jo rg Wagner1,2, Volker Fischer1, Michael Herman1 and Sven Behnke2 1- Robert Bosch GmbH - 70442 Stuttgart - Germany 2-
More informationTRANSFORMING PHOTOS TO COMICS USING CONVOLUTIONAL NEURAL NETWORKS. Tsinghua University, China Cardiff University, UK
TRANSFORMING PHOTOS TO COMICS USING CONVOUTIONA NEURA NETWORKS Yang Chen Yu-Kun ai Yong-Jin iu Tsinghua University, China Cardiff University, UK ABSTRACT In this paper, inspired by Gatys s recent work,
More informationUnderstanding Convolution for Semantic Segmentation
Understanding Convolution for Semantic Segmentation Panqu Wang 1, Pengfei Chen 1, Ye Yuan 2, Ding Liu 3, Zehua Huang 1, Xiaodi Hou 1, Garrison Cottrell 4 1 TuSimple, 2 Carnegie Mellon University, 3 University
More informationDeep filter banks for texture recognition and segmentation
Deep filter banks for texture recognition and segmentation Mircea Cimpoi, University of Oxford Subhransu Maji, UMASS Amherst Andrea Vedaldi, University of Oxford Texture understanding 2 Indicator of materials
More informationVisualizing and Understanding. Fei-Fei Li & Justin Johnson & Serena Yeung. Lecture 12 -
Lecture 12: Visualizing and Understanding Lecture 12-1 May 16, 2017 Administrative Milestones due tonight on Canvas, 11:59pm Midterm grades released on Gradescope this week A3 due next Friday, 5/26 HyperQuest
More informationSynthetic View Generation for Absolute Pose Regression and Image Synthesis: Supplementary material
Synthetic View Generation for Absolute Pose Regression and Image Synthesis: Supplementary material Pulak Purkait 1 pulak.cv@gmail.com Cheng Zhao 2 irobotcheng@gmail.com Christopher Zach 1 christopher.m.zach@gmail.com
More information6. Convolutional Neural Networks
6. Convolutional Neural Networks CS 519 Deep Learning, Winter 2016 Fuxin Li With materials from Zsolt Kira Quiz coming up Next Tuesday (1/26) 15 minutes Topics: Optimization Basic neural networks No Convolutional
More informationImpact of Automatic Feature Extraction in Deep Learning Architecture
Impact of Automatic Feature Extraction in Deep Learning Architecture Fatma Shaheen, Brijesh Verma and Md Asafuddoula Centre for Intelligent Systems Central Queensland University, Brisbane, Australia {f.shaheen,
More informationA COMPARATIVE ANALYSIS OF IMAGE SEGMENTATION TECHNIQUES
International Journal of Computer Engineering & Technology (IJCET) Volume 9, Issue 5, September-October 2018, pp. 64 69, Article ID: IJCET_09_05_009 Available online at http://www.iaeme.com/ijcet/issues.asp?jtype=ijcet&vtype=9&itype=5
More informationUnderstanding Neural Networks : Part II
TensorFlow Workshop 2018 Understanding Neural Networks Part II : Convolutional Layers and Collaborative Filters Nick Winovich Department of Mathematics Purdue University July 2018 Outline 1 Convolutional
More informationLANDMARK recognition is an important feature for
1 NU-LiteNet: Mobile Landmark Recognition using Convolutional Neural Networks Chakkrit Termritthikun, Surachet Kanprachar, Paisarn Muneesawang arxiv:1810.01074v1 [cs.cv] 2 Oct 2018 Abstract The growth
More informationWhat Is And How Will Machine Learning Change Our Lives. Fair Use Agreement
What Is And How Will Machine Learning Change Our Lives Raymond Ptucha, Rochester Institute of Technology 2018 Engineering Symposium April 24, 2018, 9:45am Ptucha 18 1 Fair Use Agreement This agreement
More informationMulti-task Learning of Dish Detection and Calorie Estimation
Multi-task Learning of Dish Detection and Calorie Estimation Department of Informatics, The University of Electro-Communications, Tokyo 1-5-1 Chofugaoka, Chofu-shi, Tokyo 182-8585 JAPAN ABSTRACT In recent
More informationThe Cityscapes Dataset for Semantic Urban Scene Understanding SUPPLEMENTAL MATERIAL
The Cityscapes Dataset for Semantic Urban Scene Understanding SUPPLEMENTAL MATERIAL Marius Cordts 1,2 Mohamed Omran 3 Sebastian Ramos 1,4 Timo Rehfeld 1,2 Markus Enzweiler 1 Rodrigo Benenson 3 Uwe Franke
More informationTransformation to Artificial Intelligence with MATLAB Roy Lurie, PhD Vice President of Engineering MATLAB Products
Transformation to Artificial Intelligence with MATLAB Roy Lurie, PhD Vice President of Engineering MATLAB Products 2018 The MathWorks, Inc. 1 A brief history of the automobile First Commercial Gas Car
More informationDeformable Convolutional Networks
Deformable Convolutional Networks Jifeng Dai^ With Haozhi Qi*^, Yuwen Xiong*^, Yi Li*^, Guodong Zhang*^, Han Hu, Yichen Wei Visual Computing Group Microsoft Research Asia (* interns at MSRA, ^ equal contribution)
More informationMultimedia Forensics
Multimedia Forensics Using Mathematics and Machine Learning to Determine an Image's Source and Authenticity Matthew C. Stamm Multimedia & Information Security Lab (MISL) Department of Electrical and Computer
More informationConvolutional neural networks
Convolutional neural networks Themes Curriculum: Ch 9.1, 9.2 and http://cs231n.github.io/convolutionalnetworks/ The simple motivation and idea How it s done Receptive field Pooling Dilated convolutions
More informationMultiple Kernels for Object Detection. Andrea Vedaldi Varun Gulshan Manik Varma Andrew Zisserman
Multiple Kernels for Object Detection Andrea Vedaldi Varun Gulshan Manik Varma Andrew Zisserman MK classification PHOW Gray MK SVM PHOW Color combine one kernel per histogram PHOG PHOG Sym Feature vector
More informationFree-hand Sketch Recognition Classification
Free-hand Sketch Recognition Classification Wayne Lu Stanford University waynelu@stanford.edu Elizabeth Tran Stanford University eliztran@stanford.edu Abstract People use sketches to express and record
More informationA Fast Method for Estimating Transient Scene Attributes
A Fast Method for Estimating Transient Scene Attributes Ryan Baltenberger, Menghua Zhai, Connor Greenwell, Scott Workman, Nathan Jacobs Department of Computer Science, University of Kentucky {rbalten,
More informationDynamic Scene Deblurring Using Spatially Variant Recurrent Neural Networks
Dynamic Scene Deblurring Using Spatially Variant Recurrent Neural Networks Jiawei Zhang 1,2 Jinshan Pan 3 Jimmy Ren 2 Yibing Song 4 Linchao Bao 4 Rynson W.H. Lau 1 Ming-Hsuan Yang 5 1 Department of Computer
More informationFully Convolutional Network with dilated convolutions for Handwritten
International Journal on Document Analysis and Recognition manuscript No. (will be inserted by the editor) Fully Convolutional Network with dilated convolutions for Handwritten text line segmentation Guillaume
More informationLecture 7: Scene Text Detection and Recognition. Dr. Cong Yao Megvii (Face++) Researcher
Lecture 7: Scene Text Detection and Recognition Dr. Cong Yao Megvii (Face++) Researcher yaocong@megvii.com Outline Background and Introduction Conventional Methods Deep Learning Methods Datasets and Competitions
More informationAutomated Image Timestamp Inference Using Convolutional Neural Networks
Automated Image Timestamp Inference Using Convolutional Neural Networks Prafull Sharma prafull7@stanford.edu Michel Schoemaker michel92@stanford.edu Stanford University David Pan napdivad@stanford.edu
More informationArtistic Image Colorization with Visual Generative Networks
Artistic Image Colorization with Visual Generative Networks Final report Yuting Sun ytsun@stanford.edu Yue Zhang zoezhang@stanford.edu Qingyang Liu qnliu@stanford.edu 1 Motivation Visual generative models,
More informationLearning Pixel-Distribution Prior with Wider Convolution for Image Denoising
Learning Pixel-Distribution Prior with Wider Convolution for Image Denoising Peng Liu University of Florida pliu1@ufl.edu Ruogu Fang University of Florida ruogu.fang@bme.ufl.edu arxiv:177.9135v1 [cs.cv]
More informationObject Detection in Wide Area Aerial Surveillance Imagery with Deep Convolutional Networks
Object Detection in Wide Area Aerial Surveillance Imagery with Deep Convolutional Networks Gregoire Robinson University of Massachusetts Amherst Amherst, MA gregoirerobi@umass.edu Introduction Wide Area
More informationPhysically-Based Rendering for Indoor Scene Understanding Using Convolutional Neural Networks SUPPLEMENTAL MATERIAL
Physically-Based Rendering for Indoor Scene Understanding Using Convolutional Neural Networks SUPPLEMENTAL MATERIAL Yinda Zhang Shuran Song Ersin Yumer Manolis Savva Joon-Young Lee Hailin Jin Thomas Funkhouser
More informationCoursework 2. MLP Lecture 7 Convolutional Networks 1
Coursework 2 MLP Lecture 7 Convolutional Networks 1 Coursework 2 - Overview and Objectives Overview: Use a selection of the techniques covered in the course so far to train accurate multi-layer networks
More informationFood Image Recognition Using Deep Convolutional Network with Pre-training and Fine-tuning
Food Image Recognition Using Deep Convolutional Network with Pre-training and Fine-tuning ICME Workshop on Multimedia for Cooking and Eating Activities (CEA) July 3 th 2015 Keiji Yanai and Yoshiyuki Kawano
More informationStudy Impact of Architectural Style and Partial View on Landmark Recognition
Study Impact of Architectural Style and Partial View on Landmark Recognition Ying Chen smileyc@stanford.edu 1. Introduction Landmark recognition in image processing is one of the important object recognition
More informationRecognition: Overview. Sanja Fidler CSC420: Intro to Image Understanding 1/ 83
Recognition: Overview Sanja Fidler CSC420: Intro to Image Understanding 1/ 83 Textbook This book has a lot of material: K. Grauman and B. Leibe Visual Object Recognition Synthesis Lectures On Computer
More informationSketch-R2CNN: An Attentive Network for Vector Sketch Recognition
Sketch-R2CNN: An Attentive Network for Vector Sketch Recognition sketch-based retrieval [4, 38, 30, 42] and modeling [26], etc. In this paper, we focus on developing a novel learning-based method for freehand
More informationSketch-a-Net: a Deep Neural Network that Beats Humans
Sketch-a-Net: a Deep Neural Network that Beats Humans Yu, Q; Yang, Y; Liu, F; SONG, Y; Xiang, T; Hospedales, T DOI: 0.00/s-0-0- For additional information about this publication click this link. http://qmro.qmul.ac.uk/xmlui/handle//
More informationThermal Image Enhancement Using Convolutional Neural Network
SEOUL Oct.7, 2016 Thermal Image Enhancement Using Convolutional Neural Network Visual Perception for Autonomous Driving During Day and Night Yukyung Choi Soonmin Hwang Namil Kim Jongchan Park In So Kweon
More informationPark Smart. D. Di Mauro 1, M. Moltisanti 2, G. Patanè 2, S. Battiato 1, G. M. Farinella 1. Abstract. 1. Introduction
Park Smart D. Di Mauro 1, M. Moltisanti 2, G. Patanè 2, S. Battiato 1, G. M. Farinella 1 1 Department of Mathematics and Computer Science University of Catania {dimauro,battiato,gfarinella}@dmi.unict.it
More informationCompositing-aware Image Search
Compositing-aware Image Search Hengshuang Zhao 1, Xiaohui Shen 2, Zhe Lin 3, Kalyan Sunkavalli 3, Brian Price 3, Jiaya Jia 1,4 1 The Chinese University of Hong Kong, 2 ByteDance AI Lab, 3 Adobe Research,
More informationCSC321 Lecture 11: Convolutional Networks
CSC321 Lecture 11: Convolutional Networks Roger Grosse Roger Grosse CSC321 Lecture 11: Convolutional Networks 1 / 35 Overview What makes vision hard? Vison needs to be robust to a lot of transformations
More informationFrame-Based Classification of Operation Phases in Cataract Surgery Videos
Frame-Based Classification of Operation Phases in Cataract Surgery Videos Manfred Jüergen Primus 1, Doris Putzgruber-Adamitsch 2 Mario Taschwer 1, Bernd Münzer 1, Yosuf El-Shabrawi 2, Laszlo Böszörmenyi
More informationIMAGE RESTORATION WITH NEURAL NETWORKS. Orazio Gallo Work with Hang Zhao, Iuri Frosio, Jan Kautz
IMAGE RESTORATION WITH NEURAL NETWORKS Orazio Gallo Work with Hang Zhao, Iuri Frosio, Jan Kautz MOTIVATION The long path of images Bad Pixel Correction Black Level AF/AE Demosaic Denoise Lens Correction
More informationA Geometry-Sensitive Approach for Photographic Style Classification
A Geometry-Sensitive Approach for Photographic Style Classification Koustav Ghosal 1, Mukta Prasad 1,2, and Aljosa Smolic 1 1 V-SENSE, School of Computer Science and Statistics, Trinity College Dublin
More informationUniversity of Bristol - Explore Bristol Research. Peer reviewed version. Link to publication record in Explore Bristol Research PDF-document
Hepburn, A., McConville, R., & Santos-Rodriguez, R. (2017). Album cover generation from genre tags. Paper presented at 10th International Workshop on Machine Learning and Music, Barcelona, Spain. Peer
More informationDYNAMIC CONVOLUTIONAL NEURAL NETWORK FOR IMAGE SUPER- RESOLUTION
Journal of Advanced College of Engineering and Management, Vol. 3, 2017 DYNAMIC CONVOLUTIONAL NEURAL NETWORK FOR IMAGE SUPER- RESOLUTION Anil Bhujel 1, Dibakar Raj Pant 2 1 Ministry of Information and
More informationConvolutional Neural Networks. Fei-Fei Li & Justin Johnson & Serena Yeung. Lecture 5-1
Lecture 5: Convolutional Neural Networks Lecture 5-1 Administrative Assignment 1 due Thursday April 20, 11:59pm on Canvas Assignment 2 will be released Thursday Lecture 5-2 Last time: Neural Networks Linear
More informationImproving a real-time object detector with compact temporal information
Improving a real-time object detector with compact temporal information Martin Ahrnbom Lund University martin.ahrnbom@math.lth.se Morten Bornø Jensen Aalborg University mboj@create.aau.dk Håkan Ardö Lund
More informationScene Perception based on Boosting over Multimodal Channel Features
Scene Perception based on Boosting over Multimodal Channel Features Arthur Costea Image Processing and Pattern Recognition Research Center Technical University of Cluj-Napoca Research Group Technical University
More information