Supplementary Material: Deep Photo Enhancer: Unpaired Learning for Image Enhancement from Photographs with GANs
|
|
- Darlene Jackson
- 6 years ago
- Views:
Transcription
1 Supplementary Material: Deep Photo Enhancer: Unpaired Learning for Image Enhancement from Photographs with GANs Yu-Sheng Chen Yu-Ching Wang Man-Hsin Kao Yung-Yu Chuang National Taiwan University 1 More comparisons on the MIT-Adobe 5K dataset In the paper, we proposed three models for enhancing images, SL (supervised learning using the proposed generator trained on the MIT-Adobe 5K dataset), UL (the proposed 2-way GAN trained on the MIT-Adobe 5K dataset) and HDR (the proposed 2-way GAN trained on the collected HDR dataset). We compare the proposed models with five state-of-the-art methods, including Cycle- GAN [10], DPED [5], CLHE [8], NPEA [9] and FLLF [1]. Note that our model can be taken as an enhanced CycleGAN with three proposed improvements, a better generator, a better WGAN model with the adaptive weighting scheme and a better 2-way GAN model with individual batch normalization layers. The CycleGAN was trained on the collected HDR dataset. The DPED models are tailored with di erent mobile phones. Here, we show results of the DPED models for iphone6, iphone7 and Nexus 5x. This section compares these methods on some images from the MIT-Adobe- 5K [2] testing dataset. In general, we have the following observations. Our models trained with the photographer s labels approximate the labels reasonably well. DPED models vary a lot with di erent phone models. Though trained on the same HDR dataset, CycleGAN cannot capture the characteristics of the dataset as well as our HDR model. It shows that the proposed improvements are e ective and important, at least for this application. CLHE, NPEA and FLLF are not robust and could generate unnatural enhanced images at times. Our HDR model captures the characteristics of the collected HDR dataset well and generates the most natural enhanced images.
2 2 Yu-Sheng Chen, Yu-Ching Wang, Man-Hsin Kao, and Yung-Yu Chuang Input Label Our (SL) Our (UL) Our (HDR) DPED (iphone6) DPED (iphone7) DPED (Nexus 5x) CycleGAN (HDR) CLHE NPEA FLLF Fig. 1: Comparisons of di erent methods on a MIT-adobe-5K testing image, a3552.
3 Supplementary Material: Deep Photo Enhancer 3 Input Label Our (SL) Our (UL) Our (HDR) DPED (iphone6) DPED (iphone7) DPED (Nexus 5x) CycleGAN (HDR) CLHE NPEA FLLF Fig. 2: Comparisons of di erent methods on a MIT-adobe-5K testing image, a0212
4 4 Yu-Sheng Chen, Yu-Ching Wang, Man-Hsin Kao, and Yung-Yu Chuang Input Label Our (SL) Our (UL) Our (HDR) DPED (iphone6) DPED (iphone7) DPED (Nexus 5x) CycleGAN (HDR) CLHE NPEA FLLF Fig. 3: Comparisons of di erent methods on a MIT-adobe-5K testing image, a0481
5 Supplementary Material: Deep Photo Enhancer 5 Input Label Our (SL) Our (UL) Our (HDR) DPED (iphone6) DPED (iphone7) DPED (Nexus 5x) CycleGAN (HDR) CLHE NPEA FLLF Fig. 4: Comparisons of di erent methods on a MIT-adobe-5K testing image, a3203
6 6 Yu-Sheng Chen, Yu-Ching Wang, Man-Hsin Kao, and Yung-Yu Chuang Input Label Our (HDR) Our (SL) Our (UL) CycleGAN (HDR) DPED (iphone6) DPED (iphone7) DPED (Nexus 5x) CLHE NPEA FLLF Fig. 5: Comparisons of di erent methods on a MIT-adobe-5K testing image, a0535
7 Supplementary Material: Deep Photo Enhancer 7 Input Label Our (HDR) Our (SL) Our (UL) CycleGAN (HDR) DPED (iphone6) DPED (iphone7) DPED (Nexus 5x) CLHE NPEA FLLF Fig. 6: Comparisons of di erent methods on a MIT-adobe-5K testing image, a1305
8 8 Yu-Sheng Chen, Yu-Ching Wang, Man-Hsin Kao, and Yung-Yu Chuang Input Label Our (HDR) Our (SL) Our (UL) CycleGAN (HDR) DPED (iphone6) DPED (iphone7) DPED (Nexus 5x) CLHE NPEA FLLF Fig. 7: Comparisons of di erent methods on a MIT-adobe-5K testing image, a4963
9 Supplementary Material: Deep Photo Enhancer 9 2 More comparisons on images from the Internet We show results of di erent methods on enhancing images collected from the Internet. We compare our HDR model with DPED and CLHE. For DPED, we show the results using all three phone models. Although with good enhancement in general, the results of CLHE sometimes look unnatural, particularly on colors. The results of DPED vary among phone models, showing its dependence to phone models. In general, our results give natural results with enhanced color, contrast and details. Noe that the model was trained on HDR images. Thus, the results are HDR-like. Sometime, it could look too prominent. It is however possible to train a modest model with a set of images with that style. We also provide an accompanying video showing the results of our HDR model on a video. Each frame is processed independently. However, there is no obvious temporal flick. It shows that our model is quite stable. The video also demonstrates that our model can be used for a wide variety of images.
10 10 Yu-Sheng Chen, Yu-Ching Wang, Man-Hsin Kao, and Yung-Yu Chuang Input Our (HDR) DPED (iphone6) DPED (iphone7) DPED (Nexus 5x) CLHE Fig. 8: Comparisons of di erent methods on an Internet image.
11 Supplementary Material: Deep Photo Enhancer 11 Input Our (HDR) DPED (iphone6) DPED (iphone7) DPED (Nexus 5x) CLHE Fig. 9: Comparisons of di erent methods on an Internet image.
12 12 Yu-Sheng Chen, Yu-Ching Wang, Man-Hsin Kao, and Yung-Yu Chuang Input Our (HDR) DPED (iphone6) DPED (iphone7) DPED (Nexus 5x) CLHE Fig. 10: Comparisons of di erent methods on an Internet image.
13 Supplementary Material: Deep Photo Enhancer 13 Input Our (HDR) DPED (iphone6) DPED (iphone7) DPED (Nexus 5x) CLHE Fig. 11: Comparisons of di erent methods on an Internet image.
14 14 Yu-Sheng Chen, Yu-Ching Wang, Man-Hsin Kao, and Yung-Yu Chuang Input Our (HDR) DPED (iphone6) DPED (iphone7) DPED (Nexus 5x) CLHE Fig. 12: Comparisons of di erent methods on an Internet image.
15 Supplementary Material: Deep Photo Enhancer 15 Input Our (HDR) DPED (iphone6) DPED (iphone7) DPED (Nexus 5x) CLHE Fig. 13: Comparisons of di erent methods on an Internet image.
16 16 Yu-Sheng Chen, Yu-Ching Wang, Man-Hsin Kao, and Yung-Yu Chuang Input Our (HDR) DPED (iphone6) DPED (iphone7) DPED (Nexus 5x) CLHE Fig. 14: Comparisons of di erent methods on an Internet image.
17 Supplementary Material: Deep Photo Enhancer 17 Input Our (HDR) DPED (iphone6) DPED (iphone7) DPED (Nexus 5x) CLHE Fig. 15: Comparisons of di erent methods on an Internet image.
18 18 Yu-Sheng Chen, Yu-Ching Wang, Man-Hsin Kao, and Yung-Yu Chuang Input Our (HDR) DPED (iphone6) DPED (iphone7) DPED (Nexus 5x) CLHE Fig. 16: Comparisons of di erent methods on an Internet image.
19 Supplementary Material: Deep Photo Enhancer 19 Input Our (HDR) DPED (iphone6) DPED (iphone7) DPED (Nexus 5x) CLHE Fig. 17: Comparisons of di erent methods on an Internet image.
20 20 Yu-Sheng Chen, Yu-Ching Wang, Man-Hsin Kao, and Yung-Yu Chuang Input Our (HDR) DPED (iphone6) DPED (iphone7) DPED (Nexus 5x) CLHE Fig. 18: Comparisons of di erent methods on an Internet image.
21 Supplementary Material: Deep Photo Enhancer 21 Input Our (HDR) DPED (iphone6) DPED (iphone7) DPED (Nexus 5x) CLHE Fig. 19: Comparisons of di erent methods on an Internet image.
22 22 Yu-Sheng Chen, Yu-Ching Wang, Man-Hsin Kao, and Yung-Yu Chuang 3 The collected HDR dataset We show sample images of the collected HDR dataset below. Fig. 20: Sample images from the collected HDR dataset
23 Supplementary Material: Deep Photo Enhancer 23 4 Training This section shows some figures about the training process. Figure 21 shows the training progress of di erent GAN models, the proposed A-WGAN, WGAN- GP, DRAGAN, LSGAN (local D), LSGAN and GAN. It can be seen that the proposed A-WGAN generates the best results while some GAN models could totally collapse. Figure 22 shows the discriminator loss along the training process for di erent one-way GAN models trained on the MIT-Adobe 5K dataset. For the proposed model, the discriminator loss can be used as a good indicator for convergence. Figure 23 shows the discriminator loss for the proposed two-way GAN model with and without the individual batch normalization layers on the MIT-Adobe 5K and the collected HDR datasets. Although training on the MIT- Adobe 5K dataset is e ective without individual batch normalization, individual batch normalization layers play an important role on the training with the HDR dataset. PSNR Ours (A-WGAN) WGAN-GP DRAGAN LSGAN(Local D) LSGAN GAN epoch Fig. 21: PNSR values of testing on the MIT-Adobe-5K daatset with di erent oneway GAN architectures which use di erent GAN formulas, GAN [3], LSGAN [7], DRAGAN [6] and WGAN-GP [4]. (Local D: using the local discriminator proposed by CycleGAN [10].)
24 24 Yu-Sheng Chen, Yu-Ching Wang, Man-Hsin Kao, and Yung-Yu Chuang Discriminator loss Ours (A-WGAN) WGAN-GP DRAGAN LSGAN (Local D) LSGAN GAN epoch Fig. 22: Discriminator loss of training on the MIT-Adobe-5K dataset for several one-way GAN architectures, GAN [3], LSGAN [7], DRAGAN [6] and WGAN- GP [4]. The value can be used as a good indicator of convergence for our model. Discriminator loss two-way trained on HDR (without ibn) two-way trained on HDR (with ibn) two-way trained on MIT-Adobe-5K (without ibn) two-way trained on MIT-Adobe-5K (with ibn) epoch Fig. 23: Discriminator loss of training on the MIT-Adobe-5K dataset and our HDR dataset for the proposed two-way GAN architecture with and without individual BN. It shows that individual BN is crucial for training on the HDR dataset.
25 Supplementary Material: Deep Photo Enhancer 25 References 1. Aubry, M., Paris, S., Hasino, S.W., Kautz, J., Durand, F.: Fast local laplacian filters: Theory and applications. ACM Transactions on Graphics (TOG) 33(5), 167 (2014) 2. Bychkovsky, V., Paris, S., Chan, E., Durand, F.: Learning photographic global tonal adjustment with a database of input/output image pairs. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (June 2011) 3. Goodfellow, I., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S., Courville, A., Bengio, Y.: Generative adversarial nets. In: Advances in neural information processing systems (NIPS). pp (2014) 4. Gulrajani, I., Ahmed, F., Arjovsky, M., Dumoulin, V., Courville, A.: Improved training of Wasserstein GANs. In: Advances in neural information processing systems (NIPS) (2017) 5. Ignatov, A., Kobyshev, N., Vanhoey, K., Timofte, R., Van Gool, L.: DSLR-quality photos on mobile devices with deep convolutional networks. In: IEEE International Conference on Computer Vision (ICCV) (Oct 2017) 6. Kodali, N., Abernethy, J., Hays, J., Kira, Z.: On convergence and stability of GANs. In: arxiv preprint arxiv: (2017) 7. Mao, X., Li, Q., Xie, H., Lau, R.Y., Wang, Z., Paul Smolley, S.: Least squares generative adversarial networks. In: The IEEE International Conference on Computer Vision (ICCV) (Oct 2017) 8. Wang, S., Cho, W., Jang, J., Abidi, M.A., Paik, J.: Contrast-dependent saturation adjustment for outdoor image enhancement. JOSA A 34(1), (2017) 9. Wang, S., Zheng, J., Hu, H.M., Li, B.: Naturalness preserved enhancement algorithm for non-uniform illumination images. IEEE Transactions on Image Processing (TIP) 22(9), (2013) 10. Zhu, J.Y., Park, T., Isola, P., Efros, A.A.: Unpaired image-to-image translation using cycle-consistent adversarial networks. In: The IEEE International Conference on Computer Vision (ICCV) (2017)
Fast Perceptual Image Enhancement
Fast Perceptual Image Enhancement Etienne de Stoutz [0000 0001 5439 3290], Andrey Ignatov [0000 0003 4205 8748], Nikolay Kobyshev [0000 0001 6456 4946], Radu Timofte [0000 0002 1478 0402], and Luc Van
More informationArtistic Image Colorization with Visual Generative Networks
Artistic Image Colorization with Visual Generative Networks Final report Yuting Sun ytsun@stanford.edu Yue Zhang zoezhang@stanford.edu Qingyang Liu qnliu@stanford.edu 1 Motivation Visual generative models,
More informationEnhancing Symmetry in GAN Generated Fashion Images
Enhancing Symmetry in GAN Generated Fashion Images Vishnu Makkapati 1 and Arun Patro 2 1 Myntra Designs Pvt. Ltd., Bengaluru - 560068, India vishnu.makkapati@myntra.com 2 Department of Electrical Engineering,
More informationDeep Recursive HDRI: Inverse Tone Mapping using Generative Adversarial Networks
Deep Recursive HDRI: Inverse Tone Mapping using Generative Adversarial Networks Siyeong Lee, Gwon Hwan An, Suk-Ju Kang Department of Electronic Engineering, Sogang University {siyeong, ghan, sjkang}@sogang.ac.kr
More informationUniversity of Bristol - Explore Bristol Research. Peer reviewed version. Link to publication record in Explore Bristol Research PDF-document
Hepburn, A., McConville, R., & Santos-Rodriguez, R. (2017). Album cover generation from genre tags. Paper presented at 10th International Workshop on Machine Learning and Music, Barcelona, Spain. Peer
More informationCombination of Single Image Super Resolution and Digital Inpainting Algorithms Based on GANs for Robust Image Completion
SERBIAN JOURNAL OF ELECTRICAL ENGINEERING Vol. 14, No. 3, October 2017, 379-386 UDC: 004.932.4+004.934.72 DOI: https://doi.org/10.2298/sjee1703379h Combination of Single Image Super Resolution and Digital
More informationarxiv: v2 [cs.lg] 7 May 2017
STYLE TRANSFER GENERATIVE ADVERSARIAL NET- WORKS: LEARNING TO PLAY CHESS DIFFERENTLY Muthuraman Chidambaram & Yanjun Qi Department of Computer Science University of Virginia Charlottesville, VA 22903,
More informationarxiv: v2 [cs.cv] 3 Mar 2018
WESPE: Weakly Supervised Photo Enhancer for Digital Cameras Andrey Ignatov, Nikolay Kobyshev, Kenneth Vanhoey, Radu Timofte, Luc Van Gool ETH Zurich {andrey, nk, vanhoey, timofter, vangool}@vision.ee.ethz.ch
More informationLEARNING AN INVERSE TONE MAPPING NETWORK WITH A GENERATIVE ADVERSARIAL REGULARIZER
LEARNING AN INVERSE TONE MAPPING NETWORK WITH A GENERATIVE ADVERSARIAL REGULARIZER Shiyu Ning, Hongteng Xu,3, Li Song, Rong Xie, Wenjun Zhang School of Electronic Information and Electrical Engineering,
More informationarxiv: v1 [cs.cv] 4 Sep 2017
WESPE: Weakly Supervised Photo Enhancer for Digital Cameras Andrey Ignatov, Nikolay Kobyshev, Kenneth Vanhoey, Radu Timofte, Luc Van Gool ETH Zurich andrey.ignatoff@gmail.com, {nk, vanhoey, timofter, vangool}@vision.ee.ethz.ch
More informationDerek Allman a, Austin Reiter b, and Muyinatu Bell a,c
Exploring the effects of transducer models when training convolutional neural networks to eliminate reflection artifacts in experimental photoacoustic images Derek Allman a, Austin Reiter b, and Muyinatu
More informationThe Threshold Between Human and Computational Creativity. Pindar Van Arman
The Threshold Between Human and Computational Creativity Pindar Van Arman cloudpainter.com @vanarman One of Them is Human #1 Photo by Maiji Tammi that was recently shortlisted for the Taylor Wessing Prize.
More informationarxiv: v1 [cs.cv] 22 Sep 2018
Parametric Synthesis of Text on Stylized Backgrounds using PGGANs Mayank Gupta Conduent Labs mayank.gupta3@conduent.com Abhinav Kumar Conduent Labs abhinav.kumar@conduent.com Sriganesh Madhvanath ebay,
More informationDSLR-Quality Photos on Mobile Devices with Deep Convolutional Networks
DSLR-Quality Photos on Mobile Devices with Deep Convolutional Networks Andrey Ignatov 1, Nikolay Kobyshev 1, Radu Timofte 1, Kenneth Vanhoey 1, Luc Van Gool 1,2 1 Computer Vision Laboratory, ETH Zürich,
More informationVISUAL ANALOGIES BETWEEN ATARI GAMES FOR STUDYING TRANSFER LEARNING IN RL
VISUAL ANALOGIES BETWEEN ATARI GAMES FOR STUDYING TRANSFER LEARNING IN RL Doron Sobol 1, Lior Wolf 1,2 & Yaniv Taigman 2 1 School of Computer Science, Tel-Aviv University 2 Facebook AI Research ABSTRACT
More informationRefocusGAN: Scene Refocusing using a Single Image
RefocusGAN: Scene Refocusing using a Single Image Parikshit Sakurikar 1, Ishit Mehta 1, Vineeth N. Balasubramanian 2 and P. J. Narayanan 1 1 Center for Visual Information Technology, Kohli Center on Intelligent
More informationLearning Pixel-Distribution Prior with Wider Convolution for Image Denoising
Learning Pixel-Distribution Prior with Wider Convolution for Image Denoising Peng Liu University of Florida pliu1@ufl.edu Ruogu Fang University of Florida ruogu.fang@bme.ufl.edu arxiv:177.9135v1 [cs.cv]
More informationFilmy Cloud Removal on Satellite Imagery with Multispectral Conditional Generative Adversarial Nets
Filmy Cloud Removal on Satellite Imagery with Multispectral Conditional Generative Adversarial Nets Kenji Enomoto 1 Ken Sakurada 1 Weimin Wang 1 Hiroshi Fukui 2 Masashi Matsuoka 3 Ryosuke Nakamura 4 Nobuo
More informationWaveNet Vocoder and its Applications in Voice Conversion
The 2018 Conference on Computational Linguistics and Speech Processing ROCLING 2018, pp. 96-110 The Association for Computational Linguistics and Chinese Language Processing WaveNet WaveNet Vocoder and
More informationFace Recognition in Low Resolution Images. Trey Amador Scott Matsumura Matt Yiyang Yan
Face Recognition in Low Resolution Images Trey Amador Scott Matsumura Matt Yiyang Yan Introduction Purpose: low resolution facial recognition Extract image/video from source Identify the person in real
More informationUpscaling Beyond Super Resolution Using a Novel Deep Learning System
Upscaling Beyond Super Resolution Using a Novel Deep Learning System Pablo Navarrete Michelini pnavarre@boe.com.cn Hanwen Liu lhw@boe.com.cn BOE Technology Group Co., Ltd. BOE Technology Group Co., Ltd.
More informationfast blur removal for wearable QR code scanners
fast blur removal for wearable QR code scanners Gábor Sörös, Stephan Semmler, Luc Humair, Otmar Hilliges ISWC 2015, Osaka, Japan traditional barcode scanning next generation barcode scanning ubiquitous
More informationColorful Image Colorizations Supplementary Material
Colorful Image Colorizations Supplementary Material Richard Zhang, Phillip Isola, Alexei A. Efros {rich.zhang, isola, efros}@eecs.berkeley.edu University of California, Berkeley 1 Overview This document
More informationECE 599/692 Deep Learning Lecture 19 Beyond BP and CNN
ECE 599/692 Deep Learning Lecture 19 Beyond BP and CNN Hairong Qi, Gonzalez Family Professor Electrical Engineering and Computer Science University of Tennessee, Knoxville http://www.eecs.utk.edu/faculty/qi
More informationMISLGAN: AN ANTI-FORENSIC CAMERA MODEL FALSIFICATION FRAMEWORK USING A GENERATIVE ADVERSARIAL NETWORK
MISLGAN: AN ANTI-FORENSIC CAMERA MODEL FALSIFICATION FRAMEWORK USING A GENERATIVE ADVERSARIAL NETWORK Chen Chen *, Xinwei Zhao * and Matthew C. Stamm Dept. of Electrical and Computer Engineering, Drexel
More informationarxiv: v1 [cs.cv] 31 Dec 2018
Do GANs leave artificial fingerprints? arxiv:1812.11842v1 [cs.cv] 31 Dec 2018 Francesco Marra, Diego Gragnaniello, Luisa Verdoliva, Giovanni Poggi DIETI University Federico II of Naples Via Claudio 21,
More informationSemantic Segmentation in Red Relief Image Map by UX-Net
Semantic Segmentation in Red Relief Image Map by UX-Net Tomoya Komiyama 1, Kazuhiro Hotta 1, Kazuo Oda 2, Satomi Kakuta 2 and Mikako Sano 2 1 Meijo University, Shiogamaguchi, 468-0073, Nagoya, Japan 2
More informationarxiv: v1 [cs.lg] 2 Jan 2018
Deep Learning for Identifying Potential Conceptual Shifts for Co-creative Drawing arxiv:1801.00723v1 [cs.lg] 2 Jan 2018 Pegah Karimi pkarimi@uncc.edu Kazjon Grace The University of Sydney Sydney, NSW 2006
More informationCS354 Computer Graphics Computational Photography. Qixing Huang April 23 th 2018
CS354 Computer Graphics Computational Photography Qixing Huang April 23 th 2018 Background Sales of digital cameras surpassed sales of film cameras in 2004 Digital Cameras Free film Instant display Quality
More informationDeformable Deep Convolutional Generative Adversarial Network in Microwave Based Hand Gesture Recognition System
arxiv:1711.01968v2 [stat.ml] 22 Nov 2017 Deformable Deep Convolutional Generative Adversarial Network in Microwave Based Hand Gesture Recognition System Abstract Traditional vision-based hand gesture recognition
More informationIMAGE RESTORATION WITH NEURAL NETWORKS. Orazio Gallo Work with Hang Zhao, Iuri Frosio, Jan Kautz
IMAGE RESTORATION WITH NEURAL NETWORKS Orazio Gallo Work with Hang Zhao, Iuri Frosio, Jan Kautz MOTIVATION The long path of images Bad Pixel Correction Black Level AF/AE Demosaic Denoise Lens Correction
More informationDeep Learning Basics Lecture 9: Recurrent Neural Networks. Princeton University COS 495 Instructor: Yingyu Liang
Deep Learning Basics Lecture 9: Recurrent Neural Networks Princeton University COS 495 Instructor: Yingyu Liang Introduction Recurrent neural networks Dates back to (Rumelhart et al., 1986) A family of
More informationLearning to See in the Dark
Learning to See in the Dark Chen Chen UIUC Qifeng Chen Intel Labs Jia Xu Intel Labs Vladlen Koltun Intel Labs (a) Camera output with ISO 8,000 (b) Camera output with ISO 409,600 (c) Our result from the
More informationPredicting Range of Acceptable Photographic Tonal Adjustments
Predicting Range of Acceptable Photographic Tonal Adjustments Ronnachai Jaroensri Sylvain Paris Aaron Hertzmann Vladimir Bychkovsky Frédo Durand MIT CSAIL Adobe Research Adobe Research Facebook, Inc. MIT
More informationarxiv: v1 [cs.cv] 15 Nov 2018
IMAGE DECLIPPING WITH DEEP NETWORKS Shachar Honig & Michael Werman Department of Computer Science, The Hebrew University of Jerusalem arxiv:1811.06277v1 [cs.cv] 15 Nov 2018 ABSTRACT We present a deep network
More informationNumber Plate Detection with a Multi-Convolutional Neural Network Approach with Optical Character Recognition for Mobile Devices
J Inf Process Syst, Vol.12, No.1, pp.100~108, March 2016 http://dx.doi.org/10.3745/jips.04.0022 ISSN 1976-913X (Print) ISSN 2092-805X (Electronic) Number Plate Detection with a Multi-Convolutional Neural
More informationCROSS-LAYER FEATURES IN CONVOLUTIONAL NEURAL NETWORKS FOR GENERIC CLASSIFICATION TASKS. Kuan-Chuan Peng and Tsuhan Chen
CROSS-LAYER FEATURES IN CONVOLUTIONAL NEURAL NETWORKS FOR GENERIC CLASSIFICATION TASKS Kuan-Chuan Peng and Tsuhan Chen Cornell University School of Electrical and Computer Engineering Ithaca, NY 14850
More informationSemantic Segmentation on Resource Constrained Devices
Semantic Segmentation on Resource Constrained Devices Sachin Mehta University of Washington, Seattle In collaboration with Mohammad Rastegari, Anat Caspi, Linda Shapiro, and Hannaneh Hajishirzi Project
More informationarxiv: v1 [cs.cv] 23 Dec 2017
Aerial Spectral Super-Resolution using Conditional Adversarial Networks Aneesh Rangnekar Nilay Mokashi Emmett Ientilucci Christopher Kanan Matthew Hoffman Rochester Institute of Technology {aneesh.rangnekar,
More informationDYNAMIC CONVOLUTIONAL NEURAL NETWORK FOR IMAGE SUPER- RESOLUTION
Journal of Advanced College of Engineering and Management, Vol. 3, 2017 DYNAMIC CONVOLUTIONAL NEURAL NETWORK FOR IMAGE SUPER- RESOLUTION Anil Bhujel 1, Dibakar Raj Pant 2 1 Ministry of Information and
More informationSIMULATION-BASED MODEL CONTROL USING STATIC HAND GESTURES IN MATLAB
SIMULATION-BASED MODEL CONTROL USING STATIC HAND GESTURES IN MATLAB S. Kajan, J. Goga Institute of Robotics and Cybernetics, Faculty of Electrical Engineering and Information Technology, Slovak University
More informationProject Title: Sparse Image Reconstruction with Trainable Image priors
Project Title: Sparse Image Reconstruction with Trainable Image priors Project Supervisor(s) and affiliation(s): Stamatis Lefkimmiatis, Skolkovo Institute of Science and Technology (Email: s.lefkimmiatis@skoltech.ru)
More informationDISCUSSION. 12th IAPR International Workshop on Graphics Recognition Kyoto, Japan - November Josep Lladós
GREC2017 FINAL PANEL DISCUSSION 12th IAPR International Workshop on Graphics Recognition Kyoto, Japan - November 9-10 2017 Josep Lladós Statistics in GREC series Statistics in GREC series A traditional
More informationA Saturation-based Image Fusion Method for Static Scenes
2015 6th International Conference of Information and Communication Technology for Embedded Systems (IC-ICTES) A Saturation-based Image Fusion Method for Static Scenes Geley Peljor and Toshiaki Kondo Sirindhorn
More informationSingle Scale image Dehazing by Multi Scale Fusion
Single Scale image Dehazing by Multi Scale Fusion Mrs.A.Dyanaa #1, Ms.Srruthi Thiagarajan Visvanathan *2, Ms.Varsha Chandran #3 #1 Assistant Professor, * 2 #3 UG Scholar Department of Information Technology,
More informationEXIF Estimation With Convolutional Neural Networks
EXIF Estimation With Convolutional Neural Networks Divyahans Gupta Stanford University Sanjay Kannan Stanford University dgupta2@stanford.edu skalon@stanford.edu Abstract 1.1. Motivation While many computer
More informationDoppler-Radar Based Hand Gesture Recognition System Using Convolutional Neural Networks
Doppler-Radar Based Hand Gesture Recognition System Using Convolutional Neural Networks Jiajun Zhang, Jinkun Tao, Jiangtao Huangfu, Zhiguo Shi College of Information Science & Electronic Engineering Zhejiang
More informationBiologically Inspired Computation
Biologically Inspired Computation Deep Learning & Convolutional Neural Networks Joe Marino biologically inspired computation biological intelligence flexible capable of detecting/ executing/reasoning about
More informationTravel Photo Album Summarization based on Aesthetic quality, Interestingness, and Memorableness
Travel Photo Album Summarization based on Aesthetic quality, Interestingness, and Memorableness Jun-Hyuk Kim and Jong-Seok Lee School of Integrated Technology and Yonsei Institute of Convergence Technology
More informationarxiv: v1 [cs.cv] 24 Nov 2017
End-to-End Deep HDR Imaging with Large Foreground Motions Shangzhe Wu Jiarui Xu Yu-Wing Tai Chi-Keung Tang Hong Kong University of Science and Technology Tencent Youtu arxiv:1711.08937v1 [cs.cv] 24 Nov
More informationarxiv: v1 [cs.cv] 1 Nov 2017
Learning deep features for source color laser printer identification based on cascaded learning Do-Guk Kim a, Jong-Uk Hou b, Heung-Kyu Lee b, a Graduate School of Information Security, KAIST, b School
More informationStudy Impact of Architectural Style and Partial View on Landmark Recognition
Study Impact of Architectural Style and Partial View on Landmark Recognition Ying Chen smileyc@stanford.edu 1. Introduction Landmark recognition in image processing is one of the important object recognition
More informationComputational Photography
Computational photography Computational Photography Digital Visual Effects Yung-Yu Chuang wikipedia: Computational photography h refers broadly to computational imaging techniques that enhance or extend
More informationDeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition. ECE 289G: Paper Presentation #3 Philipp Gysel
DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition ECE 289G: Paper Presentation #3 Philipp Gysel Autonomous Car ECE 289G Paper Presentation, Philipp Gysel Slide 2 Source: maps.google.com
More informationSinusoidal wave generating network based on adversarial learning and its application: synthesizing frog sounds for data augmentation
Sinusoidal wave generating network based on adversarial learning and its application: synthesizing frog sounds for data augmentation Sangwook Park, David K. Han, and Hanseok Ko Abstract Simulators that
More informationTRANSFORMING PHOTOS TO COMICS USING CONVOLUTIONAL NEURAL NETWORKS. Tsinghua University, China Cardiff University, UK
TRANSFORMING PHOTOS TO COMICS USING CONVOUTIONA NEURA NETWORKS Yang Chen Yu-Kun ai Yong-Jin iu Tsinghua University, China Cardiff University, UK ABSTRACT In this paper, inspired by Gatys s recent work,
More informationAutomatic Content-Aware Color and Tone Stylization
Automatic Content-Aware Color and Tone Stylization Joon-Young Lee Adobe Research Kalyan Sunkavalli Adobe Research Zhe Lin Adobe Research Xiaohui Shen Adobe Research In So Kweon KAIST Abstract We introduce
More informationAesthetic-Driven Image Enhancement by Adversarial Learning
Aesthetic-Driven Image Enhancement by Adversarial Learning Yubin Deng CUHK-SenseTime Joint Laboratory, The Chinese University of Hong Kong dy5@ie.cuhk.edu.hk ABSTRACT We introduce EnhanceGAN, an adversarial
More informationWadehra Kartik, Kathpalia Mukul, Bahl Vasudha, International Journal of Advance Research, Ideas and Innovations in Technology
ISSN: 2454-132X Impact factor: 4.295 (Volume 4, Issue 1) Available online at www.ijariit.com Hand Detection and Gesture Recognition in Real-Time Using Haar-Classification and Convolutional Neural Networks
More informationarxiv: v1 [cs.cv] 2 Jan 2019
Transferred Painting Ancient Painting Ancient Painting to Natural Image: A New Solution for Painting Processing Tingting Qiao Weijing Zhang Miao Zhang Zixuan Ma Duanqing Xu Zhejiang University, China {qiaott,
More informationSuggested projects for EL-GY 6123 Image and Video Processing (Spring 2018) 360 Degree Video View Prediction (contact: Chenge Li,
Suggested projects for EL-GY 6123 Image and Video Processing (Spring 2018) Updated 2/6/2018 360 Degree Video View Prediction (contact: Chenge Li, cl2840@nyu.edu) Pan, Junting, et al. "Shallow and deep
More informationA Fuller Understanding of Fully Convolutional Networks. Evan Shelhamer* Jonathan Long* Trevor Darrell UC Berkeley in CVPR'15, PAMI'16
A Fuller Understanding of Fully Convolutional Networks Evan Shelhamer* Jonathan Long* Trevor Darrell UC Berkeley in CVPR'15, PAMI'16 1 pixels in, pixels out colorization Zhang et al.2016 monocular depth
More informationResearch on Hand Gesture Recognition Using Convolutional Neural Network
Research on Hand Gesture Recognition Using Convolutional Neural Network Tian Zhaoyang a, Cheng Lee Lung b a Department of Electronic Engineering, City University of Hong Kong, Hong Kong, China E-mail address:
More informationRecent Advances in Image Deblurring. Seungyong Lee (Collaboration w/ Sunghyun Cho)
Recent Advances in Image Deblurring Seungyong Lee (Collaboration w/ Sunghyun Cho) Disclaimer Many images and figures in this course note have been copied from the papers and presentation materials of previous
More informationList of Publications for Thesis
List of Publications for Thesis Felix Juefei-Xu CyLab Biometrics Center, Electrical and Computer Engineering Carnegie Mellon University, Pittsburgh, PA 15213, USA felixu@cmu.edu 1. Journal Publications
More informationarxiv: v1 [cs.cv] 19 Apr 2018
Survey of Face Detection on Low-quality Images arxiv:1804.07362v1 [cs.cv] 19 Apr 2018 Yuqian Zhou, Ding Liu, Thomas Huang Beckmann Institute, University of Illinois at Urbana-Champaign, USA {yuqian2, dingliu2}@illinois.edu
More informationConsistent Comic Colorization with Pixel-wise Background Classification
Consistent Comic Colorization with Pixel-wise Background Classification Sungmin Kang KAIST Jaegul Choo Korea University Jaehyuk Chang NAVER WEBTOON Corp. Abstract Comic colorization is a time-consuming
More informationMARCO PEDERSOLI. Assistant Professor at ETS Montreal profs.etsmtl.ca/mpedersoli
MARCO PEDERSOLI Assistant Professor at ETS Montreal profs.etsmtl.ca/mpedersoli RESEARCH INTERESTS Visual Recognition, Efficient Deep Learning, Learning with Reduced Supervision, Data Exploration ACADEMIC
More informationarxiv: v2 [cs.cv] 2 Jul 2018
Yubin Deng Chen Change Loy Xiaoou Tang CUHK-Sensetime Joint Laboratory, The Chinese University of Hong Kong CUHK-Sensetime Joint Laboratory, The Chinese University of Hong Kong CUHK-Sensetime Joint Laboratory,
More informationChangyin Zhou. Ph.D, Computer Science, Columbia University Oct 2012
Changyin Zhou Software Engineer at Google X Google Inc. 1600 Amphitheater Parkway, Mountain View, CA 94043 E-mail: changyin@google.com URL: http://www.changyin.org Office: (917) 209-9110 Mobile: (646)
More informationIMage demosaicing (a.k.a. color-filter-array interpolation)
1 Joint Demosaicing and Denoising with Perceptual Optimization on a Generative Adversarial Network Weisheng Dong, Member, IEEE, Ming Yuan, Xin Li, Guangming Shi, Senior member, IEEE arxiv:1802.04723v1
More informationContinuous Gesture Recognition Fact Sheet
Continuous Gesture Recognition Fact Sheet August 17, 2016 1 Team details Team name: ICT NHCI Team leader name: Xiujuan Chai Team leader address, phone number and email Address: No.6 Kexueyuan South Road
More informationMLP for Adaptive Postprocessing Block-Coded Images
1450 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 10, NO. 8, DECEMBER 2000 MLP for Adaptive Postprocessing Block-Coded Images Guoping Qiu, Member, IEEE Abstract A new technique
More informationSemi-Supervised Haptic Material Recognition for Robots using Generative Adversarial Networks
Semi-Supervised Haptic Material Recognition for Robots using Generative Adversarial Networks Zackory Erickson, Sonia Chernova, and Charles C. Kemp Institute for Robotics and Intelligent Machines Georgia
More informationThermal Image Enhancement Using Convolutional Neural Network
SEOUL Oct.7, 2016 Thermal Image Enhancement Using Convolutional Neural Network Visual Perception for Autonomous Driving During Day and Night Yukyung Choi Soonmin Hwang Namil Kim Jongchan Park In So Kweon
More informationA Novel Algorithm for Hand Vein Recognition Based on Wavelet Decomposition and Mean Absolute Deviation
Sensors & Transducers, Vol. 6, Issue 2, December 203, pp. 53-58 Sensors & Transducers 203 by IFSA http://www.sensorsportal.com A Novel Algorithm for Hand Vein Recognition Based on Wavelet Decomposition
More informationNeural Network Part 4: Recurrent Neural Networks
Neural Network Part 4: Recurrent Neural Networks Yingyu Liang Computer Sciences 760 Fall 2017 http://pages.cs.wisc.edu/~yliang/cs760/ Some of the slides in these lectures have been adapted/borrowed from
More informationMulti-Modal Spectral Image Super-Resolution
Multi-Modal Spectral Image Super-Resolution Fayez Lahoud, Ruofan Zhou, and Sabine Süsstrunk School of Computer and Communication Sciences École Polytechnique Fédérale de Lausanne {ruofan.zhou,fayez.lahoud,sabine.susstrunk}@epfl.ch
More informationDurham Research Online
Durham Research Online Deposited in DRO: 11 June 2018 Version of attached le: Accepted Version Peer-review status of attached le: Peer-reviewed Citation for published item: Dong, Z. and Kamata, S. and
More informationWeiran Wang, On Column Selection in Kernel Canonical Correlation Analysis, In submission, arxiv: [cs.lg].
Weiran Wang 6045 S. Kenwood Ave. Chicago, IL 60637 (209) 777-4191 weiranwang@ttic.edu http://ttic.uchicago.edu/ wwang5/ Education 2008 2013 PhD in Electrical Engineering & Computer Science. University
More informationFree-hand Sketch Recognition Classification
Free-hand Sketch Recognition Classification Wayne Lu Stanford University waynelu@stanford.edu Elizabeth Tran Stanford University eliztran@stanford.edu Abstract People use sketches to express and record
More informationComputer Vision Seminar
Computer Vision Seminar 236815 Spring 2017 Instructor: Micha Lindenbaum (Taub 600, Tel: 4331, email: mic@cs) Student in this seminar should be those interested in high level, learning based, computer vision.
More informationarxiv: v3 [cs.cv] 18 Dec 2018
Video Colorization using CNNs and Keyframes extraction: An application in saving bandwidth Ankur Singh 1 Anurag Chanani 2 Harish Karnick 3 arxiv:1812.03858v3 [cs.cv] 18 Dec 2018 Abstract In this paper,
More informationA Chinese License Plate Recognition System
A Chinese License Plate Recognition System Bai Yanping, Hu Hongping, Li Fei Key Laboratory of Instrument Science and Dynamic Measurement North University of China, No xueyuan road, TaiYuan, ShanXi 00051,
More informationComparing Time and Frequency Domain for Audio Event Recognition Using Deep Learning
Comparing Time and Frequency Domain for Audio Event Recognition Using Deep Learning Lars Hertel, Huy Phan and Alfred Mertins Institute for Signal Processing, University of Luebeck, Germany Graduate School
More informationToward Joint Image Generation and Compression using Generative Adversarial Networks
1 Toward Joint Image Generation and Compression using Generative Adversarial Networks Byeongkeun Kang, Subarna Tripathi, and Truong Q. Nguyen, Fellow, IEEE arxiv:1901.07838v1 [cs.cv] 23 Jan 2019 Abstract
More informationLANDMARK recognition is an important feature for
1 NU-LiteNet: Mobile Landmark Recognition using Convolutional Neural Networks Chakkrit Termritthikun, Surachet Kanprachar, Paisarn Muneesawang arxiv:1810.01074v1 [cs.cv] 2 Oct 2018 Abstract The growth
More informationMethod Of Defogging Image Based On the Sky Area Separation Yanhai Wu1,a, Kang1 Chen, Jing1 Zhang, Lihua Pang1
2nd Workshop on Advanced Research and Technology in Industry Applications (WARTIA 216) Method Of Defogging Image Based On the Sky Area Separation Yanhai Wu1,a, Kang1 Chen, Jing1 Zhang, Lihua Pang1 1 College
More informationComparing Computer-predicted Fixations to Human Gaze
Comparing Computer-predicted Fixations to Human Gaze Yanxiang Wu School of Computing Clemson University yanxiaw@clemson.edu Andrew T Duchowski School of Computing Clemson University andrewd@cs.clemson.edu
More informationSimulated Programmable Apertures with Lytro
Simulated Programmable Apertures with Lytro Yangyang Yu Stanford University yyu10@stanford.edu Abstract This paper presents a simulation method using the commercial light field camera Lytro, which allows
More informationarxiv: v1 [cs.cv] 29 Mar 2017
Learning High Dynamic Range from Outdoor Panoramas Jinsong Zhang Jean-François Lalonde Université Laval, Québec, Canada jinsong.zhang.1@ulaval.ca, jflalonde@gel.ulaval.ca arxiv:1703.10200v1 [cs.cv] 29
More informationDomain Adaptation & Transfer: All You Need to Use Simulation for Real
Domain Adaptation & Transfer: All You Need to Use Simulation for Real Boqing Gong Tecent AI Lab Department of Computer Science An intelligent robot Semantic segmentation of urban scenes Assign each pixel
More informationA2-RL: Aesthetics Aware Reinforcement Learning for Automatic Image Cropping
A2-RL: Aesthetics Aware Reinforcement Learning for Automatic Image Cropping Debang Li Huikai Wu Junge Zhang Kaiqi Huang NLPR, Institute of Automation, Chinese Academy of Sciences {debang.li, huikai.wu}@cripac.ia.ac.cn
More informationA Recognition of License Plate Images from Fast Moving Vehicles Using Blur Kernel Estimation
A Recognition of License Plate Images from Fast Moving Vehicles Using Blur Kernel Estimation Kalaivani.R 1, Poovendran.R 2 P.G. Student, Dept. of ECE, Adhiyamaan College of Engineering, Hosur, Tamil Nadu,
More informationSemantic Localization of Indoor Places. Lukas Kuster
Semantic Localization of Indoor Places Lukas Kuster Motivation GPS for localization [7] 2 Motivation Indoor navigation [8] 3 Motivation Crowd sensing [9] 4 Motivation Targeted Advertisement [10] 5 Motivation
More informationNew Techniques for Preserving Global Structure and Denoising with Low Information Loss in Single-Image Super-Resolution
New Techniques for Preserving Global Structure and Denoising with Low Information Loss in Single-Image Super-Resolution Yijie Bei Alex Damian Shijia Hu Sachit Menon Nikhil Ravi Cynthia Rudin Duke University
More informationIEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, TNSESI
IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, TNSESI-28--35 AF-DCGAN: Amplitude Feature Deep Convolutional GAN for Fingerprint Construction in Indoor Localization System Qiyue LI, Member, IEEE,
More informationLow frequency extrapolation with deep learning Hongyu Sun and Laurent Demanet, Massachusetts Institute of Technology
Hongyu Sun and Laurent Demanet, Massachusetts Institute of Technology SUMMARY The lack of the low frequency information and good initial model can seriously affect the success of full waveform inversion
More informationarxiv: v1 [cs.cv] 20 Jul 2018
QIN, WEI, MANDUCHI: AUTOMATIC SEMANTIC CONTENT REMOVAL 1 arxiv:1807.07696v1 [cs.cv] 20 Jul 2018 Automatic Semantic Content Removal by Learning to Neglect Siyang Qin siqin@soe.ucsc.edu Jiahui Wei jwei19@ucsc.edu
More informationA Single Image Haze Removal Algorithm Using Color Attenuation Prior
International Journal of Scientific and Research Publications, Volume 6, Issue 6, June 2016 291 A Single Image Haze Removal Algorithm Using Color Attenuation Prior Manjunath.V *, Revanasiddappa Phatate
More information