3D-Assisted Image Feature Synthesis for Novel Views of an Object

Similar documents
Graph-of-word and TW-IDF: New Approach to Ad Hoc IR (CIKM 2013) Learning to Rank: From Pairwise Approach to Listwise Approach (ICML 2007)

Recent Advances in Sampling-based Alpha Matting

Hash Function Learning via Codewords

1 st Keypoints Challenge. ImageNet and COCO Visual Recognition Challenges Workshop. Yin Cui, Tsung-Yi Lin, Matteo Ruggero Ronchi, Genevieve Patterson

SketchNet: Sketch Classification with Web Images[CVPR `16]

Wavelet-based image compression

Autocomplete Sketch Tool

Semantic Localization of Indoor Places. Lukas Kuster

A Novel Method for Enhancing Satellite & Land Survey Images Using Color Filter Array Interpolation Technique (CFA)

TRANSFORMING PHOTOS TO COMICS USING CONVOLUTIONAL NEURAL NETWORKS. Tsinghua University, China Cardiff University, UK

Computer Graphics (Fall 2011) Outline. CS 184 Guest Lecture: Sampling and Reconstruction Ravi Ramamoorthi

Video Object Segmentation with Re-identification

Optimizing Media Access Strategy for Competing Cognitive Radio Networks Y. Gwon, S. Dastangoo, H. T. Kung

Dynamic Data-Driven Adaptive Sampling and Monitoring of Big Spatial-Temporal Data Streams for Real-Time Solar Flare Detection

Data and Knowledge as Infrastructure. Chaitan Baru Senior Advisor for Data Science CISE Directorate National Science Foundation

Optimization Techniques for Alphabet-Constrained Signal Design

ENERGY-EFFICIENT ALGORITHMS FOR SENSOR NETWORKS

MIMO Radar and Communication Spectrum Sharing with Clutter Mitigation

Steganalysis in resized images

OFDM Pilot Optimization for the Communication and Localization Trade Off

Structure and Synthesis of Robot Motion

Romantic Partnerships and the Dispersion of Social Ties

EE 123 Discussion Section 6. Frank Ong March 14th, 2016

Super resolution with Epitomes

Sketch-a-Net that Beats Humans

Real Time Word to Picture Translation for Chinese Restaurant Menus

Fast Online Learning of Antijamming and Jamming Strategies

Colorful Image Colorizations Supplementary Material

Recent Advances in Image Deblurring. Seungyong Lee (Collaboration w/ Sunghyun Cho)

Wildlife Census via LSH-based animal tracking APOORV PATWARDHAN

Vision Defect Identification System (VDIS) using Knowledge Base and Image Processing Framework

Using the Time Dimension to Sense Signals with Partial Spectral Overlap. Mihir Laghate and Danijela Cabric 5 th December 2016

Distributed Collaborative Path Planning in Sensor Networks with Multiple Mobile Sensor Nodes

DYNAMIC CONVOLUTIONAL NEURAL NETWORK FOR IMAGE SUPER- RESOLUTION

DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition. ECE 289G: Paper Presentation #3 Philipp Gysel

Spatial Color Indexing using ACC Algorithm

Connected Identifying Codes

Detection of Compound Structures in Very High Spatial Resolution Images

The Capability of Error Correction for Burst-noise Channels Using Error Estimating Code

Convolutional Networks for Image Segmentation: U-Net 1, DeconvNet 2, and SegNet 3

The Visual Language of New Media the Book as Database

A TWO-PART PREDICTIVE CODER FOR MULTITASK SIGNAL COMPRESSION. Scott Deeann Chen and Pierre Moulin

Constructing local discriminative features for signal classification

NTU CSIE. Advisor: Wu Ja Ling, Ph.D.

Automatic feature-queried bird identification system based on entropy and fuzzy similarity

Spline wavelet based blind image recovery

Travel Photo Album Summarization based on Aesthetic quality, Interestingness, and Memorableness

The Game-Theoretic Approach to Machine Learning and Adaptation

A. Siffer, P-A Fouque, A. Termier and C. Largouet April 26, 2017

Games, Privacy and Distributed Inference for the Smart Grid

Simple, Optimal, Fast, and Robust Wireless Random Medium Access Control

Communication Theory II

Chapter 2 Distributed Consensus Estimation of Wireless Sensor Networks

Modeling and Synthesis of Aperture Effects in Cameras

Matching Words and Pictures

A New Control Theory for Dynamic Data Driven Systems

Lecture 7: Scene Text Detection and Recognition. Dr. Cong Yao Megvii (Face++) Researcher

Comparative Study of Different Wavelet Based Interpolation Techniques

Comparing Computer-predicted Fixations to Human Gaze

CSE548, AMS542: Analysis of Algorithms, Fall 2016 Date: Sep 25. Homework #1. ( Due: Oct 10 ) Figure 1: The laser game.

Wireless communications: from simple stochastic geometry models to practice III Capacity

On Coding for Cooperative Data Exchange

Early art: events. Baroque art: portraits. Renaissance art: events. Being There: Capturing and Experiencing a Sense of Place

Retrieval of Large Scale Images and Camera Identification via Random Projections

Synthesizing Interpretable Strategies for Solving Puzzle Games

Transport Capacity and Spectral Efficiency of Large Wireless CDMA Ad Hoc Networks

Optimal Coded Information Network Design and Management via Improved Characterizations of the Binary Entropy Function

Patent Mining: Use of Data/Text Mining for Supporting Patent Retrieval and Analysis

TITLE OF PRESENTATION. Elsevier s Challenge. Dynamic Knowledge Stores and Machine Translation. Presented By Marius Doornenbal,, Anna Tordai

How user throughput depends on the traffic demand in large cellular networks

SoilJ Technical Manual

Predicting Content Virality in Social Cascade

CS688/WST665 Student presentation Learning Fine-grained Image Similarity with Deep Ranking CVPR Gayoung Lee ( 이가영 )

ADAPTIVE ADDER-BASED STEPWISE LINEAR INTERPOLATION

Efficiency and detectability of random reactive jamming in wireless networks

Time Frequency Domain for Segmentation and Classification of Non-stationary Signals

A Practical Approach to Bitrate Control in Wireless Mesh Networks using Wireless Network Utility Maximization

Artifacts Reduced Interpolation Method for Single-Sensor Imaging System

IMAGE PROCESSING IEEE TITLES

New Generation Reliability Model

Learning Hierarchical Visual Codebook for Iris Liveness Detection

Analysis on Color Filter Array Image Compression Methods

Face detection, face alignment, and face image parsing

Microphone Array Design and Beamforming

Two-stage column generation and applications in container terminal management

CLASSLESS ASSOCIATION USING NEURAL NETWORKS

Filters. Materials from Prof. Klaus Mueller

Multiresolution Histograms and their Use for Texture Classification

Control Synthesis and Delay Sensor Deployment for Efficient ASV designs

Liangliang Cao *, Jiebo Luo +, Thomas S. Huang *

Chapter - 1 PART - A GENERAL INTRODUCTION

Today. CS 395T Visual Recognition. Course content. Administration. Expectations. Paper reviews

Diversity and Freedom: A Fundamental Tradeoff in Multiple Antenna Channels

Histogram-based Threshold Selection of Retinal Feature for Image Registration

Sparsity-Driven Feature-Enhanced Imaging

Bits From Photons: Oversampled Binary Image Acquisition

Physical-Layer Multicasting by Stochastic Beamforming and Alamouti Space-Time Coding

INTAIRACT: Joint Hand Gesture and Fingertip Classification for Touchless Interaction

Privacy preserving data mining multiplicative perturbation techniques

A Novel Image Deblurring Method to Improve Iris Recognition Accuracy

Transcription:

3D-Assisted Image Feature Synthesis for Novel Views of an Object Hao Su* Fan Wang* Li Yi Leonidas Guibas * Equal contribution

View-agnostic Image Retrieval Retrieval using AlexNet features Query

Cross-view Image Comparison

Cross-view Image Comparison The comparison is between the underlying 3D objects

Reconstruct 3D and then compare? Su et al, SIGGRAPH 14 Kar et al, CVPR 15 Huang et al, SIGGRAPH 15

Single-image based 3D Reconstruction is hard Common dependencies: Many dependencies Not Robust Fg/bg segmentation Slow Keypoint detection 2D image part segmentation 3D shape part segmentation 2D-3D Correspondence Non-convex iterative optimization

Our Formulation: Novel View Feature Synthesis Observed view (HoG feature as an example)

Our Novel View Feature Synthesis Results (HoG feature as an example)

Outline Motivation Approach Applications Method Diagnosis Conclusion

Key idea Learn from a dataset of many objects with multi-view features

Key idea Learn from a dataset of multi-view features The dataset is generated by rendering 3D models d

Key idea Learn from a dataset of multi-view features The dataset is generated by rendering large-scale 3D models http://shapenet.cs.stanford.edu

3D-assisted Feature Synthesis: Nearest Neighbour Observed view image Novel view feature (HoG feature as an example)

3D-assisted Feature Synthesis: Nearest Neighbour Observed view image Strong assumption: very similar model exists Novel view feature (HoG feature as an example)

3D-assisted Feature Synthesis: Multiple Shapes Observed view image... Novel view feature (HoG feature as an example)

3D-assisted Feature Synthesis: Multiple Shapes Attention: Brain games start!

Pipeline Observed view image Novel view feature (HoG feature as an example)

Pipeline Observed view image Novel view feature (HoG feature as an example)

Pipeline Observed view image Novel view feature (HoG feature as an example)

Pipeline Observed view image + + Novel view feature (HoG feature as an example)

Pipeline Observed view image + + Novel view feature (HoG feature as an example)

Pipeline Observed view image Locally Linear Reconstruction 0.1 + 0.4 + 0.3 + + Novel view feature (HoG feature as an example)

Pipeline Observed view image Locally Linear Reconstruction 0.1 + 0.4 + 0.3 + + Novel view feature (HoG feature as an example)

Pipeline Observed view image Locally Linear Reconstruction 0.1 + 0.4 + 0.3 + + Novel view feature (HoG feature as an example)

Pipeline Observed view image Locally Linear Reconstruction + 0.1 0.4 0.3 + + + Novel view feature Inter-shape relationship (HoG feature as an example)

Surrogate Relationship Discovery Observed view image Locally Linear Reconstruction + 0.1 0.4 0.3 +? + + Novel view feature Inter-shape relationship (HoG feature as an example)

Surrogate Relationship Discovery Observed view Shape Collection Novel view

Surrogate Relationship Discovery Observed view Shape Collection Novel view Surrogate suitability matrix

Formal Definition of Surrogate Suitability Shape Collection Observed view Assume A, B are discrete random variables A Novel view B

Formal Definition of Surrogate Suitability Shape Collection Observed view Assume A, B are discrete random variables (a 1, b 1 ), (a 2, b 2 ), are i.i.d samples of (A, B) A Novel view e.g. a 1 a 2 b 1 b 2 B

Formal Definition of Surrogate Suitability Shape Collection Observed view Assume A, B are discrete random variables (a 1, b 1 ), (a 2, b 2 ), are i.i.d samples of (A, B) A Novel view e.g. a 1 a 2 Surrogate suitability: b 1 b 2 B γ A; B = log P(b 1 = b 2 a 1 = a 2 )

Formal Definition of Surrogate Suitability Shape Collection Observed view Assume A, B are discrete random variables (a 1, b 1 ), (a 2, b 2 ), are i.i.d samples of (A, B) How well can the sameness at A predict the sameness at B? A Novel view e.g. a 1 a 2 Surrogate suitability: b 1 b 2 B γ A; B = log P(b 1 = b 2 a 1 = a 2 )

Formal Definition of Surrogate Suitability Shape Collection Observed view Assume A, B are discrete random variables (a 1, b 1 ), (a 2, b 2 ), are i.i.d samples of (A, B) How well can the sameness at A predict the sameness at B? A Novel view e.g. a 1 a 2 Cross-view transfer of relationships B b 1 b 2 Surrogate suitability: γ A; B = log P(b 1 = b 2 a 1 = a 2 )

Estimation of Surrogate Suitability Derivation shows H R : Renyi-entropy

Estimation of Surrogate Suitability Derivation shows Sample complexity: tight bound Θ V A + V B where V A and V B are vocabulary size of A and B

Estimation of Surrogate Suitability Derivation shows Sample complexity: tight bound Θ V A + V B where V A and V B are vocabulary size of A and B Theoretically optimal algorithm is proposed that reaches the bound

Estimation of Surrogate Suitability Derivation shows Sample complexity: tight bound Θ V A + V B where V A and V B are vocabulary size of A and B Theoretically optimal algorithm is proposed that reaches the bound Strong connection with Mutual Information

More Visualization of Surrogate Suitability Matrix Novel view Observed view B

More Visualization of Surrogate Suitability Matrix Novel view Observed view B

More Visualization of Surrogate Suitability Matrix Novel view Observed view B

Review of Pipeline Observed view image + 0.1 0.4 0.3 + + + Novel view feature

Inter-shape relationship Review of Pipeline Observed view image + Inter-shape relationship: + 0.1 0.4 0.3 Knowledge transfer from 3D shape database to+ new instance + Novel view feature

Intra-shape relationship Inter-shape relationship Review of Pipeline Observed view image Intra-shape relationship: + Inter-shape relationship: + 0.1 0.4 0.3 Knowledge transfer from observed view to novel view Knowledge transfer from 3D shape database to+ new instance + Novel view feature

Outline Motivation Approach Applications Method Diagnosis Conclusion

Application: Cross-view localized image comparison

Cross-view Image Retrieval

Application: View-agnostic Image Retrieval HoG L2 vertical bars swivel base Ours (combined HoG)

Application: View-agnostic Image Retrieval HoG L2 vertical bars swivel base Ours (combined HoG)

Application: View-agnostic Image Retrieval HoG L2 vertical bars swivel base Ours (combined HoG)

Part-based View-agnostic Image Retrieval

Generalizability to Many Feature Types Task: fine-grained retrieval (images and annotations are from ImageNet) Metric: Average Precision

Outline Motivation Approach Applications Method Diagnosis Conclusion

How many shapes are sufficient? 200 (Measured by Average Precision on Fine-grained retrieval for Chairs)

How many neighboring shapes for interpolation? 80 (Measured by Average Precision on Fine-grained retrieval for Chairs)

How well can one view predict another view? Controlled diagnosis on renderings Cross-view retrieval rank

Outline Motivation Approach Applications Method Diagnosis Conclusion

Conclusion A novel framework for synthesizing object features at novel views 3D shape database provides the knowledge of feature synthesis For relationship transfer, surrogate suitability is defined, which is a type of predictability between random variables. A theoretically optimal estimator is proposed

Thank you!