feihu zhang, victor prisacariu, ruigang yang, philip …title your title here: maybe add some...

5. SGM to SGA Layer : ➢ User-defined param ( 1 , 2 ) --> learnable weights ( 1 , … 4 ): - Learnable and adaptive in different scenes and locations. ➢ Second/internal “min” --> “max” selection: - Maximize the probability at the ground truth labels. - Avoid zeros and negatives, more effective. ➢ First “min” --> weighted “sum”: - Proven effective in [Springenberg, et al, 2014], no loss of accuracy. - Reduce fronto-parallel surfaces in large textureless regions. - Avoid zeros and negatives. 6. LGA Layer : ➢ Learn guided 3×× filtering kernel for each location/pixel. ➢ Locally refine thin structures and edges. ➢ Recover loss of accuracy caused by down-sampling. 1. Key Steps of Stereo Matching : ➢ Feature Extraction - Patches [Zbontar et al. 2015], Pyramid [Chang et al. 2018]. - Encoder-decoder [Kendall et al. 2017], etc. ➢ Matching Cost Aggregation - Feature based matching cost is often ambiguous. - Wrong matches easily have a lower cost than correct ones. ➢ Disparity Estimation - Classification loss, Disparity regression [Kendall et al. 2017]. 2. Problem and Target : Matching Cost Aggregation: ➢ Current Deep Neural Networks: - Only 2D/3D convolutions ➢ Traditional Methods: - Geometric, Optimization - SGM [Hirschmuller. 2008], CostFilter [Hosni et al. 2013], etc. Formulate Traditional Geometric & Optimization into Neural Networks. 3. Contributions : ➢ Semi-Global Aggregation (SGA) Layer - Differentiable SGM. - Aggregate Matching Costs Over Whole Image. ➢ Local Guided Aggregation (LGA) Layer - Learn Guided Filtering. - Refine Thin Structures and Edges. - Recover Loss of Accuracy Caused by Down-sampling. 4. Energy Function an d SGM : ➢ Approximate Solution: SGM GA-Net: Guided Aggregation Net for End-to-end Stereo Matching Feihu Zhang, Victor Prisacariu, Ruigang Yang, Philip H.S. Torr University of Oxford, Baidu Research Code: https://github.com/feihuzhang/GANet SGA Layer LGA Layer SGM Equation SGA Layer 7. Experimental Result s : Sum of Matching Costs Smoothness Penalties Ground Truth Our GANet-2 GC-Net [Kendall et al. 2017] Input Models 3D conv layers GA layers Avg. EPE (pixel) Error rate (%) GC-Net 19 - 1.80 15.6 PSMNet 35 - 1.09 12.1 GANet-15 15 5 0.84 9.9 GANet-deep 22 9 0.78 8.7 ➢ Evaluation and Comparisons on SceneFlow Dataset Models KITTI 2012 benchmark KITTI 2015 benchmark Non-Occluded All Area Non-Occluded All Area GC-Net 1.77 2.30 2.61 2.87 PSMNet 1.49 1.89 2.14 2.32 GANet-15 1.36 1.80 1.73 1.93 GANet-deep 1.19 1.60 1.63 1.81 ➢ Evaluation and Comparisons on KITTI Benchmarks Our GANet-15 PSMNet [Chang et al. 2018] Input GC-Net [Kendall et al. 2017] - User-defined parameters. - Hard to tune. - Fixed for all locations. - Produce only zeros. - Not immediately differentiable. - Produce fronto-parallel surfaces. Cannot be Used in Deep Neural Networks SGA ... … SGA SGA .. .. LGA max Cost Aggregation Feature Extraction Cost Volume Disparity Estimation

Upload: others

Post on 04-Jul-2020

1 views

Category:

Documents

0 download

Report

Download

Embed Size (px):

TRANSCRIPT

5. SGM to SGA Layer:

➢ User-defined param (𝑃1, 𝑃2) --> learnable weights (𝑊1, … 𝑊4):

- Learnable and adaptive in different scenes and locations.

➢ Second/internal “min” --> “max” selection:

- Maximize the probability at the ground truth labels.

- Avoid zeros and negatives, more effective.

➢ First “min” --> weighted “sum”:

- Proven effective in [Springenberg, et al, 2014], no loss of accuracy.

- Reduce fronto-parallel surfaces in large textureless regions.

- Avoid zeros and negatives.

6. LGA Layer:

➢ Learn guided 3 × 𝑘 × 𝑘 filtering kernel for each location/pixel.

➢ Locally refine thin structures and edges.

➢ Recover loss of accuracy caused by down-sampling.

1. Key Steps of Stereo Matching:

➢ Feature Extraction

- Patches [Zbontar et al. 2015], Pyramid [Chang et al. 2018].

- Encoder-decoder [Kendall et al. 2017], etc.

➢ Matching Cost Aggregation

- Feature based matching cost is often ambiguous.

- Wrong matches easily have a lower cost than correct ones.

➢ Disparity Estimation

- Classification loss, Disparity regression [Kendall et al. 2017].

2. Problem and Target:

Matching Cost Aggregation:

➢ Current Deep Neural Networks:

- Only 2D/3D convolutions

➢ Traditional Methods:

- Geometric, Optimization

- SGM [Hirschmuller. 2008], CostFilter [Hosni et al. 2013], etc.

Formulate Traditional Geometric & Optimization into Neural Networks.

3. Contributions:

➢ Semi-Global Aggregation (SGA) Layer

- Differentiable SGM.

- Aggregate Matching Costs Over Whole Image.

➢ Local Guided Aggregation (LGA) Layer

- Learn Guided Filtering.

- Refine Thin Structures and Edges.

- Recover Loss of Accuracy Caused by Down-sampling.

4. Energy Function and SGM:

➢ Approximate Solution: SGM

GA-Net: Guided Aggregation Net for End-to-end Stereo MatchingFeihu Zhang, Victor Prisacariu, Ruigang Yang, Philip H.S. Torr

University of Oxford, Baidu Research

Code: https://github.com/feihuzhang/GANet

SGA Layer LGA Layer

SGM Equation SGA Layer

7. Experimental Results:

Sum of Matching Costs

Smoothness

Penalties

Ground TruthOur GANet-2

GC-Net [Kendall et al. 2017]Input

Models 3D conv layers GA layers Avg. EPE (pixel) Error rate (%)

GC-Net 19 - 1.80 15.6

PSMNet 35 - 1.09 12.1

GANet-15 15 5 0.84 9.9

GANet-deep 22 9 0.78 8.7

➢ Evaluation and Comparisons on SceneFlow Dataset

ModelsKITTI 2012 benchmark KITTI 2015 benchmark

Non-Occluded All Area Non-Occluded All Area

GC-Net 1.77 2.30 2.61 2.87

PSMNet 1.49 1.89 2.14 2.32

GANet-15 1.36 1.80 1.73 1.93

GANet-deep 1.19 1.60 1.63 1.81

➢ Evaluation and Comparisons on KITTI Benchmarks

Our GANet-15PSMNet [Chang et al. 2018]

Input GC-Net [Kendall et al. 2017]

- User-defined parameters.- Hard to tune.- Fixed for all locations.

- Produce only zeros.- Not immediately differentiable.- Produce fronto-parallel surfaces.

Cannot be Used in

Deep Neural Networks

... …

A .. .. LG

max

Cost AggregationFeature Extraction Cost Volume Disparity Estimation

https://github.com/feihuzhang/GANet

IEEE JOURNAL OF SELECTED TOPICS IN QUANTUM ...cryptography Feihu Xu, Marcos Curty, Bing Qi, and Hoi-Kwong Lo (Invited Paper) Abstract—In theory, quantum key distribution (QKD) provides

LATEST TRENDS in APPLIED andMircea Boşcoianu, Vasile Prisacariu, Ionică Cîrciu, Andrei Luchian, Calin Ciufudean. The Use of Ohmic Heating in Processing of Food Industry . 102

Online Global Non-rigid Registration for 3D Object ...graphics.cs.uh.edu/wp-content/papers/2018/2018-PG-3DObjectScanning.pdflar idea, Prisacariu et al. [KPM16] developed an online

Lista lucrari 2013 Mwww › publications › lucrari_2013.pdf49. Optimizing mechanical performance of nanostructured polyurethane elastomers and films C. Prisacariu, S. Coseri Chemicke

Yazi Zhong Feihu Chen 2,b - Atlantis PressStudy on the Uncertainty in Meishan Culture Park Construction and Art’s Randomness Yazi Zhong 1, a *, Feihu Chen 2,b 1 Dormitory 15 , South

Accurate 3D Pose Estimation From a Single Depth ImageAccurate 3D Pose Estimation From a Single Depth Image Mao Ye1 Xianwang Wang2 Ruigang Yang1 Liu Ren3 Marc Pollefeys4 University

1 Combining Approximate Geometry with VDTM – A Hybrid Approach to 3D Video Teleconferencing Celso Kurashima 2, Ruigang Yang 1, Anselmo Lastra 1 1 Department

Organization Structure - SolarPACES...Dr. Feihu Sun, Institute of Electrical Engineering, Chinese Academy of Sciences 10:15-10:35 Coffee Break 10:35-11:00 Cogeneration technology with

EVALUATION OF IMAGE QUALITY OF EXPERIENCE IN … · EVALUATION OF IMAGE QUALITY OF EXPERIENCE IN CONSIDERATION OF VIEWING DISTANCE (INVITED PAPER) Ruigang Fang, Dapeng Wu University

Restoring 2D Content from Distorted Documents Michael S. Brown,Mingxuan Sun, Ruigang Yang,Lin Yun, W.Brent Seales

Favre- and Reynolds-averaged velocity measurements ...Favre- and Reynolds-averaged velocity measurements: Interpreting PIV and LDA measurements in combustion M. Mustafa Kamal⇑, Ruigang

Ian Reid University of Oxfordadame/pdfs/2013_CVPR_dame.pdf · Dense Reconstruction Using 3D Object Shape Priors Amaury Dame, Victor A. Prisacariu, Carl Y. Ren University of Oxford

Segment Graph Based Image Filtering: Fast Structure ... · Segment Graph Based Image Filtering: Fast Structure-Preserving Smoothing Feihu Zhang1,2 Longquan Dai2 Shiming Xiang2 Xiaopeng

Using Adaboost to Detect and Segment Characters from ... · Using Adaboost to Detect and Segment Characters from Natural Scenes Kaihua Zhu Feihu Qi Renjie Jiang Li Xu Shanghai Jiao

Ian Reid University of Oxford...Dense Reconstruction Using 3D Object Shape Priors Amaury Dame, Victor A. Prisacariu, Carl Y. Ren University of Oxford {adame,victor,carl}@robots.ox.ac.ukIan

Robust 3D Hand Tracking for Human Computer Interaction 3D... · Robust 3D Hand Tracking for Human Computer Interaction Victor Adrian Prisacariu Department of Engineering Science University

arXiv:1410.0925v3 [cs.CV] 23 Oct 2014victor/infinitam/files/itmtech...2014/10/27 · * Olaf K¨ahler and Victor Adrian Prisacariu contributed equally to this work 1 arXiv:1410.0925v3

Supporting Information mimic a multienzyme-cascade ... · mimic a multienzyme-cascade antioxidative system Ruizhen Tian, XiaoTong Fan, Shengda liu, Feihu yang, Quan luo, Chunxi Hou,

Scanned Document - Ion Neculce, Iașiprimaria-ionneculce.ro/.../03/Prisacariu-Constantin... · Title: Scanned Document Created Date: 3/30/2017 9:44:51 AM

Uniﬁcation and Theories Research Report No. Sergiu Bursuc Cristian Prisacariu … › ... › bursuc.prisacariu10unification.pdf · Uniﬁcation and Matching in Complex Theories

Cristina Prisacariu - Home - Springer978-3-7091-0514-6/1.pdf · Foreword The World Wide Web oﬀers wonderful resources. This is a comfortable medium for today, especially for those

Exploiting Occlusion in Non-Line-of-Sight Active Imaging · Exploiting Occlusion in Non-Line-of-Sight Active Imaging Christos Thrampoulidis , Gal Shulkind , Feihu Xu, William T. Freeman

Cronfa - Swansea University Open Access Repository · Short communication Flow rates in perfusion bioreactors to maximise mineralisation in bone tissue engineering in vitro Feihu

of spectrally factorable biphotons - arXivEfﬁcient generation and characterization of spectrally factorable biphotons Changchen Chen,1 Cao Bo,2 Murphy Yuezhen Niu,1;3 Feihu Xu,1

Lecture 4 - robots.ox.ac.ukvictor/teaching/pdfs/B1... · Lecture 4 B1 Optimization -Trinity 2019 / Michaelmas 2020-V. A. Prisacariu. One (global) optima Multiple optima Cost Functions

Substorm Activity during CME and CIR Driven Storms Smitha Thampi, Diwakar Tiwari, Ruigang Wang, Hui Zhang, Ling Qian Zhang, Yihua Zheng Tutor: Robert L

arXiv:1811.02146v5 [cs.CV] 9 Apr 2019 · TrafﬁcPredict: Trajectory Prediction for Heterogeneous Trafﬁc-Agents Yuexin Ma 1;2, Xinge Zhu3, Sibo Zhang , Ruigang Yang1, Wenping Wang2,

Zhiyuan Tang , Zhongfa Liao, Feihu Xu, Bing Qi, Li Qian, and Hoi-Kwong Lo

Continuous Signed Distance Functions for 3D Vision · tation introduced by Prisacariu et al. [42,21,58]: storing objects as discrete SDFs with the space of shape variations built

Incremental Dense Semantic Stereo Fusion for Large-Scale …niessner/papers/2015/3... · 2017-03-29 · Victor A. Prisacariu Olaf K¨ahler David W. Murray Shahram Izadi Patrick P

InﬁniTAM v3: A Framework for Large-Scale 3D Reconstruction ... · InﬁniTAM v3: A Framework for Large-Scale 3D Reconstruction with Loop Closure Victor Adrian Prisacariu [email protected]

gSLICr: SLIC superpixels at over 250Hz2008/09/20 · Victor Adrian Prisacariu [email protected] University of Oxford Ian D Reid [email protected] University of Adelaide

3D Reconstruction for AR - ARM architecture › cfs-file › __key › communityserver... · 2018-08-19 · 3D Reconstruction for AR Prof Victor Adrian Prisacariu Active Vision Lab

An Introduction to Computer Graphicsdmn.netlab.uky.edu/~seales/cs100/lecture/Ruigang-2015.pdf · Wednesday, October 07, 2015 4 Definitions •The creation, display, and storage of

Integrating Object Detection with 3D Tracking …Integrating Object Detection with 3D Tracking Towards a Better Driver Assistance System Victor Adrian Prisacariu 1, Radu Timofte 2,