3d object classification via spherical projections · 6 90.87 88.95 24 91.02 89.23 12 91.36 89.45...

10
3D Object Classification via Spherical Projections Zhangjie Cao 1 , Qixing Huang 2 , and Ramani Karthik 3 1 School of Software Tsinghua University, China 2 Department of Computer Science University of Texas at Austin, USA 3 School of Mechanical Engineering Purdue University, USA International Conference on 3DVision, 2017 Z. Cao et al. Spherical Projections 3DV 2017 1/1

Upload: others

Post on 22-Jun-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: 3D Object Classification via Spherical Projections · 6 90.87 88.95 24 91.02 89.23 12 91.36 89.45 contour-based 6 92.26 89.23 24 93.12 90.36 12 93.31 90.49 Accuracy w.r.t Elevation

3D Object Classification via Spherical Projections

Zhangjie Cao1, Qixing Huang2, and Ramani Karthik3

1School of SoftwareTsinghua University, China

2Department of Computer ScienceUniversity of Texas at Austin, USA

3School of Mechanical EngineeringPurdue University, USA

International Conference on 3DVision, 2017

Z. Cao et al. Spherical Projections 3DV 2017 1 / 1

Page 2: 3D Object Classification via Spherical Projections · 6 90.87 88.95 24 91.02 89.23 12 91.36 89.45 contour-based 6 92.26 89.23 24 93.12 90.36 12 93.31 90.49 Accuracy w.r.t Elevation

Motivation 3D Classification

Main-stream Methods

Two main-stream 3D classification methods: image-based and3D-based.

(a) Image-based (b) 3D-based

Spherical projections combine key advantages of these twomain-stream 3D classification methods.

Z. Cao et al. Spherical Projections 3DV 2017 2 / 1

Page 3: 3D Object Classification via Spherical Projections · 6 90.87 88.95 24 91.02 89.23 12 91.36 89.45 contour-based 6 92.26 89.23 24 93.12 90.36 12 93.31 90.49 Accuracy w.r.t Elevation

Spherical Projections Depth-based Projection

Depth-based Projection

CNN1

CNN1

CNN1

CNN1

ConcatMap CNN2 CNN3

60

60

Concat

fc layer

example shape 12 vertical stripe projection

1 horizontal stripe projection

convolution net for vertical stripe

cylindrical convolution net for

horizontal stripesoftmax loss

output of last fc layer of CNN2

output of last fc layer of CNN3

1 0 00

… …

30

x y

z

x y

z

Figure: Depth-based Projections and Networks

Z. Cao et al. Spherical Projections 3DV 2017 3 / 1

Page 4: 3D Object Classification via Spherical Projections · 6 90.87 88.95 24 91.02 89.23 12 91.36 89.45 contour-based 6 92.26 89.23 24 93.12 90.36 12 93.31 90.49 Accuracy w.r.t Elevation

Spherical Projections Depth-based Projection

Depth-based Projection

p

pq

q

d

d

1

2

1

2

1

2

(a) Depth-based Projection Method

kernel_sizekernel_size

copy

(b) Cylindrical Depth-based Projection

Figure: Details on Depth-based Projection

Depth values are recorded as the distance to the first hitting pointFirst compute depth values for vertices of a semi-regular quad-meshThen generate the depth value of other points by linear interpolation.

Z. Cao et al. Spherical Projections 3DV 2017 4 / 1

Page 5: 3D Object Classification via Spherical Projections · 6 90.87 88.95 24 91.02 89.23 12 91.36 89.45 contour-based 6 92.26 89.23 24 93.12 90.36 12 93.31 90.49 Accuracy w.r.t Elevation

Spherical Projections Contour-based Projection

Contour-based Projection

ConcatProjection CNN4

softmax loss

1

00

0

example shape contour projection

convolution net for contour projection

Figure: Contour-based Projections and Networks

Z. Cao et al. Spherical Projections 3DV 2017 5 / 1

Page 6: 3D Object Classification via Spherical Projections · 6 90.87 88.95 24 91.02 89.23 12 91.36 89.45 contour-based 6 92.26 89.23 24 93.12 90.36 12 93.31 90.49 Accuracy w.r.t Elevation

Experiments Setup

Experiments Setup

Datasets: ModelNet40, ShapeNetCoreParameter selection: cross-validation by jointly assessingMethods to compare with: Image-based methods: MVCNN,MVCNN-MultiRes; 3D-based methods: 3D ShapeNets, Voxnet,Volumetric CNN, OctNet; combined methods: FusionNet. All ofthese methods use the upright orientation but do not use the frontorientation.

Z. Cao et al. Spherical Projections 3DV 2017 6 / 1

Page 7: 3D Object Classification via Spherical Projections · 6 90.87 88.95 24 91.02 89.23 12 91.36 89.45 contour-based 6 92.26 89.23 24 93.12 90.36 12 93.31 90.49 Accuracy w.r.t Elevation

Experiments Results

Results

Accuracy of our approaches and the various baseline methods onModelNet40 and ShapeNetCore and two curated subsets.

Method ModelNet40 ShapeNetCore ModelNet40-SubI ShapeNetCore-SubI3D Shapenets 85.9 na 83.33 na

Voxnet 87.8 na 85.99 naFusionNet 90.80 na 89.54 na

Volumetric CNN 89.9 na 88.65 naMVCNN 92.31 88.93 91.22 88.64

MVCNN-MultiRes 93.8 90.01 92.60 90.00OctNet 87.83 88.03 86.45 87.85

depth-base pattern 91.36 89.45 90.25 89.13contour-based pattern 93.31 90.49 92.20 90.80

overall pattern 94.24 91.00 93.09 91.22

Z. Cao et al. Spherical Projections 3DV 2017 7 / 1

Page 8: 3D Object Classification via Spherical Projections · 6 90.87 88.95 24 91.02 89.23 12 91.36 89.45 contour-based 6 92.26 89.23 24 93.12 90.36 12 93.31 90.49 Accuracy w.r.t Elevation

Experiments Results

Results

Accuracy Before and After Pre-training on ModelNet40Method Before Pre-training After Pre-training

Accuracy (class) Accuracy (instance) Accuracy (class) Accuracy (instance)MVCNN 82.15 87.15 90.35 92.31

MVCNN-MultiRes 88.12 91.20 91.40 93.80depth-base pattern 80.44 86.09 87.32 91.36

contour-based pattern 88.33 91.48 91.16 93.31overall pattern 88.53 91.77 91.56 94.24

Accuracy Before and After Pre-training on ShapeNetCoreMethod Before Pre-training After Pre-training

Accuracy (class) Accuracy (instance) Accuracy (class) Accuracy (instance)MVCNN 67.84 84.55 78.79 88.93

MVCNN-MultiRes 75.34 88.4 79.01 90.01depth-base pattern 70.55 85.15 78.84 89.45

contour-based pattern 74.52 88.54 79.38 90.49overall pattern 75.60 88.87 80.38 91.00

Z. Cao et al. Spherical Projections 3DV 2017 8 / 1

Page 9: 3D Object Classification via Spherical Projections · 6 90.87 88.95 24 91.02 89.23 12 91.36 89.45 contour-based 6 92.26 89.23 24 93.12 90.36 12 93.31 90.49 Accuracy w.r.t Elevation

Experiments Results

AnalysisAccuracy w.r.t Number of Views for Depth and Contour Pattern onModelNet40 and ShapeNetCore

Pattern Number of Views ModelNet40 ShapeNetCore

depth-based6 90.87 88.9524 91.02 89.2312 91.36 89.45

contour-based6 92.26 89.2324 93.12 90.3612 93.31 90.49

Accuracy w.r.t Elevation degree of the strip parallel to the latitude

Elevation Degree15 20 25 30 35 40 45 50 55 60 65 70 75 80 85 90

Accu

racy

80

85

90

95ModelNet40ShapeNetCore

Z. Cao et al. Spherical Projections 3DV 2017 9 / 1

Page 10: 3D Object Classification via Spherical Projections · 6 90.87 88.95 24 91.02 89.23 12 91.36 89.45 contour-based 6 92.26 89.23 24 93.12 90.36 12 93.31 90.49 Accuracy w.r.t Elevation

Summary

Summary

We introduce a spherical representation exploiting both depthvariation and contour information which can capture geometric detailsand data dependencies across the entire object.We develop deep neural networks incorporating large-scale labeledimages for training to classify spherical representations of 3D objects.In the future, we plan to define convolutional kernels directly onspherical domains.

Z. Cao et al. Spherical Projections 3DV 2017 10 / 1