min sun: 3d object detection

3D object categorization, detection, and viewpoint classification

Final Presentation

Min SunEECS, University of Michigan at Ann Arbor

Mentor: Gary Bradski

Goal: Viewpoint ClassificationGoal: Detection and Categorization

Approaches

• Discriminative Codeword (Random Forest), using Random Forest

• Hough voting for each viewing region

Mouse, V1

Mouse, V2

First Step: System implementation

• Using OpenCV and octave to re-implement the system

• Old system in Matlab: slow and not open source

• New system is fast and open source

• Speed-up detection from ~2 minutes to ~5 seconds for single object class

• Create a ROS node (rf_detector) to recognize object online

Challenges

• Need good shape descriptor for objects with less texture

• Need to have a multi-class object detector to detect multiple object classes at the same time

Second Step: system upgrade

• Exploring different shape features:

1. Histogram Oriented Gradients (opencv)

2. Geometric Blur (geometric_blur in ROS)

3. Berkeley natural boundary (Nb) detector

Berkeley (Nb) OpenCV

Conclusion

• Hog has similar performance as Geometric blur+natural boundary detector(Nb)

• It takes 3 minutes to compute Natural boundary(Nb) for each image

• Hog is fast and almost the best

Recall Mouse Stapler

Gb+Nb 28% 37%

Hog+Nb 25% 45%

Hog 30% 35%

Second Step: system upgrade

• Multi-class Random Forest

Stapler

Mug & Mouse

Third Step: 3d information

• Using stereo depth to sample image patches corresponding to fix physical size to avoid scale search

• Using Dan’s shape spectral and spin image descriptors in descriptor_3d (ROS pkg)

• Combine both Hog and 3d descriptors

Data collection

• Table top object classes: mice, staplers, and mugs

• Collect aligned images and dense stereo point clouds

Multiple Views

Third Step: Results

staplermug

mouse stapler mug

Stapler

Third Step: Comparison

Average Precision

ClassificationAccuracy

spin 0.213 0.2

shape 0.138 0.4

hog 0.635 0.73

hog+spin 0.612 0.7

Hog+shape 0.67 0.72

Working system

• Texture_light_on_off node aligns images w/o texture light and dense stereo point clouds

• Table top object detector (t2obj) segments out the point clouds of table top objects

• Finally, rf_detector recognizes object locations, classes, and viewpoints.

Results: miceRecognition Table top segmentation

Results: mugsRecognition Table top segmentation

Results: staplersRecognition Table top segmentation

ResultsRecognition Table top segmentation

• Train on 3d+image, Test on image only

Use image patches of fix physical size to detect objects and infer the 3d position of the supporting image patches -> Object Pop-Up

• Vote for object center directly in 3d

Make the model fully rotational invariant and more compact

Future work

Thank you

min sun: 3d object detection

Documents

1 min-entropy latent model for weakly supervised object...

tourschedule - orange travel suriname...sun. june 09th...

xiaodong tang , zhe-min tan1, juan fang , y. qiang sun

sun certified java programmer 1 © cpu, university of...

cloud object storage | store & retrieve data …€¦ ·...

users sleeping time analysis based on micro-blogging data...

stars chapter 30. properties of the sun the sun is the...

lncs 6315 - depth-encoded hough voting for joint...

in the supreme court of guam edward kim, v. min sun cha,

sun-spot: a small programmable object technology - an...

staar grade 8 science administered may 2017 – released ·...

sun™ small programmable object technology (sun spot...

hierarchical selectivity for object-based visual...

min-sun kim curriculum vitae - university of hawaii ·...

tourschedule · sun. june 09th cultural markets € 39 min....

the sun by: kristel curameng and courtney lee. the sun the...

our earth, sun, and moon. the path an object takes as it...

the sun small programmable object technology (sun spot

object category classiﬁcation using occluding...

min-sun kim curriculum vitae - university of hawaii ·...