rgb-d mapping: using depth cameras for dense 3d modeling of indoor environments peter henry 1,...

23
RGB-D Mapping: Using Depth Cameras for Dense 3D Modeling of Indoor Environments Peter Henry 1 , Michael Krainin 1 , Evan Herbst 1 , Xiaofeng Ren 2 , and Dieter Fox 1,2 1 University of Washington Computer Science and Engineering 2 Intel Labs Seattle 1

Upload: angelina-anderson

Post on 13-Dec-2015

214 views

Category:

Documents


2 download

TRANSCRIPT

Page 1: RGB-D Mapping: Using Depth Cameras for Dense 3D Modeling of Indoor Environments Peter Henry 1, Michael Krainin 1, Evan Herbst 1, Xiaofeng Ren 2, and Dieter

1

RGB-D Mapping:Using Depth Cameras for Dense 3D Modeling of

Indoor Environments

Peter Henry1, Michael Krainin1, Evan Herbst1,Xiaofeng Ren2, and Dieter Fox1,2

1University of Washington Computer Science and Engineering2Intel Labs Seattle

Page 2: RGB-D Mapping: Using Depth Cameras for Dense 3D Modeling of Indoor Environments Peter Henry 1, Michael Krainin 1, Evan Herbst 1, Xiaofeng Ren 2, and Dieter

2

RGB-D Data

Page 3: RGB-D Mapping: Using Depth Cameras for Dense 3D Modeling of Indoor Environments Peter Henry 1, Michael Krainin 1, Evan Herbst 1, Xiaofeng Ren 2, and Dieter

3

Page 4: RGB-D Mapping: Using Depth Cameras for Dense 3D Modeling of Indoor Environments Peter Henry 1, Michael Krainin 1, Evan Herbst 1, Xiaofeng Ren 2, and Dieter

4

Related Work SLAM

[Davison et al, PAMI 2007] (monocular) [Konolige et al, IJR 2010] (stereo) [Pollefeys et al, IJCV 2007] (multi-view stereo) [Borrmann et al, RAS 2008] (3D laser) [May et al, JFR 2009] (ToF sensor)

Loop Closure Detection [Nister et al, CVPR 2006] [Paul et al, ICRA 2010]

Photo collections [Snavely et al, SIGGRAPH 2006] [Furukawa et al, ICCV 2009]

Page 5: RGB-D Mapping: Using Depth Cameras for Dense 3D Modeling of Indoor Environments Peter Henry 1, Michael Krainin 1, Evan Herbst 1, Xiaofeng Ren 2, and Dieter

5

System Overview

1. Frame-to-frame alignment

2. Loop closure detection and global consistency

3. Map representation

Page 6: RGB-D Mapping: Using Depth Cameras for Dense 3D Modeling of Indoor Environments Peter Henry 1, Michael Krainin 1, Evan Herbst 1, Xiaofeng Ren 2, and Dieter

6

Alignment (3D SIFT RANSAC)

SIFT features (from image) in 3D (from depth)

RANSAC alignment requires no initial estimate

Requires sufficient matched features

Page 7: RGB-D Mapping: Using Depth Cameras for Dense 3D Modeling of Indoor Environments Peter Henry 1, Michael Krainin 1, Evan Herbst 1, Xiaofeng Ren 2, and Dieter

7

RANSAC Failure Case

Page 8: RGB-D Mapping: Using Depth Cameras for Dense 3D Modeling of Indoor Environments Peter Henry 1, Michael Krainin 1, Evan Herbst 1, Xiaofeng Ren 2, and Dieter

8

Alignment (ICP)

Does not need visual features or color

Requires sufficient geometry and initial estimate

Page 9: RGB-D Mapping: Using Depth Cameras for Dense 3D Modeling of Indoor Environments Peter Henry 1, Michael Krainin 1, Evan Herbst 1, Xiaofeng Ren 2, and Dieter

9

ICP Failure Case

Page 10: RGB-D Mapping: Using Depth Cameras for Dense 3D Modeling of Indoor Environments Peter Henry 1, Michael Krainin 1, Evan Herbst 1, Xiaofeng Ren 2, and Dieter

10

Joint Optimization (RGBD-ICP)

Fixed feature matches Dense point-to-plane

Page 11: RGB-D Mapping: Using Depth Cameras for Dense 3D Modeling of Indoor Environments Peter Henry 1, Michael Krainin 1, Evan Herbst 1, Xiaofeng Ren 2, and Dieter

11

Loop ClosureSequential alignments accumulate error

Revisiting a previous location results in an inconsistent map

Page 12: RGB-D Mapping: Using Depth Cameras for Dense 3D Modeling of Indoor Environments Peter Henry 1, Michael Krainin 1, Evan Herbst 1, Xiaofeng Ren 2, and Dieter

12

Loop Closure DetectionDetect by running RANSAC against previous

frames

Pre-filter options (for efficiency):Only a subset of frames (keyframes)

Every nth frameNew Keyframe when RANSAC fails directly against

previous keyframeOnly keyframes with similar estimated 3D posePlace recognition using vocabulary tree

Page 13: RGB-D Mapping: Using Depth Cameras for Dense 3D Modeling of Indoor Environments Peter Henry 1, Michael Krainin 1, Evan Herbst 1, Xiaofeng Ren 2, and Dieter

13

Loop Closure CorrectionTORO [Grisetti 2007]:

Constraints between camera locations in pose graph

Maximum likelihood global camera poses

Page 14: RGB-D Mapping: Using Depth Cameras for Dense 3D Modeling of Indoor Environments Peter Henry 1, Michael Krainin 1, Evan Herbst 1, Xiaofeng Ren 2, and Dieter

14

Page 15: RGB-D Mapping: Using Depth Cameras for Dense 3D Modeling of Indoor Environments Peter Henry 1, Michael Krainin 1, Evan Herbst 1, Xiaofeng Ren 2, and Dieter

16

Map Representation: Surfels

Surface Elements [Pfister 2000, Weise 2009, Krainin 2010]

Circular surface patches

Accumulate color / orientation / size information

Incremental, independent updates

Incorporate occlusion reasoning

750 million points reduced to 9 million surfels

Page 16: RGB-D Mapping: Using Depth Cameras for Dense 3D Modeling of Indoor Environments Peter Henry 1, Michael Krainin 1, Evan Herbst 1, Xiaofeng Ren 2, and Dieter

17

Page 17: RGB-D Mapping: Using Depth Cameras for Dense 3D Modeling of Indoor Environments Peter Henry 1, Michael Krainin 1, Evan Herbst 1, Xiaofeng Ren 2, and Dieter

18

Experiment 1: RGBD-ICP16 locations marked in environment (~7m

apart)

Camera on tripod carried between and placed at markers

Error between ground truth distance and the accumulated distance from frame-to-frame alignment

α weight 0.0 (ICP) 0.2 0.5 0.8 1.0 (RANSAC)

Mean error (m)

0.22 0.19

0.16

0.18

1.29

Page 18: RGB-D Mapping: Using Depth Cameras for Dense 3D Modeling of Indoor Environments Peter Henry 1, Michael Krainin 1, Evan Herbst 1, Xiaofeng Ren 2, and Dieter

19

Experiment 2: Robot ArmCamera held in WAM arm for ground truth pose

Method

Mean Translation

al Error (m)

Mean Angular

Error (deg)

RANSAC .168 4.5

ICP .144 3.9

RGBD-ICP (α=0.5) .123 3.3

RGBD-ICP + TORO

.069 2.8

Page 19: RGB-D Mapping: Using Depth Cameras for Dense 3D Modeling of Indoor Environments Peter Henry 1, Michael Krainin 1, Evan Herbst 1, Xiaofeng Ren 2, and Dieter

20

Map Accuracy: Architectural Drawing

Page 20: RGB-D Mapping: Using Depth Cameras for Dense 3D Modeling of Indoor Environments Peter Henry 1, Michael Krainin 1, Evan Herbst 1, Xiaofeng Ren 2, and Dieter

21

Map Accuracy: 2D Laser Map

Page 21: RGB-D Mapping: Using Depth Cameras for Dense 3D Modeling of Indoor Environments Peter Henry 1, Michael Krainin 1, Evan Herbst 1, Xiaofeng Ren 2, and Dieter

22

ConclusionKinect-style depth cameras have recently

become available as consumer products

RGB-D Mapping can generate rich 3D maps using these cameras

RGBD-ICP combines visual and shape information for robust frame-to-frame alignment

Global consistency achieved via loop closure detection and optimization (RANSAC, TORO, SBA)

Surfels provide a compact map representation

Page 22: RGB-D Mapping: Using Depth Cameras for Dense 3D Modeling of Indoor Environments Peter Henry 1, Michael Krainin 1, Evan Herbst 1, Xiaofeng Ren 2, and Dieter

23

Open QuestionsWhich are the best features to use?

How to find more loop closure constraints between frames?

What is the right representation (point clouds, surfels, meshes, volumetric, geometric primitives, objects)?

How to generate increasingly photorealistic maps?

Autonomous exploration for map completeness?

Can we use these rich 3D maps for semantic mapping?

Page 23: RGB-D Mapping: Using Depth Cameras for Dense 3D Modeling of Indoor Environments Peter Henry 1, Michael Krainin 1, Evan Herbst 1, Xiaofeng Ren 2, and Dieter

24

Thank You!