robot vision for the visually impaired

Robot Vision for the Visually ImpairedVivek Pradeep, Gerard Medioni, James Weiland

presented byPhongsathorn Eakamongul

Department of Computer ScienceAsian Institute of Technology

2010, December 7

Phongsathorn (AIT) Robot Vision for the Visually Impaired Short Occasion 1 / 18

Outline

1 Abstracts

2 System Description

3 Result

Abstracts

head-mounted : wide-field information compare to shoulder or waist-mounteddesign in literature which require body rotations

stereo-vision

navigational assistance device

visual odometry : dense 3D with 2D elevation grids

metric-topological SLAM

build vicinity map

3D traversability analysis to steer subjects away from obstacles in the path

use microvibration motors provides cues for taking evasive action : they use tactilecues instead of audio since the latter impose greater cognitive load on the subject,and blind users rely on hearing to perform a wide variety of other tasks

experiment running at 10 Hz

Abstracts

stereo-vision

build vicinity map

Abstracts

stereo-vision

build vicinity map

Abstracts

stereo-vision

build vicinity map

Abstracts

stereo-vision

build vicinity map

Abstracts

stereo-vision

build vicinity map

Abstracts

stereo-vision

build vicinity map

Abstracts

stereo-vision

build vicinity map

Abstracts

stereo-vision

build vicinity map

Abstracts

stereo-vision

build vicinity map

Introduction

visual impairment : need long cane or guide dog

In US, 109,000 people : use long canes, 7,000 use dog guides

only 1,500 graduate from dog-guid user program

Electronic travel aids (ETAs), leveraging ultrasonic, laser, or vision sensors

Introduction

wearable array of microvibration motors provides a tactile cuesand guide user along safe path

Outline

1 Abstracts

3 Result

Online SLAM + obstacle detection

Stereo Vision Odometry

matched correspondences across (P t−1L ,P t−1

R ,P tL) or (P t−1

L ,P t−1R ,P t

R) can becomputed using three-point algorithm in RANSAC setting

for robustness, features matching and reprojection errors are measured acrossfour views

Sparse Bundle Adjustment

feature covariances can be propagated to get motion uncertainty for use in theSLAM filter

L ,P t−1R ,P t

Rao-Blackwellised particle filter (RBPF) in FastSLAM framework

which use KLT and SIFT trackingconstruct 2 maps

SLAM map : collection of sparse landmarks that propagated every frame to yieldconsistent camera pose estimates, for SLAM purpose onlytraversability map : dense 3D cloud from triangulation

Metric-Topological SLAM

serveral thousands of landmarks environmenttwo levels of environment representation

local, metric (submap) : estimates state informationsix dimensional camera trajectory st

sparse map mtfeature observations (KLT/SIFT) z t

camera motion estimates ut

RBPFp(st ,mt |z t , ut ) ≈ p(st |z t , ut )

∏i p(mt (i)|st , z t , ut )

mt (i) : ith landmark in the map represented by N(µi , σi )each time feature is observed, the corresponding lankmark is updated using EKFRBPF enables us to only update the observed landmark instead of the whole map

global topologicalmap is represents as a collection of submap

annotated graphG = (i Mi∈Ωt ,

baΛa,b∈Ωt )

i M : annotated submapsΩt : set of computed submapsbaΛ : coordinate transformations between adjacent maps

baΛa,b∈Ωt )

Traversability Map

5 radius sphere

multi-surface elevation map : point cloud is quantized into 2D grid

Traversability Map

5 radius sphere

Traversability Map

5 radius sphere

Prediction Motion and Cue Generation

if magnitude of translation respect to previous position exceeds certain threshold,the direction of motion and reference position are updated

little translation -> no update

cue generation : most continuous traversable path ( Green color in picture )

Outline

1 Abstracts

3 Result

Result

Green : travesibleRed : not

error of camera frame-to-frame heading (yaw), when compared withreadings from a commercially Inertial Measurement Unit (IMU)

camera motion : slow (< 5 degree/s), medium (5-20 degree/s), fast (20-30 degree/s)

SLAM result

Traversability Map

one frame exppatch that has thickness > 30 cm is labeled as vertical5 horizontal patches is labeled as traversable

Experiment

Manually generate cues : wireless remote control

Autonomous generate cues, like group 4

robot vision for the visually impaired

Technology

d traversability analysis

tactile cues

wide variety

waistmounted design

wideeld information

microvibration motors

evasive action

greater cognitive load