s7348: deep learning in ford's autonomous...

S7348: Deep Learning in Ford's Autonomous Vehicles

Bryan Goodman

Argo AI

9 May 2017

Today: examples from• Stereo image processing

• Object detection• Using RNN’s

• Motorsports

Ford’s 12 Year History in Autonomous Driving

Stereo Matching Problem

• Determining the correspondences in stereo images

• Calculating the disparities

• But what is the correct correspondence?

• Basic stereo matching algorithm− Compare pixels on the same

epipolar line in two images

− Choose the best match

Deep neural networks for stereo matching

• The brain can estimate the distance of an object using the visual information from two eyes.

• We can use deep neural networks

Right Stereo Camera

Deep Convolutional Neural Networks

Post-Processing

Left Stereo Camera

Distance Map Estimation

Proposed deep convolutional neural network• AV driving requires an intelligent distance map estimation, which filters out the

objects not of interest.• Network I

− General network

− Encoding and decoding layers

− Retain objects of interest in the training data sets

nv1 Conv5

Encoder Decoder

Loss Function

Proposed deep convolutional neural network II

− Specialized network

− Encoding and decoding layers

− The cross correlation layers force the network to look for correspondence on the epipolar line

− The weights in the encoding layers are shared

Conv4L

Encoder Decoder

Conv4R

Proposed deep convolutional neural network

• Cross correlation (CC) layer− Computes CC values between each pairs of

patches

− Outputs the CC values for each pair of patches

− Does not lose any information

• Loss function− In AV driving, closer objects are more important

than distant ones

− Assigns more weight to the closer objects

− The closer object distance is estimated more accurately

0.2 0.4 0.6 0.8 1

Performance on synthetic and real stereo data

• Synthetic data generation− Generate 14,000 pairs of RGB stereo images

− Synthetic distance maps are only generated for the objects of interest, e.g. cars or pedestrians

− Gaussian noise added to the stereo images

Performance on synthetic and real stereo data• Fine tuning with LIDAR data sets

− Project LIDAR point clouds onto the camera images

− The baseline and optic axes are not the same as the synthetic data

Left camera Right camera Network I Network II

Comparing Manual Annotation to DNN Model

Detection Result Original Image Enhanced Contrast

Network’s detection outperforms human labelerin low-contrast areas

Pedestrian detection Pedestrian misdetection Detected, but not labeled

Introducing Recurrence in Detection and Tracking

• Use RNN’s to detect occluded objects• Remember location of static objects

• Predict location of non-static objects

Image 0

FeatureMap

RNN Conv

Image 1

FeatureMap

Image 2

FeatureMap

RNN Conv RNN Conv

Detector Detector Detector

Orange = ground truth; Green = model prediction

Classifying NASCAR images

The Ford team reviews pictures during the race

Looking for damage and other performance indicators

Results –Boxing the Cars

Using ~2k images labeled

with boxes around the

vehicles, the model does

well detecting cars

Results –Boxing the Cars

Next –determine car

number:labeled ~30k

images

Outliers easy to find in review

Human: ???Model: 78

Confidence: 0.999

Human: ???Model: 42

Confidence: 0.985

Inspecting the Neural Network

Activated Filter Input Image

The Model is not a black box. We can see that it is detecting the numbers – important for robustness when the paint changes

Argo AI

• Argo AI is an artificial intelligence company, established to tackle one of the most challenging applications in computer science, robotics and artificial intelligence: self-driving vehicles

• Engineering hubs in Pittsburgh, Southeastern Michigan and the Bay Area of California

• For more information regarding Argo AI and its work, please talk to me at GTC or visit: www.argo.ai

s7348: deep learning in ford's autonomous...

Documents

autonomous blimp control using deep reinforcement learning

high-speed autonomous drifting with deep reinforcement

haer ford's richmond assembly plant

ford's marketing analysis

ford's most monumental moments

desmond ford's visual resume

development of deep learning autonomous car using raspberry...

mrs. ford's acrostic name poems

autonomous driving: a multi-objective deep reinforcement

deep learning for autonomous car driving

deep learning towards autonomous ship navigation and

s7348: deep learning in ford's autonomous vehicles ·...

ford's colony open house

deep learning-based autonomous scanning electron microscope

agile autonomous driving using end-to-end deep...

deep predictive autonomous driving using multi-agent joint

ford's theater powerpoint presentation

thomas - ford's theatre

deep learning lane detection for autonomous vehicle...

autonomous software-defined radio receivers for deep space