is motion planning overrated? - personal robotics lab...inverse dynamics controller robot torques...

Is Motion Planning Overrated?Jeannette Bohg - Interactive Perception and Robot Learning Lab - Stanford


tl;dr; Yes


tl;dr; It depends

Robot System DesignDynamic and Uncertain Environment

Robot System DesignDynamic and Uncertain Environment

Real-Time Perception meets Reactive Motion Generation

Inverse Dynamics Controller

RobotTorques

Joint Sensors

Robot State Acceleration Policies

Camera

World

Interaction

World Model

Object Tracker

Arm Tracker

Motion Optimizer

Kappler et al. Real-Time Perception meets Reactive Motion Generation. RA-L + ICRA’18. Finalist 2018 Amazon Best Systems Paper

One time step in the system


Raw Sensory Data

30Hz-1kHz


Raw Sensory Data Processed Sensory Data

30Hz-1kHz 15-30Hz


Raw Sensory Data Processed Sensory Data Local and optimised policies

30Hz-1kHz 15-30Hz 30kHz 5-10Hz


Raw Sensory Data Processed Sensory Data Local and optimised policies Fused Policy

30Hz-1kHz 15-30Hz 30kHz 5-10Hz 30 kHz


Raw Sensory Data Processed Sensory Data Local and optimised policies Fused Policy

30Hz-1kHz 15-30Hz 30kHz 5-10Hz 30 kHz

Robot Controlled at 1kHz

System-Level Evaluation


Sense-Plan-Act


Sense-Plan-Act Feedback Control


Sense-Plan-Act Feedback Control Full System


Static Pick and Place Dynamic Pick and Place Dynamic Grasping Dynamic Pointing


tl;dr; It depends

Environment Complexity

Success of Feedback Control




Real-Time Perception meets Reactive Motion Generation. Kappler et al. ICRA’18 + RAL.




QT-Opt: Scalable Deep Reinforcement Learning for Vision-Based Robotic Manipulation. Kalashnikov et al. To appear at CORL ’18.





Motion Planning





Motion PlanningReactive

Take-Home from Systems Paper


Higher complexity - More planning


Higher complexity - More planning

More uncertainty and dynamics - Online Re-Planning

Controlling through contact

Object

Finger

Goal


Better ModelsBetter Feedback

Predictive Model

Model

Predictive Model

Model

Sensory Observations

Predictive Model

Model


Action

Predictive Model

Model


Action

Predicted Effect

Predictive Model

Model


Action

Predicted Effect

Model-Predictive Control

Predictive Model

Predicting physical effect

Goal

Keep Predicting

Goal

Our Hypothesis

Our HypothesisPhysics Models

Our HypothesisPhysics Models +

Our HypothesisPhysics Models + Learning

Our HypothesisPhysics Models + Learning

= Generalization

Hybrid Model

Physics-based Model

Action

Predicted EffectParameters

Learned Model


Hybrid Model

Physics-based Model

Action


Learned Model

Sensory Observations End-to-EndLoss on Effect

Hybrid Model

Physics-based Model

Action


Learned Model


Extrapolation

Testing Hypothesis on a Case Study

More than a Million Ways to Be Pushed. A High-Fidelity Experimental Dataset of Planar Pushing. Yu et al. IROS 2016.

x

x

x

x

K. M. Lynch, H. Maekawa, and K. Tanie, “Manipulation and active sensing by pushing using tactile feedback,” in Proc. IEEE/RSJ Int. Conf. Intelligent Robots and Systems, vol. 1, Jul 1992, pp. 416–421

Analytic Model

x

x

x

x


1. Stage

Analytic Model

x

x

x

x


2. Stage

1. Stage

Analytic Model

Learned Model

Action

Predicted Effect


Neural Network only

Compared Architectures

Learned Model

Action

Predicted Effect


Neural Network only Hybrid Model

Physics-based Model

ActionPredicted

Effect


Parameters

Learned Model

Compared Architectures

Learned Model

Action

Predicted Effect



Physics-based Model

ActionPredicted

Effect


Parameters

Learned Model

Compared ArchitecturesError Model

Physics-based Model

ActionPredicted

Effect


Parameters

Learned Model

Learned Error

+

Learned Model

Action

Predicted Effect



Physics-based Model

ActionPredicted

Effect


Parameters

Learned Model

Compared ArchitecturesRaw Sensory Observations

Error Model

Physics-based Model

ActionPredicted

Effect


Parameters

Learned Model

Learned Error

+

Learned Model

Action

Predicted Effect



Physics-based Model

ActionPredicted

Effect


Parameters

Learned Model

Training: End-to-End Loss: Error between Predicted and Ground Truth Effect

Compared ArchitecturesRaw Sensory Observations

Error Model

Physics-based Model

ActionPredicted

Effect


Parameters

Learned Model

Learned Error

+

Testing Data Efficiency

Testing Generalization

Alina Kloss et al, “Combining learned and analytical models for predicting action effects,” Submitted. 2018. Pre-print on arXiv.


New Pushing Angles & Contact Points

Training Testing

Interpolation




Training Testing

New Push Velocities

Training Testing

Interpolation Extrapolation




Training Testing

New Push Velocities

Training Testing

New Object Shapes

Training Testing

Interpolation Extrapolation


Training Testing

Generalization to new push velocities

Training Testing


Extrapolation

Training Testing


Training Testing


Extrapolation

Training Testing


Training Testing


Extrapolation

Training Testing


Training Testing


Extrapolation

Don’t throw away structure

Learn to extract given state representation from raw data

Physics-based Model

Action


Learned Model


A Concrete Suggestion

Interpretability

Interpretability Real and Predicted Box Position after 200 identical pushes

Representation interpretable

Compensation for Errors in Analytical Model Wrong Friction Parameters of Analytical Model

Future Direction

Physics-based Model

Action


Learned Model

Sequence of Sensory Observation

Future Direction

Physics-based Model

Action


Learned Model


Multistep Prediction

Future Direction

Physics-based Model

Action


Learned Model


Multistep Prediction

Backpropagation w. r. t. control

Models will never be perfect

Differentiable Recursive Filtering

Jonschkowski et al. RSS’18

Haarnoja et al. NIPS’16

Karkus et al. arXiv’18

Differentiable Particle Filters: End-to-End Learning with Algorithmic Priors. Jonschkowski et al. RSS’18

Learning Heteroscedastic Noise

On Learning Heteroscedastic Noise Models within Differentiable Filtering. Kloss and Bohg. Submitted to ICLR’19. Picture adapted from: Differentiable Particle Filters: End-to-End Learning with Algorithmic Priors. Jonschkowski et al. RSS’18



Observation Noise



Observation Noise

Process Noise

Four Non-Linear Filters


Extended Kalman Filter



Unscented Kalman Filter (UKF)




Monte-Carlo UKF




Monte-Carlo UKF

Particle Filter




Monte-Carlo UKF

Particle Filter

Process Model Observation Model

Loss Function

Loss Function

Ground Truth State Sequence

Loss Function


Sequence of State Estimates (Mean and Covariance)

Loss Function



Network Weights

Loss Function



Network Weights

Negative log-likelihood of true state given believe

Loss Function



Network Weights

Negative log-likelihood of true state given believe Estimation Error

Loss Function



Network Weights

Negative log-likelihood of true state given believe Estimation Error Regularization

Two Application

Kitti Visual Odometry Task Planar Pushing Task

Visual OdometryLearned Constant Observation Noise

Learned Hsc. Process Noise

Learned Constant Noises Mixed

Planar PushingLearned Constant Observation Noise

Learned Hsc. Process Noise

Learned Constant Noises MixedWell-Tuned

NoiseBadly-Tuned

Noise

Take Home

Take Home

Differentiable EKF most accurate and robust to inaccurate noise models

Take Home


Particle filter gains most from heteroscedastic noise model.

Take Home



UKF works best for more complex dynamics models.

Take Home



UKF works best for more complex dynamics models.

Uncertainty bounds for long-term predictions.

Conclusions

ConclusionsSuccess of Feedback Control




Motion Planning




Better Predictive Models Better Feedback

O P

Thank you for your Attention!

IPRL @ Stanford AMD @ MPI, CLMC @ USCiprl.stanford.edu https://am.is.tuebingen.mpg.de/

http://iprl.stanford.edu

is motion planning overrated? - personal robotics lab...inverse dynamics controller robot torques...

Documents