swarms of small flying robots · 2020-04-20 · inertial measurement units • accelerometer,...

Swarms of Small Flying Robots

Vijay KumarProfessor and Nemirovsky Family DeanUniversity of Pennsylvania

Department of Mechanical and Aerospace EngineeringNew York University

Dec 2, 2019

Acknowledgements

Spin off companies

Trefo.ai

applications to agriculture

aerial mapping mining, asset mapping

acquired by health, disaster response

Non profit

3

A Brief History of Aerial Robotics

• International Aerial Robotics Competition (1991)• Early work at GRASP (< 2000)

4

There are no new ideas, only good ideas!

George de Bothezat’s Quadrotor, 1922Breguet-Richet Gyroplane No.1, 1907

1907, Paul Cornu: First to Hover?

Wired Magazine, A hundred years of hovering, 2007

Flying windmill, 2007

5


• International Aerial Robotics Competition (1991)• Early work at GRASP (< 2000)

6

Inertial Measurement Units

• Accelerometer, airbag sensors (Analog Devices), 1993

• MEMS gyros for electronic stability control (Bosch), 1997

• 3-axis accelerometers for Nintendo Wii, 2006• 3-axis accelerometer, iPhone (Apple), 2007• 3-axis accelerometer, 3-axis gyro, 3-axis

magnetometer, iPhone 4 (Apple), 2010inflexion point

7


• International Aerial Robotics Competition (1991)• Early work at GRASP• 2008-9 – small multi rotor aircrafts become practical

Ascending Technologies KMel Robotics

Daniel Mellinger and Alex Kushleyev

N. Michael, D. Mellinger, Q. Lindsey, and V. Kumar. The GRASP multiple micro UAV testbed. IEEE Robotics and Automation Magazine, Vol. 17, No. 3. 2010.

Daniel Mellinger

Auton Robot

Lee, T. (2011). Geometric tracking control of the attitude dynamics ofa rigid body on SO(3). In Proc. of the Amer. control conf., SanFrancisco, CA.

Lee, T., Leok, M., & McClamroch, N. H. (2010). Geometric trackingcontrol of a quadrotor UAV on SE(3). In Proc. of the IEEE conf.on decision and control, Atlanta, GA.

Mellinger, D., & Kumar, V. (2011). Minimum snap trajectory genera-tion and control for quadrotors. In Proc. of the IEEE intl. conf. onrobot. and autom., Shanghai, China.

Mellinger, D., Michael, N., & Kumar, V. (2010). Trajectory generationand control for precise aggressive maneuvers with quadrotors. InProc. of the intl. sym. on exp. robot., Delhi, India.

Mesbahi, M. (2005). On state-dependent dynamic graphs and theircontrollability properties. IEEE Transactions on Automatic Con-trol, 50(3), 387–392.

Michael, N., Mellinger, D., Lindsey, Q., & Kumar, V. (2010). TheGRASP multiple micro UAV testbed. IEEE Robotics & Automa-tion Magazine, 17(3), 56–65.

Nieuwstadt, M. J. V., & Murray, R. M. (1998). Real-time trajectorygeneration for differentially flat systems. International Journal ofRobust and Nonlinear Control, 8 (11), 995–1020.

Ogren, P., Fiorelli, E., & Leonard, N. (2002). Formations with a mis-sion: stable coordination of vehicle group maneuvers. In Proc.of intl. sym. on mathematical theory networks and syst., NotreDame, IN.

Olfati-Saber, R., & Murray, R. M. (2002). Distributed cooperative con-trol of multiple vehicle formations using structural potential func-tions. In Proc. of the IFAC world congress, Barcelona, Spain.

Olfati-Saber, R., & Murray, R. M. (2004). Consensus problems in net-works of agents with switching topology and time-delays. IEEETransactions on Automatic Control, 49 (9), 1520–1533.

Shim, D., Kim, H., & Sastry, S. (2003). Decentralized nonlinear modelpredictive control of multiple flying robots. In Decision andcontrol, 2003. Proceedings. 42nd IEEE conference on (Vol. 4,pp. 3621–3626). New York: IEEE.

Tabuada, P., Pappas, G. J., & Lima, P. (2001). Feasible formations ofmulti-agent systems. In Proc. of the Amer. control conf., Arling-ton, VA (pp. 56–61).

Tanner, H., Pappas, G. J., & Kumar, V. (2002). Input-to-state stabilityon formation graphs. In Proc. of the IEEE intl. conf. on robot. andautom., Las Vegas, NV (pp. 2439–2444).

Turpin, M., Michael, N., & Kumar, V. (2011). Trajectory design andcontrol for aggressive formation flight with quadrotors. In Proc.of the intl. sym. of robotics research, Flagstaff, AZ.

Vicsek, T., Czirók, A., Ben-Jacob, E., Cohen, I., & Shochet, O. (1995).Novel type of phase transition in a system of self-driven particles.Physical Review Letters, 75(6), 1226–1229.

Matthew Turpin is a Ph.D. candi-date in the Department of Mechan-ical Engineering and Applied Me-chanics at the University of Penn-sylvania. He works on formation de-sign and control of micro-aerial ve-hicles.

Nathan Michael is a Research As-sistant Professor in the Departmentof Mechanical Engineering and Ap-plied Mechanics at the University ofPennsylvania. He received his Ph.D.in Mechanical Engineering from theUniversity of Pennsylvania in 2008.He works in the areas of dynamics,estimation, and control for groundand aerial robot systems.

Vijay Kumar is the UPS Founda-tion Professor and the Deputy Deanfor Education in the School of En-gineering and Applied Science atthe University of Pennsylvania. Hereceived his Ph.D. in MechanicalEngineering from The Ohio StateUniversity in 1987. He has beenon the Faculty in the Departmentof Mechanical Engineering and Ap-plied Mechanics with a secondaryappointment in the Department ofComputer and Information Scienceat the University of Pennsylvaniasince 1987. He is a Fellow of the

American Society of Mechanical Engineers (ASME) and the Institu-tion of Electrical and Electronic Engineers (IEEE).

Author's personal copy

Alex Kushleyev

Jonathan Fink Nathan Michael

8

Aerial Robotics, 2009

Nathan Michael (CMU)

Jonathan Fink (ARL)

Daniel Mellinger, Nathan Michael, and Vijay Kumar. Trajectory Generation and Control for Precise Aggressive Maneuvers with Quadrotors.International Journal of Robotics Research, Apr. 2012.

9

9

Daniel Mellinger, and Vijay Kumar. “Minimum Snap Trajectory Generation and Control for Quadrotors,” Proc. IEEE International Conference onRobotics and Automation. Shanghai, China, May, 2011.

2012 – Swarm of 75 gm quadrotors

Daniel MellingerAlex Kushleyev

10

11

2015 – Drones everywhere!

12

Drone Racing

Drone Nationals, New York City, 2016

Yash Mulgaonkar

13

Beyond Quadrotors

S. Driessens and P. Pounds, “The triangular quadrotor: a more efficient quadrotor configuration,” 2015. Dario Brescianini and Raffaello

D’Andrea, “An omni-directional multirotor vehicle,” 2018.

Davide Falanga , Kevin Kleber, Stefano Mintchev , Dario Floreano , and Davide Scaramuzza, “The Foldable Drone: A Morphing Quadrotor That Can Squeeze and Fly,” 2019. David Saldana, Bruno Gabrich, Guanrui Li, Mark Yim, and

Vijay Kumar, “ModQuad: The Flying Modular Structure that Self-Assembles in Midair,” 2018.

David Saldana

14

Gartner Hype Cycle

20182017

2019 (10 years later)

Goldman Sachs Research

17%

70%

13%

military

b2b

consumer

Market Agriculture, Mining, Health

SmallSafe

SpeedSmartWe are here!

Swarms

Aerial Robotics Research (and Commercialization)

Autonomy

Size

3000 g

3500 g

75 g

201120 g

2014650 g 2015

740 g

2012

1750 g

2013

1850 g

2011

2016

250 g

2016

rely on external infrastructure

vision + IMU

2-D laser scanner

3-D laser

The FalconFast Aggressive Lightweight flight in CONstrained Environments

Y. Mulgaonkar, A. Makineni, K. Mohta, C.J. Taylor, and V. Kumar, 2016 Hilton Head Workshop

. . . and at high speeds

Search and Rescue

Five Challenges

• Perception Action Loops for Autonomy

• State Estimation

• Navigation in Cluttered Environments

• Scaling Down in Size, Weight

• Perception Action Communication Loops for Swarms

1. Nested Perception/Action Loops

Continuous

DiscretePlan Control

Perception

Plan Control System

Perception

HybridPlan Control

Perception

System

System

JTY Wen and K Kreutz-Delgado, The attitude controlproblem, IEEE Transactions on Automatic control, 1991.D Mellinger and V Kumar, “Minimum Snap TrajectoryGeneration and Control for Quadrotors,” Proc. IEEEInternational Conference on Robotics and Automation.Shanghai, China, May, 2011.T Lee, M Leok, NH McClamroch, “Nonlinear RobustTracking Control of a Quadrotor UAV on SE(3),” AsianJournal of Control 2012.

Nonlinear controllers on SO(3)/SE(3)

2. State Estimation (Stereo + IMU)

K. Sun, K. Mohta, B. Pfrommer, M. Watterson, S. Liu, Y. Mulgaonkar, C. J. Taylor, and V. Kumar, “Robust Stereo Visual Inertial Odometry for Fast Autonomous Flight”, RAL 2018

Augmented State

Model

Stereo Camera Measurement

Orientation

Gyro bias Velocity PositionAcc bias

Cam-IMU extrinsics

Ke Sun

Kartik Mohta

Stereo Multistate Constraint Kalman Filter (S-MCKF)

Fast autonomous flight (Top speed at 18m/s)

Autonomous flight in unstructured environment● Includes various scenes (warehouse, woods,

open field, etc).● Round trip over 700m● Final drift under 0.5%

Multi-Sensor Unscented

KalmanFilter (200 Hz)

velocity

State Estimation

Visual odometry

Visual odometry

Laser odometry

Altitude estimator

DownwardCamera (40Hz)

StereoCamera (40 Hz)

Altimeter (20 Hz)

Laser Scanner(20 Hz)

GPS(10Hz)

Controller(200 Hz)

Shaojie Shen, Yash Mulgaonkar, Nathan Michael and Vijay Kumar, “Multi-Sensor Fusion for Robust Autonomous Flight in Indoor and Outdoor Environments with a Rotorcraft MAV,” Proceedings of IEEE International Conference on Robotics and Automation (ICRA), 2014.

IMU(1000 Hz)

Shaojie Shen (HKUST)

3. Autonomy: Perception and Action for Navigation

Trajectory Generator (20 Hz)

Planner(20 Hz)

GPS (10Hz)

Local Map(40 Hz)

position

Pose Graph SLAM

Multi-Sensor Unscented

KalmanFilter (200 Hz)

velocity

Visual odometry

Visual odometry

Laser odometry

Altitude estimator

DownwardCamera (40Hz)

StereoCamera (40 Hz)

Altimeter (20 Hz)

Laser Scanner(20 Hz)

GPS(10Hz)

Controller(200 Hz)IMU

(1000 Hz)

DARPA Fast Lightweight Autonomy (FLA)

Cameras + IMU

Lidar

Yash Mulgaonkar

3 Planning in Cluttered Environments

– Relative degree 4 (input and state constraints)

– Non convex

– Safe corridors in different homology classes

– Partially known environment (limited field of view sensors)

X goal ⇢ X free

x = Ax(t) +Bu(t), u(t) 2 U , 8t 2 [0, T ]

x(0) = x0, x(T ) 2 X goal, x(t) 2 X free

minu(t), T

J(x(t), u(t)) + ⇢T

1. Optimal Control

Sarah Tang

Sikang Liu

SubhrajitBhattacharya

Mike Watterson

FLA (5-20 m/s)

S. Bhattacharya, M. Likhachev and V. Kumar, “Topological Constraints in Search-based Robot Path Planning.” Autonomous Robots, 33(3):273-290, 2012.S. Liu, M. Watterson, S. Tang, and V. Kumar, “High speed navigation for quadrotors with limited onboard sensing,” Robotics and Automation Letters, 2016.S. Liu, N. Atanasov, K. Mohta, and V. Kumar, “Search-based Motion Planning for Quadrotors using Linear Quadratic Minimum Time Control,” IROS 2017.

Sarah Tang

Sikang Liu

SubhrajitBhattacharya

Mike Watterson

Planning in Cluttered Environments

Minimum snap primitives

Search over induced discretization on state space

Results for different functionals

2. Search-Based Planning with Motion Primitives

S. Liu, N. Atanasov, K. Mohta, V. Kumar, Search-based motion planning for quadrotors using linear quadratic minimum time control, IROS 2017

Sikang Liu

Nikolay Atanasov(UCSD)

Resolution complete but …

p1

M

gpk(tk)

v = 20 m/s, max acceleration 1 g Stopping time ~ 2 sStopping distance ~ 20 m

limited field of view creates challenges

Safety Certificate

p1

M

gpk(tk)

pk+1(tk+1)

g0 k

�k

xk(tk) =

2

664

pkpkpk...p k

3

775 xk+1(tk+1) =

2

664

pk+1

pk+1

pk+1...p k+1

3

775xk+1(tk+1) =

2

664

pk+1

pk+1

pk+1...p k+1

3

775

k

xg0 =

2

664

g0

000

3

775

● Safety● Completeness● Suboptimality

Autonomous Flight in Unknown GPS-Denied Environment (5 m/s)

CJ Taylor

Yash Mulgaonkar

Sikang Liu

Mike Watterson

Ke Sun

Kartik Mohta

Search of Collapsed Buildings

S. S. Shivakumar, K. Mohta, B. Pfrommer, V. Kumar and C. J. Taylor, Guided Semi Global Optimization for Real Time Dense Depth Estimation, ICRA 2019.

Shreyas Shivakumar

CJ Taylor

Sikang Liu

Mike Watterson

Ke Sun

Kartik Mohta

Brandon Duick

Amilcar Cipriano Monica DeGuzman

Jason Derenick

Raghu Balasa Alex Burka

www.exyn.ai

Nader Elm

Pete Furlong Gray Greene Vinutha Kallem Nick Lynch

Billy Sisson James Sui Justin Thomas

Xuchu (Dennis) Ding

Denise Wong

A. Weinstein, A. Cho, and G. Loianno, and V. Kumar, “VIO-Swarm: A Swarm of 250 gram autonomous quadrotors ” ICRA 2018.

4. Light Weight Autonomy

250 gram quadrotor (2018)Qualcomm® Snapdragon Flight™ development board running Snapdragon Navigator™ flight controller and Machine Vision (MV) SDK

GiuseppeLoianno

Shreyas Shivakumar

Ke Sun

Kartik Mohta

CJ Taylor

S. S. Shivakumar, K. Mohta, B. Pfrommer, V. Kumar and C. J. Taylor, Guided Semi Global Optimization for Real Time Dense Depth Estimation, ICRA 2019.

1 kg quadrotor (2018)

Stereo camera synced with Vector NAV IMU, NVDIA Jetson TX2 + FPGA (low-level pixel-wise operations) – OSRF

TOF 3-D camera, 6m range, 100x65 deg, 60 Hz – PMD technologies

2.5 kg quadrotor (2017)Stereo camera synced with Vector NAV IMU, LiDar, Intel i7

42

Robustness to Collisions

250 gram quadrotorQualcomm® Snapdragon Flight™ board with Snapdragon Navigator™ flight controller

133 gram quadrotor capable of sustaining collisionsQualcomm® Snapdragon Flight™ board with forward-facing stereo cameras, a downward facing camera for VIO, onboard WiFi and GPS

TiercelWenxin Liu

Yash Mulgaonkar, Wenxin Liu, Dinesh Thakur, Kostas Daniilidis, Vijay Kumar, The Tiercel: A novel autonomous micro aerial vehicle that can map the environment by flying into obstacles, IEEE Robotics and Automation Letters, submitted (2020)

Yash Mulgaonkar

Fig. 4: Onboad LEDs and drive board mounted on the MAV (left). The lighting payload is equipped with A) LEDiL C16029STRADA-SQ-C lens and B) LEDiL C12727 STRADA-SQ-VSM lens. Cree XP-G3 illumination distribution is used for thelenses. The concentric rings on the test target indicate fields of view of 100� and 120� captured with bottom camera.

Fig. 5: Picture of mock-up PCV viewed from a camera located at X100B (left), Point cloud of test fixture captured viaKAARATA Stencil (center), OctoMap and automatically generated JPS 3D navigation path (right)

widest and most uniform central illumination spot while(LEDiL C12727 STRADA-SQ-VSM [15]) generated themost uniform illumination spot over the entire FOV of thecamera. The Cree LEDs have 125� directional viewing angle.The forward cameras have much narrower FOV of 90�, henceno lens were added.

We have designed and fabricated a custom LED drivecontroller board for use with the MAV. Constant current LEDdrivers are used in the lighting payload. A constant currentdriver generates consistent illumination throughout the flightand protects the LED from thermal runaway if the ambienttemperature or the temperature of the LED increases duringthe flight. The board is equipped with trim pot resistors tocontrol the Lux output of the LEDs. Fig. 4 shows the LEDdrive board with two different lenses and the correspondingillumination distribution and also a picture of the lightingpayload installed on the MAV platform. The lighting payloadadds only 8g to the MAV weight and draws a maximumcurrent of 1.5A at maximum light output of all three LEDs.

B. Waterproof System

Waterproofing the vehicle requires careful considerationof each component. A multi-step conformal coating processis performed on the Snapdragon Flight board, ElectronicSpeed Control (ESC) and the LED driver board. Specifically,

the pins, connectors, and cameras are removed and a firstconformal coating is applied with Arathane 5750-A/B (LV),which is a soft translucent urethane composite designed forinsulating printed circuit boards and electronic components.Once the initial coating process is completed, the connectorsare reattached, and the second coating process is performedusing RTV silicone. To protect the LEDs from the adverseeffects of dripping water, the front PCB surface of the LEDswas coated with RTV silicone. The downward-facing camerais also coated with a hydrophobic coating. Glass coverslipswith a hydrophobic coating were installed on the forward-facing stereo cameras. A waterproof coating is added to thebattery cell, balancing cable, and circuit.

III. AUTONOMOUS NAVIGATION

In this section, we describe the main algorithms enablingthe autonomous capabilities of the aerial vehicle in unknownand unstructured environments.

A. Control and EstimationThe system combines the Inertial Measurement Unit

(IMU) data and the VGA downward camera image data in anExtended Kalman Filter (EKF) framework in order to local-ize the vehicle in the environment [12]. The prediction stepis based on integration of the IMU data. The measurementupdate step is given by the standard 3D landmark perspective

Giuseppe Loianno

Dinesh Thakur

Autonomous Flight in Fukushima Daiichi Reactor Unit 1

Monica Garcia (SWRI), Richard Garcia (SWRI), Wataru Sato (TEPCO)

Giuseppe Loianno

Dinesh Thakur

Laura Jarin-Lipschitz

Wenxin Liu

Elijah Lee

5. Aerial Robot Swarms

Perception Action

• observations inform actions• move for better observations

Perception—Action Loops

5. Aerial Robot Swarms

Perception Action

Communication

• share private perceptions

• observations inform actions• move for better observations

• messages inform actions• move to communicate better

Task

Perception—Action—Communication Loops

• communicate to make better observations

Decentralized Multi-Robot Teams

• Decentralized, correct-by-construction policies available only for very simple cases

– simple communication and sensing models

– edge or cloud computation – point robots

A. Weinstein, A. Cho, and G. Loianno, and V. Kumar, “VIO-Swarm: A Swarm of 250 gram autonomous quadrotors ” ICRA 2018.

Tanner, Pappas and Jadbabaie, 2004Sensor CoverageCortes, 2004

Belta and Kumar, 2005

Aaron Weinstein

Giuseppe Loianno

• Centralized methods are not practical for real-world robot deployments (Turpin, ‘14)

– Partial observability by individual agents

– Limited communication

Distributed Learning: PAC Loops

Perception Action

Communication

Task

Ø Key Ideas:o Learn communication policieso Learn action policieso Learn planning policies

Jimmy Paulos

Ekaterina Tolstaya

Arbaaz Khan

Steven Chen

(with Professor Alejandro Ribeiro)

Dinesh Thakur


Graph Neural Networks

• Robots act on relative position and velocity information– Must stay close to each other– Must avoid collisions

– Must “align” themselves

ChoiceofParametrization

IThereisoverwhelmingempiricalandtheoreticaljustificationtochooseaneuralnetwork

IChallengeiswewanttolearnoverthisIAndwearegoodatlearningoverthis

ITheNeuralNetworksthatweknowgeneralizewellareConvolutionalNeuralNetworks

IWeknowhowtolearnovertimeandimages.Wedon’tknowhowtolearnoveragraph.

A.RibeiroLearningDecentralizedControllersforRobotSwarmswithGraphNeuralNetworks6/28

CNNs GNNs

Aggregate information at each node from neighboring node using graph adjacency properties

49

Aggregate over belief states of neighbors

z =KX

k=0

hkSkx(k) = H(S)x

<latexit sha1_base64="SY4t/wzZstDd8eWBFh8xN2m9gdM=">AAACOnicbVDLSsNAFJ34rPVVdelmsAjtpiQq6KZQdFNw06J9QNOGyXTSDpk8mJmINeS73PgV7ly4caGIWz/ASZuFth4YOJxzL3fOsUNGhdT1F21peWV1bT23kd/c2t7ZLeztt0UQcUxaOGAB79pIEEZ90pJUMtINOUGezUjHdq9Sv3NHuKCBfysnIel7aORTh2IklWQVmqaH5Nh24ocEVqEpIs+K3aqeDOLrBI4tF8am7cCbZOBOyb0ySm45qc70elLK/HJmw7xVKOoVfQq4SIyMFEGGhlV4NocBjjziS8yQED1DD2U/RlxSzEiSNyNBQoRdNCI9RX3kEdGPp9ETeKyUIXQCrp4v4VT9vREjT4iJZ6vJNKiY91LxP68XSeeiH1M/jCTx8eyQEzEoA5j2CIeUEyzZRBGEOVV/hXiMOMJStZ2WYMxHXiTtk4pxWtGbZ8XaZVZHDhyCI1ACBjgHNVAHDdACGDyCV/AOPrQn7U371L5mo0tatnMA/kD7/gEFlaxE</latexit>

50

FlockingGNN-learned control

policy (K=3)

(a) Average difference in velocities (b) Average minimum distance to a neighbor

(c) Flock positions using the GNN (d) Flock positions using the local controller

Figure 1. The GNN (K = 3) maintains a cohesive flock, while the local controller allows the flock to scatter.

using 400 trajectories, each of length 200 steps total. For testing, 20 trajectories of length 200 wereobserved by using the learned controller only.

In practice, following the optimal policy to collect training data results in a distribution of statesthat is not representative of those seen at test time. To resolve this we use the Dataset Aggregation(DAgger) algorithm and follow the learner’s policy instead of the expert’s with probability 1�bwhen collecting training trajectories [33]. The probability b of choosing the expert action whiletraining is decayed by a factor of 0.993 after each trajectory to a minimum of 0.5.

5 Results

We report results comparing (11) and (10) for point masses with fully controllable accelerations inSection 5.1. This simple setting allows for an exploration of the effect of different system parameterssuch as initial velocity or communication radius, to determine experimentally the scenarios on whichthe aggregation GNN offers good performance. In Section 5.2 we study the case of transfer learning,where we train the model in one network but test it in another (for example, with different numberof agents), and also by exporting the trained architecture to other physical models beyond the point-mass model, as shown in the AirSim simulator (Sec. 7.2).

5.1 Learning to flock with point masses

First, we compare the performance of the GNN controller with K = 3 to the local controller u†i

(11). Figure 1a depicts the magnitude of velocity differences between agents over the course of atrajectory in terms of the population mean and standard devision. The GNN converges much morerapidly and, unlike the local controller, approaches a perfect velocity consensus. Part of the reasonfor this is explained in Figure 1b, which plots agents’ minimum distance to any neighbor overtime. The GNN control approaches a uniform flock spacing, but the local controller fails to stopagents from dispersing quickly enough. Soon the local controller’s network has become completelydisconnected as agent distances exceed their communication range of R = 1. One flock trajectory isdepicted for the GNN controller in Fig. 1c and the local controller in Fig. 1d. Each diagram showsthe initial agent positions and velocities at time n = 0 and then at n = 300, qualitatively illustratingthe stable flocking of the GNN and failure of the local controller.

6

Tanner, Pappas and Jadbabaie, 2004

Graph Policy Gradients

• Train GNNs on a small number of robots– Information from k-hop neighbors is aggregated by each robot

– Local controllers are learned by each robot– Centralized reward used to train the robots

• Extend to swarms with larger numbers– Transfer of policies to larger groups with similar “local” graph properties

Arbaaz Khan

Arbaaz Khan, Vijay Kumar, and Alejandro Ribeiro, Graph Policy Gradients for Large Scale Unlabeled Motion Planning with Constraints , IEEE International Conference on Robotics and Automation, submitted (2020)

Graph Policy Gradients for Large Scale Formation Control

53

Conclusion

• Autonomy using smartphone grade processors/sensors

• 10x improvement in performance/price • Applications to search and rescue and precision

agriculture• Integration of model-based and data-driven methods

AI 1.0 AI 2.0 AI 3.0

Ed Steager Dinesh Thakur


Ty NguyenYash Mulgaonkar

Avi Cohen

Ekaterina Tolstaya

Monroe Kennedy

Ke Sun

Jim Keller

Mickey Whitzer

Sambeeta Das

Chao Qu

Jnaneshwar DasElizabeth Beattie

Sarah Tang

Kelsey SaulnierTolga Özaslan

Kartik MohtaRebecca Li

James SvachaRattanachai Ramaithitima

Steven Chen

Giuseppe Loianno

http://kumar.grasp.upenn.edu/

Mike Watterson

Luis Guerrero

Arbaaz Khan Elijah Lee Sikang Liu Wenxin Liu

Ian Miller

Dan Mox

Daigo ShishikaDavid Saldana Aaron Weinstein

Jimmy Paulos

http://kumar.grasp.upenn.edu/

swarms of small flying robots · 2020-04-20 · inertial measurement units • accelerometer,...

Documents