september 23, modeling of gradient-based controllers ii
DESCRIPTION
Multi-Robot SystemsTRANSCRIPT
![Page 1: September 23, Modeling of Gradient-Based Controllers II](https://reader033.vdocuments.us/reader033/viewer/2022051609/547a48d3b4af9f4b688b4595/html5/thumbnails/1.jpg)
Multi-Robot Systems
CSCI 7000-006Wednesday, September 23, 2009
Nikolaus Correll
![Page 2: September 23, Modeling of Gradient-Based Controllers II](https://reader033.vdocuments.us/reader033/viewer/2022051609/547a48d3b4af9f4b688b4595/html5/thumbnails/2.jpg)
So far (Modeling)
• Deterministic models for deliberative systems• Gradient-based controllers for reactive
systems– Generating controllers by performing gradient
descent on a cost-function– From global to local optimization problems using
Voronoi partitions
![Page 3: September 23, Modeling of Gradient-Based Controllers II](https://reader033.vdocuments.us/reader033/viewer/2022051609/547a48d3b4af9f4b688b4595/html5/thumbnails/3.jpg)
Today
• More gradient-based control: – Shape formation– Flocking
• Introduction to hybrid systems
![Page 4: September 23, Modeling of Gradient-Based Controllers II](https://reader033.vdocuments.us/reader033/viewer/2022051609/547a48d3b4af9f4b688b4595/html5/thumbnails/4.jpg)
Gradient-based control
• Convergence to minimal sets of a cost function over robot positions
• Minimal sets can also be shapes or isocontours
• Minimal sets can also be temporary and local
![Page 5: September 23, Modeling of Gradient-Based Controllers II](https://reader033.vdocuments.us/reader033/viewer/2022051609/547a48d3b4af9f4b688b4595/html5/thumbnails/5.jpg)
Gradient-based approach for shape formation
• Goal: distribute all robots along a 2D curve
• Applications: construction, perimeter surveillance
• “Minimum Set” given by an implicit function s(x,y)=0 on a 3D surfaceL. Chaimowicz, Michael, N., and V. Kumar, "Controlling Swarms of Robots Using Interpolated Implicit Functions" Proceedings of the 2005 IEEE International Conference on Robotics and Automation, pp. 2498-2503, Barcelona, Spain, April 2005.
![Page 6: September 23, Modeling of Gradient-Based Controllers II](https://reader033.vdocuments.us/reader033/viewer/2022051609/547a48d3b4af9f4b688b4595/html5/thumbnails/6.jpg)
Shape formation: Controller
• Let f be a suitable convex function with the desired shape as isocontour with value 0
• Let qi=[xi,yi] be the robot position
• Let vi=qi’ be the robot speed and ui=vi’ its acceleration
• Let Fc and Fr be forces repelling robots from each other
![Page 7: September 23, Modeling of Gradient-Based Controllers II](https://reader033.vdocuments.us/reader033/viewer/2022051609/547a48d3b4af9f4b688b4595/html5/thumbnails/7.jpg)
Stability
• Lyapunov candidate V(q,q’)>=0
• V(q,q’)<0• Course Question: what did we not prove?
![Page 8: September 23, Modeling of Gradient-Based Controllers II](https://reader033.vdocuments.us/reader033/viewer/2022051609/547a48d3b4af9f4b688b4595/html5/thumbnails/8.jpg)
Problems
• What about the repulsive terms?
• What about too few robots?• What about too many
robots?• Further reading
M. A. Hsieh, V. Kumar and L. Chaimowicz. Decentralized Controllers for Shape Generation with Robotic Swarms. Robotica, Vol. 26, Issue 5, September 2008, pp 691-701.
![Page 9: September 23, Modeling of Gradient-Based Controllers II](https://reader033.vdocuments.us/reader033/viewer/2022051609/547a48d3b4af9f4b688b4595/html5/thumbnails/9.jpg)
Shape generation
• f could be a sum of Radial Basis Functions given a set of constraint points
• Constraint• RBF i is centered around pi
• Find set of weights wi so that all constraints are satisfied
![Page 10: September 23, Modeling of Gradient-Based Controllers II](https://reader033.vdocuments.us/reader033/viewer/2022051609/547a48d3b4af9f4b688b4595/html5/thumbnails/10.jpg)
From theory to practice• Simulation
– Robots get stuck in local minima– Unreachable shapes (inside of letter P, e.g.) depending on initial
position• Real robots
– No local range and bearing– Constraints non-holonomic
![Page 11: September 23, Modeling of Gradient-Based Controllers II](https://reader033.vdocuments.us/reader033/viewer/2022051609/547a48d3b4af9f4b688b4595/html5/thumbnails/11.jpg)
Example: Herding/Flocking
• Agents are attracted to their neighbors
• Agents are repelled by their neighbors
• Agents move voluntarily (random or informed)
![Page 12: September 23, Modeling of Gradient-Based Controllers II](https://reader033.vdocuments.us/reader033/viewer/2022051609/547a48d3b4af9f4b688b4595/html5/thumbnails/12.jpg)
Model
• Kinematic model:
€
x i = (x,y, ˙ x , ˙ y )Artificial Potential field
Agent-to-agent forceRandom noise
M. Schwager, C. Detweiler, I. Vasilescu, D. M. Anderson, D. Rus - Data-Driven Identification of Group Dynamics for Motion Prediction and Control, Journal of Field Robotics 25(6-7):305-324, 2008.
![Page 13: September 23, Modeling of Gradient-Based Controllers II](https://reader033.vdocuments.us/reader033/viewer/2022051609/547a48d3b4af9f4b688b4595/html5/thumbnails/13.jpg)
What can you do with this model?
• Numerical simulation– Initialize positions– Calculate agent-to-agent interaction forces
between all agents– Update positions
• Gradient controller?– Yes! Only speed is updated– Can we formulate this as a
cost function?
![Page 14: September 23, Modeling of Gradient-Based Controllers II](https://reader033.vdocuments.us/reader033/viewer/2022051609/547a48d3b4af9f4b688b4595/html5/thumbnails/14.jpg)
Generalized Coverage Control
• Cost to service point q in Q:• New: Team-based cost
• Mixing function: encodes collaboration
• New cost function
Q
M. Schwager, A Gradient Optimization Approach to Adaptive Multi-Robot Control, Ph.D. Thesis, Massachusetts Institute of Technology, Department of Mechanical Engineering, September, 2009.
![Page 15: September 23, Modeling of Gradient-Based Controllers II](https://reader033.vdocuments.us/reader033/viewer/2022051609/547a48d3b4af9f4b688b4595/html5/thumbnails/15.jpg)
Properties of the Mixing function
• Tells how information from different robots should be combined to sense at q
• Course question: What happens for
€
α → −∞
![Page 16: September 23, Modeling of Gradient-Based Controllers II](https://reader033.vdocuments.us/reader033/viewer/2022051609/547a48d3b4af9f4b688b4595/html5/thumbnails/16.jpg)
Mixing function
• For
• Results in standard Voronoi cost function (Monday)
€
α → −∞
€
minf i
![Page 17: September 23, Modeling of Gradient-Based Controllers II](https://reader033.vdocuments.us/reader033/viewer/2022051609/547a48d3b4af9f4b688b4595/html5/thumbnails/17.jpg)
Mixing function
• Let
• Cost function
• Result:
Q
![Page 18: September 23, Modeling of Gradient-Based Controllers II](https://reader033.vdocuments.us/reader033/viewer/2022051609/547a48d3b4af9f4b688b4595/html5/thumbnails/18.jpg)
From generalized coverage to flocking
• Cost function
• Let agent-to-agent force be
• Take gradient:
![Page 19: September 23, Modeling of Gradient-Based Controllers II](https://reader033.vdocuments.us/reader033/viewer/2022051609/547a48d3b4af9f4b688b4595/html5/thumbnails/19.jpg)
Hybrid Systems
• So far: all robots behave according to the same dynamical system
• Hybrid systems: robot dynamics are a function of discrete states
X’=f1(X) X’=f2(X)
Logic
Logic
![Page 20: September 23, Modeling of Gradient-Based Controllers II](https://reader033.vdocuments.us/reader033/viewer/2022051609/547a48d3b4af9f4b688b4595/html5/thumbnails/20.jpg)
Example: Cow Herding
• Continuous part*– Artificial potential field:
• Far-field attraction• Near-field repulsion
– Gaussian noise added to force estimates
• Discrete part– Cows can be in two states: Grazing
and Stressed.– Different potential fields for each
state*M. Schwager, C. Detweiler, I. Vasilescu, D. Anderson, and D. Rus, “Data-driven identification of group dynamics for motion prediction and control,” Journal of Field Robotics, 2008.
![Page 21: September 23, Modeling of Gradient-Based Controllers II](https://reader033.vdocuments.us/reader033/viewer/2022051609/547a48d3b4af9f4b688b4595/html5/thumbnails/21.jpg)
Behavioral Hypothesis
We theoretically study the influence of two potential social effects:
1. Animals tend to aggregate more when under stress due to a stimulus
2. Stress propagates within the herd [Butler, 2006]
R
R
These hypotheses are implemented in a hybrid dynamical model and tested in simulation.
![Page 22: September 23, Modeling of Gradient-Based Controllers II](https://reader033.vdocuments.us/reader033/viewer/2022051609/547a48d3b4af9f4b688b4595/html5/thumbnails/22.jpg)
System Description• Cows and Environment
– Hereford and Hereford x Brangus
– USDA experimental range, 466ha paddock
• Sensors– GPS– Accelerometer
• Communication– 900Mhz radio
• Actuators– Stereo headphones– Electrical stimulation
![Page 23: September 23, Modeling of Gradient-Based Controllers II](https://reader033.vdocuments.us/reader033/viewer/2022051609/547a48d3b4af9f4b688b4595/html5/thumbnails/23.jpg)
Formal descriptionState-space of agent i
State transition probabilities
Stress propagationControl input (stimulus)
Artificial Potential field
Agent-to-agent forceRandom noise
R4
![Page 24: September 23, Modeling of Gradient-Based Controllers II](https://reader033.vdocuments.us/reader033/viewer/2022051609/547a48d3b4af9f4b688b4595/html5/thumbnails/24.jpg)
Simulation Environment
• Dynamical simulation• Experiment
– Initial condition: N cows grazing inside a circular fence of 25m diameter (random distribution)
– Fence moves northwards with constant 20m/h (open loop)
– After 5h simulated time the experiment is stopped
• Investigate different values for a and R
• Speed-up of about x15 between real experiment and dynamical simulation
![Page 25: September 23, Modeling of Gradient-Based Controllers II](https://reader033.vdocuments.us/reader033/viewer/2022051609/547a48d3b4af9f4b688b4595/html5/thumbnails/25.jpg)
Sample Result: Impact of Increased Gregarious Behavior during Stress
For constant stimulus, a(x=S)>a(x=G) necessary condition for aggregation to work
R= 0 mR= 5 m
R= 10 m50 simulations per data point
![Page 26: September 23, Modeling of Gradient-Based Controllers II](https://reader033.vdocuments.us/reader033/viewer/2022051609/547a48d3b4af9f4b688b4595/html5/thumbnails/26.jpg)
Sample Result: Impact of Stress Propagation
R= 0 mR= 5 m
R= 10 m
Success: >50% of population within fence
Moderate stress propagation increases control performance, but potentially leads to instable systems
![Page 27: September 23, Modeling of Gradient-Based Controllers II](https://reader033.vdocuments.us/reader033/viewer/2022051609/547a48d3b4af9f4b688b4595/html5/thumbnails/27.jpg)
Hybrid Systems
• Analysis of individual dynamics, but unclear what state the other robots are in
• Analysis of discrete dynamics, e.g. Markov chain
• Verification using numerical tools
• OverviewGoebel, Rafal; Sanfelice, Ricardo G.; Teel, Andrew R. (2009), "Hybrid dynamical systems", IEEE Control Systems Magazine 29 (2): 28–93
![Page 28: September 23, Modeling of Gradient-Based Controllers II](https://reader033.vdocuments.us/reader033/viewer/2022051609/547a48d3b4af9f4b688b4595/html5/thumbnails/28.jpg)
Summary
• Gradient descent approaches are a versatile tool for– Shape formation– Flocking– Coverage
• Community is moving unified theory for controller analysis and synthesis
• Analysis of discrete-continuous systems still in its infancy
![Page 29: September 23, Modeling of Gradient-Based Controllers II](https://reader033.vdocuments.us/reader033/viewer/2022051609/547a48d3b4af9f4b688b4595/html5/thumbnails/29.jpg)
Next Week
• Discussion of course projects– “develop”, “study”, “explore” are all words that
should NOT be in your research objective– formulate a hypothesis that leads to your method
• Probabilistic Models for reactive and deliberative systems
• Assignment of teams