2014-designing collective behavior in a termite-inspired robot construction team-(science)

5
Designing Collective Behavior in a Termite-Inspired Robot Construction Team Justin Werfel, 1 * Kirstin Petersen, 1,2 Radhika Nagpal 1,2 Complex systems are characterized by many independent components whose low-level actions produce collective high-level results. Predicting high-level results given low-level rules is a key open challenge; the inverse problem, finding low-level rules that give specific outcomes. We present a multi-agent construction system inspired by mound-building termites, solving such an inverse problem. A user specifies a desired structure, and the system automatically generates low-level rules for independent climbing robots that guarantee production of that structure. Robots use only local sensing and coordinate their activity via the shared environment. We demonstrate the approach via a physical realization with three autonomous climbing robots limited to onboard sensing. This work advances the aim of engineering complex systems that achieve specific human-designed goals. I n contrast to the careful preplanning and regi- mentation that characterize human construc- tion projects, animals that build in groups do so in a reactive and decentralized way. The most striking examples are mound-building termites, colonies of which comprise millions of indepen- dently behaving insects that build intricate struc- tures orders of magnitude larger than themselves (1, 2) (Fig. 1, A and B). These natural systems inspire us to envision artificial ones operating via similar principles (3, 4), with independent agents 1 Wyss Institute for Biologically Inspired Engineering, Harvard Uni- versity, Cambridge, MA 02138, USA. 2 School of Engineering and Applied Sciences, Harvard University, Cambridge, MA 02138, USA. *Corresponding author. E-mail: [email protected] 14 FEBRUARY 2014 VOL 343 SCIENCE www.sciencemag.org 754 REPORTS

Upload: deon-botha

Post on 05-May-2017

219 views

Category:

Documents


1 download

TRANSCRIPT

Designing Collective Behaviorin a Termite-Inspired RobotConstruction TeamJustin Werfel,1* Kirstin Petersen,1,2 Radhika Nagpal1,2

Complex systems are characterized by many independent components whose low-level actions producecollective high-level results. Predicting high-level results given low-level rules is a key open challenge;the inverse problem, finding low-level rules that give specific outcomes. We present a multi-agentconstruction system inspired by mound-building termites, solving such an inverse problem. A userspecifies a desired structure, and the system automatically generates low-level rules for independentclimbing robots that guarantee production of that structure. Robots use only local sensing andcoordinate their activity via the shared environment. We demonstrate the approach via a physicalrealization with three autonomous climbing robots limited to onboard sensing. This work advances theaim of engineering complex systems that achieve specific human-designed goals.

In contrast to the careful preplanning and regi-mentation that characterize human construc-tion projects, animals that build in groups do

so in a reactive and decentralized way. The moststriking examples are mound-building termites,colonies of which comprise millions of indepen-dently behaving insects that build intricate struc-tures orders of magnitude larger than themselves(1, 2) (Fig. 1, A and B). These natural systemsinspire us to envision artificial ones operating viasimilar principles (3, 4), with independent agents

1Wyss Institute for Biologically Inspired Engineering, Harvard Uni-versity, Cambridge, MA 02138, USA. 2School of Engineering andApplied Sciences, Harvard University, Cambridge,MA 02138, USA.

*Corresponding author. E-mail: [email protected]

14 FEBRUARY 2014 VOL 343 SCIENCE www.sciencemag.org754

REPORTS

acting together to build elaborate large-scalestructures, guided by reacting to the local situa-tions they encounter. Such systems could enableconstruction in settings where human presenceis dangerous or problematic, as in disaster areasor extraterrestrial environments.

Engineering an automated construction sys-tem that operates by termite-like principles ratherthan human-like ones requires an ability to designcomplex systems with desired collective behavior(e.g., producing a particular user-specified build-ing). The hallmark of complex systems of inde-pendent agents (5–7) is unexpected collectivebehavior that emerges from their joint actions,not readily predictable from knowledge of agentrules. If a specific collective behavior is desired,no method in general is known to find agent rulesthat will produce it.

We present a decentralized multi-agent sys-tem for automated construction of user-specifiedstructures, thereby providing a solution to such aproblem of complex system design. An arbitrarynumber of independent robots follow an iden-tical set of simple, local rules that collectivelyproduce a specific structure requested by a user(Fig. 1, C and D). The rules are automaticallygenerated from a high-level representation of thefinal target structure and provide provable guar-antees of correct completion of that structure. Thechallenges associated with engineering a complexsystem are addressed by using principles drawnfrom social insects—in particular, indirect coor-dination through manipulation and sensing of ashared environment (stigmergy), and behavioralregularities that constrain the space of possibleoutcomes—which together make analysis andexecution tractable. We first present the theoreticalfoundation for this work, followed by a physical

implementation with three independent robotsdemonstrating autonomous construction.

The independence of individual robots standsin contrast to other work on automating construc-tion with single (8–10) or multiple (11–14) robotswith centralized sensing and/or control. Central-ized systems that provide a global computingauthority and/or precise positioning informationduring run time, in settings where such featuresare feasible, can have advantages in aspects suchas efficiency and run-time flexibility. Conversely,decentralization provides advantages including op-portunities for greater scalability (no coordinatingauthority that can become overloaded) and robust-ness (no single point of failure).

We distinguish between two types of build-ing processes (Fig. 2). A system may produce apredetermined outcome, in which many possiblesystem trajectories all lead to the same guaran-teed final state. Alternatively, variation during theprocess may lead to a variable outcome, in whichthe final state is determined during the course ofconstruction and can change if the process isrerun. In the context of human construction, sin-gle buildings are built via the first type of pro-cess, in which the order of operations might varybut the final result always matches a blueprint;cities develop via the second type of process, inwhich choices are contingent on previous deci-sions such that many distinct results are possible.Here, we focus on designing processes with fixedoutcomes, but also show how our system can beused to generate structures that vary each timerobots construct them.

Our system design is motivated by the goalof relatively simple, independent robots withlimited capabilities (15), able to autonomouslybuild a large class of nontrivial structures using

a single type of prefabricated building material(solid “bricks”). We require a robot to be able tomove forward, move back, and turn in place;climb up or down a step the height of one brick;and pick up one brick, move while carrying it,and attach it directly in front of itself at its ownlevel. Robots can build staircases of bricks toclimb to higher levels. Robots are limited to localsensing, able to perceive only bricks and otherrobots in their immediate vicinity. Informationabout the current state of the overall structure andthe actions of more distant robots is not avail-able.Robots obtain information aboutwhere brickshave been attached only through direct inspec-tion; after they leave an area, this information isliable to become outdated as other robots modifythe structure. The structure, built from square bricksin a nonoverlapping grid pattern, provides a ref-erence that robots can use to keep track of theirrelative movement around it. A single “seed”brick, the initiation point from which the contig-uous structure is built, provides a unique landmark.

We take an approach derived from the classi-cally insect-inspired notion of stigmergy (2, 3, 16),in which, instead of any explicit broadcast or one-to-one communication between agents, all com-munication is implicit via the joint manipulationof a shared environment. In particular, we focuson qualitative stigmergy (2) in which actions aretriggered by qualitatively different stimuli, suchas distinct arrangements of building material.Robots in our system add bricks to the structurein response to existing configurations of bricks.In doing so, the rules they follow must be con-structed in such a way that correct completion ofthe target structure is guaranteed, despite staleinformation about other parts of the structure, andirrespective of the (potentially variable) number

Fig. 1. Natural and artificial collective construction. (A and B) Complexmeter-scale termite mounds (A) are built by millimeter-scale insects (B), which actindependently with local sensing and limited information. (C) Physical implemen-tation of our system, with independent climbing robots that build using specialized

bricks. (D) System overview for building a specific predetermined result (Fig. 2, Aand C): A user specifies a desired final structure; an offline compiler converts it to a“structpath” representation (Fig. 3), which is provided to all robots; robots followlocal rules that guarantee correct completion of the target structure (movie S1).

www.sciencemag.org SCIENCE VOL 343 14 FEBRUARY 2014 755

REPORTS

of other robots and the order and timing of theirown actions. The fact that all robots follow thesame rules (17)—as all insects in the same colonyfollow consistent behavior patterns, or other ani-mals obey intraspecific social conventions (18)—helps to constrain outcomes and restricts the spaceof possible situations that robots typically en-counter and must be able to handle.

To ensure a predetermined outcome, robotrules are based on sensed brick configurationsplus a static internal representation of the targetstructure. A user provides a picture or other high-level representation of the desired final structure,specifying what sites are ultimately meant to beoccupied by bricks. An offline compilation stepconverts this to a “structpath” representation thatprovides movement guidelines for robots at eachlocation—in a sense, traffic laws appropriate forthat structure (19) (Fig. 3). Robots then follow afixed set of simple rules (19), referring to the staticstructpath representation and otherwise identicalfor any target structure; these ensure the growthof that structure to completion in a way consist-ent with robot capabilities (movie S1). The rulesrely on locally available information, preserve a

robot’s ability to move freely over the structureand opportunities for parallelism, and preventdeadlocks and other situations where the actionsof one robot interfere with those of another.Robots act reactively (20, 21); they do not preplantheir actions, as is appropriate in this decen-tralized multirobot approach in which a robotsetting out to perform a specific task might findit already completed by the time it gets there.

The structpath representation specifies a setof paths that robots can follow through the struc-ture layout while respecting their movementconstraints. In particular, all paths start from theseed and require climbing up or down a heightof at most one brick at a time. The structpathspecifies a fixed direction for robots to travel be-tween each pair of neighboring sites; off thestructure, robots follow its perimeter strictly counter-clockwise. This directional restriction smoothstraffic flow, ensures a flow of material into thegrowing structure (avoiding excessive backtrack-ing from, e.g., laden robots making way for un-laden ones to exit), and allows regularities instructure growth that let local rules ensure thepreservation of global invariants. Paths may split

and merge; a robot may have more than oneway of leaving or entering a site. A multiplicityof possible paths helps the system exploit theparallelism of the swarm. The compiler performsa recursive search to identify a set of paths with-out cycles that meets these requirements, or todetermine that none exists.

Individual robots then repeat the followingroutine: With a brick, circle the structure perim-eter until reaching the seed; climb onto the struc-ture and move along any legal path, keepingtrack of relative position with respect to the seed;attach the brick at any vacant site whose localneighborhood satisfies a fixed set of geometricrequirements (19); continue to follow the path offthe structure; obtain a new brick. These rules canbe shown to guarantee successful completion ofthe target structure while ensuring that no inter-mediate state calls for a robot to perform tasksbeyond its capabilities—in particular, climb ordescend a height of greater than one brick, attacha brick at a higher or lower level than itself, orforce a brick into place directly between twoothers (a mechanically difficult operation re-quiring high precision) (19). Direct interaction

Fig. 2. Two types of building process. (A) Different possible sequences alllead to the same predetermined endpoint. (B) Different sequences lead todifferent results, which are determined only during the course of construction.(C) Example of the first type of process, building a step pyramid modeled afterthe main temple at Chichen Itza [photo by Kyle Simourd, CC BY 2.0]. Upperand lower panels show snapshots from different possible sequences (movie

S2), at approximately 10%, 25%, 50%, 67%, and 80% completion of thecommon final structure. (D) Example of the second type of process, building aset of one-brick-high ramifying paths. (E) A hybrid system can combineelements of both types; in this example, paths of stochastically determinedlengths lead to buildings chosen randomly from a set of predefined structures.See (19) for details of agent rules for all cases.

14 FEBRUARY 2014 VOL 343 SCIENCE www.sciencemag.org756

REPORTS

between robots is limited to each one yieldingto the one ahead of it in this physical loop.

Because robots may take multiple possiblepaths through a structure, the ordering of the build-ing process can occur in many different ways.

Accordingly, the structure will emerge in differ-ent ways in different instances of building withthe same structpath, with intermediate structuresthat may be observed in one instance but notanother; however, the agent rules guide the

process to always end in the same final structure(movie S2).

In addition to this approach for producingpredetermined structures, the same robots can usedifferent local rules to build structures whose de-tailed form emerges from the construction pro-cess. Multiple structures built with the samerules share qualitative features but differ in de-tail. Such a rule set could, for example, be used togenerate a randomized street layout for a buildingcomplex. Figure 2E shows an example of a hy-brid system built by such a rule set (19), wherebuildings chosen randomly from a set of pre-defined types are positioned at the ends of lanesof stochastically determined lengths. The robotsagain use stigmergy to coordinate their actions;for example, particular configurations of bricksconstitute cues to agree on which building typeshould be constructed at the end of a given lane.

To demonstrate the feasibility of such a de-centralized multirobot construction system, wepresent a proof-of-concept implementation in hard-ware (19) (Fig. 1C and Fig. 4). Design choiceswere driven by the requisite primitive operationsthat robots must perform: pick up a brick from acache; attach a brick directly in front of them-selves; detect nearby robots; when on the struc-ture, move forward one site (while staying at thesame level or climbing up or down one brick)or turn in place 90° left or right; when off thestructure, circle its perimeter. For locomotion,we equipped robots with whegs [hybrid wheel-legs (22)], chosen for their empirical effective-ness in climbing (23). Each robot is equipped withseven active infrared sensors to detect black-and-white patterns on the bricks and ground fornavigation; an accelerometer to register tilt anglefor climbing and descent; an arm to lift and lower

Fig. 3. Target struc-tures and correspond-ing structpaths. For eachpredefined target structureat left, the correspondingstructpath representationat right is generated bythe offline compiler (19).From top to bottom: a sim-ple structure with a uniquestructpath if the seed lo-cation is given; the templeof Fig. 2C, showing one ofmany possible structpaths;a structure enclosing inter-nal courtyards. Sites in thestructpath are shaded ac-cording to height (darker =higher); a dot marks theseed brick. Directions arecolor-coded to clarify flows(red, left; blue, right; green,up; yellow, down).

Fig. 4. Hardware demonstration. Independent autonomous robots with purely onboard sensing collectively work on prespecified structures. (A) A castle-likestructure (movie S3). (B) A sequence of overhead snapshots building a branching structure (movie S4).

www.sciencemag.org SCIENCE VOL 343 14 FEBRUARY 2014 757

REPORTS

bricks, with a spring-loaded gripper to hold themsecurely while carried; and five ultrasound sonarunits that let each robot evaluate and maintain itsdistance from the structure perimeter, as well asdetect other sonar-using robots nearby. The ro-bot footprint (17.5 cm × 11.0 cm) is smallerthan that of the bricks, facilitating their maneu-verability atop a wall one brick wide. Bricks(21.5 cm × 21.5 cm × 4.5 cm) are made fromexpanded urethane foam, with physical featuresto achieve self-alignment and neodymium mag-nets for attachment.

This hardware system demonstrates multiplesimultaneously active, independent robots exe-cuting the full algorithm with entirely on-boardsensing. supplementary movies show fully autono-mous robots working on different user-specifiedstructures (Fig. 4 and movies S3 and S4), addingbricks both atop the structure and on the ground,climbing over the structure as they build it, andadapting to one another’s presence and actions,without human intervention beyond reloading thebrick cache. The reactive nature of the approachcan be demonstrated via externally imposed changesmade to a structure while robots work on it(movie S5). Many approaches to mobile roboticsdeal with the continuous, noisy real world byprobabilistically modeling its uncertainty (24);our system instead uses carefully engineered hard-ware to effectively discretize robot actions on thestructure. Sensor feedback and brick featuresmatched to the robots allow reasonable reliabilitywith simple control (19). Minor deviations fromideal behavior are corrected by compensatoryroutines and/or passive mechanical features; forexample, drift in position atop a structure is re-duced both by robots checking their pose withrespect to the brick markings, and by indentationson brick upper faces that guide the robots tostay within tolerance. Although extending thisresearch prototype to a full production systemwould require solving many additional engineer-ing challenges (19), our work demonstrates thatphysical hardware can allow the discretized theoryto sufficiently represent the continuous reality.

This work provides an example of anengineered complex system, with multiple au-tonomous robots following simple, local rulesand collectively achieving a specific desiredresult. Tools drawn from the social insects thatinspire our approach—the exploitation of regu-larities that arise from identical programming inmulti-agent systems, and the use of the envi-ronment as a means of implicit coordination—make these results possible. Future progress inour ability to design complex systemswill advanceour capacity to engineer systems that work asnature does (25–27), with large numbers of func-tionally limited, interchangeable parts, individ-ually unreliable, collectively robust.

References and Notes1. J. S. Turner, Nat. Hist. 111, 62–67 (2002).2. S. Camazine et al., Self-Organization in Biological

Systems (Princeton Univ. Press, Princeton, NJ,2001).

3. G. Theraulaz, E. Bonabeau, Science 269, 686–688 (1995).4. J. Werfel, thesis, Massachusetts Institute of Technology

(2006).5. P. W. Anderson, Science 177, 393–396 (1972).6. Y. Bar-Yam, Dynamics of Complex Systems

(Westview, Boulder, CO, 1997).7. M. Mitchell, Complexity: A Guided Tour (Oxford Univ.

Press, New York, 2009).8. B. Khoshnevis, Autom. Constr. 13, 5–19 (2004).9. S. Wismer, G. Hitz, M. Bonani, A. Gribovski, S. Magnenat,

Autonomous construction of a roofed structure:Synthesizing planning and stigmergy on a mobilerobot. In Proceedings of the 2012 IEEE/RSJ InternationalConference on Intelligent Robots and Systems(7 to 12 October 2012, Vilamoura, Algarve, Portugal),pp. 5436–5437.

10. F. Nigl, S. Li, J. E. Blum, H. Lipson, IEEE Robot. Autom.Mag. 20, 60–71 (2013).

11. Q. Lindsey, D. Mellinger, V. Kumar, Construction of cubicstructures with quadrotor teams. In Robotics: Science andSystems VII, H. Durrant-Whyte, N. Roy, P. Abbeel, Eds.(MIT Press, Cambridge, MA, 2012), pp. 177–184.

12. J. Willmann et al., Int. J. Archit. Comput. 10, 439–460(2012).

13. A. Bolger et al., Experiments in decentralized robotconstruction with tool delivery and assembly robots.In Proceedings of the 2010 IEEE/RSJ InternationalConference on Intelligent Robots and Systems(18 to 22 October 2010, Taipei), pp. 5085–5092.

14. J. Worcester, J. Rogoff, M. A. Hsieh, Constrained taskpartitioning for distributed assembly. In Proceedings ofthe 2011 IEEE/RSJ International Conference on IntelligentRobots and Systems (25 to 30 September 2011, SanFrancisco), pp. 4790–4796.

15. R. A. Brooks, A. M. Flynn, J. Br. Interplanet. Soc. 42,478–485 (1989).

16. P.-P. Grassé, Insectes Soc. 6, 41–81 (1959).

17. Y. Shoham, M. Tennenholtz, Artif. Intell. 73, 231–252 (1995).18. J. Maynard Smith, J. Theor. Biol. 47, 209–221 (1974).19. See supplementary materials on Science Online.20. R. Brooks, IEEE J. Robot. Autom. 2, 14–23 (1986).21. R. Arkin, Behavior-Based Robotics (MIT Press, Cambridge,

MA, 1998).22. R. D. Quinn et al., Insect designs for improved robot

mobility. In Proceedings of the 4th InternationalConference on Climbing and Walking Robots (24 to 26September 2001, Karlsruhe, Germany), pp. 69–76.

23. K. Petersen, R. Nagpal, J. Werfel, TERMES: Anautonomous robotic system for three-dimensionalcollective construction. In Robotics: Science and SystemsVII, H. Durrant-Whyte, N. Roy, P. Abbeel, Eds. (MIT Press,Cambridge, MA, 2012), pp. 257–264

24. S. Thrun, W. Burgard, D. Fox, Probabilistic Robotics(MIT Press, Cambridge, MA, 2005).

25. E. Bonabeau, M. Dorigo, G. Théraulaz, SwarmIntelligence: From Natural to Artificial Systems(Oxford Univ. Press, New York, 1999).

26. A. J. Ijspeert, A. Martinoli, A. Billard, L. M. Gambardella,Auton. Robots 11, 149–171 (2001).

27. M. Brambilla, E. Ferrante, M. Birattari, M. Dorigo, SwarmIntell. 7, 1–41 (2013).

Acknowledgments: Supported by the Wyss Institute forBiologically Inspired Engineering.

Supplementary Materialswww.sciencemag.org/content/343/6172/754/suppl/DC1Materials and MethodsFigs. S1 to S12Table S1References (28–34)Movies S1 to S5

11 September 2013; accepted 30 December 201310.1126/science.1245842

14 FEBRUARY 2014 VOL 343 SCIENCE www.sciencemag.org758

REPORTS