making offensive play predictable - using a graph convolutional … · 2021. 5. 4. · milner,...

19
1 Making Offensive Play Predictable - Using a Graph Convolutional Network to Understand Defensive Performance in Soccer Michael Stöckl, Thomas Seidl, Daniel Marley & Paul Power | Stats Perform 1. Introduction 1.1 Measuring defensive quality in soccer The art of good defending is to prevent something from happening before it has even happened. Virgil Van Dijk is considered one of the best defenders in world soccer as he has the ability to prevent a pass being made to an open attacker to shoot by forcing the ball carrier to pass somewhere else less dangerous. However, while we know this is great defending, in today’s stats, Van Dijk would not receive any acknowledgement. A defender’s contribution is simply measured by the number of tackles or interceptions they make. But what if we were able to measure actions that have been prevented before they were made? The aim of a defense and a defender is to make offensive play predictable. For example, Jürgen Klopp’s Liverpool, press the opposition with the aim of forcing them to give the ball away in specific areas of the pitch by limiting the number of passing options available in dangerous areas. If the art of good defending is to make play predictable, then it should be measurable. Given enough data, we should be able to predict where a player will pass the ball, the likelihood of that pass being completed and whether this pass will result in a scoring opportunity. It therefore stands that we should be able to measure if a defender forces an attacker to change their mind or to prevent an attacker from even becoming an option. Figure 1 shows a situation from a match between Liverpool vs Bayern Munich in the 2018/19 UEFA Champions League that leads to Mané (red 10) scoring. Our model identifies that Milner (red 7) is the primary target for Van Dijk (red 4) in the first instance. However, due to the combination of Gnabry (blue 22) closing down Milner, Lewandowski (blue 9) closing down Van Dijk and Mané making an active run, behind the defence, Mané becomes both the most likely receiver and a high threat for scoring. This demonstrates our ability to model how players decision making is influenced and how a situation can move from low threat to high threat by the off-ball actions of attackers and defenders. [LINK TO VIDEO]. In this paper we present a novel Graph Convolutional Neural Network (GNN) which is able to deal with highly unstructured and variable tracking data to make predictions in real time. This allows us to accurately model defensive behaviour and its effect on attacking behaviour, i.e., preventing actions before they have occurred. To do this we trained the following models: - xReceiver: Predicts the likelihood of every player becoming the pass receiver at any moment within a player possession. - xThreat: Predicts the probability of a shot occurring in the next 10 seconds if a pass was played to an attacker. - xPass: Predicts how likely a pass would be completed to each attacker off the ball at any moment within a player possession.

Upload: others

Post on 13-Aug-2021

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Making Offensive Play Predictable - Using a Graph Convolutional … · 2021. 5. 4. · Milner, Lewandowski (blue 9) closing down Van Dijk and Mané making an active run, behind the

1

MakingOffensivePlayPredictable-UsingaGraphConvolutionalNetworktoUnderstandDefensive

PerformanceinSoccer

MichaelStöckl,ThomasSeidl,DanielMarley&PaulPower|StatsPerform

1.Introduction1.1MeasuringdefensivequalityinsoccerTheartofgooddefendingistopreventsomethingfromhappeningbeforeithasevenhappened.VirgilVanDijkisconsideredoneofthebestdefendersinworldsoccerashehastheabilitytopreventapassbeingmadetoanopenattackertoshootbyforcingtheballcarriertopasssomewhereelselessdangerous.However,whileweknowthisisgreatdefending,intoday’sstats,VanDijkwouldnotreceiveanyacknowledgement.Adefender’scontributionissimplymeasuredbythenumberoftacklesorinterceptionstheymake.Butwhatifwewereabletomeasureactionsthathavebeenpreventedbeforetheyweremade?The aimof a defense and a defender is tomakeoffensive playpredictable. For example, JürgenKlopp’sLiverpool,presstheoppositionwiththeaimofforcingthemtogivetheballawayinspecificareasofthepitchbylimitingthenumberofpassingoptionsavailableindangerousareas. Iftheartofgooddefendingistomakeplaypredictable,thenitshouldbemeasurable.Givenenoughdata,weshouldbeabletopredictwhereaplayerwillpasstheball,thelikelihoodofthatpassbeingcompletedandwhetherthispasswillresultinascoringopportunity.Itthereforestandsthatweshouldbeabletomeasureifadefenderforcesanattackertochangetheirmindortopreventanattackerfromevenbecominganoption.Figure 1 shows a situation from a match between Liverpool vs Bayern Munich in the 2018/19UEFAChampionsLeaguethatleadstoMané(red10)scoring.OurmodelidentifiesthatMilner(red7)istheprimarytargetforVanDijk(red4)inthefirstinstance.However,duetothecombinationofGnabry(blue22)closingdownMilner,Lewandowski(blue9)closingdownVanDijkandManémakinganactiverun,behindthedefence,Manébecomesboththemostlikelyreceiverandahighthreatforscoring.Thisdemonstratesourabilitytomodelhowplayersdecisionmakingisinfluencedandhowasituationcanmovefromlowthreattohighthreatbytheoff-ballactionsofattackersanddefenders.[LINKTOVIDEO].InthispaperwepresentanovelGraphConvolutionalNeuralNetwork(GNN)whichisabletodealwithhighlyunstructuredandvariabletrackingdatatomakepredictionsinrealtime.Thisallowsustoaccuratelymodeldefensivebehaviouranditseffectonattackingbehaviour,i.e.,preventingactionsbeforetheyhaveoccurred.Todothiswetrainedthefollowingmodels:

- xReceiver:Predictsthelikelihoodofeveryplayerbecomingthepassreceiveratanymomentwithinaplayerpossession.

- xThreat:Predictstheprobabilityofashotoccurringinthenext10secondsifapasswasplayedtoanattacker.

- xPass:Predictshowlikelyapasswouldbecompletedtoeachattackerofftheballatanymomentwithinaplayerpossession.

Page 2: Making Offensive Play Predictable - Using a Graph Convolutional … · 2021. 5. 4. · Milner, Lewandowski (blue 9) closing down Van Dijk and Mané making an active run, behind the

2

andintroducenewdefensiveconcepts:

- Player Availability: Using the outputs from xReceiver and xPass we infer how available everyattackerisofftheballateachframe.

- Defensive Impact: We are able to detect high level defensive concepts such as ball and manorientateddefending,defensivepositionplayandoffballruns.

- Disruption Maps: Global visual representations of defending teams’ ability to disrupt theoppositionsattackingstrategy.

Figure1:Wetrainthreemodels(xPass,xReceiver&xThreat)tobetterunderstanddefensiveandoffensiveofftheballbehaviour,suchasman-orientateddefending,ball-orientateddefendingandactiveoffballruns.ThisletsusbetterunderstandhowSadioManéscoreda1-0leadvsBayernMunichduringLiverpool’swaytotheUCLfinalin2017/18.LINKTOVIDEO

Page 3: Making Offensive Play Predictable - Using a Graph Convolutional … · 2021. 5. 4. · Milner, Lewandowski (blue 9) closing down Van Dijk and Mané making an active run, behind the

3

RelatedWork

1.2.1 DealingwithunstructureddataTrackingdataishighlyunstructuredandcanbedifficulttomodelduetomostmachinelearningtechniquesrequiringtabulardatasetswherefeaturesareinsertedinaspecificorder.TosolvethisorderingissueLuceyetal.[1]presentedtheconceptofaligningplayerstoaformationtemplate[1,2].However,thismethodhasseverallimitations.Firstly,insoccer,teamsusedifferentformationssoaplayeratrole10forateamusing433wouldbeverydifferenttoateamplaying352.Therefore,itisdifficulttocomparepredictionsbetweentheseplayers.Inaddition,thismethodisreliantuponteamshavingthesamenumberofplayersonthepitch(11perteam)meaningdifferentmodelshavetobelearnedwhenplayershavebeensentoffforexample.Withregardstorealtimeinferenceafurtherlimitationisthespeedatwhichplayersneedtobealignedbeforecalculatingfeaturesforinference.Fernandez[3]andBrefeld[4]useConvolutionalNeutralNetworks(CNNs)onanimagerepresentationoftracking data to circumvent the ordering/alignment issue while predicting probabilistic pitch controlsurfaces.Convertingtrackingdatadirectlytoimagesissuboptimalasonegivesupaverylow-dimensionaldata set and converts it into a high-dimensional sparse representation. Tracking data has an irregularstructureduetomissingplayersandalackofclearschemetoorderplayersinasequenceorframe[19].Bothtechniques are also time consuming and could cause timing issues for feature generation in real-timeapplications.Insteadofusinganimage-basedrepresentationweusedGraphNeuralNetworks(GNNs)which1)neglectstheneedfororderingfeatures,2)cancopewithvaryingnumberofplayersonthepitchand3)learns local and higher scale features directly from the tracking data. Horton applied a set-learningframeworktomodelpassinginfootballwhichissimilartoasimplegraph(noedges)[18].1.2.2 EvaluatingandPredictingFutureActionsinSportsConceptssuchasxThreatandxPassarenotnewwithpreviousresearchusingtrackingdatatopredictthelikelihoodofapassbeingcompleteoragoalbeingscoredafteraspecificaction [2,4,6]. Inaddition tomeasuringthevalueofanaction,theconceptofvaluingaplayers’offballposition,hasalsobeeninvestigated[2,3]. Spearman [9] alsodevelopedamodel to evaluateoff-ball scoringopportunities in soccer. ThesemodelscreateasurfaceareacombiningxThreatandxPassvaluestounderstandhowdangerousateam’sorplayer’scurrentpossessionisandalsowhatspacetheycontrol.Wei [7, 8] modelled the probability of where the next action will go in tennis and soccer. Franks [10]predicted defensive match ups and measured the influence of defenders on the offenses shootingperformance.Ghosting[11,12]hallucinatedwhereateamofdefenderswillmovetonextbasedonwheretheattackershavemoved,and theball ismoving to. Thispotentiallyprovidesuseful tools toassess thedefensive strategyof teamsbyevaluating thedifference inExpectedGoal (xG)orPossessionValue (PV)comparedtoaglobalbaseline.2.Method

2.1DataTotrainandvalidatethethreemodels(xTransition,xThreat,andxReceiver)weused1,200gamesoftrackingdatafrommultipleseasonsofTop5Europeanfootballleaguessampledat10Hzpersecond.Trackingdataconsistsof(x,y)positionsforeachplayerandtheball,theteamandplayerids,time,half,andeventidateachframe.Intotal,thedatasetconsistedofonemillionpasseswhichwassplitintoa90/10trainandtestset.

Page 4: Making Offensive Play Predictable - Using a Graph Convolutional … · 2021. 5. 4. · Milner, Lewandowski (blue 9) closing down Van Dijk and Mané making an active run, behind the

4

ForlearningaxThreatmodelonlyframesrelatingtothemomentofpassingeventswereconsidered.ForthexTransitionandxReceivermodelsweincludedtrackingdatafromnotonlytheindividualpassesbutalsotrackingdatafromonehalf-second(5frames)andonesecond(10frames)beforethepass.Includingthesetwoadditionalmomentsprior toeachpasseventallowedustoachieveasemanticregularizationduringtrainingpreventingthemodel tooverfit to thepassmomentwhereplayers’movementsalready indicatewheretheballwillbeplayedtosomedegree.

2.2GraphConvolutionalNetworkTorepresentthetrackingdatainawell-definedstructurethatavoidsorderingissues,weusedagraph.AgraphG(V,E,U) isdefinedbynodesV,edgesE,andglobalfeaturesU.Inourrepresentation,asshowninFigure2,thenodesrepresenttheplayerandballtrackingdata,andtheedgescontaininformationabouttherelationshipbetweenthenodes.Noglobalfeatureswereincludedinthisapproach.Theedgeseijaredirectedandconnectasendingnodevitoareceivingnodevj.Tolearntherelationshipbetweenthegraphinputandoutputs,weusedaGNN.Specifically,weapplythespatialGNNapproachthat includesseparateoperations,knownasblocks,ontheedgesandnodesof thegraph[13].Anedgeblockisdefinedbyaneuralnetworkthattakesinputsfromtheedgefeatures,sendingnodefeatures,receivingnodefeaturesandoutputsanewedgeembedding.Similarly,anodeblockisdefinedbyaneuralnetworkthattakesinputsfromthenodefeatures,aggregatedsending edge features, aggregated receiving edge features and outputs a new node embedding. Apermutation invariant function isrequiredtoaggregatethesendingandreceivingedge features,e.g., themeanorsumofthosefeatures.WeusedasimilarGNNarchitectureforeachofthethreemodels:anedgeblock followedbyanodeblock.Eachblockhasamultilayerperceptron(MLP)with three layersandthenumberofunitsineachlayervariesbetweeneachtask.UsinganedgeblockintheGNNallowedthenetworktolearntherelationshipsbetweendifferentnodesforeachtask.ThisflexibilityisnotpresentinstandardapproachessuchasMLPsorconvolutions,wheretheconnectionsbetweeninputsaredefined.Eachmodeloutputsapredictionforeachplayerfromthefinalnodeblock.Thepredictiononeachnodeisthelikelihoodofaplayerreceivingthenextpass(xReceiver),thepasswillbecompleted(xPass),andiftherewillbeashotongoalwithinthenext10seconds(xThreat).

Figure2:Sketchof the graph representationused for thetrackingdata.Individualplayersandtheballareshownasnodesinthegraphwithdirectededgesconnectingthem.Individual players and the ball are shown asnodesinthegraphconnectedbydirectededges.Edgesareweighted to allow themodel to learnwhichnodesareonwhichteam.

Page 5: Making Offensive Play Predictable - Using a Graph Convolutional … · 2021. 5. 4. · Milner, Lewandowski (blue 9) closing down Van Dijk and Mané making an active run, behind the

5

2.3FeaturesAllthreeGNNmodelsusethesamesetofinputfeatures.NodefeaturesincludeplayerXYposition,speed,acceleration,angleofmotion,distanceandangletotheattackinggoal,distancetotheballcarrier,differenceintheangleofmotiontotheballcarrier,andaflagthatindicateswhethertheplayeristheballcarrier.Theedgefeaturesincludeaflagdefiningtherelationshipbetweenthetwonodes(teammate2,oropponent1),thedistancebetweenthetwoplayers,andthedifferenceintheangleofmotion.2.4TrainingWetrainedauniqueGNNforeachofthexTransition,xThreat,andxReceivermodels1.Acrossthedifferenttasks, thenodesareweightedinthe losscalculationtostabilizethetraining.Thenodesrepresentingthedefensiveplayersweremaskedoutforallmodels.DuringthetrainingofthexTransitionandxThreatmodel,the only players consideredwere those that received the pass or were the intended receivers. For thexReceivermodelthe(intended)receiverofapasshasaweightof1.0andallotherteammatesoftheballcarrierhaveaweightof0.1tobalancethesignaltobackground.2.5ModelTrainingResultsWecomparedthetrainedgraphmodelsxReceiver,xPassandxThreatagainstrespectivebaselinemodelswetrainedearlier.ThebaselinemodelswereMLPsthatwerebasedonmanyhandcraftedfeaturesconsideringthe players’ motion characteristics (speed, acceleration, moving direction), relationships between theplayersshownbydifferencesinthemotionfeaturesandwealsoconsideredtheballinformation(whereitisplayedwithwhichspeed)asseenin[11,12].ThelossandaccuracyofallthreeGNNmodelswerebetterthanorthesameasthemetricsoftherespectivebaselinemodel.Whereastheaccuracywascalculatedthesameforallmodels,theloglossisnotcomparableforthexReceivermodels.Asdescribedabove,theloglossoftheGNNxReceivercontainsallteammatesofthepasser,eventhoughmostofthemwithsmallweights*.However,theloglossforthebaselinexReceiveriscalculatedonlyconsideringtheplayerapasswasplayedto.Intotal,theGNNmodelsarebetteroratleastofthesamequalityasthebaselinemodelsalthoughmuchfewerhandcraftedfeatureswereusedsincethegraphwas able to learn the spatial relationships between the players on the pitch. Interestingly the baselinexReceivermodelhadthespeedofpass,angleofpassanddistancetoreceivercalculatedforthepassmoment.However, these featureswerenot included in theGNN. This indicates that theGNN is able to learn thecomplexdynamiccharacteristicsoffootballbasedontheplayermotionfeaturesatthenodelevelcombinedwiththeinter-playermotiondynamicsandteamidentityattheedge.Thisiswhatallowsustouseasimplerfeaturerepresentationallowingfasterendtoendprediction(0.05s).SeeAppendixA1fordetails.Table1:TrainingmetricsofthethreeGNNmodels

Accuracy loglossbaselinexPass 0.85 0.29GNNxPass 0.86 0.28

baselinexReceiver 0.73 0.70GNNxReceiver 0.83 0.07*baselinexThreat 0.95 0.16GNNxThreat 0.95 0.16

1Weuseacustom,pure-TensorFlowpackagebasedontheDeepMindgraph_netslibrary[https://github.com/deepmind/graph_nets]totrainourGNNs.

Page 6: Making Offensive Play Predictable - Using a Graph Convolutional … · 2021. 5. 4. · Milner, Lewandowski (blue 9) closing down Van Dijk and Mané making an active run, behind the

6

Figure3:Exampleofhowweareabletocaptureemergentbehaviour.A-EshowspecificmomentswithinapossessionofBayernMunich(blueteam).FshowsframelevelestimatesforxReceiver(lowvalues:blue,highvalues: red) for a subsetof offensiveplayers throughout thepossession.Players are identifiedby jerseynumber. Grey ellipsoids show collectiveman orientated defendingwhereas orange ellipsoids show ballorientateddefendingfromLiverpool(redteam).Greenellipsoidsshowactiverunsfromtheblueteam.LINKTOVIDEO

Page 7: Making Offensive Play Predictable - Using a Graph Convolutional … · 2021. 5. 4. · Milner, Lewandowski (blue 9) closing down Van Dijk and Mané making an active run, behind the

7

3.MeasuringtheUnmeasurableWe introduce a new defensive toolbox,Defensive Impact, which analytically determines howmuch thedefendingteamdisruptstheopposition’splay.Wecouldmeasurethisbylookingatwhereteamsinterceptthe ball, however as discussed in Section 1 defending is aboutpreventingwhat hasn’t happened yet.Wedemonstrate that the xReceivermodelnot only accuratelypredictswhowill receive theball next at themomentofapass,butwhenappliedattheframelevel,captureshumandecisionmaking.Wefindthemodelcanpredictwhowillreceivetheballinadvanceofapassbeingmadeandalso(moreimpressively)whenaplayerchangeshismindandmoreimportantly,why.OurxThreatandxPassmodelsallowustovaluenotjustwhatdidhappenbutwhatcouldhavehappenedormoreaccuratelywhatwasprevented(Figure3linktovideo).InthefollowingsectionwewilldemonstratehowwecreatedthefeaturesthatpowerDefensiveImpactandthenewinsightsthatarenowpossibletobegeneratedusinganexamplegamebetweenLaziovsJuventusfrom2018/19SerieAseason.3.1DisruptionMapsWhileweareabletocreateverygranularinsightswithDefensiveImpact,theabilitytofindsummaryinsightsisequallyimportant.ToprovidecompressedrepresentationsofourinsightsweintroduceDisruptionMaps.ADisruptionMapisaweighted2ddistributionthatshowswhereateam,positivelyornegatively,disruptedtheopposition’soffballoptions.TocalculateaDisruptionMap,wefirstgeneratea‘spatialidentity’(Figure4 top left) for each team for their xReceiver, xThreat and xPass model output. This acts as a globalrepresentationofateam(seeAppendixA2foroutputsforallteamsin2018/19SerieAseason).Wethencalculatethesamesurfaceatagamelevel(Figure4topmiddle)andsubtractthetwosurfacestocreatetheDisruptionMap(Figure4topright).Thisfinalimagerevealswheretheoppositiondisrupts(ornotinsomecases)theopposition’snormalstrategy/flow2.Wecannowusethesemapstodeterminewhichteamhadthebiggestimpactontheiropposition’sattackingstyleandefficiency.Wearealsoabletobreakthisdownataplayerleveltoseewhowastargetbothonandofftheball.3.2Lazio’s“Stellungsspiel”-DefensivePositionPlayDespite Lazio being beaten 1-2, Juventus’ Giorgio Chiellini stated post game "Itwas theworst Juventusperformanceoftheseasonforthefirst60minutes”[14].WeusedourDisruptionMapstounderstandwhatLazio did to affect Juventus somuch andwhich players they targeted Figure 4 bottom shows Juventus’DisruptionMapsforthegame.WecanclearlyseeLazioconsiderablydecreasedthexThreatfromtheirleftsideofthepenaltybox(Ronaldo’sside),witha10%decreaseinJuventus’xThreatcomparedtoallotherJuventusgamesinthatseason.Inaddition,wecanalsoseehowLazioincreasedtheriskofcompletingapasstoanyoptioninthewidechannelsbetweenthetouchlineandsix-yardboxandinthe“secondpenaltyarea”outsidethepenaltyareaby20%.ThesearethemostdangerouszoneswhereshotsandgoalsarecreatedandthespacesthatJuventus’threeprimaryattackersoperate.Withregardstowhereplayerswerelikelytobeatargetforapass,wecanseethatxReceiverwashigherthanaveragearoundLazio’spenaltyboxhowever,asdiscussed,theseareashadasignificantlyhigherriskofcompetitionandmoreimportantlysignificantlylowerprobabilityofleadingtogoaleveniftheywerecompleted.

2DisruptionMapsestimatetacticaldeviationfromanaveragestyle,wheredifferencescanbeattributedtooppositionand/orchangesofyourownteam’stacticforaspecificgame.Todifferentiatebetweenthetwoisoutofthescopeofthispaper.

Page 8: Making Offensive Play Predictable - Using a Graph Convolutional … · 2021. 5. 4. · Milner, Lewandowski (blue 9) closing down Van Dijk and Mané making an active run, behind the

8

Figure4. ToprowshowshowDisruptionMapsarecomposed; theyare thedifferencebetweenaTeamIdentityandaGameIdentity;BottomrowshowsJuventus’DisruptionMapsforxReceiver,xPass,andxThreatTore-emphasisehowpoorJuventus’performancewasviewed,theGermanfootballmagazineKickerstatedinthefirsthalf,“CristianoRonaldowasoutofthegame,andDybalaalsofailedtogivemomentumtotheBianconeri'soffense.” [14]. OurplayerDisruptionMapsbackthisup(Figure5).Ronaldo’sprobabilityofreceivingapasswasbelowaverageacrossthepitchinfirsthalfespeciallyinrightchannels.Thismarginallyimprovedinthesecondhalfbutonlyinthedeeprightchannel.Histhreatdidn’texistinthefirsthalf(Figure5lowerleft)however,hetookupdangerouspositionsontheleftchannel.But,asshowninhissecondhalfxReceiverDisruptionMap(Figure5upperright)hewasneverviewedasagoodoptiontopassto.Dybalatookupsignificantlymoredangerouspositionsontheleftofthepenaltyareaandaroundthepenaltyspot.Theriskofcompletingthepasswassignificantlyloweraswellindicatinghetookboththreateningandlow risk positions in attacking areas. However, hewas rarely viewed as a realistic passing option (lowxReceiver)withhimonlybeingaboveaverageontheleftflankandthemiddleofthepitch.Thisisthecriticalinsightwearenowabletosurfacecomparedtoothermethods.Wearenotjustabletomeasureifaplayerisinagoodposition(highthreat/lowrisk),wecanmeasureiftheyarearealisticoptiontoreceivetheball.TounderstandwhyDybalawasnotaviableoptiondespitehisgoodpositioningweneedtogoadeeperlevelofanalysiswhichisoutofscopeforthispaper.

Page 9: Making Offensive Play Predictable - Using a Graph Convolutional … · 2021. 5. 4. · Milner, Lewandowski (blue 9) closing down Van Dijk and Mané making an active run, behind the

9

Figure5:PlayerxReceiverandxThreatDisruptionMapsbyhalf:LeftRonaldo,RightDybala.Blueareasarebelowaveragevalueswhereasredindicatesaboveaverageregions.4.GoingDeeper–MeasuringDecisionMakingAprimarydefensivetacticinsocceristotargetkeyplayersinpossession.Thesemaybeplayerswhoareimportanttotheteam’sbuilduporconverselyplayerswhoaremorelikelytogivetheballawayandcausetransitionmoments.Targetingcanhappeninthreeways,ballorientatedwheredefenderswillspecificallytargettheplayerinpossessionthroughpressure;manorientated,wheretheballcarrierisnottargetedbuttheirpassingoptionsare;oracombinationofbothwiththeaimofeithercausingatransitionorfortheballtobeplayedtoalessdangerousarea[16](Figure3).Thesehigh-levelcomplexcoachingconceptsareincrediblydifficulttocaptureandrelyondomainexpertswatchinghoursofvideotofindandanalyze.However,byusingtheoutputfromthexReceivermodel,wecancreateadetector to find thesemomentsby simply findingwhen theprimary target changes for theballcarrier.WedefinetheprimarytargetastheattackerwiththehighestxReceivervalueateachframe.Ifthereisachangeintheprimarytarget,wemakeanassumptionthataproactiveactionhasoccurred.Thiscouldbearunfromasupportingattacker,theplayerinpossessionmovingwiththeballoradefender(s)actions.Anobviousnextstepcouldbetotrainamodeltopredict thesesituations. However, these labelsdonotcurrentlyexistandtoaskasetofdomainexpertstoannotatethousandsofexampleswouldbehighlytimeconsumingandexpensive.Instead,weusedaprogrammaticlabellingfunctionsapproach[17]andaskedadomain expert to identify simple functions that capture the expected behaviour of players for differentdefensivecontexts(seeAppendixA3forasample).Basedonobservingourinitialdetectedmomentswithadomainexpert,wedefinedthreedefensiveandtwoattackingsituations(Table3).

Page 10: Making Offensive Play Predictable - Using a Graph Convolutional … · 2021. 5. 4. · Milner, Lewandowski (blue 9) closing down Van Dijk and Mané making an active run, behind the

10

Table3:Defensiveoffballcontextsandtheirdefinitionsasdefinedbythedomainexpert.

4.1Ronaldo:NullandVoidWecansee fromtheDisruptionMapsthat in the firsthalfLaziowereable tonullifyRonaldoasaviablepassingoption.However,howdidtheydothis?UsingourDefensiveImpacttoolbox,wecannowassessifRonaldowastargetedwheninpossession(ballorientateddefending)andoutofpossession(manorientateddefending).Inaddition,wecanassessifRonaldomadeanyactiverunstobecomemoreavailableforateammate.4.1.1 TargetingtheSupplyLineTounderstandwhyRonaldowentmissinginthefirsthalf,weidentifiedtheprimaryplayerswhopasstheballtoRonaldofromthelast5games(Table5).Wechosethelast5gamesasthisishowoppositionscoutswoulddoit.WemeasuredhowoftenRonaldowasidentifiedasaprimarytargetfortheseplayers(Table6).WeclearlyseethattherelationshipbetweenMatuidiandRonaldoisheavilytargeted,withRonaldoonlybeingconsideredanoptionforMatuidionceduringtheLaziomatch.WealsoseethedifferenceinRonaldo’sperformancebetweenthefirstandsecondhalf,withRonaldoonlybeingidentified5timesinthefirstand8timesinthesecond.ThisfurthersupportstheobservationsinKicker.4.1.2 TargetingRonaldoinPossession

Ronaldo was targeted 8 times in the gamewhen in possession with Lazio applying both ball andmanorientateddefending.Figure6(right)showsaprimaryexampleofhowLaziocloseddownRonaldo’soptionsbyapplyingpressuretoRonaldoashecarriedtheballandalsoapplyingpressuretohisprimarytargets;Matuidi(red14)andSandro(red12).WecanseehowSandromadeanactiveruntobecometheprimarytargetandhowCorrea(blue11)attemptedtoapplymanorientatedpressure.However,thedefensiverunofLuisAlberto(blue10)topressRonaldowasresponsibleforthedecreaseinSandro’sxReceivervalue.AtthesametimeMatuidi(red14)becamethemostlikelyreceiver(xR0.98),however,duetothecombinedpressureofMilinković-Savić(blue21)andParolo(blue16)hisxPasswasonly0.66,meaningtherewasahighchanceofatransition.

Situations Definition

Ballorientateddefending Defendermovesclosertotheballcarrierathighspeedtoincreasethechanceoftheballbeinggivenawayorpasstolessdangerousarea.

Manorientateddefending Defendermovesclosertotheprimarytargettoreducetheprobabilityofthembeingareceiver.

Ballandmanorientateddefending Combinationoftheprevioustwo.

ActiveoffBallRuns Anattackermovesathighspeedtoincreasetheirprobabilityofbeingareceiver.

Page 11: Making Offensive Play Predictable - Using a Graph Convolutional … · 2021. 5. 4. · Milner, Lewandowski (blue 9) closing down Van Dijk and Mané making an active run, behind the

11

Table5.Top3passerswhostartedthegamefromtheprevious5matches

Table6.Ronaldo’savailabilityforhisthreeprimarysuppliers

Table7.TheavailabilityofRonaldo’sthreeprimarysupplierswhenheisinpossession

Option RonaldoInPossessionFirstHalf

RonaldoInPossessionSecondHalf

PauloDybala 4 4BlaiseMatuidi 2 1AlexSandro 3 6

WehavedemonstratedhoweffectiveLazio’sdefensiveplaywasinnullifyingRonaldoandhowtheydidit.Theotheraspectwecanexamineis ifRonaldoactivelytriedtomakehimselfavailable. Ronaldomade7activeruns(Figure6RightLinktoVideo)howeveronly3wereattackingforwardruns.IfwecomparethistoLazio’sforwards,ImmobileandLuisAlberto(Figure7Left),weseetheyweremoredangerouswiththeirruns.Figure6(Left)showsanexampleofRonaldomakinganactiverunintothepenaltyareatobecomeaprimary target forDouglasCosta. WhenCosta received theballRonaldoonlyhadanxReceiverof0.22.However,aswefollowhisrun,wecanseehimbecomingtheprimarytargetattheedgeofthepenaltyarea.Interestingly,theplotrevealshewasalwaysinahighlythreateningpositionfortheentiretyofthemovebyplayingontheshoulderofthelastdefender.

Passer PassTotal GamesPlayedTogether PassPer90

AlexSandro 31 4 7.23

Matuidi 22 4 5.31Dybala 22 4 5.29

Supplier RonaldoTargetFirstHalf

RonaldoTargetSecondHalf

PauloDybala 1 5BlaiseMatuidi 1 0AlexSandro 3 3

Page 12: Making Offensive Play Predictable - Using a Graph Convolutional … · 2021. 5. 4. · Milner, Lewandowski (blue 9) closing down Van Dijk and Mané making an active run, behind the

12

Figure6.Left:ExampleofRonaldo(7)makinganactiveoffballruntobecomethemostlikelyreceiver.HistrailshowsbothhowdangeroushisrunwasandhowhisxReceiverincreased.Right:Ronaldobeingtargetedwithcombinedmanorientatedandballorientateddefending.LINKTOVIDEO

Figure7.ActiveRunMapsforLazio’s(CiroImmobileandLuisAlberto)andJuventus’(PauloDybalaandChristianoRonaldo)mainattackers.

Page 13: Making Offensive Play Predictable - Using a Graph Convolutional … · 2021. 5. 4. · Milner, Lewandowski (blue 9) closing down Van Dijk and Mané making an active run, behind the

13

4.SummaryWeproposedanovelGNNarchitecturethatallowstodealwithunstructureddata.TheGNNallowsustodirectlylearnfromalightweightfeaturerepresentationthecollectivebehaviourofthesoccerplayersonthepitch.Thiscircumventstheneedtoorderplayersandforheavyfeaturecraftingwhichallowsustoapplyanend-to-endinferencepipelineinreal-timeapplications.Basedonthemodeloutputs,weintroducedDefensiveImpact,atoolboxtomeasuretheinfluenceofdefensivestrategy on the opposition at a team and player level. Disruption Maps create a compact visualrepresentationofateam’seffectontheopposition’sxThreat,xPassandxReceivervaluescomparedtotheirglobaloverage. Itallowsusers todeterminewhereadefencehasbeensuccessfulorstruggledbasedonreducing/increasinganopposition’sthreat,passriskandplayeravailability.Wewereabletoidentifydifferentdefensivestyles(manandballorientateddefending)andoffballruns.Duetothelackoflabelleddataweutilisedataskprogrammingapproach–creatinglabellingfunctionsbasedonadomainexpert’sinsights.Movingforwardanactivelearningapproachwherelabelsaregenerated,trainedagainstandthenassessedwouldbearecommendednextstep.Thisresearchenablestheevaluationofdefensivebehaviourandprovidestoolsandinsightsforcoachesandfanstoengagewith.

Page 14: Making Offensive Play Predictable - Using a Graph Convolutional … · 2021. 5. 4. · Milner, Lewandowski (blue 9) closing down Van Dijk and Mané making an active run, behind the

14

5.References[1]XinyuW.,LongS.,Lucey,P.,Morgan,S.,&Sridharan,S..Large-scaleanalysisofformationsinsoccer.In

DigitalImageComputing:TechniquesandApplications(DICTA),2013InternationalConferenceon,pages1–8.IEEE,2013.

[2]Power,P.,Ruiz,H.,Wei,X.,&Lucey,P.(2017).“Notallpassesarecreatedequal:”Objectivelymeasuringtheriskandrewardofpassesinsoccerfromtrackingdata.ProceedingsoftheACMSIGKDDInternationalConferenceonKnowledgeDiscoveryandDataMining.

[3]Fernández,J.,&Bornn,L.(2020).SoccerMap:ADeepLearningArchitectureforVisually-InterpretableAnalysisinSoccer.EuropeanConferenceonMachineLearningandPrinciplesandPracticeofKnowledgeDiscoveryinDatabases(ECML-PKDD).

[4]Brefeld,U.,Lasek,J.,&Mair,S.(2019).Probabilisticmovementmodelsandzonesofcontrol.MachineLearning,108(1),127–147.https://doi.org/10.1007/s10994-018-5725-1

[5]Kim,K.,Grundmann,M.,Shamir,A.,Matthews,I.,Hodgins,J.,&Essa,I.(2010).Motionfieldstopredictplayevolutionindynamicsportscenes.ProceedingsoftheIEEEComputerSocietyConferenceonComputerVisionandPatternRecognition,840–847

[6]Cervone,D.,D’amour,A.,Bornn,L.,&Goldsberry,K.(2014).POINTWISE:PredictingPointsandValuingDecisionsinRealTimewithNBAOpticalTrackingData.MITSloanSportsAnalyticsConference.

[7]Wei,X.,Lucey,P.,Morgan,S.,Reid,M.,&Sridharan,S.(2016).“TheThinEdgeoftheWedge”:AccuratelyPredictingShotOutcomesinTennisusingStyleandContextPriors.MITSloanSportsAnalyticsConference.

[8]Wei,X.,Lucey,P.,Vidas,S.,Morgan,S.,&Sridharan,S.(2015).Forecastingeventsusinganaugmentedhiddenconditionalrandomfield.LectureNotesinComputerScience(IncludingSubseriesLectureNotesinArtificialIntelligenceandLectureNotesinBioinformatics),9006,569–582.https://doi.org/10.1007/978-3-319-16817-3_37

[9]Spearman,W.(2019).BeyondExpectedGoals.MITSloanSportsAnalyticsConference.[10]Franks,A.,Miller,A.,Bornn,L.,&Goldsberry,K.(2015).Counterpoints:AdvancedDefensiveMetrics

forNBABasketball.MITSloanSportsAnalyticsConference.[11]Seidl,T.,Cherukumudi,A.,Hartnett,A.,Carr,P.,&Lucey,P.(2018).Bhostgusters:RealtimeInteractive

PlaySketchingwithSynthesizedNBADefenses.MITSloanSportsAnalyticsConference.[12]Le,H.M.,Carr,P.,Yue,Y.,&Lucey,P.(2017).DataDrivenGhostingusingDeepImitationLearning.MIT

SloanSportsAnalyticsConference.[13]Battaglia,P,etal.(2018).Relationalinductivebiases,deeplearning,andgraphnetworks.[14]https://www.espn.com/soccer/report?gameId=522613[15]https://www.kicker.de/lazio-gegen-juventus-2019-serie-a-4509058/spielbericht(in German)[16]https://spielverlagerung.com/2014/07/07/counterpressing-variations/ [17]Sun,J.J.,Kennedy,A.,Eric,Z.,Yue,Y.,&Perona,P.(2020).TaskProgramming:LearningDataEfficient

BehaviorRepresentations.arXivPreprint,(2011.13917).[18]Horton,M.(2020).LearningFeatureRepresentationsfromFootballTracking.MITSloanSports

AnalyticsConference.[19]Mehrasa,N.,Zhong,Y.,Tung,F.,Bornn,L.,Mori,G.(2018)DeepLearningofPlayerTrajectory

RepresentationsforTeamActivityAnalysis.MITSloanSportsAnalyticsConference

Page 15: Making Offensive Play Predictable - Using a Graph Convolutional … · 2021. 5. 4. · Milner, Lewandowski (blue 9) closing down Van Dijk and Mané making an active run, behind the

15

A. AppendixA1.SpeedofInferenceOurthreeGNNmodelsusethesamefeaturesandthenumberoffeaturesisrathersmallasdescribedabove.Thisraisedourinterestwhetherthesemodelsareapplicableinreal-timeoratleastclosetoreal-time.Weranatesthowlongthefeatureengineering(custompythoncode)andaGNNmodelinferenceofasingleframetakeonanoff-the-shelfnotebook3,respectively.Table2showsthatonaveragethewholeprocessfromcraftingthefeaturesuntiltheinferenceofthemodelismadetooklessthan0.05sforonetrackingtimestamponourtestnotebook.ThisenablesustouseourGNNmodelsnearlyinreal-time.TableA.1:Speedtestshowlongfeaturecraftingandthemodelinferencetakesinseconds[s];descriptiveresultsof1000attempts

Featurecrafting(s) Inference(s) Total(s)mean(s) 0.044 0.001 0.045

std 0.004 0.0002 0.006

min 0.038 0.0007 0.04

max 0.068 0.0029 0.08

3 MacBookPro with a 2.3 GHz Intel Core i5 processor and 16GB RAM

Page 16: Making Offensive Play Predictable - Using a Graph Convolutional … · 2021. 5. 4. · Milner, Lewandowski (blue 9) closing down Van Dijk and Mané making an active run, behind the

16

A2.SpatialidentitymapsforallteamsfromSerieAseason2018/19orderedbyleaguetable

FigureA.1:“SpatialIdentityforxReceiver”forallteamsfromSerieA2018/19.Teamsplayingfromlefttoright.

Page 17: Making Offensive Play Predictable - Using a Graph Convolutional … · 2021. 5. 4. · Milner, Lewandowski (blue 9) closing down Van Dijk and Mané making an active run, behind the

17

FigureA.2:“SpatialIdentityforxPass”forallteamsfromSerieA2018/19.Teamsplayingfromlefttoright.

Page 18: Making Offensive Play Predictable - Using a Graph Convolutional … · 2021. 5. 4. · Milner, Lewandowski (blue 9) closing down Van Dijk and Mané making an active run, behind the

18

FigureA.3:“SpatialIdentityforxThreat”forallteamsfromSerieA2018/19.Teamsplayingfromlefttoright.

Page 19: Making Offensive Play Predictable - Using a Graph Convolutional … · 2021. 5. 4. · Milner, Lewandowski (blue 9) closing down Van Dijk and Mané making an active run, behind the

19

A3.SamplepseudocodeforaprogrammaticlabellingfunctionBasedondiscussionswithdomainexpertsthetaskofidentifyingmanorientateddefendingwastranslatedintothefollowingpseudocode:“Defendermovesclosertotheprimarytargettoreducetheprobabilityofthembeingareceiver.”

FigureA.4:Pseudocodeforprogrammaticlabellingofmanorientateddefending.