cvpr 2020 tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · deep...

71
Incremental Learning Karteek Alahari Inria, France CVPR 2020 Tutorial Towards Annotation-Efficient Learning

Upload: others

Post on 24-Aug-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

IncrementalLearning

KarteekAlahariInria,France

CVPR2020TutorialTowardsAnnotation-EfficientLearning

Page 2: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

IncrementalLearning?

•  Continuallearning

•  Lifelonglearning

•  Sequentiallearning

•  Never-endingLearning2KA:IncrementalLearning

Page 3: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

AnIncrementalLearningScenario

•  GrowingupinIndia

3KA:IncrementalLearning

Page 4: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

AnIncrementalLearningScenario

•  Andthenduringtravels

4KA:IncrementalLearning

Page 5: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

AnIncrementalLearningScenario

•  Andthenduringtravels

Butcanstillrememberholycows!

5KA:IncrementalLearning

Page 6: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

Slidecredit:Hung-yiLee6KA:IncrementalLearning

Page 7: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

TRAIN–VALIDATION–TEST

Allsampledfromthesamedistribution->benchmarksandacademicdatasets->real-worldsystems->embodiedlearning

StandardMachineLearning

Slidecredit:T.Tuytelaars7KA:IncrementalLearning

Page 8: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

Slidecredit:T.Tuytelaars8KA:IncrementalLearning

Page 9: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

-  Task-incrementallearning-  Class-incrementallearning-  Domain-incrementallearning

IncrementalLearningSetup

Slidecredit:T.Tuytelaars9KA:IncrementalLearning

Page 10: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

IncrementalLearning•  Aclassicalprobleminmachinelearning,e.g.,

[Carpenteretal.’92,CauwenberghsandPoggio’00,Polikaretal.’01,SchlimmerandFisher’86,Thrun’96]

•  Somemethods

–  Zero-shotlearning,e.g.,[Lampertetal.’13]Notrainingstepforunseenclasses

–  Continuouslyupdatethetrainingset,e.g.,[Chenetal.’13]Keepdataandretrain

–  Useafixeddatarepresentation,e.g.,[Mensinketal.’13]Simplifythelearningproblem 10KA:IncrementalLearning

Page 11: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

Originalmodel

Jointtraining“golden”baseline

Figuresfrom[LiandHoiem2016]

BruteForceSolution(non-incremental)

11KA:IncrementalLearning

Page 12: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

Retrainfullmodelwithbotholdandnewdata• Computationallyexpensive

• Needsaccesstoolddata•  Storagecapacitylimitations•  Privacyissues•  Scalabilityissues

FeatureExtraction

Classification

BruteForceSolution(non-incremental)

Slidecredit:T.Tuytelaars 12KA:IncrementalLearning

Page 13: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

Whynotbruteforce?

•  Noaccesstoallthedata

•  Cannotstoreallthedata

•  Accesstoonlyapreviouslylearnedmodel,e.g.,trainedbyothers

13KA:IncrementalLearning

Page 14: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

Originalmodel

Featureextraction

Figuresfrom[LiandHoiem2016]

NaïveSolution1

14KA:IncrementalLearning

Page 15: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

•  Finetuneonlylastlayerusingnewdataonly–  Leadstosuboptimalresults

FeatureExtraction

Classification

NaïveSolution1

Slidecredit:T.Tuytelaars15KA:IncrementalLearning

Page 16: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

Originalmodel

Finetuning

Figuresfrom[LiandHoiem2016]

NaïveSolution2

16KA:IncrementalLearning

Page 17: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

NaïveSolution2

•  Finetunethenetworkusingnewdataonly–  Leadstocatastrophicforgetting

FeatureExtraction

Classification

Slidecredit:T.Tuytelaars17KA:IncrementalLearning

Page 18: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

IncrementalLearning:ComputerVisionTask

A

B

18

Page 19: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

Baseline,i.e.,trainingwithalltheclasses

Trainingwiththeinitialsetofclasses

Trainingwiththenewsetofclasses

HowwelldoesnetworkBperform?

12.868.4

64.571.3

38.769.8

Noguidanceforretainingtheoldclasses19KA:IncrementalLearning[Catastrophicforgetting:McCloskeyandCohen1989,Ratcliff1990]

Page 20: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

•  Learnonetaskaftertheother•  Withoutstoring(many)datafromprevioustasks•  Withoutmemoryfootprintgrowing(significantly)overtime•  Without(completely)forgettingoldtasks

IncrementalLearning:TheRules!

Slidecredit:T.Tuytelaars

Task1 Task2 Task3 Taskn…

20KA:IncrementalLearning

Page 21: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

Whatelsewillweseetoday?•  Flavourofdifferentapproaches:

1.  Regularizationbased:LwF,EBLL,EWC,SI,MAS,IMM,…2.  Rehearsal/Replay:iCaRL,DGR,GEM,…3.  Architecturebased:PackNet,progressivenets,HAT,…

•  Morethanclassification?

•  Takeaways21KA:IncrementalLearning

Page 22: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

Whatelsewillweseetoday?•  Flavourofdifferentapproaches:

1.   Regularizationbased:LwF,EBLL,EWC,SI,MAS,IMM,…2.  Rehearsal/Replay:iCaRL,DGR,GEM,…3.  Architecturebased:PackNet,progressivenets,HAT,…

•  Morethanclassification?

•  Takeaways22KA:IncrementalLearning

Page 23: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

Regularization-basedModels

•  Whentraininganewtask,–  addaregularizationtermtotheloss–  i.e.,termtopenalizecatastrophicforgetting

•  R1:data-focusedmethods•  R2:model/prior-focusedmethods

Slidecredit:T.Tuytelaars23KA:IncrementalLearning

Page 24: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

Data-focusedRegularization:LearningwithoutForgetting

•  Knowledgedistillationloss–  i.e.,preservationofresponses

FeatureExtraction

Classification

…Newtaskinput

Newtaskannotations

Previousmodel’soutputforoldtasks

24KA:IncrementalLearning Slidecredit:T.Tuytelaars[Li&Hoiem2016]

Page 25: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

Data-focusedRegularization:LearningwithoutForgetting

25KA:IncrementalLearning[Li&Hoiem2016]

Simplemethod;goodresultsforrelatedtasks

Poorresultsforunrelatedtasks

? Needtostoretheoldmodel

Page 26: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

•  Penalizechangesto‘important’parameters

Model-focusedRegularization

26KA:IncrementalLearning

Lossonnewtask(s) Regularization

Differentvariantspossiblefor“importance”andregularization

Page 27: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

•  Elasticweightconsolidation[Kirkpatricketal.,2017]–  Indiv.penaltyforeachprevioustask–  Fisherinformationmatrixfor

Model-focusedRegularization

27KA:IncrementalLearningFigurefrompaper

Page 28: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

•  Elasticweightconsolidation[Kirkpatricketal.,2017]–  Indiv.penaltyforeachprevioustask–  Fisherinformationmatrixfor

Model-focusedRegularization

28KA:IncrementalLearning

Agnostictoarchitecture;Goodresultsempirically

Onlyvalidlocally

? Needtostoreimportanceweights

Page 29: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

•  Memoryawaresynapses[Aljundietal.,2018]–  Considersonlytheprevioustask–  Changeingradientsfor

Model-focusedRegularization

29KA:IncrementalLearningFigurefrompaper

Page 30: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

•  Memoryawaresynapses[Aljundietal.,2018]–  Considersonlytheprevioustask–  Changeingradientsfor

Model-focusedRegularization

30KA:IncrementalLearning

Agnostictoarchitecture;Leveragesdata&output

Onlyvalidlocally

? Needtostoreimportanceweights

Page 31: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

•  Twoexamples–  Elasticweightconsolidation[Kirkpatricketal.,2017]–  Memoryawaresynapses[Aljundietal.,2018]

•  Otheralternatives–  PathIntegral/SynapticIntelligence:largechangesduringtraining[Zenkeetal.,2017]

–  Momentmatching[Leeetal.,2017]–  Pathnet[Fernandoetal.,2017]–  …

Model-focusedRegularization

31KA:IncrementalLearning

Page 32: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

Whatelsewillweseetoday?•  Flavourofdifferentapproaches:

1.  Regularizationbased:LwF,EBLL,EWC,SI,MAS,IMM,…2.   Rehearsal/Replay:iCaRL,DGR,GEM,…3.  Architecturebased:PackNet,progressivenets,HAT,…

•  Morethanclassification?

•  Takeaways32KA:IncrementalLearning

Page 33: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

Rehearsal/Replay-basedmethods

•  Storeacoupleofexamplesfromprevioustasks

•  Orproducesamplesfromagenerativemodel

•  But–  Howmany?–  Howtoselectthem?–  Howtousethem?

33KA:IncrementalLearning

Page 34: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

iCaRL:Incrementalclassifierandrepresentationlearning

•  Selectssamplesthatareclosesttothefeaturemeanofeachclass

•  Knowledgedistillationloss[Hintonetal.’14]

•  Cleveruseofavailablememory(seethefollowing)

34KA:IncrementalLearning[Rebuffietal.2017]

Page 35: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

iCaRL:Incrementalclassifierandrepresentationlearning

35KA:IncrementalLearning[Rebuffietal.2017]

Splittheprobleminto:•  learningfeatures,andthen•  usingNCMclassifier

Page 36: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

iCaRL:Incrementalclassifierandrepresentationlearning

36KA:IncrementalLearning[Rebuffietal.2017]

Page 37: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

iCaRL:Incrementalclassifierandrepresentationlearning[Rebuffietal.’17]

37KA:IncrementalLearning[Rebuffietal.2017]

Classificationloss

Distillationloss:Comparingoldvsnew

Page 38: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

iCaRL:Incrementalclassifierandrepresentationlearning

38KA:IncrementalLearning[Rebuffietal.2017]

Cleveruseofavailablememory

Potentialissueswithstoringdata,e.g.,privacy

Limitedbythememorycapacity(themorethebetter)

Page 39: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

Whatelsewillweseetoday?•  Flavourofdifferentapproaches:

1.  Regularizationbased:LwF,EBLL,EWC,SI,MAS,IMM,…2.   Rehearsal/Replay:iCaRL,DGR,GEM,…3.  Architecturebased:PackNet,progressivenets,HAT,…

•  Morethanclassification?

•  Takeaways39KA:IncrementalLearning

Page 40: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

●  Themodel“Scholar”iscomposedof:●  agenerator+asolver(classifier)

●  Thegeneratorandthesolverareupdatedineveryincrementalstep

DeepGenerativeReplay

[Shinetal.2017]Figurefromthepaper

Scholar

40KA:IncrementalLearning

Page 41: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

Trainingprocedure:

●  Attaskt,wetrainanewScholar●  withdatafromthetaskt,and

●  datageneratedbythepreviouslytrainedScholarattaskt-1

DeepGenerativeReplay

[Shinetal.2017]Figurefromthepaper Slidecourtesy:A.Massenet41KA:IncrementalLearning

Page 42: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

DeepGenerativeReplay

Slidecourtesy:A.Massenet[Shinetal.2017]Figurefromthepaper

Trainingprocedure(Generator):

●  Withdatafromtaskt,and

●  datageneratedbythepreviouslytrainedScholarfortaskt-1

42KA:IncrementalLearning

Page 43: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

Target = Label

DeepGenerativeReplay

Slidecourtesy:A.Massenet[Shinetal.2017]Figurefromthepaper

Trainingprocedure(Solver):

●  Withdatafromtaskt,and

●  DatafromgeneratorandsolverofthepreviouslytrainedScholarfortaskt-1

43KA:IncrementalLearning

Page 44: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

DeepGenerativeReplay

Slidecourtesy:A.Massenet

Avoidsmemoryissues

Accumulationoferrors

Nocontrolovertheclassofthegeneratedsamples

[Shinetal.2017]Figurefromthepaper 44KA:IncrementalLearning

Page 45: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

Whatelsewillweseetoday?•  Flavourofdifferentapproaches:

1.  Regularizationbased:LwF,EBLL,EWC,SI,MAS,IMM,…2.  Rehearsal/Replay:iCaRL,DGR,GEM,…3.   Architecturebased:PackNet,progressivenets,HAT,…

•  Morethanclassification?

•  Takeaways45KA:IncrementalLearning

Page 46: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

Architecture-based

PackNet[Mallya&Lazebnik’17]Figurefromthepaper 46KA:IncrementalLearning

Page 47: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

Fixedmemoryconsumption

Needsthetotalnumberoftasks

Avoidsforgetting

Architecture-based

PackNet[Mallya&Lazebnik’17] 47KA:IncrementalLearning

Page 48: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

AComparativeAnalysis•  TinyImagenet:small,balanced,class-incremental•  iNaturalist:large-scale,unbalanced,task-incremental

•  Fairwayofsettinghyperparameters(stability-plasticitytradeoff)

48KA:IncrementalLearning[Langeetal.,2020]

Page 49: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

ComparativeEvaluation(TinyImagenet)

Imagecredit:[Langeetal.,2020] 49KA:IncrementalLearning

Regularization&Architecturebased

Page 50: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

ComparativeEvaluation(TinyImagenet)

50KA:IncrementalLearning

Rehearsal/ReplaybasedImagecredit:[Langeetal.,2020]

Page 51: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

GeneralTrends

•  Rehearsal/replaybasedmethodsonlypayoffwhenstoringsignificantamountofexemplars

•  PackNetresultsinno-forgettingandproducestopresults

•  MASmorerobustthanEWC

Slidecredit:T.Tuytelaars 51KA:IncrementalLearning

Page 52: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

WhatkindofmodelshouldIuse?

•  Largermodelsgivemorecapacity(but:overfitting)•  Wideisbetterthandeep

•  Regularizationmayinterferewithincrementallearning•  Dropoutusuallybetterthanweightdecay

52KA:IncrementalLearningSlidecredit:T.Tuytelaars

Page 53: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

Whatelsewillweseetoday?•  Flavourofdifferentapproaches:

1.  Regularizationbased:LwF,EBLL,EWC,SI,MAS,IMM,…2.  Rehearsal/Replay:iCaRL,DGR,GEM,…3.  Architecturebased:PackNet,progressivenets,HAT,…

•  Morethanclassification?

•  Takeaways53KA:IncrementalLearning

Page 54: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

MitigateCatastrophicForgetting•  Learningwithoutforgetting[LiandHoiem2016]

•  iCaRL[Rebuffietal.2017]

old model’s TasksdefinedonanewdatasetFocusonimageclassification(rareco-occurrenceofoldandnew)

DecoupleclassifierandfeaturelearningRelyonasubsetoftheolddata

54KA:IncrementalLearning

Page 55: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

MitigateCatastrophicForgetting•  Elasticweightconsolidation[Kirkpatricketal.,2017]–  Selectivelyslowingdownlearningonweights

•  Otherattempts,e.g.,[Aljundietal.,2018,Jungetal.,2016,MallyaandLazebnik,2017,Risinetal.,2014,Rusuetal.,2016]

LimitedtospecificsettingsFocusonimageclassification(rareco-occurrenceofoldandnew)

55KA:IncrementalLearning

Page 56: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

MitigateCatastrophicForgetting•  Elasticweightconsolidation[Kirkpatricketal.,2017]–  Selectivelyslowingdownlearningonweights

•  Otherattempts,e.g.,[Jungetal.,2016,MallyaandLazebnik,2017,Risinetal.,2014,Rusuetal.,2016]

LimitedtospecificsettingsFocusonimageclassification(rareco-occurrenceofoldandnew)

Lackofmethodsforincrementallearningofobjectdetectors

56KA:IncrementalLearning

Page 57: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

Anapproach

•  IncrementalLearningofObjectDetectorswithoutCatastrophicForgetting[Shmelkovetal.,2017]

57KA:IncrementalLearning

Page 58: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

OurApproach

•  Buildsonknowledgedistillation[Hintonetal.’14]

•  Knowledgedistillationoriginallyfor:–  learningasimplernetequivalenttoacomplexone,and–  producesadifferentlystructurednetwork

•  Wedistillthe“old”netwhenlearningnewclasses

[Shmelkov,Schmid,Alahari,ICCV2017] 58KA:IncrementalLearning

Page 59: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

OurApproach•  Copyofnetwork(A)toevaluateproposals&memorizeoutputs

•  Newcomponent(B)tolearnthenewclasses•  Learnwithacombinationoflosses

[FastRCNN:Girshick2015]59KA:IncrementalLearning

Page 60: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

•  MinimizeOurApproach

Lossoverclasses Localizationloss

Comparingthetwonetcomponents

60KA:IncrementalLearning

Page 61: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

•  MinimizeOurApproach

Lossoverclasses Localizationloss

61KA:IncrementalLearning

Logitresponses Localization

Page 62: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

Experimentalsetup•  Datasets:PASCALVOC2007andMSCOCO

•  A(n-m):initialtrainingonclassesfromntom(old)•  B(n-m):incrementaltrainingonclassesntom(new)addedall

atonce•  B(n)(m):incrementaltrainingforclassn,thenm,addedone

afteranother

•  Unbiaseddistillation:distillationproposalsareselectedcompletelyrandomly

62KA:IncrementalLearning

Page 63: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

Additionof10classes

•  VOC2007validationset

63.212.8

63.164.5

63KA:IncrementalLearning

Page 64: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

Additionof10classes

•  VOC2007validationset

31.663.2

64KA:IncrementalLearning

Page 65: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

Additionof5classes

•  VOC2007validationset

66.068.4

65KA:IncrementalLearning

Page 66: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

Additionof5classes

•  VOC2007validationset

66.045.8

66KA:IncrementalLearning

Page 67: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

Additionof40classes

•  COCOminival(first5000validationimages)

67KA:IncrementalLearning

Page 68: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

IntermediateSummary

•  Nocatastrophicforgettingwith– distillationloss–  inparticular,biaseddistillation

•  AtECCV2018:Balancedfine-tuning

68KA:IncrementalLearning

Page 69: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

Summary•  Flavourofdifferentapproaches:

1.  Regularizationbased:LwF,EBLL,EWC,SI,MAS,IMM,…2.  Rehearsal/Replay:iCaRL,DGR,GEM,…3.  Architecturebased:PackNet,progressivenets,HAT,…

•  Morethanclassification?

•  Takeaways69KA:IncrementalLearning

Page 70: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

Lookingtothefuture

•  Desiderata–  Constantmemory–  Taskagnostic:Somerecentadvances[Raoetal.,NeurIPS’19]–  Forgettinggracefully–  Datasets

70KA:IncrementalLearning

“Idon’tlikedatasets,it’smoreaproblemthanasolution”–heardatICCV2019

Page 71: CVPR 2020 Tutorial - thoth.inrialpes.frthoth.inrialpes.fr/~alahari/teaching/inclearn.pdf · Deep Generative Replay Slide courtesy: A. Massenet [Shin et al. 2017] Figure from the paper

Summary•  Flavourofdifferentapproaches:

1.  Regularizationbased:LwF,EBLL,EWC,SI,MAS,IMM,…2.  Rehearsal/Replay:iCaRL,DGR,GEM,…3.  Architecturebased:PackNet,progressivenets,HAT,…

•  Morethanclassification?

•  Takeaways71KA:IncrementalLearning

Seealso:Workshopon14thJuneContinualLearninginComputerVision