Page 1
© 2018 Kiwisoft, all rights reserved
Aurélien Géron
May 23
rd
, 2018
Strata Data Conference, London
Deep Computer Vision for Manufacturing
Page 2
© 2018 Kiwisoft, all rights reserved
Page 3
© 2018 Kiwisoft, all rights reserved
● Machine Vision
● Introduction to Convolutional Neural Networks
● A Tour of Deep Computer Vision
● Challenges in Manufacturing
Agenda
Page 4
© 2018 Kiwisoft, all rights reserved
Machine Vision
Page 5
© 2018 Kiwisoft, all rights reserved
Machine Vision > Vision System
Page 6
© 2018 Kiwisoft, all rights reserved
Machine Vision > 3D-Laser
Page 7
© 2018 Kiwisoft, all rights reserved
● Sorting
● Inspection
● Analytics
● Robot Guidance
● Security
● More!
Machine Vision > Applications
Page 8
© 2018 Kiwisoft, all rights reserved
● Sorting
● Inspection
● Analytics
● Robot Guidance
● Security
● More!
Machine Vision > Applications
Page 9
© 2018 Kiwisoft, all rights reserved
● Sorting
● Inspection
● Analytics
● Robot Guidance
● Security
● More!
Machine Vision > Applications
Page 10
© 2018 Kiwisoft, all rights reserved
● Sorting
● Inspection
● Analytics
● Robot Guidance
● Security
● More!
Machine Vision > Applications
Page 11
© 2018 Kiwisoft, all rights reserved
● Sorting
● Inspection
● Analytics
● Robot Guidance
● Security
● More!
Machine Vision > Applications
Page 12
© 2018 Kiwisoft, all rights reserved
● Sorting
● Inspection
● Analytics
● Robot Guidance
● Security
● More!
Machine Vision > Applications
Page 13
© 2018 Kiwisoft, all rights reserved
Machine Vision > Classical Algorithms
Page 14
© 2018 Kiwisoft, all rights reserved
Machine Vision > Edge Detection
Page 15
© 2018 Kiwisoft, all rights reserved
Machine Vision > Filters
Page 16
© 2018 Kiwisoft, all rights reserved
Machine Vision > Filters-1
-2
-1
0
0
0
1
2
1
Page 17
© 2018 Kiwisoft, all rights reserved
Machine Vision > Filters-1
-2
-1
0
0
0
1
2
1
Page 18
© 2018 Kiwisoft, all rights reserved
Machine Vision > Filters-1
-2
-1
0
0
0
1
2
1
Page 19
© 2018 Kiwisoft, all rights reserved
Machine Vision > Filters-1
-2
-1
0
0
0
1
2
1
Page 20
© 2018 Kiwisoft, all rights reserved
Machine Vision > Filters-1
-2
-1
0
0
0
1
2
1
Page 21
© 2018 Kiwisoft, all rights reserved
Machine Vision > Filters-1
-2
-1
0
0
0
1
2
1
Page 22
© 2018 Kiwisoft, all rights reserved
Machine Vision > Filters-1
-2
-1
0
0
0
1
2
1
Page 23
© 2018 Kiwisoft, all rights reserved
Machine Vision > Filters-1
-2
-1
0
0
0
1
2
1
Page 24
© 2018 Kiwisoft, all rights reserved
Machine Vision > Filters-1
-2
-1
0
0
0
1
2
1
Page 25
© 2018 Kiwisoft, all rights reserved
Machine Vision > Filters-1
-2
-1
0
0
0
1
2
1
Page 26
© 2018 Kiwisoft, all rights reserved
Machine Vision > Filters-1
-2
-1
0
0
0
1
2
1
Page 27
© 2018 Kiwisoft, all rights reserved
Machine Vision > Filters-1
-2
-1
0
0
0
1
2
1
Page 28
© 2018 Kiwisoft, all rights reserved
Machine Vision > Filters-1
-2
-1
0
0
0
1
2
1
Page 29
© 2018 Kiwisoft, all rights reserved
Machine Vision > Filters-1
-2
-1
0
0
0
1
2
1
Page 30
© 2018 Kiwisoft, all rights reserved
Machine Vision > Filters-1
-2
-1
0
0
0
1
2
1
Page 31
© 2018 Kiwisoft, all rights reserved
Machine Vision > Filters-1
-2
-1
0
0
0
1
2
1
Convolution (cross-correlation)
Page 32
© 2018 Kiwisoft, all rights reserved
Machine Vision > Filters > Zero Padding-1
-2
-1
0
0
0
1
2
1
Page 33
© 2018 Kiwisoft, all rights reserved
Machine Vision > Filters > Zero Padding-1
-2
-1
0
0
0
1
2
1
Page 34
© 2018 Kiwisoft, all rights reserved
Machine Vision > Filters > Zero Padding-1
-2
-1
0
0
0
1
2
1
Page 35
© 2018 Kiwisoft, all rights reserved
Machine Vision > Filters
Page 36
© 2018 Kiwisoft, all rights reserved
Machine Vision > Blob Discovery
Page 37
© 2018 Kiwisoft, all rights reserved
Machine Vision > Blob Discovery
Page 38
© 2018 Kiwisoft, all rights reserved
Machine Vision > Blob Discovery
Page 39
© 2018 Kiwisoft, all rights reserved
Machine Vision > Blob Discovery
Page 40
© 2018 Kiwisoft, all rights reserved
Introduction to ConvNets
Page 41
© 2018 Kiwisoft, all rights reserved
Visual Cortex (V2)
CNNs > Visual Cortex
Page 42
© 2018 Kiwisoft, all rights reserved
CNNs > Visual Cortex > V1, V2...
Page 43
© 2018 Kiwisoft, all rights reserved
CNNs > Visual Cortex > Receptive Fields
Page 44
© 2018 Kiwisoft, all rights reserved
CNNs > Convolutional Layers
Page 45
© 2018 Kiwisoft, all rights reserved
CNNs > Receptive Field Size & Padding
Page 46
© 2018 Kiwisoft, all rights reserved
CNNs > Stride
Page 47
© 2018 Kiwisoft, all rights reserved
CNNs > Color Channels
Page 48
© 2018 Kiwisoft, all rights reserved
CNNs > Color Channels
Page 49
© 2018 Kiwisoft, all rights reserved
CNNs > Satellite ImagesDiffusion MRI
Page 50
© 2018 Kiwisoft, all rights reserved
CNNs > Medical ImagesDiffusion MRI
Page 51
© 2018 Kiwisoft, all rights reserved
CNNs > Convolutional Layers
Page 52
© 2018 Kiwisoft, all rights reserved
CNNs > Pooling Layers
Page 53
© 2018 Kiwisoft, all rights reserved
CNNs > ConvNet Architecture
Page 54
© 2018 Kiwisoft, all rights reserved
CNNs > ConvNet Architecture
Tree
Page 55
© 2018 Kiwisoft, all rights reserved
CNNs > ConvNet Architecture
Tree
Temple
Page 56
© 2018 Kiwisoft, all rights reserved
CNNs > ConvNet Architecture
Tree
Temple
Page 57
© 2018 Kiwisoft, all rights reserved
A Tour of Deep Computer Vision
Page 58
© 2018 Kiwisoft, all rights reserved
Deep Computer Vision > Classification
Page 59
© 2018 Kiwisoft, all rights reserved
Deep Computer Vision > ClassificationSheep
Page 60
© 2018 Kiwisoft, all rights reserved
Deep Computer Vision > ResNet Architecture
Page 61
© 2018 Kiwisoft, all rights reserved
Deep Computer Vision > Classification with LocalizationSheep
Page 62
© 2018 Kiwisoft, all rights reserved
Deep Computer Vision > OverFeatFully Connected
4 units
Page 63
© 2018 Kiwisoft, all rights reserved
Deep Computer Vision > Object Detection
Sheep 1
Sheep 2 Sheep 3
Page 64
© 2018 Kiwisoft, all rights reserved
Deep Computer Vision > YOLO
Page 65
© 2018 Kiwisoft, all rights reserved
Deep Computer Vision > YOLO
Page 66
© 2018 Kiwisoft, all rights reserved
Deep Computer Vision > YOLO
Page 67
© 2018 Kiwisoft, all rights reserved
Deep Computer Vision > YOLO
Page 68
© 2018 Kiwisoft, all rights reserved
Deep Computer Vision > YOLO
Page 69
© 2018 Kiwisoft, all rights reserved
Deep Computer Vision > YOLO
Page 70
© 2018 Kiwisoft, all rights reserved
Deep Computer Vision > YOLO
Page 71
© 2018 Kiwisoft, all rights reserved
Deep Computer Vision > YOLO
Page 72
© 2018 Kiwisoft, all rights reserved
Deep Computer Vision > YOLO
Page 73
© 2018 Kiwisoft, all rights reserved
Deep Computer Vision > YOLO
Page 74
© 2018 Kiwisoft, all rights reserved
Deep Computer Vision > YOLO
Page 75
© 2018 Kiwisoft, all rights reserved
Deep Computer Vision > YOLO
Page 76
© 2018 Kiwisoft, all rights reserved
Deep Computer Vision > YOLO
Page 77
© 2018 Kiwisoft, all rights reserved
Deep Computer Vision > YOLO
Page 78
© 2018 Kiwisoft, all rights reserved
Deep Computer Vision > YOLO
Page 79
© 2018 Kiwisoft, all rights reserved
Deep Computer Vision > YOLO
Page 80
© 2018 Kiwisoft, all rights reserved
Deep Computer Vision > YOLO
Page 81
© 2018 Kiwisoft, all rights reserved
Deep Computer Vision > YOLO
Page 82
© 2018 Kiwisoft, all rights reserved
Deep Computer Vision > YOLO
Page 83
© 2018 Kiwisoft, all rights reserved
Deep Computer Vision > YOLO
Sheep 1
Sheep 2 Sheep 3
Page 84
© 2018 Kiwisoft, all rights reserved
Deep Computer Vision > YOLO
Page 85
© 2018 Kiwisoft, all rights reserved
Deep Computer Vision > Semantic Segmentation
Sheep
Grass
Road
Page 86
© 2018 Kiwisoft, all rights reserved
Deep Computer Vision > Fully Convolution Network
Source: “Fully Convolutional Networks for Semantic Segmentation”, by Jonathan Long, Evan Shelhamer, Trevor Darrell, https://arxiv.org/abs/1411.4038 (Figure 1).
Page 87
© 2018 Kiwisoft, all rights reserved
Deep Computer Vision > Instance SegmentationGrass
Road
Sheep1 Sheep
2
Sheep3
Page 88
© 2018 Kiwisoft, all rights reserved
Deep Computer Vision > Masked R-CNN
Source: “Mask R-CNN”, by Kaiming He, Georgia Gkioxari, Piotr Dollár, Ross Girshick, https://arxiv.org/abs/1703.06870 (Figure 1).
Page 89
© 2018 Kiwisoft, all rights reserved
Deep Computer Vision > Masked R-CNN
Page 90
© 2018 Kiwisoft, all rights reserved
Deep Computer Vision > Anomaly Detection
Page 91
© 2018 Kiwisoft, all rights reserved
Deep Computer Vision > Anomaly Detection
Page 92
© 2018 Kiwisoft, all rights reserved
Deep Computer Vision > Anomaly Detection
Page 93
© 2018 Kiwisoft, all rights reserved
Deep Computer Vision > Anomaly Detection
One Class SVM
Page 94
© 2018 Kiwisoft, all rights reserved
Deep Computer Vision > Anomaly Detection
One Class SVM
Page 95
© 2018 Kiwisoft, all rights reserved
Deep Computer Vision > Anomaly Detection
One Class SVM
Page 96
© 2018 Kiwisoft, all rights reserved
Challenges
Page 97
© 2018 Kiwisoft, all rights reserved
Challenges > Pushback
Page 98
© 2018 Kiwisoft, all rights reserved
Challenges > Pushback
Page 99
© 2018 Kiwisoft, all rights reserved
Challenges > Hardware
Page 100
© 2018 Kiwisoft, all rights reserved
Challenges > Framing the Problem
Page 101
© 2018 Kiwisoft, all rights reserved
Challenges > Framing the Problem
Page 102
© 2018 Kiwisoft, all rights reserved
Challenges > Framing the Problem
Page 103
© 2018 Kiwisoft, all rights reserved
Challenges > Framing the Problem
Page 104
© 2018 Kiwisoft, all rights reserved
Challenges > Framing the Problem
Page 105
© 2018 Kiwisoft, all rights reserved
Challenges > Training Set
Page 106
© 2018 Kiwisoft, all rights reserved
Challenges > Training Set
Page 107
© 2018 Kiwisoft, all rights reserved
Challenges > Model Rot
Precision
Time
Page 108
© 2018 Kiwisoft, all rights reserved
Challenges > High Volume, Low Latency
Page 109
© 2018 Kiwisoft, all rights reserved
Challenges > Model Interpretation
Page 110
© 2018 Kiwisoft, all rights reserved
Challenges > Model Interpretation
Page 111
© 2018 Kiwisoft, all rights reserved
Challenges > Model Interpretation
Page 112
© 2018 Kiwisoft, all rights reserved
Page 113
© 2018 Kiwisoft, all rights reserved
Questions?
Page 114
© 2018 Kiwisoft, all rights reserved
Media Credits
Page 115
© 2018 Kiwisoft, all rights reserved
● Slide 1: public domain● Slide 2: https://youtu.be/qrzQ_AB1DZk● Slide 5: http://www.hte.net/solutions/machine-vision/smart-vision-cameras (reproduced with the kind authorization of HTE)● Slide 6: https://en.wikipedia.org/wiki/Glossary_of_machine_vision#/media/File:3D-Laserscanner_on_tripod.jpg (CC BY-SA 3.0, by user Dr.
Schorsch) ● Slide 6: https://en.wikipedia.org/wiki/Glossary_of_machine_vision#/media/File:Engine_movingparts.jpg● (CC BY-SA 3.0, by user Wapcaplet)● Slides 7, 8, 9: https://upload.wikimedia.org/wikipedia/commons/9/9e/Stemilt%27s_SweeTango_croo.jpg (CC-BY-2.0, by Stemilt Growers)● Slide 10: https://en.wikipedia.org/wiki/Industrial_robot#/media/File:Automation_of_foundry_with_robot.jpg (public domain)● Slides 11, 12: https://pixabay.com/en/metal-press-industrial-equipment-853694/ (CC0 Creative Commons)● Slide 13: Source: https://en.wikipedia.org/wiki/File:Valve_original_%281%29.PNG (CC BY-SA 3.0, by Simpsons contributor)● Slide 14: https://en.wikipedia.org/wiki/Canny_edge_detector#/media/File:Valve_monochrome_canny_(6).PNG (CC BY-SA 3.0, by Simpsons
contributor)● Slides 15-34 and 36-39: https://en.wikipedia.org/wiki/Connected-component_labeling#/media/File:
Two-pass_connected_component_labeling.svg (CC BY-SA 4.0, assumed by user Wereon)● Slide 41: https://commons.wikimedia.org/wiki/Brodmann_areas (CC BY-SA 2.1 jp, by Database Center for Life Science(DBCLS))● Slide 42: https://commons.wikimedia.org/wiki/File:Brain_circuits_for_visually_guided_saccades.jpg (CC-BY-4.0, by user Robert H. Wurtz)● Slide 49: https://www.flickr.com/photos/gsfc/4591847715 (public domain)● Slide 50: https://en.wikipedia.org/wiki/Medical_image_computing#/media/File:DiffusionMRI_glyphs.png (CC BY-SA 3.0, by user Tucania)● Slide 84: https://youtu.be/MPU2HistivI● Slide 89: https://youtu.be/OOT3UIXZztE● Slide 90: https://pixabay.com/en/wolf-in-sheep-s-clothing-wolf-sheep-2577813/ (CC0 Creative Commons)● Slide 97: https://en.wikipedia.org/wiki/Luddite#/media/File:FrameBreaking-1812.jpg (public domain)● Slide 98: https://pixabay.com/en/atm-withdraw-cash-map-ec-card-1524870/ (CC0 Creative Commons)● Slide 99: https://commons.wikimedia.org/wiki/File:Inspection_system.jpg (CC BY-SA 4.0, by user Yingz)● Slide 108: https://commons.wikimedia.org/wiki/File:Millville,_New_Jersey_-_Glass_bottles._(Numerous_bottles_in_foreground_and_bottles
_on_a_conveyor_belt.)_-_NARA_-_518673.jpg (public domain)
Credits