visual processing sparks a new class of processors · videantis con!dential | copyright videantis...

49
Copyright videantis GmbH videantis condential | Visual processing sparks a new class of processors [email protected]

Upload: others

Post on 08-Jul-2020

7 views

Category:

Documents


1 download

TRANSCRIPT

Copyright videantis GmbH videantis confidential |

Visual processing sparks a new class of processors

[email protected]

Copyright videantis GmbH videantis confidential | Copyright videantis GmbH videantis confidential |

People like to capture visual information 19 July 2015 10

Copyright videantis GmbH videantis confidential | Copyright videantis GmbH videantis confidential |

People like to consume visual information 19 July 2015 11

• Global Internet Video Traffic   64% of consumer traffic in 2014   62% in 2013   57% in 2012

• Global Mobile Video Traffic   55% of mobile traffic in 2014   52% in 2013   50% in 2012

Mary Meeker’s Internet trends: http://www.kpcb.com/internet-trends

Copyright videantis GmbH videantis confidential | Copyright videantis GmbH videantis confidential |

Display 19 July 2015 12

1995 2005 2015 6x 8x

Copyright videantis GmbH videantis confidential | Copyright videantis GmbH videantis confidential |

Capture 19 July 2015 13

1995 2005 2015 6x 8x

Apple Quicktake VGA

Nokia N90 2MPixel

Samsung S6 16Mpixel

Copyright videantis GmbH videantis confidential | Copyright videantis GmbH videantis confidential |

Processing 19 July 2015 14

Intel Pentium 0.35um

Intel Pentium D 65nm

Intel Core i5 14nm

~25x ~25x

More and more processing available per pixel

Copyright videantis GmbH videantis confidential | Copyright videantis GmbH videantis confidential |

Altering pictures 19 July 2015 15

bokeh tilt-shift HDR

Copyright videantis GmbH videantis confidential | Copyright videantis GmbH videantis confidential |

Computational photography – 360 cameras 19 July 2015 16

Copyright videantis GmbH videantis confidential | Copyright videantis GmbH videantis confidential |

Computational imaging – camera arrays 20 July 2015 17

Copyright videantis GmbH videantis confidential | Copyright videantis GmbH videantis confidential |

Computational imaging – lensless camera 20 July 2015 18

Copyright videantis GmbH videantis confidential | Copyright videantis GmbH videantis confidential |

3D cameras 20 July 2015 19

• Time of Flight • Structured light • Stereo cameras • Structure from Motion

Copyright videantis GmbH videantis confidential |

cameras being used more often for machine vision

19 July 2015 20

Copyright videantis GmbH videantis confidential | Copyright videantis GmbH videantis confidential |

Driverless vehicles

Life saver, time saver, cost saver

Today >2020

19 July 2015 21

Copyright videantis GmbH videantis confidential | Copyright videantis GmbH videantis confidential |

Driverless vehicles

WHO: 1.2M deaths per year world-wide

Today >2020

Life saver, time saver, cost saver

19 July 2015 22

Copyright videantis GmbH videantis confidential | Copyright videantis GmbH videantis confidential |

Driverless vehicles

USA: ~5 years of your life in a car

Today >2020

Life saver, time saver, cost saver

19 July 2015 23

Copyright videantis GmbH videantis confidential | Copyright videantis GmbH videantis confidential |

Driverless vehicles

USA insurance: ~$0.10/mile

Today >2020

Life saver, time saver, cost saver

19 July 2015 24

Copyright videantis GmbH videantis confidential | Copyright videantis GmbH videantis confidential |

Driverless vehicles

90% autonomous: doubles road capacity

Today >2020

Life saver, time saver, cost saver

19 July 2015 25

Copyright videantis GmbH videantis confidential | Copyright videantis GmbH videantis confidential |

Only autonomous vehicle today… 19 July 2015 26

Copyright videantis GmbH videantis confidential |

Hype cycle

"Hype-Cycle-General" by NeedCokeNow

20 July 2015 27

Copyright videantis GmbH videantis confidential | Copyright videantis GmbH videantis confidential |

Big leap from current cars to driverless

Legislation, R&D, maps, V2V/V2I, … Two key components: image sensors and image processing architectures

… and change will be gradual

19 July 2015 28

Copyright videantis GmbH videantis confidential | Copyright videantis GmbH videantis confidential |

Automotive camera markets 19 July 2015 29

•  1 wide angle camera •  Viewing: rear view, dewarp •  Analytics: object detection, distance, etc

Rear view camera

•  2-4 cameras •  Viewing: surround / top view, dewarp, stitch •  Analytics: automatic parking assist, object detect

Surround and mirror cameras

•  1 or 2 cameras •  Viewing: not needed •  Analytics: monitoring driver status

Driver monitoring

•  1-3 cameras •  Analytics: lane detection, pedestrian detection,

vehicle detection, traffic sign recognition, headlight control

Front camera

Copyright videantis GmbH videantis confidential | Copyright videantis GmbH videantis confidential |

Capture: 3D camera (e.g. Google Tango)

measure environment

gesture interfaces

augmented reality

20 July 2015 30

Copyright videantis GmbH videantis confidential | Copyright videantis GmbH videantis confidential |

Microsoft Kinect 20 July 2015 31

Copyright videantis GmbH videantis confidential | Copyright videantis GmbH videantis confidential |

Searching and indexing 19 July 2015 32

Copyright videantis GmbH videantis confidential | Copyright videantis GmbH videantis confidential |

Apple’s fingerprint recognition 20 July 2015 33

Copyright videantis GmbH videantis confidential | Copyright videantis GmbH videantis confidential |

Capture – always on

Dashcam Google Glass

20 July 2015 34

Copyright videantis GmbH videantis confidential | Copyright videantis GmbH videantis confidential |

Capture – rugged

Camera pil GoPro

20 July 2015 35

Copyright videantis GmbH videantis confidential |

Capture: fly and follow 20 July 2015 36

Copyright videantis GmbH videantis confidential | Copyright videantis GmbH videantis confidential |

Failures 20 July 2015 37

• Android’s face unlock • Samsung Galaxy’s gesture interfaces • Amazon Fire’s eye tracking • Amazon’s camera-based search app • Google Goggles • Google Glass

Copyright videantis GmbH videantis confidential | Copyright videantis GmbH videantis confidential |

http://image-sensors-world.blogspot.nl/2015/01/yole-updates-cmos-sensor-market-reports.html

Image sensors outgrow semi industry by 2x (8-10% CAGR)

20 July 2015 38

Copyright videantis GmbH videantis confidential |

What does visual processing look like?

19 July 2015 39

Copyright videantis GmbH videantis confidential |

Feature Detection

1.  Sobel in x 2.  Sobel in y 3. Derivative calc 4. Box Filter 5. Harris calc

6. Max location 7. Threshold 8. Dilate 9.  Select

3 3 2

3 3 2

2 2

M = w(x, y)Ix2 Ix Iy

Ix Iy Iy2

!

"

##

$

%

&&

x,y∑

(λ0,λ1) = eigenvalues(M )

λ1

λ0

λ0 ≈ λ1big

λ1 >> λ0

λ0 >> λ1λ0 ≈ λ1small

“edge”

“corner”

“edge”

“flat”

Find edges in horizontal and

vertical directions

Find edges in horizontal and

vertical directions

1

Two strong gradient

directions? ! found corner

2

Select corners R > threshold K (local maxima)

3

20 July 2015 40

Copyright videantis GmbH videantis confidential | Copyright videantis GmbH videantis confidential |

HOG/SVM Pedestrian Detection 19 July 2015 41

Grayscale image

Multiscale image pyramid

Gamma normalization

Gradient calculation (angle and magnitude)

Histogram per block

SVM per window position

Non-max suppression

128

64

16 16

8 8 8

8

4 histograms of 9 cells

Gradient direction & magnitude

Dalal & Triggs, “Histograms of Oriented Gradients for Human Detection”, INRIA (France), 2005

Feature vector: •  105 (16x16 positions) •  x4 (8x8 blocks) •  X9 (histogram bins) ! 3780 elements Multiply SVM vector ! pedestrian yes/no

1

2

3

4

5

6

7

Copyright videantis GmbH videantis confidential |

•  Binocular

•  Stereopsis •  Convergence

•  Monocular

•  Motion parallax •  Depth from motion •  Kinetic depth effect •  Aerial & curvilinear

perspective, size, accommodation, occlusion, texture gradient, lighting and shading, defocus blur, elevation

Perceiving 3D

Structure from

motion

Humans primarily use monocular vision

to sense depth

20 July 2015 42

Copyright videantis GmbH videantis confidential | Copyright videantis GmbH videantis confidential |

19 July 2015 43

Structure from motion algorithm

+ camera origin and direction

+ calibrated camera

Structure from Motion

Feature detect and track in 2D image domain

  Calculates camera pose and 3D point cloud of the scene using a single standard 2D camera

  Target applications: automated parking, obstacle detection, object segmentation, etc

3D point cloud

Copyright videantis GmbH videantis confidential | Copyright videantis GmbH videantis confidential |

Deep learning / convolutional neural nets 20 July 2015 44

Copyright videantis GmbH videantis confidential | Copyright videantis GmbH videantis confidential |

Machines are beating humans at identifying objects

20 July 2015 45

http://www.sbs.com.au/news/article/2015/06/22/comment-dreams-googles-ai-are-equal-parts-amazing-and-disturbing

Copyright videantis GmbH videantis confidential |

•  50% of the brain used for vision •  Body uses 100W •  Brain consumes 20W •  ! about 10W for vision analysis

Half our brain is used for visual processing

•  Challenge: beat the human •  Build machines that are

faster, safer, cheaper, last longer, more accurate, etc.

20 July 2015 46

Copyright videantis GmbH videantis confidential | Copyright videantis GmbH videantis confidential |

Half of mobile SOCs for visual processing 19 July 2015 47

Samsung Exynos 5 die shot

Copyright videantis GmbH videantis confidential | Copyright videantis GmbH videantis confidential |

Half of mobile SOCs for visual processing 19 July 2015 48

Samsung Exynos 5 die shot

50M+ lines of

code (Android

is 30M)

<1M LOC

10K

~100K

Copyright videantis GmbH videantis confidential | Copyright videantis GmbH videantis confidential |

mobile

Computer vision

Video coding

home automotive

Target markets 19 July 2015 49

Processor IP for visual processing

Copyright videantis GmbH videantis confidential | Copyright videantis GmbH videantis confidential |

What does videantis do? 19 July 2015 50

Low-power, high-performance video/vision processor IP

Copyright videantis GmbH videantis confidential | Copyright videantis GmbH videantis confidential |

10 years of videantis – company highlights

  Customers in Europe, US, Asia   Close ties to automotive algorithm ISVs,

semis, Tier 1s, and OEMs Fourth generation scalable unified video/vision processing IP Lowest power, highest performance video/image processing platform

2004 2006 2008 2010 2012 2014

10 years In business

1st Video IP Licensee

1st Vision

Licensee

1st Automotive

Volume

Deloitte Fast 50

3rd Gen Chip

1st Gen Chip

2005 2007 2009 2011 2013

1st Automotive

Licensee

1997 ..

R&D at Leibniz University

2nd Gen Chip

20 July 2015 51

Copyright videantis GmbH videantis confidential | Copyright videantis GmbH videantis confidential |

• Heterogenous v-MP4280HDX arch:   4th generation (v-MP4000HDX series)   2 v-SP cores •  Enables bitstream processing high

quality video encode and decode   8 v-MP cores •  High-performance VLIW/SIMD

computer vision and video codec signal processing cores

• Software programmable video/vision:   H.264 encode or decode (and 15

other video codecs) •  1080p60, high-quality, low delay

  High-performance vision processing •  >384GOPS •  >64GMAC/s

videantis unified video/vision processor 19 July 2015 52

v-SP and v-MP for H.264 video codec

8x v-MP processor for vision processing

Highly-optimized, unified, programmable architecture for vision and video processing

Copyright videantis GmbH videantis confidential | Copyright videantis GmbH videantis confidential |

Architecture for computer vision processing CPU GPU Imaging DSP videantis

ILP Superscalar

power hungry undisclosed Needs CPU

>4-issue NOPs and code size increase

2-issue VLIW right trade off

SIMD 128-bit

requires second pipeline, RF, etc.

wide array control flow not efficient

>256-bit SIMD control flow not efficient

64/128-bit right trade off

Multicore

1-4 cores cache coherency

introduces overhead

Many cores with many restrictions

1 core not enough

performance

1-8+ cores scales: low to high end apps

Processor frequency

2GHz+ long pipelines

cause overhead

~1GHz medium/long

pipelines

500MHz-1GHz medium pipeline

500MHz-1GHz medium pipeline

Cache & DMA

multi-level caches

multi-level caches

cache single DMA

No I/D caches DMA each core

19 July 2015 53

Copyright videantis GmbH videantis confidential | Copyright videantis GmbH videantis confidential |

Videantis: more performance than 192-core GPU 19 July 2015 54

NVIDIA K1 Algorithm: HOG/SVM Pedestrian detect

Videantis v-MP4280HDX Algorithm: HOG/SVM Pedestrian detect

Performance: VGA at 3.4fps

Performance: VGA at >40fps

Power: 4W+ 1W/frame

Power at 40fps: 40mW ~1mW/frame

• Same algorithm

• >10x performance

• <1000x lower power

http://youtu.be/nCoMaLD56wQ http://youtu.be/grwzV2Umx-U

Copyright videantis GmbH videantis confidential | Copyright videantis GmbH videantis confidential |

Half of mobile SOCs for visual processing 19 July 2015 55

videantis 10-core video DSP •  Computer vision •  4K video coding •  Computational photography Samsung Exynos 5 die shot

Copyright videantis GmbH videantis confidential | Copyright videantis GmbH videantis confidential |

Conclusion 19 July 2015 56

• Visual computing field is growing quickly   Computer vision drives many new applications

• Visual computing processor is a fragmented field today   CPU, GPUs  Hard-wired  DSPs, vision processors

• Time is right for new class of processors that are specifically designed for visual processing

Thank you !