adaptive neural trees12... · 2019-06-07 · daniel c. alexander, antonio criminisi, aditya nori...

Adaptive Neural Trees Ryutaro Tanno, Kai Arulkumaran, Daniel C. Alexander, Antonio Criminisi, Aditya Nori #82

Upload: others

Post on 16-Jul-2020

1 views

Category:

Documents

0 download

Report

Download

Embed Size (px):

TRANSCRIPT

Adaptive Neural TreesRyutaro Tanno, Kai Arulkumaran,

Daniel C. Alexander, Antonio Criminisi, Aditya Nori

#82

WaterGrey matter

White Matter

Two Paradigms of Machine LearningDeep Neural Networks Decision Trees

『hierarchical representation of data』『hierarchical clustering of data』Super-resolution of dMR brain images with a DT [Alexander et al. NeuroImage 2017]

ImageNet classifiers with CNNs[Zeiler and Fergus, ECCV 2014]

Low-level features

Mid-level features

High-level features

Trainable Classifier

Oriented edges & colours

Textures & patterns Object parts

Page 3: Adaptive Neural Trees12... · 2019-06-07 · Daniel C. Alexander, Antonio Criminisi, Aditya Nori #82. Grey matter Water White Matter Two Paradigms of Machine Learning Deep Neural

Two Paradigms of Machine LearningDeep Neural Networks Decision Trees

『hierarchical representation of data』『hierarchical clustering of data』

Page 4: Adaptive Neural Trees12... · 2019-06-07 · Daniel C. Alexander, Antonio Criminisi, Aditya Nori #82. Grey matter Water White Matter Two Paradigms of Machine Learning Deep Neural

Two Paradigms of Machine Learning

+ learn features of data

+ scalable learning with stochastic optimisation

- architectures are hand-designed

- heavy-weight inference, engaging every parameter of the model for each input

Deep Neural Networks Decision Trees

『hierarchical representation of data』『hierarchical clustering of data』

Page 5: Adaptive Neural Trees12... · 2019-06-07 · Daniel C. Alexander, Antonio Criminisi, Aditya Nori #82. Grey matter Water White Matter Two Paradigms of Machine Learning Deep Neural

Two Paradigms of Machine Learning

+ learn features of data

+ scalable learning with stochastic optimisation

- architectures are hand-designed

- heavy-weight inference, engaging every parameter of the model for each input

Deep Neural Networks

- operate on hand-designed features

- limited expressivity with simple splitting functions

+ architectures are learned from data

+ lightweight inference, activating only a fraction of the model per input

Decision Trees

『hierarchical representation of data』『hierarchical clustering of data』

Page 6: Adaptive Neural Trees12... · 2019-06-07 · Daniel C. Alexander, Antonio Criminisi, Aditya Nori #82. Grey matter Water White Matter Two Paradigms of Machine Learning Deep Neural

Joining the Paradigms

+ learn features of data

+ scalable learning with stochastic optimisation

+ architectures are learned from data

+ lightweight inference, activating only a fraction of the model per input

『hierarchical representation of data』『hierarchical clustering of data』

Adaptive Neural Trees

ANTs unify the two paradigms and generalise previous work

Joining the Paradigms

+ learn features of data

+ scalable learning with stochastic optimisation

+ architectures are learned from data

+ lightweight inference, activating only a fraction of the model per input

『hierarchical representation of data』『hierarchical clustering of data』

Adaptive Neural Trees

ANTs unify the two paradigms and generalise previous work

•ANTs consist of two key designs:

What are ANTs?

Page 9: Adaptive Neural Trees12... · 2019-06-07 · Daniel C. Alexander, Antonio Criminisi, Aditya Nori #82. Grey matter Water White Matter Two Paradigms of Machine Learning Deep Neural

•ANTs consist of two key designs:

input, x

(1). DTs which uses NNs in every path and routing decisions.

What are ANTs?

Page 10: Adaptive Neural Trees12... · 2019-06-07 · Daniel C. Alexander, Antonio Criminisi, Aditya Nori #82. Grey matter Water White Matter Two Paradigms of Machine Learning Deep Neural

•ANTs consist of two key designs:

(2). DT-like architecture growth using SGD

(1). DTs which uses NNs in every path and routing decisions.

(a) Split (b) DeepenTarget Node

What are ANTs?

Page 11: Adaptive Neural Trees12... · 2019-06-07 · Daniel C. Alexander, Antonio Criminisi, Aditya Nori #82. Grey matter Water White Matter Two Paradigms of Machine Learning Deep Neural

What are ANTs?•ANTs consist of two key designs:

(2). DT-like architecture growth using SGD

(1). DTs which uses NNs in every path and routing decisions.

Page 12: Adaptive Neural Trees12... · 2019-06-07 · Daniel C. Alexander, Antonio Criminisi, Aditya Nori #82. Grey matter Water White Matter Two Paradigms of Machine Learning Deep Neural

Conditional Computation

0.9

1.8

ANT

3 0

ANT

3 0

0.8

1.6

ANT

51K

101K

ANT

Multi-path inferenceSingle-path inference

0.65M

1.3M

ANT

50K

100K

ANT

ErrorsMNIST

(%)

Model size drops!

• Single-path inference enables efficient inference without compromising accuracy.

Number of ParametersCIFAR10

(%)SARCOS

(mse)MNIST CIFAR10 SARCOS

Page 13: Adaptive Neural Trees12... · 2019-06-07 · Daniel C. Alexander, Antonio Criminisi, Aditya Nori #82. Grey matter Water White Matter Two Paradigms of Machine Learning Deep Neural

Adaptive Model Complexity

Models are trained on subsets of size 50, 250, 500, 2.5k, 5k, 25k, 45k examples.

• ANTs can tune the architecture to the availability of training data.

Page 14: Adaptive Neural Trees12... · 2019-06-07 · Daniel C. Alexander, Antonio Criminisi, Aditya Nori #82. Grey matter Water White Matter Two Paradigms of Machine Learning Deep Neural

Please come & see me at poster #82 for details!

Unsupervised Hierarchical Clustering

CNS DEVELOPMENT. Stages in Neural Tube Development Neural plate. Neural plate. Neural folds. Neural folds. Neural tube. Neural tube

Neural Information Flow: Learning neural information

Interactive Video Tours MSR Interactive Visual Media Group //msrweb/vision/IBR Rick Szeliski, Sing Bing Kang, Matt Uyttendaele, Simon Winder, Antonio Criminisi

Neural Networks Neural Networks

Dynamically pre-trained deep recurrent neural …...Keywords Time series prediction Deep learning Pre-training Recurrent neural networks Elastic net Fine particulate matter Environmental

Single View Metrologyvgg/publications/1999/Criminisi99/... · 2011-08-18 · Single View Metrology A. Criminisi, I. Reid and A. Zisserman Department of Engineering Science University

Real-Time RGB-D Camera Relocalization RGB-D Camera Relocalization Ben Glocker ∗ Shahram Izadi Jamie Shotton Antonio Criminisi Microsoft Research, Cambridge, UK ABSTRACT We introduce

Early Moments Matter - ep00.epimg.net › descargables › 2017 › 09 › 20 › afd22d2f134a… · Negative experiences can slow down and alter how neural connections are made in

Neural Communication: The Neural Impulse

Decision Jungles - geekStackgeekstack.net/resources/...decision_jungles_slides.pdf · Winn, and Antonio Criminisi. Decision Jungles: Compact and rich models for classiﬁcation. Advances

Differentiation between non neural and neural

Neural network for Neural Networks

Explainable Neural Computation via Stack Neural Module …openaccess.thecvf.com/content_ECCV_2018/papers/Ronghang_Hu_Explainable... · Explainable Neural Computationvia Stack Neural

Color Image Inpainting By an Improved Criminisi Algorithm...Color Image Inpainting By an Improved Criminisi Algorithm Yu-Ting HEa,1, Xiang-Hong TANG b,1,2 1School of Communication

A. Criminisi, J. Shotton, S. Bucciarelli and K. Siddiqui

Announcements Midterm due Friday, beginning of lecture Guest lecture on Friday: Antonio Criminisi, Microsoft Research

Robust Linear Registration of CT images using Random Regression Forests Ender Konukoglu 1, Antonio Criminisi 1, Sayan Pathak 2, Duncan Robertson 1, Steve

PROFESSIONAL PROGRAM - 22q11 Society · Carrie Bearden -Structural and functional neural dysconnectivity as a predictor of psychotic ... -White matter microstructural abnormalities

Single View Metrology A. Criminisi, I. Reid, and A. Zisserman University of Oxford IJCV Nov 2000 Presentation by Kenton Anderson CMPT820 March 24, 2005

Neural Communication: The Neural Chain Module 7: Neural and Hormonal Systems

Florian Schroff, Antonio Criminisi & Andrew Zisserman ICCV 2007

arXiv:1612.08894v1 [cs.CV] 28 Dec 2016 · Virginia Newcombe 2;3, Joanna Simpson , Andrew Kane , David Menon2;3, Aditya Nori 4, Antonio Criminisi , Daniel Rueckert1, and Ben Glocker1

Multiple View Geometry in Computer Vision...2D Projective Geometry? Measure distances A. Criminisi. Accurate Visual Metrology from Single and Multiple Uncalibrated Images.PhD Thesis

Image De-fencing Revisitedvision.cse.psu.edu/publications/pdfs/2010park2.pdf2.2 Image Completion Traditional texture lling tools such as Criminisi et al. [3,4] require users to manually

Also called direct neural interface The concept is linking between BRAIN and computerized system Done by implanting electrodes into grey matter

Artificial Neural NetworksArtificial Neural Networksrailway.iust.ac.ir/files/rail/Booklet_Teach/neural_network_moaveni.pdf · Artificial Neural NetworksArtificial Neural Networks

Wavelet-Based Inpainting for Object Removal from Image Seriesagas/Public/Vetter2010WIF.pdf · Wavelet-Based Inpainting for Object Removal from Image Series 5 Criminisi [2] Ign´acio

THE SPECIFICITY AND NEURAL BASIS OF IMPAIRED INHIBITORY CONTROL · 2014-02-19 · lesion studies have not examined the effect of lesion tissue type (e.g., gray matter, white matter,

Extracting Layers and Analyzing their Specular … Layers and Analyzing their Specular Properties Using Epipolar-Plane-Image Analysis Antonio Criminisi, Sing Bing Kang, Rahul Swaminathan1,

GeoS: Geodesic Image Segmentation - microsoft.com Geodesic Image Segmentation Antonio Criminisi, Toby Sharp, and Andrew Blake Microsoft Research, Cambridge, UK Abstract. This paper

Chapter 2 Introduction to Neural networktomczak/PDF/[Grbic]Neural...Chapter 2 Introduction to Neural network 2.1 Introduction to Artiﬂcial Neural Net-work Artiﬂcial Neural Networks

Single View Metrologyvgg/publications/papers/...Single View Metrology A. Criminisi, I. Reid and A. Zisserman Department of Engineering Science University of Oxford Oxford, UK, OX1

Neural and Fuzzy Neural Networks

PkANN { I. Non-linear matter power spectrum interpolation ...Matter power spectrum using arti cial neural networks 3 the cosmological parameter space. Using halofit spectra as mock

Developmental white matter microstructure in autism ...docs.autismresearchcentre.com/papers/2015_Lisiecka...Adolescence is a period of intense increase in neural connectivity and growth