assessing uncertainty in deep learning techniques that...

22
Assessing Uncertainty in Deep Learning Techniques that Identify Atmospheric Rivers in Climate Simulations Ankur Mahesh 1,2,3 , Travis O’Brien 1,4 ,Mayur Mudigonda 1,2 , Karthik Kashinath 1 , Sookyung Kim 5 , Samira Kahou 7 , Vincent Michalski 6 , Dean Williams 5 , Yunjie Liu 1 , Prabhat 1,2 , Burlen Loring 1 , William D. Collins 1,2 1 Lawrence Berkeley National Lab 2 University of California, Berkeley 3 Undergraduate, Department of Electrical Engineering and Computer Science 4 University of California, Davis 5 Lawrence Livermore National Laboratory 6 Montreal Institute for Learning Algorithms 7 Microsoft Research EARTH AND ENVIRONMENTAL SCIENCES LAWRENCE BERKELEY NATIONAL LABORATORY

Upload: others

Post on 07-Jul-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Assessing Uncertainty in Deep Learning Techniques that ...cw3e.ucsd.edu/IARC_2018/AR_Tracking/OBrien_IARC2018.pdf · Assessing Uncertainty in Deep Learning Techniques that Identify

Assessing Uncertainty in DeepLearning Techniques that

Identify Atmospheric Rivers in Climate Simulations

Ankur Mahesh1,2,3, Travis O’Brien1,4,Mayur Mudigonda1,2, Karthik

Kashinath1, Sookyung Kim5, Samira Kahou7, Vincent Michalski6, Dean

Williams5, Yunjie Liu1, Prabhat1,2, Burlen Loring1, William D. Collins1,2

1Lawrence Berkeley National Lab

2University of California, Berkeley

3Undergraduate, Department of Electrical Engineering and Computer Science

4University of California, Davis

5Lawrence Livermore National

Laboratory 6Montreal Institute for

Learning Algorithms 7Microsoft

Research

EARTH AND ENVIRONMENTAL SCIENCES • LAWRENCE BERKELEY NATIONAL LABORATORY

Page 2: Assessing Uncertainty in Deep Learning Techniques that ...cw3e.ucsd.edu/IARC_2018/AR_Tracking/OBrien_IARC2018.pdf · Assessing Uncertainty in Deep Learning Techniques that Identify

Why Deep Learning for Climate Science?

• Atmospheric Rivers (ARs): ”long, narrow, and transient corridors of strong horizontal water vapor transport…” (AMS Glossary)

• No community-accepted standard for identifying atmospheric rivers“You know one when you see one” – ARTMIP 2018 Participant

• Poses a unique machinelearning problem: uncertainty with ground truth ”label” data

An AR off the coast of California. Source: TheGuardian.

EARTH AND ENVIRONMENTAL SCIENCES • LAWRENCE BERKELEY NATIONAL LABORATORY

Page 3: Assessing Uncertainty in Deep Learning Techniques that ...cw3e.ucsd.edu/IARC_2018/AR_Tracking/OBrien_IARC2018.pdf · Assessing Uncertainty in Deep Learning Techniques that Identify

Why Deep Learning for Climate Science?

Logistic Regression

K-Nearest Neighbor

Support Vector Machine

Random Forest

ConvNet (a

type of deep

learning

model)

Train Test Train Test Train Test Train Test Train Test

Tropical Cyclone

96.8 95.85 98.1 97.85 97.0 95.85 99.2 99.4 99.3 99.1

Atmospheric Rivers

81.97 82.65 79.7 81.7 81.6 83.0 87.9 88.4 90.5 90.0

Weather Fronts

84.9 89.8 72.46 76.45 84.35 90.2 80.97 87.5 88.7 89.4

EARTH AND ENVIRONMENTAL SCIENCES • LAWRENCE BERKELEY NATIONAL LABORATORY

Page 4: Assessing Uncertainty in Deep Learning Techniques that ...cw3e.ucsd.edu/IARC_2018/AR_Tracking/OBrien_IARC2018.pdf · Assessing Uncertainty in Deep Learning Techniques that Identify

• Feature Learning: a filter slides, or convolves, over the imageand extracts features

• Classification: probabilistically map the features to the likelihoodthat an image belongs to aclass

Convolutional Neural Networks

Source: MathWorks

EARTH AND ENVIRONMENTAL SCIENCES • LAWRENCE BERKELEY NATIONAL LABORATORY

Page 5: Assessing Uncertainty in Deep Learning Techniques that ...cw3e.ucsd.edu/IARC_2018/AR_Tracking/OBrien_IARC2018.pdf · Assessing Uncertainty in Deep Learning Techniques that Identify

Transfer Learning

• Transfer learning: take a model trained to solve one problem and use it to solve a different problem

• When trained with millions ofimages, neural networks are generic feature extractors

• In transfer learning, neural networks use one dataset totrain the feature learning part of themodel

• Using this feature learning strategy, neural networks classify imagesin another dataset

• Reduces the need for large labelled training datasets in climate science

EARTH AND ENVIRONMENTAL SCIENCES • LAWRENCE BERKELEY NATIONAL LABORATORY

Page 6: Assessing Uncertainty in Deep Learning Techniques that ...cw3e.ucsd.edu/IARC_2018/AR_Tracking/OBrien_IARC2018.pdf · Assessing Uncertainty in Deep Learning Techniques that Identify

Architecture Uncertainty

• Tested several architectures with 1, 2, 3, and 16 layers for classifyingimages of ARs(16-layer model=VGGNet)

• How a neural network is trained: minimization of a loss function that quantifies model performance

• 16 layer architecture usedtransfer learning, which led to higher accuracy and more rapid convergence

• Uncertainty: which type of architecture yields best results?

1

0.8

0.6

0.4

0.2

0

VGGNet 3-Conv w/

Augmentation

3-Conv w/o

Augmentation2-Conv 1-Conv

Model Accuracy (higher isbetter)

TrainAccuracy ValidationAccuracy TestAccuracy

Acc

ura

cyLo

ss(E

rror)

Transfer learning:Best Model

Transfer learning: ConvergesRapidly

EARTH AND ENVIRONMENTAL SCIENCES • LAWRENCE BERKELEY NATIONAL LABORATORY

Page 7: Assessing Uncertainty in Deep Learning Techniques that ...cw3e.ucsd.edu/IARC_2018/AR_Tracking/OBrien_IARC2018.pdf · Assessing Uncertainty in Deep Learning Techniques that Identify

Transfer Learning for Classifying ARs

• 16-layer model pre-trained on ImageNet: 92%accuracy• ImageNet: dataset with millions of ordinary images (i.e. dogs, cats, benches,etc.)

An image of an atmospheric river, correctly classified by the model.

EARTH AND ENVIRONMENTAL SCIENCES • LAWRENCE BERKELEY NATIONAL LABORATORY

Page 8: Assessing Uncertainty in Deep Learning Techniques that ...cw3e.ucsd.edu/IARC_2018/AR_Tracking/OBrien_IARC2018.pdf · Assessing Uncertainty in Deep Learning Techniques that Identify

A Pre-Trained Model for Classifying ARs

• 16 Layer Model Pre-Trained on ImageNet: 92%accuracy• ImageNet: dataset with millions of ordinary images (i.e. dogs, cats, benches,etc.)

An image of an atmospheric river, correctly classified by the model.

Set some pixels to 0 and record if the model classifies the image as anAR

EARTH AND ENVIRONMENTAL SCIENCES • LAWRENCE BERKELEY NATIONAL LABORATORY

Page 9: Assessing Uncertainty in Deep Learning Techniques that ...cw3e.ucsd.edu/IARC_2018/AR_Tracking/OBrien_IARC2018.pdf · Assessing Uncertainty in Deep Learning Techniques that Identify

A Pre-Trained Model for Classifying ARs

• 16 Layer Model Pre-Trained on ImageNet: 92%accuracy• ImageNet: dataset with millions of ordinary images (i.e. dogs, cats, benches,etc.)

An image of an atmospheric river, correctly classified by the model.

Set some pixels to 0 and record if the model classifies the image as anAR

EARTH AND ENVIRONMENTAL SCIENCES • LAWRENCE BERKELEY NATIONAL LABORATORY

Page 10: Assessing Uncertainty in Deep Learning Techniques that ...cw3e.ucsd.edu/IARC_2018/AR_Tracking/OBrien_IARC2018.pdf · Assessing Uncertainty in Deep Learning Techniques that Identify

A Pre-Trained Model for Classifying ARs

• 16 Layer Model Pre-Trained on ImageNet: 92%accuracy• ImageNet: dataset with millions of ordinary images (i.e. dogs, cats, benches,etc.)

An image of an atmospheric river, correctly classified by the model.

Set some pixels to 0 and record if the model classifies the image as anAR

EARTH AND ENVIRONMENTAL SCIENCES • LAWRENCE BERKELEY NATIONAL LABORATORY

Page 11: Assessing Uncertainty in Deep Learning Techniques that ...cw3e.ucsd.edu/IARC_2018/AR_Tracking/OBrien_IARC2018.pdf · Assessing Uncertainty in Deep Learning Techniques that Identify

A Pre-Trained Model for Classifying ARs

• 16 Layer Model Pre-Trained on ImageNet: 92%accuracy• ImageNet: dataset with millions of ordinary images (i.e. dogs, cats, benches,etc.)

An image of an atmospheric river, correctly classified by the model.

Set some pixels to 0 and record if the model classifies the image as anAR

EARTH AND ENVIRONMENTAL SCIENCES • LAWRENCE BERKELEY NATIONAL LABORATORY

Page 12: Assessing Uncertainty in Deep Learning Techniques that ...cw3e.ucsd.edu/IARC_2018/AR_Tracking/OBrien_IARC2018.pdf · Assessing Uncertainty in Deep Learning Techniques that Identify

A Pre-Trained Model for Classifying ARs

• 16 Layer Model Pre-Trained on ImageNet: 92%accuracy• ImageNet: dataset with millions of ordinary images (i.e. dogs, cats, benches,etc.)

An image of an atmospheric river, correctly classified by the model.

Set some pixels to 0 and record if the model classifies the image as anAR

EARTH AND ENVIRONMENTAL SCIENCES • LAWRENCE BERKELEY NATIONAL LABORATORY

Page 13: Assessing Uncertainty in Deep Learning Techniques that ...cw3e.ucsd.edu/IARC_2018/AR_Tracking/OBrien_IARC2018.pdf · Assessing Uncertainty in Deep Learning Techniques that Identify

A Pre-Trained Model for Classifying ARs

• 16 Layer Model Pre-Trained on ImageNet: 92%accuracy• ImageNet: dataset with millions of ordinary images (i.e. dogs, cats, benches,etc.)

An image of an atmospheric river, correctly classified by the model.

Set some pixels to 0 and record if the model classifies the image as anAR

EARTH AND ENVIRONMENTAL SCIENCES • LAWRENCE BERKELEY NATIONAL LABORATORY

Page 14: Assessing Uncertainty in Deep Learning Techniques that ...cw3e.ucsd.edu/IARC_2018/AR_Tracking/OBrien_IARC2018.pdf · Assessing Uncertainty in Deep Learning Techniques that Identify

A Pre-Trained Model for Classifying ARs

• 16 Layer Model Pre-Trained on ImageNet: 92%accuracy• ImageNet: dataset with millions of ordinary images (i.e. dogs, cats, benches,etc.)

Heat Map: when the bottom left portion of

this image is set to 0, the model does not

think the image is an AR(RED)

If the top left or bottom right portion is set

to 0, then the model still thinks the image is

an AR(GREEN)

An image of an atmospheric river, correctly classified by the model.

Set some pixels to 0 and record if the model classifies the image as anAR

EARTH AND ENVIRONMENTAL SCIENCES • LAWRENCE BERKELEY NATIONAL LABORATORY

Page 15: Assessing Uncertainty in Deep Learning Techniques that ...cw3e.ucsd.edu/IARC_2018/AR_Tracking/OBrien_IARC2018.pdf · Assessing Uncertainty in Deep Learning Techniques that Identify

A Pre-Trained Model for Classifying ARs

• 16 Layer Model Pre-Trained on ImageNet: 92%accuracy• ImageNet: dataset with millions of ordinary images (i.e. dogs, cats, benches,etc.)

• Conclusion: the model identified the features that make this image anAR!

Heat Map: when the bottom left portion of

this image is set to 0, the model does not

think the image is an AR(RED)

If the top left or bottom right portion is set

to 0, then the model still thinks the image is

an AR(GREEN)

An image of an atmospheric river, correctly classified by the model.

Set some pixels to 0 and record if the model classifies the image as anAR

EARTH AND ENVIRONMENTAL SCIENCES • LAWRENCE BERKELEY NATIONAL LABORATORY

Page 16: Assessing Uncertainty in Deep Learning Techniques that ...cw3e.ucsd.edu/IARC_2018/AR_Tracking/OBrien_IARC2018.pdf · Assessing Uncertainty in Deep Learning Techniques that Identify

Classification vs. Segmentation

• Classification: classify each image as amember of a class

• Semantic Segmentation: classify each pixel as a member of class

• Semantic segmentation does not distinguish between multiple instances of the sameclass

Top: this is a picture of acar

Bottom: this is a picture of acrowd

EARTH AND ENVIRONMENTAL SCIENCES • LAWRENCE BERKELEY NATIONAL LABORATORY

People, bicycles, sidewalk, signposts,

roads, and cars are all recognized

Classification Segmentation

Source: Kundu, et al. Feature Space Optimization

for Semantic Video Segmentation, 2016.

Page 17: Assessing Uncertainty in Deep Learning Techniques that ...cw3e.ucsd.edu/IARC_2018/AR_Tracking/OBrien_IARC2018.pdf · Assessing Uncertainty in Deep Learning Techniques that Identify

Label Uncertainty

• ARs: Isolate areas 1500 km longwith 95th percentile IntegratedVapor Transport

• Tropical Cyclones: Use theToolkit for Extreme ClimateAnalysis (TECA) to generatelabels

• There is uncertainty with these labels, which rely on arbitrary thresholds

Integrated Water Vapor with LabelledStorms

Atmospheric Rivers

(RED)

Tropical Cyclones

(BLUE)

ARs and TCs oftenoverlap

EARTH AND ENVIRONMENTAL SCIENCES • LAWRENCE BERKELEY NATIONAL LABORATORY

Page 18: Assessing Uncertainty in Deep Learning Techniques that ...cw3e.ucsd.edu/IARC_2018/AR_Tracking/OBrien_IARC2018.pdf · Assessing Uncertainty in Deep Learning Techniques that Identify

Segmentation Model Results

• Model has successfully learned the structure of ARs andTCs

• Segmentations are smoother than current “ground-truth” labelling methodologies

• Model predictions remove reliance on arbitrary thresholdsby finding patterns from thousands of training images

• The model can detect TCs and ARs, despite their close proximity

GroundTruth ModelPredictions

Segmentation of ARs (RED) and

TCs (BLUE) in an IWV image

EARTH AND ENVIRONMENTAL SCIENCES • LAWRENCE BERKELEY NATIONAL LABORATORY

Diff

ere

ntLoca

tions/

Tim

es

Page 19: Assessing Uncertainty in Deep Learning Techniques that ...cw3e.ucsd.edu/IARC_2018/AR_Tracking/OBrien_IARC2018.pdf · Assessing Uncertainty in Deep Learning Techniques that Identify

Segmentation Results: Metric Uncertainty

• Overall accuracy: 92%

• The “ground truth” labels were generated using much more information than the modelwas provided

• Ground-truth-labelling input: integrated vapor transport, geopotential height, wind velocity, and sea surface temperature

• Model input: integrated watervapor

• Metric Uncertainty: how do we evaluate the model whenground truth is imperfect?

Segmentation Confusion Matrix

EARTH AND ENVIRONMENTAL SCIENCES • LAWRENCE BERKELEY NATIONAL LABORATORY

Page 20: Assessing Uncertainty in Deep Learning Techniques that ...cw3e.ucsd.edu/IARC_2018/AR_Tracking/OBrien_IARC2018.pdf · Assessing Uncertainty in Deep Learning Techniques that Identify

Future Work

• Investigate how to represent architecture, label, and metric uncertainty

• Ensemble-based extreme event detection• Use different labelling strategies to generate multiple ground

truth datasets

• Train a neural network on each ground truth dataset

• Have each network vote on whether or not an image is a particular type of extreme

• Test neural networks with an expert-hand-labelled dataset

• Use neural networks to detect other classes of extremes

EARTH AND ENVIRONMENTAL SCIENCES • LAWRENCE BERKELEY NATIONAL LABORATORY

Page 21: Assessing Uncertainty in Deep Learning Techniques that ...cw3e.ucsd.edu/IARC_2018/AR_Tracking/OBrien_IARC2018.pdf · Assessing Uncertainty in Deep Learning Techniques that Identify

Explicit Uncertainty in Training CNNs

EARTH AND ENVIRONMENTAL SCIENCES • LAWRENCE BERKELEY NATIONAL LABORATORY

Example Training Data: Average AR Mask from ARTMIP algorithms.

Possible approach:

• Modify loss function used in training CNNs

• Explicitly account foruncertainty in trainingdata

• Applicable to expert-labeleddatasets w/ input from multiple experts

Page 22: Assessing Uncertainty in Deep Learning Techniques that ...cw3e.ucsd.edu/IARC_2018/AR_Tracking/OBrien_IARC2018.pdf · Assessing Uncertainty in Deep Learning Techniques that Identify

Thank You

This work was supported in part by the U.S. Department ofEnergy, (DOE) Office of Science, Office of WorkforceDevelopment for Teachers and Scientists (WDTS) under theScience Undergraduate Laboratory Internship (SULI) program,and the DOE Regional and Global Climate Modeling Program aspart of the Calibrated And Systematic Characterization,Attribution, and Detection of Extremes Scientific Focus Area.

For more information, contact:

TravisO’[email protected]

Ankur [email protected]

EARTH AND ENVIRONMENTAL SCIENCES • LAWRENCE BERKELEY NATIONAL LABORATORY