application des réseaux de neurones aux environnements ... · our dataset is derived from a subset...

48
Application des réseaux de neurones aux environnements planétaires P. Garnier, M. Bouayad, Q. Lenouvel, T. Noël, N. Vachon, V. Génot, J. Lasue, S. Toledo, J. Inglada, B. Renard

Upload: others

Post on 02-Aug-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Application des réseaux de neurones aux environnements ... · Our dataset is derived from a subset of the catalog identified by Hall (~1500 bow shock crossings out of ~11800) →

Application des réseaux de neurones aux environnements

planétaires

P. Garnier, M. Bouayad, Q. Lenouvel, T. Noël, N. Vachon, V. Génot, J. Lasue, S. Toledo, J. Inglada, B. Renard

Page 2: Application des réseaux de neurones aux environnements ... · Our dataset is derived from a subset of the catalog identified by Hall (~1500 bow shock crossings out of ~11800) →

SupernovaGalaxy

Gravitationnal waves Exoplanets

Space weather : solar activity

impact Moon craters

Polar auroras

Astronomy/Astrophysics Planetology/Aeronomy

Applications of deep learning in astrophysics

A recent and growing approach in planetology / aeronomy

Page 3: Application des réseaux de neurones aux environnements ... · Our dataset is derived from a subset of the catalog identified by Hall (~1500 bow shock crossings out of ~11800) →

IRAP PEPS major scientific objectives :• Formation and evolution of planetary environments• Which influence of the Sun on the planets and the

atmospheric escape ?

Page 4: Application des réseaux de neurones aux environnements ... · Our dataset is derived from a subset of the catalog identified by Hall (~1500 bow shock crossings out of ~11800) →

Which data / constraints / objectives for us ?

Type / amount of data ?

- Images/scalars/multi-dimensional- Temporal / spatial data- Few/Many data

Specific constraints

- Strong temporal dynamics- Uncertainty even among experts- Physics is still present !!!!!

Main objectives

- Classification/Regression/Detection- Provide real-time catalogs and do science- Provide added-value to the community

that uses shared database

Page 5: Application des réseaux de neurones aux environnements ... · Our dataset is derived from a subset of the catalog identified by Hall (~1500 bow shock crossings out of ~11800) →

3 example studies involving deep learning

Comet’s dustEarth magnetic reconnection

Martian shocks

Page 6: Application des réseaux de neurones aux environnements ... · Our dataset is derived from a subset of the catalog identified by Hall (~1500 bow shock crossings out of ~11800) →

Deep learning for comets dust

Comets are among the most pristine objects of solar system

Studying these icy dust balls brings constraints on solar system formation

Credit : Johannes-Kepler-Observatory, April 1997

Dust tail

Ion tail

Page 7: Application des réseaux de neurones aux environnements ... · Our dataset is derived from a subset of the catalog identified by Hall (~1500 bow shock crossings out of ~11800) →

• Rosetta : first mission to follow a comet’s nucleus (67P/Churyumov-Gerasimenko), with a lander and an orbiter

• Dust in situ analysis with several instruments :

– COmetary Secondary Ion Mass Analyzer (COSIMA) [50µm to > 500µm] (Kisselet al., 2007)

– Micro-Imaging Dust Analysis System (MIDAS) [< 20µm] (Riedler et al., 2007)

Deep learning for comets dust

Structure of the dust Constraints on comet formation

Page 8: Application des réseaux de neurones aux environnements ... · Our dataset is derived from a subset of the catalog identified by Hall (~1500 bow shock crossings out of ~11800) →

COSIMA target 2CF (Langevin et al., 2016)

Classification of dust

aggregates proposed by

COSIMA team :

(a) Compact particle

(b) Shattered cluster

(c) Glued cluster

(d) Rubble pile

Deep learning for comets dust

Our objective : automatically classify all collected dust aggregates based on a Convolutional Neural Network (CNN)

BUT few labeled agregates to train : 176 !!

Page 9: Application des réseaux de neurones aux environnements ... · Our dataset is derived from a subset of the catalog identified by Hall (~1500 bow shock crossings out of ~11800) →

Deep learning for comets dust

Log and ReLU

Polar mapping

72 imagesfor each

aggregate

Data augmentation

……

Preprocessing of the dust aggregates images

Increasing the contrast (Log and ReLU) Data augmentation (rotation + left/right images) Polar mapping due to radial

symmetry assumed Adding a « no aggregate » class

11232 images for

training the CNN

Page 10: Application des réseaux de neurones aux environnements ... · Our dataset is derived from a subset of the catalog identified by Hall (~1500 bow shock crossings out of ~11800) →

Deep learning for comets dust

A CNN architecture inspired by SAR techniques

Conv196 channels3x3 kernel

Conv296 channels3x3 kernel

MaxPool12x2 kernel

Conv3256

channels3x3 kernel

MaxPool22x2 kernel

80x80 pixels

The classifier (MLP)

CompactShatteredGluedRubble-pileNone

400 neurons5 neurons

Page 11: Application des réseaux de neurones aux environnements ... · Our dataset is derived from a subset of the catalog identified by Hall (~1500 bow shock crossings out of ~11800) →

Deep learning for comets dust

Very difficult to recognize the classes (intrinsic confusion between classes)

Performance of CNN classification on test set…

Page 12: Application des réseaux de neurones aux environnements ... · Our dataset is derived from a subset of the catalog identified by Hall (~1500 bow shock crossings out of ~11800) →

Deep learning for comets dust

Very difficult to recognize the classes (intrinsic confusion between classes)

Manual “blind” classification by J. Lasue and myself :<60% global accuracy vs original paper !!!! (and 60% among us 2)

Performance of CNN classification on test set…

… before we check ourselves the classification by COSIMA team« Experts vs experts »

Detailed analysis reveals more a continuum of dust structures than distinct classes

Page 13: Application des réseaux de neurones aux environnements ... · Our dataset is derived from a subset of the catalog identified by Hall (~1500 bow shock crossings out of ~11800) →

Deep learning for comets dust

BUT the « no aggregate class » works perfectly !

Change of objective : automatic detection

Segmentation to select aggregates Now : statistical analysis of the

size/volume distribution

Page 14: Application des réseaux de neurones aux environnements ... · Our dataset is derived from a subset of the catalog identified by Hall (~1500 bow shock crossings out of ~11800) →

Earth reconnection

Earth

Page 15: Application des réseaux de neurones aux environnements ... · Our dataset is derived from a subset of the catalog identified by Hall (~1500 bow shock crossings out of ~11800) →

Earth reconnection

Earth

Reconnection process

Core of reconnection process : Electron Diffusion Region (EDR)

Page 16: Application des réseaux de neurones aux environnements ... · Our dataset is derived from a subset of the catalog identified by Hall (~1500 bow shock crossings out of ~11800) →

Earth reconnection

Magnetospheric Multi Scale Mission (NASA)

Launch 2015, 4 spacecraft

Main objective : study reconnectionprocess, in particular the EDR region

Excellent time resolution (order of ms)

32 EDR events published to date : complex to identify, no thresholdapproach possible

Automatic detection of EDR regions based on neural network to provide catalogs and better understand reconnection process

Page 17: Application des réseaux de neurones aux environnements ... · Our dataset is derived from a subset of the catalog identified by Hall (~1500 bow shock crossings out of ~11800) →

Earth reconnection

Detailed and debated analysis of variables (features selection) to be included in the neural network

Box plots for feature selection

Definition of new features for the EDR detection to reveal agyrotropy of electrons

Supervised classification by a MLP deep neural network of temporal data

Definition of 5 classes to classify the whole Earth plasma environment, but imbalanced datasets with few EDRs : weighted classes

Velocity distribution function of electrons (agyrotropic left, gyrotropic right)

Page 18: Application des réseaux de neurones aux environnements ... · Our dataset is derived from a subset of the catalog identified by Hall (~1500 bow shock crossings out of ~11800) →

Earth reconnection

Promising results since good recall for EDRs (rare event detection)

Page 19: Application des réseaux de neurones aux environnements ... · Our dataset is derived from a subset of the catalog identified by Hall (~1500 bow shock crossings out of ~11800) →

Earth reconnection

Promising results since good recall for EDRs (rare event detection)

Real Classes

Predicted classes

Test on independant continous event

EDR correctly recognized !NEXT :

- refinement of model- propose real time

detection of possible EDRs to MMS team

Page 20: Application des réseaux de neurones aux environnements ... · Our dataset is derived from a subset of the catalog identified by Hall (~1500 bow shock crossings out of ~11800) →

Martian shocks and plasma regions

MARS

Full zoology of « boundaries »shock, Magnetic Pile-Up Boundary or Induced Magnetospheric Boundary….

Strong dynamics of the plasma regions and boundaries

Study this dynamics to betterunderstand the martian escape history

MagnetosheathSolar wind

Page 21: Application des réseaux de neurones aux environnements ... · Our dataset is derived from a subset of the catalog identified by Hall (~1500 bow shock crossings out of ~11800) →

Shock detection• Not a rare event (twice

every orbit) : 11000 for Mars Express mission

• Classic approach based on threshold detection

• Aim : use similar supervised deep neural network for several space missions and planets

Martian shocks and plasma regions

Electrons

Ions velocity

Ions temperature

Magnetic field

Distance to planet

Page 22: Application des réseaux de neurones aux environnements ... · Our dataset is derived from a subset of the catalog identified by Hall (~1500 bow shock crossings out of ~11800) →

Specific problems

Martian shocks and plasma regions

• Published catalogs for supervised learning provide a single time for shock crossings, we want to keep fine structure

• From an author to another, several minutes of difference depending on their method

• For any expert, uncertainty of serveral minutes ; the shock is not a boundary but a transition region

Need to add pre/post processing steps

Time difference for shock detection between 2 published datasets

Page 23: Application des réseaux de neurones aux environnements ... · Our dataset is derived from a subset of the catalog identified by Hall (~1500 bow shock crossings out of ~11800) →

Feature selection and creation to enhance time variations

Rupture algorithm for change point detection

+ k-fold+ HyperOpt bayesian

optimization

Layers : 8:40:38:3

Page 24: Application des réseaux de neurones aux environnements ... · Our dataset is derived from a subset of the catalog identified by Hall (~1500 bow shock crossings out of ~11800) →

Martian shocks and plasma regions

Metrics with k-foldMetrics including time tolerance

Good results for shock detection, assuming some temporal tolerance(better than threshold method by Hall et al. 2016)

Same algorithm being applied to Mars Express / MAVEN / Venus Express

Page 25: Application des réseaux de neurones aux environnements ... · Our dataset is derived from a subset of the catalog identified by Hall (~1500 bow shock crossings out of ~11800) →

Martian shocks and plasma regions

What about physics ?

Analysis of metrics coherent withexperts knowledge : shockdetection difficult away from the nose due to plasma turbulence

Algorithm keeps traces of fine structure with e.g. oscillatingshocks that are interesting for us

Solar Wind

Shock

Planetaryenvironment

Page 26: Application des réseaux de neurones aux environnements ... · Our dataset is derived from a subset of the catalog identified by Hall (~1500 bow shock crossings out of ~11800) →

Automatic data labeling for the community in the CDPP/AMDA planetary database : prototype of user added value

Discrete class labelling : planetary environment (0), shock (1), Solar Wind (2)

Martian shocks and plasma regions

Mars Express electrons fluxes

Or with continuous class value given for finer interpretation :

MAVEN electrons fluxes

Page 27: Application des réseaux de neurones aux environnements ... · Our dataset is derived from a subset of the catalog identified by Hall (~1500 bow shock crossings out of ~11800) →

THANK YOU !!

Page 28: Application des réseaux de neurones aux environnements ... · Our dataset is derived from a subset of the catalog identified by Hall (~1500 bow shock crossings out of ~11800) →
Page 29: Application des réseaux de neurones aux environnements ... · Our dataset is derived from a subset of the catalog identified by Hall (~1500 bow shock crossings out of ~11800) →
Page 30: Application des réseaux de neurones aux environnements ... · Our dataset is derived from a subset of the catalog identified by Hall (~1500 bow shock crossings out of ~11800) →

Building the dataset

Our dataset is derived from a subset of the catalog identified by Hall (~1500 bow shock crossings out of ~11800)

→ measurements in a ±15 min window around shock crossings

→ ~1 800 000 data vectors

→ labeled in 3 classes (Solar wind, Bow shock, Close environment)

→ the class of interest represents 6.2% of the dataset

Page 31: Application des réseaux de neurones aux environnements ... · Our dataset is derived from a subset of the catalog identified by Hall (~1500 bow shock crossings out of ~11800) →

Detailed algorithm layout

- Classification with the MLP classifier (param. : architecture, loss, opt.)- Reduction of the number of vectors by extracting the class variations - Post-processing : correction of the variations (param. : Dt_corr)- Extracting the bow shock crossings from the variations- Post-processing : merging the crossings that are separated by less than

Dt (param. : Dt_cross)- Returns the list of timed crossings

v1

v2

c0

Page 32: Application des réseaux de neurones aux environnements ... · Our dataset is derived from a subset of the catalog identified by Hall (~1500 bow shock crossings out of ~11800) →

MLP classification (no post-processing)

- Temporality ignored (the algorithm is trained vector by vector)- Loss function : jaccard distance- Optimizer : adam

Results example :

Page 33: Application des réseaux de neurones aux environnements ... · Our dataset is derived from a subset of the catalog identified by Hall (~1500 bow shock crossings out of ~11800) →

Evaluation of identified crossings

Now that we identified the bow shock crossings, we have to evaluate the accuracy and recall with respect to the crossings training list.

We then define a tolerance window of ±Dt around the true crossings, and evaluate the performance of the algorithm for a fixed Dt.

Page 34: Application des réseaux de neurones aux environnements ... · Our dataset is derived from a subset of the catalog identified by Hall (~1500 bow shock crossings out of ~11800) →

Ruptures library- Change point detection for non-stationary signals:ex:

Without

With

Shock position estimationfor sample labeling:

Change point Change point Change point Change point

beginningend

Page 35: Application des réseaux de neurones aux environnements ... · Our dataset is derived from a subset of the catalog identified by Hall (~1500 bow shock crossings out of ~11800) →

Grusbeck catalog

Fangcatalog

catalogsunion

Rupture relocation

AMDAdata

To dataset

samplelabeling

Page 36: Application des réseaux de neurones aux environnements ... · Our dataset is derived from a subset of the catalog identified by Hall (~1500 bow shock crossings out of ~11800) →

Rupture relocation and sample labeling

Raw data based on catalog

Rupture prediction relocates shock position

TALK ABOUT SCALAR PRODUCT PREDICTION ?

0 1 2

Different zones are labeled

To dataset

Page 37: Application des réseaux de neurones aux environnements ... · Our dataset is derived from a subset of the catalog identified by Hall (~1500 bow shock crossings out of ~11800) →

Discussion: Detection delta timeDistribution of shock date estimations for each catalog compared with ruptures’ estimations.

Grusbeck:mean = -13sstdev = 135s

Fang:mean = 132sstdev = 376s

Page 38: Application des réseaux de neurones aux environnements ... · Our dataset is derived from a subset of the catalog identified by Hall (~1500 bow shock crossings out of ~11800) →

Postprocessing

Simple moving average on probabilities for each class.

True classRaw predictionPost processed predictionAbove

In

Below

Page 39: Application des réseaux de neurones aux environnements ... · Our dataset is derived from a subset of the catalog identified by Hall (~1500 bow shock crossings out of ~11800) →

Optimization: hyperopt library

Results:Every point in the same 2%

No significant improvements even if batch size is included in the optimization space

Page 40: Application des réseaux de neurones aux environnements ... · Our dataset is derived from a subset of the catalog identified by Hall (~1500 bow shock crossings out of ~11800) →

ANN structure

Feedforward multilayer perceptron

Input layer: 8 neurons [relu]Hidden layer 1: 40 neurons [relu]Hidden layer 2: 38 neurons [tanh]Output layer: 3 neurons [softmax]

Page 41: Application des réseaux de neurones aux environnements ... · Our dataset is derived from a subset of the catalog identified by Hall (~1500 bow shock crossings out of ~11800) →

Extrapolation to Venus Express

Similar measurements on Venus Express

Similar metrics with a MAVEN trained algorithm on VEX data

Page 42: Application des réseaux de neurones aux environnements ... · Our dataset is derived from a subset of the catalog identified by Hall (~1500 bow shock crossings out of ~11800) →
Page 43: Application des réseaux de neurones aux environnements ... · Our dataset is derived from a subset of the catalog identified by Hall (~1500 bow shock crossings out of ~11800) →
Page 44: Application des réseaux de neurones aux environnements ... · Our dataset is derived from a subset of the catalog identified by Hall (~1500 bow shock crossings out of ~11800) →
Page 45: Application des réseaux de neurones aux environnements ... · Our dataset is derived from a subset of the catalog identified by Hall (~1500 bow shock crossings out of ~11800) →
Page 46: Application des réseaux de neurones aux environnements ... · Our dataset is derived from a subset of the catalog identified by Hall (~1500 bow shock crossings out of ~11800) →
Page 47: Application des réseaux de neurones aux environnements ... · Our dataset is derived from a subset of the catalog identified by Hall (~1500 bow shock crossings out of ~11800) →
Page 48: Application des réseaux de neurones aux environnements ... · Our dataset is derived from a subset of the catalog identified by Hall (~1500 bow shock crossings out of ~11800) →