probabilistic graphical models: introduction and general information · 2015. 10. 1. · polycopi e...

13
Probabilistic graphical models: Introduction and general information Simon Lacoste-Julien, INRIA/ENS Francis Bach, INRIA/ENS Guillaume Obozinski, ENPC M2 MVA 2015-2016 Probabilistic graphical models: Intro 1/13

Upload: others

Post on 24-Aug-2021

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Probabilistic graphical models: Introduction and general information · 2015. 10. 1. · Polycopi e : The course will be based on the book in preparation of Michael Jordan (UC Berkeley)

Probabilistic graphical models:Introduction and general information

Simon Lacoste-Julien, INRIA/ENS

Francis Bach, INRIA/ENS

Guillaume Obozinski, ENPC

M2 MVA 2015-2016

Probabilistic graphical models: Intro 1/13

Page 2: Probabilistic graphical models: Introduction and general information · 2015. 10. 1. · Polycopi e : The course will be based on the book in preparation of Michael Jordan (UC Berkeley)

Algorithm

Criterion/Formulation

Model Principles

A prioriknowledge

Donnees

Task + lear-ning modality

• Max entropy• Max likelihood• Min risk• MDL• Bayesian calculus• Max margin• Min regret

• Supervision ?• On/off line• Active/Passive• Sequentiel minimax• Reinforcement

• Function spaces− RKHS (kernels)− Wavelets•Functional rel.• Probabilistic rel.

• Optimization• Expectation computa-tion

• Dynamic progr• (Linear Algebra)

Probabilistic graphical models: Intro 2/13

Page 3: Probabilistic graphical models: Introduction and general information · 2015. 10. 1. · Polycopi e : The course will be based on the book in preparation of Michael Jordan (UC Berkeley)

Structured problems in HD

Probabilistic graphical models: Intro 3/13

Page 4: Probabilistic graphical models: Introduction and general information · 2015. 10. 1. · Polycopi e : The course will be based on the book in preparation of Michael Jordan (UC Berkeley)

Sequence modelling

How to model the distribution of DNA sequences of length k ?

Naive model→ 4n − 1 parameters

Independant model → 3n parameters

x1 x2 x3 x4

First order Markov chain :

x1 x2 x3 x4

Second order Markov chain :

x1 x2 x3 x4

Number of parameters O(n) for chains of length n.

Probabilistic graphical models: Intro 4/13

Page 5: Probabilistic graphical models: Introduction and general information · 2015. 10. 1. · Polycopi e : The course will be based on the book in preparation of Michael Jordan (UC Berkeley)

Models for speech processing

Speech modelled by a sequence of unobserved phonemes

For each phoneme a random sound is produced following adistribution which characterizes the phoneme

Hidden Markov Model : HMM (Modele de Markov cache)

zn−1 zn zn+1

xn−1 xn xn+1

z1 z2

x1 x2

→ Latent variable models

Probabilistic graphical models: Intro 5/13

Page 6: Probabilistic graphical models: Introduction and general information · 2015. 10. 1. · Polycopi e : The course will be based on the book in preparation of Michael Jordan (UC Berkeley)

Modelling image structures

Markov Random Field(Champ de Markov cache)

Segmentation→ oriented graphical model vs non oriented

Probabilistic graphical models: Intro 6/13

Page 7: Probabilistic graphical models: Introduction and general information · 2015. 10. 1. · Polycopi e : The course will be based on the book in preparation of Michael Jordan (UC Berkeley)

Anaesthesia alarm (Beinlich et al., 1989)

“The ALARM Monitoring system”

http://www.bnlearn.com/documentation/networks/

CVP central venous pressurePCWP pulmonary capillary wedge pressureHIST historyTPR total peripheral resistanceBP blood pressureCO cardiac outputHRBP heart rate / blood pressure.HREK heart rate measured by an EKG monitorHRSA heart rate / oxygen saturation.PAP pulmonary artery pressure.SAO2 arterial oxygen saturation.FIO2 fraction of inspired oxygen.PRSS breathing pressure.ECO2 expelled CO2.MINV minimum volume.MVS minimum volume setHYP hypovolemiaLVF left ventricular failureAPL anaphylaxisANES insufficient anesthesia/analgesia.PMB pulmonary embolusINT intubationKINK kinked tube.DISC disconnectionLVV left ventricular end-diastolic volumeSTKV stroke volumeCCHL catecholamineERLO error low outputHR heart rate.ERCA electrocauterSHNT shuntPVS pulmonary venous oxygen saturationACO2 arterial CO2VALV pulmonary alveoli ventilationVLNG lung ventilationVTUB ventilation tubeVMCH ventilation machine

Probabilistic graphical models: Intro 7/13

Page 8: Probabilistic graphical models: Introduction and general information · 2015. 10. 1. · Polycopi e : The course will be based on the book in preparation of Michael Jordan (UC Berkeley)

Probabilistic model

1

2

3

7

8

6

4

5

9

f12 f34 f45

f678

f9

f56f37f23

p(x1, x2, . . . , x9) = f12(x1, x2) f23(x2, x3) f34(x3, x4) f45(x4, x5) . . .

f56(x5, x6) f37(x3, x7) f678(x6, x7, x8) f9(x9)

Probabilistic graphical models: Intro 8/13

Page 9: Probabilistic graphical models: Introduction and general information · 2015. 10. 1. · Polycopi e : The course will be based on the book in preparation of Michael Jordan (UC Berkeley)

Abstact models vs concrete ones

Abstracts models

Linear regression

Logistic regression

Mixture model

Principal Component Analysis

Canonical Correlation Analysis

Independent Component analysis

LDA (Multinomiale PCA)

Naive Bayes Classifier

Mixture of experts

Concrete Models

Markov chains

HMM

Tree-structured models

Double HMMs

Oriented acyclic models

Markov Random Fields

Star models

Constellation Model

Probabilistic graphical models: Intro 9/13

Page 10: Probabilistic graphical models: Introduction and general information · 2015. 10. 1. · Polycopi e : The course will be based on the book in preparation of Michael Jordan (UC Berkeley)

Operations on graphical models

Probabilistic inference

Computing a marginal distr. p(xi ) ou p(xi |x1 = 3, x7 = 0)

Decoding (MAP inference)

What is the most likely instance ?

argmaxzp(z |x)

zn−1 zn zn+1

xn−1 xn xn+1

z1 z2

x1 x2

Learning (or Estimation)

Soit p(x ;θ) = 1Z(θ)

∏C ψ(xC , θC ), we want to find

argmaxθ

n∏i=1

p(x (i);θ) = argmaxθ1

Z (θ)

n∏i=1

∏C

ψ(x(i)C , θC )

Probabilistic graphical models: Intro 10/13

Page 11: Probabilistic graphical models: Introduction and general information · 2015. 10. 1. · Polycopi e : The course will be based on the book in preparation of Michael Jordan (UC Berkeley)

Course outline

Lecture 1IntroductionMaximum likelihoodLinear regressionLogistic regressionGenerative classification (Fisherdiscriminant)

Lecture 2K-meansEMGaussian mixturesGraph Theoretic aspects

Lecture 3Unoriented graphical modelsOriented graphical models

Lecture 4Exponential familiesInformation Theory

Lecture 5Gaussian VariablesFactorial Analysis

Lecture 6Sum-product algorithmHMM

Lecture 7Approximate inference

Lecture 8Approximate inference

Lecture 9Bayesian methodsModel selection

Probabilistic graphical models: Intro 11/13

Page 12: Probabilistic graphical models: Introduction and general information · 2015. 10. 1. · Polycopi e : The course will be based on the book in preparation of Michael Jordan (UC Berkeley)

General informationEvery Wed 9am-12pm amphi Curie until Dec 2.Except Nov 11 : no lectureGrading :

Homework 1 (15%)Homework 2 (15%)Homework 3 (15%)Exam (25%)Project (30%)

Programming :All Hwk+ Project involve programmingYou may choose the programming language you wantWe recommend you choose a vector oriented PL such as Python, RMatlab.

Polycopie :The course will be based on the book in preparation of MichaelJordan (UC Berkeley).

Scribes :For some lectures, the students are encouraged to volunteer asscribes to contribute to the lecture notes.

Probabilistic graphical models: Intro 12/13

Page 13: Probabilistic graphical models: Introduction and general information · 2015. 10. 1. · Polycopi e : The course will be based on the book in preparation of Michael Jordan (UC Berkeley)

To start : models with 1 and 2 nodes...Regression and classification

t

xx0

2σy(x0,w)

y(x,w)

p(t|x0,w, β)

−6 −4 −2 0 2 4 6−6

−4

−2

0

2

4

6

Mixture models

xn

zn

N

µ

σ2

W

(a)

0 0.5 1

0

0.5

1 (b)

0 0.5 1

0

0.5

1

Probabilistic graphical models: Intro 13/13